Models
GitHub
Discord
Docs
Cloud
Sign in
Download
Models
Download
GitHub
Discord
Docs
Cloud
Sign in
devkit
/
L1-Qwen-1.5B-Max
:f16
68
Downloads
Updated
9 months ago
Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
Cancel
L1-Qwen-1.5B-Max:f16
...
/
template
369ca498f347 · 387B
{{- if .System }}{{ .System }}{{ end }}
{{- range $i, $_ := .Messages }}
{{- $last := eq (len (slice $.Messages $i)) 1}}
{{- if eq .Role "user" }}<|User|>{{ .Content }}
{{- else if eq .Role "assistant" }}<|Assistant|>{{ .Content }}{{- if not $last }}<|end▁of▁sentence|>{{- end }}
{{- end }}
{{- if and $last (ne .Role "assistant") }}<|Assistant|>{{- end }}
{{- end }}