devkit/L1-Qwen-1.5B-Max:f16/template

devkit/

L1-Qwen-1.5B-Max:f16

68 Downloads Updated 9 months ago

Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

template

369ca498f347 · 387B

{{- if eq .Role "user" }}<｜User｜>{{ .Content }}

{{- else if eq .Role "assistant" }}<｜Assistant｜>{{ .Content }}{{- if not $last }}<｜end▁of▁sentence｜>{{- end }}

{{- if and $last (ne .Role "assistant") }}<｜Assistant｜>{{- end }}