376 11 months ago

A specialized medical model fine-tuned from Qwen3 using SFT and Group Relative Policy Optimization (GRPO) for advanced clinical case analysis.

323f8fe82249 · 283B
{{- if .System }}
{{ .System }}|endoftext|
{{- end }}
{{- range .Messages }}
{{- if eq .Role "user" }}
{{ .Content }}
{{- end }}
{{- if eq .Role "assistant" }}
{{ .Content }}|endoftext|
{{- end }}
{{- end }}
{{- if .Prompt }}
{{ .Prompt }}<start_working_out>
{{- end -}}