376 11 months ago

A specialized medical model fine-tuned from Qwen3 using SFT and Group Relative Policy Optimization (GRPO) for advanced clinical case analysis.

41854258a067 · 67B
{
"stop": [
"|endoftext|"
],
"temperature": 0.7,
"top_k": 20,
"top_p": 0.95
}