376 11 months ago

A specialized medical model fine-tuned from Qwen3 using SFT and Group Relative Policy Optimization (GRPO) for advanced clinical case analysis.