-
llama3.2-chinese
llama3.2 countinue pretrain with wiki-zh and sft with chinese
504 Pulls 1 Tag Updated 1 year ago
-
Qwen3_Medical_GRPO
A specialized medical model fine-tuned from Qwen3 using SFT and Group Relative Policy Optimization (GRPO) for advanced clinical case analysis.
376 Pulls 1 Tag Updated 11 months ago
-
llama3.1-health
llama3.1 SFT with DataSet Flmc/DISC-Med-SFT
tools4 Pulls 1 Tag Updated 1 year ago