51 Downloads Updated 1 month ago
Qwen2.5-N2 is a fine-tuned version of Qwen2.5-0.5B-Instruct and Qwen2.5-1.5B-Instruct..
Qwen2.5-N2 is the newer version of Qwen2.5-N. Even though it was trained on less data, it performs noticeably better.
Training Stage | Epoch | Easy Data | Medium Data | Hard Data |
---|---|---|---|---|
Stage 1: Hard | 1–3 | 0% | 20% | 80% |
Stage 2: Medium Transition | 4–6 | 10% | 50% | 40% |
Stage 3: Medium Dominance | 7–9 | 25% | 60% | 15% |
Stage 4: Easy Reinforcement | 10 | 40% | 50% | 10% |
All work is released under the original Qwen2.5 Apache 2.0 license.