1 Download Updated 12 hours ago
ollama run DedeProGames/Astral-2.7:20b
Astral-2.7 is a 20B-parameter AI model based on gpt-oss:20b, designed for high performance on general reasoning tasks.
| Benchmark | Métrica | gpt-oss:20b | Astral-2.7 |
|---|---|---|---|
| AIME 2024 (no tools) | accuracy % | 80.0 | 92.1 |
| AIME 2024 (with tools) | accuracy % | 86.0 | 96.0 |
| AIME 2025 (no tools) | accuracy % | 72.1 | 91.7 |
| AIME 2025 (with tools) | accuracy % | 90.4 | 98.7 |
| GPQA Diamond (no tools) | accuracy % | 66.0 | 71.5 |
| GPQA Diamond (with tools) | accuracy % | 67.1 | 74.2 |
| HLE (no tools) | score % | 7.0 | 10.9 |
| HLE (with tools) | score % | 8.8 | 17.3 |
| MMLU | accuracy % | 84.0 | 85.3 |
| MMMLU (Average) | accuracy % | 73.5 | 75.7 |
| SWE-Bench Verified | pass@1 % | 53.2 | 60.7 |
| Codeforces (no tools) | Elo | 1998 | 2230 |
| Codeforces (with tools) | Elo | 2064 | 2516 |
| Aider Polyglot | pass@1 % | 26.6 | 34.2 |
| Tau-Bench Retail | score % | 47.3 | 54.8 |
| Tau-Bench Airline | score % | 42.6 | 38.0 |
| HealthBench | score % | 41.8 | 42.5 |
| HealthBench Hard | score % | 12.9 | 10.8 |
| HealthBench Consensus | score % | 83.0 | 82.6 |
ollama pull DedeProGames/Astral-2.7:20b
ollama pull DedeProGames/Astral-2.7:20b-plus