167 Downloads Updated 6 months ago
Updated 6 months ago
6 months ago
bd16bcc6bb7f · 14GB
base_model: - mistralai/Mistral-Small-24B-Instruct-2501
library_name: transformers
license: apache-2.0
Arcee-Blitz (24B) is a new Mistral-based 24B model distilled from DeepSeek, designed to be both fast and efficient. We view it as a practical “workhorse” model that can tackle a range of tasks without the overhead of larger architectures.
GGUF quants are available here
AWQ quants are available here
Arcee-Blitz shows large improvements to performance on MMLU-Pro versus the original Mistral-Small-3, reflecting a dramatic increase in world knowledge.
We carefully examined our training data and pipeline to avoid contamination. While we’re confident in the validity of these gains, we remain open to further community validation and testing (one of the key reasons we release these models as open-source).
Benchmark | mistral‑small‑3 | arcee‑blitz |
---|---|---|
MixEval | 81.6% | 85.1% |
GPQADiamond | 42.4% | 43.1% |
BigCodeBench Complete | 44.4% | 45.5% |
BigCodeBench Instruct | 34.7% | 35.9% |
BigCodeBench Complete-hard | 16.2% | 19.6% |
BigCodeBench Instruct-hard | 15.5% | 15.5% |
IFEval | 77.44 | 80.60 |
BBH | 64.46 | 65.00 |
GPQA | 33.90 | 36.70 |
MMLU Pro | 44.70 | 60.20 |
MuSR | 40.90 | 50.00 |
Math Level 5 | 12.00 | 38.60 |
Arcee-Blitz (24B) is released under the Apache-2.0 License. You are free to use, modify, and distribute this model in both commercial and non-commercial applications, subject to the terms and conditions of the license.
If you have questions or would like to share your experiences using Arcee-Blitz (24B), please connect with us on social media. We’re excited to see what you build—and how this model helps you innovate!