MathΣtral: a 7B model designed for math reasoning and scientific discovery by Mistral AI.
522.1K Pulls 17 Tags Updated 1 year ago
A family of efficient AI models under 10B parameters performant in science, math, and coding through innovative training techniques.
2.6M Pulls 17 Tags Updated 1 year ago
Phi-4-mini brings significant enhancements in multilingual support, reasoning, and mathematics, and now, the long-awaited function calling feature is finally supported.
1.2M Pulls 5 Tags Updated 1 year ago
EXAONE Deep exhibits superior capabilities in various reasoning tasks including math and coding benchmarks, ranging from 2.4B to 32B parameters developed and released by LG AI Research.
740K Pulls 13 Tags Updated 1 year ago
Mistral Large 2 is Mistral's new flagship model that is significantly more capable in code generation, mathematics, and reasoning with 128k context window and support for dozens of languages.
1.2M Pulls 32 Tags Updated 1 year ago
Qwen2 Math is a series of specialized math language models built upon the Qwen2 LLMs, which significantly outperforms the mathematical capabilities of open-source models and even closed-source models (e.g., GPT4o).
1M Pulls 52 Tags Updated 1 year ago
Athene-V2 is a 72B parameter model which excels at code completion, mathematics, and log extraction tasks.
565.2K Pulls 17 Tags Updated 1 year ago
Model focused on math and logic problems
946.9K Pulls 64 Tags Updated 2 years ago
Building upon Mistral Small 3, Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance.
743.6K Pulls 5 Tags Updated 1 year ago
MiniMax-M2.5 is a state-of-the-art large language model designed for real-world productivity and coding tasks.
2.1M Pulls 1 Tag Updated 3 months ago
🎩 Magicoder is a family of 7B parameter models trained on 75K synthetic instruction data using OSS-Instruct, a novel approach to enlightening LLMs with open-source code snippets.
530.7K Pulls 18 Tags Updated 2 years ago
Kimi K2 Thinking, Moonshot AI's best open-source thinking model.
2M Pulls 1 Tag Updated 6 months ago
Google Gemma 2 is a high-performing and efficient model available in three sizes: 2B, 9B, and 27B.
23.7M Pulls 94 Tags Updated 1 year ago
State-of-the-art large embedding model from mixedbread.ai
10.8M Pulls 4 Tags Updated 2 years ago
A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.
4.6M Pulls 17 Tags Updated 10 months ago
SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters.
3.4M Pulls 49 Tags Updated 1 year ago
Skole-AI
3 Pulls 1 Tag Updated 3 weeks ago
Sailor2 are multilingual language models made for South-East Asia. Available in 1B, 8B, and 20B parameter sizes.
390.8K Pulls 13 Tags Updated 1 year ago
7B parameter text-to-SQL model made by MotherDuck and Numbers Station.
503.9K Pulls 17 Tags Updated 2 years ago
1,489 Pulls 1 Tag Updated 1 year ago