SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters.
495.8K Pulls 49 Tags Updated 5 months ago
🪐 A family of small models with 135M, 360M, and 1.7B parameters, trained on a new high-quality dataset.
208.4K Pulls 94 Tags Updated 7 months ago
StarCoder is a code generation model trained on 80+ programming languages.
190.9K Pulls 100 Tags Updated 17 months ago
Stable Code 3B is a coding model with instruct and code completion variants on par with models such as Code Llama 7B that are 2.5x larger.
125.1K Pulls 36 Tags Updated 12 months ago
Starling is a large language model trained by reinforcement learning from AI feedback focused on improving chatbot helpfulness.
82.3K Pulls 36 Tags Updated 12 months ago
Solar Pro Preview: an advanced large language model (LLM) with 22 billion parameters designed to fit into a single GPU
33.7K Pulls 18 Tags Updated 6 months ago
DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens.
637.6K Pulls 102 Tags Updated 15 months ago
Sentence-transformers model that can be used for tasks like clustering or semantic search.
56.2K Pulls 3 Tags Updated 8 months ago
A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.
1.5M Pulls 17 Tags Updated 8 months ago
A suite of text embedding models by Snowflake, optimized for performance.
704.3K Pulls 16 Tags Updated 11 months ago
Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
620.2K Pulls 5 Tags Updated 2 months ago
Aya 23, released by Cohere, is a new family of state-of-the-art, multilingual models that support 23 languages.
139.1K Pulls 33 Tags Updated 10 months ago
Stable LM 2 is a state-of-the-art 1.6B and 12B parameter language model trained on multilingual data in English, Spanish, German, Italian, French, Portuguese, and Dutch.
109.9K Pulls 84 Tags Updated 11 months ago
SQLCoder is a code completion model fined-tuned on StarCoder for SQL generation tasks
98.8K Pulls 48 Tags Updated 14 months ago
A companion assistant trained in philosophy, psychology, and personal relationships. Based on Mistral.
91.3K Pulls 49 Tags Updated 17 months ago
A compact, yet powerful 10.7B large language model designed for single-turn conversation.
80.2K Pulls 32 Tags Updated 15 months ago
Llama 2 based model fine tuned on an Orca-style dataset. Originally called Free Willy.
59.6K Pulls 49 Tags Updated 17 months ago
Snowflake's frontier embedding model. Arctic Embed 2.0 adds multilingual support without sacrificing English performance or scalability.
54.5K Pulls 3 Tags Updated 4 months ago
A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model.
51.2K Pulls 5 Tags Updated 3 months ago
ShieldGemma is set of instruction tuned models for evaluating the safety of text prompt input and text output responses against a set of defined safety policies.
38K Pulls 49 Tags Updated 5 months ago