New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.
821.5K Pulls 14 Tags Updated 6 weeks ago
Meta's Llama 3.2 goes small with 1B and 3B models.
7.2M Pulls 63 Tags Updated 3 months ago
Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
19.6M Pulls 93 Tags Updated 7 weeks ago
The 7B model released by Mistral AI, updated to version 0.3.
8M Pulls 84 Tags Updated 6 months ago
Qwen2 is a new series of large language models from Alibaba group
4M Pulls 97 Tags Updated 4 months ago
Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.
3.3M Pulls 133 Tags Updated 4 months ago
The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.
1.6M Pulls 196 Tags Updated 2 months ago
A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.
979.6K Pulls 17 Tags Updated 5 months ago
A set of Mixture of Experts (MoE) model with open weights by Mistral AI in 8x7b and 8x22b parameter sizes.
528.3K Pulls 70 Tags Updated 4 weeks ago
Command R is a Large Language Model optimized for conversational interaction and long context tasks.
262.8K Pulls 32 Tags Updated 4 months ago
QwQ is an experimental research model focused on advancing AI reasoning capabilities.
148.3K Pulls 5 Tags Updated 7 weeks ago
SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters.
115.7K Pulls 49 Tags Updated 2 months ago
Mistral Large 2 is Mistral's new flagship model that is significantly more capable in code generation, mathematics, and reasoning with 128k context window and support for dozens of languages.
115.7K Pulls 32 Tags Updated 2 months ago
Command R+ is a powerful, scalable large language model purpose-built to excel at real-world enterprise use cases.
115.4K Pulls 21 Tags Updated 4 months ago
Hermes 3 is the latest version of the flagship Hermes series of LLMs by Nous Research
73.6K Pulls 65 Tags Updated 5 weeks ago
Athene-V2 is a 72B parameter model which excels at code completion, mathematics, and log extraction tasks.
62.9K Pulls 17 Tags Updated 2 months ago
Mistral Small is a lightweight model designed for cost-effective use in tasks like translation and summarization.
62.4K Pulls 17 Tags Updated 4 months ago
Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.
57K Pulls 17 Tags Updated 3 months ago
A commercial-friendly small language model by NVIDIA optimized for roleplay, RAG QA, and function calling.
51.8K Pulls 17 Tags Updated 4 months ago
A series of models from Groq that represent a significant advancement in open-source AI capabilities for tool use/function calling.
44.4K Pulls 33 Tags Updated 6 months ago
The IBM Granite 2B and 8B models are designed to support tool-based use cases and support for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.
38.4K Pulls 33 Tags Updated 2 months ago
The IBM Granite 2B and 8B models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM’s initial testing.
34.6K Pulls 33 Tags Updated 4 days ago
Cohere For AI's language models trained to perform well across 23 different languages.
25.7K Pulls 33 Tags Updated 2 months ago
The IBM Granite 1B and 3B models are the first mixture of experts (MoE) Granite models from IBM designed for low latency usage.
24.7K Pulls 33 Tags Updated 2 months ago
An open weights function calling model based on Llama 3, competitive with GPT-4o function calling capabilities.
17K Pulls 17 Tags Updated 6 months ago
The IBM Granite 1B and 3B models are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.
11.8K Pulls 33 Tags Updated 4 days ago
The smallest model in Cohere's R series delivers top-tier speed, efficiency, and quality to build powerful AI applications on commodity GPUs and edge devices.
5,752 Pulls 5 Tags Updated 5 days ago