Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
107.6M Pulls 93 Tags Updated 1 year ago
Meta's Llama 3.2 goes small with 1B and 3B models.
49.9M Pulls 63 Tags Updated 1 year ago
Meta Llama 3: The most capable openly available LLM to date
13M Pulls 68 Tags Updated 1 year ago
Llama 3.2 Vision is a collection of instruction-tuned image reasoning generative models in 11B and 90B sizes.
3.3M Pulls 9 Tags Updated 6 months ago
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.
2.8M Pulls 14 Tags Updated 1 year ago
A model from NVIDIA based on Llama 3 that excels at conversational question answering (QA) and retrieval-augmented generation (RAG).
154.2K Pulls 35 Tags Updated 1 year ago
This model extends LLama-3 8B's context length from 8k to over 1m tokens.
140.9K Pulls 35 Tags Updated 1 year ago
A series of models from Groq that represent a significant advancement in open-source AI capabilities for tool use/function calling.
124.9K Pulls 33 Tags Updated 1 year ago
Llama Guard 3 is a series of models fine-tuned for content safety classification of LLM inputs and responses.
119K Pulls 33 Tags Updated 1 year ago
A LLaVA model fine-tuned from Llama 3 Instruct with better scores in several benchmarks.
2.1M Pulls 4 Tags Updated 1 year ago
Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.
635.1K Pulls 53 Tags Updated 1 year ago
Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
3.5M Pulls 5 Tags Updated 11 months ago
The TinyLlama project is an open endeavor to train a compact 1.1B Llama model on 3 trillion tokens.
3.2M Pulls 36 Tags Updated 1 year ago
A strong multi-lingual general language model with competitive performance to Llama 3.
190K Pulls 32 Tags Updated 1 year ago
Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.
119.4K Pulls 17 Tags Updated 1 year ago
An open weights function calling model based on Llama 3, competitive with GPT-4o function calling capabilities.
59.1K Pulls 17 Tags Updated 1 year ago
A new small LLaVA model fine-tuned from Phi 3 Mini.
154.6K Pulls 4 Tags Updated 1 year ago
SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters.
2.2M Pulls 49 Tags Updated 1 year ago
Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.
3,655 Pulls 15 Tags Updated yesterday
1,755 Pulls 10 Tags Updated yesterday