Ollama

llama3.3

New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.

tools 70b

821.5K Pulls 14 Tags Updated 6 weeks ago

llama3.2

Meta's Llama 3.2 goes small with 1B and 3B models.

tools 1b 3b

7.2M Pulls 63 Tags Updated 3 months ago

llama3.1

Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.

tools 8b 70b 405b

19.6M Pulls 93 Tags Updated 7 weeks ago

mistral

The 7B model released by Mistral AI, updated to version 0.3.

tools 7b

8M Pulls 84 Tags Updated 6 months ago

qwen2

Qwen2 is a new series of large language models from Alibaba group

tools 0.5b 1.5b 7b 72b

4M Pulls 97 Tags Updated 4 months ago

qwen2.5

Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.

tools 0.5b 1.5b 3b 7b 14b 32b 72b

3.3M Pulls 133 Tags Updated 4 months ago

qwen2.5-coder

The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.

tools 0.5b 1.5b 3b 7b 14b 32b

1.6M Pulls 196 Tags Updated 2 months ago

mistral-nemo

A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.

tools 12b

979.6K Pulls 17 Tags Updated 5 months ago

mixtral

A set of Mixture of Experts (MoE) model with open weights by Mistral AI in 8x7b and 8x22b parameter sizes.

tools 8x7b 8x22b

528.3K Pulls 70 Tags Updated 4 weeks ago

command-r

Command R is a Large Language Model optimized for conversational interaction and long context tasks.

tools 35b

262.8K Pulls 32 Tags Updated 4 months ago

qwq

QwQ is an experimental research model focused on advancing AI reasoning capabilities.

tools 32b

148.3K Pulls 5 Tags Updated 7 weeks ago

smollm2

SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters.

tools 135m 360m 1.7b

115.7K Pulls 49 Tags Updated 2 months ago

mistral-large

Mistral Large 2 is Mistral's new flagship model that is significantly more capable in code generation, mathematics, and reasoning with 128k context window and support for dozens of languages.

tools 123b

115.7K Pulls 32 Tags Updated 2 months ago

command-r-plus

Command R+ is a powerful, scalable large language model purpose-built to excel at real-world enterprise use cases.

tools 104b

115.4K Pulls 21 Tags Updated 4 months ago

hermes3

Hermes 3 is the latest version of the flagship Hermes series of LLMs by Nous Research

tools 3b 8b 70b 405b

73.6K Pulls 65 Tags Updated 5 weeks ago

athene-v2

Athene-V2 is a 72B parameter model which excels at code completion, mathematics, and log extraction tasks.

tools 72b

62.9K Pulls 17 Tags Updated 2 months ago

mistral-small

Mistral Small is a lightweight model designed for cost-effective use in tasks like translation and summarization.

tools 22b

62.4K Pulls 17 Tags Updated 4 months ago

nemotron

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.

tools 70b

57K Pulls 17 Tags Updated 3 months ago

nemotron-mini

A commercial-friendly small language model by NVIDIA optimized for roleplay, RAG QA, and function calling.

tools 4b

51.8K Pulls 17 Tags Updated 4 months ago

llama3-groq-tool-use

A series of models from Groq that represent a significant advancement in open-source AI capabilities for tool use/function calling.

tools 8b 70b

44.4K Pulls 33 Tags Updated 6 months ago

granite3-dense

The IBM Granite 2B and 8B models are designed to support tool-based use cases and support for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.

tools 2b 8b

38.4K Pulls 33 Tags Updated 2 months ago

granite3.1-dense

The IBM Granite 2B and 8B models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM’s initial testing.

tools 2b 8b

34.6K Pulls 33 Tags Updated 4 days ago

aya-expanse

Cohere For AI's language models trained to perform well across 23 different languages.

tools 8b 32b

25.7K Pulls 33 Tags Updated 2 months ago

granite3-moe

The IBM Granite 1B and 3B models are the first mixture of experts (MoE) Granite models from IBM designed for low latency usage.

tools 1b 3b

24.7K Pulls 33 Tags Updated 2 months ago

firefunction-v2

An open weights function calling model based on Llama 3, competitive with GPT-4o function calling capabilities.

tools 70b

17K Pulls 17 Tags Updated 6 months ago

granite3.1-moe

The IBM Granite 1B and 3B models are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.

tools 1b 3b

11.8K Pulls 33 Tags Updated 4 days ago

command-r7b

The smallest model in Cohere's R series delivers top-tier speed, efficiency, and quality to build powerful AI applications on commodity GPUs and edge devices.

tools 7b

5,752 Pulls 5 Tags Updated 5 days ago