Meta's latest collection of multimodal models.
699.4K Pulls 11 Tags Updated 3 months ago
Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
103.1M Pulls 93 Tags Updated 9 months ago
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.
2.5M Pulls 14 Tags Updated 9 months ago
An open weights function calling model based on Llama 3, competitive with GPT-4o function calling capabilities.
33.6K Pulls 17 Tags Updated 1 year ago
The Llama 4 models are Meta's flagship LLMs. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text understanding and generation.
5,994 Pulls 5 Tags Updated 5 months ago
The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.
3,250 Pulls 3 Tags Updated 4 months ago
Works, thanks ollama team for supporting Llama4!
907 Pulls 1 Tag Updated 5 months ago
unsloth's v2.0 dynamic quants of Llama-4-Scout-17B-16E-Instruct-GGUF, Q2_K_XL(2.71-bit)
281 Pulls 1 Tag Updated 4 months ago
56 Pulls 1 Tag Updated 3 months ago
MiniCPM-V surpasses proprietary models such as GPT-4V, Gemini Pro, Qwen-VL and Claude 3 in overall performance, and support multimodal conversation for over 30 languages.
44.6K Pulls 8 Tags Updated 1 year ago
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model.
22K Pulls 10 Tags Updated 9 months ago
The ollama model for the 4bit-quantized GGUF version of llama3-8b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-4bit).
7,442 Pulls 2 Tags Updated 1 year ago
This model requires Ollama v0.6.6 or later
3,677 Pulls 1 Tag Updated 5 months ago
It uses this one Q4_K_M-imat (4.89 BPW) quant for up to 12288 context sizes. for less than 8gb vram
3,219 Pulls 1 Tag Updated 1 year ago
The ollama model for the 4bit-quantized GGUF version of llama3-70b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-70B-Chinese-Chat-GGUF-4bit).
2,752 Pulls 1 Tag Updated 1 year ago
NousResearch/Hermes-3-Llama-3.1-405B
2,720 Pulls 2 Tags Updated 1 year ago
reasoning model that is post trained for reasoning, human chat preferences, and tasks, such as RAG and tool calling.
2,212 Pulls 6 Tags Updated 6 months ago
Llama3.2 1b trained on data distilled from gpt4o, claude3.5 and claude opus
939 Pulls 2 Tags Updated 2 months ago
https://habr.com/ru/articles/830332/
791 Pulls 1 Tag Updated 1 year ago
minicpm-llama3-2.5-8b-16-v With only 8B parameters, it surpasses widely used proprietary models like GPT-4V-1106, Gemini Pro, Claude 3 and Qwen-VL-Max and greatly outperforms other Llama 3-based MLLMs
481 Pulls 1 Tag Updated 1 year ago