Meta's latest collection of multimodal models.
898.8K Pulls 11 Tags Updated 6 months ago
Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
107.6M Pulls 93 Tags Updated 1 year ago
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.
2.8M Pulls 14 Tags Updated 1 year ago
An open weights function calling model based on Llama 3, competitive with GPT-4o function calling capabilities.
59.1K Pulls 17 Tags Updated 1 year ago
The Llama 4 models are Meta's flagship LLMs. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text understanding and generation.
6,147 Pulls 5 Tags Updated 8 months ago
The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.
3,792 Pulls 3 Tags Updated 7 months ago
Works, thanks ollama team for supporting Llama4!
1,306 Pulls 1 Tag Updated 8 months ago
unsloth's v2.0 dynamic quants of Llama-4-Scout-17B-16E-Instruct-GGUF, Q2_K_XL(2.71-bit)
352 Pulls 1 Tag Updated 6 months ago
67 Pulls 1 Tag Updated 5 months ago
MiniCPM-V surpasses proprietary models such as GPT-4V, Gemini Pro, Qwen-VL and Claude 3 in overall performance, and support multimodal conversation for over 30 languages.
44.9K Pulls 8 Tags Updated 1 year ago
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model.
23.5K Pulls 10 Tags Updated 1 year ago
The ollama model for the 4bit-quantized GGUF version of llama3-8b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-4bit).
7,569 Pulls 2 Tags Updated 1 year ago
This model requires Ollama v0.6.6 or later
4,294 Pulls 1 Tag Updated 7 months ago
It uses this one Q4_K_M-imat (4.89 BPW) quant for up to 12288 context sizes. for less than 8gb vram
3,490 Pulls 1 Tag Updated 1 year ago
NousResearch/Hermes-3-Llama-3.1-405B
3,433 Pulls 2 Tags Updated 1 year ago
The ollama model for the 4bit-quantized GGUF version of llama3-70b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-70B-Chinese-Chat-GGUF-4bit).
2,757 Pulls 1 Tag Updated 1 year ago
reasoning model that is post trained for reasoning, human chat preferences, and tasks, such as RAG and tool calling.
2,457 Pulls 6 Tags Updated 9 months ago
Llama3.2 1b trained on data distilled from gpt4o, claude3.5 and claude opus
1,687 Pulls 2 Tags Updated 5 months ago
https://habr.com/ru/articles/830332/
857 Pulls 1 Tag Updated 1 year ago
554 Pulls 1 Tag Updated 8 months ago