Meta's latest collection of multimodal models.
725.6K Pulls 11 Tags Updated 3 months ago
Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
103.9M Pulls 93 Tags Updated 10 months ago
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.
2.6M Pulls 14 Tags Updated 10 months ago
An open weights function calling model based on Llama 3, competitive with GPT-4o function calling capabilities.
34.5K Pulls 17 Tags Updated 1 year ago
The Llama 4 models are Meta's flagship LLMs. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text understanding and generation.
6,018 Pulls 5 Tags Updated 5 months ago
The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.
3,286 Pulls 3 Tags Updated 5 months ago
unsloth's v2.0 dynamic quants of Llama-4-Scout-17B-16E-Instruct-GGUF, Q2_K_XL(2.71-bit)
296 Pulls 1 Tag Updated 4 months ago
56 Pulls 1 Tag Updated 3 months ago
Works, thanks ollama team for supporting Llama4!
936 Pulls 1 Tag Updated 6 months ago
MiniCPM-V surpasses proprietary models such as GPT-4V, Gemini Pro, Qwen-VL and Claude 3 in overall performance, and support multimodal conversation for over 30 languages.
44.6K Pulls 8 Tags Updated 1 year ago
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model.
22.1K Pulls 10 Tags Updated 9 months ago
The ollama model for the 4bit-quantized GGUF version of llama3-8b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-4bit).
7,445 Pulls 2 Tags Updated 1 year ago
It uses this one Q4_K_M-imat (4.89 BPW) quant for up to 12288 context sizes. for less than 8gb vram
3,397 Pulls 1 Tag Updated 1 year ago
The ollama model for the 4bit-quantized GGUF version of llama3-70b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-70B-Chinese-Chat-GGUF-4bit).
2,752 Pulls 1 Tag Updated 1 year ago
NousResearch/Hermes-3-Llama-3.1-405B
2,748 Pulls 2 Tags Updated 1 year ago
Llama3.2 1b trained on data distilled from gpt4o, claude3.5 and claude opus
981 Pulls 2 Tags Updated 3 months ago
https://habr.com/ru/articles/830332/
804 Pulls 1 Tag Updated 1 year ago
minicpm-llama3-2.5-8b-16-v With only 8B parameters, it surpasses widely used proprietary models like GPT-4V-1106, Gemini Pro, Claude 3 and Qwen-VL-Max and greatly outperforms other Llama 3-based MLLMs
484 Pulls 1 Tag Updated 1 year ago
https://huggingface.co/taide/Llama3-TAIDE-LX-8B-Chat-Alpha1-4bit
390 Pulls 1 Tag Updated 1 year ago
379 Pulls 1 Tag Updated 1 year ago