The most powerful vision-language model in the Qwen model family to date.
798.1K Pulls 59 Tags Updated 1 month ago
Uncensored version of Wizard LM model
114.5K Pulls 18 Tags Updated 2 years ago
🌋 LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.
11.9M Pulls 98 Tags Updated 1 year ago
A series of multimodal LLMs (MLLMs) designed for vision-language understanding.
4.1M Pulls 17 Tags Updated 1 year ago
Llama 3.2 Vision is a collection of instruction-tuned image reasoning generative models in 11B and 90B sizes.
3.3M Pulls 9 Tags Updated 7 months ago
A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
3M Pulls 5 Tags Updated 11 months ago
BGE-M3 is a new model from BAAI distinguished for its versatility in Multi-Functionality, Multi-Linguality, and Multi-Granularity.
2.9M Pulls 3 Tags Updated 1 year ago
CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.
1.7M Pulls 85 Tags Updated 1 year ago
An open-source Mixture-of-Experts code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.
1.3M Pulls 64 Tags Updated 1 year ago
Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.
1.1M Pulls 17 Tags Updated 7 months ago
A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.
561.4K Pulls 5 Tags Updated 9 months ago
moondream2 is a small vision language model designed to run efficiently on edge devices.
454.8K Pulls 18 Tags Updated 1 year ago
Hermes 3 is the latest version of the flagship Hermes series of LLMs by Nous Research
370.7K Pulls 65 Tags Updated 1 year ago
General use chat model based on Llama and Llama 2 with 2K to 16K context sizes.
230.4K Pulls 111 Tags Updated 2 years ago
A strong, economical, and efficient Mixture-of-Experts language model.
227.2K Pulls 34 Tags Updated 1 year ago
Orca 2 is built by Microsoft research, and are a fine-tuned version of Meta's Llama 2 models. The model is designed to excel particularly in reasoning.
102.6K Pulls 33 Tags Updated 2 years ago
DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.
65.5K Pulls 3 Tags Updated 4 weeks ago
Wizard Vicuna is a 13B parameter model based on Llama 2 trained by MelodysDreamj.
64.3K Pulls 17 Tags Updated 2 years ago
A new state-of-the-art version of the lightweight Command R7B model that excels in advanced Arabic language capabilities for enterprises in the Middle East and Northern Africa.
42.5K Pulls 5 Tags Updated 9 months ago
The Cogito v2.1 LLMs are instruction tuned generative models. All models are released under MIT license for commercial use.
18.7K Pulls 6 Tags Updated 3 weeks ago