As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.
1M Pulls 4 Tags Updated 2 months ago
A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.
155.3K Pulls 1 Tag Updated 1 month ago
GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture.
177.9K Pulls 3 Tags Updated 1 month ago
Advancing the Coding Capability
81.8K Pulls 1 Tag Updated 3 months ago
Advanced agentic, reasoning and coding capabilities.
98.8K Pulls 1 Tag Updated 5 months ago
A strong multi-lingual general language model with competitive performance to Llama 3.
878.1K Pulls 32 Tags Updated 1 year ago
Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.
187.8K Pulls 16 Tags Updated yesterday
IBM Granite 2B and 8B models are 128K context length language models that have been fine-tuned for improved reasoning and instruction-following capabilities.
970.8K Pulls 3 Tags Updated 11 months ago
Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1
6.6M Pulls 102 Tags Updated 1 year ago
The IBM Granite 1B and 3B models are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.
2.7M Pulls 33 Tags Updated 1 year ago
A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.
860.2K Pulls 5 Tags Updated 1 year ago
The IBM Granite 2B and 8B models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM’s initial testing.
776.7K Pulls 33 Tags Updated 1 year ago
Granite-3.2 is a family of long-context AI models from IBM Granite fine-tuned for thinking capabilities.
383.8K Pulls 9 Tags Updated 1 year ago
A language model created by combining two fine-tuned Llama 2 70B models into one.
377.3K Pulls 16 Tags Updated 2 years ago
Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.
25.7M Pulls 58 Tags Updated 5 months ago
Granite 4 features improved instruction following (IF) and tool-calling capabilities, making them more effective in enterprise applications.
1M Pulls 17 Tags Updated 5 months ago
Model to analyze log files and help troubleshoot errors based on ministral-3:3b model
32 Pulls 1 Tag Updated 1 month ago
The current, most capable model that runs on a single GPU.
34.9M Pulls 29 Tags Updated 3 months ago
Google Gemma 2 is a high-performing and efficient model available in three sizes: 2B, 9B, and 27B.
19.7M Pulls 94 Tags Updated 1 year ago
CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.
2.7M Pulls 85 Tags Updated 1 year ago