GLM-5.1 is our next-generation flagship model for agentic engineering, with significantly stronger coding capabilities than its predecessor. It achieves state-of-the-art performance on SWE-Bench Pro and leads GLM-5 by a wide margin.
2M Pulls 1 Tag Updated 1 month ago
A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.
2M Pulls 1 Tag Updated 3 months ago
Advancing the Coding Capability
1.9M Pulls 1 Tag Updated 4 months ago
As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.
1.2M Pulls 4 Tags Updated 3 months ago
GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture.
672.1K Pulls 3 Tags Updated 3 months ago
Advanced agentic, reasoning and coding capabilities.
1.9M Pulls 1 Tag Updated 7 months ago
A strong multi-lingual general language model with competitive performance to Llama 3.
1.1M Pulls 32 Tags Updated 1 year ago
Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.
9.5M Pulls 30 Tags Updated 2 weeks ago
Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1
7.1M Pulls 102 Tags Updated 2 years ago
The IBM Granite 1B and 3B models are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.
2.9M Pulls 33 Tags Updated 1 year ago
IBM Granite 2B and 8B models are 128K context length language models that have been fine-tuned for improved reasoning and instruction-following capabilities.
1M Pulls 3 Tags Updated 1 year ago
A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.
912.9K Pulls 5 Tags Updated 1 year ago
The IBM Granite 2B and 8B models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM’s initial testing.
962.7K Pulls 33 Tags Updated 1 year ago
Granite-3.2 is a family of long-context AI models from IBM Granite fine-tuned for thinking capabilities.
434.2K Pulls 9 Tags Updated 1 year ago
A language model created by combining two fine-tuned Llama 2 70B models into one.
458.4K Pulls 16 Tags Updated 2 years ago
Monolithic AGI Special Operations Model. Powered by Qwen 3.6 Plus. Developed by Niko Software under CEO Berkay." (Veya Türkçe istersen: "Niko Software tarafından CEO Berkay liderliğinde geliştirilmiş, Qwen 3.6 Plus tabanlı monolitik AGI modeli.
6 Pulls 1 Tag Updated 4 days ago
IBM Granite Models are a family of enterprise-ready, open foundation models that support multilingual capabilities, coding, retrieval-augmented generation (RAG), tool use, and structured JSON output. Released under Apache 2.0 license.
93.2K Pulls 48 Tags Updated yesterday
Granite 4 features improved instruction following (IF) and tool-calling capabilities, making them more effective in enterprise applications.
1.2M Pulls 17 Tags Updated 6 months ago
Model to analyze log files and help troubleshoot errors based on ministral-3:3b model
45 Pulls 1 Tag Updated 2 months ago
The current, most capable model that runs on a single GPU.
36.9M Pulls 29 Tags Updated 5 months ago