Advanced agentic, reasoning and coding capabilities.
9,237 Pulls 1 Tag Updated 1 week ago
A strong multi-lingual general language model with competitive performance to Llama 3.
161.4K Pulls 32 Tags Updated 1 year ago
Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1
5.4M Pulls 102 Tags Updated 1 year ago
The IBM Granite 1B and 3B models are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.
973.6K Pulls 33 Tags Updated 9 months ago
IBM Granite 2B and 8B models are 128K context length language models that have been fine-tuned for improved reasoning and instruction-following capabilities.
650.1K Pulls 3 Tags Updated 6 months ago
A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.
402.6K Pulls 5 Tags Updated 8 months ago
Granite-3.2 is a family of long-context AI models from IBM Granite fine-tuned for thinking capabilities.
160.8K Pulls 9 Tags Updated 8 months ago
A language model created by combining two fine-tuned Llama 2 70B models into one.
35.1K Pulls 16 Tags Updated 1 year ago
The IBM Granite 2B and 8B models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM’s initial testing.
123.5K Pulls 33 Tags Updated 9 months ago
The current, most capable model that runs on a single GPU.
22.5M Pulls 26 Tags Updated 2 months ago
Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.
11.2M Pulls 58 Tags Updated 2 weeks ago
Google Gemma 2 is a high-performing and efficient model available in three sizes: 2B, 9B, and 27B.
8.5M Pulls 94 Tags Updated 1 year ago
A family of open foundation models by IBM for Code Intelligence
288.5K Pulls 162 Tags Updated 1 year ago
The IBM Granite Embedding 30M and 278M models models are text-only dense biencoder embedding models, with 30M available in English only and 278M serving multilingual use cases.
107.8K Pulls 6 Tags Updated 10 months ago
The IBM Granite 1B and 3B models are the first mixture of experts (MoE) Granite models from IBM designed for low latency usage.
89.2K Pulls 33 Tags Updated 11 months ago
Granite 4 features improved instruction following (IF) and tool-calling capabilities, making them more effective in enterprise applications.
57.2K Pulls 5 Tags Updated 3 weeks ago
890 Pulls 1 Tag Updated 1 year ago
AI agent specialized in generating and adapting SQLAlchemy code using Python. Supports ORM models, async queries, and integration with FastAPI and relational databases.
40 Pulls 1 Tag Updated 4 months ago
Formulários HTML completos com base em schemas Pydantic ou modelos SQLAlchemy.
8 Pulls 1 Tag Updated 4 months ago
LongWriter-glm4-9b is trained based on glm-4-9b, and is capable of generating 10,000+ words at once.
2,636 Pulls 1 Tag Updated 1 year ago