54 Downloads Updated 1 week ago
Name
16 models
unsloth_granite-4.0-h-350m-GGUF:latest
223MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:Q3_K_XL
191MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:Q4_K_XL
225MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:Q5_K_XL
253MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:Q6_K_XL
311MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:Q8_K_XL
461MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:Q3_K_M
189MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:Q4_K_M
223MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:Q5_K_M
252MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:Q6_K
284MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:IQ3_XXS
160MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:IQ4_XS
208MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:IQ4_NL
216MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:BF16
685MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:Base-Q6_K
284MB · 1M context window · Text · 1 week ago
unsloth_granite-4.0-h-350m-GGUF:Base-Q8_0
366MB · 1M context window · Text · 1 week ago
This Modelfile checkpoint documents the configuration and lineage of the model.
It is intended for reproducibility, sharing, and reference across deployments.
Granite-4.0-H-350M is a lightweight instruct model finetuned from Granite-4.0-H-350M-Base using a combination of open-source instruction datasets with permissive license and internally collected synthetic datasets. This model is developed using a diverse set of techniques including supervised finetuning, reinforcement learning, and model merging.
English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, and Chinese. Users may fine-tune Granite 4.0 Nano models to support languages beyond those included in this list.
Granite 4.0 Nano instruct models feature strong instruction following capabilities bringing advanced AI capabilities within reach for on-device deployments and research use cases. Additionally, their compact size makes them well-suited for fine-tuning on specialized domains without requiring massive compute resources.
Granite-4.0-H-350M baseline is based on a decoder-only dense transformer architecture. Core components of this architecture are: GQA, Mamba2, MLP with SwiGLU, RMSNorm, and shared input/output embeddings.
Granite 4.0 Nano Instruct Models are primarily finetuned using instruction-response pairs mostly in English, but also multilingual data covering multiple languages. Although this model can handle multilingual dialog use cases, its performance might not be similar to English tasks. In such case, introducing a small number of examples (few-shot) can help the model in generating more accurate outputs. While this model has been aligned by keeping safety in consideration, the model may in some cases produce inaccurate, biased, or unsafe responses to user prompts. So we urge the community to use this model with proper safety testing and tuning tailored for their specific tasks.
Developers: Granite Team, IBM
Function Calling: IBM Docs
GitHub Repository: Granite 4.0 Nano
Website: Granite Docs
Research: Granite Research
Release Date: October 28, 2025
License: Apache 2.0