35 Downloads Updated 2 weeks ago
ollama run batiai/nemotron3-nano-omni-text:iq4
Reasoning-tuned text backbone extracted from NVIDIA’s Nemotron 3 Nano Omni multimodal model. NemotronH MoE 30B-A3B (Mamba+Attention hybrid).
Vision and audio encoders are stripped from this GGUF (llama.cpp doesn’t yet support the Omni multimodal architecture). Watching for upstream support — multimodal release coming when ready.
| Tag | Size | RAM target | Use Case |
|---|---|---|---|
| iq3 | 17GB | 24GB Mac | Compact reasoning |
| iq4 | 17GB | 32GB Mac | Recommended |
| q5 | 25GB | 36GB+ Mac | Highest quality |
ollama run batiai/nemotron3-nano-omni-text:iq4
Same NemotronH MoE 30B-A3B backbone, but Omni is reasoning-focused:
| nemotron3-nano | nemotron3-nano-omni-text | |
|---|---|---|
| Tuning | General agentic | Reasoning-focused |
| Step-by-step | Standard | Stronger |
| Tool calling | ✅ | ✅ |
| Your Mac RAM | iq3 (17GB) | iq4 (17GB) | q5 (25GB) |
|---|---|---|---|
| 16GB | ⚠️ Heavy swap | ⚠️ Heavy swap | ❌ |
| 24GB | ✅ | ✅ | ❌ |
| 32GB | ✅ Fast | ✅ Recommended | ⚠️ Tight |
| 36GB | ✅ | ✅ | ✅ |
| 48GB+ | ✅ | ✅ | ✅ Headroom |
| Your Mac | Recommended |
|---|---|
| 16GB | batiai/gemma4-e4b:q4 |
| 24GB | batiai/nemotron3-nano-omni-text:iq3 (this) |
| 32GB | batiai/nemotron3-nano-omni-text:iq4 (this, recommended) |
| 48GB | batiai/nemotron3-nano-omni-text:q5 |
| 128GB | batiai/minimax-m2.7:iq3 (229B Dense frontier) |
general.author=BatiAI)Free, on-device AI automation for Mac. 5MB app, 100% local, unlimited.