247 1 year ago

A commercial-friendly small language model by NVIDIA optimized for roleplay, RAG QA, and function calling.

tools
ollama run tripolskypetr/nemotron-mini

Applications

Claude Code
Claude Code ollama launch claude --model tripolskypetr/nemotron-mini
Codex
Codex ollama launch codex --model tripolskypetr/nemotron-mini
OpenCode
OpenCode ollama launch opencode --model tripolskypetr/nemotron-mini
OpenClaw
OpenClaw ollama launch openclaw --model tripolskypetr/nemotron-mini

Models

View all →

Readme

Nemotron-Mini-4B-Instruct

Nemotron-Mini Logo

Nemotron-Mini-4B-Instruct is a model for generating responses for roleplaying, retrieval augmented generation, and function calling. It is a small language model (SLM) optimized through distillation, pruning and quantization for speed and on-device deployment.

This instruct model is optimized for roleplay, RAG QA, and function calling in English. It supports a context length of 4,096 tokens. This model is ready for commercial use.

References

Blog

HuggingFace