3,660 Downloads Updated 2 weeks ago
ollama run tomng/nanbeige4.1
ollama launch claude --model tomng/nanbeige4.1
ollama launch codex --model tomng/nanbeige4.1
ollama launch opencode --model tomng/nanbeige4.1
ollama launch openclaw --model tomng/nanbeige4.1

Nanbeige4.1-3B is built upon Nanbeige4-3B-Base and represents an enhanced iteration of our previous reasoning model, Nanbeige4-3B-Thinking-2511, achieved through further post-training optimization with supervised fine-tuning (SFT) and reinforcement learning (RL). As a highly competitive open-source model at a small parameter scale, Nanbeige4.1-3B illustrates that compact models can simultaneously achieve robust reasoning, preference alignment, and effective agentic behaviors.

Specifically, Nanbeige4.1-3B exhibits the following key strengths:
Technical Report: Link