tomng/nanbeige4.1

tomng/ nanbeige4.1

3,660 Downloads Updated 2 weeks ago

Nanbeige4.1-3B illustrates that compact models can simultaneously achieve robust reasoning, preference alignment, and effective agentic behaviors

tools thinking 3b

ollama run tomng/nanbeige4.1

curl http://localhost:11434/api/chat \
  -d '{
    "model": "tomng/nanbeige4.1",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='tomng/nanbeige4.1',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'tomng/nanbeige4.1',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Applications

Claude Code

Claude Code ollama launch claude --model tomng/nanbeige4.1

Codex

Codex ollama launch codex --model tomng/nanbeige4.1

OpenCode

OpenCode ollama launch opencode --model tomng/nanbeige4.1

OpenClaw

OpenClaw ollama launch openclaw --model tomng/nanbeige4.1

Models

Name

6 models

Size

Context

Input

nanbeige4.1:latest

4.2GB · 256K context window · Text · 2 weeks ago

nanbeige4.1:latest

4.2GB

256K

Text

nanbeige4.1:3b

4.2GB · 256K context window · Text · 2 weeks ago

nanbeige4.1:3b latest

4.2GB

256K

Text

Readme

From mradermacher/Nanbeige4.1-3B-GGUF

Nanbeige4.1-3B is built upon Nanbeige4-3B-Base and represents an enhanced iteration of our previous reasoning model, Nanbeige4-3B-Thinking-2511, achieved through further post-training optimization with supervised fine-tuning (SFT) and reinforcement learning (RL). As a highly competitive open-source model at a small parameter scale, Nanbeige4.1-3B illustrates that compact models can simultaneously achieve robust reasoning, preference alignment, and effective agentic behaviors.

Specifically, Nanbeige4.1-3B exhibits the following key strengths:

Strong Reasoning: Nanbeige4.1-3B is capable of solving complex, multi-step problems through sustained and coherent reasoning within a single forward pass, and reliably produces correct final answers on challenging tasks such as LiveCodeBench-Pro, IMO-Answer-Bench, and AIME 2026 I.
Robust Preference Alignment: Nanbeige4.1-3B achieves solid alignment performance, outperforming not only same-scale models such as Qwen3-4B-2507 and Nanbeige4-3B-2511, but also substantially larger models including Qwen3-30B-A3B and Qwen3-32B on Arena-Hard-v2 and Multi-Challenge.
Agentic Capability: Nanbeige4.1-3B is the first general small model to natively support deep-search tasks and reliably sustain complex problem solving involving more than 500 rounds of tool invocations. It fills a long-standing gap in the small-model ecosystem where models are typically optimized for either general reasoning or agentic scenarios, but rarely excel at both.

Technical Report: Link

![Nanbeige Logo](https://huggingface.co/Nanbeige/Nanbeige4.1-3B/resolve/b32903a1ac96cf16ef3f63f03d3eac96c0218850/figures/nbg.png)

> From [mradermacher/Nanbeige4.1-3B-GGUF](https://huggingface.co/mradermacher/Nanbeige4.1-3B-GGUF)

Nanbeige4.1-3B is built upon Nanbeige4-3B-Base and represents an enhanced iteration of our previous reasoning model, Nanbeige4-3B-Thinking-2511, achieved through further post-training optimization with supervised fine-tuning (SFT) and reinforcement learning (RL). As a highly competitive open-source model at a small parameter scale, Nanbeige4.1-3B illustrates that compact models can simultaneously achieve robust **reasoning**, **preference alignment**, and **effective agentic behaviors**.

![](https://huggingface.co/Nanbeige/Nanbeige4.1-3B/resolve/b32903a1ac96cf16ef3f63f03d3eac96c0218850/figures/model_performance_comparison.png)

Specifically, Nanbeige4.1-3B exhibits the following key strengths:

* **Strong Reasoning:** Nanbeige4.1-3B is capable of solving complex, multi-step problems through sustained and coherent reasoning within a single forward pass, and reliably produces correct final answers on challenging tasks such as LiveCodeBench-Pro, IMO-Answer-Bench, and AIME 2026 I.
* **Robust Preference Alignment:** Nanbeige4.1-3B achieves solid alignment performance, outperforming not only same-scale models such as Qwen3-4B-2507 and Nanbeige4-3B-2511, but also substantially larger models including Qwen3-30B-A3B and Qwen3-32B on Arena-Hard-v2 and Multi-Challenge.
* **Agentic Capability:** Nanbeige4.1-3B is the first general small model to natively support deep-search tasks and reliably sustain complex problem solving involving more than 500 rounds of tool invocations. It fills a long-standing gap in the small-model ecosystem where models are typically optimized for either general reasoning or agentic scenarios, but rarely excel at both.
  
> **Technical Report:** [Link](https://huggingface.co/Nanbeige/Nanbeige4.1-3B/blob/b32903a1ac96cf16ef3f63f03d3eac96c0218850/Nanbeige4.1-3B-Report.pdf)

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)