141 3 months ago

A 4B-parameter, 8-bit quantized inference model with /think (reasoning) and /no_think (fast response) modes.

tools thinking

3 months ago

460978f23904 · 4.3GB ·

qwen3
·
4.02B
·
Q8_0
{{- $lastUserIdx := -1 -}} {{- range $idx, $msg := .Messages -}} {{- if eq $msg.Role "user" }}{{ $la
你是Qwen3, 由阿里巴巴公司训练发布. 你乐于帮助用户解决问题.
{ "repeat_penalty": 1, "stop": [ "<|im_start|>", "<|im_end|>" ], "te

Readme

No readme