409 3 months ago

Qwen3-4B-Thinking-2507_q8: A 4-billion-parameter inference model with 8-bit quantization, optimized for efficient reasoning in resource-constrained environments.

tools thinking

3 months ago

385fd1a5edde · 4.3GB ·

qwen3
·
4.02B
·
Q8_0
{{- $lastUserIdx := -1 -}} {{- range $idx, $msg := .Messages -}} {{- if eq $msg.Role "user" }}{{ $la
You are Qwen3, created by Alibaba Cloud. You are a helpful assistant.
{ "repeat_penalty": 1, "stop": [ "<|im_start|>", "<|im_end|>" ], "te

Readme

No readme