Models
GitHub
Discord
Docs
Cloud
Sign in
Download
Models
Download
GitHub
Discord
Docs
Cloud
Sign in
pielee
/
qwen3-4b-thinking-2507_q8
:latest
409
Downloads
Updated
3 months ago
Qwen3-4B-Thinking-2507_q8: A 4-billion-parameter inference model with 8-bit quantization, optimized for efficient reasoning in resource-constrained environments.
Qwen3-4B-Thinking-2507_q8: A 4-billion-parameter inference model with 8-bit quantization, optimized for efficient reasoning in resource-constrained environments.
Cancel
tools
thinking
Updated 3 months ago
3 months ago
385fd1a5edde · 4.3GB ·
model
arch
qwen3
·
parameters
4.02B
·
quantization
Q8_0
4.3GB
template
{{- $lastUserIdx := -1 -}} {{- range $idx, $msg := .Messages -}} {{- if eq $msg.Role "user" }}{{ $la
1.5kB
system
You are Qwen3, created by Alibaba Cloud. You are a helpful assistant.
69B
params
{ "repeat_penalty": 1, "stop": [ "<|im_start|>", "<|im_end|>" ], "te
120B
Readme
No readme
Write
Preview
Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)