Models
Docs
Pricing
Sign in
Download
Models
Download
Docs
Pricing
Sign in
mychen76
/
qwen2.5-3b-think-r1
257
Downloads
Updated
1 year ago
A regular model convert into Reasoning/Think Model fine-tuned using DeepSeek GRPO algorithm without using distilled data from R1.
A regular model convert into Reasoning/Think Model fine-tuned using DeepSeek GRPO algorithm without using distilled data from R1.
Cancel
tools
Name
2 models
Size
Context
Input
qwen2.5-3b-think-r1:latest
0937053a5fde
• 3.3GB • 32K context window •
Text input • 1 year ago
Text input • 1 year ago
qwen2.5-3b-think-r1:latest
3.3GB
32K
Text
0937053a5fde
· 1 year ago
qwen2.5-3b-think-r1:q8
latest
0937053a5fde
• 3.3GB • 32K context window •
Text input • 1 year ago
Text input • 1 year ago
qwen2.5-3b-think-r1:q8
latest
3.3GB
32K
Text
0937053a5fde
· 1 year ago