Deductive Reasoning Qwen 32B is a reinforcement fine-tune of Qwen 2.5 32B Instruct to solve challenging deduction problems
tools
47 Pulls Updated 6 weeks ago
bf328696c54f · 34B
{
"stop": [
"<|im_end|>"
]
}