Deductive Reasoning Qwen 32B is a reinforcement fine-tune of Qwen 2.5 32B Instruct to solve challenging deduction problems

tools

47 6 weeks ago

bf328696c54f · 34B
{
"stop": [
"<|im_end|>"
]
}