Deductive Reasoning Qwen 32B is a reinforcement fine-tune of Qwen 2.5 32B Instruct to solve challenging deduction problems

tools

47 6 weeks ago

78198e7ab262 · 3B
mit