Deductive Reasoning Qwen 32B is a reinforcement fine-tune of Qwen 2.5 32B Instruct to solve challenging deduction problems
tools
47 Pulls Updated 6 weeks ago
78198e7ab262 · 3B
mit
47 Pulls Updated 6 weeks ago