35 3 weeks ago

My (Gökdeniz Gülmez) first reasoning model fine-tuned on a custom distill dataset

tools thinking