67 2 months ago

My (Gökdeniz Gülmez) first reasoning model fine-tuned on a custom distill dataset

tools thinking