Models trained on my Thinker dataset.

tools 2b 7b

7 3 weeks ago

Readme

Collection of models trained on my Thinker dataset. Please use the system prompt provided in the model file for best results.

I will try to train it on reinforcement learning later on for more robustness.

Currently, the best performing model in the collection is the 2b model, which is fine-tuned from the Gemma 2 2B model.