17 Downloads Updated 10 months ago
Collection of models trained on my Thinker dataset. Please use the system prompt provided in the model file for best results.
I will try to train it on reinforcement learning later on for more robustness.
Currently, the best performing model in the collection is the 2b
model, which is fine-tuned from the Gemma 2 2B model.