17 Downloads Updated 10 months ago
Updated 10 months ago
10 months ago
2196d38bdfa3 · 2.8GB
Collection of models trained on my Thinker dataset. Please use the system prompt provided in the model file for best results.
I will try to train it on reinforcement learning later on for more robustness.
Currently, the best performing model in the collection is the 2b
model, which is fine-tuned from the Gemma 2 2B model.