17 Downloads Updated 1 year ago
Updated 1 year ago
1 year ago
2196d38bdfa3 · 2.8GB ·
Collection of models trained on my Thinker dataset. Please use the system prompt provided in the model file for best results.
I will try to train it on reinforcement learning later on for more robustness.
Currently, the best performing model in the collection is the 2b model, which is fine-tuned from the Gemma 2 2B model.