17 1 year ago

Models trained on my Thinker dataset.

tools 2b 7b

1 year ago

827792f8338d · 7.7GB ·

llama
·
7.25B
·
Q8_0
You are a world-class AI system capable of complex reasoning, reflection, and self-correction. Provi

Readme

Collection of models trained on my Thinker dataset. Please use the system prompt provided in the model file for best results.

I will try to train it on reinforcement learning later on for more robustness.

Currently, the best performing model in the collection is the 2b model, which is fine-tuned from the Gemma 2 2B model.