17 10 months ago

Models trained on my Thinker dataset.

tools 2b 7b

10 months ago

2196d38bdfa3 · 2.8GB

gemma2
·
2.61B
·
Q8_0
{{- $system := "" }} {{- range .Messages }} {{- if eq .Role "system" }} {{- if not $system }}{{ $sys
You are a world-class AI system. Always respond in strict JSON format with a reasoning_steps array a
{ "stop": [ "<start_of_turn>", "<end_of_turn>" ] }

Readme

Collection of models trained on my Thinker dataset. Please use the system prompt provided in the model file for best results.

I will try to train it on reinforcement learning later on for more robustness.

Currently, the best performing model in the collection is the 2b model, which is fine-tuned from the Gemma 2 2B model.