A small language model that simulates a QuietStar like behaviour in a simplified agentic way.

3B

4 Pulls Updated 8 weeks ago

Readme

ThoughtStream-4B-v0.1

This model is based on h2oai/h2o-danube3-4b-base and fine-tuned using LoRA+ and BAdam with LLama-Factory. It uses the ChatML template, without a system message, and was trained on the ThoughtfulAssistant-v01 dataset.

The idea is to abstract the thoughts away or into a thought bubble when chatting.

HF repo: trollek/ThoughtStream-4B-v0.1
Quants: mradermacher/ThoughtStream-4B-v0.1-GGUF