20 Downloads Updated 1 year ago
This model is based on h2oai/h2o-danube3-4b-base and fine-tuned using LoRA+ and BAdam with LLama-Factory. It uses the ChatML template, without a system message, and was trained on the ThoughtfulAssistant-v01 dataset.
The idea is to abstract the thoughts away or into a thought bubble when chatting.
HF repo: trollek/ThoughtStream-4B-v0.1 Quants: mradermacher/ThoughtStream-4B-v0.1-GGUF