trollek/
thoughtstream:4b-v01-q6_K

20 1 year ago

A small language model that simulates a QuietStar like behaviour in a simplified agentic way.

1 year ago

1b7145223393 · 3.3GB ·

llama
·
3.96B
·
Q6_K
<|im_start|>user {{ .Prompt }}<|im_end|> <|im_start|>assistant
{ "num_ctx": 8192, "num_predict": 2048, "stop": [ "<|im_end|>", "<|im_st

Readme

ThoughtStream-4B-v0.1

This model is based on h2oai/h2o-danube3-4b-base and fine-tuned using LoRA+ and BAdam with LLama-Factory. It uses the ChatML template, without a system message, and was trained on the ThoughtfulAssistant-v01 dataset.

The idea is to abstract the thoughts away or into a thought bubble when chatting.

HF repo: trollek/ThoughtStream-4B-v0.1 Quants: mradermacher/ThoughtStream-4B-v0.1-GGUF