A small language model that simulates a QuietStar like behaviour in a simplified agentic way.
11 Pulls Updated 3 months ago
Updated 3 months ago
3 months ago
1b7145223393 · 3.3GB
model
archllama
·
parameters3.96B
·
quantizationQ6_K
3.3GB
template
<|im_start|>user
{{ .Prompt }}<|im_end|>
<|im_start|>assistant
63B
params
{"num_ctx":8192,"num_predict":2048,"stop":["\u003c|im_end|\u003e","\u003c|im_start|\u003e"],"tempera
135B
Readme
ThoughtStream-4B-v0.1
This model is based on h2oai/h2o-danube3-4b-base and fine-tuned using LoRA+ and BAdam with LLama-Factory. It uses the ChatML template, without a system message, and was trained on the ThoughtfulAssistant-v01 dataset.
The idea is to abstract the thoughts away or into a thought bubble when chatting.
HF repo: trollek/ThoughtStream-4B-v0.1 Quants: mradermacher/ThoughtStream-4B-v0.1-GGUF