4 5 days ago

Schematron-3B is a 3.2B-parameter Llama-architecture chat model (inference-net/Schematron-3B) converted to GGUF for llama.cpp/Ollama, with two published quantizations: Q4_K_M (recommended balance) and IQ4_XS (smaller). It supports up to 131,072 context

aeeca073b2d7 · 247B
<|begin_of_text|>{{ if .System }}<|start_header_id|>system<|end_header_id|>
{{ .System }}<|eot_id|>{{ end }}<|start_header_id|>user<|end_header_id|>
{{ .Prompt }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
{{ .Response }}<|eot_id|>