44 yesterday

A lightweight local intent-analysis model built for the OpenViking search API. It detects whether a conversation turn needs context retrieval, skips chitchat to reduce unnecessary memory injection and token usage, and emits structured retrieval queries

tools thinking
13584952422b · 131B
{{ if .System }}<|im_start|>system
{{ .System }}<|im_end|>
{{ end }}<|im_start|>user
{{ .Prompt }}<|im_end|>
<|im_start|>assistant