49 Downloads Updated 5 days ago
ollama run SetneufPT/hermes79_2b_q4_128k_8gb-gpu
Updated 5 days ago
5 days ago
f5e13a33e759 · 1.9GB ·
Custom Ollama model, fine-tuned from Qwen3.5-2B, configured for Hermes Agent and local personal-assistant workflows.
This model is based on a 2B parameter LLM, quantized in Q4, and configured with a very large context window for long assistant sessions. It is intended for local AI assistant experiments where privacy, offline operation, tool use, and extended conversations are important.
This model is designed for: