huihui_ai/ huihui-4:8b-q4_K

652 1 week ago

Huihui4-8B-A4B is a lightweight MoE (Mixture of Experts) conversational model optimized from Google's gemma-4-26B-A4B-it architecture

vision tools thinking 8b
ollama run huihui_ai/huihui-4:8b-q4_K

Details

1 week ago

6351e4a67645 · 6.6GB ·

gemma4
·
8.67B
·
Q4_K_M
Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US
{ "temperature": 1, "top_k": 64, "top_p": 0.95 }
{{ .Prompt }}

Readme

Huihui4-8B-A4B is a lightweight MoE (Mixture of Experts) conversational model optimized from Google’s gemma-4-26B-A4B-it architecture. Through expert pruning and supervised fine-tuning on high-quality dialogue data, this model significantly reduces computational overhead while preserving core reasoning and interaction capabilities. It is specifically designed for deployment on consumer-grade hardware and code-related conversational tasks.

This model is not an ablation variant.

Welcome everyone to test it.

References

[https://huggingface.co/huihui-ai/Huihui4-8B-A4B]