13 Downloads Updated 4 hours ago
ollama run rubinmaximilian/Monk-Router-Gemma4e2b
Updated 4 hours ago
4 hours ago
86572402fe95 · 7.2GB ·
Monk-Router-gemma4e2b is a performance-first router designed for the Monk AI assistant. Leveraging Google’s 2026 E2B (Effective 2 Billion) architecture, it offers the fastest possible decision-making speed for edge computing environments (versus the similar model based on Phi4-mini).
This model is ideal for users prioritizing latency and VRAM efficiency on devices like the Jetson Orin Nano or MacBook Air. Please let me know if I should make an even larger model for scaled applications!
”`json { “logic”: “General logic task. Keeping on local Jetson.”, “tool_call”: { “name”: “switch_model”, “parameters”: { “model_name”: “gemma4-e2b” } } }