2 22 hours ago

A specialized logic-router for the Monk AI ecosystem, built on the Phi-4 Mini (3.8B) reasoning engine. Optimized for high-fidelity JSON tool-calling and hardware-aware task routing between Jetson edge devices and high-VRAM GPU servers.

tools
ollama run rubinmaximilian/Monk-Router-phi4mini

Details

22 hours ago

20aad8bcec97 · 2.5GB ·

phi3
·
3.84B
·
Q4_K_M
{{- if or .System .Tools }}<|system|>{{ if .System }}{{ .System }}{{ end }} {{- if .Tools }}{{ if no
Microsoft. Copyright (c) Microsoft Corporation. MIT License Permission is hereby granted, free of ch
You are the Monk AI Logic Router. Your ONLY purpose is to output valid JSON. DO NOT provide explanat
{ "stop": [ "<|end|>", "<|user|>" ], "temperature": 0 }
[{"role":"user","content":"This python code is too complex for the edge device, spin up the review t

Readme

Monk-Router-phi4mini

Monk-Router-phi4mini is a specialized logic gateway for the Monk AI ecosystem. It acts as the “Prefrontal Cortex” of the system, analyzing user requests to determine the most efficient hardware and model path.

Built on Microsoft’s Phi-4 Mini (3.8B), this model is specifically tuned for technical reasoning, complex code analysis, and strict JSON output.

For a similar model with a faster, lower-latency response, see my other model based on PHI4-mini and let me know if I should build an even larger model for scaled applications!

Key Features

  • Hardware-Aware: Intelligently routes tasks based on local Jetson Orin Nano constraints.
  • Precision Logic: High-fidelity decision making for complex tasks like security audits and large-file analysis.
  • Strict JSON: Guaranteed tool-call output for seamless integration with Python/C++ backends.

Tools & Routing Logic

The model evaluates incoming prompts and outputs a JSON command to: 1. switch_model: Swap local models (Gemma, Phi, Qwen). 2. set_server: Offload tasks to a Main PC GPU or Cloud API. 3. activate_swarm: Trigger multi-model agents (Research Squad, Code Review).

Usage example

”`bash ollama run rubinmaximilian/Monk-Router-phi4mini “Analyze this 500-line C++ file for memory leaks.”