19 Downloads Updated 1 week ago
ollama run dcostenco/prism-ide:32b
Updated 1 week ago
1 week ago
4b92f661f6a9 · 20GB ·
Fine-tuned on Qwen3 for healthcare-grade TypeScript codebases. Matches Claude Sonnet 4.6 on a 22-check coding eval while running fully offline.
| Tag | Base | Size | Best for |
|---|---|---|---|
1.7b |
Qwen3-1.7B | 1.1 GB | On-device / iOS |
8b |
Qwen3-8B | 5.2 GB | Laptop / fast iteration |
14b |
Qwen3-14B | 9.0 GB | Daily driver |
32b |
Qwen3-32B | 20 GB | Highest quality |
| Metric | Score |
|---|---|
| Routing accuracy (BFCL) | 100% |
| Coding eval (22 checks) | 22⁄22 |
| vs Claude Sonnet 4.6 | Tied |
| vs Claude Opus 4 | Beats on coding |
”`bash ollama run dcostenco/prism-ide:14b What it knows TypeScript / Next.js App Router patterns Healthcare audit logging (withAudit, HIPAA non-blocking .then) Supabase RLS, UUID validation, JSONB safety General ledger (CO/PR entries, CAS patterns) Tool-call routing (picks the right model/tool for the task) Thinking suppression Qwen3 thinking is disabled by default in the Modelfile for deterministic, fast responses. Ideal for IDE autocomplete and agent pipelines.
Companion routing model Use dcostenco/prism-coder for pure tool/model routing (smaller, faster). Use prism-ide when you need code generation quality.
License MIT — weights derived from Qwen3 (Apache 2.0).