19 1 week ago

Local-first AI tool router + coder. 4 sizes. 100% routing accuracy. 22/22 coding eval. 97% free. Beats Opus.

14b 32b
ollama run dcostenco/prism-ide:14b

Details

1 week ago

39ef5e42ef90 · 9.0GB ·

qwen3
·
14.8B
·
Q4_K_M
{{ .Prompt }}
{ "num_ctx": 8192, "stop": [ "<|im_end|>", "<|endoftext|>" ], "tempe

Readme

Prism IDE — Local-First AI Coding Assistant

Fine-tuned on Qwen3 for healthcare-grade TypeScript codebases. Matches Claude Sonnet 4.6 on a 22-check coding eval while running fully offline.

Models

Tag Base Size Best for
1.7b Qwen3-1.7B 1.1 GB On-device / iOS
8b Qwen3-8B 5.2 GB Laptop / fast iteration
14b Qwen3-14B 9.0 GB Daily driver
32b Qwen3-32B 20 GB Highest quality

Performance

Metric Score
Routing accuracy (BFCL) 100%
Coding eval (22 checks) 2222
vs Claude Sonnet 4.6 Tied
vs Claude Opus 4 Beats on coding

Quick Start

”`bash ollama run dcostenco/prism-ide:14b What it knows TypeScript / Next.js App Router patterns Healthcare audit logging (withAudit, HIPAA non-blocking .then) Supabase RLS, UUID validation, JSONB safety General ledger (CO/PR entries, CAS patterns) Tool-call routing (picks the right model/tool for the task) Thinking suppression Qwen3 thinking is disabled by default in the Modelfile for deterministic, fast responses. Ideal for IDE autocomplete and agent pipelines.

Companion routing model Use dcostenco/prism-coder for pure tool/model routing (smaller, faster). Use prism-ide when you need code generation quality.

License MIT — weights derived from Qwen3 (Apache 2.0).