436 2 months ago

CPU-optimized gpt-oss:20b for Kubernetes and VM environments. Runs without GPU, tuned for predictable CPU inference, balanced quality, and reduced memory usage.

tools thinking