299 2 weeks ago

gpt-oss optimized for coding, now with an increased 128k context length

tools thinking 20b

Models

View all →

Readme

Make sure to use ollama version 0.11.5 or later to avoid out-of-memory issues. The new 128k version running on 0.11.5 has roughly the same memory usage as the previous 64k version on 0.11.4.