Generic all purpose model. Occasionally may have notable logic, usually Llama-3_3-Nemotron-Super-49B-v1_5 is preferred.

Applications

Claude Code ollama launch claude --model mirage335/gpt-oss-120b-virtuoso

Codex App ollama launch codex-app --model mirage335/gpt-oss-120b-virtuoso

OpenClaw ollama launch openclaw --model mirage335/gpt-oss-120b-virtuoso

Hermes Agent ollama launch hermes --model mirage335/gpt-oss-120b-virtuoso

Codex ollama launch codex --model mirage335/gpt-oss-120b-virtuoso

OpenCode ollama launch opencode --model mirage335/gpt-oss-120b-virtuoso

Apache License Version 2.0

NOTICE

Design

Generic all purpose model. Occasionally may have notable logic, usually Llama-3_3-Nemotron-Super-49B-v1_5 is preferred.

Usage

ollama_pull_virtuoso() {
ollama pull mirage335/"$1"
ollama cp mirage335/"$1" "$1"
ollama rm mirage335/"$1"
}

ollama_pull_virtuoso gpt-oss-120b-virtuoso

echo "FROM gpt-oss-120b-virtuoso" > Modelfile-128k
echo "PARAMETER num_ctx 131072" >> Modelfile-128k
echo "PARAMETER num_keep 131072" >> Modelfile-128k
echo "PARAMETER num_predict 131072" >> Modelfile-128k
ollama create gpt-oss-120b-128k-virtuoso -f Modelfile-128k
rm -f Modelfile-128k

Recommended environment variables. KV_CACHE quantization “q4_0” in particular RECOMMENDED, unless “q8_0” is needed (eg. by Qwen-2_5-VL-7B-Instruct-virtuoso, etc).

export OLLAMA_NUM_THREADS=18
export OLLAMA_FLASH_ATTENTION=1
export OLLAMA_KV_CACHE_TYPE="q4_0"
export OLLAMA_NEW_ENGINE=true
export OLLAMA_NOHISTORY=true
export OLLAMA_NUM_PARALLEL=1
export OLLAMA_MAX_LOADED_MODELS=1

Adjust OLLAMA_NUM_THREADS and/or disable HyperThreading, etc, to prevent crippling performance loss.

CAUTION - Preservation

Pulling the model this way relies on the ollama repository, and more generally, reliability of internet services, which has been rather significantly fragile.

If possible, you should use the “Llama-3-virtuoso” project, which automatically caches an automatically installable backup copy.

https://github.com/mirage335-colossus/Llama-3-virtuoso

Generic all purpose model. Occasionally may have notable logic, usually Llama-3_3-Nemotron-Super-49B-v1_5 is preferred.

Applications

Models

Readme

Design

Usage

CAUTION - Preservation