mirage335/ Qwen3-Coder-30b-virtuoso

320 Downloads Updated 6 months ago

From qwen3-coder:30b . Compatible with opencode (both).

tools

ollama run mirage335/Qwen3-Coder-30b-virtuoso

curl http://localhost:11434/api/chat \
  -d '{
    "model": "mirage335/Qwen3-Coder-30b-virtuoso",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='mirage335/Qwen3-Coder-30b-virtuoso',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'mirage335/Qwen3-Coder-30b-virtuoso',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Models

Name

1 model

Size / Usage

Context

Input

Qwen3-Coder-30b-virtuoso:latest

19GB · 256K context window · Text · 6 months ago

Qwen3-Coder-30b-virtuoso:latest

19GB

256K

Text

Readme

Apache License Version 2.0

https://ollama.com/library/qwen3-coder:30b

NOTICE

Design

Recommended for opencode .

Usage

ollama_pull_virtuoso() {
ollama pull mirage335/"$1"
ollama cp mirage335/"$1" "$1"
ollama rm mirage335/"$1"
}

ollama_pull_virtuoso Qwen3-Coder-30b-virtuoso

echo "FROM Qwen3-Coder-30b-virtuoso:latest" > Modelfile-128k
echo "PARAMETER num_ctx 131072" >> Modelfile-128k
echo "PARAMETER num_keep 131072" >> Modelfile-128k
echo "PARAMETER num_predict 49152" >> Modelfile-128k
ollama create Qwen3-Coder-30b-128k-virtuoso -f Modelfile-128k

echo "FROM Qwen3-Coder-30b-virtuoso:latest" > Modelfile-256k
echo "PARAMETER num_ctx 262144" >> Modelfile-256k
echo "PARAMETER num_keep 262144" >> Modelfile-256k
echo "PARAMETER num_predict 49152" >> Modelfile-256k
ollama create Qwen3-Coder-30b-256k-virtuoso -f Modelfile-256k

Recommended environment variables. KV_CACHE quantization “q4_0” in particular RECOMMENDED, unless “q8_0” is needed (eg. by Qwen-2_5-VL-7B-Instruct-virtuoso, etc).

export OLLAMA_NUM_THREADS=18
export OLLAMA_FLASH_ATTENTION=1
export OLLAMA_KV_CACHE_TYPE="q4_0"
export OLLAMA_NEW_ENGINE=true
export OLLAMA_NOHISTORY=true
export OLLAMA_NUM_PARALLEL=1
export OLLAMA_MAX_LOADED_MODELS=1

Adjust OLLAMA_NUM_THREADS and/or disable HyperThreading, etc, to prevent crippling performance loss.

CAUTION - Preservation

Pulling the model this way relies on the ollama repository, and more generally, reliability of internet services, which has been rather significantly fragile.

If possible, you should use the “Llama-3-virtuoso” project, which automatically caches an automatically installable backup copy.

https://github.com/mirage335-colossus/Llama-3-virtuoso

Apache License Version 2.0

https://ollama.com/library/qwen3-coder:30b

NOTICE

# Design

Recommended for opencode .

# Usage

```bash
ollama_pull_virtuoso() {
ollama pull mirage335/"$1"
ollama cp mirage335/"$1" "$1"
ollama rm mirage335/"$1"
}

ollama_pull_virtuoso Qwen3-Coder-30b-virtuoso
```
```
echo "FROM Qwen3-Coder-30b-virtuoso:latest" > Modelfile-128k
echo "PARAMETER num_ctx 131072" >> Modelfile-128k
echo "PARAMETER num_keep 131072" >> Modelfile-128k
echo "PARAMETER num_predict 49152" >> Modelfile-128k
ollama create Qwen3-Coder-30b-128k-virtuoso -f Modelfile-128k

echo "FROM Qwen3-Coder-30b-virtuoso:latest" > Modelfile-256k
echo "PARAMETER num_ctx 262144" >> Modelfile-256k
echo "PARAMETER num_keep 262144" >> Modelfile-256k
echo "PARAMETER num_predict 49152" >> Modelfile-256k
ollama create Qwen3-Coder-30b-256k-virtuoso -f Modelfile-256k
```

Recommended environment variables. KV_CACHE quantization “q4_0” in particular RECOMMENDED, unless “q8_0” is needed (eg. by Qwen-2_5-VL-7B-Instruct-virtuoso, etc).
```bash
export OLLAMA_NUM_THREADS=18
export OLLAMA_FLASH_ATTENTION=1
export OLLAMA_KV_CACHE_TYPE="q4_0"
export OLLAMA_NEW_ENGINE=true
export OLLAMA_NOHISTORY=true
export OLLAMA_NUM_PARALLEL=1
export OLLAMA_MAX_LOADED_MODELS=1
```
Adjust OLLAMA_NUM_THREADS and/or disable HyperThreading, etc, to prevent crippling performance loss.

# CAUTION - Preservation

Pulling the model this way relies on the ollama repository, and more generally, reliability of internet services, which has been rather significantly fragile.

If possible, you should use the "Llama-3-virtuoso" project, which automatically caches an automatically installable backup copy.

https://github.com/mirage335-colossus/Llama-3-virtuoso

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)