danielsheep/Qwen3-Coder-30B-A3B-Instruct-1M-Unsloth

danielsheep/ Qwen3-Coder-30B-A3B-Instruct-1M-Unsloth:latest

10.4K Downloads Updated 10 months ago

Unsloth Dynamic 2.0 Quants achieves 1M tokens & superior accuracy & SOTA quantization performance. Select UD-IQ3_XXS for 16GB VRAM, UD-Q4_K_XL for 24GB VRAM, or UD-Q5_K_XL/UD-Q6_K_XL for 32GB VRAM.

tools

ollama run danielsheep/Qwen3-Coder-30B-A3B-Instruct-1M-Unsloth

curl http://localhost:11434/api/chat \
  -d '{
    "model": "danielsheep/Qwen3-Coder-30B-A3B-Instruct-1M-Unsloth",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='danielsheep/Qwen3-Coder-30B-A3B-Instruct-1M-Unsloth',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'danielsheep/Qwen3-Coder-30B-A3B-Instruct-1M-Unsloth',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 10 months ago

10 months ago

56720d55401e · 18GB ·

model

archqwen3moe

parameters30.5B

quantizationQ4_K_M

18GB

params

{ "min_p": 0, "repeat_penalty": 1.05, "stop": [ "<|im_start|>", "<|im_en

178B

license

Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US

11kB

template

{{- $lastUserIdx := -1 -}} {{- range $idx, $msg := .Messages -}} {{- if eq $msg.Role "user" }}{{ $la

1.4kB

Readme

Qwen3 Coder 30B model for consumer-grade graphics cards, Unsloth Dynamic 2.0 quantized version with 1M tokens. For graphics cards with 16GB of VRAM, the UD-IQ3_XXS is recommended. For graphics cards with 24GB of VRAM, the UD-Q4_K_XL is recommended. For graphics cards with 32GB of VRAM, the UD-Q5_K_XL is recommended. Remember using Environment Variable OLLAMA_CONTEXT_LENGTH to adjust context length. With 16GB of VRAM and UD-IQ3_XXS, the recommended context length is 8192.