sammcj/devstral-small-24b-2505-ud

sammcj/ devstral-small-24b-2505-ud

1,005 Downloads Updated 1 year ago

Built from Unsloth's UD Q6_K_XL quant

tools

ollama run sammcj/devstral-small-24b-2505-ud:64k-q6_k_xl

curl http://localhost:11434/api/chat \
  -d '{
    "model": "sammcj/devstral-small-24b-2505-ud:64k-q6_k_xl",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='sammcj/devstral-small-24b-2505-ud:64k-q6_k_xl',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'sammcj/devstral-small-24b-2505-ud:64k-q6_k_xl',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Applications

Claude Code ollama launch claude --model sammcj/devstral-small-24b-2505-ud:64k-q6_k_xl

Codex App ollama launch codex-app --model sammcj/devstral-small-24b-2505-ud:64k-q6_k_xl

OpenClaw ollama launch openclaw --model sammcj/devstral-small-24b-2505-ud:64k-q6_k_xl

Hermes Agent ollama launch hermes --model sammcj/devstral-small-24b-2505-ud:64k-q6_k_xl

Codex ollama launch codex --model sammcj/devstral-small-24b-2505-ud:64k-q6_k_xl

OpenCode ollama launch opencode --model sammcj/devstral-small-24b-2505-ud:64k-q6_k_xl

Models

View all →

Name

5 models

Size / Usage

Context

Input

devstral-small-24b-2505-ud:64k-q6_k_xl

21GB · 128K context window · Text · 1 year ago

devstral-small-24b-2505-ud:64k-q6_k_xl

21GB

128K

Text

devstral-small-24b-2505-ud:128k-q6_k_xl

21GB · 128K context window · Text · 1 year ago

devstral-small-24b-2505-ud:128k-q6_k_xl

21GB

128K

Text

devstral-small-24b-2505-ud:cline-128k-q6_k_xl

21GB · 128K context window · Text · 1 year ago

devstral-small-24b-2505-ud:cline-128k-q6_k_xl

21GB

128K

Text

devstral-small-24b-2505-ud:cline-64k-q4_k_xl

15GB · 128K context window · Text · 1 year ago

devstral-small-24b-2505-ud:cline-64k-q4_k_xl

15GB

128K

Text

devstral-small-24b-2505-ud:cline-64k-q6_k_xl

21GB · 128K context window · Text · 1 year ago

devstral-small-24b-2505-ud:cline-64k-q6_k_xl

21GB

128K

Text

Readme

GGUF: https://huggingface.co/unsloth/Devstral-Small-2505-GGUF/blob/main/Devstral-Small-2505-UD-Q6_K_XL.gguf

128k tags default to 128k context which requires around 35GB vRAM.
64k tags default to 64k context which requires around 27GB vRAM, num_batch set to 1024 (up from 512) for performance, you can tune num_batch to trade off speed and vRAM requirements.
cline tags are optimised for Cline & Roo Code Agentic Coding.