devstral-small-2

924.5K Downloads Updated 7 months ago

24B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

vision tools 24b

ollama run devstral-small-2

curl http://localhost:11434/api/chat \
  -d '{
    "model": "devstral-small-2",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='devstral-small-2',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'devstral-small-2',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Applications

Claude Code

Claude Code ollama launch claude --model devstral-small-2

OpenCode

OpenCode ollama launch opencode --model devstral-small-2

Hermes Agent

Hermes Agent ollama launch hermes --model devstral-small-2

OpenClaw

OpenClaw ollama launch openclaw --model devstral-small-2

Models

Name

6 models

Size / Usage

Context

Input

devstral-small-2:latest

15GB · 384K context window · Text, Image · 7 months ago

devstral-small-2:latest

15GB

384K

Text, Image

devstral-small-2:24b

15GB · 384K context window · Text, Image · 7 months ago

devstral-small-2:24b latest

15GB

384K

Text, Image

devstral-small-2:24b-cloud

· 256K context window · Text, Image · 7 months ago

devstral-small-2:24b-cloud

256K

Text, Image

Readme

Note: this model requires Ollama 0.13.3 or later. Download Ollama

Devstral Small 2

Devstral is an agentic LLM for software engineering tasks. Devstral 2 models excel at using tools to explore codebases, editing multiple files and power software engineering agents.
The model achieves remarkable performance on SWE-bench.

ollama run devstral-small-2

Key Features

The Devstral 2 Instruct model offers the following capabilities:

Agentic Coding: Devstral is designed to excel at agentic coding tasks, making it a great choice for software engineering agents.
Improved Performance: Devstral 2 is a step-up compared to its predecessors.
Better Generalization: Generalises better to diverse prompts and coding environments.

Use Cases

AI Code Assistants, Agentic Coding, and Software Engineering Tasks. Leveraging advanced AI capabilities for complex tool integration and deep codebase understanding in coding environments.

Benchmark Results

Model/Benchmark	Size (B Tokens)	SWE Bench Verified	SWE Bench Multilingual	Terminal Bench
Devstral 2	123	72.2%	61.3%	40.5%
Devstral Small 2	24	65.8%	51.6%	32.0%

DeepSeek v3.2	671	73.1%	70.2%	46.4%
Kimi K2 Thinking	1000	71.3%	61.1%	35.7%
MiniMax M2	230	69.4%	56.5%	30.0%
GLM 4.6	455	68.0%	–	40.5%
Qwen 3 Coder Plus	480	69.6%	54.7%	37.5%
Gemini 3 Pro	–	76.2%	–	54.2%
Claude Sonnet 4.5	–	77.2%	68.0%	42.8%
GPT 5.1 Codex Max	–	77.9%	–	58.1%
GPT 5.1 Codex High	–	73.7%	–	52.8%

License

Devstral Small 2 - 24B

Apache 2.0

Reference

<img src="/assets/library/devstral-2/22065d6d-626a-4fc8-af4c-2efe10844651" width="72" />

> Note: this model requires Ollama 0.13.3 or later. [Download Ollama](https://ollama.com/download)

# Devstral Small 2 
Devstral is an agentic LLM for software engineering tasks. **Devstral 2** models excel at using tools to explore codebases, editing multiple files and power software engineering agents.  
The model achieves remarkable performance on SWE-bench.

**[24B model](https://ollama.com/library/devstral-small-2)**

```
ollama run devstral-small-2
```

### Key Features

The Devstral 2 Instruct model offers the following capabilities:

- **Agentic Coding**: Devstral is designed to excel at agentic coding tasks, making it a great choice for software engineering agents.

- **Improved Performance**: Devstral 2 is a step-up compared to its predecessors.

- **Better Generalization**: Generalises better to diverse prompts and coding environments.

### Use Cases

AI Code Assistants, Agentic Coding, and Software Engineering Tasks. Leveraging advanced AI capabilities for complex tool integration and deep codebase understanding in coding environments.

### Benchmark Results

| Model/Benchmark               | Size (B Tokens) | SWE Bench Verified | SWE Bench Multilingual | Terminal Bench |
|-------------------------------|-----------------|--------------------|------------------------|----------------|
| **Devstral 2**                | 123             | 72.2%              | 61.3%                  | 40.5%          |
| **Devstral Small 2**          | 24              | 65.8%              | 51.6%                  | 32.0%          |
|                               |                 |                    |                        |                |
| DeepSeek v3.2                 | 671             | 73.1%              | 70.2%                  | 46.4%          |
| Kimi K2 Thinking              | 1000            | 71.3%              | 61.1%                  | 35.7%          |
| MiniMax M2                    | 230             | 69.4%              | 56.5%                  | 30.0%          |
| GLM 4.6                       | 455             | 68.0%              | --                     | 40.5%          |
| Qwen 3 Coder Plus             | 480             | 69.6%              | 54.7%                  | 37.5%          |
| Gemini 3 Pro                  | --              | 76.2%              | --                     | 54.2%          |
| Claude Sonnet 4.5             | --              | 77.2%              | 68.0%                  | 42.8%          |
| GPT 5.1 Codex Max             | --              | 77.9%              | --                     | 58.1%          |
| GPT 5.1 Codex High            | --              | 73.7%              | --                     | 52.8%          |

### License

**[Devstral Small 2 - 24B](https://ollama.com/library/devstral-small-2)**

Apache 2.0

### Reference

[Devstral 2](https://ollama.com/library/devstral-2)

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)