nishtahir/ sera

81 Downloads Updated 4 days ago

tools thinking 8b 32b

ollama run nishtahir/sera

curl http://localhost:11434/api/chat \
  -d '{
    "model": "nishtahir/sera",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='nishtahir/sera',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'nishtahir/sera',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Models

Name

9 models

Size

Context

Input

sera:latest

5.0GB · 40K context window · Text · 4 days ago

sera:latest

5.0GB

40K

Text

sera:8b

5.0GB · 40K context window · Text · 4 days ago

sera:8b latest

5.0GB

40K

Text

sera:32b

20GB · 40K context window · Text · 4 days ago

sera:32b

20GB

40K

Text

Readme

SERA (Unofficial)

Paper: https://allenai.org/papers/opencodingagents
Code: https://github.com/allenai/SERA
CLI: https://github.com/allenai/sera-cli | PyPI
Collection: https://huggingface.co/collections/allenai/open-coding-agents
Dataset: https://huggingface.co/datasets/allenai/Sera-4.6-Lite-T2

Model Variants

Model	HuggingFace	Base	Teacher	SWE-bench Verified
SERA-32B	allenai/SERA-32B	Qwen 3-32B	GLM-4.6	49.5% ± 1.9%
SERA-32B-GA	allenai/SERA-32B-GA	Qwen 3-32B	GLM-4.5-Air	46.6% ± 0.7%
SERA-8B	allenai/SERA-8B	Qwen 3-8B	GLM-4.6	31.7% ± 0.9%
SERA-8B-GA	allenai/SERA-8B-GA	Qwen 3-8B	GLM-4.5-Air	31.7% ± 0.4%

All results evaluated at 32K context length. Standard deviations computed over 3 random seeds.

Performance

SWE-bench Verified (32K Context)

Model	Type	Resolve Rate
SkyRL-8B	Open-source	9.4%
Nex-N1-8B	Open-source	20.3%
SERA-8B	Open-source	31.7%
Qwen 3-32B (base)	Open-weight	24.4%
SWE-smith	Open-source	32.6%
SkyRL-Agent	Open-source	39.4%
DeepSWE	Open-source	42.2%
SERA-32B	Open-source	49.5%
Devstral-Small-2 (24B)	Open-weight	50.0%
GLM-4.5-Air (110B)	Open-weight	50.5%

Open-source: code, model weights, and data publicly available. Open-weight: model weights available but training data/code not fully released.

# SERA (Unofficial)

- **Paper:** [https://allenai.org/papers/opencodingagents](https://allenai.org/papers/opencodingagents)
- **Code:** [https://github.com/allenai/SERA](https://github.com/allenai/SERA)
- **CLI:** [https://github.com/allenai/sera-cli](https://github.com/allenai/sera-cli) | [PyPI](https://pypi.org/project/ai2-sera-cli/)
- **Collection:** [https://huggingface.co/collections/allenai/open-coding-agents](https://huggingface.co/collections/allenai/open-coding-agents)
- **Dataset:** [https://huggingface.co/datasets/allenai/Sera-4.6-Lite-T2](https://huggingface.co/datasets/allenai/Sera-4.6-Lite-T2)

## Model Variants

| Model | HuggingFace | Base | Teacher | SWE-bench Verified |
|-------|-------------|------|---------|-------------------|
| **SERA-32B** | [allenai/SERA-32B](https://huggingface.co/allenai/SERA-32B) | Qwen 3-32B | GLM-4.6 | 49.5% ± 1.9% |
| SERA-32B-GA | [allenai/SERA-32B-GA](https://huggingface.co/allenai/SERA-32B-GA) | Qwen 3-32B | GLM-4.5-Air | 46.6% ± 0.7% |
| SERA-8B | [allenai/SERA-8B](https://huggingface.co/allenai/SERA-8B) | Qwen 3-8B | GLM-4.6 | 31.7% ± 0.9% |
| SERA-8B-GA | [allenai/SERA-8B-GA](https://huggingface.co/allenai/SERA-8B-GA) | Qwen 3-8B | GLM-4.5-Air | 31.7% ± 0.4% |

All results evaluated at 32K context length. Standard deviations computed over 3 random seeds.

## Performance

### SWE-bench Verified (32K Context)

| Model | Type | Resolve Rate |
|-------|------|--------------|
| SkyRL-8B | Open-source | 9.4% |
| Nex-N1-8B | Open-source | 20.3% |
| **SERA-8B** | **Open-source** | **31.7%** |
| Qwen 3-32B (base) | Open-weight | 24.4% |
| SWE-smith | Open-source | 32.6% |
| SkyRL-Agent | Open-source | 39.4% |
| DeepSWE | Open-source | 42.2% |
| **SERA-32B** | **Open-source** | **49.5%** |
| Devstral-Small-2 (24B) | Open-weight | 50.0% |
| GLM-4.5-Air (110B) | Open-weight | 50.5% |

*Open-source: code, model weights, and data publicly available. Open-weight: model weights available but training data/code not fully released.*

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)