Models
GitHub
Discord
Docs
Cloud
Sign in
Download
Models
Download
GitHub
Discord
Docs
Cloud
Sign in
mikepfunk28
/
deepseekq3_agent
:latest
270
Downloads
Updated
4 months ago
16k Context Window meaning you need less RAM to run this. It's full context windows is loaded in the deepseekq3_coder. It allocates the RAM needed for the context when loading the model.
16k Context Window meaning you need less RAM to run this. It's full context windows is loaded in the deepseekq3_coder. It allocates the RAM needed for the context when loading the model.
Cancel
tools
thinking
Updated 4 months ago
4 months ago
b37cd58f6fd3 · 5.2GB ·
model
arch
qwen3
·
parameters
8.19B
·
quantization
Q4_K_M
5.2GB
license
MIT License Copyright (c) 2023 DeepSeek Permission is hereby granted, free of charge, to any person
1.1kB
template
{{- $lastUserIndex := -1 }} {{- $hasActiveToolCall := false }} {{- range $index, $_ := .Messages }}
1.9kB
system
# Devstral - Advanced Coding Assistant System Prompt You are Devstral, an elite coding assistant eng
14kB
params
{ "num_ctx": 16000, "seed": 42, "stop": [ "<|begin▁of▁sentence|>",
216B
Readme
No readme
Write
Preview
Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)