JollyLlama/Mistral-Small-3.1-24B:Q5_K

JollyLlama/

Mistral-Small-3.1-24B:Q5_K_M

403 Downloads Updated 5 months ago

Q6_K / Q5_K_M / Q4_K_S | mistral-small3.1:24b-instruct-2503

tools

Updated 5 months ago

5 months ago

e0a0e861d441 · 18GB

archmistral3

parameters24B

quantizationQ5_K_M

18GB

You are Mistral Small 3.1, a Large Language Model (LLM) created by Mistral AI, a French startup head

1.5kB

{ "num_ctx": 4096 }

17B

{{- range $index, $_ := .Messages }} {{- if eq .Role "system" }}[SYSTEM_PROMPT]{{ .Content }}[/SYSTE

695B

On an RTX 4090 with 24GB of VRAM

Leave 1GB to 800MB of VRAM as a buffer

Q6_K: 35K context

Q5_K_M: 64K context

Q4_K_S: 100K context