freerainboxbox/mistral-small:24b-instruct-2501-q4

freerainboxbox/

mistral-small:24b-instruct-2501-q4_1

761 Downloads Updated 9 months ago

Alternative quantization levels, no fine-tuning

tools

Updated 9 months ago

9 months ago

537af93c11da · 15GB ·

archllama

parameters23.6B

quantizationQ4_1

15GB

{{- range $index, $_ := .Messages }} {{- if eq .Role "system" }}[SYSTEM_PROMPT]{{ .Content }}[/SYSTE

695B

You are Mistral Small 3, a Large Language Model (LLM) created by Mistral AI, a French startup headqu

644B

Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US

11kB

{ "temperature": 0.15 }

21B

These are alternative quantization levels from Mistral’s new 24B Mistral Small 3. No fine-tuning has been done, these are purely quantized.

Benchmarks on M1 Max (64GB):

Easy prompts that are tolerant to potential mistakes should run Q4_0. For balanced quality with decent speed, use Q4_K_M. Avoid Q6_K.