sam860/gemma3:270m

sam860/

gemma3:270m

90 Downloads Updated 2 weeks ago

Google's new small model

270m 1b

Updated 2 weeks ago

2 weeks ago

b16d6d39dfbd · 241MB

model

archgemma3

parameters268M

quantizationQ4_0

241MB

template

{{- $systemPromptAdded := false }} {{- range $i, $_ := .Messages }} {{- $last := eq (len (slice $.Me

476B

license

Gemma Terms of Use Last modified: February 21, 2024 By using, reproducing, modifying, distributing,

8.4kB

params

{ "stop": [ "<end_of_turn>" ], "top_k": 64, "top_p": 0.95 }

61B

Readme

Notes

Uploading Unsloth DynamicQuant2 versions for both the 1B and the new 270m Gemma 3 models. Unsloth’s DQ2 offers better accuracy, especially at higher quants, making these small models more capable.

The default latest, 1b, and 270m tags point to the official Quantization-Aware Trained (QAT) versions, which deliver near-fp16 performance even at smaller quants like Q4_0.

Description

Google’s Gemma 3: lightweight, multimodal models built from the same research as the Gemini family.

Key features include a large context window, multilingual support for over 140 languages, and strong performance for their size. These models can handle both text and image inputs to generate text outputs.

Ideal for on-device tasks, quick summarization, chatbots, and other resource-constrained environments where latency is critical.

References

gemma3-qat on HuggingFace

HuggingFace (Unlsoth-DQ2)

HuggingFace (qat-Unsloth-DQ2)

### Notes
Uploading **Unsloth DynamicQuant2** versions for both the **1B** and the new **270m** Gemma 3 models. Unsloth's DQ2 offers better accuracy, especially at higher quants, making these small models more capable.

The default `latest`, `1b`, and `270m` tags point to the official Quantization-Aware Trained (QAT) versions, which deliver near-fp16 performance even at smaller quants like Q4_0.

---

### Description
**Google's Gemma 3:** lightweight, multimodal models built from the same research as the Gemini family.

Ideal for on-device tasks, quick summarization, chatbots, and other resource-constrained environments where latency is critical.

---

### References
[gemma3-qat on HuggingFace](https://huggingface.co/google/gemma-3-270m-it-qat-q4_0-unquantized)

[HuggingFace (Unlsoth-DQ2)](https://huggingface.co/unsloth/gemma-3-270m-it-GGUF)

[HuggingFace (qat-Unsloth-DQ2)](https://huggingface.co/unsloth/gemma-3-270m-it-qat-GGUF)

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)