113 Downloads Updated 2 months ago
I’m uploading these gemma3n
models as Unsloth DynamicQuant2.0 versions. This is a significant point: it means that even at higher quantization levels, you’re getting much better accuracy than what you might expect from standard quants. These models are truly optimized for performance on everyday devices without compromising quality as much.
The idea here is to make these incredibly efficient models even more accessible and performant for your local setup.
Gemma 3n models are designed for efficient execution on everyday devices like laptops, tablets, or phones. These models have been trained extensively with data in over 140 spoken languages, making them incredibly versatile.
A key innovation in Gemma 3n is the use of selective parameter activation technology. This technique allows the models to operate at an effective size of 2B and 4B parameters, which is considerably lower than their total parameter count. This efficiency makes them ideal for local deployment without sacrificing too much capability.
Gemma 3n models offer a wide range of applications, democratizing access to powerful AI on personal devices. They are well-suited for: