25 1 month ago

The Google Gemma 3 models are multimodal—processing text and images—and feature a 128K context window with support for over 140 languages....

vision