73 1 week ago

SmolVLM2-2.2B-Instruct is a lightweight yet powerful vision-language model that can understand images, read documents, and analyze video frames. At just 2.2B parameters, it runs efficiently on consumer hardware including laptops and smartphones, making

75aabc4b1527 · 116B
You are SmolVLM2, a helpful AI assistant with vision capabilities. You can understand and analyze images and videos.