93 1 week ago

SmolVLM2-2.2B-Instruct is a lightweight yet powerful vision-language model that can understand images, read documents, and analyze video frames. At just 2.2B parameters, it runs efficiently on consumer hardware including laptops and smartphones, making

45f0a49de5c7 · 164B
# SmolVLM2-2.2B-Instruct Q8_0
Higher quality quantization (1.8 GB).
## Links
- **Original Model:** HuggingFaceTB/SmolVLM2-2.2B-Instruct
## License
Apache 2.0