A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.
vision
tools
2b
45.9K Pulls Updated 7 weeks ago