A series of llava-based model for automating the creation of precise and accessible alt text descriptions for social media.
vision
2b
4b
8b
13b
34b
75 Pulls Updated 7 weeks ago