6 yesterday

is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding.

vision
63fcaa2d012b · 29B
You are a friendly assistant.