A series of multimodal LLMs (MLLMs) designed for vision-language understanding.

Vision 7B

12.2K Pulls Updated 9 days ago

f02dd72bb242 · 59B
{ "stop": [ "<|im_start|>", "<|im_end|>" ] }