A series of multimodal LLMs (MLLMs) designed for vision-language understanding.
vision
8b
42.1K Pulls Updated 3 days ago
f02dd72bb242 · 59B
{
"stop": [
"<|im_start|>",
"<|im_end|>"
]
}