795 2 months ago

A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

vision 8b
e6daeaa4f14b · 112B
{
"num_ctx": 4096,
"stop": [
"[\"<|im_start|>\",\"<|im_end|>\"]"
],
"temperature": 0.7,
"top_p": 0.9
}