153 10 hours ago

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

vision
e6daeaa4f14b · 112B
{
"num_ctx": 4096,
"stop": [
"[\"<|im_start|>\",\"<|im_end|>\"]"
],
"temperature": 0.7,
"top_p": 0.9
}