150 8 hours ago

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

vision

12 models