-
minicpm-o2.6
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
vision 8b26.9K Pulls 13 Tags Updated 1 year ago
-
minicpm-v4.5
A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
vision 8b18.4K Pulls 11 Tags Updated 2 weeks ago
-
minicpm-v4.6
A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone
vision 1b9,607 Pulls 13 Tags Updated 2 weeks ago
-
minicpm-o4.5
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Mulitmodal Live Streaming on Your Phone
vision 8b7,304 Pulls 12 Tags Updated 4 months ago
-
minicpm-v2.6
A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
vision 8b2,666 Pulls 12 Tags Updated 1 year ago
-
minicpm5
highly efficient large language models (LLMs) designed explicitly for end-side devices
2,152 Pulls 4 Tags Updated 3 weeks ago
-
minicpm-v4
A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
vision 4b1,987 Pulls 12 Tags Updated 10 months ago
-
minicpm4.1
highly efficient large language models (LLMs) designed explicitly for end-side devices
1,325 Pulls 1 Tag Updated 9 months ago
-
minicpm-v2.5
A GPT-4V Level Multimodal LLM on Your Phone
vision 8b461 Pulls 13 Tags Updated 1 year ago