350 Downloads Updated 2 months ago
InternVL3 is a new multimodal large language model that represents a significant advancement over its predecessor, InternVL 2.5.
The model benefits from Native Multimodal Pre-Training, which allows it to outperform even the Qwen2.5 series in text tasks, despite using Qwen2.5’s pre-trained base models as initialization for its language component.
InternVL3 pushes the boundaries of what multimodal AI can do by combining stronger foundational capabilities with a broader range of practical applications across visual, textual, and interactive domains.