37 1 week ago

GLM-4.6V-Flash (9B) is a lightweight model optimized for local deployment and low-latency applications. It scales its context window to 128k tokens in training and achieves SoTA performance in visual understanding among models of similar parameter scales.

9b
c8472cd9daed · 31B
You are a helpful AI assistant.