37 1 week ago

GLM-4.6V-Flash (9B) is a lightweight model optimized for local deployment and low-latency applications. It scales its context window to 128k tokens in training and achieves SoTA performance in visual understanding among models of similar parameter scales.

9b
77c25c0df517 · 35B
{
"num_ctx": 4096,
"temperature": 0.7
}