Models
GitHub
Discord
Docs
Cloud
Sign in
Download
Models
Download
GitHub
Discord
Docs
Cloud
Sign in
ahmadwaqar
/
smolvlm2-2.2b-instruct
:latest
63
Downloads
Updated
2 weeks ago
SmolVLM2-2.2B-Instruct is a compact multimodal model for image and video understanding. Built on SmolLM2-1.7B with SigLIP vision encoder. Supports visual QA, OCR, and video analysis. Available in Q8 and FP16 quantizations. Apache 2.0 license.
SmolVLM2-2.2B-Instruct is a compact multimodal model for image and video understanding. Built on SmolLM2-1.7B with SigLIP vision encoder. Supports visual QA, OCR, and video analysis. Available in Q8 and FP16 quantizations. Apache 2.0 license.
Cancel
vision
smolvlm2-2.2b-instruct:latest
...
/
params
ebe0966a2ea4 · 105B
{
"num_ctx": 16384,
"stop": [
"<|im_start|>",
"<|im_end|>"
],
"temperature": 0.7,
"top_p": 0.9
}