ahmadwaqar/smolvlm2-2.2b-instruct/params

ahmadwaqar/ smolvlm2-2.2b-instruct:latest

801 Downloads Updated 7 months ago

SmolVLM2-2.2B-Instruct is a compact multimodal model for image and video understanding. Built on SmolLM2-1.7B with SigLIP vision encoder. Supports visual QA, OCR, and video analysis. Available in Q8 and FP16 quantizations. Apache 2.0 license.

vision

smolvlm2-2.2b-instruct:latest ... /

params

ebe0966a2ea4 · 105B

{

"num_ctx": 16384,

"stop": [

"<|im_start|>",

"<|im_end|>"

],

"temperature": 0.7,

"top_p": 0.9

}