Models
Docs
Pricing
Sign in
Download
Models
Download
Docs
Pricing
Sign in
ahmadwaqar
/
smolvlm2-agentic-gui
171
Downloads
Updated
1 month ago
Lightweight 2.2B vision model for GUI automation - clicks, types, scrolls on screenshots. Fine-tuned for agentic reasoning with normalized [0,1] coordinate output. Available in Q4_K_M, Q8_0, and FP16 quantizations. Apache 2.0 license.
Lightweight 2.2B vision model for GUI automation - clicks, types, scrolls on screenshots. Fine-tuned for agentic reasoning with normalized [0,1] coordinate output. Available in Q4_K_M, Q8_0, and FP16 quantizations. Apache 2.0 license.
Cancel
vision
Name
3 models
Size
Context
Input
smolvlm2-agentic-gui:latest
9699caf78904
• 2.0GB • 8K context window •
Text, Image input • 1 month ago
Text, Image input • 1 month ago
smolvlm2-agentic-gui:latest
2.0GB
8K
Text, Image
9699caf78904
· 1 month ago
smolvlm2-agentic-gui:q8_0
4fb6a1b4deab
• 2.8GB • 8K context window •
Text, Image input • 1 month ago
Text, Image input • 1 month ago
smolvlm2-agentic-gui:q8_0
2.8GB
8K
Text, Image
4fb6a1b4deab
· 1 month ago
smolvlm2-agentic-gui:fp16
9f1f083bdab7
• 4.5GB • 8K context window •
Text, Image input • 1 month ago
Text, Image input • 1 month ago
smolvlm2-agentic-gui:fp16
4.5GB
8K
Text, Image
9f1f083bdab7
· 1 month ago