Models
GitHub
Discord
Docs
Cloud
Sign in
Download
Models
Download
GitHub
Discord
Docs
Cloud
Sign in
ahmadwaqar
/
smolvlm2-agentic-gui
29
Downloads
Updated
1 week ago
Lightweight 2.2B vision model for GUI automation - clicks, types, scrolls on screenshots. Fine-tuned on aguvis datasets for agentic reasoning. Available in Q8 and FP16 quantizations. Apache 2.0 license.
Lightweight 2.2B vision model for GUI automation - clicks, types, scrolls on screenshots. Fine-tuned on aguvis datasets for agentic reasoning. Available in Q8 and FP16 quantizations. Apache 2.0 license.
Cancel
vision
Name
3 models
Size
Context
Input
smolvlm2-agentic-gui:latest
6634d7cc8777
• 2.5GB • 8K context window •
Text, Image input • 2 weeks ago
Text, Image input • 2 weeks ago
smolvlm2-agentic-gui:latest
2.5GB
8K
Text, Image
6634d7cc8777
· 2 weeks ago
smolvlm2-agentic-gui:q8_0
68b43abec263
• 2.5GB • 8K context window •
Text, Image input • 1 week ago
Text, Image input • 1 week ago
smolvlm2-agentic-gui:q8_0
2.5GB
8K
Text, Image
68b43abec263
· 1 week ago
smolvlm2-agentic-gui:fp16
a1bcc5544900
• 4.2GB • 8K context window •
Text, Image input • 1 week ago
Text, Image input • 1 week ago
smolvlm2-agentic-gui:fp16
4.2GB
8K
Text, Image
a1bcc5544900
· 1 week ago