A series of llava-based model for automating the creation of precise and accessible alt text descriptions for social media.

vision 2b 4b 8b 13b 34b

84 2 months ago

1750e15c298f · 97B
{
"num_ctx": 4096,
"num_keep": 4,
"num_predict": 1024,
"stop": [
"USER:",
"ASSISTANT:"
],
"temperature": 0.1
}