llava:7b-v1.6-mistral-q5_K_M

5.7M 15 months ago

๐ŸŒ‹ LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.

vision 7b 13b 34b
c43332387573 ยท 67B
[INST] {{ if .System }}{{ .System }} {{ end }}{{ .Prompt }} [/INST]