6 yesterday

is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding.

vision
c43332387573 · 67B
[INST] {{ if .System }}{{ .System }} {{ end }}{{ .Prompt }} [/INST]