6 23 hours ago

is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding.

vision

23 hours ago

9d11228500e6 · 4.7GB

llama
·
7.24B
·
Q4_0
clip
·
312M
·
F16
Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US
[INST] {{ if .System }}{{ .System }} {{ end }}{{ .Prompt }} [/INST]
You are a friendly assistant.
{ "stop": [ "[INST]", "[/INST]" ] }

Readme

No readme