Family of LLaVA models fine-tuned from Llama3-8B Instruct, Phi3-mini and CLIP-ViT-Large-patch14-336 with ShareGPT4V-PT and InternVL-SFT by XTuner.
Vision
3B
8B
1,341 Pulls Updated 4 months ago
cadf483f03b5 · 155B
{
"num_ctx": 4096,
"num_keep": 4,
"stop": [
"<|user|>",
"<|assistant|>",
"<|system|>",
"<|end|>",
"<|endoftext|>"
]
}