22 Downloads Updated 2 weeks ago
Apache License Version 2.0
https://ollama.com/library/qwen2.5vl
https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct
NOTICE
Recommended for transcribing and summarizing text from screenshots.
Q4_K_M
ollama_pull_virtuoso() {
ollama pull mirage335/"$1"
ollama cp mirage335/"$1" "$1"
ollama rm mirage335/"$1"
}
ollama_pull_virtuoso Qwen-2_5-VL-7B-Instruct-virtuoso
Recommended environment variables. KV_CACHE quantization “q4_0” in particular is NOT COMPATIBLE.
export OLLAMA_NUM_THREADS=18
export OLLAMA_FLASH_ATTENTION=1
export OLLAMA_KV_CACHE_TYPE="q8_0"
export OLLAMA_NEW_ENGINE=true
export OLLAMA_NOHISTORY=true
export OLLAMA_NUM_PARALLEL=1
export OLLAMA_MAX_LOADED_MODELS=1
Adjust OLLAMA_NUM_THREADS and/or disable HyperThreading, etc, to prevent crippling performance loss.
Pulling the model this way relies on the ollama repository, and more generally, reliability of internet services, which has been rather significantly fragile.
If possible, you should use the “Llama-3-virtuoso” project, which automatically caches an automatically installable backup copy.