DeepSeek-V4-Flash is a preview of the DeepSeek-V4 series, a Mixture-of-Experts model with 284B total parameters and 13B activated, built for efficient reasoning across a 1M-token context window.
121.1K Pulls 1 Tag Updated 1 month ago
Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash - is an efficient reasoning model distilled using high-quality data from DeepSeek-V4. + vision. ollama v.0.30.0-rc20 +
3,213 Pulls 1 Tag Updated 3 weeks ago
256 Pulls 1 Tag Updated 3 weeks ago
pull from hf.co/Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash-GGUF
2,691 Pulls 1 Tag Updated 1 month ago
Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash - is an efficient reasoning model distilled using high-quality data from DeepSeek-V4.
1,593 Pulls 1 Tag Updated 1 month ago
163 Pulls 1 Tag Updated 1 month ago
2 Pulls 1 Tag Updated 1 week ago