DeepSeek-V4-Flash is a preview of the DeepSeek-V4 series, a Mixture-of-Experts model with 284B total parameters and 13B activated, built for efficient reasoning across a 1M-token context window.
120.9K Pulls 1 Tag Updated 1 month ago
Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash - is an efficient reasoning model distilled using high-quality data from DeepSeek-V4. + vision. ollama v.0.30.0-rc20 +
3,191 Pulls 1 Tag Updated 3 weeks ago
255 Pulls 1 Tag Updated 3 weeks ago
pull from hf.co/Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash-GGUF
2,690 Pulls 1 Tag Updated 1 month ago
Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash - is an efficient reasoning model distilled using high-quality data from DeepSeek-V4.
1,590 Pulls 1 Tag Updated 1 month ago
163 Pulls 1 Tag Updated 1 month ago
2 Pulls 1 Tag Updated 1 week ago