234 Downloads Updated 7 months ago
Name
1 model
Size
Context
Input
DeepSeek-R1-Distill-Qwen-7B:latest
5.4GB · 128K context window · Text · 7 months ago
5.4GB
128K
Text
Configured longer sequence length. I recommend running with flash attention and kv-cache quantization if you run out of VRAM.