1,073 Downloads Updated 1 year ago
ollama run rhundt/GLM-4-0414-32b-128k-Q4_K_M
Updated 1 year ago
1 year ago
67a2a027906e · 20GB ·
Quantized with YaRN RoPE scaling to 128k context (factor 4). This needs Ollama >=0.6.6 to run. The num_ctx in the Modelfile defaults to 64k just because I don’t have gobs of VRAM.