-
glm-4.6v-flash
418 Pulls 1 Tag Updated 3 months ago
-
wedlm-7b-base
Tencent WeDLM-7B-Base converted to GGUF (Q4_K_M). A text-diffusion model based on Qwen2.5 architecture, optimized for efficient parallel decoding.
120 Pulls 1 Tag Updated 2 months ago
-
qwen3-ro-rel-extract
Specialized Romanian Relation Extraction (Qwen3 4B). Structured JSON output. Tags: f16 (high-precision) & q4_k_m (fast).
33 Pulls 3 Tags Updated 2 months ago