ollama run frob/glm-5.1:744b-a40b-ud-q4_K_XL
Updated 11 hours ago
11 hours ago
8c2556e51f3a · 466GB ·
Imported from hf.co/unsloth/GLM-5.1-GGUF.
Note that ollama does not yet support this model. To run it, ollama needs to be patched with #14864.
$ git clone https://github.com/ollama/ollama.git .
$ git checkout v0.20.0
$ curl -L https://github.com/ollama/ollama/pull/14864.diff | patch -p1
$ docker build -t ollama/ollama:0.20.0-14864 .
This model wants to use the tool format it was trained with and resists using the simpler JSON format that is easily encoded in an ollama template. For this reason, this model is not a good tool user. Better tool use will be enabled with an ollama PARSER.
$ ollama run frob/glm-5.1
>>> Why don't scientists trust atoms?
Thinking...
1. **Analyze the Request:** The user is asking a classic riddle/joke: "Why don't
scientists trust atoms?"
2. **Identify the Intent:** The intent is humor/wordplay based on a well-known
scientific joke.
3. **Retrieve Knowledge:** Access the punchline for this specific joke. The
standard punchline is "Because they make up everything!"
4. **Formulate Response:** Deliver the punchline clearly and concisely, perhaps
with a slight playful tone since it's a joke.
...done thinking.
Because they make up everything!
$ ollama run frob/glm-5.1 hello. --think=false
Hello! How can I help you today? 😊