ibm/granite3.1-moe:3b-base-q5_K_S

ibm/

granite3.1-moe:3b-base-q5_K_S

16 Downloads Updated 1 month ago

tools 1b 3b

Updated 1 month ago

1 month ago

f7830781cc25 · 2.3GB ·

archgranitemoe

·

parameters3.3B

·

quantizationQ5_K_S

2.3GB

<|start_of_role|>system<|end_of_role|> {{- if and (gt (len .Messages) 0) (eq (index .Messages 0).Rol

1.4kB

Knowledge Cutoff Date: April 2024. You are Granite, developed by IBM.

69B

Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US

11kB

Readme

Granite mixture of experts models

The IBM Granite 1B and 3B models are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.

The models are trained on over 10 trillion tokens of data, the Granite MoE models are ideal for deployment in on-device applications or situations requiring instantaneous inference.

Running

1B:

ollama run ibm/granite3.1-moe:1b

3B:

ollama run ibm/granite3.1-moe:3b

Supported Languages

English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, Chinese (Simplified)

Capabilities

Summarization
Text classification
Text extraction
Question-answering
Retrieval Augmented Generation (RAG)
Code related
Function-calling
Multilingual dialog use cases
Long-context tasks including long document/meeting summarization, long document QA, etc.

Granite dense models

The Granite dense models are available in 2B and 8B parameter sizes designed to support tool-based use cases and for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.

Learn more

Release Date: December 18th, 2024
License: Apache 2.0
https://www.ibm.com/granite

<center><img src="https://ollama.com/assets/library/granite3.2/90c5e567-0004-425c-a17a-1b846c2b5d3d" data-canonical-src="https://gyazo.com/eb5c5741b6a9a16c692170a41a49c858.png" width="600" /></center>

### Granite mixture of experts models

The IBM Granite 1B and 3B models are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.

The models are trained on over 10 trillion tokens of data, the Granite MoE models are ideal for deployment in on-device applications or situations requiring instantaneous inference.

#### Running

##### 1B:

```
ollama run ibm/granite3.1-moe:1b
```

##### 3B:

```
ollama run ibm/granite3.1-moe:3b
```

#### Supported Languages

English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, Chinese (Simplified)

#### Capabilities

- Summarization
- Text classification
- Text extraction
- Question-answering
- Retrieval Augmented Generation (RAG)
- Code related
- Function-calling
- Multilingual dialog use cases
- Long-context tasks including long document/meeting summarization, long document QA, etc.

#### Granite dense models

The Granite dense models are available in 2B and 8B parameter sizes designed to support tool-based use cases and for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.

---

#### Learn more

- Release Date: December 18th, 2024
- License: [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
- https://www.ibm.com/granite

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)