granite3-moe

granite3-moe

930.8K Downloads Updated 1 year ago

The IBM Granite 1B and 3B models are the first mixture of experts (MoE) Granite models from IBM designed for low latency usage.

tools 1b 3b

ollama run granite3-moe

curl http://localhost:11434/api/chat \
  -d '{
    "model": "granite3-moe",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='granite3-moe',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'granite3-moe',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Applications

Claude Code

Claude Code ollama launch claude --model granite3-moe

OpenCode

OpenCode ollama launch opencode --model granite3-moe

Hermes Agent

Hermes Agent ollama launch hermes --model granite3-moe

OpenClaw

OpenClaw ollama launch openclaw --model granite3-moe

Models

Name

33 models

Size / Usage

Context

Input

granite3-moe:latest

822MB · 4K context window · Text · 1 year ago

granite3-moe:latest

822MB

4K

Text

granite3-moe:1b

822MB · 4K context window · Text · 1 year ago

granite3-moe:1b latest

822MB

4K

Text

granite3-moe:3b

2.1GB · 4K context window · Text · 1 year ago

granite3-moe:3b

2.1GB

4K

Text

Readme

Granite mixture of experts models

The IBM Granite 1B and 3B models are the first mixture of experts (MoE) Granite models from IBM designed for low latency usage.

The models are trained on over 10 trillion tokens of data, the Granite MoE models are ideal for deployment in on-device applications or situations requiring instantaneous inference.

Parameter Sizes

1B:

ollama run granite3-moe:1b

3B:

ollama run granite3-moe:3b

Supported Languages

English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, Chinese (Simplified)

Capabilities

Summarization
Text classification
Text extraction
Question-answering
Retrieval Augmented Generation (RAG)
Code related
Function-calling
Multilingual dialog use cases

Granite dense models

The Granite dense models are available in 2B and 8B parameter sizes designed to support tool-based use cases and for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.

Learn more

Developers: IBM Research
GitHub Repository: ibm-granite/granite-3.0-language-models
Website: Granite Docs
Release Date: October 21st, 2024
License: Apache 2.0.

![An illustration of Ollama holding a beautiful flower with the IBM Rebus logo of the Eye, Bee and M, made by Paul Rand.](https://ollama.com/assets/library/granite3-moe/6ea49528-3ff2-4fcc-98b2-01f6104254d2)

### Granite mixture of experts models

The IBM Granite **1B and 3B models** are the first mixture of experts (MoE) Granite models from IBM designed for low latency usage.

The models are trained on over 10 trillion tokens of data, the Granite MoE models are ideal for deployment in on-device applications or situations requiring instantaneous inference.

## Parameter Sizes

**1B:**
  
`ollama run granite3-moe:1b`

**3B:**

`ollama run granite3-moe:3b`

## Supported Languages
English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, Chinese (Simplified)

### Capabilities
* Summarization
* Text classification
* Text extraction
* Question-answering
* Retrieval Augmented Generation (RAG)
* Code related 
* Function-calling
* Multilingual dialog use cases

### Granite dense models

The Granite dense models are available in **2B and 8B** parameter sizes designed to support tool-based use cases and for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.

[See model page](https://ollama.com/library/granite3-dense)

### Learn more

- **Developers:** IBM Research
- **GitHub Repository:** [ibm-granite/granite-3.0-language-models](https://github.com/ibm-granite/granite-3.0-language-models)
- **Website**: [Granite Docs](https://www.ibm.com/granite/docs/)
- **Release Date**: October 21st, 2024
- **License:** [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0).

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)