granite3.1-dense:8b-instruct-q3_K_S

granite3.1-dense:8b-instruct-q3_K_S

150.1K Downloads Updated 11 months ago

The IBM Granite 2B and 8B models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM’s initial testing.

tools 2b 8b

Updated 11 months ago

11 months ago

338340c340cb · 3.6GB ·

archgranite

·

parameters8.17B

·

quantizationQ3_K_S

3.6GB

Knowledge Cutoff Date: April 2024. You are Granite, developed by IBM.

69B

Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US

11kB

<|start_of_role|>system<|end_of_role|> {{- if and (gt (len .Messages) 0) (eq (index .Messages 0).Rol

1.4kB

Readme

Granite dense models

The IBM Granite 2B and 8B models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM’s initial testing.

They are designed to support tool-based use cases and for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.

Parameter Sizes

2B:

ollama run granite3.1-dense:2b

8B:

ollama run granite3.1-dense:8b

Supported Languages

English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, Chinese (Simplified)

Capabilities

Summarization
Text classification
Text extraction
Question-answering
Retrieval Augmented Generation (RAG)
Code related tasks
Function-calling tasks
Multilingual dialog use cases
Long-context tasks including long document/meeting summarization, long document QA, etc.

Granite mixture of experts models

The Granite mixture of experts models are available in 1B and 3B parameter sizes designed for low latency usage.

Learn more

Developers: IBM Research
GitHub Repository: ibm-granite/granite-language-models
Website: Granite Docs
Release Date: December 18th, 2024
License: Apache 2.0.

## Granite dense models

The IBM Granite **2B** and **8B** models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM's initial testing.

**They are designed to support tool-based use cases** and for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.

### Parameter Sizes

**2B:**
  
`ollama run granite3.1-dense:2b`

**8B:**

`ollama run granite3.1-dense:8b`

### Supported Languages
English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, Chinese (Simplified)

### Capabilities
* Summarization
* Text classification
* Text extraction
* Question-answering
* Retrieval Augmented Generation (RAG)
* Code related tasks
* Function-calling tasks
* Multilingual dialog use cases
* Long-context tasks including long document/meeting summarization, long document QA, etc.

## Granite mixture of experts models

The Granite mixture of experts models are available in **1B and 3B** parameter sizes designed for **low latency usage**.

[See model page](https://ollama.com/library/granite3-moe)

## Learn more

- **Developers:** IBM Research
- **GitHub Repository:** [ibm-granite/granite-language-models](https://github.com/ibm-granite/granite-3.1-language-models)
- **Website**: [Granite Docs](https://www.ibm.com/granite/docs/)
- **Release Date**: December 18th, 2024
- **License:** [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0).

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)