ashishpatel26/ granite-8b-code:latest

334 Downloads Updated 1 year ago

This is IBM Release opensource code model.

ollama run ashishpatel26/granite-8b-code

curl http://localhost:11434/api/chat \
  -d '{
    "model": "ashishpatel26/granite-8b-code",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='ashishpatel26/granite-8b-code',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'ashishpatel26/granite-8b-code',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 1 year ago

1 year ago

59068e1ea036 · 8.6GB ·

model

archllama

·

parameters8.05B

·

quantizationQ8_0

8.6GB

params

{ "num_keep": 24, "stop": [ "<|start_header_id|>", "<|end_header_id|>",

110B

template

{{ if .System }}<|start_header_id|>system<|end_header_id|> {{ .System }}<|eot_id|>{{ end }}{{ if .Pr

260B

Readme

Granite-8B-Code-Base

Model Summary

Granite-8B-Code-Base is a decoder-only code model designed for code generative tasks (e.g., code generation, code explanation, code fixing, etc.). It is trained from scratch with a two-phase training strategy. In phase 1, our model is trained on 4 trillion tokens sourced from 116 programming languages, ensuring a comprehensive understanding of programming languages and syntax. In phase 2, our model is trained on 500 billion tokens with a carefully designed mixture of high-quality data from code and natural language domains to improve the models’ ability to reason and follow instructions.

Developers: IBM Research
GitHub Repository: ibm-granite/granite-code-models
Paper: Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Release Date: May 6th, 2024
License: Apache 2.0.

Intended use

Prominent enterprise use cases of LLMs in software engineering productivity include code generation, code explanation, code fixing, generating unit tests, generating documentation, addressing technical debt issues, vulnerability detection, code translation, and more. All Granite Code Base models, including the 8B parameter model, are able to handle these tasks as they were trained on a large amount of code data from 116 programming languages.

# Granite-8B-Code-Base

## Model Summary
**Granite-8B-Code-Base** is a decoder-only code model designed for code generative tasks (e.g., code generation, code explanation, code fixing, etc.). It is trained from scratch with a two-phase training strategy. In phase 1, our model is trained on 4 trillion tokens sourced from 116 programming languages, ensuring a comprehensive understanding of programming languages and syntax. In phase 2, our model is trained on 500 billion tokens with a carefully designed mixture of high-quality data from code and natural language domains to improve the models’ ability to reason and follow instructions.

- **Developers:** IBM Research
- **GitHub Repository:** [ibm-granite/granite-code-models](https://github.com/ibm-granite/granite-code-models)
- **Paper:** [Granite Code Models: A Family of Open Foundation Models for Code Intelligence](https://arxiv.org/abs/2405.04324)
- **Release Date**: May 6th, 2024
- **License:** [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0).

### Intended use
Prominent enterprise use cases of LLMs in software engineering productivity include code generation, code explanation, code fixing, generating unit tests, generating documentation, addressing technical debt issues, vulnerability detection, code translation, and more. All Granite Code Base models, including the **8B parameter model**, are able to handle these tasks as they were trained on a large amount of code data from 116 programming languages.

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)