codegemma:2b-code-q3_K_S

codegemma:2b-code-q3_K_S

3.1M Downloads Updated 2 years ago

CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.

2b 7b

ollama run codegemma:2b-code-q3_K_S

curl http://localhost:11434/api/chat \
  -d '{
    "model": "codegemma:2b-code-q3_K_S",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='codegemma:2b-code-q3_K_S',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'codegemma:2b-code-q3_K_S',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 2 years ago

2 years ago

6b80287fa068 · 1.3GB ·

model

archgemma

·

parameters2.51B

·

quantizationQ3_K_S

1.3GB

template

{{- if .Suffix }}<|fim_prefix|>{{ .Prompt }}<|fim_suffix|>{{ .Suffix }}<|fim_middle|> {{- else }}{{

121B

license

Gemma Terms of Use Last modified: February 21, 2024 By using, reproducing, modifying, distributing,

8.4kB

params

{ "repeat_penalty": 1, "stop": [ "<|fim_prefix|>", "<|fim_suffix|>",

142B

Readme

CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.

Variants:

instruct a 7b instruction-tuned variant for natural language-to-code chat and instruction following
code a 7b pretrained variant that specializes in code completion and generation from code prefixes and/or suffixes
2b a state of the art 2B pretrained variant that provides up to 2x faster code completion

Advantages:

Intelligent code completion and generation: Complete lines, functions, and even generate entire blocks of code, whether you’re working locally or using Google Cloud resources.
Enhanced accuracy: Trained on 500 billion tokens of primarily English language data from web documents, mathematics, and code, CodeGemma models generate code that’s not only more syntactically correct but also semantically meaningful, reducing errors and debugging time.
Multi-language proficiency: Supports Python, JavaScript, Java, Kotlin, C++, C#, Rust, Go, and other languages.
Streamlined workflows: Integrate a CodeGemma model into your development environment to write less boilerplate and focus on interesting and differentiated code that matters, faster.

Fill-in-the-middle

CodeGemma models support fill-in-the-middle (FIM), for use in autocomplete or coding assistant tooling. Below is an example using the Ollama Python library:

response = generate(
  model='codegemma:2b-code',
  prompt=f'<|fim_prefix|>{prefix}<|fim_suffix|>{suffix}<|fim_middle|>',
  options={
    'num_predict': 128,
    'temperature': 0,
    'top_p': 0.9,
    'stop': ['<|file_separator|>'],
  },
)

References

<img src="https://github.com/ollama/ollama/assets/251292/56ffc5fc-0c30-4ab5-a0e3-65fc66de17bc" width="320" />

CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.

### Variants:

* `instruct` a 7b instruction-tuned variant for natural language-to-code chat and instruction following
* `code` a 7b pretrained variant that specializes in code completion and generation from code prefixes and/or suffixes
* `2b` a state of the art 2B pretrained variant that provides up to 2x faster code completion

### Advantages:

* **Intelligent code completion and generation**: Complete lines, functions, and even generate entire blocks of code, whether you're working locally or using Google Cloud resources.

* **Enhanced accuracy**: Trained on 500 billion tokens of primarily English language data from web documents, mathematics, and code, CodeGemma models generate code that's not only more syntactically correct but also semantically meaningful, reducing errors and debugging time.

* **Multi-language proficiency**: Supports Python, JavaScript, Java, Kotlin, C++, C#, Rust, Go, and other languages.

* **Streamlined workflows**: Integrate a CodeGemma model into your development environment to write less boilerplate and focus on interesting and differentiated code that matters, faster.

![benchmarks](https://github.com/ollama/ollama/assets/251292/0d8473cb-bcee-4bd0-9214-c527ce367d88)

### Fill-in-the-middle

CodeGemma models support fill-in-the-middle (FIM), for use in autocomplete or coding assistant tooling. Below is an example using the Ollama [Python](https://github.com/ollama/ollama-python) library:

```python
response = generate(
  model='codegemma:2b-code',
  prompt=f'<|fim_prefix|>{prefix}<|fim_suffix|>{suffix}<|fim_middle|>',
  options={
    'num_predict': 128,
    'temperature': 0,
    'top_p': 0.9,
    'stop': ['<|file_separator|>'],
  },
)
```

### References

[Hugging Face](https://huggingface.co/collections/google/codegemma-release-66152ac7b683e2667abdee11)

[Report](https://storage.googleapis.com/deepmind-media/gemma/codegemma_report.pdf)

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)