elvee/ capybarahermes-2.5-mistral:7b-q6_K

239 Downloads Updated 2 years ago

This is the llama.cpp GGUF Quantised version of CapyBaraHermes 2.5 Mistral 7B from Argilla.

ollama run elvee/capybarahermes-2.5-mistral:7b-q6_K

curl http://localhost:11434/api/chat \
  -d '{
    "model": "elvee/capybarahermes-2.5-mistral:7b-q6_K",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='elvee/capybarahermes-2.5-mistral:7b-q6_K',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'elvee/capybarahermes-2.5-mistral:7b-q6_K',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 2 years ago

2 years ago

4719eda6b89a · 5.9GB ·

model

archllama

·

parameters7.24B

·

quantizationQ6_K

5.9GB

params

{ "stop": [ "<|im_start|>", "<|im_end|>" ] }

59B

template

<|im_start|>system {{ .System }}<|im_end|> <|im_start|>user {{ .Prompt }}<|im_end|> <|im_start|>assi

106B

Readme

CapybaraHermes-2.5-Mistral-7B - GGUF

Description

This model is the launching partner of the capybara-dpo dataset build with ⚗️ distilabel. It’s a preference tuned OpenHermes-2.5-Mistral-7B.

CapybaraHermes has been preference tuned with LoRA and TRL for 3 epochs using argilla’s dpo mix 7k.

To test the impact on multi-turn performance we have used MTBench. We also include the Nous Benchmark results and Mistral-7B-Instruct-v0.2 for reference as it’s a strong 7B model on MTBench. The most interesting aspect in the context of the capybara-dpo dataset is the increased performance in MTBench Second Turn scores.

For the merge lovers, we also preference tuned Beagle14-7B with a mix of capybara-dpo and distilabel orca pairs using the same recipe as NeuralBeagle (see YALL - Yet Another LLM Leaderboard for reference)

Prompt Format

CapybaraHermes-2.5-Mistral-7B uses ChatML as the prompt format, opening up a much more structured system for engaging the LLM in multi-turn chat dialogue. It uses a system prompt to establish rules and roles, and steer the interaction between the user and the model.

Resources

# CapybaraHermes-2.5-Mistral-7B - GGUF
![Image of the Capybara Hermes 2.5 Mistral logo](https://cdn-uploads.huggingface.co/production/uploads/60420dccc15e823a685f2b03/Vmr0FtTvnny6Snm-UDM_n.png)

## Description
This model is the launching partner of the [capybara-dpo dataset](https://huggingface.co/datasets/argilla/distilabel-capybara-dpo-9k-binarized) build with ⚗️ distilabel. It's a preference tuned [OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B).

CapybaraHermes has been preference tuned with LoRA and TRL for 3 epochs using argilla's [dpo mix 7k](https://huggingface.co/datasets/argilla/dpo-mix-7k).

To test the impact on multi-turn performance we have used MTBench. We also include the Nous Benchmark results and Mistral-7B-Instruct-v0.2 for reference as it's a strong 7B model on MTBench. The most interesting aspect in the context of the capybara-dpo dataset is the increased performance in MTBench Second Turn scores.

For the merge lovers, we also preference tuned Beagle14-7B with a mix of capybara-dpo and distilabel orca pairs using the same recipe as NeuralBeagle (see [ YALL - Yet Another LLM Leaderboard](https://huggingface.co/spaces/mlabonne/Yet_Another_LLM_Leaderboard) for reference)

## Prompt Format
CapybaraHermes-2.5-Mistral-7B uses ChatML as the prompt format, opening up a much more structured system for engaging the LLM in multi-turn chat dialogue. It uses a system prompt to establish rules and roles, and steer the interaction between the user and the model.

## Resources
- [https://huggingface.co/TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF](https://huggingface.co/TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF)
- [https://huggingface.co/argilla/CapybaraHermes-2.5-Mistral-7B](https://huggingface.co/argilla/CapybaraHermes-2.5-Mistral-7B)

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)