mannix/starling-lm-10.7b:q6

mannix/ starling-lm-10.7b:q6_K

158 Downloads Updated 1 year ago

Starling-LM-10.7B-beta, an open large language model (LLM) trained by Reinforcement Learning from AI Feedback (RLAIF)

ollama run mannix/starling-lm-10.7b:q6_K

curl http://localhost:11434/api/chat \
  -d '{
    "model": "mannix/starling-lm-10.7b:q6_K",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='mannix/starling-lm-10.7b:q6_K',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'mannix/starling-lm-10.7b:q6_K',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 1 year ago

1 year ago

d37a9ec6b121 · 8.8GB ·

model

archllama

parameters10.7B

quantizationQ6_K

8.8GB

params

{ "stop": [ "<|endoftext|>", "<|end_of_turn|>", "Human:", "Assis

105B

template

"{{ if .System }}GPT4 Correct System: {{ .System }}<|end_of_turn|>{{ end }}{{ if .Prompt }}GPT4 Corr

202B

Readme

This is Starling-LM-10.7B-beta, a depth-upscaled version of Nexusflow/Starling-LM-7B-beta.

This model is intended to be used as a drop-in upgrade from the original 7 billion parameter model.

We introduce Starling-LM-7B-beta, an open large language model (LLM) trained by Reinforcement Learning from AI Feedback (RLAIF). Starling-LM-7B-beta is trained from Openchat-3.5-0106 with our new reward model Nexusflow/Starling-RM-34B and policy optimization method Fine-Tuning Language Models from Human Preferences (PPO). Harnessing the power of the ranking dataset, berkeley-nest/Nectar, the upgraded reward model, Starling-RM-34B, and the new reward training and policy tuning pipeline, Starling-LM-7B-beta scores an improved 8.12 in MT Bench with GPT-4 as a judge.

Important: The model output can be verbose in rare cases. Please consider setting temperature = 0 to make this happen less. Default temperature is set to 0.1

@HuggingFace https://huggingface.co/bartowski/Starling-LM-10.7B-beta-GGUF