Starling-LM-10.7B-beta, an open large language model (LLM) trained by Reinforcement Learning from AI Feedback (RLAIF)

112 Pulls Updated 5 months ago

5e6cbd573f8f · 105B
{ "stop": [ "<|endoftext|>", "<|end_of_turn|>", "Human:", "Assistant:" ], "temperature": 0.1 }