27 1 year ago

Reinforcement Learning with Thought Process Llama 3.2 3B to achieve search

ollama run medragondot/llama-3.2-rltp-v2

Details

1 year ago

1415657e73f7 · 6.4GB ·

llama
·
3.21B
·
F16
<|begin_of_text|><|start_header_id|>system<|end_header_id|> Cutting Knowledge Date: December 2023 To
{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>",

Readme

No readme