medragondot/llama-3.2-rltp

medragondot/

llama-3.2-rltp:latest

20 Downloads Updated 10 months ago

Reinforcement Learning with Thought Process Llama 3.2 3B to achieve search

Updated 10 months ago

10 months ago

764981b8bc87 · 6.4GB

archllama

·

parameters3.21B

·

quantizationF16

6.4GB

<|begin_of_text|><|start_header_id|>system<|end_header_id|> Cutting Knowledge Date: December 2023 To

389B

{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>",

158B

Readme

No readme