63 1 year ago

It is an LLM fine-tuned from Llama-3.2-3B-Instruct, capable of reasoning in the format <reasoning>...</reasoning><answer>...</answer>. Its capability might improve with further training.

tools