Llama 3.1 Swallow is a series of large language models (8B, 70B) that were built by continual pre-training on the Meta Llama 3.1 models.

8B

101 Pulls Updated 8 days ago

{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>" ] }