This model was built using a new Smaug recipe for improving performance on real world multi-turn conversations applied to Meta-Llama-3-70B-Instruct, 32k context length

134 5 months ago

4 Tags
5a62d16a2608 • 40GB • 5 months ago
d0e922d7ab6c • 21GB • 5 months ago
ae673363eee3 • 38GB • 5 months ago
5a62d16a2608 • 40GB • 5 months ago