DeepSeek R1 Distilled model to one-fourth its original file size—without losing any accuracy.

259 6 weeks ago

{
"num_ctx": 131072,
"stop": [
"<|end▁of▁sentence|>"
]
}