Models
GitHub
Discord
Turbo
Sign in
Download
Models
Download
GitHub
Discord
Sign in
lucataco
/
deepseek-v3-64k
:latest
18
Downloads
Updated
7 months ago
A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
Cancel
deepseek-v3-64k:latest
...
/
params
1c16f08d3b69 · 164B
{
"num_ctx": 64000,
"stop": [
"<|begin▁of▁sentence|>",
"<|end▁of▁sentence|>",
"<|User|>",
"<|Assistant|>"
]
}