yarn-llama2:13b-128k-q4_K_M

80.7K 1 year ago

An extension of Llama 2 that supports a context of up to 128k tokens.

7b 13b
1639d5c1f004 · 18B
{
"num_ctx": 131072
}