yarn-llama2:13b-64k-q4_1

81.2K 1 year ago

An extension of Llama 2 that supports a context of up to 128k tokens.

7b 13b
e9d3a814cdd6 · 17B
{
"num_ctx": 65536
}