89 11 months ago

2 million long context window model built on Llama-3.1.

ollama run tukia/nvidia-ultralong-2M

Models

View all →

Readme

Nemotron-UltraLong-8B from https://huggingface.co/nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct.

Context window size of 2 million tokens.