Fixed num_ctx to 8192. This Dolphin 2.9 Llama 3 8B model is ready to use for full model's 8k contexts window.

112 6 months ago

Readme

Dolphin-2.9-Llama-3-8B

Model Quants Size Bit Perplexity
dolphin-llama3-8b:Q4_0 4.7GB 4 +0.2166 ppl
dolphin-llama3-8b:Q4_K_M 4.9GB 4 +0.0532 ppl
dolphin-llama3-8b:Q5_K_M 5.7GB 5 +0.0122 ppl
dolphin-llama3-8b:Q6_K 6.6GB 6 +0.0008 ppl
Config

max_position_embeddings : 128258
rope_theta : 500000.0
vocab_size : 8192

Remarks
  • ‘latest’ model points to Q4_0
  • modelfile has 8192 num_ctx activated (Ollama default only 2048)