273 Downloads Updated 1 year ago
| Metric | Score | Base Model Score |
|---|---|---|
| Agentic Similarity | 86 | 84.67 |
| CoT Contextual Accuracy | 56 / 3 | 54 / 5 |
| Medical GPT Score | 65 | 51.75 |
This model demonstrates superior performance compared to previous Llama 3.1 8B implementations across all measured metrics.