A series of models from Groq that represent a significant advancement in open-source AI capabilities for tool use/function calling.

tools

264 4 months ago

Readme

image.png

These models, developed in collaboration with Glaive, represent a significant advancement in open-source AI capabilities for tool use/function calling.

Benchmark Results

These models have achieved remarkable results, setting new benchmarks for Large Language Models with tool use capabilities:

Llama-3-Groq-70B-Tool-Use: 90.76% overall accuracy (#1 on BFCL at the time of publishing - July 2024) Llama-3-Groq-8B-Tool-Use: 89.06% overall accuracy (#3 on BFCL at the time of publishing - July 2024)

References

Hugging Face

Blog