NVIDIA DGX Spark

The latest NVIDIA DGX Spark is here! Ollama has partnered with NVIDIA to ensure it runs fast and efficiently out-of-the-box.

Powered by the NVIDIA GB10 Grace Blackwell Superchip, the NVIDIA DGX delivers 1 petaFLOP of performance for prototyping and running local language models on Ollama.

With 128GB of memory, you can run the latest models from Alibaba (Qwen), DeepSeek, Meta (Llama), Mistral, Google (Gemma), OpenAI (Gpt-oss), and many more from Ollama’s library. You can also upload and bring your own custom or fine-tuned models.

We can’t wait to see what you’ll build with the latest NVIDIA DGX Spark!

In the meantime, we’re working with NVIDIA to optimize Ollama’s performance and testing it across the use cases we see most often—chat, document processing (retrieval, OCR, modification), code tasks, and multimodal workflows.

Learn more about the NVIDIA DGX Spark.

Get started with Ollama

Download Ollama

October 13, 2025

Get started with Ollama