Llama 3.2 Vision
November 6, 2024
Llama 3.2 Vision 11B and 90B models are now available in Ollama.
IBM Granite 3.0 models
October 21, 2024
Ollama partners with IBM to bring Granite 3.0 models to Ollama.
Llama 3.2 goes small and multimodal
September 25, 2024
Ollama partners with Meta to bring Llama 3.2 to Ollama.
Reduce hallucinations with Bespoke-Minicheck
September 18, 2024
Bespoke-Minicheck is a new grounded factuality checking model developed by Bespoke Labs that is now available in Ollama. It can fact-check responses generated by other models to detect and reduce hallucinations.
Tool support
July 25, 2024
Ollama now supports tool calling with popular models such as Llama 3.1. This enables a model to answer a given prompt using tool(s) it knows about, making it possible for models to perform more complex tasks or interact with the outside world.
Google Gemma 2
June 27, 2024
Gemma 2 is now available on Ollama in 3 sizes - 2B, 9B and 27B.
An entirely open-source AI code assistant inside your editor
May 31, 2024
Continue enables you to easily create your own coding assistant directly inside Visual Studio Code and JetBrains with open-source LLMs.
Google announces Firebase Genkit with Ollama support
May 20, 2024
At Google IO 2024, Google announced Ollama support in Firebase Genkit, a new open-source framework for developers to build, deploy and monitor production-ready AI-powered apps.
Llama 3 is not very censored
April 19, 2024
Compared to Llama 2, Llama 3 feels much less censored. Meta has substantially lowered false refusal rates. Llama 3 will refuse less than 1/3 of the prompts previously refused by Llama 2.
Llama 3
April 18, 2024
Llama 3 is now available to run on Ollama. This model is the next generation of Meta's state-of-the-art large language model, and is the most capable openly available LLM to date.
Embedding models
April 8, 2024
Embedding models are available in Ollama, making it easy to generate vector embeddings for use in search and retrieval augmented generation (RAG) applications.
Ollama now supports AMD graphics cards
March 14, 2024
Ollama now supports AMD graphics cards in preview on Windows and Linux. All the features of Ollama can now be accelerated by AMD graphics cards on Ollama for Linux and Windows.
Windows preview
February 15, 2024
Ollama is now available on Windows in preview, making it possible to pull, run and create large language models in a new native Windows experience. Ollama on Windows includes built-in GPU acceleration, access to the full model library, and serves the Ollama API including OpenAI compatibility.
OpenAI compatibility
February 8, 2024
Ollama now has initial compatibility with the OpenAI Chat Completions API, making it possible to use existing tooling built for OpenAI with local models via Ollama.
Vision models
February 2, 2024
New vision models are now available: LLaVA 1.6, in 7B, 13B and 34B parameter sizes. These models support higher resolution images, improved text recognition and logical reasoning.
Python & JavaScript Libraries
January 23, 2024
The initial versions of the Ollama Python and JavaScript libraries are now available, making it easy to integrate your Python or JavaScript, or Typescript app with Ollama in a few lines of code. Both libraries include all the features of the Ollama REST API, are familiar in design, and compatible with new and previous versions of Ollama.
Building LLM-Powered Web Apps with Client-Side Technology
October 13, 2023
Recreate one of the most popular LangChain use-cases with open source, locally running software - a chain that performs Retrieval-Augmented Generation, or RAG for short, and allows you to “chat with your documents”
Ollama is now available as an official Docker image
October 5, 2023
Ollama can now run with Docker Desktop on the Mac, and run inside Docker containers with GPU acceleration on Linux.
Leveraging LLMs in your Obsidian Notes
September 21, 2023
This post walks through how you could incorporate a local LLM using Ollama in Obsidian, or potentially any note taking tool.
How to prompt Code Llama
September 9, 2023
This guide walks through the different ways to structure prompts for Code Llama and its different variations and features including instructions, code completion and fill-in-the-middle (FIM).
Run Code Llama locally
August 24, 2023
Meta's Code Llama is now available on Ollama to try.
Run Llama 2 uncensored locally
August 1, 2023
This post will give some example comparisons running Llama 2 uncensored model versus its censored model.