2,213 3 days ago

LFM2.5-8B-A1B, an edge model built for fast, reliable tool calling on consumer hardware.

tools thinking 8b
ollama run lfm2.5

Applications

Claude Code
Claude Code ollama launch claude --model lfm2.5
Codex App
Codex App ollama launch codex-app --model lfm2.5
OpenClaw
OpenClaw ollama launch openclaw --model lfm2.5
Hermes Agent
Hermes Agent ollama launch hermes --model lfm2.5
Codex
Codex ollama launch codex --model lfm2.5
OpenCode
OpenCode ollama launch opencode --model lfm2.5

Models

View all →

Readme

image.png

LFM2.5 is a new family of hybrid models designed for on-device deployment. It builds on the LFM2 architecture with extended pre-training and reinforcement learning.

  • On-device personal assistant: Designed to power real-life applications, chaining tool calls, and following complex instructions on all devices.
  • Compressed performance: Competitive with much larger dense and MoE models on instruction following and agentic tasks.
  • Unmatched throughput: Fastest in its size class on both CPU and GPU inference

benchmark