Ollama
Run and manage large language models locally on your own machine.
Ollama doubles as an MLX runtime and a local backend for coding agents
◆Recent moves
- 3d ago
Tool-call JSON parsing fix; MLX and llama.cpp bumps
A release-candidate maintenance tag: a tool-call parser fix that ignores braces inside JSON strings, plus MLX and llama.cpp dependency bumps. It fits the steady cadence of upstream syncs rather than any directional move.
View source ↗ - 7d ago
Adds Ornith 9B renderer and parser support
Adds renderer and parser support for the Ornith 9B model, the kind of per-model plumbing Ollama ships continuously to keep pace with new model releases.
View source ↗ - 8d ago
Auto-install Claude Code/opencode; MLX speculative decoding
The substantive release of this window: auto-install for Claude Code and opencode, Codex model-drift detection, MLX speculative-decoding tuning, and context-headroom fixes. It advances the agent-launcher direction while hardening the MLX engine.
View source ↗ - 15d ago
Command A and North models run on Apple Silicon via MLX
Brings the Command A and North model families to Apple Silicon via the MLX engine and bumps llama.cpp to build 9672, incremental progress on the MLX-as-first-class-engine thread.
View source ↗ - 15d ago
CI: pin Darwin release Xcode version
A CI-only change pinning the Darwin release Xcode version, with no user-visible effect.
View source ↗ - 16d ago
Updates bundled llama.cpp engine to build b9672
A single llama.cpp engine bump to build b9672 ahead of the 0.30.10 release, routine upstream maintenance.
View source ↗