Ollama

PatchBot will keep your Discord channel up-to-date on all the latest Ollama patch notes.

Servers

587

Gamers

117,418

Game details

Description This AI tool simplifies the process of running machine learning models locally, enabling developers to easily deploy and interact with various models without extensive setup. Its user-friendly interface streamlines the integration of AI into applications.
Release Date Jul 8, 2023

Latest updates

PatchBot BOT 1 day ago

v0.32.1

What's Changed

Improved Gemma 4 tool calling and multi-turn reasoning, including more reliable tool-response continuations
Fixed a recurrent MLX model cache leak that could increase memory use across requests, and improved cache snapshot performance
MLX text model loading now respects OLLAMA_LOAD_TIMEOUT
Agent web search and fetch now tell users to run ollama signin when authentication is required ...

Git tag

v0.32.1

PatchBot BOT 3 days ago

v0.32.0

What's Changed

New interactive agent experience: running ollama now launches an agent to help you code and delegate work

❯ ollama
Ollama 0.32.0
▸ Chat, Code, & Work (glm-5.2:cloud)
    Chat with models, code, search the web, and delegate real work

Renamed the Codex App integration to ChatGPT: use ollama launch chatgpt (and --restore to return to your usual ChatGPT profile) ...

Git tag

v0.32.0

PatchBot BOT 1 week ago

v0.31.2

What's Changed

Enabled flash attention on older NVIDIA GPUs (compute capability 6.x)
iGPU can now offload vision models with padding to fit available memory
Fixed structured output for thinking models when thinking is disabled
Hardened GGUF model creation
ollama launch for Claude Code now disables telemetry by default
Fixed loading models on paths with non-UTF-8 characters
Updated the MLX and llama.cpp engines New Contributors ...

Git tag

v0.31.2

PatchBot BOT 2 weeks ago

v0.31.1

What's Changed

mlx: tighten up gemma4 moe loading code by @pdevine in #16964
mlx: bump to latest version to include new small batch matmul kernel @jessegross @dhiltgen
llama.cpp: bump to b9840 @dhiltgen
improved gemma4 MTP performance @jessegross

Git tag

v0.31.1

PatchBot BOT 2 weeks ago

v0.30.11

What's Changed

launch: add thinking capability detection to opencode by @hoyyeva in #15434
launch: auto-install Claude Code by @hoyyeva in #16802
launch: auto-install opencode when missing by @hoyyeva in #16806 ...

Git tag

v0.30.11

PatchBot BOT 4 weeks ago

v0.30.10

What's Changed

models: add Cohere2MoE model by @jmorganca in #16670
llama: update llama.cpp to b9672 by @pdevine in #16775

Git tag

v0.30.10

PatchBot BOT 1 day ago

v0.32.1

What's Changed

Improved Gemma 4 tool calling and multi-turn reasoning, including more reliable tool-response continuations
Fixed a recurrent MLX model cache leak that could increase memory use across requests, and improved cache snapshot performance
MLX text model loading now respects OLLAMA_LOAD_TIMEOUT
Agent web search and fetch now tell users to run ollama signin when authentication is required ...

Git tag

v0.32.1

PatchBot BOT 1 day ago

v0.32.1

What's Changed

Improved Gemma 4 tool calling and multi-turn reasoning, including more reliable tool-response continuations
Fixed a recurrent MLX model cache leak that could increase memory use across requests, and improved cache snapshot performance
MLX text model loading now respects OLLAMA_LOAD_TIMEOUT
Agent web search and fetch now tell users to run ollama signin when authentication is required ...

Git tag

v0.32.1-rc0

PatchBot BOT 6 days ago

v0.32.0

What's Changed

create: select the qwen3.5 parser and renderer for Qwen3.5/Next by @jessegross in #17078
launch: warn before old agent models by @ParthSareen in #17063
cmd: agent UI by @ParthSareen in #17017

Git tag

v0.32.0-rc0

PatchBot BOT 1 week ago

v0.31.2

What's Changed

ci: avoid unbounded parallelism by @dhiltgen in #16966
fix cuda toolkit lookup and parallel by @MarkWard0110 in #16613
docs(cloud): update retirement list by @mxyng in #17000
discover: fall back to standard CUDA when the JetPack runner is absent by @kevinpark1217 in #16949 ...

Git tag

v0.31.2

PatchBot BOT 1 week ago

v0.31.2

What's Changed

ci: avoid unbounded parallelism by @dhiltgen in #16966
fix cuda toolkit lookup and parallel by @MarkWard0110 in #16613
docs(cloud): update retirement list by @mxyng in #17000
discover: fall back to standard CUDA when the JetPack runner is absent by @kevinpark1217 in #16949 ...

Git tag

v0.31.2-rc2

PatchBot BOT 1 week ago

v0.31.2

What's Changed

ci: avoid unbounded parallelism by @dhiltgen in #16966
fix cuda toolkit lookup and parallel by @MarkWard0110 in #16613
docs(cloud): update retirement list by @mxyng in #17000
discover: fall back to standard CUDA when the JetPack runner is absent by @kevinpark1217 in #16949 ...

Git tag

v0.31.2-rc1

FANTASY LIFE i: The Girl Who Steals Time