DescriptionThis AI tool simplifies the process of running machine learning models locally, enabling developers to easily deploy and interact with various models without extensive setup. Its user-friendly interface streamlines the integration of AI into applications.
This version of Ollama will change the architecture to directly support llama.cpp instead of building on top of GGML, and allows for compatibility with GGUF file format. MLX is used to accelerate model inference on Apple Silicon.
While in pre-release we'd love feedback on:
Performance improvements or degradation
Errors or crashes that did not previously occur
Memory utilization improvements or degradation
Known issues:
...
Codex App
The OpenAI Codex App is now available on Ollama. Use any Ollama model — local or cloud — inside the desktop app to code, browse, and review.
ollama launch codex-app
Built-in browser
Codex can spin up local servers and sites in its built-in browser. Annotate directly on the page to request changes.
Review mode
Review code inside the app, leave comments, and iterate without leaving your workspace.
Choosing a model
For difficult coding and agentic tasks:
...