Ollama

Run open source LLMs locally or in the cloud

Free | Pro $20/mo +1Local LLMsLast verified: 2026-02-02

Overview

Ollama lets you run large language models locally on your own hardware with zero cost, or use cloud models for faster responses. Download and run Llama, Mistral, Qwen, DeepSeek, and other open source models with a simple CLI. Local models run unlimited and free, keeping your data completely private. Cloud features add the ability to run multiple models simultaneously and access larger models. Integrates with VS Code, Claude Code, LangChain, LlamaIndex, Dify, n8n, and 40,000+ community integrations. OpenAI-compatible API makes it easy to swap into existing applications. Can run fully offline in air-gapped environments.

Works with

REST APIPythonJavaScriptVS CodeLangChainLlamaIndex

Pricing

$0Free

Unlimited local models
Cloud model access (light usage)
CLI, API, desktop apps
40,000+ integrations
Data stays private

$20/moPro

Everything in Free
Multiple cloud models at once
More cloud usage
3 private models
3 collaborators

$100/moMax

Everything in Pro
5+ cloud models at once
5x more usage than Pro
5 private models
5 collaborators

Pros

+Unlimited free local inference
+40,000+ community integrations
+Runs fully offline
+Simple CLI interface

Cons

-Cloud features require paid plan
-Local performance depends on hardware

Get Discovered by Developers

Promote your tool

Reach thousands of developers actively searching for AI tools. Featured listings get 10x more clicks.

Back