← Back to Ai-Edu Hub

🖥️ Run AI Offline — Free & Private

Set up local LLMs with Ollama, OpenClaw, LM Studio, and more. No API keys. No cloud. No monthly bills.

⚠️ Before You Install

  • System Access: OpenClaw can read files, run terminal commands, and make changes to your computer.
  • Permissions: You'll be asked to grant various permissions during setup. Review each one carefully.
  • Local Models: Your data stays on your computer when using local models. No internet required after setup.
  • Cloud Models: If you connect cloud APIs (GPT, Claude, etc.), data is sent to those providers. Local-only is the default.
  • Your Responsibility: You are responsible for what OpenClaw does on your system. Review actions before confirming.

What You Can Do With Local AI

💬
Chat with AI
Talk to local models offline. No internet, no API keys, no usage limits. Completely private.
📁
Read & Write Files
Analyze documents, edit code, organize folders — all locally, all with natural language.
🌐
Code Generation
Qwen3-Coder and DeepSeek R1 run locally. Production-quality code, 80-150 tokens/second.
⚙️
Run Commands
OpenClaw executes terminal commands. Automate tasks, build scripts, manage your system.
🧠
Reasoning
DeepSeek R1 runs chain-of-thought reasoning locally. Think step-by-step, on your hardware.
🔌
Integrations
Connect to iMessage, Telegram, Discord, WhatsApp. Use AI everywhere, powered by local models.

💻 Hardware Requirements (April 2026)

8GB RAM — Entry Level

  • 8GB RAM (Apple Silicon or 8GB VRAM GPU)
  • macOS 13+, Windows 10, or Linux
  • 10GB free disk space
  • Models: Qwen3-Coder 8B, Llama 3.2 3B, Phi-4 Mini
  • ~20-40 tokens/second

24GB+ RAM — Power User

  • 24GB+ RAM (M-series Mac or 24GB VRAM)
  • Apple M1/M2/M3/M4 Pro/Max or RTX 4090
  • 50GB+ free disk space
  • Models: Llama 4 Scout 17B, QwQ 32B, Mistral Small 24B
  • ~60-150 tokens/second

🍎 Apple Silicon note: M-series Macs use unified memory — the GPU shares system RAM. An 8GB M2 Mac mini runs Qwen3-Coder 8B at ~40 tok/s. A 24GB M4 Pro runs Llama 4 Scout at ~60 tok/s. No discrete GPU needed.

📥 Installation (Ollama + OpenClaw)

🍎 macOS
🪟 Windows
🐧 Linux
1

Download OpenClaw

Download the latest macOS release from the official website:

2

Install the App

  1. Open the downloaded .dmg file
  2. Drag OpenClaw to your Applications folder
  3. Open OpenClaw from Applications
  4. If you see "unidentified developer" warning:
    • Go to System Preferences → Privacy & Security
    • Click "Open Anyway" next to the security warning
3

Grant Permissions

OpenClaw will request several permissions:

  • Accessibility: Required for keyboard shortcuts and automation
  • Full Disk Access: Required to read/write files in your folders
  • Automation: Required to control other apps

Go to System Preferences → Privacy & Security and enable these for OpenClaw.

4

Install Ollama (for local models)

OpenClaw uses Ollama for local AI models:

  1. Download Ollama from ollama.com
  2. Install it like any Mac app
  3. Open Terminal and run: ollama pull qwen3-coder (best local coding model)

Recommended first models (April 2026):

  • ollama pull qwen3-coder — 8B, best coding model under 10B, 8GB RAM
  • ollama pull deepseek-r1:14b — reasoning model, 16GB RAM
  • ollama pull llama3.2 — general chat, 8GB RAM
  • ollama pull phi4-mini — ultra-fast, 8GB RAM

OpenClaw detects Ollama automatically. See AI Pricing Guide for the full model comparison.

5

Configure & Start

  1. Open OpenClaw
  2. Choose your default model (local or cloud)
  3. Set up memory and preferences
  4. Try asking: "What can you help me with?"
1

Download OpenClaw

Download the Windows installer:

2

Run the Installer

  1. Run the downloaded .exe file
  2. Follow the installation wizard
  3. Choose install location (default is fine)
  4. Launch OpenClaw from the Start Menu

Windows may show a "Windows protected your PC" warning. Click "More info" then "Run anyway" if you downloaded from the official source.

3

Grant Permissions

OpenClaw may request:

  • File system access: To read and write files
  • Network access: To connect to AI services
  • Windows Terminal: For running commands

Approve these in the Windows security dialogs that appear.

4

Install Ollama (for local models)

  1. Download Ollama from ollama.com
  2. Run the installer
  3. Open PowerShell and run: ollama pull qwen3-coder
5

Configure & Start

  1. Open OpenClaw from Start Menu
  2. Choose your default model
  3. Set up preferences
  4. Start chatting!
1

Download OpenClaw

Download the Linux package (AppImage or .deb/.rpm):

2

Install

For AppImage:

chmod +x openclaw-*.AppImage
./openclaw-*.AppImage

For Debian/Ubuntu (.deb):

sudo dpkg -i openclaw-*.deb
sudo apt-get install -f  # Fix dependencies

For Fedora/RHEL (.rpm):

sudo rpm -i openclaw-*.rpm
3

Install Ollama

curl -fsSL https://ollama.com/install.sh | sh
ollama pull qwen3-coder
4

Run OpenClaw

Launch from your application menu or run:

openclaw

🏆 Best Local Models (April 2026)

DeepSeek R1 14B

  • Best for: Reasoning, analysis
  • RAM: 16GB
  • Speed: 40-55 tok/s
  • Pull: ollama pull deepseek-r1:14b
  • Chain-of-thought on budget hardware.

Llama 3.2 3B

  • Best for: General chat, quick tasks
  • RAM: 8GB
  • Speed: 40-60 tok/s
  • Pull: ollama pull llama3.2
  • Fast, reliable, well-tested.

💡 Tip: Install 2-3 models for different tasks. Qwen3-Coder for code, DeepSeek R1 for reasoning, Llama 3.2 for quick chat. Switch models in OpenClaw with one click. See the AI Pricing Guide for full comparisons including API models.

🛠️ Other Ways to Run Local AI

OpenClaw + Ollama

  • AI assistant with memory, tools, and integrations
  • Telegram, Discord, iMessage, WhatsApp
  • File access, terminal, web search, automation
  • openclaw.ai

Ollama (CLI Only)

  • Terminal-based model runner
  • Lightweight, no GUI needed
  • Also exposes OpenAI-compatible API on port 11434
  • ollama.com

⚙️ Initial Configuration

After installation, OpenClaw will guide you through setup. Here's what to configure:

Model Selection

  • Local (Ollama): Free, private, offline-capable. Try qwen3-coder for coding or deepseek-r1:14b for reasoning.
  • Cloud: More powerful but sends data to servers. Add API keys for ChatGPT, Claude, etc.

Memory Setup

OpenClaw can remember information across conversations. In settings:

  • Enable "Long-term memory" to remember facts about you
  • Set memory location (default is fine)
  • Configure what to remember/forget

Integrations (Optional)

Connect to messaging platforms in Settings → Integrations:

  • iMessage: Chat with AI via Messages app (Mac only)
  • Telegram: Create a bot for Telegram chats
  • Discord: Add AI to your Discord server

🔧 Troubleshooting

"Ollama not found" error

Make sure Ollama is installed and running:

# Check if Ollama is running
ollama list

# If not, start it
ollama serve

"Permission denied" errors

On macOS, make sure OpenClaw has:

  • Accessibility access (System Preferences → Privacy & Security)
  • Full Disk Access (System Preferences → Privacy & Security)

Slow responses

  • Use a smaller model (phi4-mini or llama3.2:3b for 8GB RAM)
  • Close other resource-intensive apps
  • Check you have enough RAM for the model size