| icon | fontawesome/brands/linux |
|---|
Native Linux setup with full NVIDIA GPU acceleration for optimal performance.
Tip
🐧 Recommended for Linux — Optimal performance with full NVIDIA GPU acceleration.
- Linux distribution (Ubuntu 20.04+, Fedora 35+, Arch, Debian, etc.)
- 8GB+ RAM (16GB+ recommended for GPU acceleration)
- 10GB free disk space
- Python 3.11 or higher
- PortAudio development libraries (required for audio features):
- Ubuntu/Debian:
sudo apt install portaudio19-dev - Fedora:
sudo dnf install portaudio-devel - Arch:
sudo pacman -S portaudio - openSUSE:
sudo zypper install portaudio-devel
- Ubuntu/Debian:
- NVIDIA GPU (GTX 1060+ or RTX series recommended)
- NVIDIA drivers 470+ installed
- CUDA 11.7+ installed
-
Run the setup script:
scripts/setup-linux.sh
-
Start all services:
scripts/start-all-services.sh
-
Install agent-cli:
uv tool install agent-cli
-
Test the setup:
agent-cli autocorrect "this has an eror"
If you're using NixOS, see the dedicated NixOS Installation Guide for system-level service integration.
If you prefer manual setup:
# 1. Install dependencies
curl -LsSf https://astral.sh/uv/install.sh | sh
curl -fsSL https://ollama.ai/install.sh | sh
# 2. Start services individually
# Terminal 1: Ollama
ollama serve
# Terminal 2: Whisper (with GPU)
agent-cli server whisper
# Terminal 3: Piper
agent-cli server tts --backend piper
# Terminal 4: OpenWakeWord
scripts/run-openwakeword.sh| Service | Port | GPU Support | Auto-Detection |
|---|---|---|---|
| Ollama | 11434 | ✅ CUDA/ROCm | Automatic |
| Whisper | 10300 | ✅ CUDA | Automatic (falls back to CPU) |
| Piper | 10200 | N/A | N/A |
| OpenWakeWord | 10400 | N/A | N/A |
The scripts use Zellij for managing all services in one session (works on both Linux and macOS):
scripts/start-all-services.shCtrl-O d- Detach (services keep running)zellij attach agent-cli- Reattach to sessionzellij list-sessions- List all sessionszellij kill-session agent-cli- Stop all servicesAlt + arrow keys- Navigate between panesCtrl-Q- Quit (stops all services)
The scripts automatically detect and use GPU acceleration:
- Whisper: Detects NVIDIA GPU and uses
large-v3model with CUDA, falls back totinyon CPU - Ollama: Automatically uses available GPU (CUDA/ROCm)
-
Install NVIDIA drivers:
# Ubuntu/Debian sudo apt install nvidia-driver-535 # Fedora sudo dnf install akmod-nvidia
-
Install CUDA toolkit:
# Ubuntu/Debian sudo apt install nvidia-cuda-toolkit # Fedora sudo dnf install cuda
-
Verify GPU setup:
nvidia-smi nvcc --version
-
Install ROCm:
# Ubuntu/Debian sudo apt install rocm-dev # Configure for Ollama export HSA_OVERRIDE_GFX_VERSION=10.3.0 # Adjust for your GPU
-
Start Ollama with ROCm:
ollama serve
- Full GPU acceleration - NVIDIA CUDA support
- Automatic configuration - Scripts detect and configure GPU
- Better performance - Direct system integration
# Check if NVIDIA GPU is detected
nvidia-smi# Check what's running on the ports
ss -tlnp | grep -E ':(11434|10300|10200|10400)'- Make sure you have enough RAM (8GB minimum)
- Services automatically download required models
If you prefer Docker (with some performance limitations):
- Docker Setup Guide
- Note: May have reduced GPU acceleration support