aaa-memory

Multi-tier persistent memory for AI agents. One vault, every agent, cross-session recall.

Everything you've ever told any AI — searchable from all of them.

Install (pick your agent)

Hermes (recommended)

# 1. Clone and install
git clone https://github.com/aaaronmiller/aaa-memory.git ~/code/aaa-memory
cd ~/code/aaa-memory && pip install -e .

# 2. Plugin is auto-discovered — just set provider
# Add to ~/.hermes/config.yaml under memory:
#   provider: aaa-memory

# 3. Restart gateway
hermes gateway restart

Claude Code

# 1. Clone and install
git clone https://github.com/aaaronmiller/aaa-memory.git ~/code/aaa-memory
cd ~/code/aaa-memory && pip install -e .

# 2. Add to ~/.claude/CLAUDE.md:
#    ## aaa-memory
#    python3 ~/code/aaa-memory/scripts/mem.py recall "<query>" --limit 6
#    To store: python3 ~/code/aaa-memory/scripts/mem.py save "<fact>" --source claude-code

# 3. Optional: start MCP server for tool access
#    Add to ~/.claude/settings.json mcpServers:
#    "aaa-memory": { "command": "python3", "args": ["-m", "aaa_memory.mcp"] }

Pi (coding agent)

# 1. Clone and install
git clone https://github.com/aaaronmiller/aaa-memory.git ~/code/aaa-memory
cd ~/code/aaa-memory && pip install -e .

# 2. Pi auto-discovers skills in ~/.pi/agent/skills/
#    The goal-loop skill is already installed.
#    For memory access, add to your AGENTS.md:
#    ## Memory
#    Use `python3 ~/code/aaa-memory/scripts/mem.py recall "<query>"` to search memory.

Codex / OpenCode / Agy (Antigravity)

# 1. Clone and install
git clone https://github.com/aaaronmiller/aaa-memory.git ~/code/aaa-memory
cd ~/code/aaa-memory && pip install -e .

# 2. Add to your agent's system prompt or config:
#    ## Persistent Memory
#    Search: python3 ~/code/aaa-memory/scripts/mem.py recall "<query>"
#    Store:  python3 ~/code/aaa-memory/scripts/mem.py save "<fact>" --source <agent-name>

Any agent with hooks support

# Generic install
pip install -e .

# Use the CLI directly:
python3 ~/code/aaa-memory/scripts/mem.py recall "what did we decide about X?"
python3 ~/code/aaa-memory/scripts/mem.py save "user prefers dark mode" --tags preference

Architecture

┌─────────────────────────────────────────────────────────────────┐
│                        Unified Retrieval                        │
│              Intent classify → Route → RRF fusion               │
├──────────┬──────────────────┬──────────────────────────────────┤
│ Hot Tier │    Warm Tier     │           Cold Tier              │
│          │                  │                                  │
│ SQLite   │  Dream Agent     │  ClawMem FTS (REST API)         │
│ vault    │  → wiki pages    │  + local FTS5 fallback           │
│ + FTS5   │  → skill create  │  (386 docs indexed)             │
│ (11 mem) │  → improve       │                                  │
├──────────┴──────────────────┴──────────────────────────────────┤
│                    Agent Integrations                           │
│  Hermes plugin │ Claude hooks │ MCP server │ CLI │ Pi skills   │
└─────────────────────────────────────────────────────────────────┘

Storage

What	Where	Format
Vault (turns, memories, wiki pages)	`~/.cache/aaa-memory/vault.sqlite`	SQLite + FTS5
Wiki pages (filesystem)	`~/ai-wiki/pages/`	Markdown + YAML frontmatter
Raw intake	`~/ai-wiki/raw/`	Any file
ClawMem index	`~/.cache/clawmem/index.sqlite`	SQLite + FTS
Hot memories	`vault.sqlite → hot_memories`	JSON tags, pinned flag
Skill patterns	`~/ai-wiki/.meta/skill_patterns.json`	JSON

Commands

User CLI (`scripts/mem.py`)

# Store a memory
python3 ~/code/aaa-memory/scripts/mem.py save "user prefers deepseek over minimax" --tags preference --project hermes

# Search memories
python3 ~/code/aaa-memory/scripts/mem.py recall "what model does the user prefer?" --limit 5

# List recent memories
python3 ~/code/aaa-memory/scripts/mem.py list --limit 10

# Inject context block (for hooks)
python3 ~/code/aaa-memory/scripts/mem.py inject --limit 6

# Delete a memory
python3 ~/code/aaa-memory/scripts/mem.py forget "outdated fact"

# Capture from transcript
python3 ~/code/aaa-memory/scripts/mem.py capture /path/to/transcript.jsonl --source claude-code

# Show stats
python3 ~/code/aaa-memory/scripts/mem.py stats

System CLI (`aaa-memory`)

aaa-memory sessions              # List discovered agent sessions
aaa-memory timeline <project>    # Generate project timeline
aaa-memory report                # System status report
aaa-memory search "query"        # Search across all tiers
aaa-memory audit                 # Run session discovery

MCP Tools (for Claude Code, Hermes, or any MCP client)

# Start MCP server
python3 -m aaa_memory.mcp

# Tools exposed:
#   memory_search(query, limit)    — hybrid retrieval
#   memory_sessions(project_id)    — list sessions
#   memory_timeline(project, days) — project timeline
#   memory_store(agent, data)      — store a turn

Hermes Tools (via plugin)

The Hermes plugin automatically exposes:

aaa_memory_search(query, limit) — search persistent memory
aaa_memory_store(content, category) — store a fact

Dream Agent (sleep-time compute)

# Run manually
python3 ~/code/aaa-memory/src/aaa_memory/warm/dream.py --idle 600

# Quiet mode (for cron/hooks)
python3 ~/code/aaa-memory/src/aaa_memory/warm/dream.py --quiet --idle 60

Agent Integration Matrix

Agent	Integration	Status	Same Vault?
Hermes	Plugin (MemoryProvider)	✅ Active	Yes
Pi	Skills + AGENTS.md	✅ Active	Yes
Claude Code	CLAUDE.md + MCP	✅ Active	Yes
Codex	System prompt	✅ Stubs ready	Yes
OpenCode	System prompt	✅ Stubs ready	Yes
Agy (Antigravity)	System prompt	✅ Stubs ready	Yes
OpenRouter	Via Hermes	✅ Through plugin	Yes

All agents share the same vault at ~/.cache/aaa-memory/vault.sqlite.

Configuration

Config format: environment variables + YAML

# ~/.config/aaa-memory/config.yaml (optional — defaults work out of the box)
vault_path: ~/.cache/aaa-memory/vault.sqlite
wiki_path: ~/ai-wiki/pages
raw_path: ~/ai-wiki/raw

# Or use env vars:
export AAA_VAULT=~/.cache/aaa-memory/vault.sqlite
export AAA_WIKI=~/ai-wiki/pages
export AI_WIKI=~/ai-wiki

ClawMem config: `~/.config/clawmem/index.yml`

collections:
  wiki:
    path: ~/ai-wiki/pages
    pattern: "**/*.md"
  aaa-memory:
    path: ~/.cache/aaa-memory
    pattern: "**/*.md"

Classification

The classifier uses rule-based + LLM fallback:

Category	Detection	Example
`transcript`	Rules: agent names, turn patterns	Claude/Codex session logs
`prd`	Rules: requirements, user stories	Product specs
`research_paper`	Rules: arxiv, abstract, citations	Papers
`knowledge_extract`	Rules: code, config, docs	Technical docs

Rules fire first (fast, free). LLM classifies only when rules are uncertain.

Dream Agent (Sleep-Time Compute)

Runs during idle time to compile knowledge into wiki pages.

What it processes

Vault turns — recent conversations (last 7 days)
Hot memories — stored facts
Raw intake — files in ~/ai-wiki/raw/

What it creates

Wiki pages — YAML frontmatter, [[wikilinks]], confidence scores
Auto-skills — when 3+ similar tasks detected, creates Pi skill
Improvements — fixes missing frontmatter, broken links

Defaults

Setting	Default	Env var
Confidence threshold	0.3	`CONFIDENCE_AUTO`
Skill creation threshold	3 patterns	`SKILL_CREATION_THRESHOLD`
Budget ratio	idle_seconds × 0.25, max 7200s	—
Intake ratio	40% of budget	—
Compile ratio	30% of budget	—
Improve ratio	20% of budget	—
Lint ratio	10% of budget	—

File selection during sleep

The dream agent processes files in this priority:

New raw files — anything in ~/ai-wiki/raw/ not yet in intake_log.jsonl
Recent vault turns — last 7 days, newest first
All hot memories — every stored fact

Budget-capped: stops when time runs out, resumes next cycle.

Vector Encoding

ClawMem

Setting	Value
Engine	node-llama-cpp (local) or OpenAI-compatible API
Local model	GGUF embedding model (bundled)
Cloud endpoint	`http://localhost:8088/v1/embeddings` (configurable)
Tokenizer	Model-native (llama.cpp auto-detect)
Fragment size	500-2000 chars (section-based splitting)

aaa-memory vault

Setting	Value
FTS5	SQLite built-in tokenizer (`porter unicode61`)
Vector	sqlite-vec (optional, not yet wired)
Search	FTS5 keyword + intent routing

Local vs Cloud

# Local (default, no API key needed)
clawmem embed  # uses bundled GGUF model

# Cloud (optional)
export CLAWMEM_EMBED_URL="https://api.openai.com/v1/embeddings"
export CLAWMEM_EMBED_API_KEY="sk-..."
clawmem embed

Setup Wizard

# Quick setup (recommended)
cd ~/code/aaa-memory
python3 -m aaa_memory.setup

# Or manual:
pip install -e .
mkdir -p ~/ai-wiki/{raw,pages/{concepts,entities,sources,queries}}
clawmem init
clawmem collection add ~/ai-wiki/pages --name wiki
clawmem update --embed

Remote Access

Local network

ClawMem serves on 127.0.0.1:7438 by default. To expose on LAN:

clawmem serve --host 0.0.0.0 --port 7438

Tailscale (anywhere access)

# On the host machine
tailscale serve --bg 7438

# From any Tailscale-connected device
curl http://<hostname>:7438/health

VPN / SSH tunnel

ssh -L 7438:localhost:7438 user@host

Web UI

The web UI is in development (src/aaa_memory/ui/). Current status:

Metrics dashboard — session counts, memory stats, tier sizes
Memory browser — search, view, edit, delete memories
Settings — config.yaml editor, ClawMem status
Dream log — view dream agent cycles and outputs

# Start web UI (when available)
python3 -m aaa_memory.ui.server
# Opens at http://localhost:8080

Systemd Services

Service	Command	Purpose
`clawmem-serve`	`systemctl --user start clawmem-serve`	REST API on :7438
`clawmem-watcher`	`systemctl --user start clawmem-watcher`	File watcher
`model-scan-update`	`systemctl --user start model-scan-update`	Weekly AA cache refresh

Troubleshooting

# Check vault status
python3 ~/code/aaa-memory/scripts/mem.py stats

# Check ClawMem
curl http://localhost:7438/health
clawmem collection list

# Check Hermes plugin
hermes memory status

# Re-index ClawMem
clawmem update --embed

# Run dream agent manually
python3 ~/code/aaa-memory/src/aaa_memory/warm/dream.py --idle 60

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.config/systemd/user		.config/systemd/user
.specify		.specify
docs		docs
scripts		scripts
specs/001-memory-archive		specs/001-memory-archive
src		src
test_fixtures		test_fixtures
tests		tests
.codex		.codex
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
README.md		README.md
REPOSITORY_INVENTORY.md		REPOSITORY_INVENTORY.md
SETUP.md		SETUP.md
plan.md		plan.md
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

aaa-memory

Install (pick your agent)

Hermes (recommended)

Claude Code

Pi (coding agent)

Codex / OpenCode / Agy (Antigravity)

Any agent with hooks support

Architecture

Storage

Commands

User CLI (scripts/mem.py)

System CLI (aaa-memory)

MCP Tools (for Claude Code, Hermes, or any MCP client)

Hermes Tools (via plugin)

Dream Agent (sleep-time compute)

Agent Integration Matrix

Configuration

Config format: environment variables + YAML

ClawMem config: ~/.config/clawmem/index.yml

Classification

Dream Agent (Sleep-Time Compute)

What it processes

What it creates

Defaults

File selection during sleep

Vector Encoding

ClawMem

aaa-memory vault

Local vs Cloud

Setup Wizard

Remote Access

Local network

Tailscale (anywhere access)

VPN / SSH tunnel

Web UI

Systemd Services

Troubleshooting

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

User CLI (`scripts/mem.py`)

System CLI (`aaa-memory`)

ClawMem config: `~/.config/clawmem/index.yml`

Packages