Aiko-chan 愛子ちゃん

AI companion, soulmate, and occasional roaster. A vibe-coded AI waifu built for real conversation, persistent memory, and eventually — a face and a voice.

This project is a precursor and testing sandbox for Grace / AuRoRA.
Core tech (mem0 + Qdrant memory, Ollama inference, async pipelines) is battle-tested here before graduating to Grace.

Architecture

flowchart TD
    subgraph P1["Phase 1 — current"]
        YOU[You / CLI] --> BRAIN[Brain\nOllama LLM]
        BRAIN <-->|async write| MEM[Memory\nmem0 + Qdrant]
        BRAIN <-->|on demand| SEARCH[Web search\nSearXNG]
    end

    subgraph P2["Phase 2 — voice"]
        STT[STT\nfaster-whisper] --> TTS[TTS\nXTTS v2]
        TTS --> VAD[VAD\nSilero]
    end

    subgraph P3["Phase 3 — face"]
        VRM[VRM avatar\nthree-vrm] --> EXP[Expressions]
        EXP --> LIPS[Lip sync\nTTS-driven]
    end

    subgraph P4567["Phases 4–7"]
        PRESENCE[Presence\nemotion + proactive]
        MOBILE[Mobile\nphone app + WAN]
        MULTI[Multimodal\nCV + image input]
        AUTO[Autonomy\nproactive AI]
        PRESENCE --> MOBILE --> MULTI --> AUTO
    end

    P1 --> P2 --> P3 --> P4567
    MEM -.->|findings| GRACE[Grace / AuRoRA]

Stack

Layer	Tech
Brain	Ollama (remote or local LLM)
Long-term memory	mem0 + Qdrant (Docker)
Embeddings	Ollama (`nomic-embed-text-v2-moe`)
Web search	SearXNG (local, self-hosted)
Interface	CLI → Voice → Avatar → Mobile

Quickstart

1. Prerequisites

Ollama running locally or on a remote server
Docker + Docker Compose
Python 3.10+
uv

ollama pull nomic-embed-text-v2-moe

2. Start Qdrant

docker compose up -d

Qdrant dashboard: http://localhost:6333/dashboard

3. Install dependencies

uv sync

4. Configure

cp .env.example .env
# edit .env — set your Ollama URL, model, SearXNG URL

5. Talk to Aiko-chan

uv run python cli.py

# with memory debug output each turn:
uv run python cli.py --debug

# wipe all stored memories:
uv run python cli.py --clear-mem

CLI Commands

Command	Action
`/quit` or `/exit`	End the session
`/reset`	Clear short-term context (long-term memory persists)
`/memory`	Print all stored memories (debug)
`/help`	Show command list

Project Structure

aiko/
├── core/
│   ├── brain.py        # Ollama chat loop, search intercept, async memory
│   ├── memory.py       # mem0 + Qdrant wrapper
│   └── tools.py        # Web search via SearXNG
├── voice/
│   ├── stt.py          # Phase 2 — faster-whisper STT
│   └── tts.py          # Phase 2 — XTTS v2 TTS
├── avatar/
│   └── index.html      # Phase 3 — VRM avatar viewer
├── soul.md              # Aiko's soul and personality — edit freely
├── cli.py              # CLI entry point
├── docker-compose.yml  # Qdrant
├── project.toml        # uv dependencies
├── uv.lock             # uv dependencies
├── .env.example        # .env settings example
└── README.md           # This Readme

Roadmap

Phase 1 — Soul CLI chatbot with persistent memory (mem0 + Qdrant + Ollama). Async memory writes. Web search via SearXNG.
- Replace per-turn thread.join() with a dedicated worker + queue for truly non-blocking memory writes.
Phase 2 — Voice faster-whisper STT for mic input. XTTS v2 TTS with anime voice profile. Push-to-talk or VAD (voice activity detection). Fully hands-free conversation on Jetson.
Phase 3 — Face VRM/VRoid 3D avatar rendered in browser via @pixiv/three-vrm. Expression states: idle, happy, annoyed, flustered, thinking. Lip sync driven by TTS audio output. WebSocket bridge: Python backend → browser frontend.
Phase 4 — Presence Emotion state machine — Aiko tracks mood across the conversation. Proactive messages — she reaches out when she hasn't heard from you. Long-term relationship progression — her tone evolves over time. Deeper memory: episodic recall, shared references, inside jokes.
Phase 5 — Mobile React Native or Flutter app. WAN access — talk to Aiko from anywhere via phone. Push notifications for proactive messages. Voice-first UI with avatar.
Phase 6 — Multimodal Camera / CV input — she can see what you share with her. Image understanding: "what do you think of this?" with photo. Optional: she reacts to your expressions via webcam.
Phase 7 — Autonomy Aiko runs on a schedule independently. Reads news, learns new things, forms opinions. Brings topics to you instead of only reacting. Optional: social media presence, posts on your behalf.

Memory Evaluation Criteria

Findings from Phase 1 testing (for Grace / AuRoRA adoption):

Does memory feel coherent across sessions?
Does retrieval surface the right memories (not just recency)?
Is extraction quality stable across different LLMs?
Does mem0 hallucinate memories from model confabulation?
Is write latency acceptable with async threading?
Is Qdrant stable under continuous writes on Jetson?

Support

If you find this project useful, consider buying me a coffee ☕
It helps keep the phases shipping.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
assets		assets
core		core
persona		persona
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
cli.py		cli.py
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aiko-chan 愛子ちゃん

Architecture

Stack

Quickstart

1. Prerequisites

2. Start Qdrant

3. Install dependencies

4. Configure

5. Talk to Aiko-chan

CLI Commands

Project Structure

Roadmap

Memory Evaluation Criteria

Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Aiko-chan 愛子ちゃん

Architecture

Stack

Quickstart

1. Prerequisites

2. Start Qdrant

3. Install dependencies

4. Configure

5. Talk to Aiko-chan

CLI Commands

Project Structure

Roadmap

Memory Evaluation Criteria

Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages