Robot Buddy

A kid-safe, expressive robot platform combining real-time motor control, an animated TFT face, and optional networked AI planner.

How It Works

Two ESP32-S3 microcontrollers handle the deterministic, safety-critical work: one drives motors with PID control and enforces safety limits, the other renders an animated face on a 320x240 TFT touch display (esp32-face). A Raspberry Pi 5 orchestrates everything at 50 Hz — reading sensors, running the state machine, applying layered safety policies, and streaming telemetry to a browser UI. An optional AI planner server on a separate machine (3090 Ti) generates expressive behavior plans via a local LLM.

Reflexes are local and deterministic. Planner is remote and optional.

Hardware

Component	Hardware	Role
Supervisor	Raspberry Pi 5	50 Hz orchestration, safety policy, HTTP/WS API
Face MCU	ESP32-S3 (ES3C28P)	320x240 TFT face renderer + touch/buttons telemetry
Reflex MCU	ESP32-S3 WROOM	Differential drive, PID, encoders, IMU, ultrasonic, safety
AI Server	PC with 3090 Ti (off-robot)	Planner/conversation LLM + TTS on LAN (`LLM_BACKEND=ollama
Motor Driver	TB6612FNG	Dual H-bridge for differential drive
Power	2S LiPo	Split into dirty (motors) and clean 5V regulated rails

Repository Layout

robot-buddy/
├── supervisor/       # Python supervisor (Raspberry Pi 5, process-isolated workers)
│   ├── core/            # 50 Hz tick loop, state machine, safety, behavior engine
│   ├── devices/         # MCU clients (reflex, face), protocol, expressions
│   ├── io/              # Serial transport, COBS framing, CRC
│   ├── workers/         # Process-isolated workers (TTS, vision, AI)
│   ├── messages/        # NDJSON envelope, event/action types
│   ├── api/             # FastAPI HTTP/WebSocket server, param registry
│   ├── mock/            # Mock Reflex MCU for testing (PTY-based)
│   ├── tests/           # pytest test suite
│   └── pyproject.toml   # Package metadata, deps
├── server/              # AI planner server (3090 Ti, FastAPI + backend switch)
│   ├── app/             # FastAPI app, LLM/STT/TTS backends, prompts, schemas
│   ├── tests/           # pytest test suite
│   ├── Modelfile        # Legacy Ollama model config
│   └── pyproject.toml   # Package metadata, deps
├── esp32-face/       # Face MCU firmware (ESP32-S3, C/C++, ESP-IDF)
│   └── main/            # TFT face rendering + touch/buttons + USB protocol
├── esp32-reflex/        # Reflex MCU firmware (ESP32-S3, C/C++, ESP-IDF)
│   └── main/            # Differential drive, PID, IMU, safety, encoders
├── dashboard/           # React dashboard (Vite + TypeScript + Biome)
│   └── src/             # Components, hooks, stores, tabs
├── specs/               # Completed specifications (immutable reference)
├── docs/                # TODO, architecture, protocols, wiring, power, research
├── deploy/              # Deployment (systemd service, install/update scripts)
├── tools/               # Dev utilities (face sim V3, parity check)
└── training/            # Wake word model training

Architecture

┌──────────────────────────────────────────────────────┐
│ On Robot                                             │
│                                                      │
│  ┌────────────────────────────────────────────────┐  │
│  │ Raspberry Pi 5 — Supervisor                    │  │
│  │                                                │  │
│  │  50 Hz tick loop:                              │  │
│  │    read telemetry → state machine → safety     │  │
│  │    policies → send commands → broadcast        │  │
│  │                                                │  │
│  │  HTTP API (:8080)  WebSocket (:8080/ws)        │  │
│  │  Vision process (separate OS process, 10-20Hz) │  │
│  └──────┬──────────────────────┬──────────────────┘  │
│         │ USB serial (COBS)    │ USB serial (COBS)   │
│  ┌──────▼──────┐        ┌─────▼───────┐             │
│  │ Reflex MCU  │        │  Face MCU   │             │
│  │ ESP32-S3    │        │  ESP32-S3   │             │
│  │             │        │             │             │
│  │ Motors, PID │        │ 320x240 TFT │             │
│  │ Encoders    │        │ Face +      │             │
│  │ IMU, Range  │        │ Touch UI    │             │
│  │ Safety      │        │             │             │
│  └─────────────┘        └─────────────┘             │
└──────────────────────────────────────────────────────┘
         │
         │ HTTP (LAN, optional)
         │
┌────────▼───────────────────────────┐
│ AI Server (3090 Ti PC)             │
│                                    │
│ FastAPI planner server             │
│ LLM backend: ollama | vllm         │
│ TTS: Orpheus (vLLM) + espeak shed  │
│ POST /plan / WS /converse /tts     │
└────────────────────────────────────┘

State Machine

BOOT → IDLE → TELEOP / WANDER → ERROR

BOOT → IDLE: automatic when Reflex MCU connects with no faults
IDLE → TELEOP/WANDER: via set_mode command
Any → ERROR: on disconnect, ESTOP, TILT, or BROWNOUT
ERROR → IDLE: via clear_error() when Reflex is healthy

Safety Policies (Defense in Depth)

Mode gate — no motion outside TELEOP/WANDER
Fault gate — any fault → zero twist
Reflex disconnect → zero twist
Ultrasonic range scaling (hard stop at 250 mm, 50% at 500 mm)
Stale range fallback (50% cap)
Vision confidence scaling
Stale vision timeout (500 ms)

Safety-critical enforcement also runs on the Reflex MCU itself (acceleration limits, command TTL, hard stop). The supervisor applies additional caps above this.

Serial Protocol

Binary packets over USB serial with COBS framing:

[type:u8][seq:u8][payload:N][crc16:u16-LE]

For esp32-face, this protocol carries face state/gesture/system/talking commands and touch/button/status telemetry only. Audio transport is supervisor-side USB audio.

Auto-reconnect with exponential backoff (0.5s–5s). See docs/protocols.md for packet definitions.

Tech Stack

Component	Stack
Supervisor	Python 3.11+, asyncio, FastAPI, uvicorn, pyserial, OpenCV
AI Server	Python 3.11+, FastAPI, httpx, Pydantic, Ollama (compat) + vLLM (migration target)
ESP32 Firmware	C/C++, ESP-IDF (FreeRTOS), CMake
Build (Python)	Hatchling via pyproject.toml, uv for dependency management
Build (ESP32)	`idf.py build` (CMake), `source ~/esp/esp-idf/export.sh`
Dashboard	React 19, Vite, TypeScript, Zustand, TanStack Query
Tests	pytest, pytest-asyncio, Vitest
Linting	ruff (Python), clang-format + cppcheck (C++), Biome (TypeScript)

Getting Started

Supervisor (Raspberry Pi 5)

cd supervisor
uv sync --group dev

# Run with mock hardware (no physical robot needed)
just run-mock

# Run with real hardware
just run

# Other options
uv run python -m supervisor --no-vision         # Disable vision worker
uv run python -m supervisor --http-port 8080    # Custom HTTP port
uv run python -m supervisor --planner-api http://10.0.0.20:8100 --robot-id robot-1

AI Planner Server (3090 Ti PC)

# Install and run the server
cd server
uv sync --extra dev --extra llm --extra stt --extra tts

# Recommended testing profile (vLLM planner + CPU STT + espeak)
LLM_BACKEND=vllm STT_DEVICE=cpu TTS_BACKEND=espeak \
uv run --extra llm --extra stt --extra tts python -m app.main

The server starts on port 8100. See server/README.md for full API docs and configuration.

ESP32 Firmware

Requires ESP-IDF toolchain.

cd esp32-face   # or esp32-reflex
idf.py build
idf.py flash
idf.py monitor

Dashboard

just run-dashboard         # dev server with hot reload
just build-dashboard       # production build → supervisor/static/

Development

All commands are available via just (see justfile):

just test-all              # run all tests (supervisor, server, dashboard)
just lint                  # check Python + C++ + dashboard
just lint-fix              # auto-fix formatting
just preflight             # full pre-commit check (lint + tests + parity)
just sim                   # run face simulator V3
just check-parity          # verify sim↔MCU constant alignment

Mock Mode

The supervisor includes a PTY-based mock Reflex MCU (supervisor/mock/mock_reflex.py) that simulates serial communication, telemetry, and fault injection. Use just run-mock to run the full supervisor stack without any hardware.

Dashboard

When the supervisor is running, open http://<robot_ip>:8080 in a browser for:

Live telemetry display with diagnostic tree
Mode control (IDLE, TELEOP, WANDER)
E-STOP button
Face control (moods, gestures, talking, conversation state)
Parameter tuning sliders (PID gains, speed limits, safety thresholds)
Monitor tab (device health, comms, power, sensors, faults, workers)
MJPEG video stream (if vision enabled)

Configuration

Supervisor — YAML config file (schema in supervisor/config.py):

Sections: serial, control, safety, network, logging, vision
Default serial paths: /dev/robot_reflex, /dev/robot_face (via udev symlinks)

AI Server — environment variables:

LLM_BACKEND, VLLM_MODEL_NAME, LLM_MAX_INFLIGHT, PERFORMANCE_MODE
legacy compatibility: OLLAMA_URL, MODEL_NAME, PLAN_TIMEOUT_S, TEMPERATURE, NUM_CTX
See server/README.md for the full table

ESP32 — sdkconfig.defaults + config.h constants

Supervisor API

Endpoint	Method	Description
`/status`	GET	Current robot state (JSON)
`/params`	GET	Full parameter registry
`/params`	POST	Transactional parameter updates
`/actions`	POST	RPC: `set_mode`, `e_stop`, `clear_e_stop`
`/video`	GET	MJPEG stream (if vision enabled)
`/debug/devices`	GET	Device connection state
`/debug/planner`	GET	Planner state
`/debug/mcu_benchmark`	GET	MCU benchmark run status
`/ws`	WS	Telemetry stream (20 Hz, JSON)
`/ws/logs`	WS	Live log stream

AI Server API

Endpoint	Method	Description
`/health`	GET	Server + selected LLM backend status
`/plan`	POST	Accept world state + `robot_id/seq/monotonic_ts_ms`, return plan + `plan_id` echo metadata
`/converse`	WS	Conversation stream (single active session per `robot_id`)
`/tts`	POST	Direct TTS with optional metadata (`robot_id`, `seq`, `monotonic_ts_ms`)

Plan actions: say(text), emote(name, intensity), gesture(name, params), skill(name) — planner proposes intent and supervisor executes deterministic skills.

Supervisor Fallback Policy

Failure condition	Immediate supervisor action	Motion policy	Face policy	Speech policy
`/plan` unreachable / non-200	Mark planner disconnected; skip remote plan apply	Local deterministic only (`patrol_drift`/`avoid_obstacle`/safe stop)	`confused` gesture with cooldown	Cancel queued planner speech
`/converse` TTS fails mid-turn	Stop playback and clear talking flag	No change to motion authority	Show `thinking` briefly then restore previous mood	Attempt fallback backend once; if unavailable, skip speech

Project Status

Working

In Progress

Reflex MCU hardware commissioning (breadboard bring-up)
Personality engine implementation (spec complete, implementation pending)

Future

Conversation memory / interaction history
Wake word model improvements (recall 42% → 80%+)
Additional modes: LINE_FOLLOW, BALL, CRANE, CHARGING
Home Assistant integration
Voice ID / speaker identification

See docs/TODO.md for the detailed backlog and specs/ for design specifications.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Robot Buddy

How It Works

Hardware

Repository Layout

Architecture

State Machine

Safety Policies (Defense in Depth)

Serial Protocol

Tech Stack

Getting Started

Supervisor (Raspberry Pi 5)

AI Planner Server (3090 Ti PC)

ESP32 Firmware

Dashboard

Development

Mock Mode

Dashboard

Configuration

Supervisor API

AI Server API

Supervisor Fallback Policy

Project Status

Working

In Progress

Future

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 305 Commits
dashboard		dashboard
deploy		deploy
docs		docs
esp32-face		esp32-face
esp32-reflex		esp32-reflex
server		server
specs		specs
supervisor		supervisor
tools		tools
training		training
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
justfile		justfile
pyrightconfig.json		pyrightconfig.json
ruff.toml		ruff.toml
stage-4-todo-plan.txt		stage-4-todo-plan.txt

Folders and files

Latest commit

History

Repository files navigation

Robot Buddy

How It Works

Hardware

Repository Layout

Architecture

State Machine

Safety Policies (Defense in Depth)

Serial Protocol

Tech Stack

Getting Started

Supervisor (Raspberry Pi 5)

AI Planner Server (3090 Ti PC)

ESP32 Firmware

Dashboard

Development

Mock Mode

Dashboard

Configuration

Supervisor API

AI Server API

Supervisor Fallback Policy

Project Status

Working

In Progress

Future

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages