Operio Agent

Autonomous mall operations agent for lease-aware maintenance dispatch.

Operio Agent is a FastAPI + React application for handling retail center incidents end to end. It combines Gemini-powered reasoning, a multi-tiered MongoDB Atlas Search & Vector Search pipeline for lease and manual retrieval, MongoDB-backed operational state, and Arize-powered observability so each dispatch decision is evidence-backed and reviewable.

What It Does

Accepts tenant incident reports through a chat-driven workflow.
Retrieves lease clauses and equipment manual evidence before acting.
Checks available technicians and creates or updates work orders through MCP tools.
Escalates landlord-liable or high-cost incidents into a human-in-the-loop approval path.
Logs model calls and tool execution traces to Arize Phoenix, with optional AX dataset and experiment publishing.

Runtime Stack

Google Cloud Agent Builder via google-adk
Gemini on Vertex AI and direct Google Developer API
MongoDB MCP server for work orders, sessions, tenants, staff, leases, and manuals (RAG Search)
Arize Phoenix for trace capture, with optional Arize AX publishing for datasets and experiments
FastAPI backend served with a React + Vite frontend

Partner Track

Primary submission target: Arize

Why this track fits:

Trace-Level Observability: Full execution traces (AGENT -> TOOL -> MCP) are captured in Arize Phoenix so judges can inspect the reasoning path behind every dispatch decision.
LLM-as-a-Judge Evaluators: Six evaluators (liability correctness, evidence usage, ambiguity handling, workflow correctness, session coherence, resolution quality) score chat turns and log results onto traced spans.
Phoenix MCP Server Integration: The app communicates with @arizeai/phoenix-mcp to keep telemetry and evaluation workflows connected to the agent runtime.
Continuous Benchmarking: The scenario benchmark runs locally and can optionally be published into Arize AX as a reusable dataset + experiment baseline.

Repository Guide

Local Setup

Prerequisites

Node.js 22+
pnpm 9+
Python 3.11+
uv
Docker Desktop or another Docker runtime

1. Install dependencies

pnpm install

2. Configure environment variables

cp .env.example .env

Set at least:

Variable	Purpose	Default
`OPERIO_REASONING_BACKEND`	Selects the agent runtime (`adk` or `legacy`)	`legacy`
`GOOGLE_GENAI_USE_VERTEXAI`	Routes Gemini calls through Vertex AI	`false`
`GOOGLE_CLOUD_PROJECT`	Google Cloud project for Vertex AI / deployment	none
`GOOGLE_CLOUD_LOCATION`	Google Cloud region for Vertex AI	`us-central1`
`GEMINI_API_KEY`	Optional legacy fallback for direct Gemini API use	none
`MONGO_URI`	MongoDB connection string	`mongodb://localhost:27017`
`MONGO_DB`	MongoDB database name	`operio`
`PHOENIX_PROJECT_NAME`	Phoenix project name	`operio-agent`
`PHOENIX_COLLECTOR_ENDPOINT`	Phoenix collector base URL	`http://localhost:6006`

3. Start local infrastructure

docker compose up -d

This starts:

MongoDB on http://localhost:27017
Arize Phoenix on http://localhost:6006

4. Seed demo data

pnpm run seed

This loads mock tenants, technicians, work orders, lease documents, and equipment manuals, automatically generating 3072-dimensional vector embeddings using Gemini (gemini-embedding-2) to enable semantic search.

5. Run the app

pnpm run dev

Open:

App: http://localhost:3001
Phoenix traces: http://localhost:6006

6. Run the scenario evaluation flow

The repo now has a single orchestration command for the scenario baseline:

pnpm run eval:flow

Useful variants:

pnpm run eval:flow -- --no-seed --limit 1
pnpm run eval:flow -- --publish --space operio
pnpm run eval:flow -- --publish --space operio --scenario-ids 1,9,10

What it does:

optionally reseeds MongoDB
runs the local scenario benchmark
exports the AX dataset artifact
optionally publishes the dataset and experiment baseline to Arize AX

Generated artifacts land under agents/demo/:

operio_ax_baseline_results.json
operio_ax_eval_dataset.csv
operio_ax_experiment_runs.json
operio_ax_run_annotations.json

7. Optional frontend-only dev server

pnpm run frontend:dev

Use this when iterating on the SPA with Vite hot reload. The frontend proxies API traffic to the FastAPI backend on port 3001.

Demo Flow

Good scenarios for a judge or reviewer:

HVAC failure with lease-backed tenant liability and auto-dispatch.
High-cost landlord-liable repair that pauses for manager approval.
Equipment diagnosis that pulls troubleshooting context from manual search.

For the Arize-specific validation path, use docs/ARIZE_JUDGING_GUIDE.md.

Development Commands

pnpm run dev - start the FastAPI backend on http://localhost:3001
pnpm run build - type-check the TypeScript workspace and build the frontend
pnpm run frontend:build - build the frontend bundle into demo/
pnpm run frontend:review - run alias, JSDoc, and loop-discipline checks
pnpm run frontend:test - run frontend tests
pnpm run test - run MCP tests, frontend tests, and backend pytest suites
pnpm run evaluate - run the scenario benchmark pytest gate
pnpm run eval:flow - run the local scenario baseline and optionally publish it to Arize AX
pnpm run seed - reseed MongoDB with demo data and generate search embeddings

License

Distributed under the Apache License 2.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.agents/skills		.agents/skills
.github/workflows		.github/workflows
agents		agents
docs		docs
frontend		frontend
mcp_servers		mcp_servers
output/pdf		output/pdf
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
PRIVACY.md		PRIVACY.md
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
skills-lock.json		skills-lock.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Operio Agent

What It Does

Runtime Stack

Partner Track

Repository Guide

Local Setup

Prerequisites

1. Install dependencies

2. Configure environment variables

3. Start local infrastructure

4. Seed demo data

5. Run the app

6. Run the scenario evaluation flow

7. Optional frontend-only dev server

Demo Flow

Development Commands

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Operio Agent

What It Does

Runtime Stack

Partner Track

Repository Guide

Local Setup

Prerequisites

1. Install dependencies

2. Configure environment variables

3. Start local infrastructure

4. Seed demo data

5. Run the app

6. Run the scenario evaluation flow

7. Optional frontend-only dev server

Demo Flow

Development Commands

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages