✦ Pod

✦ Pod

Unified proxy for LLM inference. Pod sits in front of your AI providers and exposes a single OpenAI-compatible endpoint — with routing, fallback, caching, rate limiting, and a dashboard built in.

🚧 under active development on v0.0.x

Features

Multi-provider routing — OpenAI, Anthropic, Gemini, Codex, Ollama, Blackbox, MiniMax, and more
Compatibility APIs — OpenAI, Anthropic, Gemini, and Ollama-compatible endpoints under /v1/*
Semantic cache — deduplicates identical requests; streaming responses are cached too
Conversational memory — automatic injection and extraction across sessions
API key auth — per-key rate limiting (req/min + concurrent cap)
Combos — model groups with fallback and round-robin strategies
Proxy pools — per-provider proxy config with optional Vercel relay
Tunnel support — Tailscale and Cloudflare tunnel integration
Dashboard — full web UI for providers, usage analytics, quota tracking, logs, and health

Quick Start

Docker (recommended)

docker run -d \
  --name pod \
  -p 20128:20128 \
  -v pod-data:/app/data \
  lazuardytech/pod:latest

Then open http://localhost:20128.

With an env file:

docker run -d \
  --name pod \
  -p 20128:20128 \
  -v pod-data:/app/data \
  --env-file .env \
  lazuardytech/pod:latest

Local Development

Requires bun v1.3.14+.

bun install
bun run dev        # starts on http://localhost:20128

Environment Variables

Variable	Default	Description
`PORT`	`20128`	HTTP port
`DATA_DIR`	`/app/data`	SQLite data directory
`INITIAL_PASSWORD`	`123456`	Initial dashboard login password. Change after first login.
`BASE_URL`	`http://localhost:20128`	Internal base URL used for self-referencing API calls (e.g. model availability checks). Set this when running behind a reverse proxy.
`CLOUD_URL`	(none)	URL of your self-hosted Cloudflare Worker (cloud deployment). Overrides the value stored in settings.
`NEXT_TELEMETRY_DISABLED`	`1`	Disable Next.js telemetry
`SEMANTIC_CACHE_MAX_BYTES`	`4194304`	Semantic cache max size in bytes
`SEMANTIC_CACHE_MAX_SIZE`	`100`	Semantic cache max entries
`SEMANTIC_CACHE_TTL_MS`	`1800000`	Semantic cache TTL (ms)
`PROMPT_CACHE_MAX_BYTES`	`2097152`	Prompt cache max size in bytes
`PROMPT_CACHE_MAX_SIZE`	`50`	Prompt cache max entries
`PROMPT_CACHE_TTL_MS`	`300000`	Prompt cache TTL (ms)

API

Pod exposes standard-compatible endpoints:

Endpoint	Protocol
`POST /v1/chat/completions`	OpenAI
`POST /v1/messages`	Anthropic
`POST /v1/responses`	OpenAI Responses
`POST /v1/embeddings`	OpenAI
`POST /v1/audio/speech`	OpenAI TTS
`POST /v1/audio/transcriptions`	OpenAI STT
`POST /v1/images/generations`	OpenAI
`GET /v1/models`	OpenAI
`GET /v1beta/models`	Gemini
`POST /v1/api/chat`	Ollama

All endpoints accept Authorization: Bearer <key> or x-api-key: <key> when API key auth is enabled.

Development

bun install          # install dependencies
bun run dev          # start dev server on :20128
bun run build        # production build
bun run check        # biome format + lint + eslint
bun run test:run     # run vitest

Always run bun run check and bun run test:run before pushing.

See AGENTS.md for project rules (applies to both humans and AI agents). Additional agent context lives in .agents/.

Contributing

See CONTRIBUTING.md for guidelines.

Pod is heavily inspired by 9router and OmniRoute. Credits to their maintainers.

Security

See SECURITY.md for the vulnerability disclosure policy.

Name		Name	Last commit message	Last commit date
Latest commit History 938 Commits
.agents		.agents
.github		.github
.pi/agents		.pi/agents
.rwx		.rwx
cloud		cloud
open-sse		open-sse
public		public
src		src
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.gitmodules		.gitmodules
.npmignore		.npmignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
DESIGN.md		DESIGN.md
DOCKER.md		DOCKER.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
biome.json		biome.json
bun.lock		bun.lock
docker-compose.yml		docker-compose.yml
eslint.config.mjs		eslint.config.mjs
jsconfig.json		jsconfig.json
next.config.mjs		next.config.mjs
package.json		package.json
postcss.config.mjs		postcss.config.mjs
vitest.config.mjs		vitest.config.mjs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

✦ Pod

Features

Quick Start

Docker (recommended)

Local Development

Environment Variables

API

Development

Contributing

Security

License

About

Uh oh!

Releases

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

✦ Pod

Features

Quick Start

Docker (recommended)

Local Development

Environment Variables

API

Development

Contributing

Security

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Contributors

Uh oh!

Languages