LLM Relay

A desktop app that talks to AI using multiple providers through your own API keys.

What it does

Routes your messages to 15 different LLM providers. If one fails, it tries another. All data stays on your machine.

Why use this:

Connect your own API keys for each provider
Smart routing picks the best available provider
Encrypted key storage (OS keychain)
Auto-fallback when providers fail

Providers (15)

Free tier:

Provider	What you get
Google AI	Gemini 2.0 Flash
Groq	Llama 3, Mixtral (fast)
Cerebras	Llama 3.1/3.3 (fast)
NVIDIA NIM	Various models
Cohere	Command R+
Cloudflare	10k neurons/day
OpenRouter	50 req/day `:free` models

Paid: OpenAI, Anthropic, Mistral, Perplexity, Together, DeepSeek, xAI

Local: Ollama (run models on your machine)

Quick start

git clone https://github.com/your-username/llm-relay.git
cd llm-relay
npm install
npm run dev

Opens on port 5190.

Scripts

Command	What it does
`npm run dev`	Dev mode
`npm run build`	Production build
`npm run package`	Package for current OS
`npm run package:win`	Windows installer
`npm run package:mac`	macOS DMG
`npm run package:linux`	AppImage/DEB/RPM
`npm run lint`	Lint check
`npm run typecheck`	Type check
`npm run test`	Run tests

Adding keys

Open Settings (gear icon)
Enter your API key
Click "Validate & Save"

Cloudflare format: account_id:api_token

How routing works

Filters: has key, not in cooldown, circuit closed
Scores by health (latency + success rate)
Weighted random selection
Up to 6 retries on failure

Data location

OS	Path
Windows	`%APPDATA%/llm-relay/llm-relay.sqlite`
macOS	`~/Library/Application Support/llm-relay/`
Linux	`~/.config/llm-relay/`

All data local. Keys encrypted. No telemetry.

Structure

llm-relay/
├── electron-src/     # Main process (TS)
│   ├── database/     # SQLite
│   ├── providers/    # LLM adapters
│   ├── router/       # Routing logic
│   └── services/     # Memory, facts
├── src/              # React frontend
└── tests/            # Vitest

Keyboard shortcuts

Cmd/Ctrl+N - New chat
Cmd/Ctrl+K - Search
Cmd/Ctrl+, - Settings
Escape - Cancel

Troubleshooting

Port in use? Change it in vite.config.ts, package.json, and electron-src/main/index.ts.

429 error? Rate limited - key is valid, just wait.

Reset data? Delete the sqlite file.

License

Apache 2.0

You manage your keys. You're responsible for each provider's ToS.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
build		build
electron-src		electron-src
public		public
src		src
tests		tests
.eslintrc.cjs		.eslintrc.cjs
.gitignore		.gitignore
.prettierrc		.prettierrc
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
electron-builder.yml		electron-builder.yml
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
tsconfig.node.tsbuildinfo		tsconfig.node.tsbuildinfo
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Relay

What it does

Providers (15)

Quick start

Scripts

Adding keys

How routing works

Data location

Structure

Keyboard shortcuts

Troubleshooting

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM Relay

What it does

Providers (15)

Quick start

Scripts

Adding keys

How routing works

Data location

Structure

Keyboard shortcuts

Troubleshooting

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages