OpenProxy

A high-performance LLM (Large Language Model) proxy server written in Rust, designed to intelligently route requests between multiple LLM providers with advanced features like weighted load balancing, health checks, and connection pooling.

Features

Multi-Provider Support: Seamlessly proxy requests to OpenAI, Gemini, and Anthropic APIs
Intelligent Load Balancing: Weighted provider selection algorithm for optimal resource utilization
Fallback Providers: Automatic failover to backup providers when primary providers are unavailable
Health Monitoring: Automatic health checks with configurable intervals and failure recovery
Connection Pooling: Efficient connection reuse to minimize latency and resource usage
Authentication & Authorization: Flexible API key management with per-provider and global authentication
OAuth Support: Dynamic authentication with shell command execution for Anthropic OAuth tokens
Protocol Support: Full HTTP/1.1 and HTTP/2 support with automatic protocol negotiation
WebSocket Support: Transparent WebSocket proxying for both HTTP/1.1 upgrade and HTTP/2 Extended CONNECT (RFC 8441)
HTTP & HTTPS Support: Run with TLS encryption (HTTPS) or plaintext HTTP for internal networks
TLS Encryption: Secure communication using rustls with modern cipher suites
Configuration Management: YAML-based configuration with hot-reload capability (SIGHUP)
Structured Logging: Comprehensive logging for monitoring and debugging
Container Ready: Docker support for easy deployment and scaling
CI/CD Ready: GitHub Actions workflow for E2E testing

Installation

From Source

git clone https://github.com/x5iu/openproxy.git
cd openproxy
cargo build --release

Docker

# Pull and run the pre-built image
# Port mapping should match your config.yml (https_port/http_port)
docker run -d \
  --name openproxy \
  -p 443:443 \
  -v /path/to/config.yml:/config.yml \
  -v /path/to/certificate.pem:/certs/certificate.pem \
  -v /path/to/private-key.pem:/certs/private-key.pem \
  x5iu/openproxy:latest

# Example with custom ports (e.g., https_port: 8443, http_port: 8080 in config)
docker run -d \
  --name openproxy \
  -p 8443:8443 \
  -p 8080:8080 \
  -v /path/to/config.yml:/config.yml \
  -v /path/to/certificate.pem:/certs/certificate.pem \
  -v /path/to/private-key.pem:/certs/private-key.pem \
  x5iu/openproxy:latest

Configuration

TLS Certificates

The proxy requires TLS certificates for secure HTTPS communication. You need two files:

certificate.pem - Your TLS certificate (public key)
private-key.pem - Your TLS private key

Configuration File

Create a config.yml file with your LLM provider configurations:

# TLS Configuration (required for HTTPS)
cert_file: "/certs/certificate.pem"
private_key_file: "/certs/private-key.pem"

# Port Configuration
# - https_port: HTTPS with TLS (requires cert_file and private_key_file)
# - http_port: Plain HTTP without TLS (HTTP/1.1 only, useful for internal networks)
# At least one port must be configured. Both can be enabled simultaneously.
https_port: 443
# https_bind_address: "0.0.0.0"  # Bind address for HTTPS (default: 0.0.0.0)
# http_port: 8080  # Uncomment to enable HTTP
# http_bind_address: "127.0.0.1"  # Bind address for HTTP (default: 0.0.0.0)

# Global Authentication Keys
auth_keys:
  - "client-api-key-1"
  - "client-api-key-2"

# Health Check Configuration
health_check:
  enabled: true
  interval: 60  # seconds

# HTTP Configuration
# http_max_header_size: 8192  # Maximum HTTP header size in bytes (default: 4096, min: 1024, max: 1MB)

# Provider Configuration
providers:
  # OpenAI Configuration
  - type: "openai"
    host: "openai.example.com"       # Client uses "Host: openai.example.com" header to route to this provider
    endpoint: "api.openai.com"       # Actual OpenAI API endpoint
    port: 443
    tls: true
    weight: 1.0
    api_key: "sk-your-openai-api-key"
    health_check:
      method: "GET"
      path: "/v1/models"

  # Gemini Configuration
  - type: "gemini"
    host: "gemini.example.com"       # Client uses "Host: gemini.example.com" header to route to this provider
    endpoint: "generativelanguage.googleapis.com"  # Actual Google API endpoint
    port: 443
    tls: true
    weight: 1.5
    api_key: "your-gemini-api-key"
    health_check:
      method: "GET"
      path: "/v1beta/models"

  # Anthropic Configuration (Standard API Key)
  - type: "anthropic"
    host: "anthropic.example.com"    # Client uses "Host: anthropic.example.com" header to route to this provider
    endpoint: "api.anthropic.com"    # Actual Anthropic API endpoint
    port: 443
    tls: true
    weight: 1.2
    api_key: "sk-ant-your-anthropic-api-key"
    health_check:
      method: "POST"
      path: "/v1/messages"
      body: '{"model":"claude-3-haiku-20240307","max_tokens":1,"messages":[{"role":"user","content":"ping"}]}'
      headers:
        - "Content-Type: application/json"
        - "anthropic-version: 2023-06-01"

  # Anthropic Configuration (OAuth)
  # Use $(command) pattern to execute a shell command that returns the OAuth token
  # The proxy will use "Authorization: Bearer <token>" instead of "X-API-Key"
  # and automatically add "anthropic-beta: oauth-2025-04-20" header
  - type: "anthropic"
    host: "anthropic-oauth.example.com"
    endpoint: "api.anthropic.com"
    api_key: "$(cat /path/to/oauth-token.txt)"  # Or any command that outputs the token
    # api_key: "$(aws secretsmanager get-secret-value --secret-id anthropic-token --query SecretString --output text)"

  # Multiple API Keys for Load Distribution
  - type: "openai"
    host: "openai-backup.example.com"  # Different host name for separate routing
    endpoint: "api.openai.com"
    api_keys:
      - key: "sk-key-1"
        weight: 1.0
      - key: "sk-key-2"
        weight: 2.0
    auth_keys:
      - "provider-specific-auth-key"

  # Fallback Provider
  # Fallback providers are only used when no non-fallback providers are available (healthy).
  # Useful for backup/failover scenarios.
  - type: "openai"
    host: "openai.example.com"         # Same host as primary provider
    endpoint: "api.openai.com"
    api_key: "sk-backup-key"
    is_fallback: true                  # Mark as fallback provider

Usage

Start the Proxy Server

# Basic usage (HTTPS on configured port)
./openproxy start -c config.yml

# With PID file (useful for systemd integration)
./openproxy start -c config.yml -p /var/run/openproxy.pid

HTTP-only Mode (No TLS)

For internal networks or development, you can run without TLS:

# config.yml - HTTP only (no TLS certificates needed)
http_port: 8080

auth_keys:
  - "your-auth-key"

providers:
  - type: "openai"
    host: "localhost:8080"
    endpoint: "api.openai.com"
    api_key: "sk-your-openai-api-key"

Making Requests

The proxy server routes requests based on the Host header:

# OpenAI API request
curl -X POST https://localhost/v1/chat/completions \
  -H "Host: openai.example.com" \
  -H "Authorization: Bearer client-api-key-1" \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-3.5-turbo", "messages": [{"role": "user", "content": "Hello!"}]}'

# Gemini API request
curl -X POST https://localhost/v1/models/gemini-pro:generateContent \
  -H "Host: gemini.example.com" \
  -H "x-goog-api-key: client-api-key-1" \
  -H "Content-Type: application/json" \
  -d '{"contents": [{"parts": [{"text": "Hello!"}]}]}'

# Anthropic API request
curl -X POST https://localhost/v1/messages \
  -H "Host: anthropic.example.com" \
  -H "X-API-Key: client-api-key-1" \
  -H "Content-Type: application/json" \
  -d '{"model": "claude-3-haiku-20240307", "max_tokens": 100, "messages": [{"role": "user", "content": "Hello!"}]}'

WebSocket Connections

The proxy transparently handles WebSocket connections, including OpenAI's Realtime API:

import websocket

# Connect to OpenAI Realtime API through the proxy
ws = websocket.create_connection(
    "wss://localhost/v1/realtime?model=gpt-4o-realtime-preview-2024-10-01",
    header=[
        "Host: openai.example.com",
        "Authorization: Bearer client-api-key-1",
        "OpenAI-Beta: realtime=v1"
    ]
)

# Send and receive messages
ws.send('{"type": "response.create", "response": {"modalities": ["text"]}}')
result = ws.recv()
print(result)
ws.close()

Performance

Concurrent Requests: Handles thousands of concurrent connections
Low Latency: Connection pooling minimizes request overhead
Memory Efficient: Streaming request/response processing
CPU Optimized: Async I/O with minimal thread overhead

Security

TLS 1.3: Modern encryption standards with forward secrecy
API Key Validation: Multi-layer authentication system
No Key Logging: Secure handling of sensitive credentials
Constant-Time Comparison: API key validation uses constant-time comparison to prevent timing attacks

Signal Handling

SIGTERM/SIGINT: Graceful shutdown
SIGHUP: Reload configuration without restart
SIGUSR2: Hot upgrade (spawn new process with same arguments, then graceful shutdown)

Hot Upgrade

The hot upgrade feature allows zero-downtime binary upgrades. When SIGUSR2 is received, the proxy:

Spawns a new process with the same arguments
Waits for the new process to start and bind to the port
Gracefully shuts down the old process

# Trigger hot upgrade
kill -USR2 $(cat /var/run/openproxy.pid)

Systemd Integration

For production deployments, you can use systemd to manage the proxy. The --pid-file option enables proper process tracking during hot upgrades.

Create /etc/systemd/system/openproxy.service:

[Unit]
Description=OpenProxy - High-Performance LLM Proxy Server
After=network.target

[Service]
Type=simple
ExecStart=/usr/local/bin/openproxy start -c /etc/openproxy/config.yml -p /var/run/openproxy.pid
ExecReload=/bin/kill -HUP $MAINPID
PIDFile=/var/run/openproxy.pid
Restart=on-failure
RestartSec=5

[Install]
WantedBy=multi-user.target

Manage the service:

# Enable and start
sudo systemctl enable openproxy
sudo systemctl start openproxy

# Reload configuration (SIGHUP)
sudo systemctl reload openproxy

# Hot upgrade (after replacing the binary)
sudo kill -USR2 $(cat /var/run/openproxy.pid)

# Check status
sudo systemctl status openproxy

Development

Running Tests

# Run all tests
cargo test

# Run a specific test
cargo test test_name

E2E Testing

The project includes end-to-end tests that run via GitHub Actions. The E2E tests verify the proxy works correctly with real API calls.

To run E2E tests locally:

cd e2e
pip install openai pydantic "httpx[http2]" websockets websocket-client pyyaml

# Run HTTP/HTTPS tests
python test_https.py
python test_http.py

# Run WebSocket tests
python test_websocket.py

# Run host path routing tests
python test_host_path.py

# Run Anthropic OAuth tests
python test_anthropic_oauth.py

# Run OpenAI Realtime API test (requires OPENAI_API_KEY)
OPENAI_API_KEY=sk-xxx python test_openai_realtime.py --direct

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 116 Commits
.github/workflows		.github/workflows
e2e		e2e
src		src
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OpenProxy

Features

Installation

From Source

Docker

Configuration

TLS Certificates

Configuration File

Usage

Start the Proxy Server

HTTP-only Mode (No TLS)

Making Requests

WebSocket Connections

Performance

Security

Signal Handling

Hot Upgrade

Systemd Integration

Development

Running Tests

E2E Testing

License

About

Uh oh!

Releases 15

Packages

Uh oh!

Uh oh!

Contributors 5

Uh oh!

Languages

License

x5iu/openproxy

Folders and files

Latest commit

History

Repository files navigation

OpenProxy

Features

Installation

From Source

Docker

Configuration

TLS Certificates

Configuration File

Usage

Start the Proxy Server

HTTP-only Mode (No TLS)

Making Requests

WebSocket Connections

Performance

Security

Signal Handling

Hot Upgrade

Systemd Integration

Development

Running Tests

E2E Testing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 15

Packages 0

Uh oh!

Uh oh!

Contributors 5

Uh oh!

Languages

Packages