Echo-TTS OpenAI Bridge

A high-performance FastAPI bridge that provides OpenAI-compatible TTS endpoints by forwarding requests to RunPod serverless Echo-TTS workers.

This lightweight bridge enables seamless integration with any OpenAI TTS-compatible client while leveraging the cost-efficiency and scalability of RunPod's serverless GPU infrastructure. It forwards requests to RunPod, handles voice mapping/auth/concurrency, and streams audio back to deliver low-latency text-to-speech capabilities.

✨ Features

🔄 OpenAI Compatible: Drop-in replacement for OpenAI's /v1/audio/speech endpoint
☁️ Serverless Architecture: Offloads GPU processing to RunPod workers for efficient scaling
📝 Upstream Text Chunking: Text chunking is handled by the RunPod worker endpoint (bridge forwards input as-is)
🎵 Flexible Voice Mapping: Map OpenAI voices to custom Echo-TTS speaker files
🌊 Streaming Response: Returns audio as a stream for OpenAI client compatibility
🔒 Optional Authentication: Secure the bridge with bearer token authentication
📊 Concurrency Control: Built-in rate limiting to prevent overloading RunPod endpoints
🐳 Docker Ready: Containerized deployment with Docker Compose support

🏗️ Architecture

The Echo-TTS Bridge follows a streamlined architecture designed for high performance and reliability:

Client Layer: Any OpenAI TTS-compatible client sends requests to the bridge
Bridge Layer: FastAPI service validates requests and forwards them to RunPod
Processing Layer: RunPod serverless workers handle TTS generation (including text chunking if needed)
Response Layer: Audio is returned to the client

📊 Data Flow

sequenceDiagram
    participant C as Client
    participant B as Bridge
    participant R as RunPod

    C->>B: POST /v1/audio/speech
    B->>B: Validate request
    B->>R: Submit TTS job
    R->>B: Return audio data
    B->>C: Stream audio

🚀 Quick Start

Prerequisites

Docker & Docker Compose
A RunPod account with Echo-TTS serverless endpoint
(Optional) Nginx Proxy Manager for SSL termination

Installation

Clone the repository

git clone https://github.com/yourusername/echoTTS-OpenAI.git
cd echoTTS-OpenAI

Configure environment

cp .env.example .env
# Edit .env with your RunPod credentials

Set up voice mapping (optional)

# Edit voice_map.json to map OpenAI voices to Echo-TTS files
cp voice_map.json.example voice_map.json

Create Docker network (if using Nginx Proxy Manager)
```
docker network create shared_net
```

Build and run

docker compose build --no-cache
docker compose up -d

Basic Usage

# Test the endpoint
curl -X POST http://localhost:8000/v1/audio/speech \
  -H "Content-Type: application/json" \
  -d '{
    "model": "tts-1",
    "input": "Hello, this is Echo-TTS speaking!",
    "voice": "alloy"
  }' \
  --output speech.mp3

# Python example using OpenAI client
from openai import OpenAI

client = OpenAI(
    api_key="your-key",
    base_url="https://your-domain.com/v1"
)

response = client.audio.speech.create(
    model="tts-1",
    voice="alloy",
    input="The future of AI is here!"
)

response.stream_to_file("output.mp3")

📖 Documentation

API Reference

POST /v1/audio/speech

Generate speech from text using the specified voice.

Request Body:

{
  "model": "tts-1",
  "input": "Text to convert to speech",
  "voice": "alloy",
  "response_format": "mp3",
  "speed": 1.0
}

Parameters:

model (string): Must be "tts-1" for OpenAI compatibility
input (string): Text to convert (max 4096 characters)
voice (string): Voice to use (alloy, echo, fable, onyx, nova, shimmer)
response_format (string): Audio format (mp3, opus, aac, flac, wav)
speed (float): Playback speed (0.25 to 4.0)

Response: Binary audio data in the specified format

GET /health

Health check endpoint.

Response:

{
  "status": "ok"
}

Configuration

Variable	Description	Default
`RUNPOD_ENDPOINT`	RunPod serverless endpoint URL	Required
`RUNPOD_API_KEY`	RunPod API key	Required
`MAX_CONCURRENT_REQUESTS`	Simultaneous RunPod requests	3
`REQUIRE_AUTH`	Enable bearer token auth	False
`BRIDGE_TOKEN`	Authentication token	None
`LOG_LEVEL`	Logging verbosity	INFO

Voice Mapping

Configure which Echo-TTS voice files to use for each OpenAI voice:

Option 1: JSON File (Recommended)

{
  "alloy": "EARS_p004_freeform.mp3",
  "echo": "EARS_p005.mp3",
  "fable": "EARS_p004_freeform.mp3",
  "onyx": "EARS_p005.mp3",
  "nova": "EARS_p004_freeform.mp3",
  "shimmer": "EARS_p005.mp3"
}

Option 2: Environment Variable

VOICE_MAP="alloy:EARS_p004_freeform.mp3,echo:EARS_p005.mp3"

🧪 Testing

# Run all tests
python -m pytest tests/

# Run with coverage
python -m pytest --cov=app tests/

# Run specific test
python -m pytest tests/

🐳 Docker Deployment

Production Deployment

Set up reverse proxy (Nginx Proxy Manager example)
- Domain: tts.yourdomain.com
- Forward to: echotts-openai:8000
- Enable SSL

Environment variables for production

REQUIRE_AUTH=True
BRIDGE_TOKEN=your-secure-token
LOG_LEVEL=WARNING
MAX_CONCURRENT_REQUESTS=5

Deploy

docker compose -f docker-compose.yml -f docker-compose.prod.yml up -d

Health Monitoring

# Check container status
docker compose ps

# View logs
docker compose logs -f echotts-openai

# Health check
curl http://localhost:8000/health

🔧 Development

Local Development

# Setup virtual environment
python3 -m venv venv
source venv/bin/activate  # Linux/Mac
pip install -r requirements.txt

# Set environment variables
export RUNPOD_ENDPOINT=your_endpoint
export RUNPOD_API_KEY=your_key

# Run development server
uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

Code Structure

app/
├── main.py              # FastAPI application and endpoints
├── config.py            # Configuration management
├── audio_processor.py   # RunPod client and audio handling
└── models/
    ├── __init__.py
    └── schemas.py       # Pydantic models

Adding New Voices

Add your voice file to the RunPod worker

Update voice_map.json:

{
  "custom_voice": "your_voice_file.mp3"
}

Restart the service

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Development Guidelines

Follow PEP 8 for Python code style
Add tests for new features
Update documentation as needed
Keep PRs focused and atomic

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

OpenAI for the TTS API specification
RunPod for serverless GPU infrastructure
FastAPI for the web framework
Echo-TTS for the underlying TTS model

📞 Support

🐛 Report bugs: GitHub Issues
💬 Discussions: GitHub Discussions
📧 Email: support@yourdomain.com

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.kilocode		.kilocode
app		app
docs/diagrams		docs/diagrams
scripts		scripts
tests		tests
.env.example		.env.example
.gitignore		.gitignore
ADD_VOICE.md		ADD_VOICE.md
Dockerfile		Dockerfile
IMPLEMENTATION-ORDER.md		IMPLEMENTATION-ORDER.md
LINACODEC-BE.md		LINACODEC-BE.md
LINACODEC-FE.md		LINACODEC-FE.md
LINACODEC-MW.md		LINACODEC-MW.md
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
voice_map.json		voice_map.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Echo-TTS OpenAI Bridge

✨ Features

🏗️ Architecture

📊 Data Flow

🚀 Quick Start

Prerequisites

Installation

Basic Usage

📖 Documentation

API Reference

POST /v1/audio/speech

GET /health

Configuration

Voice Mapping

🧪 Testing

🐳 Docker Deployment

Production Deployment

Health Monitoring

🔧 Development

Local Development

Code Structure

Adding New Voices

🤝 Contributing

Development Guidelines

📄 License

🙏 Acknowledgments

📞 Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Echo-TTS OpenAI Bridge

✨ Features

🏗️ Architecture

📊 Data Flow

🚀 Quick Start

Prerequisites

Installation

Basic Usage

📖 Documentation

API Reference

POST /v1/audio/speech

GET /health

Configuration

Voice Mapping

🧪 Testing

🐳 Docker Deployment

Production Deployment

Health Monitoring

🔧 Development

Local Development

Code Structure

Adding New Voices

🤝 Contributing

Development Guidelines

📄 License

🙏 Acknowledgments

📞 Support

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages