Realtime HUD

A Heads-Up Display powered by the OpenAI Realtime API (gpt-realtime-1.5).

Analyzes your screen captures and audio streams in real-time, surfacing useful supplementary information, insights, and confirmations directly in your HUD.

Features

🖥 Screen capture – select any monitor or window to analyze (single or multi-monitor)
🎙 Microphone audio – include your voice for full context
🔊 System/computer audio – capture what's playing on your screen
💡 AI HUD display – real-time insights streamed as they are generated
⌨ Steerable – collapsible text input to direct the AI when needed
🔒 Privacy first – clear recording indicator, instant stop, no data storage
⤢ Pop-out display – open the HUD in a second window for multi-monitor setups
🔄 Provider-agnostic – OpenAI now; designed for future offline/private models

Privacy

Your API key never leaves your machine (kept in server .env, never sent to the browser)
Screen and audio data are proxied directly to OpenAI – nothing is stored on the server
A prominent Recording badge indicates when the session is active
You can stop the session at any time

Quick Start

1. Prerequisites

Node.js 18 or later
An OpenAI API key with Realtime API access

2. Install

git clone https://github.com/hack-r/realtime-hud.git
cd realtime-hud
npm install

3. Configure

cp .env.example .env
# Edit .env and add your OPENAI_API_KEY

4. Run (development)

npm run dev

Open http://localhost:5173 in your browser.

5. Build for production

npm run build
npm start

Usage

Click Select Screen to Capture and choose a monitor or window
Optionally enable Microphone and/or System Audio
Click Start AI Session
AI insights appear in the right panel as your screen is analyzed
Use ⤢ Pop Out to move the HUD display to a second monitor
Click ▼ Steer AI to open the text input for directing the AI
Click Stop Session to end recording

Architecture

Browser (React + Vite)
  └─ WebSocket ──▶ Node.js / Express proxy
                       └─ WebSocket ──▶ OpenAI Realtime API
                                           (gpt-realtime-1.5)

The API key is stored server-side only. The browser connects to a local WebSocket proxy that forwards traffic to OpenAI.

Provider Roadmap

The AIProvider interface (src/types/index.ts) is designed for swappability:

Status	Provider
✅	OpenAI Realtime (`gpt-realtime-1.5`)
🗓	Other cloud providers (Gemini Live, etc.)
🗓	Fully offline/private local model

Configuration

Variable	Default	Description
`OPENAI_API_KEY`	required	Your OpenAI API key
`PORT`	`3001`	Server port
`OPENAI_MODEL`	`gpt-realtime-1.5`	Realtime model to use
`OPENAI_REALTIME_INTERFACE`	`ga`	Realtime protocol interface (`ga` or `beta`)
Heads Up Display with RT AI. Will start with OpenAI, but provider agnostic.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
prompts		prompts
public		public
server		server
src		src
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
tsconfig.server.json		tsconfig.server.json
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Realtime HUD

Features

Privacy

Quick Start

1. Prerequisites

2. Install

3. Configure

4. Run (development)

5. Build for production

Usage

Architecture

Provider Roadmap

Configuration

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Realtime HUD

Features

Privacy

Quick Start

1. Prerequisites

2. Install

3. Configure

4. Run (development)

5. Build for production

Usage

Architecture

Provider Roadmap

Configuration

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages