Writing AI that never leaves your Mac.
No cloud. No accounts. No telemetry.
halen.dev · Changelog · Privacy
Halen is a writing assistant that lives in your menubar. It catches angry-sounding messages before you send them. It fixes your typos as you type. It expands shortcuts. It rewrites paragraphs. It drafts replies.
Every model runs on your Mac. Your text never goes to a server.
Download Halen — signed, notarized, free. Apple Silicon, macOS 14 or later.
- Open the DMG.
- Drag Halen to Applications.
- Launch it. Grant Accessibility and Input Monitoring when asked.
Halen updates itself. You'll never need to come back to this page.
| Plugin | What it does |
|---|---|
| ✨ Ask Halen | Press ⌃H. Ask anything. Halen knows what's on screen. |
| ✍️ Writing Assistant | Fixes typos, flags tone & clarity, and finishes your sentences as you type (Tab to accept). |
| 💬 Snippet Expander | Type ;sig, ;today, ;summary, ;reply. Or ⌃⌥R to rewrite a selection. |
| 🎙️ Voice Dictation | Press ⌃⌥Space. Speak. Press again. Apple's on-device transcription. |
| 🪄 Prompt Polish | Press ⌃⌥P. Rewrites the selected prompt in place with word-level edits for modern LLMs. |
Plus add-ons you can install from the Plugin Store: Burnout Copilot (focus suggestions from your calendar and tone history), Meeting Prep (a one-page brief 15 minutes before each meeting), Reasoning Compactor (compacts verbose LLM chain-of-thought on-device to save tokens, ⌃⌥K), and Mother (hardcore, local discipline that keeps you off the apps and sites you blocked — and means it).
Cloud writing tools see everything you type. Halen doesn't. Three reasons that matters:
Privacy. Your half-finished resignation letter, your angry reply, your password reset email — none of it leaves your Mac. Not for processing, not for "improving the model", not for ads.
Speed. Round-trips to OpenAI take a second or two. Halen's classifier is under 100ms warm. Rewrites stream in real time.
Trust. Halen is open source under MIT. You can read every line of code, build it yourself, and run it disconnected from the internet.
Halen picks the best available model on your Mac, automatically:
- Apple Intelligence if you have it (macOS 26+, supported Macs).
- A small local model — Gemma 4 E4B, plus Qwen 2.5 for classification — downloaded once on first use, if you don't.
- Your own Ollama daemon, if you've installed one.
Nothing to configure. The model picker in Settings is there if you want to.
| ⌃H | Ask Halen |
| ⌃⌥R | Rewrite the selected text |
| ⌃⌥E | Draft a reply to the focused email |
| ⌃⌥P | Polish the selected prompt in place |
| ⌃⌥Space | Start dictation |
Halen never sends your text anywhere. Inference is on-device. The only network requests Halen makes are: an update check once a day (Sparkle) and an optional one-time model download from Hugging Face if you opt in. No analytics, no telemetry, no crash reports. Read the full privacy page.
A scripted 1-minute walkthrough is in docs/DEMO.md. The web demo at halen.dev runs the same beats inline in your browser.
Halen is open source. If you want to build it yourself, contribute, or write a plugin:
git clone https://github.com/lukataylo/halen.git
cd halen
./scripts/run-dev.shArchitecture · Plugin protocol · Contributing · Roadmap · Changelog
MIT — see LICENSE. Model weights aren't bundled; they download from Hugging Face under their own licences (Gemma, Qwen).


