AI Image Caption Pro

Drag a folder of photos. Get IPTC captions and keywords written directly into every file.

Drop a folder of RAW, JPEG, or PSD images onto the window. The app sends each photo to an AI vision model, generates a caption and keyword set, and writes them back as IPTC/XMP metadata — the fields Lightroom, Capture One, and Photo Mechanic read natively.

Backends

Backend	Setup
Ollama (local, no API key)	`brew install ollama && ollama pull qwen2.5vl:7b`
Google Gemini	API key from aistudio.google.com
Anthropic Claude	API key from console.anthropic.com
OpenAI GPT-4o	API key from platform.openai.com

Quick start

brew install exiftool
pip install -r requirements.txt
python main.py

What it writes

IPTC:Caption-Abstract — AI caption, appended to any existing caption
IPTC:Keywords + XMP-dc:Subject — deduplicated, merged with existing keywords
XMP-lr:HierarchicalSubject — for Lightroom keyword hierarchy
XMP:Description, XMP:Creator, XMP:Rights — mirrored from identity settings
XMP-iptcExt:AltTextAccessibility — for CMS / screen readers
XMP sidecar .xmp alongside RAW files — Photo Mechanic reads instantly, no Cmd+R needed

Supported formats

CR3 · CR2 · ARW · NEF · NRW · DNG · RAF · ORF · RW2 · PSD · PSB · JPG · JPEG

RAW and PSD files have their embedded preview extracted for the AI — original pixel data is never touched.

AI brief

Drop a context.md into any shoot folder and the app includes it in every caption prompt for that folder. Set a global brief in Settings for your overall photography style and vocabulary.

Requirements

macOS or Linux
Python 3.11+
ExifTool (brew install exiftool / apt install libimage-exiftool-perl)
Ollama (optional — only for local AI backend)

Recommended local models: qwen2.5vl:7b, minicpm-v — both are strong vision models optimised for image understanding at modest hardware requirements.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
app		app
assets		assets
bin		bin
.gitignore		.gitignore
AIImageCaptionPro.spec		AIImageCaptionPro.spec
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
build_linux.sh		build_linux.sh
build_mac.sh		build_mac.sh
dev.sh		dev.sh
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Image Caption Pro

Backends

Quick start

What it writes

Supported formats

AI brief

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Image Caption Pro

Backends

Quick start

What it writes

Supported formats

AI brief

Requirements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages