πΊπΈ English π°π· νκ΅μ΄
A desktop app that mass-generates images and videos β via Google Flow login or your own Gemini / Veo API key β and exports them to CapCut / Premiere Pro / Vrew projects in one click.
Still building AI videos one shot at a time?
AutoFlowCut automates the entire AI video production pipeline. Generate images and videos β via Google Flow login (free) or your own Gemini / Veo API key (BYOK) β then convert them into ready-to-edit projects for CapCut, Adobe Premiere Pro, or Vrew. Import your script, generate visuals, pick the best media per scene, and export in a single click.
Choose how you generate β switch anytime from the top toggle:
- Flow Login mode β Generate via Google Flow login. Beginner-friendly, free generation on a (relatively cheap) subscription. Slower (30+ min per 100 images). Just sign in with a Google account.
- API Key mode (BYOK) β Generate directly with your own Gemini / other API key. For power users and bulk / fast work. Pay-as-you-go (no free tier yet). Fast (~2β5 min per 100 images).
- Batch image generation β Generate 100+ AI images in a single batch via the official Google GenAI APIs, with automatic retry on errors.
- T2V (Text-to-Video) β Generate video clips from text prompts (Veo 3.1).
- I2V (Image-to-Video) β Convert generated images into videos.
- Per-scene media selection β Automatically pick the best media among Image / T2V / I2V (priority: I2V > T2V > Image).
- Character / background / style references β Tag-based auto-matching keeps 200+ scenes visually consistent.
- 87 style presets β Animation, photography, film, and 8 more categories.
- Auto-injected style prompts β Reference styles are automatically merged into generation prompts.
- Multi-editor export β One-click export to CapCut, Adobe Premiere Pro (
.prproj), or Vrew (.vrew). - Complete projects β Timeline, media, subtitles, and Ken Burns animation included.
- Direct project write β Writes the project file straight to disk (no manual ZIP / import step).
- Auto-launch + install guidance β Opens the target editor after export; if it isn't installed, you're pointed to the download (or store) page.
- SRT subtitles β Multilingual subtitles included in export.
- Audio package import β Automatic detection of narration, voice, and SFX files.
- SRT timecode matching β Audio placement aligned to subtitle timing.
- Multi-track timeline β Image / subtitle / narration / voice / SFX tracks with horizontal zoom, scrub, and playhead.
- Expandable groups β Voice / SFX collapse by character / category; expand sub-tracks to see individual file rows.
- File row mini-markers β Each file row shows a colored bar at its clip's position so timing context is preserved when scrolling.
- Drag-to-adjust timecode β Drag any voice / SFX clip to nudge its timing; persisted to
.audio_overrides.jsonwithout touching original files. - Resizable lanes β Per-track height and label-column width are draggable, persisted to localStorage.
- Preview panel β Current playhead position shows the matched scene image + SRT subtitle.
- Keyboard shortcuts β
Spaceplay/pause,Escstop and rewind. - Audio review system β Flag unsuitable audio with reason and bulk-clean tracks.
- Built-in MCP server β Edit scenes / references / prompts directly from Claude Code.
- HTTP API bridge β Integrate with external tools (port 3210).
- Skill system β Install and manage Claude Code skills; auto-installed on the app's first run.
- Story Engine v2 β End-to-end 9-wave pipeline from script to export.
/story-newβ Initialize an episode + discuss the topic./story-executeβ Run W1βW9 automatically (sub-agents + review loops, two user gates at W3/W7)./story-stepβ Run the next single wave only and exit. Manual mode β no in-wave prompts; the user reviews each wave's deliverables and re-invokes for the next./story-nextβ Resume after interruption (delegates to/story-execute)./story-rewriteβ Improve an existing episode (engagement-gap diagnosis β fork β partial wave re-run).
- Full-width desktop workspace β Focused React shell for batch generation and export.
- Multilingual UI β Korean, English.
- Project management β Manage multiple projects independently, backed by
project.json. - Flexible input formats β Import TXT, CSV, and SRT files.
| Category | Technology |
|---|---|
| Frontend | React 18 + Vite 6 |
| Desktop | Electron 36 |
| AI Engine | Google Flow (login) + Google Gemini / Veo APIs (BYOK) |
| Backend | Firebase (Auth, Firestore, Cloud Functions) |
| MCP | @modelcontextprotocol/sdk |
| Payments | Lemon Squeezy |
| Build | electron-builder (DMG, ZIP, NSIS, APPX) |
| Testing | Vitest |
Electron BrowserWindow
βββ [React Shell] β Full-width desktop app (Shell.jsx)
β βββ Header β project picker, Export, Settings
β βββ PromptInput β prompt entry
β βββ SceneList β scene list (image / video / subtitle)
β βββ ReferencePanel β reference management
β βββ AudioPanel β audio / SFX import
β βββ StatusBar β generation progress
β
βββ [GenAI IPC] β Gemini / Veo BYOK calls from the main process
β βββ Local encrypted API key storage + REST client
β
βββ [MCP Server] β stdio + HTTP (port 3210)
βββ Scene / reference / style / audio management tools
| Namespace | Role | File |
|---|---|---|
fs:* |
File I/O | electron/ipc/filesystem.js |
genai:* |
Google GenAI API (image / video generation) | electron/ipc/genai-api.js |
capcut:* |
CapCut path detection, project write, app launch | electron/ipc/capcut.js |
auth:* |
Google OAuth | electron/ipc/auth.js |
Phase 1: Submit β Submit video requests with a sliding in-flight concurrency window
Phase 2: Poll β Poll all generationIds in parallel (up to 20 min)
Phase 3: Download β Download + save completed videos sequentially
AutoFlowCut/
βββ electron/ # Electron main process
β βββ main.js # Main process, menus, IPC registration
β βββ preload.js # Context bridge (window.electronAPI)
β βββ ipc/ # IPC handlers
β βββ filesystem.js # File I/O
β βββ genai-api.js # Google GenAI API (image / video generation)
β βββ capcut.js # CapCut path detection, project write
β βββ auth.js # Google OAuth
β βββ layout.js # Window/layout IPC
β βββ mcp.js # MCP HTTP server bridge
β βββ googleApiError.js # Google API error formatting
β
βββ src/ # React frontend
β βββ App.jsx # Main app logic
β βββ Shell.jsx # Full-width app shell
β βββ components/ # UI components (35+)
β β βββ Header.jsx
β β βββ SceneList.jsx
β β βββ ReferencePanel.jsx
β β βββ AudioPanel.jsx
β β βββ ExportModal.jsx
β β βββ SettingsModal.jsx
β β βββ SceneDetailModal.jsx
β β βββ VideoDetailModal.jsx
β β βββ ...
β βββ hooks/ # React hooks (15+)
β β βββ useGenAPI.js # Google GenAI API wrapper (image, video)
β β βββ useAutomation.js # Batch image generation pipeline
β β βββ useVideoAutomation.js # Video generation (3-phase async)
β β βββ useSceneGeneration.js # Per-scene regeneration
β β βββ useReferenceGeneration.js # Reference generation
β β βββ useGenerationQueue.js # Unified generation queue
β β βββ useExport.js # CapCut export
β β βββ useAudioImport.js # Audio import + SRT matching
β β βββ useScenes.js # Scene state management
β β βββ useProjectData.js # project.json management
β β βββ ...
β βββ exporters/ # CapCut JSON generation + disk write
β βββ firebase/ # Auth, Firestore, Cloud Functions
β βββ contexts/ # AuthContext
β βββ config/ # Defaults, style presets (87)
β βββ locales/ # ko, en
β βββ utils/ # Utilities (parsers, tag matching, ...)
β βββ stripe/ # Payments (Lemon Squeezy)
β
βββ mcp-server/ # MCP server (Claude Code integration)
β βββ index.js # Scene / reference / style / audio / skill tools
β
βββ skills/ # Claude Code skills
β βββ story-engine/ # Story Engine v2 (9-wave pipeline)
β βββ story-new/ # /story-new episode init
β βββ story-execute/ # /story-execute W1βW9 auto-runner
β βββ story-next/ # /story-next resume
β βββ story-step/ # /story-step single-wave manual runner
β βββ story-rewrite/ # /story-rewrite episode improvement
βββ docs/ # Documentation (schemas, store descriptions, ...)
βββ tests/ # Vitest unit + integration tests (mirrors src/)
βββ scripts/ # Build helpers (electron name patch, build-number bump, ...)
βββ assets/ # App icons (icon.icns, icon.png)
βββ public/ # Static assets (style thumbnails)
βββ vite.config.js
βββ package.json
- Node.js 18+
- npm
- Google account + Google API key (Gemini / Veo access)
- CapCut desktop app
git clone https://github.com/touchizen/AutoFlowCut.git
cd AutoFlowCut
npm install# Dev mode (test environment β uses _test GCF)
npm run dev
# Dev mode (prod environment β uses _prod GCF)
npm run dev:prod# macOS distribution (DMG + ZIP, code-signed + notarized)
npm run dist:mac
# Windows distribution (NSIS + ZIP + APPX, Certum OV code-signed)
npm run dist:win
# Windows individual targets
npm run dist:win:nsis # For website download (.exe, code-signed)
npm run dist:win:appx # For MS Store (.appx)
# Linux distribution (AppImage + deb)
npm run dist:linux
# Test-environment distribution
npm run dist:test:mac
npm run dist:test:win
# Packaging test (no installers)
npm run packWindows code signing: If SimplySign Desktop is connected during build, the binary is signed automatically (Certum OV certificate).
Build artifacts are written to the release/ directory.
npm test # Watch mode
npm run test:run # One-shot run
npm run test:coverage # Coverage reportTests live in tests/ and mirror the src/ directory layout (Vitest + jsdom + @testing-library/react).
Cloud Functions are deployed with _test / _prod suffixes.
| Script | Environment | GCF suffix |
|---|---|---|
npm run dev |
test | _test |
npm run dev:prod |
prod | _prod |
npm run dist:mac / dist:win |
prod | _prod |
npm run dist:test:mac / dist:test:win |
test | _test |
- Sign in to AutoFlowCut, then add your Google API key in Settings.
- Enter prompts (type directly, or import from TXT / CSV / SRT).
- Configure reference images (character, background, style tags).
- Click Generate Images to start batch generation with your Google API key.
- (Optional) Generate videos via T2V or I2V.
- Click Export β export requires an active AutoFlowCut trial or subscription, then writes the CapCut project to disk and launches CapCut automatically.
Edit AutoFlowCut scenes / references / prompts directly from Claude Code.
| Tool | Description |
|---|---|
load_csv |
Load a CSV file + images |
list_scenes / get_scene |
Query scenes |
update_prompt / batch_update_prompts |
Edit prompts |
list_references / update_reference_prompt |
Manage references |
list_problem_scenes |
Filter problem scenes |
list_styles |
Browse style presets |
export_capcut |
Export to CapCut |
install_skill / list_skills |
Manage skills |
get_progress |
Read W_progress.json (sub-agents write it directly via file I/O) |
app_generate_scene / app_start_scene_batch |
App-integrated generation |
When the HTTP server is enabled in settings (default port 3210):
GET /api/current-project β Current project state
GET /api/scenes β Query scenes
GET /api/references β Query references
POST /api/update β Update scenes or references
POST /api/generate-scene β Trigger one scene image generation
POST /api/generate-reference β Trigger one reference image generation
POST /api/start-scene-batch β Start scene batch generation
POST /api/start-ref-batch β Start reference batch generation
GET /api/batch-status β Query batch progress
- macOS / Windows: GitHub Releases
- Windows (MS Store): Microsoft Store
- Homepage: touchizen.com
- YouTube: @touchizen
- Discord: touchizen
- Contact: gordon.ahn@touchizen.com
We actively reward community contributions to AutoFlowCut.
Find a bug in the app, fix it, and open a PR. Once it's reviewed and merged, you receive usage credits sized to the impact of your contribution β from a minimum of 10 generations up to 1 year of unlimited use.
Built a plugin that adds a revenue model? Open a PR. We review and merge well-built, secure plugins that fit the project, and work out the details with you.
Open a PR or start a discussion on Discord. All contributions are accepted under AGPL v3 β see CONTRIBUTING.md. Reward amounts are granted at the maintainer's discretion based on the quality and impact of each contribution.
Copyright (C) 2026 Touchizen
This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License v3 as published by the Free Software Foundation. See LICENSE for the full text.
What this means: you are free to use, modify, and self-host AutoFlowCut. If you distribute a modified version β including running it as a hosted network service β your changes must also be released under AGPL v3.
Contributions are accepted under the same AGPL v3 license β see CONTRIBUTING.md.
Disclaimer: This app is an independent product developed by Touchizen and is not affiliated with, endorsed by, or sponsored by Google or ByteDance (CapCut).