GraphReel

Turn Google Drive files into explainer videos, powered by LangGraph and Vertex AI.

Paste Drive file or folder URLs → the app walks the hierarchy, extracts text, summarizes each file in parallel, then synthesizes everything into:

Narrative Briefing — flowing prose designed for reading aloud as a video script
Structured Digest — organized headers and bullets for quick reference
Explainer Video — short video with AI-generated imagery, TTS audio, and dynamic overlays.

You can also paste in public URLs for additional context.

Overview

GraphReel is a production-ready pipeline that transforms collections of Drive documents into cohesive narratives. It's built on LangGraph (for orchestration), Google Vertex AI (for summarization and content generation), and Streamlit (for the web UI).

The pipeline intelligently:

Resolves Drive URLs into files (recursively traversing folders)
Extracts text from Docs, PDFs, and Markdown
Summarizes each file in parallel using LLMs
Synthesizes summaries into narrative prose and structured bullets
Researches topics mentioned in the narrative (optional web search)
Augments with web findings for completeness
Generates video with AI images, narration, and overlays (optional)

Watch the Demo

Watch this video on youtube see a demo of GraphReel as well as more information about the architecture.

Quick Start

Prerequisites

Python 3.11+
Google Cloud project with billing enabled
gcloud CLI installed and authenticated:
```
gcloud auth application-default login
```

1. Enable Google Cloud APIs

In the GCP Console, enable:

Google Drive API
Google Docs API
Vertex AI API
Cloud Text-to-Speech API (for video generation)
Vertex AI Imagen API (for AI image generation)

2. Create OAuth 2.0 Credentials (For Drive Access)

Go to GCP Console → APIs & Services → Credentials
Click Create Credentials → OAuth 2.0 Client ID
Application type: Desktop app
Name: GraphReel Local
Download JSON and save as credentials.json in the project root

3. Configure OAuth Consent Screen

Go to OAuth consent screen
User type: External
Fill app name and contact email
Add scope: https://www.googleapis.com/auth/drive.readonly
Add yourself as a test user
Save (no need to publish)

4. Configure Environment

cp .env.example .env

Edit .env:

GCP_PROJECT=your-project-id
GCP_LOCATION=us-central1

Note: Always use us-central1 (some Gemini models return 404 from other regions).

5. Install & Run

macOS / Linux:

python3 -m venv .venv
source .venv/bin/activate
python3 -m pip install -r requirements.txt
python3 -m streamlit run app.py

Windows (PowerShell):

py -3 -m venv .venv
.\.venv\Scripts\Activate.ps1
py -3 -m pip install -r requirements.txt
py -3 -m streamlit run app.py

On first run, click Connect Google Drive, approve access in your browser, and the app auto-refreshes. Credentials are saved to token.json.

Supported File Types

Type	Extraction Method
Google Docs	Exported as plain text via Drive API
PDF	Parsed with pypdf
Plain text / Markdown	Downloaded directly

Google Sheets, Slides, and unsupported binary files are skipped with warnings.

Documentation

Architecture — Detailed overview of the LangGraph pipeline, state management, and technology stack

Project Structure

GraphReel/
├── app.py              # Streamlit UI (thin layer)
├── pipeline/
│   ├── state.py        # LangGraph TypedDict state definitions
│   ├── prompts.py      # All LLM prompts
│   ├── nodes.py        # LangGraph node functions
│   └── graph.py        # Graph assembly + stream_graph()
├── drive/
│   ├── auth.py         # OAuth flow + token management
│   ├── resolver.py     # URL parsing + recursive folder listing
│   └── extractor.py    # File content extraction
├── credentials.json    # OAuth client secret (git-ignored, you create this)
├── token.json          # OAuth access token (git-ignored, auto-created)
├── .env                # GCP config (git-ignored, you create this)
└── requirements.txt

LangGraph Pipeline

resolve_urls → fetch_files →[Send ×N parallel]→ summarize_file(s) → synthesize → END

Each file is summarized in parallel using LangGraph's Send API (map-reduce pattern). Large files (>6,000 tokens) are automatically chunked before summarization.

Security

Credentials are local-only: credentials.json and token.json are git-ignored and never transmitted
Read-only access: The app only requests drive.readonly scope — no ability to modify or delete files
Token refresh: OAuth tokens auto-refresh; re-authenticate only if you revoke access or delete token.json

License

This project is licensed under the MIT License. See LICENSE for details.

Contributing

Contributions are welcome! Please feel free to submit issues and pull requests.

Acknowledgments

Built with LangGraph for orchestration
Powered by Google Vertex AI for LLMs and image generation
UI built with Streamlit
Video generation with moviepy and Pillow

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GraphReel

Overview

Watch the Demo

Quick Start

Prerequisites

1. Enable Google Cloud APIs

2. Create OAuth 2.0 Credentials (For Drive Access)

3. Configure OAuth Consent Screen

4. Configure Environment

5. Install & Run

Supported File Types

Documentation

Project Structure

LangGraph Pipeline

Security

License

Contributing

Acknowledgments

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.vscode		.vscode
assets		assets
docs		docs
drive		drive
pipeline		pipeline
video		video
web		web
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

GraphReel

Overview

Watch the Demo

Quick Start

Prerequisites

1. Enable Google Cloud APIs

2. Create OAuth 2.0 Credentials (For Drive Access)

3. Configure OAuth Consent Screen

4. Configure Environment

5. Install & Run

Supported File Types

Documentation

Project Structure

LangGraph Pipeline

Security

License

Contributing

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages