An intelligent document processing application that automatically extracts structured business data from invoices, receipts, PDFs, images, and spreadsheets using OCR and Google Gemini AI.
The system transforms unstructured business documents into normalized Invoice, Product, and Customer entities for validation, analytics, and business reporting.
Frontend: live
GitHub Repository: https://github.com/your-username/invoice-intelligence
Organizations process large volumes of invoices and business documents daily. Manual data entry is time-consuming, error-prone, and difficult to scale.
This application automates extraction using OCR and Google Gemini AI, converting raw business documents into structured data that can be searched, validated, edited, analyzed, and exported.
- PDF Upload Support
- Image Upload Support
- Excel Upload Support
- Multiple File Upload
- Mixed File Upload Processing
- Multi-Page Document Processing
- OCR-Based Text Extraction
- Google Gemini 2.5 Flash Integration
- Structured JSON Generation
- Entity Recognition
- Customer Extraction
- Product Extraction
- Invoice Extraction
- Search Invoices
- Sort Invoices
- Edit Missing Fields
- CSV Export
- Missing Field Highlighting
- Search Products
- Sort Products
- Inline Editing
- Price With Tax Calculation
- CSV Export
- Search Customers
- Sort Customers
- Inline Editing
- Total Purchase Amount Calculation
- CSV Export
- Total Revenue
- Total Tax Collected
- Average Invoice Value
- Total Invoices
- Total Products
- Total Customers
- Top Customer
- Top Product
- Recent Activity Summary
- Validation Summary
- Missing Field Detection
- Missing Field Highlighting
- User Corrections
- Redux State Synchronization
- Multi-Document Merging
- Deduplication
- Normalized Data Models
- Redux Toolkit State Management
- Invalid File Validation
- Unsupported File Detection
- Gemini API Retry Logic
- Graceful Failure Handling
- User-Friendly Error Messages
- React
- Vite
- Redux Toolkit
- Tailwind CSS
- Google Gemini 2.5 Flash
- Redux Toolkit
- ESLint
- Prettier
- Git
Document Upload ↓ File Processing ↓ OCR Extraction ↓ Google Gemini AI ↓ Structured JSON ↓ Validation ↓ Deduplication ↓ Redux Store ↓ Analytics + Tables
Upload PDF, Image, or Excel documents.
Raw content extraction.
OCR processing.
Google Gemini AI extraction.
JSON normalization.
Deduplication.
Redux state management.
Visualization and analytics.
{
"invoices": [],
"products": [],
"customers": []
}{
"customer_id": "ABC Pvt Ltd"
}{
"product_id": "IPHONE16"
}Priority:
- Invoice ID
Priority:
- Product ID
Fallback:
- Product Name
Priority:
- GSTIN
- Phone Number
- Customer Name
The system automatically detects:
- Missing Customer Name
- Missing Phone Number
- Missing GSTIN
- Missing Product Name
- Missing Invoice Date
Users can edit records directly from the UI.
Objective: Process PDF invoices and extract structured business entities.
Result:
- Invoice Extraction Successful
- Product Extraction Successful
- Customer Extraction Successful
Status: ✅ Passed
Objective: Process PDF and Image documents together.
Result:
- Multi-Format Processing Successful
- Data Merging Successful
Status: ✅ Passed
Objective: Extract records from Excel documents.
Result:
- Structured Data Extracted Successfully
Status: ✅ Passed
Objective: Merge data from multiple Excel files.
Result:
- Multi-File Processing Successful
- Deduplication Successful
Status: ✅ Passed
Objective: Process PDFs, Images, and Excel files together.
Result:
- Unified Extraction Pipeline Successful
- Validation Successful
- Analytics Generated Successfully
Status: ✅ Passed
All screenshots and testing evidence are included in:
screenshots.zip
The screenshot archive contains:
- Case-1 Results
- Case-2 Results
- Case-3 Results
- Case-4 Results
- Case-5 Results
- Analytics Dashboard
- Validation Summary
- Customer Management
- Product Management
- Invoice Management
git clone <repository-url>
cd invoice-intelligencenpm installVITE_GEMINI_API_KEY=your_api_keynpm run dev- Authentication
- Database Integration
- Excel Export
- Cloud Storage
- Approval Workflow
- Batch Processing
- Role-Based Access Control
Boda Vamshi Kumar
AI-powered document intelligence platform for invoice processing, business entity extraction, validation, analytics, and reporting.