Skip to content

cyberdragon55k/Avi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Avi-Automated-Visual-Intelligence

Avi 🤖 is an AI-powered system that transforms raw screen recordings into intelligent, classified insights. By integrating FFmpeg, Firebase, and Google Gemini API, Avi automates screen data capture, frame extraction, and AI-based visual analysis — making it easy to understand what was happening on screen, when, and why.


📚 Table of Contents

  1. Overview
  2. Features
  3. Architecture & Workflow
  4. Technologies Used
  5. Installation & Setup
  6. Usage Guide
  7. Recommendations
  8. Contributing
  9. License

📌 Overview

Avi (Automated Visual Intelligence) simplifies the process of monitoring screen usage. It captures screen activity, splits the video into frames, uploads them to Firebase, and analyzes them using Google's Gemini Vision API. Finally, it displays the analysis via a Streamlit dashboard, giving you a snapshot of what applications or actions were taken over time.


✨ Features

  • 🔴 Screen recording via FFmpeg
  • 🖼️ Frame extraction at set intervals
  • ☁️ Upload frames to Firebase Storage
  • 🧠 Frame classification using Gemini Vision API
  • 📊 Insightful dashboard powered by Streamlit
  • 📝 Activity logging in JSON format

🛠 Technologies Used

  • FFmpeg: Captures desktop screen recordings
  • Python: Automates frame extraction, upload, and classification
  • OpenCV (optional): For custom frame manipulation
  • Firebase: Stores screenshots and optionally metadata
  • Google Gemini API: Provides intelligent image descriptions
  • Streamlit: Creates an interactive dashboard to display analysis

🔄 Architecture & Workflow

  A[Screen Recording (FFmpeg)] --> B[Extract Frames (Python)]
  B --> C[Upload to Firebase Storage]
  C --> D[Classify with Gemini API]
  D --> E[Save to frame_analysis.json]
  E --> F[Visualize in Streamlit Dashboard]


pip install -r requirements.txt

About

Illuminate your screen habits.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages