Skip to content

Harshit-Soni78/VisionMATE

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

VisionMATE: Assistive Technology for Visually Impaired

Status License


A comprehensive web application designed to assist visually impaired students with features like object detection, text-to-speech, speech-to-text, and task management using voice commands.

📝 Table of Contents

🧐 Problem Statement

Visually impaired students face significant challenges in accessing educational materials, managing tasks, and interacting with their environment. Traditional learning materials and tools often lack accessibility features, creating barriers to education and daily activities.

💡 Solution

VisionMATE provides an integrated platform with multiple assistive features:

  • Real-time object detection for environmental awareness
  • Text-to-Speech conversion for reading documents
  • Speech-to-Text for note-taking and communication
  • Voice-controlled navigation
  • Task management system with voice commands

✨ Features

  • Object Detection using TensorFlow.js and COCO-SSD model
  • Text-to-Speech functionality for document reading
  • Speech-to-Text for live captioning
  • Voice-controlled navigation throughout the application
  • Interactive To-Do list with voice commands
  • User authentication system
  • Responsive and accessible interface

🏗 System Architecture

  1. Frontend Layer:

    • React.js based user interface
    • TensorFlow.js for object detection
    • Web Speech API integration
  2. Backend Layer:

    • Node.js/Express server
    • MongoDB database
    • RESTful API architecture

⚡ Technology Stack

Frontend:

  • React.js
  • TensorFlow.js
  • Web Speech API
  • Styled Components
  • Tailwind CSS
  • Framer Motion

Backend:

  • Node.js
  • Express.js
  • MongoDB
  • JWT Authentication

📋 Prerequisites

  • Node.js (v14 or higher)
  • MongoDB
  • Modern web browser
  • Webcam
  • Microphone

🔧 Installation

  1. Clone the repository:
git clone https://github.com/akshitjain16/VisionMATE.git
  1. Install server dependencies:
cd server
npm install
  1. Install client dependencies:
cd client
npm install
  1. Start the server:
cd server
npm start
  1. Start the client:
cd client
npm start

🎈 Usage

  1. Sign up/Login to access the application
  2. Use voice commands for navigation:
    • "Go to home"
    • "Go to object detection"
    • "Go to text to speech"
    • "Go to todo list"
  3. Access different features through the intuitive interface
  4. Use voice commands or manual controls to interact with each feature

👥 Contributors

📝 License

This project is licensed under the MIT License - see the LICENSE.md file for details

About

This Project is shown by us in SITH 2k25.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • JavaScript 98.0%
  • Other 2.0%