VoiceIQ - AI Speech Emotion Recognition Platform

📖 Project Overview

VoiceIQ is an industry-grade, production-ready Speech Emotion Recognition (SER) platform. It leverages Deep Learning (1D CNNs, LSTMs, and Hybrid architectures) and advanced Audio Signal Processing to accurately detect human emotions from raw speech audio.

Designed with a futuristic, glassmorphism-themed Streamlit frontend, VoiceIQ provides a premium SaaS-like experience.

✨ Key Features

Real-Time Inference: Predict emotions live using microphone input or by uploading audio files.
Deep Learning Architectures: Includes CNN, LSTM, and Hybrid CNN-LSTM models.
Explainable AI (XAI): Visualizes feature importance and acoustic contributions.
Advanced Audio Visualizations: Interactive Plotly-based Waveforms, Mel Spectrograms, and Radar Charts.
Automated PDF Reports: Generates downloadable, professional analysis summaries.
Premium UI/UX: Responsive, dark-mode, glassmorphism dashboard built with custom CSS.

🧠 System Architecture

Audio Input (Mic/File) → Preprocessing (Noise Reduction, Silence Trimming) 
→ Feature Extraction (MFCC, Chroma, Mel, Tonnetz) → Scaler 
→ Deep Learning Model (CNN/LSTM) → Output Probabilities 
→ UI Rendering & PDF Report Generation

🛠️ Installation & Setup

Clone the repository:

git clone https://github.com/yourusername/AI_Speech_Emotion_Recognition.git
cd AI_Speech_Emotion_Recognition

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows use: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```
Prepare Datasets (Optional for training):
- Download RAVDESS, TESS, and EMO-DB datasets.
- Extract them into the respective folders inside datasets/.
Run the Application:
```
streamlit run app.py
```

📊 Datasets Supported

The platform is built to handle multiple audio datasets seamlessly:

RAVDESS: Ryerson Audio-Visual Database of Emotional Speech and Song
TESS: Toronto emotional speech set
EMO-DB: Berlin Database of Emotional Speech

🚀 Deployment (Streamlit Cloud / HuggingFace Spaces)

Push this repository to GitHub.
Log into Streamlit Cloud or HuggingFace Spaces.
Connect your repository and select app.py as the entry point.
Add packages.txt if system dependencies (like libsndfile1) are required for librosa in Linux environments.

👨‍💻 Author

Developed by an elite AI Research Engineer for the CodeAlpha Internship Portfolio.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
static		static
templates		templates
.gitignore		.gitignore
README.md		README.md
config.py		config.py
explainability.py		explainability.py
main.py		main.py
predict.py		predict.py
preprocess_audio.py		preprocess_audio.py
report_generator.py		report_generator.py
requirements.txt		requirements.txt
server.py		server.py
train_model.py		train_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VoiceIQ - AI Speech Emotion Recognition Platform

📖 Project Overview

✨ Key Features

🧠 System Architecture

🛠️ Installation & Setup

📊 Datasets Supported

🚀 Deployment (Streamlit Cloud / HuggingFace Spaces)

👨‍💻 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VoiceIQ - AI Speech Emotion Recognition Platform

📖 Project Overview

✨ Key Features

🧠 System Architecture

🛠️ Installation & Setup

📊 Datasets Supported

🚀 Deployment (Streamlit Cloud / HuggingFace Spaces)

👨‍💻 Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages