🎓 Student Performance Prediction System

A machine learning web application that predicts student exam scores based on various academic and personal factors. Built with Streamlit and Random Forest Regression.

Developed by: Ahmad Omar

📊 Dataset

This project uses a merged student performance dataset from Zenodo.

Dataset Information:

Total Records: 14,003 students
Total Features: 14 features
Target Variable: Exam Score (0-100 scale)
File Format: CSV (merged_dataset.csv)

🎯 What is Exam Score?

The model predicts a student's Exam Score on a scale of 0-100, representing their expected performance on an exam.

Exam Score Scale Interpretation:

90-100 = Excellent (نایاب)
80-89 = Very Good (زۆر باش)
70-79 = Good (باش)
60-69 = Average (مامناوەند)
Below 60 = Needs Improvement (پێویستی بە باشترکردن هەیە)

Pass/Fail Criteria:

60 or above = Pass (تێپەڕ)
Below 60 = Fail (شکست)

🔍 Features (14 Factors)

The model uses 14 factors to predict student performance, organized into the following categories:

👤 Demographic Information (4 factors)

Gender - Student's gender (Male/Female)
Age - Student's age
Learning Style - Preferred learning method (Visual, Auditory, Kinesthetic, Reading/Writing)
Motivation - Motivation level (Low, Medium, High)

📖 Study Behaviors & Engagement (6 factors)

Study Hours - Hours studied per week
Attendance - Attendance rate (%)
Assignment Completion - Assignment completion rate (%)
Online Courses - Number of online courses taken
Discussions - Participation in discussions (Yes/No)
Extracurricular - Engagement in extracurricular activities (Yes/No)

💻 Resources & Technology (4 factors)

Resources - Resource access level (Low, Medium, High)
Internet - Internet access availability (Yes/No)
EduTech - Use of educational technology (Yes/No)
Stress Level - Stress level (Low, Medium, High)

🌍 Language Support

The application supports two languages:

🇬🇧 English - Full English interface
🇹🇯 کوردی (Kurdish) - Complete Kurdish translation with RTL support

Users can switch between languages using the navigation buttons at the top of the page.

🚀 Installation

Clone the repository

git clone https://github.com/a7x3a/StudentPerformanceApp.git
cd StudentPerformanceApp

Create a virtual environment
```
python -m venv venv
```

Activate virtual environment

Windows:
```
venv\Scripts\activate
```
Linux/Mac:
```
source venv/bin/activate
```

Install dependencies
```
pip install -r requirements.txt
```
Download the dataset
- Ensure merged_dataset.csv is in the project root directory
- The dataset should contain 14 feature columns plus ExamScore and FinalGrade

📝 Usage

1. Train the Model

python train.py

This will:

Load the dataset from merged_dataset.csv
Train a Random Forest Regressor
Save the model as model.pkl
Display training and testing R² scores

Note: All features in the dataset are already numeric, so no encoding is required.

2. Run the Web Application

streamlit run app.py

The application will open in your default web browser at http://localhost:8501

3. Make Predictions

Select your preferred language (English or Kurdish)
Fill in all the required fields in the form:
- Demographic Information (Gender, Age, Learning Style, Motivation)
- Study Behaviors & Engagement (Study Hours, Attendance, Assignments, etc.)
- Resources & Technology (Resources, Internet, EduTech, Stress Level)
Click "🔮 Predict Exam Score" (or "🔮 پێشبینی نمرەی تاقیکردنەوە" in Kurdish)
View the predicted exam score (0-100) along with performance level and pass/fail status

📁 Project Structure

StudentPerformanceApp/
│
├── app.py                          # Streamlit web application
├── train.py                        # Model training script
├── model.pkl                       # Trained Random Forest model
├── merged_dataset.csv              # Dataset file (CSV format)
├── requirements.txt                # Python dependencies
├── README.md                       # This file
└── venv/                           # Virtual environment (not in git)

🛠️ Technologies Used

Python 3.x
Streamlit - Web application framework
Pandas - Data manipulation
Scikit-learn - Machine learning (Random Forest Regressor)
Joblib - Model serialization

🙏 Acknowledgments

Developer: Ahmad Omar (a7x3a.dev)

Note: Make sure the merged_dataset.csv file is in the project directory and the model has been trained (model.pkl exists) before running the application.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎓 Student Performance Prediction System

📊 Dataset

🎯 What is Exam Score?

Exam Score Scale Interpretation:

Pass/Fail Criteria:

🔍 Features (14 Factors)

👤 Demographic Information (4 factors)

📖 Study Behaviors & Engagement (6 factors)

💻 Resources & Technology (4 factors)

🌍 Language Support

🚀 Installation

📝 Usage

1. Train the Model

2. Run the Web Application

3. Make Predictions

📁 Project Structure

🛠️ Technologies Used

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.devcontainer		.devcontainer
pages		pages
.gitignore		.gitignore
README.md		README.md
app.py		app.py
merged_dataset.csv		merged_dataset.csv
model.pkl		model.pkl
requirements.txt		requirements.txt
train.py		train.py

Folders and files

Latest commit

History

Repository files navigation

🎓 Student Performance Prediction System

📊 Dataset

🎯 What is Exam Score?

Exam Score Scale Interpretation:

Pass/Fail Criteria:

🔍 Features (14 Factors)

👤 Demographic Information (4 factors)

📖 Study Behaviors & Engagement (6 factors)

💻 Resources & Technology (4 factors)

🌍 Language Support

🚀 Installation

📝 Usage

1. Train the Model

2. Run the Web Application

3. Make Predictions

📁 Project Structure

🛠️ Technologies Used

🙏 Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages