Machine Learning Projects

A collection of machine learning projects exploring regression, classification, and clustering techniques applied to real-world datasets.

Projects

1. Student Performance Prediction

Supervised Learning · Linear Regression

Predicts a student's final grade (G3) based on study habits and prior academic performance.

Features used: Study time, number of absences, first period grade (G1), second period grade (G2)

Pipeline

Load and preprocess dataset (semicolon-delimited)
Feature selection and train/test split
Feature scaling with StandardScaler
Model training with Linear Regression
Evaluation using MAE and RMSE

Metrics

Mean Absolute Error   (MAE)  : X.XX
Root Mean Squared Error (RMSE) : X.XX

2. Fraud Detection

Supervised Learning · Logistic Regression

Identifies fraudulent credit card transactions from a heavily imbalanced dataset.

Dataset — Credit Card Fraud Detection via Kaggle
Download from: https://www.kaggle.com/datasets/kartik2112/fraud-detection
Place fraudTrain.csv and fraudTest.csv in the project root before running.

Pipeline

Load pre-split train and test sets
Select numerical features and apply feature scaling
Train Logistic Regression with class_weight='balanced'
Evaluate with confusion matrix and classification report

Metrics

Confusion Matrix
Classification Report (Precision · Recall · F1-Score)

3. Customer Segmentation

Unsupervised Learning · K-Means Clustering

Groups customers into distinct segments based on age, income, and spending behavior.

Features used: Age, Annual Income, Spending Score

Pipeline

Load and explore customer dataset
Feature scaling for distance-based clustering
K-Means clustering to assign segment labels
Export results to clustered_customers.csv

Output

Cluster distribution across all customers
Sample records with assigned cluster labels
Saved: clustered_customers.csv

Tech Stack

Tool	Purpose
Python 3.x	Core language
pandas	Data loading and manipulation
scikit-learn	Model training and evaluation
numpy	Numerical operations

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
customer-segmentation		customer-segmentation
fraud-detection		fraud-detection
student-performance-prediction		student-performance-prediction
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Projects

Projects

1. Student Performance Prediction

2. Fraud Detection

3. Customer Segmentation

Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Projects

Projects

1. Student Performance Prediction

2. Fraud Detection

3. Customer Segmentation

Tech Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages