Predicting student placement outcomes using machine learning techniques
This project focuses on building a machine learning classification model to predict whether a student will be placed or not based on academic and experiential factors. Using a cleaned college placement dataset, the project demonstrates a complete ML workflow including data preprocessing, model training, evaluation, and visualization. The goal is to understand how different student attributes influence placement outcomes and to apply Logistic Regression for binary classification.
Python
Pandas β data handling and preprocessing
NumPy β numerical operations
Scikit-learn β model building and evaluation
Matplotlib & Seaborn β data visualization
Jupyter Notebook β experimentation and analysis
Cleaned and preprocessed dataset
Encoding of categorical variables
Feature scaling using StandardScaler
Logistic Regression model implementation
Model evaluation using:
Accuracy score
Confusion Matrix
Classification Report
ROC Curve and AUC
Clear visualizations for better interpretation
Dataset Cleaning
Handled categorical variables (Yes/No β 0/1)
Removed inconsistencies and prepared a cleaned dataset
Feature Selection
Selected relevant features such as CGPA, academic performance, internship experience, and projects completed
Train-Test Split
Split the dataset into training and testing sets with stratification
Model Training
Trained a Logistic Regression model on scaled features
Evaluation
Evaluated the model using accuracy, confusion matrix, classification report, and ROC-AUC
Visualized performance using plots
How to prepare real-world data for machine learning π§Ή
Importance of feature scaling in classification models π
Implementing and evaluating Logistic Regression π

