Skip to content

feat: implement K-Means and Hierarchical clustering with evaluation o…#2

Open
codermiki wants to merge 1 commit intosoftwareWCU:mainfrom
codermiki:main
Open

feat: implement K-Means and Hierarchical clustering with evaluation o…#2
codermiki wants to merge 1 commit intosoftwareWCU:mainfrom
codermiki:main

Conversation

@codermiki
Copy link
Copy Markdown

Overview

This pull request introduces an unsupervised machine learning analysis using the wine quality (red and white) dataset. The work focuses on clustering wines based on physicochemical properties without using quality labels during training.

Key Changes

  • Added full data preprocessing and feature scaling workflow
  • Implemented K-Means clustering with optimal K selection using the Elbow Method
  • Evaluated K-Means clustering using Silhouette Score
  • Implemented Hierarchical Clustering with dendrogram visualization
  • Provided post-cluster interpretation using wine quality for validation

Algorithms Used

  • K-Means Clustering
  • Hierarchical Clustering (Ward linkage)

Outcome

The clustering results show meaningful separation of wines based on chemical composition, with clusters exhibiting distinct average quality levels, validating the effectiveness of the unsupervised approach.

Notes

  • The quality column was excluded from training and used only for post-analysis
  • All features were standardized to ensure fair distance calculations

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant