Skip to content

KimmyCosmos/dimensionality-reduction-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

Dimensionality Reduction: PCA vs t-SNE vs UMAP

Overview

This project compares three dimensionality reduction techniques—PCA, t-SNE, and UMAP—on high-dimensional datasets to evaluate how well each method preserves structure and class separability in low-dimensional embeddings.

Methods

  • PCA (Principal Component Analysis)
  • t-SNE (t-Distributed Stochastic Neighbor Embedding)
  • UMAP (Uniform Manifold Approximation and Projection)

Data

Scikit-learn benchmark datasets:

  • Digits dataset (1797 samples, 10 classes)
  • Breast Cancer dataset (569 samples, binary classification)

Tools

Python, NumPy, scikit-learn, UMAP, Matplotlib, Seaborn

About

Comparison of PCA, t-SNE, and UMAP for visualizing high-dimensional datasets using scikit-learn.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages