Skip to content

thyphan2025/thyphan2025

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

15 Commits
Β 
Β 

Repository files navigation

About Me

  • πŸ‘‹ Hi, I’m @thyphan2025
  • πŸ‘€ I’m interested in AI & Machine Learning.
  • 🌱 I’m currently pursuing Master of Science in Data Analytics Engineering at George Mason University
  • πŸ˜„ Pronouns: she/her
  • ⚑ Fun fact: I love exploring different cultures, especially their amazing foods.
  • ⭐ Motivation quote : "I have no special talents. I am only passionately curious." - Albert Einstein

Currently Working On

  • Data Analytics Project (Capstone)
  • Building small passion projects to explore data workflows and new tools
  • Reading Designing Machine Learning Systems by Chip Huyen
  • Reading Machine Learning Systems by Prof. Vijay Janapa Reddi - Harvard University
  • Reading Fairness and Machine Learning by Solon Barocas, Moritz Hardt, Arvind Narayanan
  • Starting MLOps Zoomcamp course

⭐ Highlighted Projects

πŸ”Ή Bridge Material & Design Analysis β€” Feb 2026

Python, PySpark, Databricks

  • Cleaned and reshaped a multi-state bridge dataset to examine material and design patterns and applied association rule mining to identify recurring relationships.

β†’ Bridge-Material-and-Design-Analysis


πŸ”Ή Influenza Surveillance Dashboard β€” Oct 2025

Power BI

  • Explored multi-season influenza data to monitor trends, subtype distribution, and outbreak severity through an interactive dashboard.

β†’ Influenza Surveillance Dashboard Chicago


πŸ”Ή Air Quality Analysis β€” Jul 2025

R, Time-Series Analysis, Interactive Plot, Forecasting

  • Cleaned and analyzed multi-year air quality data to examine environmental risk patterns and forecast ozone trends using ARIMA model.
  • Published interactive HTML report with code, Plotly visualizations, and a few static plots.

β†’ New York Air Quality Analysis


πŸ”Ή Electric Vehicle Analysis β€” Jun 2025

Python, Data Analysis, Machine Learning

  • Analyzed electric vehicle adoption data to examine growth trends, geographic distribution, and vehicle characteristics across regions.
  • Trained a Decision Tree Model to classify between Battery Electric Vehicles (BEVs) and Plug-in Hybrid Electric Vehicles (PHEVs)
  • Utilized Synthetic Minority Oversampling Technique (SMOTE) to address class imbalance.

β†’ Electric-Vehicle-Analysis


πŸ”Ή Education in Danger Analysis β€” Dec 2024

Python, SQL, R, NLP

  • Cleaned and analyzed global incident data to identify geographic hotspots, severity patterns, and recurring risk signals affecting education infrastructure.
  • Applied natural language processing (NLP) to extract sentiment and patterns from incident descriptions.

β†’ Education-in-Danger-Incidents

πŸ“ Other Projects

Bridge Damage Prediction (Group Project) β€” PySpark ML workflow (notebook-based)

Python, PySpark, Spark MLib, Databricks

  • Contributed code to the PySpark modeling workflow in Databricks, including feature engineering and evaluation using Python, PySpark and Spark MLlib.

β†’ Bridge-Damage-Prediction

About

Config files for my GitHub profile.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors