Skip to content

Add SDV privacy classification project#494

Open
tanayaharley31 wants to merge 1 commit into
gpsaggese:masterfrom
tanayaharley31:UmdTask391_DATA605_Spring2026_Synthetic_Data_Vault
Open

Add SDV privacy classification project#494
tanayaharley31 wants to merge 1 commit into
gpsaggese:masterfrom
tanayaharley31:UmdTask391_DATA605_Spring2026_Synthetic_Data_Vault

Conversation

@tanayaharley31
Copy link
Copy Markdown

Summary

This PR adds my DATA605 Spring 2026 project on Synthetic Data Vault privacy classification.

Project contents

  • Main SDV project notebook
  • API tutorial notebook
  • README documentation
  • Docker build/run scripts
  • Project check script
  • Utility functions
  • Model comparison outputs

Main workflow

The project uses the Adult Income dataset to generate synthetic data using SDV. It evaluates synthetic data quality with SDMetrics and compares classification models trained on real data versus synthetic data.

Docker

The Docker run script executes run_project_check.py and verifies the required files and output results.

Issue

Closes #391

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

DATA605_Spring2026_YData_profiling

1 participant