Hi, I'm Ata Sadruddin Shaikh, a passionate Data Scientist with over 3 years of experience at ITKhidma, building scalable data pipelines and delivering actionable insights. I specialize in Python, SQL, Microsoft Azure, and data visualization (Power BI, Tableau). This repository showcases my projects, skills, and contributions in data science, data engineering, and machine learning. π Connect with me: LinkedIn | Email
- Master of Science in Artificial Intelligence (2022β2024)
- Bachelor of Science in Information Technology (2019β2022)
Data Science Consultant at ITKhidma (May 2022βPresent), developing machine learning models and Azure-based data pipelines
- Built an end-to-end Azure data pipeline processing 100,000+ records, improving efficiency by 30%
- Created Power BI dashboards for 10+ clients, reducing reporting time by 40%
- Developed predictive models, boosting marketing campaign effectiveness by 20%
Python, SQL, PySpark, Pandas, NumPy, Power BI, Tableau, Microsoft Azure (Data Factory, Data Lake, Synapse, Databricks), HTML, CSS, JavaScript, Java, Git, GitHub
Problem Solving, Communication, Leadership, Analytical Thinking, Collaboration, Time Management
Data Visualization, Machine Learning, Open-Source Contributions, Traveling, Photography, Reading
Hi, I'm Ata Sadruddin Shaikh, a passionate Data Scientist with over 3 years of experience at ITKhidma, building scalable data pipelines and delivering actionable insights. I specialize in Python, SQL, Microsoft Azure, and data visualization (Power BI, Tableau). This repository showcases my projects, skills, and contributions in data science, data engineering, and machine learning.
π Connect with me: LinkedIn | Email
- π Education: Master of Science in Artificial Intelligence (2022β2024) | Bachelor of Science in Information Technology (2019β2022)
- πΌ Current Role: Data Scientist at ITKhidma (May 2022βPresent), developing machine learning models and Azure-based data pipelines
- π Key Achievements:
- Built an end-to-end Azure data pipeline processing 100,000+ records, improving efficiency by 30%
- Created Power BI dashboards for 100+ clients, reducing reporting time by 40%
- Developed predictive models, boosting marketing campaign effectiveness by 20%
- π οΈ Technical Skills: Python, SQL, PySpark, Pandas, NumPy, Power BI, Tableau, Microsoft Azure (Data Factory, Data Lake, Synapse, Databricks), HTML, CSS, JavaScript, Java, Git, GitHub
- π€ Soft Skills: Problem Solving, Communication, Leadership, Analytical Thinking, Collaboration, Time Management
- π― Interests: Data Visualization, Machine Learning, Open-Source Contributions, Traveling, Photography, Reading
- Description: Engineered a scalable data pipeline using Azure Data Factory, Data Lake, and Databricks to process 100,000+ customer records from AdventureWorksLT2022.
- Key Features:
- Implemented Bronze-Silver-Gold architecture, optimizing query performance by 35%
- Secured pipeline with Azure Key Vault and RBAC, eliminating credential exposure
- Built Power BI dashboards for customer demographics, enabling 15% faster decision-making
- Technologies: Azure (Data Factory, Data Lake, Databricks, Synapse), PySpark, SQL, Power BI
- Metrics: Reduced data processing time by 30% for 100,000+ records
- View Project Details
- Description: Conducted EDA on 50,000+ bookstore transactions to identify purchase trends and segment customers.
- Key Features:
- Built predictive models to improve marketing campaigns by 20%
- Visualized insights using Tableau, reducing reporting efforts by 25%
- Technologies: SQL, Python (Pandas, NumPy), Tableau
- Metrics: Analyzed 50,000+ transactions, enhanced campaign effectiveness by 20%
- Description: Optimized complex SQL queries and automated data cleaning for large datasets.
- Key Features:
- Improved data retrieval efficiency by 15%
- Reduced preprocessing time by 30% with automated scripts
- Technologies: SQL, Python
- Metrics: Enhanced query performance for datasets with 10,000+ rows
- Certified Data Scientist Analytics Specialist
- Relational Databases
- Python Programming Bootcamp
- Career Essentials in Software Development by Microsoft
- Career Essentials in Generative AI by Microsoft
- Introduction to Git & GitHub by Google
- π§ Email: atashaikh2000@gmail.com
- π LinkedIn: linkedin.com/in/theatashaikh
- π Phone: +91 7385206750
- π» GitHub: github.com/theatashaikh
Iβm always excited to collaborate on data science, machine learning, or open-source projects. Feel free to reach out to discuss ideas or opportunities!
- Exploring advancements in machine learning and data visualization
- Contributing to open-source projects
- Traveling and capturing moments through photography
- Reading (especially books on communication, business, and technology)
- Fitness and staying active
Last updated: May 2025
