Skip to content
View theatashaikh's full-sized avatar

Highlights

  • Pro

Block or report theatashaikh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
theatashaikh/README.md

Banner Image

πŸ‘‹ Welcome to My GitHub!

Hi, I'm Ata Sadruddin Shaikh, a passionate Data Scientist with over 3 years of experience at ITKhidma, building scalable data pipelines and delivering actionable insights. I specialize in Python, SQL, Microsoft Azure, and data visualization (Power BI, Tableau). This repository showcases my projects, skills, and contributions in data science, data engineering, and machine learning. πŸ”— Connect with me: LinkedIn | Email

πŸš€ About Me

πŸŽ“ Education:

  • Master of Science in Artificial Intelligence (2022–2024)
  • Bachelor of Science in Information Technology (2019–2022)

πŸ’Ό Current Role:

Data Science Consultant at ITKhidma (May 2022–Present), developing machine learning models and Azure-based data pipelines

🌟 Key Achievements:

  • Built an end-to-end Azure data pipeline processing 100,000+ records, improving efficiency by 30%
  • Created Power BI dashboards for 10+ clients, reducing reporting time by 40%
  • Developed predictive models, boosting marketing campaign effectiveness by 20%

πŸ› οΈ Technical Skills:

Python, SQL, PySpark, Pandas, NumPy, Power BI, Tableau, Microsoft Azure (Data Factory, Data Lake, Synapse, Databricks), HTML, CSS, JavaScript, Java, Git, GitHub

🀝 Soft Skills:

Problem Solving, Communication, Leadership, Analytical Thinking, Collaboration, Time Management

🎯 Interests:

Data Visualization, Machine Learning, Open-Source Contributions, Traveling, Photography, Reading

πŸ‘‹ Welcome to My GitHub!

Hi, I'm Ata Sadruddin Shaikh, a passionate Data Scientist with over 3 years of experience at ITKhidma, building scalable data pipelines and delivering actionable insights. I specialize in Python, SQL, Microsoft Azure, and data visualization (Power BI, Tableau). This repository showcases my projects, skills, and contributions in data science, data engineering, and machine learning.

πŸ”— Connect with me: LinkedIn | Email


πŸš€ About Me

  • πŸŽ“ Education: Master of Science in Artificial Intelligence (2022–2024) | Bachelor of Science in Information Technology (2019–2022)
  • πŸ’Ό Current Role: Data Scientist at ITKhidma (May 2022–Present), developing machine learning models and Azure-based data pipelines
  • 🌟 Key Achievements:
    • Built an end-to-end Azure data pipeline processing 100,000+ records, improving efficiency by 30%
    • Created Power BI dashboards for 100+ clients, reducing reporting time by 40%
    • Developed predictive models, boosting marketing campaign effectiveness by 20%
  • πŸ› οΈ Technical Skills: Python, SQL, PySpark, Pandas, NumPy, Power BI, Tableau, Microsoft Azure (Data Factory, Data Lake, Synapse, Databricks), HTML, CSS, JavaScript, Java, Git, GitHub
  • 🀝 Soft Skills: Problem Solving, Communication, Leadership, Analytical Thinking, Collaboration, Time Management
  • 🎯 Interests: Data Visualization, Machine Learning, Open-Source Contributions, Traveling, Photography, Reading

πŸ“‚ Featured Projects

  • Description: Engineered a scalable data pipeline using Azure Data Factory, Data Lake, and Databricks to process 100,000+ customer records from AdventureWorksLT2022.
  • Key Features:
    • Implemented Bronze-Silver-Gold architecture, optimizing query performance by 35%
    • Secured pipeline with Azure Key Vault and RBAC, eliminating credential exposure
    • Built Power BI dashboards for customer demographics, enabling 15% faster decision-making
  • Technologies: Azure (Data Factory, Data Lake, Databricks, Synapse), PySpark, SQL, Power BI
  • Metrics: Reduced data processing time by 30% for 100,000+ records
  • View Project Details

Customer Behavior Analysis and Exploratory Data Analysis

  • Description: Conducted EDA on 50,000+ bookstore transactions to identify purchase trends and segment customers.
  • Key Features:
    • Built predictive models to improve marketing campaigns by 20%
    • Visualized insights using Tableau, reducing reporting efforts by 25%
  • Technologies: SQL, Python (Pandas, NumPy), Tableau
  • Metrics: Analyzed 50,000+ transactions, enhanced campaign effectiveness by 20%

Advanced Data Analytics with SQL

  • Description: Optimized complex SQL queries and automated data cleaning for large datasets.
  • Key Features:
    • Improved data retrieval efficiency by 15%
    • Reduced preprocessing time by 30% with automated scripts
  • Technologies: SQL, Python
  • Metrics: Enhanced query performance for datasets with 10,000+ rows

πŸ† Certifications

  • Certified Data Scientist Analytics Specialist
  • Relational Databases
  • Python Programming Bootcamp
  • Career Essentials in Software Development by Microsoft
  • Career Essentials in Generative AI by Microsoft
  • Introduction to Git & GitHub by Google

πŸ“¬ Get in Touch

I’m always excited to collaborate on data science, machine learning, or open-source projects. Feel free to reach out to discuss ideas or opportunities!


🌟 Interests & Hobbies

  • Exploring advancements in machine learning and data visualization
  • Contributing to open-source projects
  • Traveling and capturing moments through photography
  • Reading (especially books on communication, business, and technology)
  • Fitness and staying active

Last updated: May 2025

Popular repositories Loading

  1. react-translate-app react-translate-app Public

    A web-application which can instantly translate almost all major languages spoken across the globe created using modern web development technologies such as react.

    JavaScript 2

  2. datawarehousesql datawarehousesql Public

    This repository provides a comprehensive guide for building production-ready data warehouses using SQL Server. It walks through the complete process of implementing a modern data warehouse using Me…

    TSQL 1

  3. Diabetes-Prediction Diabetes-Prediction Public

    Diabetes Prediction using Machine Learning, It was the project that I was working on for MeriSkill as an Intern.

    Jupyter Notebook

  4. codsoft codsoft Public

    All tasks that I completed as an Intern in CodSoft

    Jupyter Notebook

  5. qwiklabs-git-training qwiklabs-git-training Public

    This repository was created during the training session of Git and GitHub

    Python

  6. it-cert-automation-practice it-cert-automation-practice Public

    Forked from google/it-cert-automation-practice

    Google IT Automation with Python Professional Certificate - Practice files

    Python