Skip to content
View Hamza-Bouali's full-sized avatar
🎯
Focus
🎯
Focus

Highlights

  • Pro

Organizations

@CODE-ESI-CLUB

Block or report Hamza-Bouali

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Hamza-Bouali/README.md

Hamza Bouali

Data Engineer ETL Specialist Cloud Data Solutions

πŸ‘¨β€πŸ’» About Me

Data Engineer with hands-on expertise in designing and implementing scalable ETL/ELT pipelines, data warehousing, and cloud infrastructure. I specialize in transforming raw data into reliable, well-architected systems that power data-driven decision making.

  • πŸ”­ Currently working on building robust data pipelines, data warehouse optimization, and cloud-native data solutions
  • 🌱 Experienced with Apache Airflow, Apache Kafka, Spark, and modern data orchestration tools
  • πŸ’‘ Passionate about data quality, performance optimization, and scalable infrastructure design
  • πŸš€ Proven track record building ETL solutions that handle complex multi-source data ingestion and transformation
  • 🎯 Seeking Data Engineering internship (July 2025) to expand expertise in large-scale data systems

πŸŽ“ Education

Data Science & Knowledge Engineering β€” ESI (Γ‰cole des Sciences de l'Information), Kenitra, Morocco (2023 - 2026)

  • Curriculum: Big Data, Cloud Computing, Data Engineering, DevOps, Software Engineering, Machine Learning
  • Applied research in data systems and national hackathons

Preparatory Classes (CPGE) — OMAR IBN AL KHATAB, Meknès, Morocco (2021 - 2023)

  • Mathematics, Physics, and logical reasoning with focus on engineering preparation

Baccalaureate in Mathematical Sciences — Fès, Morocco (2021)

πŸ’Ό Professional Experience

Data Analyst Intern at Decathlon, Casablanca, Morocco (June - August 2025)

  • Built ETL pipelines for multi-source internal data extraction and transformation
  • Designed and deployed dynamic dashboards for store performance analysis and strategic insights
  • Contributed to decision-making for new store opening in SalΓ© (2025)

Database Administration & Backend Developer at OBG Incub, Rabat, Morocco (January - May 2025)

  • Developed scalable backend services and data pipelines using Django and FastAPI
  • Designed and optimized database schemas for improved query performance
  • Implemented RAG pipeline backend logic with vector database integration (Qdrant)
  • Deployed containerized services on AWS ECS with CI/CD automation using GitHub Actions
  • Collaborated with DevOps team on infrastructure and monitoring solutions

Data Engineer Intern at AILAND, Rabat, Morocco (July 2024)

  • Cleaned, transformed, and analyzed social media data at scale
  • Implemented NLP model fine-tuning for localized data processing
  • Optimized data pipeline models improving processing efficiency by 40%

πŸ› οΈ Technical Skills

Data Engineering & Pipeline Development

Python Apache Airflow Apache Kafka PySpark ETL/ELT Kestra SQL Advanced Pandas

Data Warehousing & Analytics

Data Warehouse Design Power BI Tableau SSIS Streamlit

Databases & Data Storage

PostgreSQL MySQL MongoDB SQL Server Redshift Azure Data Warehouse Neo4j

Cloud & Infrastructure

AWS Docker Docker Compose AWS ECS CI/CD Linux Bash

Data Quality & Monitoring

Data Quality Assurance Monitoring & Logging MLflow Git

Backend Development

Django FastAPI Flask REST APIs JavaScript TypeScript

πŸš€ Featured Projects

Banking BI System & Data Warehouse

Enterprise data warehouse and analytics platform for banking operations

  • Tech Stack: SQL Server, SSIS, Power BI, Python
  • Key Achievements: Designed and implemented star schema data warehouse, built ETL processes for complex banking data transformation, created interactive KPI dashboards, optimized DirectQuery connections for real-time analytics
  • Impact: Enabled business intelligence and predictive insights for strategic decision-making

MemorAI

Intelligent data management system with multi-modal support

  • Tech Stack: Django REST, AWS (S3, DynamoDB, Lambda, EC2), Qdrant Vector Database, GitHub Actions
  • Data Engineering Focus: Designed scalable data pipelines for ingestion and storage, implemented semantic search with vector indexing, built robust data persistence layer on cloud infrastructure
  • Features: Multi-turn context persistence, semantic search, RAG-based retrieval

Call center operations platform with data processing capabilities

  • Tech Stack: Python, Django, data pipeline architecture
  • Features: Real-time data processing, analytics infrastructure

Healthcare platform with data management systems

  • Tech Stack: Django, PostgreSQL, REST APIs
  • Data Features: Patient records management, secure data handling

ML & Deep Learning Models From Scratch

Mathematical implementation of machine learning algorithms

  • Built 6 ML models: Linear Regression, Logistic Regression, KNN, Decision Trees, Random Forest, SVM
  • Implemented 4 Deep Learning architectures: MLP, CNN, RNN, Autoencoder
  • Complete backpropagation and optimization algorithms

πŸ“œ Professional Certifications

Data Engineer Associate Python Data Associate SQL Associate Supervised Machine Learning

πŸ† Awards & Recognition

  • 2nd Place β€” Hackathon MDFDS (Code ESI Club)
  • 3rd Place β€” EAIC Data Competition
  • Top 25 out of 400 β€” Think AI (2nd Edition)
  • 20th Place β€” MCPC - Moroccan Competitive Programming Championship

πŸ‘₯ Leadership & Volunteering

Co-Head of Competitive Programming Cell at CODE-ESI, Rabat (September 2024 - June 2025)

  • Mentoring engineers in algorithmic problem-solving

Treasurer at JCMP-ESI, Rabat (September - December 2024)

  • Managed finances and budgeting

Sponsorship & Event Committee Member for Moroccan Days of Future Data Scientists (May 2024 - Present)

  • Coordinated partnerships for data science events

🌐 Languages

English French Arabic

πŸ’‘ Core Competencies

Pipeline Architecture: ETL/ELT design and optimization, data orchestration, workflow automation, scheduling and dependency management

Data Warehousing: Schema design (star/snowflake), dimensional modeling, fact/dimension tables, slowly changing dimensions, query optimization

Data Processing: Distributed processing with Spark, stream processing with Kafka, batch processing optimization, data transformation logic

Database Management: Schema design, query optimization, indexing strategies, performance tuning, backup/recovery

Cloud Infrastructure: AWS ecosystem (EC2, S3, RDS, Lambda), containerization with Docker, infrastructure as code, cost optimization

Data Quality: Validation frameworks, anomaly detection, data profiling, pipeline monitoring, error handling

πŸ“ˆ GitHub Stats

GitHub Stats GitHub Streak

🀝 Let's Connect

Portfolio LinkedIn Email Phone


Open to collaboration on Data Engineering, ETL, and Cloud Data Solutions

Last Updated: January 2026

### πŸ’Ό Open to Collaboration and New Opportunities!

I'm always eager to work on innovative projects, contribute to open-source, and expand my expertise. If you're looking for a dedicated professional who combines data engineering with full-stack development skills, let's connect!

πŸ” Actively seeking a 2-month Data Engineering/MLOps internship starting July 2025

Popular repositories Loading

  1. Antaeus Antaeus Public

    Jupyter Notebook 1

  2. plant-detection-v0 plant-detection-v0 Public

    TypeScript 1

  3. Hamza-Bouali Hamza-Bouali Public

    Config files for my GitHub profile.

    TypeScript

  4. Student-Jobs Student-Jobs Public

    For FLA entreprenship competition

    HTML

  5. pharmaflow pharmaflow Public

    JavaScript

  6. 4gitpod 4gitpod Public

    Jupyter Notebook