Skip to content

zeeshan1122334455/Data_Engineering_tools

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 

Repository files navigation

Data Engineering Tools for Machine Learning

Learn data engineering fundamentals by constructing a modern data stack for analytics and machine learning applications. We'll also learn how to orchestrate our data workflows and programmatically execute tasks to prepare our high quality data for downstream consumers (analytics, ML, etc.)

     

👉  This repository contains the list of all modern tools for Data Engineering

   

Tools

  • Apache Spark - Unified engine for large-scale data analytics
  • SnowFlake - A tool for Cloud Data Warehousing
  • Apache Flink - Apache Flink is a framework and distributed processing engine for stateful computations over unbounded and bounded data streams.
  • Apache Hadoop - The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models
  • MySQL - MySQL is an open-source relational database management system.
  • Add more tools...................

Rules for Contribution

  • Fork the repo
  • Make a seperate branch like data-feature
  • Update Readme.md and Commit Changes
  • Make PR for approval to main branch

Hacktoberfest Approved Repo:

This repo is part of Opensource project for Hacktoberfest2023.Please make pull requests to participate in this repo

Releases

No releases published

Packages

 
 
 

Contributors