Skip to content

Latest commit

 

History

History
9 lines (9 loc) · 545 Bytes

File metadata and controls

9 lines (9 loc) · 545 Bytes

DistributedComputingExercises

The exercises performed in the following notebooks were a part of the Cloud Computing course at Eurecom. The Distributed computing framework used is Apache Spark and the python library used for this purpose is PySpark. Please note that the main point of these notebooks is not machine learning in itself, but the use of distributed computing for machine learning problems. The libraries used in the notebooks are:

PySpark

SparkSQL

Matplotlib

Numpy

Pandas

Scipy

Scikit-learn