BIG DATA: Volume architecture

Duration:

38h

Difficulty:

4/5

Price:

1495€

Prerequisite:

Mastery of Python and advanced programming.

Database management.

Skills acquired at the end of the course:

Load data and process it in HDFS.

Transform this data with Hadoop Streaming or PySpark.

Optimize queries on structured data in Apache Hive.

Train Machine Learning algorithms on a cluster of machines with PySpark.

The curriculum:

Introduction to Apache Hadoop (15h)

Introduction to Pyspark (20h)

Introduction to Apache Hive (10h)

Les prochaines dates :

Format Bootcamp

6 octobre

9 novembre

9 décembre

Format Continu

22 octobre

30 novembre

You wish to build a tailor-made course adapted to your needs?

A member of our team can help you!