BIG DATA: Volume architecture
Mastery of Python and advanced programming.
Skills acquired at the end of the course:
Load data and process it in HDFS.
Transform this data with Hadoop Streaming or PySpark.
Optimize queries on structured data in Apache Hive.
Train Machine Learning algorithms on a cluster of machines with PySpark.
Introduction to Apache Hadoop (15h)
Introduction to Pyspark (20h)
Introduction to Apache Hive (10h)
Les prochaines dates :
You wish to build a tailor-made course adapted to your needs?
A member of our team can help you!