Big Data / Database
Duration:
25h
Difficulty:
3.5/5
Price:
1295€

Prerequisite:
Mastery of Python.
Skills acquired at the end of the course:
Read and query relational databases
Master the syntax of SQL queries
Training in the processing of massive datasets using distributed computing
Powerfully apply Machine Learning models to large databases
The curriculum:
Introduction to Data Engineering and Big Data
SQL language, introduction to advanced concepts
Data Processing and Machine Learning on large databases with PySpark
- Introduction to PySpark
- PySpark: Data Processing
- PySpark: DataFrames
- Regression with PySpark
- PySpark: ML Pipelines
- PySpark: Model Tuning
Optional
- Appropriation of PyMongo
Dates
Format Bootcamp
6 octobre
9 novembre
9 décembre
Format Continue
22 octobre
30 novembre
You wish to build a tailor-made course adapted to your needs?
A member of our team can help you!