Datasets: Top 5 places to find quality datasets

Getting started and training in data today will require you to have solid mathematical skills and to study a number of Machine Learning and Deep Learning algorithms.To understand them and observe their performance, you’ll often need to practice on quality datasets. It’s not always easy to find one. You may have the opportunity to work […]

Data Exploration: definition, tools, training…

Data exploration is the first step in data analysis. Find out everything you need to know about it, and how to acquire the necessary skills, thanks to DataScientest training courses. Data analysis is a process that can be broken down into several stages. Data Exploration is the first of these steps. It consists of exploring […]

AWS EC2: How to use Amazon’s tool

One of the first services offered by AWS, EC2 was launched in 2006, and there’s no doubt that AWS has made its mark on the cloud market thanks to it. But what does this service do? What are its use cases? That’s what we’re going to look at in this article. What is the AWS […]

OLAP: everything you need to know about this database organization

This article deals with a form of database organization known as OLAP. Before you start reading it, and to make sure you fully grasp its contents, you may be interested to read our article on databases. This article will explain how databases work, their architecture and the different programming languages that govern them. Databases are […]

Wordcloud: What’s it all about?

The word cloud is a visualization tool that allows you to quickly see which words are the most frequent in a text or text corpus. In this article, we’ll take a look at how it works. Why create a wordcloud? The principle of the word cloud is as follows: Within a text, we calculate the […]

Talend, the tool of choice for integrating your data

Talend is a software publisher specializing in data integration. It has been named a leader in the Gartner 2021 Magic Quadrant for Data Quality Solutions. Talend provides software and services for data integration and management, enterprise application integration, data quality optimization, cloud storage and Big Data. Launched in 2006 by Bertrand Diard and Fabrice Bonan, […]

Python programming for beginners – Episode 2

Table of Contents: Episode 1 – Introduction – Variables – Types Episode 2 Operators Loops Functions Episode 3 – Data Import – Data Cleaning – Data Processing Episode 4 – Importing Python Libraries –Importing Data – Handling Missing Values  Python Cheat Sheet Welcome to the second episode in our series of introductions to Python programming. […]

How do I insert columns and rows in Excel?

When you use Microsoft Excel for your business, you need to reorganize your spreadsheet. You need to know how to insert and delete rows and columns in your Excel spreadsheet. Understanding how to manipulate this tool is essential for reorganizing the table and the data. In this article, you’ll learn how to insert an extra […]

Docker Tutorial: how to create your first container?

Docker is a tool for creating and isolating environments called containers. The aim is to isolate an application on a particular environment so that it can be run on any OS (Windows, Linux, Mac, etc.) without any adaptability problems. What’s the difference with a virtual machine? A VM will contain an operating system in its […]

Demystifying Derivatives: Understanding the Core of Calculus

The notion of derivative is particularly used in DataScience for machine learning models. Apart from what you’ve probably seen in high school, the applications of this mathematical tool are much broader, and we’ll come back to them at the end of this article. In this article, you’ll first learn how to calculate the derivative of […]