🚀 Think you’ve got what it takes for a career in Data? Find out in just one minute!

AutoML and Machine Learning automation: a threat to Data Scientists?

AutoML and Machine Learning automation: a threat to Data Scientists?

Do Machine Learning automation and AutoML tools pose a threat to Data Scientists? It’s a question that’s on the minds of more and more Data Science professionals, as well as aspiring Data Scientists worried about their future careers. For the time being, however, the complete automation of Data Science seems unlikely. To cope with the […]

Jupyter Notebook: An indispensable code-sharing tool

Jupyter Notebook

Jupyter Notebook is a web application that lets you create electronic notebooks capable of combining text, images, computer code or equations, all in the same document. In the interests of readability and document source code utilization, it’s best to run the application from the same interface, and see changes in real time. This is exactly […]

Data Literacy: why is it important for a company?

Data Literacy: why is it important for a company?

Data literacy is a hot topic in today’s society. Indeed, companies recognize a lack of awareness of the capabilities and importance of data within their teams. Data Literacy, which is the responsibility of the Chief Data Officer, must acculturate employees to data, and initiate the company’s transformation towards a data-centric orientation. According to a study […]

Beautiful Soup: Introduction to web scraping with Python

Beautiful Soup

As an Internet browser, we have access to a lot of information about customers, offers, share prices, physical phenomena and so on. This data can be read by users, but we’d like to be able to exploit it by transforming it into an operational format so that we can analyze it and make the most […]

Distributed Architecture: definition and relationship to Big Data

Distributed architecture

Distributed architectures are information systems that distribute and use available resources that are not located in the same place or on the same machine. In this article, we will explain in detail what these architectures are, their advantages over other architectures and how they are used in practice in Data Science. What is a distributed […]

Time series plot Python: The Tutorial

python temp

A time series is an array of data showing the evolution of a variable over time. In Python, this is often processed in the form of a Series Pandas indexed by a DateTime. This format makes for easy processing and visualization. Time series are used in many fields, such as astronomy and meteorology, but are […]

Bayesian optimization: Definition and operation

Bayesian optimization: Definition and operation

To define a predictive model, data scientists call on multiple observations. But while the study of these observations leads to an optimal result, data scientists often have little time to analyze all the hypotheses.So how do you find the right model in the shortest possible time? This is where Bayesian optimization comes in. What is […]

Power BI Mobile: Everything you need to know about the mobile version of Power BI

Power BI Mobile: everything you need to know about the mobile version of power bi

Power BI Mobile is the mobile version of Microsoft’s Business Intelligence platform. This application is available on iOS, Android and Windows smartphones and tablets. Find out everything you need to know about this tool, and how to learn to use it. In the age of Big Data, it has become imperative for companies to aggregate […]

ELIF Python: What you need to know about this function

The ELIF function in Python: What you need to know

Testing conditional expressions is the very basis of programming. While the if then else structure is inherent to most languages, Python manipulates another form of condition: if elif else… What is it exactly? If you’ve ever learned to write a computer program, and whatever your level, you’ll know that an essential element of almost any […]

Pearson and Spearman Correlations: A Guide to Understanding and Applying Correlation Methods

Pearson and Spearman correlations: Understanding everything

Correlation is a statistical tool used in Machine Learning to identify dependencies between several variables.There are several types of correlation. Find out more about the The Pearson correlation and Spearman Correlation below. For data analysis, a Data Scientist has several statistical tools at his disposal. One of these tools is correlation.Correlation is a particularly useful […]

Cypher: The language for working with data in graphs

Cypher: the language for manipulating data in graphs

For humans, visual comprehension is often easier than textual comprehension, as our brains are able to process visual information faster and more efficiently. That’s why we’ve seen the emergence of graphical databases using the Cypher language instead of traditional SQL. Visual databases offer a much faster and more intuitive way of modeling and querying data. […]

Google Search Console: The essential SEO tool

Google Search Console: the essential SEO tool

In this article, delve into the key features of Google Search Console, explore the latest additions to its arsenal and discover how this essential tool can propel your site to new heights. What is GSC? Google Search Console (launched in 2006 under the name Webmaster Tools) is one of the most useful tools provided free […]