Unsupervised learning: principle and use

21 Jan 2024

min read

Machine Learning

Melanie

As far as future prospects are concerned, many hopes are pinned on unsupervised learning to improve cyber security or the identification of various diseases.

In terms of future prospects, unlike supervised learning, unsupervised learning is where the algorithm has to operate on unannotated examples. In this case, machine learning is entirely independent. The machine is fed data without being provided with examples of results.

Thus, in this learning situation, the answers we want to find are not present in the data provided: the algorithm uses unlabeled data. The machine is therefore expected to create the answers itself, through various analyses and classification of the data.

Unsupervised learning models are used in particular for :

Data classification
Approximate calculation of distribution density
Dimension reduction

In this context, all the data collected are treated as random variables. Unlike machine learning, which must find a model from labeled data: f(X) Y, it uses only unlabeled data: there is no Y variable to predict.

The use of unsupervised learning can be grouped into clustering and association problems.

Unsupervised Learning & Clustering

A clustering problem is one in which the machine is expected to group together objects present in groups of data in the most accurate and efficient way possible. This technique, although sometimes difficult for humans to understand, is widely used in marketing to place different customers in groups, for example. An example of an algorithm often used in clustering is K-means.

Unsupervised Learning & Association

The association system is used to sort and group data that can be linked by certain characteristics. The aim is to find objects that are related to each other, but not identical. For example, if the algorithm is provided with numerous images of cats and cat accessories, then the unsupervised learning algorithm would not group all the cats together, but rather, for example, a ball of wool with a cat. An example of an algorithm often used in association is the A-priori algorithm.

Unsupervised learning is very often used in the field of speech recognition, as in the use of Siri or Alexa.

The latter is used to learn the phone owner’s vocal characteristics (language, tone of voice, etc.).

Similarly, some cell phones use it to automatically arrange photos. In fact, the phone is able to identify the same person in photos, or find similar places and arrange them according to these criteria.

How does Unsupervised Learning differ from supervised learning?

Although both types of learning are part of artificial intelligence, supervised learning involves learning a prediction function from labelled examples.

Supervised learning involves monitoring the machine’s learning by presenting it with examples of what it should do. Thus, the aim of supervised learning is to create algorithms capable of receiving data sets and performing statistical analysis to predict a result.

Aspect	Supervised Learning	Unsupervised Learning
Training Data	Labeled data with input-output pairs.	Unlabeled data; no explicit output labels.
Objective	Predict a target variable based on input.	Discover patterns, structures, or clusters in data.
Task Examples	Classification, Regression.	Clustering, Dimensionality Reduction.
Guidance	Model is guided by labeled examples.	No specific guidance from labeled data.
Training Process	Requires labeled data for training.	Learning from inherent data structures.
Performance Evaluation	Accuracy, Precision, Recall, F1 Score.	Silhouette Score, Davies-Bouldin Index.
Examples	Spam detection, Image recognition.	Customer segmentation, Anomaly detection.
Model Complexity	Often more complex models.	Simpler models, focused on data patterns.
Use of Validation Data	Used for tuning and validating models.	Limited use due to unsupervised nature.
Feedback Loop	Immediate feedback on correctness.	Less direct feedback due to lack of labels.
Common Algorithms	Linear Regression, Decision Trees, SVM.	K-Means, Hierarchical Clustering, PCA.

Finally, it’s worth pointing out that there are other types of learning, such as semi-supervised learning, which is a mix of supervised and unsupervised learning.

Would you like to discover all the fields of application of supervised and unsupervised learning? Learn how to exploit them? Discover our training courses!

DataScientest News

You are not available?

Leave us your e-mail, so that we can send you your new articles when they are published!

Data Analyst

Analytics Engineer

Data Scientist

AI / Machine Learning Engineer

Data Engineer

Cloud Engineer

DevOps Engineer

Data Marketing & AI

MLOps

ETL Developer

Data Ops Engineer

Amazon Web Services (AWS)

Microsoft Power BI

Overview

Bildungsgutschein

For Employees

Unsupervised learning: principle and use

As far as future prospects are concerned, many hopes are pinned on unsupervised learning to improve cyber security or the identification of various diseases.

Unsupervised Learning & Clustering

Unsupervised Learning & Association

How does Unsupervised Learning differ from supervised learning?

You are not available?

Related articles

How Marketing Data Shapes Buying Decisions

Crypto AI agents: How AI is revolutionizing cryptocurrencies?

Oracle Infrastructure Cloud Services: Storage, Computing, Networking…

AI Insights in Power BI: AI at the service of decision-making!

Data Analyst

Analytics Engineer

Data Scientist

AI / Machine Learning Engineer

Data Engineer

Cloud Engineer

DevOps Engineer

Data Marketing & AI

MLOps

ETL Developer

Data Ops Engineer

Amazon Web Services (AWS)

Microsoft Power BI

Overview

Bildungsgutschein

For Employees

Unsupervised learning: principle and use

As far as future prospects are concerned, many hopes are pinned on unsupervised learning to improve cyber security or the identification of various diseases.

Unsupervised Learning & Clustering

Unsupervised Learning & Association

How does Unsupervised Learning differ from supervised learning?

You are not available?

Related articles

How Marketing Data Shapes Buying Decisions

Crypto AI agents: How AI is revolutionizing cryptocurrencies?

Oracle Infrastructure Cloud Services: Storage, Computing, Networking…

AI Insights in Power BI: AI at the service of decision-making!

DataNews