0% found this document useful (0 votes)

14 views

U5 unsupervised learning

unit 5 of machine learing in computer diploma (sem 5 )

Uploaded by

dhruvshah2705

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

U5 unsupervised learning

unit 5 of machine learing in computer diploma (sem 5 )

Uploaded by

dhruvshah2705

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

U5:

Unsupervised
Machine
Learning
K . D . P O LY T E C H N I C , PATA N
Co5: Apply unsupervised learning algorithms
based on dataset characteristics
5.1 Introduction of Unsupervised Learning
◦ Brief explanation of unsupervised Machine Learning
◦ Need of unsupervised learning
◦ Working of unsupervised learning
◦ Real world examples of unsupervised Learning
◦ List unsupervised learning algorithms

5.2 Types of Unsupervised Learning

◦ Clustering: Definition, list clustering methods, list real world applications/examples
◦ Association: Definition, list association methods, list real world applications/examples
◦ Advantage and Disadvantage of unsupervised learning algorithm

5.3 Differentiate Supervised and Unsupervised Learning

Unsupervised Machine Learning
there is no labeled training data to learn from and no prediction to be made

the objective is to take a dataset as input and try to find natural groupings or patterns within the
data elements or records.

Unsupervised learning is often termed a descriptive model and the process of unsupervised
learning is called as pattern discovery or knowledge discovery.
Need of Unsupervised Learning
Exploratory Data Analysis (EDA): helps us understand the underlying structure of data without any predefined labels.

we gain insights into the data distribution. understanding data quality, identifying outliers, and making informed decisions.

Clustering: Clustering algorithms group similar data points together based on their features. i.e. Customer segmentation,
Image segmentation, Anomaly detection

Dimensionality Reduction: Techniques like Principal Component Analysis (PCA) reduce the number of features while
preserving essential information.

Recommendation Systems: helps build personalized recommendations. Collaborative filtering and matrix factorization are
common techniques. for ex: Movie recommendations (e.g., Netflix)., Product recommendations (e.g., Amazon).

Feature Engineering: Unsupervised learning can create new features from existing ones.

Data Preprocessing: Imputing missing values, scaling features, and handling outliers. Unsupervised methods help prepare
data for subsequent modeling.
Application of Unsupervised
Learning
❖Segmentation of target consumer populations by an advertisement consulting agency on the basis of
few dimensions such as demography, financial data, purchasing habits, etc. so that the advertisers can
reach their target consumers efficiently.

❖Anomaly or fraud detection in the banking sector by identifying the pattern of loan defaulters.

❖Image processing and image segmentation such as face recognition, expression identification, etc.

❖Grouping of important characteristics in genes to identify important influencers in new areas of

genetics.

❖Utilization by data scientists to reduce the dimensionalities in sample data to simplify modeling

❖Document clustering and identifying potential labeling options

Types of Unsupervised Learning:
Clustering
➢Clustering is a technique for exploring raw, unlabeled data and breaking it down into groups (or clusters) based on
similarities or differences.

➢Clustering algorithms split data into natural groups by finding similar structures or patterns in uncategorized data.

➢Clustering is one of the most popular unsupervised machine learning approaches.

➢Partition method: Data is grouped in a way where a single data point can only exist in one cluster. This is also referred to as
“hard” clustering. A common example of exclusive clustering is the K-means clustering algorithm, which partitions data
points into a user-defined number K of clusters.

➢Density based method: finds groups based on the density of data points

➢Hierarchical clustering: Data is divided into distinct clusters based on similarities, which are then repeatedly merged and
organized based on their hierarchical relationships.

➢Probabilistic clustering: Data is grouped into clusters based on the probability of each data point belonging to each cluster.
This approach differs from the other methods, which group data points based on their similarities to others in a cluster.
Applications of Clustering
Market Segmentation: Companies use clustering to group customers based on purchasing behavior, demographics,
and engagement levels. Segmented groups allow targeted marketing strategies and personalized recommendations.
Social Network Analysis: Clustering helps identify communities or groups within social networks. It reveals patterns of
connections, influencers, and subgroups.
Search Result Grouping: Search engines use clustering to group similar search results. Users benefit from organized
and relevant search results.
Medical Imaging: Clustering helps segment medical images (e.g., MRI, CT scans). It aids in identifying tumors,
lesions, or other anomalies.
Image Segmentation: In computer vision, clustering segments images into meaningful regions. Useful for object
detection, image recognition, and scene understanding.
Anomaly Detection: Clustering identifies unusual patterns or outliers. Examples: Fraud detection, network intrusion
detection.
Types of Unsupervised Learning:
Association analysis
•Association rule presents a methodology that is useful for identifying interesting relationships hidden in
large data sets. It is also known as association analysis, and the discovered relationships can be
represented in the form of association rules comprising a set of frequent items.
•A common application of this analysis is the Market Basket Analysis
that retailers use for cross-selling of their products.
•focuses on identifying associations between data elements.
•uncover how the items are associated with each other.
•which items appear together in a transaction or relation.
•retailers, grocery stores, an online marketplace that has a large
transactional database
Association analysis: methods
•Common Algorithm: Apriori is a well-known algorithm for association rule learning.

•The association rule learning :

•Itemset: One or more items are grouped together and are surrounded by brackets to indicate that
they form a set, or more specifically, an itemset that appears in the data with some regularity.

•Support Count: denotes the number of transactions in which a particular itemset is present. This is a very
important property of an itemset as it denotes the frequency of occurrence for the itemset.
itemset { Bread, Milk, Egg } occurs together 3 times
and thus have a support count of 3.
Association rule
The result of the market basket analysis is expressed as a set of association rules that specify
patterns of relationships among items.
Support and confidence- two concepts for measuring the strength of an association rule.
Support denotes how often a rule is applicable to a given data set.
Confidence indicates how often the items in Y appear in transactions that contain X in a total
transaction of N. Confidence denotes the predictive power or accuracy of the rule.
C({Bread, milk}→{Egg}) = S({Bread, milk, Egg})/S({Bread, milk}) = ¾ = 0.75
A low support may indicate that the rule has occurred by chance.
confidence provides the measurement for reliability of the inference of a rule.
C({Bread, milk}→{Egg}) ≠ C({Egg}→{Bread, milk}) as C({Egg}→{Bread, milk} = 3/5 = 0.6
Items set Support of item Confidence of item set
set

C({Bread, milk}→{Egg}) 3 ¾ =0.75

C({Bread, milk}→{Butter}) 2 2/4 = 0.5

C({Bread, milk}→{Salt}) 1 ¼ =0.25

C({Bread, milk}→{Apple}) 4 4/4=1

Advantages of Unsupervised
machine learning
Less Manual Data Preparation: Unsupervised learning requires less manual effort for data preparation
compared to supervised learning. There’s no need for hand-labeling or annotating data, which can be
time-consuming and expensive.
Discovery of Hidden Patterns: Unsupervised learning algorithms can discover previously unknown
patterns in data. Unlike supervised models, which rely on labeled examples, unsupervised methods can
identify underlying structures without explicit guidance.
Flexibility and Adaptability: Unsupervised learning is versatile and adaptable. It can handle various
types of data (e.g., text, images, numerical features) and doesn’t rely on predefined labels or target
variables.
Clustering and Anomaly Detection: Clustering algorithms (e.g., k-means, hierarchical clustering) group
similar data points together, helping identify natural clusters.
Anomaly detection techniques (e.g., isolation forests, DBSCAN) find outliers or anomalies in data.
Disadvantages of Unsupervised
machine learning
Lack of Clear Objectives: Unlike supervised learning, where the goal is well-defined (predicting a
specific label), unsupervised learning lacks clear objectives. It’s challenging to measure success
directly. Interpretation of results often requires domain knowledge and subjective judgment.

No Ground Truth for Evaluation: Without labeled data, it’s difficult to evaluate the performance of
unsupervised models objectively.

Metrics like accuracy or precision are not applicable, making model assessment less straightforward.

Difficulty in Generalization: Unsupervised models may struggle to generalize well to unseen examples.
Since they learn from patterns within the data,

Resource-Intensive Training: Some unsupervised algorithms require significant computational

resources and large amounts of data for training.

Training complex models can be time-consuming and computationally expensive.

Supervised vs Unsupervised
learning
Supervised Unsupervised
type of data Labelled Un-labelled

task baseline understanding of what the work independently to learn the

correct output values should be.
data's inherent structure without
any specific guidance or
instruction
Training- testing algorithm uses a sample provide unlabeled input data and
dataset to train itself to let the algorithm identify any
make predictions, naturally occurring patterns in the
iteratively adjusting itself to dataset.
minimize error.
Supervised vs Unsupervised
learning
Supervised Unsupervised
outcome focused on learning the helpful for discovering new patterns and
relationships between input and relationships in raw, unlabeled data
output data
example predict flight times based on identify buyer groups that purchase
specific parameters, such as related products together to provide
weather conditions, airport suggestions for other items to
traffic, peak flight hours, and recommend to similar customers.
more
Suitable for suited for classification and used for exploratory data analysis and
regression tasks, such as weather clustering tasks, such as anomaly
forecasting, pricing changes, detection, big data visualization, or
sentiment analysis, and spam customer segmentation.
detection.

Quiz Week 8 - Unsupervised Learning Clustering
50% (2)
Quiz Week 8 - Unsupervised Learning Clustering
2 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
6 pages
Module 6.1
No ratings yet
Module 6.1
42 pages
UNIT- 1-1
No ratings yet
UNIT- 1-1
35 pages
Unit III 1
No ratings yet
Unit III 1
22 pages
ML Unit-2 - RTU
No ratings yet
ML Unit-2 - RTU
33 pages
Introduction-to-Unsupervised-Machine-Learning
No ratings yet
Introduction-to-Unsupervised-Machine-Learning
9 pages
Group I Discrete Mathematics
No ratings yet
Group I Discrete Mathematics
4 pages
AI - W8L15
No ratings yet
AI - W8L15
44 pages
Home AI Machine Learning Dbms Java Blockchain Control System Selenium HTML CSS
No ratings yet
Home AI Machine Learning Dbms Java Blockchain Control System Selenium HTML CSS
8 pages
UnSupervised ML
No ratings yet
UnSupervised ML
17 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
4 pages
Chapter 3notes
No ratings yet
Chapter 3notes
46 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
5 pages
Unit 3 Supervised Learning
No ratings yet
Unit 3 Supervised Learning
89 pages
Unsupervised Lec
No ratings yet
Unsupervised Lec
12 pages
unsupervised learning
No ratings yet
unsupervised learning
4 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
16 pages
Machine Learning and Web Scraping Lesson02
No ratings yet
Machine Learning and Web Scraping Lesson02
29 pages
Unsupervised learning - overview
No ratings yet
Unsupervised learning - overview
6 pages
Clustering Part-A
No ratings yet
Clustering Part-A
41 pages
2 ML
No ratings yet
2 ML
9 pages
Unsupervised learning
No ratings yet
Unsupervised learning
10 pages
2nd Unit NN Final Class Notes
No ratings yet
2nd Unit NN Final Class Notes
51 pages
Slidesgo Fundamentals of Machine Learning an in Depth Exploration of Supervised and Unsupervised Learning Pr 202408181323046euf
No ratings yet
Slidesgo Fundamentals of Machine Learning an in Depth Exploration of Supervised and Unsupervised Learning Pr 202408181323046euf
10 pages
Un-Supervised Machine Learning
No ratings yet
Un-Supervised Machine Learning
9 pages
IMG 20230327 234809-Merged
No ratings yet
IMG 20230327 234809-Merged
14 pages
DSA Presentation Group 6
No ratings yet
DSA Presentation Group 6
34 pages
UnSupervised ML
No ratings yet
UnSupervised ML
15 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
20 pages
AI Unit4 Learning Dd83e0ee 7d19 48c7 Bc5d b39decf3b0fc
No ratings yet
AI Unit4 Learning Dd83e0ee 7d19 48c7 Bc5d b39decf3b0fc
19 pages
NeuralNetwork Learning
No ratings yet
NeuralNetwork Learning
22 pages
Unsupervised - Learning Final
No ratings yet
Unsupervised - Learning Final
20 pages
BDA Unit-5
No ratings yet
BDA Unit-5
26 pages
BDAunit5
No ratings yet
BDAunit5
26 pages
Unit3-Important Topics related to Neural Network
No ratings yet
Unit3-Important Topics related to Neural Network
10 pages
Unit 1
No ratings yet
Unit 1
52 pages
Machine Learning - Part -1
No ratings yet
Machine Learning - Part -1
17 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
4 pages
Abstract Hi-Tech Hexagons PowerPoint Templates
No ratings yet
Abstract Hi-Tech Hexagons PowerPoint Templates
10 pages
AIML Super, UnSuper
No ratings yet
AIML Super, UnSuper
3 pages
Supervised Vs Unsupervised Learning
100% (1)
Supervised Vs Unsupervised Learning
7 pages
DS&ML 1
No ratings yet
DS&ML 1
9 pages
Supervised and Unsupervised Learning
No ratings yet
Supervised and Unsupervised Learning
19 pages
Module II-2
No ratings yet
Module II-2
41 pages
1
No ratings yet
1
59 pages
Classification of Machine Learning
No ratings yet
Classification of Machine Learning
73 pages
Unit 4-Regression and Learning
No ratings yet
Unit 4-Regression and Learning
24 pages
Module 1 PPT
No ratings yet
Module 1 PPT
122 pages
Unit-4
No ratings yet
Unit-4
53 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
21 pages
Machine Learning
No ratings yet
Machine Learning
23 pages
ML
No ratings yet
ML
17 pages
Module 1
No ratings yet
Module 1
122 pages
Unsupervised-vs-Supervised-Learning
No ratings yet
Unsupervised-vs-Supervised-Learning
8 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
4 pages
ML Type
No ratings yet
ML Type
13 pages
Supervised_vs_Unsupervised_Learning
No ratings yet
Supervised_vs_Unsupervised_Learning
2 pages
Machine Learning Is The Branch of
No ratings yet
Machine Learning Is The Branch of
12 pages
New Doc 09-30-2024 20.37
No ratings yet
New Doc 09-30-2024 20.37
6 pages
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
CMPT 413/713: Natural Language Processing: Nat Langlab
No ratings yet
CMPT 413/713: Natural Language Processing: Nat Langlab
43 pages
AI and Ethics
No ratings yet
AI and Ethics
134 pages
Fuzzy Logic & Applications Unit Test - 2 (JKC)
No ratings yet
Fuzzy Logic & Applications Unit Test - 2 (JKC)
2 pages
Prompt Engineering
No ratings yet
Prompt Engineering
7 pages
Artificial Intelligence - ICT Project
No ratings yet
Artificial Intelligence - ICT Project
10 pages
Artificial Intelligence: Guidance By: Mr. Manoj Sir
No ratings yet
Artificial Intelligence: Guidance By: Mr. Manoj Sir
13 pages
Ca1 Ai
No ratings yet
Ca1 Ai
3 pages
Problem 1: Cse352 AI Homework 3 Solutions
No ratings yet
Problem 1: Cse352 AI Homework 3 Solutions
31 pages
Machine Learning and Deep Learning Approach For Medical Image Analysis: Diagnosis To Detection
No ratings yet
Machine Learning and Deep Learning Approach For Medical Image Analysis: Diagnosis To Detection
39 pages
Machine Learned Resume-Job Matching Solution
No ratings yet
Machine Learned Resume-Job Matching Solution
9 pages
Text Classification Improved BT Integrating Bidirectional LSTM With Two-Dimensional Max Pooling
No ratings yet
Text Classification Improved BT Integrating Bidirectional LSTM With Two-Dimensional Max Pooling
11 pages
Module 1 Assignment_Answer
No ratings yet
Module 1 Assignment_Answer
4 pages
G5Baim Artificial Intelligence Methods: Graham Kendall
No ratings yet
G5Baim Artificial Intelligence Methods: Graham Kendall
47 pages
AI Manual
No ratings yet
AI Manual
69 pages
Download Full Computational Intelligence Paradigms Theory Applications using MATLAB 1st Edition S. Sumathi PDF All Chapters
100% (9)
Download Full Computational Intelligence Paradigms Theory Applications using MATLAB 1st Edition S. Sumathi PDF All Chapters
40 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
68 pages
Machine Learning Techniques
No ratings yet
Machine Learning Techniques
3 pages
Syllabus cmpt726 Sfu
No ratings yet
Syllabus cmpt726 Sfu
4 pages
Decision Tree
No ratings yet
Decision Tree
51 pages
The CLIP Model Is Secretly An Image-to-Prompt Converter
No ratings yet
The CLIP Model Is Secretly An Image-to-Prompt Converter
19 pages
4 - Cyberbullying Detection and Machine Learning A Systematic Literature Review - 2023
No ratings yet
4 - Cyberbullying Detection and Machine Learning A Systematic Literature Review - 2023
42 pages
Toc 9780138199302
No ratings yet
Toc 9780138199302
8 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
3 pages
Andrew Rosenberg - Lecture 14: Neural Networks
No ratings yet
Andrew Rosenberg - Lecture 14: Neural Networks
50 pages
Proposing Solution To XOR Problem Using Minimum Configuration MLP
No ratings yet
Proposing Solution To XOR Problem Using Minimum Configuration MLP
8 pages
Syllabus Soft Computing - 19-20
No ratings yet
Syllabus Soft Computing - 19-20
2 pages
ML 2 (Mainly KNN)
100% (1)
ML 2 (Mainly KNN)
12 pages
Specialized AI Professional - Academy One-pager
No ratings yet
Specialized AI Professional - Academy One-pager
1 page
2marks ML
No ratings yet
2marks ML
3 pages

U5 unsupervised learning

Uploaded by

U5 unsupervised learning

Uploaded by

U5:

5.2 Types of Unsupervised Learning

5.3 Differentiate Supervised and Unsupervised Learning

❖Grouping of important characteristics in genes to identify important influencers in new areas of

❖Document clustering and identifying potential labeling options

➢Clustering is one of the most popular unsupervised machine learning approaches.

•The association rule learning :

C({Bread, milk}→{Egg}) 3 ¾ =0.75

C({Bread, milk}→{Butter}) 2 2/4 = 0.5

C({Bread, milk}→{Salt}) 1 ¼ =0.25

C({Bread, milk}→{Apple}) 4 4/4=1

Resource-Intensive Training: Some unsupervised algorithms require significant computational

Training complex models can be time-consuming and computationally expensive.

task baseline understanding of what the work independently to learn the

You might also like