ML

The document provides an overview of clustering in machine learning, explaining it as an unsupervised learning method that groups similar data points. It outlines various applications of clustering across fields such as marketing, biology, and finance, as well as methods like K-Means and Hierarchical Clustering. Additionally, it introduces association rule learning, highlighting its importance in discovering relationships between variables, particularly in market basket analysis.

Uploaded by

Satwik Saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views28 pages

ML

Uploaded by

Satwik Saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

❑ Machine learning Flowchart:-

Define Project Objectives Data Collection Data Preprocessing

Model Building Model Selection Data Visulalization

Train Model Test Model Improve Efficiency Deployment

Clustering in Machine Learning
Introduction to Clustering: It is basically a type of unsupervised learning method. An unsupervised learning
method is a method in which we draw references from datasets consisting of input data without labeled
responses. Generally, it is used as a process to find meaningful structure, explanatory underlying processes,
generative features, and groupings inherent in a set of examples.
Clustering is the task of dividing the population or data points into a number of groups such that data points in
the same groups are more similar to other data points in the same group and dissimilar to the data points in
other groups. It is basically a collection of objects on the basis of similarity and dissimilarity between them.
For example The data points in the graph below clustered together can be classified into one single group. We
can distinguish the clusters, and we can identify that there are 3 clusters in the below picture.
Applications of Clustering in different fields:
1.Marketing: It can be used to characterize & discover customer segments for marketing purposes.
2.Biology: It can be used for classification among different species of plants and animals.
3.Libraries: It is used in clustering different books on the basis of topics and information.
4.Insurance: It is used to acknowledge the customers, their policies and identifying the frauds.
5.City Planning: It is used to make groups of houses and to study their values based on their
geographical locations and other factors present.
6.Earthquake studies: By learning the earthquake-affected areas we can determine the dangerous
zones.
7.Image Processing: Clustering can be used to group similar images together, classify images based
on content, and identify patterns in image data.
8.Genetics: Clustering is used to group genes that have similar expression patterns and identify gene
networks that work together in biological processes.
9.Finance: Clustering is used to identify market segments based on customer behavior, identify
patterns in stock market data, and analyze risk in investment portfolios.
10.Customer Service: Clustering is used to group customer inquiries and complaints into categories,
identify common issues, and develop targeted solutions.
11.Traffic analysis: Clustering is used to group similar patterns of traffic data, such as peak hours,
routes, and speeds, which can help in improving transportation planning and infrastructure.
12.Social network analysis: Clustering is used to identify communities or groups within social
networks, which can help in understanding social behavior, influence, and trends.
13.Cybersecurity: Clustering is used to group similar patterns of network traffic or system
behavior, which can help in detecting and preventing cyberattacks.
14.Climate analysis: Clustering is used to group similar patterns of climate data, such as
temperature, precipitation, and wind, which can help in understanding climate change and its
impact on the environment.
15.Sports analysis: Clustering is used to group similar patterns of player or team performance
data, which can help in analyzing player or team strengths and weaknesses and making strategic
decisions.
16.Crime analysis: Clustering is used to group similar patterns of crime data, such as location,
time, and type, which can help in identifying crime hotspots, predicting future crime trends, and
improving crime prevention strategies.
Clustering Methods:
1.K-Means Clustering:
1. Divides the dataset into a predefined number of clusters (k) based on the mean of data
points in each cluster.
2.Hierarchical Clustering:
1. Builds a tree of clusters by either repeatedly merging smaller clusters (agglomerative) or
splitting larger clusters (divisive).
3.DBSCAN (Density-Based Spatial Clustering of Applications with Noise):
1. Groups together data points that are close to each other and have a sufficient number of
neighbors, while marking outliers as noise.
Association Rule Learning
Association rule learning is a type of unsupervised learning technique that checks for the
dependency of one data item on another data item and maps accordingly so that it can be
more profitable. It tries to find some interesting relations or associations among the
variables of dataset. It is based on different rules to discover the interesting relations
between variables in the database.
The association rule learning is one of the very important concepts of Machine Learning, and
it is employed in Market Basket analysis, Web usage mining, continuous production,
etc. Here market basket analysis is a technique used by the various big retailer to discover
the associations between items. We can understand it by taking an example of a
supermarket, as in a supermarket, all products that are purchased together are put together.
For example, if a customer buys bread, he most likely can also buy butter, eggs, or milk, so
these products are stored within a shelf or mostly nearby. Consider the below diagram:

Fundamentals of Data Science Unit 3
No ratings yet
Fundamentals of Data Science Unit 3
15 pages
Software Quality Concepts
No ratings yet
Software Quality Concepts
38 pages
Clustering
No ratings yet
Clustering
6 pages
Carron, Brawley
No ratings yet
Carron, Brawley
18 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
63 pages
Clustering in Machine Learning - Javatpoint
No ratings yet
Clustering in Machine Learning - Javatpoint
10 pages
Data Clustering Seminar
No ratings yet
Data Clustering Seminar
34 pages
3. Unit 3
No ratings yet
3. Unit 3
34 pages
Office of The Sangguniang Kabataan
No ratings yet
Office of The Sangguniang Kabataan
5 pages
Clustering
No ratings yet
Clustering
20 pages
Clustering Unit4
No ratings yet
Clustering Unit4
9 pages
Cbsyllabus Bda
No ratings yet
Cbsyllabus Bda
5 pages
Clustering
No ratings yet
Clustering
8 pages
Machine Learning Clustering AlgorithmsI
No ratings yet
Machine Learning Clustering AlgorithmsI
129 pages
Data Mining - UNIT-IV
No ratings yet
Data Mining - UNIT-IV
24 pages
Big Data Analytics
No ratings yet
Big Data Analytics
25 pages
Lecturer-1 Unit 3
No ratings yet
Lecturer-1 Unit 3
31 pages
FPA Unit 3
No ratings yet
FPA Unit 3
17 pages
Cluster Analysis
No ratings yet
Cluster Analysis
36 pages
Clustering New
No ratings yet
Clustering New
6 pages
The Origin of Paper
No ratings yet
The Origin of Paper
3 pages
Classify Clustering
No ratings yet
Classify Clustering
31 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
4 pages
Clustering U 5
No ratings yet
Clustering U 5
2 pages
ML Unit 4 Notes - NJ
No ratings yet
ML Unit 4 Notes - NJ
15 pages
Activity
No ratings yet
Activity
5 pages
Unit 2 ML
No ratings yet
Unit 2 ML
11 pages
1947 Benscoter, Stanley PDF
No ratings yet
1947 Benscoter, Stanley PDF
258 pages
A06-A Survey of Clustering Techniques
No ratings yet
A06-A Survey of Clustering Techniques
5 pages
ML Unit-3
No ratings yet
ML Unit-3
22 pages
Clustering: An Overview: Key Concepts Objective
No ratings yet
Clustering: An Overview: Key Concepts Objective
12 pages
Evolution of Media
100% (1)
Evolution of Media
8 pages
DW & DM Unit 4 Notes
No ratings yet
DW & DM Unit 4 Notes
40 pages
DWM PT 2 QB Soln
No ratings yet
DWM PT 2 QB Soln
8 pages
Clustering
No ratings yet
Clustering
57 pages
Unit-4 ML
No ratings yet
Unit-4 ML
16 pages
Unit 4
No ratings yet
Unit 4
62 pages
Unit 3 Unsupervised Learning Algorith
No ratings yet
Unit 3 Unsupervised Learning Algorith
15 pages
Cluster Analysis
No ratings yet
Cluster Analysis
18 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
7 pages
ML Unit 4 (Ab 22)
No ratings yet
ML Unit 4 (Ab 22)
39 pages
Data Mining Unit-4
No ratings yet
Data Mining Unit-4
15 pages
Screenshot 2024-05-17 at 3.30.05 PM
No ratings yet
Screenshot 2024-05-17 at 3.30.05 PM
31 pages
Sapera User
No ratings yet
Sapera User
109 pages
Clustering Notes
No ratings yet
Clustering Notes
17 pages
Unit 4
No ratings yet
Unit 4
106 pages
Unit 5
No ratings yet
Unit 5
66 pages
ML Unit-Iii
No ratings yet
ML Unit-Iii
18 pages
DM Unit 5
No ratings yet
DM Unit 5
15 pages
Unit - 4 (ML)
No ratings yet
Unit - 4 (ML)
13 pages
Final ML Unit3 May24
No ratings yet
Final ML Unit3 May24
154 pages
AIML Mod 5
No ratings yet
AIML Mod 5
39 pages
Module 5
No ratings yet
Module 5
91 pages
K-Means Clustering Algorithm Based On E-Commerce B
No ratings yet
K-Means Clustering Algorithm Based On E-Commerce B
6 pages
Machine Learning Note Modul 4 5
No ratings yet
Machine Learning Note Modul 4 5
20 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
21 pages
Ericsson Supply Chain
No ratings yet
Ericsson Supply Chain
178 pages
Unit III Clustering
No ratings yet
Unit III Clustering
47 pages
Git Basics
No ratings yet
Git Basics
75 pages
Artificial Intelligence Lec 5
No ratings yet
Artificial Intelligence Lec 5
20 pages
Unit 5
No ratings yet
Unit 5
5 pages
ML Unsupervised
No ratings yet
ML Unsupervised
35 pages
CLUSTER ANALYSIS Unit 3 Data Mining
No ratings yet
CLUSTER ANALYSIS Unit 3 Data Mining
84 pages
Unit 15
No ratings yet
Unit 15
26 pages
Unsupervised Learning: Niveditha. GH
No ratings yet
Unsupervised Learning: Niveditha. GH
10 pages
Machine Learning4
No ratings yet
Machine Learning4
39 pages
M. Ed #RD Teacher Education - I
No ratings yet
M. Ed #RD Teacher Education - I
78 pages
L10
No ratings yet
L10
55 pages
Data Mining Ii Sol
No ratings yet
Data Mining Ii Sol
106 pages
Imagen Turbo-Compresor Solar
No ratings yet
Imagen Turbo-Compresor Solar
2 pages
Find List of Oyo in Hyderabad Near Me - Justdial
No ratings yet
Find List of Oyo in Hyderabad Near Me - Justdial
46 pages
Visa Cashless Cities Report
No ratings yet
Visa Cashless Cities Report
68 pages
ML Unit4
No ratings yet
ML Unit4
19 pages
USAID - BHA RFSA M&E Technical Guidance May 2023
No ratings yet
USAID - BHA RFSA M&E Technical Guidance May 2023
143 pages
Getting - Started With Cisco Intersight
No ratings yet
Getting - Started With Cisco Intersight
12 pages
Ee8 3rdquarter Modules
No ratings yet
Ee8 3rdquarter Modules
9 pages
Computer Science 2
No ratings yet
Computer Science 2
24 pages
3is Activity Sheets Quarter 1
No ratings yet
3is Activity Sheets Quarter 1
17 pages
N - Channel Enhancement Mode " Single Feature Size " Power Mosfet
No ratings yet
N - Channel Enhancement Mode " Single Feature Size " Power Mosfet
9 pages
V1 N2 1980 Rabenhorst
No ratings yet
V1 N2 1980 Rabenhorst
6 pages
BROCHURE
No ratings yet
BROCHURE
8 pages
BCSL 63 Solved Assignment
No ratings yet
BCSL 63 Solved Assignment
10 pages
800 Hotmail Valid by Megalodon
No ratings yet
800 Hotmail Valid by Megalodon
15 pages
Remote Sensing - Detecting Moving Trucks On Roads Using Sentinel-2 Data
No ratings yet
Remote Sensing - Detecting Moving Trucks On Roads Using Sentinel-2 Data
28 pages
Agip GR SLL 00
No ratings yet
Agip GR SLL 00
1 page
Erp Manager
No ratings yet
Erp Manager
2 pages
DLL Ict 10
100% (1)
DLL Ict 10
3 pages
Overlay
No ratings yet
Overlay
3 pages
Computer Science: Basic Computer Organisation: Description of A Computer System
No ratings yet
Computer Science: Basic Computer Organisation: Description of A Computer System
5 pages

ML

Uploaded by

ML

Uploaded by

❑ Machine learning Flowchart:-

Define Project Objectives Data Collection Data Preprocessing

Model Building Model Selection Data Visulalization

Train Model Test Model Improve Efficiency Deployment

You might also like