Classify Clustering
Classify Clustering
What is classification?
• The term "classification" is usually used when there are exactly two
target classes called binary classification.
• When more than two classes may be predicted, specifically in pattern
recognition problems, this is often referred to as multinomial
classification.
• However, multinomial classification is also used for categorical
response data, where one wants to predict which category amongst
several categories has the instances with the highest probability.
• Classification is one of the most important tasks in data mining.
• It refers to a process of assigning pre-defined class labels to instances
based on their attributes.
• There is a similarity between classification and clustering, it looks
similar, but it is different.
• The major difference between classification and clustering is that
classification includes the levelling of items according to their
membership in pre-defined groups.
• Let's understand this concept with the help of an example;
• suppose you are using a self-organizing map neural network
algorithm for image recognition where there are 10 different kinds of
objects. If you label each image with one of these 10 classes, the
classification task is solved.
• On the other hand, clustering does not involve any labelling.
• Assume that you are given an image database of 10 objects and no
class labels.
• Using a clustering algorithm to find groups of similar-looking images
will result in determining clusters without object labels.
Classification of data mining
• These are given some of the important data mining classification
methods:
• Logistic Regression Method
• The logistic Regression Method is used to predict the response
variable.
• K-Nearest Neighbors Method
• K-Nearest Neighbors Method is used to classify the datasets into what
is known as a K observation. It is used to determine the similarities
between the neighbours.
• Naive Bayes Method
The Naive Bayes method is used to scan the set of data and locate the
records wherein the predictor values are equal.
• Neural Networks Method
The Neural Networks resemble the structure of our brain called the
Neuron. The sets of data pass through these networks and finally come
out as output. This neural network method compares the different
classifications. Errors that occur in the classifications are further
rectified and are fed into the networks. This is a recurring process.
What is clustering?
• Clustering refers to a technique of grouping objects so that objects with the same
functionalities come together and objects with different functionalities go apart.
• In other words, we can say that clustering is a process of portioning a data set into a
set of meaningful subclasses, known as clusters.
• Image segmentation