0% found this document useful (0 votes)

62 views

Non Parametric Classification: Pattern Recognition

The document discusses non-parametric classification and the k-nearest neighbors algorithm. It begins by explaining that non-parametric classifiers do not assume an underlying probability distribution and instead focus on similarity between samples. It then describes the k-nearest neighbors algorithm which involves computing distances between a test sample and training samples, finding the k nearest neighbors, and assigning the test sample the majority class of those neighbors. The document provides an example of k-nearest neighbors classification and discusses concepts like overfitting, feature selection, and decision boundaries.

Uploaded by

marshadmit

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views

Non Parametric Classification: Pattern Recognition

Uploaded by

marshadmit

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 74

Pattern Recognition

Non Parametric Classification

Dr Khurram Khurshid

Non-parametric classification
The assumption that the probability density function
(pdf) that generated the data has some parametric
form (e.g. Gaussian, etc.) may not hold true in many
cases
Non-parametric classifiers assume that the pdf does
not have any parametric form
It is based on finding the similarity between the data
samples

Non parametric classification

Given a training dataset with classes i.e. data and their
labels are known

x1 , y1 , x2 , y 2 , x3 , y 3 ,L

L , xn , y n , x

Objective: Decide the class (or label) of a test vector xt

Method:
Compute the distance between the vector xt and all examples of
the training dataset
Decision: assign the class of the nearest neighbor (closest
point) from training dataset to the vector xt

Non parametric classification

Feature Space

Vectors of the red

class

Vectors of the blue

class

Class (xtClass
) = of the nearest neighbor

Problem Statement
Can we LEARN to recognise a rugby player?

What are the features of a rugby player?

Features
Rugby players = short + heavy?
190cm

130cm
60kg

90kg

Features
Ballet dancers = tall + skinny?
190cm

130cm
60kg

90kg

Feature Space
Rugby players cluster separately in the space.

Height

Weight

The K-Nearest Neighbour Algorithm

Whos this?
Height

Weight

The K-Nearest Neighbour Algorithm

1. Measure distance to all points

Height

Weight

The K-Nearest Neighbour Algorithm

1. Measure distance to all points
2. Find closest k points

(here k=3, but it could be more)

Height

Weight

The K-Nearest Neighbour Algorithm

1. Measure distance to all points
2. Find closest k points
3. Assign majority class

(here k=3, but it could be more)

Height

Weight

Distance Measure

Euclidean distance

d ( w w1 ) (h h1 )
2

(w, h)
Height

Weight

(w1, h1)

The K-Nearest Neighbour Algorithm

for each testing point
measure distance to every training point
find the k closest points
identify the most common class among those k
predict that class

end
Advantage: Surprisingly good classifier!
Disadvantage: Have to store the entire training set in
memory

The Decision Boundary

Imagine testing loads of points.
Whos this?
Whos this?

Whos this?

The Decision Boundary

These hypothetical points
are closest to the ballet dancers

These hypothetical points

are closest to the rugby players

The Decision Boundary

Wheres the decision boundary?

The Decision Boundary

Wheres the decision boundary?
Not always a nice neat line!

The Decision Boundary

Wheres the decision boundary?

The Decision Boundary

Wheres the decision boundary?
Not always contiguous!

Distance Measure
Euclidean distance still works in 3-d, 4-d, 5-d, etc.

d ( x x1 ) ( y y1 ) ( z z1 )
2

x = Height
y = Weight
z = Shoe size

The Decision Boundary

Called a decision surface in 3-d and upward

Over-fitting
An Important Concept in Pattern Recognition

Over-fitting
Looks good so far

Over-fitting
Looks good so far
Oh no! Mistakes!
What happened?

We didnt have all the data.

We can never assume that we do.
This is called OVER-FITTING
to the small dataset.

Over-fitting
While an overly complex model may allow perfect
classification of the training samples, it is unlikely to give
good classification of novel patterns

Features
Choosing the wrong features makes it difficult
Too many features
Its computationally intensive
Possible features:
- Shoe size
- Height
- Age
- Weight

Shoe size

Age

PR Problem
Now how is this problem like

handwriting
recognition?
height

weight

Handwriting Recognition
Lets say the axes now represent pixel values.

A two-pixel image

255

pixel 1 value

(190, 85)

255
pixel 2 value

Handwriting Recognition
A three-pixel image

(190, 85, 202)

pixel 1

pixel 2

pixel 3

This 3-pixel image is represented by

a SINGLE point in a 3-D space.

Handwriting Recognition
Distances between images

(190, 85, 202)

A three-pixel image

Another 3-pixel image

pixel 1

(25, 150, 75)

pixel 2

pixel 3

Straight line distance

between them

Handwriting Recognition
A four-pixel image.

A different four-pixel image.

(190, 85, 202, 10)

Same 4-dimensional vector!

Assuming we read pixels in a systematic manner, we can now

represent any image as a single point in a high dimensional space.

Handwriting Recognition
16 x 16 image. How many dimensions?

Handwriting Recognition
Distances between digits.
?

Straight-line distance in N-dimensional space?

Recognising Handwritten Digits

maybe

probably
not
Which is closest neighbour in N-dimensions?

K-Mean Clustering
1.

Partition the data points into K clusters randomly. Find the

centroids of each cluster.
For each data point:

3.
4.

Calculate the distance from the data point to each cluster.

Assign the data point to the closest cluster.

Recompute the centroid of each cluster.

Repeat steps 2 and 3 until there is no further change in the
assignment of data points (or in the centroids).

K-Mean Clustering

Assignment
Write a code to identify the three clusters in the
given image.
The image will be emailed to you.
You have to submit the code, and resulting image
by next week.

K-Nearest Neighbor Rule

Given a training dataset with classes i.e. data and their
labels are known
x1 , y1 , x 2 , y 2 , x3 , y 3 ,L L , x n , y n , x d
Objective: Decide the class (or label) of a test vector xt
Method:
Compute the distances between the vector xt and all examples
of the training dataset
Determine k nearest neighbors of xt
Decision: class(xt) = majority vote of k nearest neighbors
The choice of distance is very important

K-Nearest Neighbor Rule

Example
Class 1
Class 1

Class 2
Class 2

1-NN

3-NN

K-Nearest Neighbor Rule

The test sample (green circle) should be classified either to the first class of
blue squares or to the second class of red triangles. If k = 3 it is assigned to
the second class because there are 2 triangles and only 1 square inside the
inner circle. If k = 5 it is assigned to the first class (3 squares vs. 2 triangles
inside the outer circle).
Image Source: Wikipedia

Distance Metric
A distance metric D(.,.) is merely a function that gives
generalized distance between two patterns
For three vectors a, b and c, a distance metric should hold
following properties
Non-negativity:
Reflexivity:
Symmetry:

D (a,b) 0

D(a,b) 0if and only if a = b

D (a,b) D(b,a)

Triangle inequality:

D(a,b) D(b,c) D(a,c)

Distance Metric
Euclidean distance possesses all the four properties

ak bk

k 1

D(a,b) a b

Euclidean distance
in d-dimensions

One general class of metrics for d-dimensional

patterns is the Minkowski metric

Lk (a,b)

a b
i 1

L1 distance is called the Manhattan or city block distance

L2 distance is the Euclidean distance

Distance Metric
Distance metrics may not be invariant to linear
transformations (e.g. scaling)

Each coordinate is scaled by some constant

A nearest-neighbor classifier would have different results
depending upon such rescaling. Consider the test point x
and its nearest neighbor. In the original space (left), the
black prototype is closest. In the figure at the right, the x1
axis has been rescaled by a factor 1/3; now the nearest

Distance Metric
Data normalization
If there is a large disparity in the ranges of the full
data in each dimension, a common procedure is to
rescale all the data to equalize such ranges

Euclidean distance is frequently employed as

the distance metric in nearest neighbor classifier

k-Nearest Neighbor Classifier

Example:
k =15
Decision
boundary is
relatively simple

Decision Regions
Voronoi Diagram
Given a set of points (referred to as sites or nodes) a
Voronoi diagram is a partition of space into regions,
within which all points are closer to some particular
node than to any other node

Voronoi Editing
Each cell contains one sample, and every location within
the cell is closer to that sample than to any other
sample.
Every query point will be assigned the classification of
the sample within that cell.

Voronoi Editing
Knowledge of this boundary is sufficient to classify new
points.
The boundary itself is rarely computed; many algorithms
seek to retain only those points necessary to generate
an identical boundary.

Voronoi Editing
The boundaries of the Voronoi regions separating
those regions whose nodes are of different class
contribute to a portion of the decision boundary

The decision boundary

Voronoi Editing
The nodes of the Voronoi regions whose boundaries
did not contribute to the decision boundary are
redundant and can be safely deleted from the training
set

The edited training set.

Voronoi Editing

A drawback of Voronoi editing is that it still leaves too many

points in the edited set by preserving many points which are well
separated from other classes and which should intuitively be
deleted

Points A1 and B1 are well separated. They are preserved by the Voronoi editing to
maintain a portion of the decision boundary. However, assuming that new points will
come from the same distribution as the training set, the portions of the decision boundary
remote from the concentration of training set points is of lesser importance.

Gabriel Graph
Two points A and B are said to be Gabriel neighbours if
their diametral sphere (i.e. the sphere such that AB is
its diameter) doesn't contain any other points

Points A and B are

Gabriel neighbours,
whereas B and C are
not

A graph where all pairs of Gabriel neighbours are

connected with an edge is called the Gabriel graph.

Gabriel Editing - algorithm

Compute the Gabriel graph for the training set.
Visit each node, marking it if all its Gabriel
neighbours are of the same class as the current
node.
Delete all marked nodes, exiting with the remaining
ones as the edited training set.

Wilson Editing
Remove points that do not agree with the
majority of their k nearest neighbours
Overlapping classes

Original data

Wilson editing with k=7

Original data

Wilson editing with k=7

Condensed nearest neighbor (cnn)

Aim is to reduce the number of training samples
Retain only the samples that are needed to define the
decision boundary
Decision Boundary Consistent a subset whose nearest
neighbour decision boundary is identical to the boundary
of the entire training set
Consistent Set a subset of the training data that
correctly classifies all of the original training data
Minimum Consistent Set smallest consistent set

Condensed nearest neighbor (cnn)

Original data

Consistent set

Minimum Consistent
Set

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

boundary consistent set
Algorithm (Hart 1968)
Initialize subset with a single training example
Classify all remaining samples using the subset, and transfer
any incorrectly classified samples to the subset
Return to 2 until no transfers occurred or the subset is full

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

References
Chapter # 4, Pattern Classification by Richard O.
Duda, Peter E. Hart & David G. Stork
Some part from chapter 10 of the book

Lecture 07 KNN 14112022 034756pm
100% (1)
Lecture 07 KNN 14112022 034756pm
24 pages
KNN
No ratings yet
KNN
29 pages
DS_Module 3
No ratings yet
DS_Module 3
65 pages
K-Nearest Neighbor Learning
No ratings yet
K-Nearest Neighbor Learning
31 pages
AI Lecture
No ratings yet
AI Lecture
63 pages
Medical Imabmnge Analysis
No ratings yet
Medical Imabmnge Analysis
41 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
ML Co4 Session 29
No ratings yet
ML Co4 Session 29
36 pages
T6- KNN - Features, Distances &amp; Non-Parametric Models
No ratings yet
T6- KNN - Features, Distances &amp; Non-Parametric Models
23 pages
3 Unit PR NonParametric Decision Making
No ratings yet
3 Unit PR NonParametric Decision Making
78 pages
Presentation UNIT-2
No ratings yet
Presentation UNIT-2
96 pages
5. K-Nearest Neighbors
No ratings yet
5. K-Nearest Neighbors
35 pages
Classification Techniques
No ratings yet
Classification Techniques
99 pages
Pattern Recognition Linear Classifier by Zaheer Ahmad
0% (1)
Pattern Recognition Linear Classifier by Zaheer Ahmad
37 pages
ML04_KNN-SVM_2024-2025
No ratings yet
ML04_KNN-SVM_2024-2025
57 pages
KNN - Algorithm - SVM - Algorithm
No ratings yet
KNN - Algorithm - SVM - Algorithm
27 pages
Machine Learning - Classifiers and Boosting: Reading CH 18.6-18.12, 20.1-20.3.2
No ratings yet
Machine Learning - Classifiers and Boosting: Reading CH 18.6-18.12, 20.1-20.3.2
54 pages
Object Recognition
No ratings yet
Object Recognition
43 pages
Clustering, A Tool To Analyze Data Points
No ratings yet
Clustering, A Tool To Analyze Data Points
61 pages
3c Feature Extraction
No ratings yet
3c Feature Extraction
19 pages
Lecture 2
No ratings yet
Lecture 2
27 pages
Chapter 2
No ratings yet
Chapter 2
26 pages
IV Distance and Rule Based Models 4.1 Distance Based Models
No ratings yet
IV Distance and Rule Based Models 4.1 Distance Based Models
45 pages
Pattern Recognition - Clustering - Classification
No ratings yet
Pattern Recognition - Clustering - Classification
177 pages
ML4 (1)
No ratings yet
ML4 (1)
23 pages
Wikipedia K Nearest Neighbor Algorithm
No ratings yet
Wikipedia K Nearest Neighbor Algorithm
4 pages
Detailed SVM Presentation
No ratings yet
Detailed SVM Presentation
15 pages
Foundations of Machine Learning: Module 3: Instance Based Learning and Feature Reduction
No ratings yet
Foundations of Machine Learning: Module 3: Instance Based Learning and Feature Reduction
40 pages
L4 KNN Classifier
No ratings yet
L4 KNN Classifier
29 pages
LFD 2005 Nearest Neighbour
No ratings yet
LFD 2005 Nearest Neighbour
6 pages
Mod09-ppt2-ML_in_Image_Classification
No ratings yet
Mod09-ppt2-ML_in_Image_Classification
30 pages
chapter-4
No ratings yet
chapter-4
40 pages
Digital - Image - Processing - 2 Lec 1 PDF
No ratings yet
Digital - Image - Processing - 2 Lec 1 PDF
56 pages
Introduction To Machine Learning: K-Nearest Neighbor Algorithm
No ratings yet
Introduction To Machine Learning: K-Nearest Neighbor Algorithm
25 pages
Foundations of Machine Learning: Sudeshna Sarkar IIT Kharagpur
No ratings yet
Foundations of Machine Learning: Sudeshna Sarkar IIT Kharagpur
40 pages
K-Nearest Neighbor Classifier: This Slide Is Modified From Dr. Tan's Slides. Thanks To Dr. Tan
No ratings yet
K-Nearest Neighbor Classifier: This Slide Is Modified From Dr. Tan's Slides. Thanks To Dr. Tan
11 pages
Clustering: CMPUT 466/551 Nilanjan Ray
No ratings yet
Clustering: CMPUT 466/551 Nilanjan Ray
34 pages
DBSCAN
No ratings yet
DBSCAN
14 pages
20 Cs 112
No ratings yet
20 Cs 112
11 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
Introduction To Machine Learning Lecture 3: Linear Classification Methods
No ratings yet
Introduction To Machine Learning Lecture 3: Linear Classification Methods
40 pages
Distance-Based Techniques
No ratings yet
Distance-Based Techniques
7 pages
KNN PDF
No ratings yet
KNN PDF
30 pages
Presentation UNIT-2(Old)
No ratings yet
Presentation UNIT-2(Old)
58 pages
KNN Algorithm
No ratings yet
KNN Algorithm
16 pages
ML Unit-2
No ratings yet
ML Unit-2
55 pages
ML-UNIT-2
No ratings yet
ML-UNIT-2
46 pages
Short Course On Deep Learning: Welcome!!
No ratings yet
Short Course On Deep Learning: Welcome!!
57 pages
ML-Lecture-13-KNN
No ratings yet
ML-Lecture-13-KNN
14 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
17 pages
cs4811-ch10c-clustering
No ratings yet
cs4811-ch10c-clustering
35 pages
Unit V: Distance and Rule Based Models
No ratings yet
Unit V: Distance and Rule Based Models
56 pages
Lect 4
No ratings yet
Lect 4
34 pages
SVM,KNN,TreeNBC
No ratings yet
SVM,KNN,TreeNBC
22 pages
Unit 2 - SVM
No ratings yet
Unit 2 - SVM
137 pages
MLCH9
No ratings yet
MLCH9
45 pages
ML UNIT-5
No ratings yet
ML UNIT-5
21 pages
Module3
No ratings yet
Module3
26 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
ACT Math Section and SAT Math Level 2 Subject Test Practice Problems 2013 Edition
From Everand
ACT Math Section and SAT Math Level 2 Subject Test Practice Problems 2013 Edition
Dr. David Kronmiller
3/5 (3)
Intonation (The Structure of The Tone Unit, & Function of Intonation) - Phonology
79% (14)
Intonation (The Structure of The Tone Unit, & Function of Intonation) - Phonology
22 pages
Rapp Geom Geod Vol II Rev
No ratings yet
Rapp Geom Geod Vol II Rev
225 pages
Lesson Plan in English 7 4Q
No ratings yet
Lesson Plan in English 7 4Q
26 pages
Report On Business Communication
100% (2)
Report On Business Communication
17 pages
BoA CFE-CMStatistics 2017 PDF
No ratings yet
BoA CFE-CMStatistics 2017 PDF
272 pages
Genrich Altshuller
No ratings yet
Genrich Altshuller
3 pages
Anatomia Colenquima Stress PDF
No ratings yet
Anatomia Colenquima Stress PDF
16 pages
Dwarf Planets & Centaurs in Academic Astrology
No ratings yet
Dwarf Planets & Centaurs in Academic Astrology
13 pages
How To Use A Compass
No ratings yet
How To Use A Compass
9 pages
PRTSCR
No ratings yet
PRTSCR
6 pages
My Matlab Code For Random Filter Uwb
No ratings yet
My Matlab Code For Random Filter Uwb
4 pages
Pseudo Code and Flow Chart
0% (1)
Pseudo Code and Flow Chart
25 pages
Appraisal Plan Template - SCAMPI B - V316
50% (2)
Appraisal Plan Template - SCAMPI B - V316
16 pages
5 Quadratic Inequalities 5a
No ratings yet
5 Quadratic Inequalities 5a
44 pages
List of natural science students and exam Room
No ratings yet
List of natural science students and exam Room
122 pages
Is 432 2 1982 PDF
No ratings yet
Is 432 2 1982 PDF
14 pages
Me 555 Degrees of Freedom
No ratings yet
Me 555 Degrees of Freedom
2 pages
Amol Final Project
No ratings yet
Amol Final Project
63 pages
Sir Kashif Chapter 1
No ratings yet
Sir Kashif Chapter 1
33 pages
Colour Vision Requirements of Fire Fighters
No ratings yet
Colour Vision Requirements of Fire Fighters
11 pages
CIAP-PCAB Citizen's Charter - AMO Seminar and Exam - 26sep2016 PDF
No ratings yet
CIAP-PCAB Citizen's Charter - AMO Seminar and Exam - 26sep2016 PDF
1 page
Chapter 12 Applications: Terrain Mapping and Analysis
No ratings yet
Chapter 12 Applications: Terrain Mapping and Analysis
13 pages
Bengkel Pembinaan Item Bahasa Inggeris (Uasa) Tahap 2.PDF
No ratings yet
Bengkel Pembinaan Item Bahasa Inggeris (Uasa) Tahap 2.PDF
51 pages
GPS Data
No ratings yet
GPS Data
8 pages
Production Details
No ratings yet
Production Details
26 pages
Chin & Todd, 1995 On The Use, Usefulness of SEM
No ratings yet
Chin & Todd, 1995 On The Use, Usefulness of SEM
11 pages
Culture Moves Europe Residency Call Application Form Template
No ratings yet
Culture Moves Europe Residency Call Application Form Template
14 pages
3 Bucket Model Template Final
No ratings yet
3 Bucket Model Template Final
8 pages
MATPLOTLIB
No ratings yet
MATPLOTLIB
10 pages
CHIP Malaysia 2014-03
100% (1)
CHIP Malaysia 2014-03
132 pages

Non Parametric Classification: Pattern Recognition

Uploaded by

Non Parametric Classification: Pattern Recognition

Uploaded by

Pattern Recognition

Non Parametric Classification

Non parametric classification

Objective: Decide the class (or label) of a test vector xt

Non parametric classification

Vectors of the red

Vectors of the blue

What are the features of a rugby player?

The K-Nearest Neighbour Algorithm

The K-Nearest Neighbour Algorithm

The K-Nearest Neighbour Algorithm

(here k=3, but it could be more)

The K-Nearest Neighbour Algorithm

(here k=3, but it could be more)

The K-Nearest Neighbour Algorithm

The Decision Boundary

The Decision Boundary

These hypothetical points

The Decision Boundary

The Decision Boundary

The Decision Boundary

The Decision Boundary

The Decision Boundary

We didnt have all the data.

(190, 85, 202)

This 3-pixel image is represented by

(190, 85, 202)

Another 3-pixel image

(25, 150, 75)

Straight line distance

A different four-pixel image.

(190, 85, 202, 10)

Assuming we read pixels in a systematic manner, we can now

Straight-line distance in N-dimensional space?

Recognising Handwritten Digits

Partition the data points into K clusters randomly. Find the

Calculate the distance from the data point to each cluster.

Recompute the centroid of each cluster.

K-Nearest Neighbor Rule

K-Nearest Neighbor Rule

K-Nearest Neighbor Rule

D(a,b) 0if and only if a = b

D(a,b) D(b,c) D(a,c)

One general class of metrics for d-dimensional

L1 distance is called the Manhattan or city block distance

Each coordinate is scaled by some constant

Euclidean distance is frequently employed as

k-Nearest Neighbor Classifier

The decision boundary

The edited training set.

A drawback of Voronoi editing is that it still leaves too many

Points A and B are

A graph where all pairs of Gabriel neighbours are

Gabriel Editing - algorithm

Wilson editing with k=7

Wilson editing with k=7

Condensed nearest neighbor (cnn)

Condensed nearest neighbor (cnn)

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

Condensed nearest neighbor (cnn)

This algorithm produces neither minimal nor decision

You might also like