0% found this document useful (0 votes)

0 views

lec-7

The document outlines a lecture on Data Mining focusing on the k-Nearest Neighbors (kNN) classifier and model evaluation techniques. It covers lazy learning methods, the kNN algorithm, and the importance of performance evaluation using metrics such as accuracy and confusion matrices. Additionally, it discusses the implications of choosing the right parameters and handling data in classification tasks.

Uploaded by

hr9s2b5cq5

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views

lec-7

Uploaded by

hr9s2b5cq5

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Data Mining [CSEN 911]

GUC - Winter 2024 – Lecture 7

Model Building – kNN Classifier, Classification
Model Evaluation

Dr. Ayman Al-Serafi

TAs:
Tameem Alghazaly* (lead)
Nada Bakeer
Sarah Samir
Mariam Moustafa
Outline
1. Midterm Feedback
2. Classification using Lazy Learning: kNN
3. Classification Model Evaluation
4. Conclusion

Q&A breaks
between sections

Urgent Qs only in
between!
7-2
Data Mining - GUC - Winter 2024
Outline
1. Midterm Feedback
2. Classification using Lazy Learning: kNN
3. Classification Model Evaluation
4. Conclusion

Q&A
Data Mining - GUC - Winter 2024 7-3
Midterm Review – Data Mining Conceptual Classification

Data Data Data

Mining Mining Mining
Strategy Technique Algorithm

Supervised Decision Tree Information

Learning - Learning Gain C4.5
Classification

Supervised Linear Gradient Descent /

Learning - Regression Least-squares
Estimation fitting

Unsupervised Partitioning- K-Means

Learning - based
Clustering Clustering

Market Association Apriori

Basket Rule Mining
Analysis

Data Mining - GUC - Winter 2024 7-4

Outline
1. Midterm Feedback
2. Classification using Lazy Learning: kNN
3. Classification Model Evaluation
4. Conclusion

Q&A
Data Mining - GUC - Winter 2024 7-7
CLASSIFYING THE CLASSIFIERS!

DATA MINING - GUC - WINTER 2024 8

LAZY VS. EAGER LEARNING
Lazy vs. eager learning
 Lazy learning (e.g., instance-based learning): Simply stores
training data (or only minor processing) and waits until it is given a
test tuple (doesn’t create a model!)
 Eager learning (the previously discussed methods): Given a set of
training tuples, constructs a classification model before receiving
new (e.g., test) data to classify
Lazy: less time in training but more time in predicting
Accuracy
 Lazy method effectively uses a richer hypothesis space since it uses
many local functions to form an implicit global approximation to
the target function
 Eager: must commit to a single hypothesis (model) that covers the
entire instance space
DATA MINING - GUC - WINTER 2024 9
LAZY LEARNER: INSTANCE-BASED METHODS

Instance-based learning:
 Store training examples and delay the processing (“lazy
evaluation”) until a new instance must be classified
 Must have a database of previous examples to be able to
classify future examples

Typical approach:
 k-nearest neighbour (kNN) approach
1. Instances represented as points in a Euclidean space.  Can only work with
numerical input data!
2. Compute distance between the new point and all other points in the DB
3. Find the k-closest instances in the database (e.g., Euclidean distance)
4. Assign the most voted class for the dependent variable as the class for the
instance with unknown classification

DATA MINING - GUC - WINTER 2024 10

LAZY LEARNERS
K-NEAREST NEIGHBOR (KNN) CLASSIFIERS
Delay classification until new test data is available
 Store training data meanwhile

Use similarity measure to compute distance between test data tuple and each of the
training data tuples (Euclidian, Manhattan, …)

 Remember to normalize if ranges vary between attributes

k stands for the number of “closest” neighbours of a test data tuple according to measured
distance
 Majority voting of their class labels used to determine class of test tuple

DATA MINING - GUC - WINTER 2024 11

THE K-NEAREST NEIGHBOUR ALGORITHM
All instances correspond to points in the
n-D space
The nearest neighbor are defined in
terms of Euclidean distance, dist(X1, X2)
Dependent variable could be discrete
(categorical) or numerical
For discrete-valued, k-NN returns the
most common value among the k
training examples nearest to xq
_
Each k nearest neighbor votes for a _
_ _
class for xq +
. +
_ xq +
_ +
DATA MINING - GUC - WINTER 2024 12
DISCUSSION ON THE K-NN ALGORITHM
k-NN for numerical prediction for a given unknown tuple
 Returns the mean values of the k nearest neighbors

Robust to noisy data by averaging k-nearest neighbors

Curse of dimensionality: distance between neighbours could be

dominated by irrelevant attributes
 To overcome it, apply normalization and data reduction to reduce
number of input attributes by elimination of the least relevant
attributes
DATA MINING - GUC - WINTER 2024 13
Euclidean distance
LAZY LEARNERS equation for input
independent
K-NEAREST NEIGHBOR – EUCLIDEAN DISTANCE + SIMILARITY variables 1 until z
between 2
instances x and y

𝐷= 𝑥1 − 𝑦1 2 + 𝑥2 − 𝑦2 2 + ⋯ + 𝑥𝑧 − 𝑦𝑧 2

S= 1 /(1 + 𝐷) OR S = 1- D (if distance is

normalized in [0,1] as % distance, divide by Max(D)) Similarity:
opposite of
distance

DATA MINING - GUC - WINTER 2024 14

LAZY LEARNERS - K-NEAREST NEIGHBOR (KNN) EXAMPLE
RID age Loan ($) Default EUCLIDEAN DISTANCE NORMALIZED EUCLIDEAN SIMILIARTY
MAX DISTANCE
1 25 40000 No 25 − 48 2 + 40000 − 142000 2 = 102000 = 102000 / 124000 = 0.823 = 1- 0.823 = 0.18 (*100) = 18%

2 35 60000 No 35 − 48 2 + 60000 − 142000 2 = 82000 = 82000 / 124000 = 0.661 = 1- 0.661 = 0.34 (*100) = 34%

3 45 80000 No 45 − 48 2 + 80000 − 142000 2 = 62000 = 62000 / 124000 = 0.5 = 1- 0.5 = 0.5 (*100) = 50%

4 20 20000 No 20 − 48 2 + 20000 − 142000 2 = 122000 = 122000 / 124000 = 0.984 = 1- 0.984 = 0.02 (*100) = 2%

5 35 120000 No 35 − 48 2 + 120000 − 142000 2 = 22000 = 22000 / 124000 = 0.177 = 1- 0.177 = 0.82 (*100) = 82%

6 52 18000 No 52 − 48 2 + 18000 − 142000 2 = 124000 = 124000 / 124000 = 1.0 = 1- 1.0 = 0 (*100) = 0%

7 23 95000 Yes 23 − 48 2 + 95000 − 142000 2 = 47000 = 47000 / 124000 = 0.379 = 1- 0.379 = 0.62 (*100) = 62%

8 40 62000 Yes 40 − 48 2 + 62000 − 142000 2 = 80000 = 80000 / 124000 = 0.645 = 1- 0.645 = 0.36 (*100) = 36%

9 60 100000 Yes 60 − 48 2 + 100000 − 142000 2 = 42000 = 42000 / 124000 = 0.339 = 1- 0.339 = 0.66 (*100) = 66%

10 48 220000 Yes 48 − 48 2 + 220000 − 142000 2 = 78000 = 78000 / 124000 = 0.629 = 1- 0.629 = 0.37 (*100) = 37%

11 33 150000 Yes 33 − 48 2 + 150000 − 142000 2 = 8000 = 8000 / 124000 = 0.065 = 1- 0.065 = 0.94 (*100) = 94%

48 142000 ?

DATA MINING - GUC - WINTER 2024 15

LAZY LEARNERS
K-NEAREST NEIGHBOR (KNN) EXAMPLE
RID age Loan ($) Default Distance
1 25 40000 No 102000
2 35 60000 No 82000
𝐷= 𝑥1 − 𝑦1 2 + 𝑥2 − 𝑦2 2
3 45 80000 No 62000
4 20 20000 No 122000
5 35 120000 No 22000
If k=1  NN is RID 11
6 52 18000 No 124000
• Default =YES
7 23 95000 Yes 47000
8 40 62000 Yes 80000 If k=3  NNs are RIDs 11, 5, 9
9 60 100000 Yes 42000
10 48 220000 Yes 78000 • Default = YES
11 33 150000 Yes 8000

48 142000 Yes
DATA MINING - GUC - WINTER 2024 16
KNN EXAMPLE
VOTER PARTY REGISTRATION
Assume we have a training data set of
voters each tagged with three attributes:
voter party registration, voter wealth, and
a quantitative measure of voter
religiousness
We want to predict voter registration using
wealth and religiousness as predictors

DATA MINING - GUC - WINTER 2024 17

KNN EXAMPLE Using KNN with k=1, we can predict voter registration for each
voter in the training data
VOTER PARTY REGISTRATION  Highly overfitted!

𝐾 =1
DATA MINING - GUC - WINTER 2024 18
KNN EXAMPLE
VOTER PARTY REGISTRATION

𝐾 =3 Lighter colors indicate less certainty about predictions 𝐾 = 10

 Reasonable fitting to data
DATA MINING - GUC - WINTER 2024 19
KNN EXAMPLE
VOTER PARTY REGISTRATION

𝐾 = 20 Lighter colors indicate less certainty about predictions 𝐾 = 80

 Highly underfitted
DATA MINING - GUC - WINTER 2024 20
Precautions and tips for kNN
• Choosing a reasonable number for n: usually an odd number like
3,5,7

• How to handle missing values?

• Normalisation of input independent variables

DATA MINING - GUC - WINTER 2024 21

Outline
1. Midterm Feedback
2. Classification using Lazy Learning: kNN
3. Classification Model Evaluation
4. Conclusion

Q&A
Data Mining - GUC - Winter 2024 7-22
EVALUATING SUPERVISED LEARNER MODELS
Performance evaluation is probably the most critical of all steps in the
data mining process.
Supervised learner models are used to classify, estimate, and/or predict
future outcome.
For some applications the desire is to build a model showing consistently
high predictive accuracy.

Three applications focus on classification correctness:

 Develop a model to accept or reject credit card applicants
 Develop a model to accept or reject home mortgage applicants
 Develop a model to decide whether or not to drill for oil

Classification correctness is best calculated by presenting previously

unseen data summarized in a table known as a Confusion Matrix.

23
Data Mining - GUC - Winter 2024
TWO-CLASS ERROR ANALYSIS
Many applications listed previously represent two-class problem.
 Yes / No, High / Low, etc.

For example, cell with True Accept and True Reject represent correctly
classified instances.
A cell with False Accept denotes accepted applicants that should have
been rejected.
A cell with False Reject denotes rejected applicants that should have
been accepted.

24
Data Mining - GUC - Winter 2024
TWO-CLASS ERROR ANALYSIS EXPLAINED

True Class
Table 2.6 • A Simple Confusion Matrix Positive Negative

Positive
Computed Computed True False

Predicted Class
Positive Positive
Accept Reject
Count (TP) Count (FP)
Accept True False
Accept Reject

Negative
Reject False True False True
Accept Reject Negative Negative
Count (FN) Count (TN)

25
Data Mining - GUC - Winter 2024
MODEL EVALUATION Measure

accuracy, recognition rate

Formula

𝑇𝑃 + 𝑇𝑁
METRICS 𝑃+𝑁

error rate, misclassification rate 𝐹𝑃 + 𝐹𝑁

Predicted (ALSO 1-Accuracy) 𝑃+𝑁
Recall, true positive rate, sensitivity 𝑇𝑃
Yes No Total 𝑃
Yes TP FN P specificity, true negative rate 𝑇𝑁
Actual No FP TN N 𝑁
Total 𝑃෠ ෡
𝑁 P+N precision 𝑇𝑃
𝑇𝑃 + 𝐹𝑃
Confusion Matrix Positives  tuples representing class of interest
Negatives  tuples representing other class(es)
True Positives  positive tuples correctly labeled
False Positives  negative tuples incorrectly labeled
True Negatives  negative tuples correctly labeled
False Negatives  positive tuples incorrectly labeled
26
CLASSIFIER EVALUATION METRICS: ACCURACY,
ERROR RATE, SENSITIVITY AND SPECIFICITY
A\P C ¬C Class Imbalance Problem:

C TP FN P
 One class may be rare, e.g.
¬C FP TN N
fraud, or COVID-positive
P’ N’ All
 Significant majority of the

Classifier Accuracy, or recognition negative class and minority of

rate: percentage of test set tuples the positive class
that are correctly classified  Sensitivity: True Positive
Accuracy = (TP + TN)/All recognition rate
Error rate: 1 – accuracy, or  Sensitivity = TP/P

Error rate = (FP + FN)/All  Specificity: True Negative

recognition rate
 Specificity = TN/N

27
Data Mining - GUC - Winter 2024
MODEL EVALUATION
METRICS FOR EVALUATING CLASSIFIER PERFORMANCE

Balanced Classes
Predicted
Yes No Total Accuracy (%)
Yes 6954 46 7000 99.34
Actual No 412 2588 3000 86.27
Total 7366 2634 10000 95.42

Example Buys_Computer Confusion Matrix

Use accuracy and error rate

Data Mining - GUC - Winter 2024 28
As the model has too few cancer
(positive) patients, we can’t only

MODEL EVALUATION depend on accuracy to evaluate

the model!

METRICS FOR EVALUATING CLASSIFIER PERFORMANCE

Imbalanced Classes Low

Predicted
Yes No Total Accuracy (%)
Yes 90 210 300 30 (90/300 = 30%)
Actual No 140 9560 9700 98.56 (9560/9700 = 98.5%)
Total 230 9770 10000 96.4 ((90+9560)/10000 = 96.5%)

Example Cancer Confusion Matrix

Use sensitivity (TPs or recall) and specificity High

Data Mining - GUC - Winter 2024 29

CONFUSION MATRIX FOR MULTI-CLASS
EXPLAINED
Table 2.5 • A Three-Class Confusion Matrix

Computed Decision

C C C
TRUE / CORRECT
1 2 3
C C C C
classifications
1 11 12 13
C C C C
2 21 22 23
C C C C
3 31 32 33

30
Data Mining - GUC - Winter 2024
CONFUSION MATRIX
• A matrix used to summarize the results of a supervised classification.
• Entries along the main diagonal are correct classifications.
• Entries other than those on the main diagonal are classification
errors.
• Rule 1: the value C11 represent total number of C1 instances correctly classified by
the matrix. Same logic applies to C22 and C33.
• Rule 2: values in row C1 represent those instances belong to class Ci. for example.
With i=2, the instances associated with cells C21, C22, and C23 are all actually
members of C2. to find total number of C2 instances misclassified as member of other
class, we compute sum of C21 and C23.
• Rule 3: values found in column Ci indicate those instances that have been classified as
members of Ci. With i=2, the instances associated with cells C12, C22, and C32
are all actually members of C2. to find total number of C2 instances misclassified
as member of other class, we compute sum of C12 and C32.

31
Data Mining - GUC - Winter 2024
Is the Error Rate sufficient to judge?

For both models, Error Rate = (25+75) / 1000 = 10%

Table 2.7 • Two Confusion Matrices Each Showing a 10% Error Rate

Model Computed Computed Model Computed Computed

A Accept Reject B Accept Reject

Accept 600 25 Accept 600 75

Reject 75 300 Reject 25 300

RecallA = (600) / (625) = 96% RecallB = (600) / (675) = 88%

PrecisionA = (600) / (675) = 88% PrecisionB = (600) / (625) = 96%

F measure (F1 or F-score): harmonic mean of precision

and recall,
Data Mining - GUC - Winter 2024
7-33
Comparing Models by Measuring Lift
for Binary Classification
 The hope is to select samples that will show higher
response rates than the rates seen within the general
population.

 Supervised learner models designed for extracting bias

samples from a general population are often evaluated by
a measure that comes directly from marketing known as
LIFT.

 A value of lift of 3+ is considered very good: 3 times better

than random selection of positive-class from population!
 1+ means model better than random selection from population
 0 to 1 means random selection from population is better than
model

Data Mining - GUC - Winter 2024

7-34
COMPUTING LIFT
Lift measures the change in percent
concentration of a

P(Ci | Sample)
Lift  Desired class, Ci, taken from a biased
P(Ci | Population) sample relative to the

Concentration of Ci within the entire

population.

35
Data Mining - GUC - Winter 2024
Table 2.9 • Two Confusion Matrices for Alternative Models with Lift Equal to 2.25

Model Computed Computed Model Computed Computed

X Accept Reject Y Accept Reject

Accept 540 460 Accept 450 550

Reject 23,460 75,540 Reject 19,550 79,450

Lift (model X) = [{540/24000}/{1000/100000}]

Lift (model Y) = [{450/20000}/{1000/100000}]

A B
C D

= (A/A+C) / (A+B/All)

7-36
Data Mining - GUC - Winter 2024
Table 2.9 • Two Confusion Matrices for Alternative Models with Lift Equal to 2.25

Model Computed Computed Model Computed Computed

X Accept Reject Y Accept Reject

Accept 540 460 Accept 450 550

Reject 23,460 75,540 Reject 19,550 79,450

Lift (model X) = [{540/24000}/{1000/100000}]

Lift (model Y) = [{450/20000}/{1000/100000}]

AccuracyX = (540+75540) / (100000) AccuracyY = (450+79450) / (100000)

= 76% = 79%

37
Table 2.8 • Two Confusion Matrices: No Model and an Ideal Model

No Computed Computed Ideal Computed Computed

Model Accept Reject Model Accept Reject

Accept 1,000 0 Accept 1,000 0

Reject 99,000 0 Reject 0 99,000

LiftA= (1000/100000) / (1000/100000) = 1 LiftB= (1000/1000) / (1000/100000) = 100

38
MODEL SELECTION: ROC CURVES FOR BINARY
CLASSIFIERS
ROC (Receiver Operating Characteristics) curves:
for visual comparison of classification models
 The true positive rate (TPR, also called sensitivity) is
calculated as TP/TP+FN = Recall
 false positive rate is calculated as FP/FP+TN

Originated from signal detection theory

Shows the trade-off between the true positive
rate and the false positive rate
 Vertical axis represents the
The area under the ROC curve is a measure of true positive rate
the accuracy of the model
 Horizontal axis rep. the
Rank the test tuples in decreasing order: the one false positive rate
that is most likely to belong to the positive class
appears at the top of the list  The plot also shows a
 Based on confidence interval from the classification output diagonal line
like the percentages we had of the leaf nodes in the decision
trees  A model with perfect
The closer to the diagonal line (i.e., the closer the
accuracy will have an area
area is to 0.5), the less accurate is the model under the curve (AUC) of
1.0 if variables are
normalised 41
Outline
1. Midterm Feedback
2. Classification using Lazy Learning: kNN
3. Classification Model Evaluation
4. Conclusion

Q&A
Data Mining - GUC - Winter 2024

7-52
SUMMARY
kNN Classifier

• The Algorithm  Lazy learning

• The distance metric
• The parameter k

Model Evaluation

• Metrics for Evaluating Classifiers Performance

53
Data Mining - GUC - Winter 2024
Mini-Project 2
 Due on Friday 15th November 23:59

 Implement a supervised classification project in Python on a

new dataset

 Expect you to implement the full CRISP-DM data mining

process as a data scientist

7-57
Data Mining - GUC - Winter 2024
THANK YOU FOR YOUR
ATTENTION
NEXT LECTURE: Unsupervised Learning: k-
means clustering, evaluation

NEXT TUTORIAL: Decision Trees Lab

+ Assignment / Mini-project 2

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
89% (45)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
59% (76)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (78)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Workbook For The Body Keeps The Score
88% (52)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Phone Codes
78% (27)
Phone Codes
5 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
Sample Mental Health Progress Note
96% (47)
Sample Mental Health Progress Note
3 pages
2025 MandateForLeadership FULL
70% (10)
2025 MandateForLeadership FULL
920 pages
How To Kiss A Woman's Breast
60% (114)
How To Kiss A Woman's Breast
14 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
80% (35)
100 Questions To Ask Your Partner
2 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (7)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
1001 Songs
70% (71)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
Applying The CRISP-DM Framework For Teaching Business Analytics
No ratings yet
Applying The CRISP-DM Framework For Teaching Business Analytics
20 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
12_23ECE216_Nearest Neighbors
No ratings yet
12_23ECE216_Nearest Neighbors
29 pages
ML Lec07 KNN
100% (2)
ML Lec07 KNN
37 pages
3. Classification (K-Nearest Neighbor)
No ratings yet
3. Classification (K-Nearest Neighbor)
22 pages
01 Basics 02knn 01
No ratings yet
01 Basics 02knn 01
7 pages
4.1-K- Nearest Neighbour Learning.pptx
No ratings yet
4.1-K- Nearest Neighbour Learning.pptx
27 pages
K-Nearest Neighbor (KNN)
No ratings yet
K-Nearest Neighbor (KNN)
27 pages
Chap4 KNN
No ratings yet
Chap4 KNN
6 pages
ML-KN
No ratings yet
ML-KN
12 pages
Mullick 2018
No ratings yet
Mullick 2018
13 pages
Machine Learning - Overview
No ratings yet
Machine Learning - Overview
5 pages
Adaptive Learning-Based K-Nearest Neighbor Classifiers With Resilience To Class Imbalance
No ratings yet
Adaptive Learning-Based K-Nearest Neighbor Classifiers With Resilience To Class Imbalance
17 pages
A Review of Data Classification Using K-Nearest Neighbour
No ratings yet
A Review of Data Classification Using K-Nearest Neighbour
7 pages
Mathematical Foundations For Machine Learning and Data Science
No ratings yet
Mathematical Foundations For Machine Learning and Data Science
25 pages
KNN 2
No ratings yet
KNN 2
53 pages
Instance Based Learning: November 2015
No ratings yet
Instance Based Learning: November 2015
11 pages
Unit 5 Learning with Algorithm
No ratings yet
Unit 5 Learning with Algorithm
7 pages
Lecture 4
No ratings yet
Lecture 4
31 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
Chap4 KNN New
No ratings yet
Chap4 KNN New
7 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
IS4242 W8 Similarity, NN and Clusters
No ratings yet
IS4242 W8 Similarity, NN and Clusters
29 pages
Chap4 KNN
No ratings yet
Chap4 KNN
11 pages
L 8 Clustering
No ratings yet
L 8 Clustering
58 pages
Module 4_chapter 4
No ratings yet
Module 4_chapter 4
4 pages
AIML - UNIT-4 Modified
No ratings yet
AIML - UNIT-4 Modified
119 pages
Data Warehousing/Mining Comp 150 DW Chapter 5: Concept Description: Characterization and Comparison
No ratings yet
Data Warehousing/Mining Comp 150 DW Chapter 5: Concept Description: Characterization and Comparison
59 pages
SVM,KNN,TreeNBC
No ratings yet
SVM,KNN,TreeNBC
22 pages
IntroML 8 KmeanClustering (3)
No ratings yet
IntroML 8 KmeanClustering (3)
21 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
Machine Learning unit 3
No ratings yet
Machine Learning unit 3
40 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
Notes 02
No ratings yet
Notes 02
79 pages
AIML_UNIT-4
No ratings yet
AIML_UNIT-4
82 pages
Lecture Notes For Chapter 4 Instance-Based Learning Introduction To Data Mining, 2 Edition
No ratings yet
Lecture Notes For Chapter 4 Instance-Based Learning Introduction To Data Mining, 2 Edition
17 pages
Mathematics
No ratings yet
Mathematics
12 pages
Data Mining Unit-Iv
No ratings yet
Data Mining Unit-Iv
34 pages
3.1 K Nearest Neighbour Classifier (1)
No ratings yet
3.1 K Nearest Neighbour Classifier (1)
24 pages
Internet of Things Comparative Study
No ratings yet
Internet of Things Comparative Study
3 pages
ML Assignment No. 3: 3.1 Title
No ratings yet
ML Assignment No. 3: 3.1 Title
6 pages
4 Clustering1
No ratings yet
4 Clustering1
41 pages
Classification KNN
No ratings yet
Classification KNN
11 pages
UNIT-2 K-Nn-March 2024
No ratings yet
UNIT-2 K-Nn-March 2024
23 pages
ML-Lecture-13-KNN
No ratings yet
ML-Lecture-13-KNN
14 pages
10ClusBasic (1)
No ratings yet
10ClusBasic (1)
31 pages
K-Nearest Neighbor Learning
No ratings yet
K-Nearest Neighbor Learning
31 pages
Sayan Das - Machine Learning
No ratings yet
Sayan Das - Machine Learning
4 pages
ML Assignment No. 3: 3.1 Title
No ratings yet
ML Assignment No. 3: 3.1 Title
6 pages
Deep Learning Assignment-1
No ratings yet
Deep Learning Assignment-1
2 pages
ML KNN
No ratings yet
ML KNN
20 pages
Act10
No ratings yet
Act10
4 pages
KNN_Algorithm
No ratings yet
KNN_Algorithm
2 pages
Clustering Notes
No ratings yet
Clustering Notes
37 pages
Lec 5-A Analytics Clustering
No ratings yet
Lec 5-A Analytics Clustering
79 pages
Grouping
No ratings yet
Grouping
98 pages
2 K-Nearest Neighbors: ( (X, Y, Y) Be The Set of Ob-X (X) R Y (Y) R
No ratings yet
2 K-Nearest Neighbors: ( (X, Y, Y) Be The Set of Ob-X (X) R Y (Y) R
2 pages
Aplikasi KNN
No ratings yet
Aplikasi KNN
5 pages
CLIQUE and PROCLUS
0% (1)
CLIQUE and PROCLUS
13 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
NIS Micro project
No ratings yet
NIS Micro project
19 pages
Machine Learning and The Physical Sciences: Giuseppe Carleo
No ratings yet
Machine Learning and The Physical Sciences: Giuseppe Carleo
47 pages
II PG ML Lab Question Paper
No ratings yet
II PG ML Lab Question Paper
2 pages
Multi Model Implementation On General Medicine Prediction With Quantum Neural Networks
No ratings yet
Multi Model Implementation On General Medicine Prediction With Quantum Neural Networks
6 pages
Revenue Predictor - Udit Ennam PDF
No ratings yet
Revenue Predictor - Udit Ennam PDF
30 pages
Untitled PDF
No ratings yet
Untitled PDF
6 pages
Machine Learning For NLP: Vocabulary
No ratings yet
Machine Learning For NLP: Vocabulary
37 pages
Unit 5
No ratings yet
Unit 5
24 pages
API Design For Machine Learning Software: Experiences From The Scikit-Learn Project
No ratings yet
API Design For Machine Learning Software: Experiences From The Scikit-Learn Project
15 pages
Data Mining Algorithmes
No ratings yet
Data Mining Algorithmes
166 pages
Explainable Artificial Intelligence (EXAI) Models For Early Prediction of Parkinson's Disease Based On Spiral and Wave Drawings
No ratings yet
Explainable Artificial Intelligence (EXAI) Models For Early Prediction of Parkinson's Disease Based On Spiral and Wave Drawings
13 pages
Where can buy (Ebook) Intelligent Data Engineering and Automated Learning – IDEAL 2014: 15th International Conference, Salamanca, Spain, September 10-12, 2014. Proceedings by Emilio Corchado, José A. Lozano, Héctor Quintián, Hujun Yin (eds.) ISBN 9783319108391, 9783319108407, 3319108395, 3319108409 ebook with cheap price
100% (1)
Where can buy (Ebook) Intelligent Data Engineering and Automated Learning – IDEAL 2014: 15th International Conference, Salamanca, Spain, September 10-12, 2014. Proceedings by Emilio Corchado, José A. Lozano, Héctor Quintián, Hujun Yin (eds.) ISBN 9783319108391, 9783319108407, 3319108395, 3319108409 ebook with cheap price
65 pages
A Genetic Algorithm-Based 3D Feature Selection For Lip Reading
No ratings yet
A Genetic Algorithm-Based 3D Feature Selection For Lip Reading
6 pages
9 SVM 2
No ratings yet
9 SVM 2
7 pages
Final Unit 3 Questions
No ratings yet
Final Unit 3 Questions
9 pages
Agricultural and Food Engineering (Dual Degree) : Shubham Patidar - 16ag30024
No ratings yet
Agricultural and Food Engineering (Dual Degree) : Shubham Patidar - 16ag30024
1 page
Module 2 Notes
No ratings yet
Module 2 Notes
24 pages
Fig 10: Object Detection in Tensorflow
No ratings yet
Fig 10: Object Detection in Tensorflow
6 pages
Clustering MMD
No ratings yet
Clustering MMD
1 page
30 Days ML Projects Challenge
No ratings yet
30 Days ML Projects Challenge
288 pages
0295-9 Future Security 2011 CD PDF
No ratings yet
0295-9 Future Security 2011 CD PDF
638 pages
UT Dallas Syllabus For cs4375.501 06f Taught by Yu Chung NG (Ycn041000)
No ratings yet
UT Dallas Syllabus For cs4375.501 06f Taught by Yu Chung NG (Ycn041000)
6 pages
Social Media Sentiment Analysis A New Empirical Tool For Assessing Public Opinion On Crime
No ratings yet
Social Media Sentiment Analysis A New Empirical Tool For Assessing Public Opinion On Crime
21 pages
An Adaptation of Relief For Attribute Estimation in Regression
No ratings yet
An Adaptation of Relief For Attribute Estimation in Regression
9 pages
5TH International Conference On Advances in Mechanical
No ratings yet
5TH International Conference On Advances in Mechanical
4 pages
AIML Syllabus
No ratings yet
AIML Syllabus
3 pages
ICS 2408 - Lecture 7 - Clustering
No ratings yet
ICS 2408 - Lecture 7 - Clustering
25 pages
Machine Learning-Lecture#7-Fall 2020
No ratings yet
Machine Learning-Lecture#7-Fall 2020
18 pages
A Classification of Quran Verses Using Deep Learning
No ratings yet
A Classification of Quran Verses Using Deep Learning
14 pages