Chapter 9
Chapter 9
Machine Learning
Anna Egorova-Frster
University of Lugano
Pattern Classification Reading
Group,
January 2007
Learning
So far: different classifiers and methods presented
BUT:
Is some classifier better than all others?
How to compare classifiers?
Is comparison possible at all?
Is at least some classifier always better than random?
AND
Do techniques exist which boost all classifiers?
Features Comparison
Binary feature fi
Patterns xi in the form:
f1 and f2, f1 or f2 etc.
Rank of a predicate r: the
number of simplest patterns it
contains.
Rank 1:
x1: f1 AND NOT f2
x2: f1 AND f2
x3: f2 AND NOT f1
Rank 2: Venn diagram
x1 OR x2 : f1
Rank 3:
x1 OR x2 OR x3 : f1 OR f2 Pattern Classification, Chapter 9
6
Features Comparison
To compare two patterns: take the number of features they
share?
Blind_left = {0,1}
Blind_right = {0,1}
Is (0,1) more similar to (1,0) or to (1,1)???
Different representations also possible:
Blind_right = {0,1}
Both_eyes_same = {0,1}
r2
2
D g(x,D) F(x) D g(x,D) D g(x,D)
2
bias2 var iance
1
(i) xj
n 1 ji
Bagging
Draw n < n training points and train a different
classifier
Combine classifiers votes into end result
Classifiers are of same type: all neural networks,
decision trees etc.
Instability: small changes in the training sets leads
to significantly different classifiers and/or results
Boosting
Improve the performance of different types of
classifiers
Weak learners: the classifier has accuracy only
slightly better than random
Example: three component-classifiers for a two-
class problem
Draw three different training sets D1, D2 and D3
and train three different classifiers C1, C2 and C3
(weak learners).
Boosting
D1: randomly draw n1 < n training points from D.
Train C1 with D1
D2: most informative dataset with respect to D1.
Half of the points are classified properly by C1, half of them not.
Flip a coin: if head, find the first pattern in D/D1 misclassified by C1.
If tails, find a pattern properly classified by C1.
Continue until possible
Train C2 with D2
D3: most informative with respect to C1 and C2.
Randomly select a pattern from D/(D1,D2)
If C1 and C2 disagree, add it to D3
Train C3 with D3
Boosting