18.Overview
18.Overview
Jihoon Yang
Examples
1 T : cancer diagnosis
E : a set of diagnosed cases
P: accuracy of diagnosis on new cases
Z : noisy measurements, occasionally misdiagnosed training cases
M: a program that runs on a general purpose computer
Training regime
Batch
Online
Distributed
Vertical fragmentation
Horizontal fragmentation
Noise
Attribute noise
Classification noise
Both
Accuracy =TP/N
TP(c)
Precision/Specificity(c) = TP(c)+FP(c)
TP(c)
Recall/Sensitivity(c) = TP(c)+FN(c)
FP(c)
False Alarm(c) = TP(c)+FP(c) = 1 − Precision(c)
In other words, the set of assumptions, that together with the training
data, deductively justify the classifications assigned by the learner to
future instances
HV = version space – the subset of H not yet ruled out by the learner
Weaker bias
→ more open to experience, flexible
→ more expressive hypothesis representation
Occam’s razor
Simple hypotheses preferred
Linear fit preferred to quadratic fit assuming both yield relatively good
fit over the training examples