CH11
CH11
By
Machine learning.
Course of M.L. by Pr. Abdellatif El Afia 2
Example: Digits Recognition
Develop a software (machine) able to recognize handwritten digits
Task: Multi-class Classification.
To build this machine, we should have a set of pictures historical data.
The objective is to let the machine learn(Best Model) from the data.
Machine learning.
Course of M.L. by Pr. Abdellatif El Afia 3
Overview of this Course
Scenario: Task
Supervised Learning
- The data form is: (Inputs, Correct Outputs)
- Learn from a dataset tagged by target variables.
- Classification, regression and ranking are tasks that belong to the scenario of supervised
learning.
Don’t
- Example: Credit Approval approve
approve
Unsupervised Learning
- The form of the data is: (Inputs)
- Learn from a non-tagged dataset by target variables.
- Partitioning and dimension reduction are tasks that belong to the scenario of unsupervised
learning.
Reinforced Learning
- The data form is: (Input, Some Outputs, Reward for each output).
Task
Data Predictive Model Optimal predictor
, : feature
Features Type
Inputs of the training algorithm:
Present employment Qualitative
- Training set: ,
Duration in month Numerical
- Testing set
Credit history Qualitative
- Feature set: Purpose Qualitative
- Labels set: Age in years Numerical
Number of existing credits Numerical
Outputs of the training algorithm: at this bank
- Optimal hypothesis: = the best probability distribution using Credit amount Numerical
Nonlinear models
𝑺 𝑳𝑺 ( 𝒉 𝑺 ) Error
measure 𝑳𝑫 ,𝒇 ( 𝒉𝑺 )
Model
Final hypothesis
𝒉
𝒉𝑺
h
Approximation Capability
Training Data
Predictive Model
(regression case)
(classification case)
Such that:
, : features
: General error.
(regression case)
(classification case)
Validation &
∗ Error
Cross-Validation 𝑳𝑺 ( 𝒉 ) measure ∗
𝑳𝑫 ,𝒇 ( 𝒉 )
Model
Final hypothesis
∗
𝒉 𝒉
h
Regularization
Course of M.L. by Pr. Abdellatif El Afia 32
Objective
• is learning?
What
PAC learning.
How can a machine learn?
ERM.
Is data learnable? S
Uniform convergence.
What is the amount of data needed for learning?
Uniform convergence.
How learning might fail?
No-free-lunch theorem.
How can we measure the complexity of a model?
VC dimension and covering number.
Is the model a good learner?
Tradeoff Bias/Variance.
How to deal with models having an infinite complexity measure?
Non-uniform learning, SRM.
How can we improve the model’s learning?
Regularization and cross-validation.
• Yaser S. Abu-Mostafa, Malik Magdon-Ismail, Hsuan-Tien Lin. Learning from data. Sci-hub
Youtube: https://www.youtube.com/watch?v=mbyG85GZ0PI&list=PLnIDYuXHkit4LcWjDe0EwlE57WiGlBs08
• Shai Shalev-Shwartz and Shai Ben-David. Understanding Machine Learning: From Theory to Algorithms.
Youtube: https://www.youtube.com/watch?v=b5NlRg8SjZg&list=PLFze15KrfxbH8SE4FgOHpMSY1h5HiRLMm
Livre:
https://www.cs.huji.ac.il/~shais/UnderstandingMachineLearning/understanding-machine-learning-theory-algorithms.pdf