Introduction, Course Logistics: CS771: Introduction To Machine Learning Nisheeth Srivastava
Introduction, Course Logistics: CS771: Introduction To Machine Learning Nisheeth Srivastava
All material will be posted on the mooKIT page for the course
3
CS771: Intro to ML
Course Team
Mahesh ([email protected])
Mosab ([email protected])
Neeraj ([email protected])
Rahul ([email protected])
Yatin ([email protected])
Nisheeth ([email protected])
4
CS771: Intro to ML
5
Workload and Grading Policy
4 homework assignments (theory + programming) worth 60%
Theory part: Derivations/analysis
Programming part: Implement/use ML algos, analysis of results. Must be done in
Python (learn if not already familiar)
Must be typeset in LaTeX (learn if not already familiar)
To be submitted via Gradescope (login details will be provided)
Will also involve a viva
Quizzes and exams (mid-sem and end-sem) worth 40%
Will be held online – details later
Quizzes will be worth 5% of the total grade
Mid-sem and end-sem will be worth 10, 20% each of the total grade
CS771: Intro to ML
6
Reference materials
CS771: Intro to ML
7
Course Goals
CS771: Intro to ML
How do we classify?
Let's say we want to classify the climate in a region
We may say something like
• If the summer temperature is above 35 C for more than 20 days during May-
July, the climate is tropical
• If the total rainfall is less than 10 cm all year, the climate is desert
Basic principles
• Think of simple rules that place thresholds on some measurable attributes of
the object
• Combine rules to maximize coverage of classification
CS771: Intro to ML
Classification using expert advice
Who should be screened for COVID-19?
Attributes
1. History of international travel
2. Flu-like symptoms
3. Contact with anyone reporting 1 or 2
Advantages:
• Simple to understand and critique
Disadvantages:
• Hard to implement without data and actuators
CS771: Intro to ML
Supervised classification using machine learning
Human Computer
Instead of asking experts to generate
Triangle
rules
• Ask them to say 'wrong' or 'right' Triangle
whenever a computer program classifies
something Triangle
Program is programmed to want to
Training
Square
make as few mistakes as possible
Revises its internal rules such that the
Testing
Wrong Triangle
expert has to say 'wrong' less often
Right Triangle
Rules are learned using expert
supervision Right Triangle
CS771: Intro to ML
Advantages of ML classification
Rules are learned from data instead of generated by experts so
• Rules have a lower chance of being contaminated by an expert's bias
• Rules are not restricted in complexity
• Rules are more likely to be testable
• Rules are likely to have greater predictive accuracy
But, for the same reason,
• Rules have a higher chance of missing important aspects of the situation not
captured within data
• Rules can become over-complex and hard to interpret
• Rules become less likely to generalize well to new situations
CS771: Intro to ML
14
Machine Learning (ML)
Designing algorithms that use data to programmatically learn a model of it
The learned model can be used to
Detect patterns/structures/themes/trends etc. in the data
Make predictions about future data and make decisions
CS771: Intro to ML
16
ML: From What It Does to How It Does It?
ML enables intelligent systems to be data-driven rather than rule-driven
How: By supplying training data and building statistical models of data
Pictorial illustration of an ML model for binary classification:
A Probabilistic Classifier (the statistical model)
P(“cat”|image) P(“dog”|image)
CS771: Intro to ML
17
Generalization
The most accurate model for your training data is a lookup table
CS771: Intro to ML
20
ML: Some Success Stories
CS771: Intro to ML
21
ML: Some Success Stories
CS771: Intro to ML
22
ML: Some Success Stories
Example from “Compilation error repair: for the student programs, from the student programs”, Ahmed et al (2018) CS771: Intro to ML
ML: Some success stories
Criminals?
Not Criminals?
An image captioning Don’t want a self-driving car Don’t want a predictive policing
system should not always that is more likely to hit black system that predicts criminality
assume a specific gender people than white people using facial features
Picture courtesy: Bhargava and Forsyth (2019), https://www.thestranger.com/, Xiaolin Wu and Xi Zhang, “Automated Inference on Criminality Using Face Images” CS771: Intro to ML
Learning about ML
We will also focus on
these Data
Representations
Algorithms
Outputs
26
CS771: Intro to ML
27
Looking Back Before We Start: History of ML
CS771: Intro to ML
28
Next Class
CS771: Intro to ML