01 Introduction ML
01 Introduction ML
N TO
MACHINE
LEARNING
1
WHAT IS LEARNING?
Herbert Simon: “Learning is any process by which a
system improves performance from experience.”
What is the task?
Classification
Categorization/clustering
Problem solving / planning / control
Prediction
others
WHAT IS MACHINE
LEARNING?
“Learning is any process by which a system improves
performance from experience.”
- Herbert Simon
Data
Computer Output
Program
Machine Learning
Data
Computer Progra
m
Output 4
WHEN DO WE USE MACHINE
LEARNING?
ML is used when:
• Human expertise does not exist (navigating on Mars)
• Humans can’t explain their expertise (speech recognition)
• Models must be customized (personalized medicine)
6
SOME MORE EXAMPLES OF TASKS
THAT ARE BEST SOLVED BY
USING A LEARNING ALGORITHM
• Recognizing patterns:
– Facial identities or facial expressions
– Handwritten or spoken words
– Medical images
• Generating patterns:
– Generating images or motion sequences
• Recognizing anomalies:
– Unusual credit card transactions
– Unusual patterns of sensor readings in a nuclear power
plant
• Prediction:
– Future stock prices or currency exchange rates
7
SAMPLE APPLICATIONS
• Web search
• Computational biology
• Finance
• E-commerce
• Space exploration
• Robotics
• Information extraction
• Social networks
• Debugging software
• [Your favorite area]
8
Slide credit: Pedro Domingos
Some web-based examples of machine learning
3. Recommendation systems:
• Lots of noisy data. Million dollar prize!
4. Information retrieval:
• Find documents or images with similar content.
5. Data Visualization:
• Display a huge database in a revealing way
4. The amount of knowledge available about
certain tasks might be too large for explicit
encoding by humans (e.g., medical
diagnostic).
5. Environments change over time.
6. New knowledge about tasks is constantly
being discovered by humans. It may be
difficult to continuously re-design systems “by
hand”.
AREAS OF INFLUENCE FOR MACHINE
LEARNING
1. Statistics: How best to use samples drawn from unknown
probability distributions to help decide from which distribution
some new sample is drawn?
2. Brain Models: Non-linear elements with weighted inputs
(Artificial Neural Networks) have been suggested as simple
models of biological neurons.
3. Adaptive Control Theory: How to deal with controlling a
process having unknown parameters that must be estimated
during operation?
4. Psychology: How to model human performance on various
learning tasks?
5. Artificial Intelligence: How to write algorithms to
acquire the knowledge humans are able to acquire,
at least, as well as humans?
6.Evolutionary Models: How to model certain
aspects of biological evolution to improve the
performance of computer programs?
Designing a Learning System
• In designing a learning system, we have to deal
with (at least) the following issues:
1. Training experience
2. Target function
3. Learned function
4. Learning algorithm
Training Experience
Issues concerning the training experience:
1. Direct or indirect evidence (supervised or unsupervised).
2. Controlled or uncontrolled sequence of training examples.
3. Representatively of training data in relation to test data.
Target Function and Learned Function
• The problem of improving performance can often be
reduced to the problem of learning some particular target
function.
• In many cases we can only hope to acquire some
approximation to the ideal target function.
Learning Algorithm
In order to learn the (approximated) target function we require:
1. A set of training examples (input arguments)
2. A rule for estimating the value corresponding to each
training example (if this is not directly available)
3. An algorithm for choosing the function that best fits the
training data
TYPES OF LEARNING
Induction vs deduction
Rote learning (memorization)
Advice or instructional learning
Learning by example or practice
Most popular; many applications
Learner
Environment/
Experience Knowledge
Performanc
e
Element
TRAINING EXPERIENCE
Direct experience: Given sample input and
output pairs for a useful target function.
Checker boards labeled with the correct move, e.g.
extracted from record of expert play
Indirect experience: Given feedback which is
not direct I/O pairs for a useful target function.
Potentially arbitrary sequences of game moves and
their final game results.
Credit/Blame Assignment Problem: How to
assign credit blame to individual moves given
only indirect feedback?
TRAINING VS. TEST
DISTRIBUTION
Generally assume that the training and test examples are
independently drawn from the same overall distribution
of data.
IID: Independently and identically distributed
1960s:
Neural networks: Perceptron
Pattern recognition
Learning in the limit theory
Minsky and Papert prove limitations of Perceptron
1970s:
Symbolic concept induction
Winston’s arch learner
Expert systems and the knowledge acquisition bottleneck
Quinlan’s ID3
Michalski’s AQ and soybean diagnosis
Scientific discovery with BACON
Mathematical discovery with AM
HISTORY OF MACHINE
LEARNING
1980s:
(CONT.)
Advanced decision tree and rule learning
Explanation-based Learning (EBL)
Learning and planning and problem solving
Utility problem
Analogy
Cognitive architectures
Resurgence of neural networks (connectionism,
backpropagation)
Valiant’s PAC Learning Theory
Focus on experimental methodology
1990s
Data mining
Adaptive software agents and web applications
Text learning
Reinforcement learning (RL)
Inductive Logic Programming (ILP)
Ensembles: Bagging, Boosting, and Stacking
Bayes Net learning
HISTORY OF MACHINE
LEARNING
2000s
(CONT.)
Support vector machines
Kernel methods
Graphical models
Statistical relational learning
Transfer learning
Sequence labeling
Collective classification and structured outputs
Computer Systems Applications
Compilers
Debugging
Graphics
Security (intrusion, virus, and worm detection)
E mail management
Personalized assistants that learn
Learning in robotics and vision
ROBOTICS AND ML
Areas that robots are used:
Industrial robots
Military, government and space robots
Service robots for home, healthcare, laboratory
Control technologies:
Autonomous (self-controlled), tele-operated (remote
control)
INDUSTRIAL ROBOTS
Uses for robots in
manufacturing:
Welding
Painting
Cutting
Dispensing
Assembly
Polishing/Finishing
Material Handling
Packaging, Palletizing
Machine loading
INDUSTRIAL ROBOTS
Uses for robots in Industry/Manufacturing
Automotive:
Video - Welding and handling of fuel tanks from TV show “How
It’s Made” on Discovery Channel. This is a system I worked on in
2003.
Packaging:
Video - Robots in food manufacturing.
INDUSTRIAL ROBOTS -
AUTOMOTIVE
MILITARY/GOVERNMENT
ROBOTS
iRobot PackBot
Remotec Andros
MILITARY/GOVERNMENT ROBOTS
Also… Mind-
controlled
wheelchair using
LABORATORY APPLICATIONS
Drug discovery
Test tube sorting
ALVINN
30 outputs
for steering
30x32 weights
4 hidden
into one out of
units
four hidden
30x32 pixels unit
as inputs
In other words the Types of Machine Learning may
be as follows:
• Supervised Learning
– Classification(pattern recognition)
– Regression
• Unsupervised Learning
• Reinforcement Learning
“Machine Learning: Field of study that gives
computers the ability to learn without being
explicitly programmed.” -Arthur Samuel (1959)
SAMUEL’S CHECKERS-PLAYER
9
STATE OF THE ART
APPLICATIONS OF
MACHINE LEARNING
11
AUTONOMOUS
CARS
13
AUTONOMOUS CAR
TECHNOLOGY
Path
Planning
Sebastian
Stanle
y
15
DEEP BELIEF NET ON FACE
IMAGES
object
models
object
parts
(combinati
on of
edges)
edge
s
pixel
Based on materials s 16
by Andrew Ng
LEARNING OF
OBJECT PARTS
17
Slide credit: Andrew Ng
TRAINING ON MULTIPLE
OBJECTS
18
Slide credit: Andrew Ng
SCENE LABELING VIA DEEP
LEARNING
Input images
Samples from
feedforward
Inference
(control)
Samples from
Full posterior
inference
20
Slide credit: Andrew Ng
MACHINE LEARNING
IN AUTOMATIC SPEECH
RECOGNITION
A Typical Speech Recognition System
22
Slide credit: Li Deng, MS Research
ML IN PRACTICE
48
Based on a slide by Pedro Domingos
ASSIGNMENT
QUESTIONS
How does ML affect information
science?
Natural vs artificial learning – which is
better?
Is ML needed in all problems?
What are the future directions of ML?