0% found this document useful (0 votes)

7 views

01 Introduction 1

Machine learning involves algorithms that can learn from experience to improve performance on some task. The document introduces machine learning and provides examples of tasks well-suited to machine learning like facial recognition, generating images, and predicting spam emails. It also summarizes applications of machine learning like autonomous vehicles, speech recognition, and genomics. Finally, it outlines different types of learning including supervised learning for regression and classification, unsupervised learning, and reinforcement learning.

Uploaded by

espina0104

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

01 Introduction 1

Uploaded by

espina0104

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 71

Introduction to

Machine
Learning
1
What is Machine Learning?
“Learning is any process by which a system
improves performance from experience.”
- Herbert Simon

Definition by Tom Mitchell (1998):

Machine Learning is the study of algorithms that
• improve their performance P
• at some task T
• with experience E.
A well-defined learning task is given by <P, T,
E>.
3
Traditional
Programming

Data Output
Computer
Program

Machine
Learning

Data Program
Computer
Output
4
When Do We Use Machine
Learning?
ML is used when:
• Human expertise does not exist (navigating on Mars)
• Humans can’t explain their expertise (speech recognition)
• Models must be customized (personalized medicine)
• Models are based on huge amounts of data (genomics)

Learning isn’t always useful:

• There is no need to “learn” to calculate payroll
5
A classic example of a task that requires machine
learning: It is very hard to say what makes
a2

6
Some more examples of tasks that are
best solved by using a learning
algorithm
• Recognizing patterns:
– Facial identities or facial expressions
– Handwritten or spoken words
– Medical images
• Generating patterns:
– Generating images or motion sequences
• Recognizing anomalies:
– Unusual credit card transactions
– Unusual patterns of sensor readings in a nuclear power
plant
• Prediction:
7
– Future stock prices or currency exchange rates

8
Sample Applications
• Web search
• Computational biology
• Finance
• E-commerce
• Space exploration
• Robotics
• Information extraction
• Social networks
• Debugging software
• [Your favorite area]

8
Slide credit: Pedro
Samuel’s Checkers-Player
“Machine Learning: Field of study that gives
computers the ability to learn without being
explicitly programmed.” -Arthur Samuel
(1959)

9
Defining the Learning Task
Improve on task T, with respect to
performance metric P, based on
experience E
T: Playing checkers
P: Percentage of games won against an arbitrary
opponent E: Playing practice games against itself

T: Recognizing hand-written words

P: Percentage of words correctly classified
E: Database of human-labeled images of handwritten words

T: Driving on four-lane highways using vision sensors

P: Average distance traveled before a human-judged error
E: A sequence of images and steering commands
recorded while observing a human driver.

T: Categorize email messages as spam or

legitimate. P: Percentage of email messages
correctly classified. E: Database of emails, some
with human-given labels
10
State of the Art Applications of
Machine Learning

11
Autonomous Cars

• Nevada made it legal for

autonomous cars to drive
on roads in June 2011
• As of 2013, four states
(Nevada, Florida, California,
and Michigan) have legalized
autonomous cars
Penn’s Autonomous Car 
12
(Ben Franklin Racing Team)
Autonomous Car Sensors

13
Autonomous Car Technology
Path
Plannin
g

Laser Terrain
Mapping

Learning from Human Adaptive

Drivers Vision
Sebastian

Stanley

Images and movies taken from Sebastian Thrun’s multimedia w1e4bsite.

Deep Learning in the Headlines

1
5
Deep Belief Net on Face Images
object models

object parts
(combination
of edges)

edges

Andrew Ng

Based on
materials by
pixels
16
Learning of Object Parts

17
Slide credit: Andrew
Training on Multiple Objects

Trained on 4 classes (cars,

faces, motorbikes, airplanes).
Second layer: Shared-
features and object-specific
features.
Third layer: More specific
features.

18
Slide credit: Andrew
Scene Labeling via Deep
Learning
[Farabet et al. ICML 2012, PAMI 2013] 19
Inference from Deep Learned
Models
Generating posterior samples from faces by “filling in”experiments
(cf. Lee and Mumford, 2003). Combine bottom-up and top-down inference.

Input images

Samples from
feedforward
Inference
(control)

Samples
from Full
posterior
inference
20
Machine Learning in
Automatic Speech Recognition
A Typical Speech Recognition System

ML used to predict of phone states from the sound spectrogram

Deep learning has state-of-the-art results

# Hidden Layers 1 2 4 8 10 12

Word Error Rate % 16. 12. 11. 10. 11. 11.

0 8 4 9 0 1
Baseline GMM performance = 15.4%
[Zeiler et al. “On rectified linear units for speech
recognition” ICASSP 2013]
21
Impact of Deep Learning in Speech
Technology

22
Types of Learning

23
Types of Learning

• Supervised (inductive) learning

– Given: training data + desired outputs (labels)
• Unsupervised learning
– Given: training data (without desired outputs)
• Semi-supervised learning
– Given: training data + a few desired outputs
• Reinforcement learning
– Rewards from sequence of actions
24
Supervised Learning:
Regression
• Given (x1, y1), (x2, y2), ..., (xn, yn)
• Learn a function f(x) to predict y given x
– y is real-valued == regression
9
8
Extent (1,000,000 sq
September Arctic Sea Ice

7
6
5
4
km)

3
2
1
0
1970 1980 1990 2000 2010 2020
Year
26
Data from G. Witt. Journal of Statistics Education, Volume 21, Number 1 (2013)
Supervised Learning:
Classification
• Given (x1, y1), (x2, y2), ..., (xn, yn)
• Learn a function f(x) to predict y given x
– y is categorical == classification
Breast Cancer (Malignant / Benign)

1(Malignant)

0(Benign)
Tumor
Size
27
Based on example by Andrew Ng
Supervised Learning:
Classification
• Given (x1, y1), (x2, y2), ..., (xn, yn)
• Learn a function f(x) to predict y given x
– y is categorical == classification
Breast Cancer (Malignant / Benign)

1(Malignant)

0(Benign)
Tumor
Size
Tumor Size 28
Supervised Learning:
Classification
• Given (x1, y1), (x2, y2), ..., (xn, yn)
• Learn a function f(x) to predict y given x
– y is categorical == classification
Breast Cancer (Malignant / Benign)

1(Malignant)

0(Benign)
Tumor
Predict Size
Benign Predict Malignant
Tumor Size 29
Supervised Learning
• x can be multi-dimensional
– Each dimension corresponds to an attribute

- Clump Thickness
- Uniformity of Cell Size
Ag - Uniformity of Cell Shape
e
…

Tumor Size

30
Based on example by Andrew Ng
Unsupervised Learning
• Given x1, x2, ..., xn (without labels)
• Output hidden structure behind the x’s
– E.g., clustering

31
Unsupervised Learning
Genomics application: group individuals by genetic similarity
Gene
s

Individuals 32
[Source: Daphne Koller]
Unsupervised Learning

Organize computing Social network analysis

clusters

Image credit: NASA/JPL-Caltech/E. Churchwell (Univ. of dison

Wisconsin, Ma )
Market Astronomical data analysis 33
Slid segmentation
e credit: Andrew Ng
Unsupervised Learning
• Independent component analysis –
separate a combined signal into its
original sources

34
Image credit: statsoft.com Audio from
Unsupervised Learning
• Independent component analysis –
separate a combined signal into its
original sources

35
Image credit: statsoft.com Audio from
Reinforcement Learning
• Given a sequence of states and actions
with (delayed) rewards, output a policy
– Policy is a mapping from states  actions
that tells you what to do in a given state
• Examples:
– Credit assignment problem
– Game playing
– Robot in a maze
– Balance a pole on your hand

36
The Agent-Environment Interface

Agent and environment interact at discrete time steps : t  0, 1, 2, K

Agent observes state at step t : st S
produces action at step t : at  A(st )
gets resulting reward : rt1 
and resulting next state : st 1

... rt +1 rt +2 rt +3 s ...
st a st at st at t +3a
+1 +2 t
t +1 +2 +3
37
Slide credit: Sutton & Barto
Reinforcement Learning

https://www.youtube.com/watch?v=4cgWya-wjgY
38
Inverse Reinforcement Learning
• Learn policy from user demonstrations

Stanford Autonomous Helicopter

http://heli.stanford.edu/
https://www.youtube.com/watch?v=VCdxqn0fc
39
nE

40
Framing a Learning Problem

41
Designing a Learning System
• Choose the training experience
• Choose exactly what is to be learned
– i.e. the target function
• Choose how to represent the target function
• Choose a learning algorithm to infer the
target function from the experience

Training Learner
data
Environment
/ Experience Knowledg
e
Testing
data Performance
Element 41
Moone
Based on slide by Ray
y
Training vs. Test Distribution
• We generally assume that the training
and test examples are independently
drawn from the same overall distribution
of data
– We call this “i.i.d” which stands for
“independent and identically distributed”

• If examples are not independent, requires

collective classification
• If test distribution is different, requires
transfer learning
42
ML in a Nutshell
• Tens of thousands of machine
learning algorithms
– Hundreds new every year

• Every ML algorithm has three components:

– Representation
– Optimization
– Evaluation

43
Slide credit: Pedro Domingos
Various Function
Representations
• Numerical functions
– Linear regression
– Neural networks
– Support vector machines
• Symbolic functions
– Decision trees
– Rules in propositional logic
– Rules in first-order predicate logic
• Instance-based functions
– Nearest-neighbor
– Case-based
• Probabilistic Graphical Models
– Naïve Bayes
– Bayesian networks
– Hidden-Markov Models (HMMs)
– Probabilistic Context Free Grammars (PCFGs)
44
Slide credit: Ray
– Markov networks

45
Slide credit: Ray
Various Search/Optimization
Algorithms
• Gradient descent
– Perceptron
– Backpropagation
• Dynamic Programming
– HMM Learning
– PCFG Learning
• Divide and Conquer
– Decision tree induction
– Rule learning
• Evolutionary Computation
– Genetic Algorithms (GAs)
– Genetic Programming (GP)
– Neuro-evolution

46
Slide credit: Ray
Evaluation
• Accurac
y
• Precision and recall
• Squared error
• Likelihood
• Posterior probability
• Cost / Utility
• Margin
• Entropy
• K-L divergence
• etc.

4
Slide credit: Pedro 7
ML in Practice
• Understand domain, prior knowledge, and goals
• Data integration, selection, cleaning, pre-processing,
Loop etc.
• Learn models
• Interpret results
• Consolidate and deploy discovered knowledge

4
Based on a slide by Pedro 8
Lessons Learned about Learning
• Learning can be viewed as using direct or
indirect experience to approximate a chosen
target function.

• Function approximation can be viewed as a

search through a space of hypotheses
(representations of functions) for one that best
fits a set of training data.

• Different learning methods assume different

hypothesis spaces (representation languages)
and/or employ different search techniques.
Slide credit: Ray
49

Slide credit: Ray

A Brief History
of Machine
Learning
50
History of Machine Learning
• 1950s
– Samuel’s checker player
– Selfridge’s Pandemonium
• 1960s:
– Neural networks: Perceptron
– Pattern recognition
– Learning in the limit theory
– Minsky and Papert prove limitations of Perceptron
• 1970s:
– Symbolic concept induction
– Winston’s arch learner
– Expert systems and the knowledge acquisition bottleneck
– Quinlan’s ID3
– Michalski’s AQ and soybean diagnosis
– Scientific discovery with BACON
– Mathematical discovery with AM

51
Slide credit: Ray
History of Machine Learning
(cont.)
• 1980s:
– Advanced decision tree and rule learning
– Explanation-based Learning (EBL)
– Learning and planning and problem solving
– Utility problem
– Analogy
– Cognitive architectures
– Resurgence of neural networks (connectionism, backpropagation)
– Valiant’s PAC Learning Theory
– Focus on experimental methodology
• 1990s
– Data mining
– Adaptive software agents and web applications
– Text learning
– Reinforcement learning (RL)
– Inductive Logic Programming (ILP)
52
Slide credit: Ray
– Ensembles: Bagging, Boosting, and Stacking
– Bayes Net learning

53
Slide credit: Ray
History of Machine Learning
(cont.)
• 2000s
– Support vector machines & kernel methods
– Graphical models
– Statistical relational learning
– Transfer learning
– Sequence labeling
– Collective classification and structured outputs
– Computer Systems Applications (Compilers, Debugging, Graphics, Security)
– E-mail management
– Personalized assistants that learn
– Learning in robotics and vision
• 2010s
– Deep learning systems
– Learning for big data
– Bayesian methods
– Multi-task & lifelong learning
53
Based on slide by Ray
– Applications to vision, speech, social networks, learning to read,
etc.
– ???

54
Based on slide by Ray
What We’ll Cover in this
Course
• Supervised learning • Unsupervised learning
– Decision tree induction – Clustering
– Linear regression – Dimensionality
– Logistic regression reduction
– Support vector • Reinforcement
machines & kernel
methods learning
– Model ensembles – Temporal
difference
– Bayesian learning learning
– Neural networks & – Q learning
deep learning
– Learning theory • Evaluation
• Applications
Our focus will be on applying machine learning to real
applications
54

Denon AVR-4311CI Firmware Update Operating Manual
100% (1)
Denon AVR-4311CI Firmware Update Operating Manual
16 pages
01 Introduction
No ratings yet
01 Introduction
50 pages
1.0_introduction
No ratings yet
1.0_introduction
50 pages
Machine Learning Copy
No ratings yet
Machine Learning Copy
42 pages
UNIT I 1 ML Introduction To ML Well Posed Learning Problem
No ratings yet
UNIT I 1 ML Introduction To ML Well Posed Learning Problem
48 pages
ML Lecture#1
No ratings yet
ML Lecture#1
52 pages
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
100% (1)
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
51 pages
Vikas Machine
No ratings yet
Vikas Machine
23 pages
Machine Learning Week2 (1)
No ratings yet
Machine Learning Week2 (1)
51 pages
01 Introduction
No ratings yet
01 Introduction
49 pages
Module 1-Basics of ML
No ratings yet
Module 1-Basics of ML
142 pages
01 Introduction
No ratings yet
01 Introduction
43 pages
Intro To ML
No ratings yet
Intro To ML
107 pages
Introduction To ML P1
No ratings yet
Introduction To ML P1
21 pages
Lec1 -Introduction
No ratings yet
Lec1 -Introduction
55 pages
Applied Machine Learning
No ratings yet
Applied Machine Learning
49 pages
Military AI-Week 02-Key Concept Machine Learning
No ratings yet
Military AI-Week 02-Key Concept Machine Learning
84 pages
Ch3-Machine Learning
No ratings yet
Ch3-Machine Learning
124 pages
Lec 01 [ML] Introduction
No ratings yet
Lec 01 [ML] Introduction
98 pages
AML All Merged PDF Class 1 To 8
No ratings yet
AML All Merged PDF Class 1 To 8
423 pages
W9 ML Overview NRG
No ratings yet
W9 ML Overview NRG
21 pages
lecture 1
No ratings yet
lecture 1
42 pages
(PR 2024) Lec1 Intro Regression I
No ratings yet
(PR 2024) Lec1 Intro Regression I
25 pages
Lecture 01
No ratings yet
Lecture 01
23 pages
Chapter 5 - Machine Learning Basics
No ratings yet
Chapter 5 - Machine Learning Basics
58 pages
01 - Introduction To Deep Learning
No ratings yet
01 - Introduction To Deep Learning
56 pages
part4
No ratings yet
part4
11 pages
Lecture 1 - Novi Quadrianto
No ratings yet
Lecture 1 - Novi Quadrianto
57 pages
Lecture1 Introduction CVML
No ratings yet
Lecture1 Introduction CVML
26 pages
Intro_DL_01
No ratings yet
Intro_DL_01
64 pages
FAI 1 Introduction
No ratings yet
FAI 1 Introduction
39 pages
ENG6500 1 IntroductionToMLDL Part1
No ratings yet
ENG6500 1 IntroductionToMLDL Part1
63 pages
ML Merged
No ratings yet
ML Merged
433 pages
LM #01-Introduction To ML
No ratings yet
LM #01-Introduction To ML
33 pages
Day 2
No ratings yet
Day 2
58 pages
MAI Lecture 01 Introduction
No ratings yet
MAI Lecture 01 Introduction
52 pages
Class10-Introduction_to_ML
No ratings yet
Class10-Introduction_to_ML
32 pages
ANN 5TH PPT
No ratings yet
ANN 5TH PPT
98 pages
UNIT I-Part 1
No ratings yet
UNIT I-Part 1
52 pages
Presentation of AI ML Session 1
No ratings yet
Presentation of AI ML Session 1
131 pages
1- ML Intro 24
No ratings yet
1- ML Intro 24
26 pages
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
No ratings yet
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
35 pages
1c Machinelearning
No ratings yet
1c Machinelearning
50 pages
AI lecture 9
No ratings yet
AI lecture 9
39 pages
DTreesAndOverfitting-1-11-2011_final
No ratings yet
DTreesAndOverfitting-1-11-2011_final
20 pages
AI-900 slides
No ratings yet
AI-900 slides
91 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Machine Learning
No ratings yet
Machine Learning
40 pages
191AIC502T - Machine Learning - Unit 1
No ratings yet
191AIC502T - Machine Learning - Unit 1
41 pages
ML UNIT I_IT
No ratings yet
ML UNIT I_IT
30 pages
Lecture Series On Machine Learning: Ravi Gupta G. Bharadwaja Kumar
No ratings yet
Lecture Series On Machine Learning: Ravi Gupta G. Bharadwaja Kumar
77 pages
Part1
No ratings yet
Part1
10 pages
Lecture 3 - Introduction To Deep Learning
No ratings yet
Lecture 3 - Introduction To Deep Learning
27 pages
ppt4dl
No ratings yet
ppt4dl
81 pages
Unit1 basic introduction
No ratings yet
Unit1 basic introduction
16 pages
Chapter-1 Ml Intro
No ratings yet
Chapter-1 Ml Intro
36 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
Hardware Architectures For Deep Neural Networks-MIT'16
No ratings yet
Hardware Architectures For Deep Neural Networks-MIT'16
300 pages
Unit 1 ML
No ratings yet
Unit 1 ML
70 pages
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
No ratings yet
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
39 pages
Beyond Silicon
From Everand
Beyond Silicon
Piyush yadav
5/5 (1)
hydra testing
No ratings yet
hydra testing
12 pages
Scrum Methodology
No ratings yet
Scrum Methodology
9 pages
How To Add Arduino Library For Proteus and Simulate Arduino Projects
No ratings yet
How To Add Arduino Library For Proteus and Simulate Arduino Projects
13 pages
Downloadfile
No ratings yet
Downloadfile
1 page
LICENSE
No ratings yet
LICENSE
11 pages
0x02 Functions Nested Loops-1
No ratings yet
0x02 Functions Nested Loops-1
20 pages
ISM-PRO
No ratings yet
ISM-PRO
12 pages
State Diagram-S
No ratings yet
State Diagram-S
31 pages
Objective:: Aws/Cloud Engineer
No ratings yet
Objective:: Aws/Cloud Engineer
5 pages
Music Olympiad Question Papers
No ratings yet
Music Olympiad Question Papers
20 pages
GUIAgents With Foundation Models
No ratings yet
GUIAgents With Foundation Models
10 pages
Tableau Company Interview Asked Questions 1212
No ratings yet
Tableau Company Interview Asked Questions 1212
23 pages
Mediant Software SBC Users Manual Ver 72
No ratings yet
Mediant Software SBC Users Manual Ver 72
1,433 pages
Technology For Training Administration-Learning Management Systems (LMS)
No ratings yet
Technology For Training Administration-Learning Management Systems (LMS)
18 pages
2266 ArticleText 9891 1 10 20230124
No ratings yet
2266 ArticleText 9891 1 10 20230124
7 pages
Manual de Usuario Motu
No ratings yet
Manual de Usuario Motu
4 pages
SaaS-SEO-Playbook
No ratings yet
SaaS-SEO-Playbook
11 pages
MazaCAM Editor and Utilities System 2021
No ratings yet
MazaCAM Editor and Utilities System 2021
54 pages
Database Proposal: Ohno! It Solutions LLC 1835 E. Hallandale Beach BLVD Miami, Florida 33009 T 954.826.8527
No ratings yet
Database Proposal: Ohno! It Solutions LLC 1835 E. Hallandale Beach BLVD Miami, Florida 33009 T 954.826.8527
32 pages
Py 101 Course
No ratings yet
Py 101 Course
253 pages
DSA - Module - III Notes
No ratings yet
DSA - Module - III Notes
56 pages
How to Set Up an ABAP Cloud Development Environment
No ratings yet
How to Set Up an ABAP Cloud Development Environment
14 pages
Lester Test
No ratings yet
Lester Test
4 pages
AWS Devsecops
No ratings yet
AWS Devsecops
15 pages
(Ii) (Iii) Signature in CAPITAL LETTERS Will NOT Be Accepted
No ratings yet
(Ii) (Iii) Signature in CAPITAL LETTERS Will NOT Be Accepted
6 pages
Verilogcoder
No ratings yet
Verilogcoder
30 pages
Inventory Management System SRS 2
No ratings yet
Inventory Management System SRS 2
25 pages
Approve Requisition
No ratings yet
Approve Requisition
4 pages
Autocad 3D Max Revit Brochure Softpro
No ratings yet
Autocad 3D Max Revit Brochure Softpro
18 pages

01 Introduction 1

Uploaded by

01 Introduction 1

Uploaded by

Introduction to

Definition by Tom Mitchell (1998):

Learning isn’t always useful:

T: Recognizing hand-written words

T: Driving on four-lane highways using vision sensors

T: Categorize email messages as spam or

• Nevada made it legal for

Learning from Human Adaptive

Images and movies taken from Sebastian Thrun’s multimedia w1e4bsite.

Trained on 4 classes (cars,

ML used to predict of phone states from the sound spectrogram

Deep learning has state-of-the-art results

Word Error Rate % 16. 12. 11. 10. 11. 11.

• Supervised (inductive) learning

Organize computing Social network analysis

Image credit: NASA/JPL-Caltech/E. Churchwell (Univ. of dison

Agent and environment interact at discrete time steps : t  0, 1, 2, K

Stanford Autonomous Helicopter

• If examples are not independent, requires

• Every ML algorithm has three components:

• Function approximation can be viewed as a

• Different learning methods assume different

Slide credit: Ray

You might also like