0% found this document useful (0 votes)

3 views

18.Overview

Artificial intelligence

Uploaded by

zenithw131013

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

18.Overview

Artificial intelligence

Uploaded by

zenithw131013

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Machine Learning: Overview

Jihoon Yang

Machine Learning Research Laboratory

Department of Computer Science & Engineering
Sogang University

Jihoon Yang (ML Research Lab) 1 / 18

Machine Learning

Algorithms or computation or information processing provide for

study of cognition and life what calculus provided for physics

We have a theory of intelligent behavior when we have precise

information processing models (computer programs) that produce
such behaviour

We will have a theory of learning when we have precise information

processing models of learning (computer programs that learn from
experience)

Jihoon Yang (ML Research Lab) 2 / 18

Why should machines learn?
Intelligent behavior requires knowledge
Explicitly specifying the knowledge needed for specific tasks is hard,
and often infeasible
Some tasks are best specified by examples (e.g. medical diagnosis,
credit risk assessment)
Buried in large volumes of data are useful predictive relationships
(data mining)
Machine learning is most useful when
The structure of the task is not well understood but a representative
dataset is available
Task (or its parameters) change dynamically
If we can program computers to learn from experience, we can
Dramatically enhance the usability of software (e.g. personalised
information assistants)
Dramatically reduce the cost of software development (e.g. for medical
diagnosis)
Automate data driven discovery (e.g. bioinformatics, social informatics)
Jihoon Yang (ML Research Lab) 3 / 18
ML Applications

Medical diagnosis/image analysis (e.g. pneumonia)

Spam filtering, fraud detection (e.g. credit cards, phone calls)
Search and recommendation (e.g. google, amazon)
Automatic speech recognition & speaker verification
Locating/tracking/identifying objects in images & videos (e.g. faces)
Printed and handwritten text parsing
Driving computer players in games
Computational molecular biology (e.g. gene expression analysis)
Autonomous driving
...

Jihoon Yang (ML Research Lab) 4 / 18

ML in Context

Jihoon Yang (ML Research Lab) 5 / 18

What is ML?

A program M is said to learn from experience E with respect to some

class of tasks T and performance measure P if its performance as
measured by P on tasks in T in an environment Z with experience E

Examples
1 T : cancer diagnosis
E : a set of diagnosed cases
P: accuracy of diagnosis on new cases
Z : noisy measurements, occasionally misdiagnosed training cases
M: a program that runs on a general purpose computer

Jihoon Yang (ML Research Lab) 6 / 18

What is ML?

2 T : annotating protein sequences with function labels

E : a data set of annotated protein sequences
P: score on a test set not seen during training (e.g. accuracy of
annotations)

3 T : driving on the interstate

E : a sequence of sensor measurements and driving actions recorded
while observing an expert driver
P: mean distance traveled before an error as judged by a human
expert

Jihoon Yang (ML Research Lab) 7 / 18

Canonical learning problems
Supervised learning: given examples of inputs and corresponding
desired outputs, predict outputs on future inputs
Classification/Regression
Time series prediction
To address labor-intensive labelling issue:
Semi-SL: combines a small amount of labeled data with a large amount
of unlabeled data during training via pseudo-labelling
Self-SL or Unsupervised pre-training: labels are created by the
algorithm, rather than provided externally by a human; solves “pretext”
tasks that produce good features for downstream tasks
Unsupervised learning: given only inputs, automatically discover
representations, features, structures, etc.
Clustering/Outlier detection
Compression
Reinforcement learning: given sequences of inputs, actions from a
fixed set, and scalar rewards/punishments, learn to select actions in a
way that maximises expected reward
Jihoon Yang (ML Research Lab) 8 / 18
Machine Learning

Learning involves synthesis or adaption of computational structures:

Classifiers
Functions
Logic Programs
Rules
Grammars
Probability distributions
Action policies

ML = Inference + Data Structures + Algorithms

Jihoon Yang (ML Research Lab) 9 / 18

Learning input-output functions

Target function f: unknown to the learner – f ∈ F

Learner’s hypothesis about what f might be – h ∈ H, Hypothesis
space
Instance space X : domain of f , h
Output space Y : range of f , h
Example: an ordered pair (x, y ) where x ∈ X and f (x) = y ∈ Y
F and H may or may not be the same!
Training set E : a multi-set of examples
Learning algorithm L: a procedure which given some E , outputs an
h∈H

Jihoon Yang (ML Research Lab) 10 / 18

Learning input-output functions

Training regime
Batch
Online
Distributed
Vertical fragmentation
Horizontal fragmentation

Noise
Attribute noise
Classification noise
Both

Jihoon Yang (ML Research Lab) 11 / 18

Inductive learning

Premise: A hypothesis (e.g. a classifier) that is consistent with a

sufficiently large number of representative training examples is likely
to accurately classify novel instances drawn from the same universe

We can prove this is an optimal approach (under appropriate

assumptions)

When the number of examples is limited, the learner needs to be

smarter (e.g. find a concise hypothesis that is consistent with the
data)

Jihoon Yang (ML Research Lab) 12 / 18

Measuring classifier performance

Jihoon Yang (ML Research Lab) 13 / 18

Measuring classifier performance

N: Total number of instances in the data set

TP(c): True Positives for class c, FP(c): False Positives for class c
TN(c): True Negatives for class c, FN(c): False Negatives for class c
TP: True Positives over all classes

Accuracy =TP/N
TP(c)
Precision/Specificity(c) = TP(c)+FP(c)
TP(c)
Recall/Sensitivity(c) = TP(c)+FN(c)
FP(c)
False Alarm(c) = TP(c)+FP(c) = 1 − Precision(c)

Jihoon Yang (ML Research Lab) 14 / 18

Inductive bias

Consider a concept learning algorithm L for the set of instances X .

Let c be an arbitrary concept defined over X , and let
Dc = {⟨x, c(x)⟩} be an arbitrary set of training examples of c. Let
L(xi , Dc ) denote the classification assigned to the instance xi by L
after training on the data Dc .

The inductive bias of L is any minimal set of assertions B such that

for any target concept c and corresponding training examples Dc

(∀xi ∈ X )[(B ∧ Dc ∧ xi ) ⊢ L(xi , Dc )]

In other words, the set of assumptions, that together with the training
data, deductively justify the classifications assigned by the learner to
future instances

Jihoon Yang (ML Research Lab) 15 / 18

Function learning and bias

Jihoon Yang (ML Research Lab) 16 / 18

Learning and bias

Suppose H = set of all n-input Boolean functions.

Suppose the learner is unbiassed. Then
n
|H| = 22

HV = version space – the subset of H not yet ruled out by the learner

Jihoon Yang (ML Research Lab) 17 / 18

Learning and bias

Weaker bias
→ more open to experience, flexible
→ more expressive hypothesis representation

Occam’s razor
Simple hypotheses preferred
Linear fit preferred to quadratic fit assuming both yield relatively good
fit over the training examples

Learning in practice requires a trade-off between complexity of

hypothesis and goodness of fit; How this trade-off is done affects the
learner’s ability to generalise

Jihoon Yang (ML Research Lab) 18 / 18

Biomass Briquettes
100% (6)
Biomass Briquettes
41 pages
CE802_Lec_IntroML_handouts
No ratings yet
CE802_Lec_IntroML_handouts
24 pages
01 Introduction
No ratings yet
01 Introduction
43 pages
Introduction To ML
No ratings yet
Introduction To ML
4 pages
01_introduction (2)
No ratings yet
01_introduction (2)
51 pages
ENG6500 1 IntroductionToMLDL Part1
No ratings yet
ENG6500 1 IntroductionToMLDL Part1
63 pages
Intro To ML
No ratings yet
Intro To ML
107 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Machine Learning Week2 (1)
No ratings yet
Machine Learning Week2 (1)
51 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
Unit 1
No ratings yet
Unit 1
62 pages
Fundamentals of ML 1
No ratings yet
Fundamentals of ML 1
38 pages
Ch7 Introduction to Machine Learning
No ratings yet
Ch7 Introduction to Machine Learning
29 pages
Machine Learning Copy
No ratings yet
Machine Learning Copy
42 pages
Lecture Compiled
No ratings yet
Lecture Compiled
224 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
24 pages
ML Intro Theory
No ratings yet
ML Intro Theory
10 pages
L1 Overview
No ratings yet
L1 Overview
28 pages
Presentation on ML - Copy
No ratings yet
Presentation on ML - Copy
469 pages
01ML Introduction
No ratings yet
01ML Introduction
80 pages
ML Lecture#1
No ratings yet
ML Lecture#1
52 pages
ML - Module 1
No ratings yet
ML - Module 1
30 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
27 pages
Chapter 5 Machine Learning
No ratings yet
Chapter 5 Machine Learning
96 pages
ML -1_Sovan_Introduction to ML
No ratings yet
ML -1_Sovan_Introduction to ML
83 pages
Firoz Topic 0 Ppt
No ratings yet
Firoz Topic 0 Ppt
24 pages
Machine Learning Unit1
No ratings yet
Machine Learning Unit1
151 pages
AML All Merged PDF Class 1 To 8
No ratings yet
AML All Merged PDF Class 1 To 8
423 pages
Module 1
No ratings yet
Module 1
175 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
Key Ideas in Machine Learning
No ratings yet
Key Ideas in Machine Learning
11 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
15 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
16 pages
AI321: Theoretical Foundations of Machine Learning: Dr. Motaz El-Saban
No ratings yet
AI321: Theoretical Foundations of Machine Learning: Dr. Motaz El-Saban
44 pages
Lecture 01 - Machine Learning Basics Revision
No ratings yet
Lecture 01 - Machine Learning Basics Revision
80 pages
Introduction To ML
100% (1)
Introduction To ML
39 pages
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
No ratings yet
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
35 pages
DIR Notes 1
No ratings yet
DIR Notes 1
39 pages
1. U1 ML Intro and Applications
No ratings yet
1. U1 ML Intro and Applications
123 pages
2024-SCU-ML-1-2-Introduction
No ratings yet
2024-SCU-ML-1-2-Introduction
35 pages
Unit1 ML NGP
No ratings yet
Unit1 ML NGP
106 pages
MACHINE LEARNING ALGORITHM - Unit-1-1
100% (1)
MACHINE LEARNING ALGORITHM - Unit-1-1
78 pages
Aiml Mca
100% (1)
Aiml Mca
38 pages
UNIT I 1 ML Introduction To ML Well Posed Learning Problem
No ratings yet
UNIT I 1 ML Introduction To ML Well Posed Learning Problem
48 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
M01 Machine Learning
No ratings yet
M01 Machine Learning
25 pages
01 Introduction
No ratings yet
01 Introduction
50 pages
MLT UINT1
No ratings yet
MLT UINT1
26 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
unit1
No ratings yet
unit1
6 pages
UNIT 1
No ratings yet
UNIT 1
38 pages
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
No ratings yet
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
102 pages
Military AI-Week 02-Key Concept Machine Learning
No ratings yet
Military AI-Week 02-Key Concept Machine Learning
84 pages
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
100% (1)
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
51 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
37 pages
ML intro
No ratings yet
ML intro
28 pages
Lecture 1
No ratings yet
Lecture 1
43 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
Inductive Logic Programming: Fundamentals and Applications
From Everand
Inductive Logic Programming: Fundamentals and Applications
Fouad Sabry
No ratings yet
Balaji Electro Smelters-Adilabad Dist Exe Sum Eng
100% (1)
Balaji Electro Smelters-Adilabad Dist Exe Sum Eng
16 pages
Teaching The Unit Radian As A Physical Quantity: Íîâè Ïîäõîäè New Approaches
No ratings yet
Teaching The Unit Radian As A Physical Quantity: Íîâè Ïîäõîäè New Approaches
5 pages
Why Use Nitrogen To Purge Moisture
No ratings yet
Why Use Nitrogen To Purge Moisture
2 pages
Timers, Counters and Shift Registers.
No ratings yet
Timers, Counters and Shift Registers.
39 pages
Catálogo Peças CAT 972H
No ratings yet
Catálogo Peças CAT 972H
945 pages
Fundamental Concepts For New Clinical Trialists Scott Evans Download PDF
100% (5)
Fundamental Concepts For New Clinical Trialists Scott Evans Download PDF
84 pages
Selected Methods of Analysis: Step 1
No ratings yet
Selected Methods of Analysis: Step 1
3 pages
Q&a 1
No ratings yet
Q&a 1
5 pages
271307-creating-the-school-development-plan
No ratings yet
271307-creating-the-school-development-plan
8 pages
Anna University PHD Course Work Results 2012
100% (2)
Anna University PHD Course Work Results 2012
8 pages
Business Communication
No ratings yet
Business Communication
18 pages
CPWD - DAR - Vol - I - 14092023-Civil 152-161
0% (1)
CPWD - DAR - Vol - I - 14092023-Civil 152-161
10 pages
TC4208 Manual
No ratings yet
TC4208 Manual
34 pages
Assignment 2 ECS429
No ratings yet
Assignment 2 ECS429
9 pages
Assignment of Iks
No ratings yet
Assignment of Iks
5 pages
Rough Notes For Marketing 4.0
100% (1)
Rough Notes For Marketing 4.0
33 pages
Difference Between Ejusdem Generis and Noscitur A Sociis
No ratings yet
Difference Between Ejusdem Generis and Noscitur A Sociis
2 pages
Commando Family Brochure TextronSystems 2016
0% (1)
Commando Family Brochure TextronSystems 2016
13 pages
Pitching Yourself: B1 - Business English
No ratings yet
Pitching Yourself: B1 - Business English
23 pages
U.S. Seismic Design Maps
No ratings yet
U.S. Seismic Design Maps
12 pages
Probability and Statistics Answer Key
100% (2)
Probability and Statistics Answer Key
67 pages
Form 1 Exercises
100% (1)
Form 1 Exercises
160 pages
Field Study 2 EPISODE 2: Learning Objectives As My Guiding Star My Map
No ratings yet
Field Study 2 EPISODE 2: Learning Objectives As My Guiding Star My Map
4 pages
Wilhelm Weitling - Wikipedia
No ratings yet
Wilhelm Weitling - Wikipedia
7 pages
Semana 2 (1) Douglas y Ubelaker (Eds.) (2019)
No ratings yet
Semana 2 (1) Douglas y Ubelaker (Eds.) (2019)
402 pages
Vernon Cooray Lightning Electromagnetics
100% (1)
Vernon Cooray Lightning Electromagnetics
403 pages
32 Final Assessment
No ratings yet
32 Final Assessment
4 pages
Alarm Management 2nd Ed Hollifield Habibi Ch1 Final 3 15
No ratings yet
Alarm Management 2nd Ed Hollifield Habibi Ch1 Final 3 15
22 pages
WHB Thermo EN 2018 09 9003656D
No ratings yet
WHB Thermo EN 2018 09 9003656D
70 pages