0% found this document useful (0 votes)

14 views

Week 6 SVM

Introduction to Applied Machine Learning

Uploaded by

zeliawillscumberg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Week 6 SVM

Introduction to Applied Machine Learning

Uploaded by

zeliawillscumberg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

IAML: Support Vector Machines I

Nigel Goddard
School of Informatics

Semester 1

1 / 18
Outline

I Separating hyperplane with maximum margin

I Non-separable training data
I Expanding the input into a high-dimensional space
I Support vector regression
I Reading: W & F sec 6.3 (maximum margin hyperplane,
nonlinear class boundaries), SVM handout. SV regression
not examinable.

2 / 18
Overview

I Support vector machines are one of the most effective and

widely used classification algorithms.
I SVMs are the combination of two ideas
I Maximum margin classification
I The “kernel trick”
I SVMs are a linear classifier, like logistic regression and
perceptron

3 / 18
Stuff You Need to Remember
w> x is length of the projection of x onto w (if w is a unit vector)

w b

i.e., b = wT x.

(If you do not remember this, see supplementary maths notes

on course Web site.)
4 / 18
Separating Hyperplane
For any linear classifier
I Training instances (xi , yi ), i = 1, . . . , n. yi ∈ {−1, +1}
I Hyperplane w> x + w0 = 0
I Notice for this lecture we use −1 rather than 0 for negative class.
This will be convenient for the maths.

x2
o o
o o
o o
o
w o o
x o
o
x x
x

x
x x x1
x

5 / 18
A Crap Decision Boundary

Seems okay This is crap

x2
x2 o o
o o
o o
o o o
o o
o o o o
o x o
w o o x x
o
x o x
o
x x
x
x
x x x1
x x x1 x
x
x w

6 / 18
Idea: Maximize the Margin
The margin is the distance between the decision boundary (the
hyperplane) and the closest training point.

x
o x
o x x

o o
~
w margin
o
7 / 18
Computing the Margin
I The tricky part will be to get an equation for the margin
I We’ll start by getting the distance from the origin to the
hyperplane
I i.e., We want to compute the scalar b below

w
b

wTx + w0 = 0

8 / 18
Computing the Distance to Origin

I Define z as the point on

the hyperplane closest to
the origin.
z I z must be proportional to
w, because w normal to
w hyperplane
b I By definition of b, we have
the norm of z given by:
wTx + w0 = 0 ||z|| = b

So
w
b =z
||w||

9 / 18
Computing the Distance to Origin

w
I We know that (a) z on the hyperplane and (b) b ||w|| = z.
I First (a) means wT z + w0 = 0
I Substituting we get

bw
wT + w0 = 0
||w||
bwT w
+ w0 = 0
||w||
w0
b=−
||w||
√
I Remember ||w|| = wT w.
I Now we have the distance from the origin to the
hyperplane!

10 / 18
Computing the Distance to Hyperplane

x c

w
a
b

I Now we want c, the distance from x to the hyperplane.

I It’s clear that c = |b − a|, where a the length of the
projection of x onto w. Quiz: What is a?

11 / 18
Computing the Distance to Hyperplane

x c

w
a
b

I Now we want c, the distance from x to the hyperplane.

I It’s clear that c = |b − a|, where a the length of the
projection of x onto w. Quiz: What is a?
wT x
a=
||w||
12 / 18
Equation for the Margin

I The perpendicular distance from a point x to the

hyperplane wT x + w0 = 0 is

1
|wT x + w0 |
||w||
I The margin is the distance from the closest training point
to the hyperplane

1
min |wT xi + w0 |
i ||w||

13 / 18
The Scaling

I Note that (w, w0 ) and (cw, cw0 ) defines the same

hyperplane. The scale is arbitrary.
I This is because we predict class y = 1 if wT x + w0 ≥ 0.
That’s the same thing as saying cwT x + cw0 ≥ 0
I To remove this freedom, we will put a constraint on (w, w0 )

min |w> xi + w0 | = 1
i

I With this constraint, the margin is always 1/||w||.

14 / 18
First version of Max Margin Optimization Problem
I Here is a first version of an optimization problem to
maximize the margin (we will simplify)
max 1/||w||
w
subject to w> xi + w0 ≥ 0 for all i with yi = 1
>
w xi + w0 ≤ 0 for all i with yi = −1
>
min |w xi + w0 | = 1
i
I The first two constraints are too lose. It’s the same thing to
say
max 1/||w||
w
subject to w> xi + w0 ≥ 1 for all i with yi = 1
w> xi + w0 ≤ −1 for all i with yi = −1
>
min |w xi + w0 | = 1
i
I Now the third constraint is redundant 15 / 18
First version of Max Margin Optimization Problem

I That means we can simplify to

max 1/||w||
w
subject to w> xi + w0 ≥ 1 for all i with yi = 1
>
w xi + w0 ≤ −1 for all i with yi = −1

I Here’s a compact way to write those two constraints

max 1/||w||
w
subject to yi (w> xi + w0 ) ≥ 1 for all i

I Finally, note that maximizing 1/||w|| is the same thing as

minimizing ||w||2

16 / 18
The SVM optimization problem

I So the SVM weights are determined by solving the

optimization problem:

min ||w||2
w
s.t. yi (w> xi + w0 ) ≥ +1 for all i

I Solving this will require maths that we don’t have in this

course. But I’ll show the form of the solution next time.

17 / 18
Fin (Part I)

18 / 18

Lecture 9 - SVMs
No ratings yet
Lecture 9 - SVMs
8 pages
ML-chap13_2024_110331
No ratings yet
ML-chap13_2024_110331
67 pages
SVM Scribe Notes
No ratings yet
SVM Scribe Notes
16 pages
Chapter_8 (1)
No ratings yet
Chapter_8 (1)
52 pages
21 Support Vector Machines 03-10-2024
No ratings yet
21 Support Vector Machines 03-10-2024
72 pages
Introduction of Support Vector Machines
No ratings yet
Introduction of Support Vector Machines
16 pages
5d. Support Vector Machine
No ratings yet
5d. Support Vector Machine
2 pages
3 Classification 2
No ratings yet
3 Classification 2
27 pages
Support Vector Machine
No ratings yet
Support Vector Machine
8 pages
EXP-14
No ratings yet
EXP-14
27 pages
SVM PCA Kmeans
No ratings yet
SVM PCA Kmeans
121 pages
Support Vector Machine
No ratings yet
Support Vector Machine
35 pages
Final - Support Vector Machine - Class - Modifie
No ratings yet
Final - Support Vector Machine - Class - Modifie
69 pages
7 SVM for Scientists Annotated (1)
No ratings yet
7 SVM for Scientists Annotated (1)
76 pages
Report 1
No ratings yet
Report 1
6 pages
An Introduction To Support Vector Machines
No ratings yet
An Introduction To Support Vector Machines
13 pages
315 F19 14 SVM 1
No ratings yet
315 F19 14 SVM 1
33 pages
ML_Lec 8-SVM as a Linear Classifier
No ratings yet
ML_Lec 8-SVM as a Linear Classifier
78 pages
10_SVM (1)
No ratings yet
10_SVM (1)
77 pages
Support Vector Machine
No ratings yet
Support Vector Machine
46 pages
Chapter 3 - Support Vector Machine With Math. - Deep Math Machine Learning - Ai - Medium
No ratings yet
Chapter 3 - Support Vector Machine With Math. - Deep Math Machine Learning - Ai - Medium
11 pages
Chapter 5 - Support Vector Machine: Prepared By: Shier Nee, SAW
No ratings yet
Chapter 5 - Support Vector Machine: Prepared By: Shier Nee, SAW
44 pages
W12 SVM
No ratings yet
W12 SVM
52 pages
Machine Learning: Support Vector Machines Kernel Methods
No ratings yet
Machine Learning: Support Vector Machines Kernel Methods
87 pages
SVM 30thoct Annotated
No ratings yet
SVM 30thoct Annotated
35 pages
ML Lec SVM Linear
No ratings yet
ML Lec SVM Linear
19 pages
SVM.pptx
No ratings yet
SVM.pptx
67 pages
SVM Tutorial: SVM - Understanding The Math - The Optimal Hyperplane
No ratings yet
SVM Tutorial: SVM - Understanding The Math - The Optimal Hyperplane
13 pages
Lecture 7_SVM
No ratings yet
Lecture 7_SVM
125 pages
Main
No ratings yet
Main
12 pages
Another Introduction SVM
No ratings yet
Another Introduction SVM
4 pages
Tutorial4 SVM
No ratings yet
Tutorial4 SVM
8 pages
Introduction To Support Vector Machines: 1 Description
No ratings yet
Introduction To Support Vector Machines: 1 Description
15 pages
Support Vector Machine
No ratings yet
Support Vector Machine
50 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
Lecture#9: Support Vector Machine (SVM)
No ratings yet
Lecture#9: Support Vector Machine (SVM)
18 pages
QSRI-lecture5
No ratings yet
QSRI-lecture5
79 pages
הרצאה - SVM 2
No ratings yet
הרצאה - SVM 2
47 pages
SVM Seminarbericht Hofmann
No ratings yet
SVM Seminarbericht Hofmann
16 pages
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
No ratings yet
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
45 pages
Support Vector Machine SVM
No ratings yet
Support Vector Machine SVM
58 pages
CS-13410 Introduction To Machine Learning
No ratings yet
CS-13410 Introduction To Machine Learning
33 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
45 pages
Support_Vector_Machine(SVM)[1]
No ratings yet
Support_Vector_Machine(SVM)[1]
103 pages
Support Vector Machines: 1 What's SVM
No ratings yet
Support Vector Machines: 1 What's SVM
25 pages
ML Lectures - 20 22
No ratings yet
ML Lectures - 20 22
14 pages
Perceptron Notes
No ratings yet
Perceptron Notes
5 pages
SVM Problems1
No ratings yet
SVM Problems1
5 pages
Kernel Method and Support Vector Machines: Nguyen Duc Dung, Ph.D. Ioit, Vast
No ratings yet
Kernel Method and Support Vector Machines: Nguyen Duc Dung, Ph.D. Ioit, Vast
34 pages
SVM Reference
No ratings yet
SVM Reference
8 pages
Support Vector Machines: Xiaojin Zhu
No ratings yet
Support Vector Machines: Xiaojin Zhu
41 pages
K-SVM: An Effective SVM Algorithm Based On K-Means Clustering
No ratings yet
K-SVM: An Effective SVM Algorithm Based On K-Means Clustering
8 pages
SVM_NEW
No ratings yet
SVM_NEW
12 pages
Support Vector Machine
No ratings yet
Support Vector Machine
55 pages
Lec5 Support vector machine
No ratings yet
Lec5 Support vector machine
28 pages
Class 0420
No ratings yet
Class 0420
44 pages
SVM Explained PDF
No ratings yet
SVM Explained PDF
19 pages
w2c_central_limit
No ratings yet
w2c_central_limit
1 page
Award_in_Education_and_Training_sample
No ratings yet
Award_in_Education_and_Training_sample
9 pages
BDS 2016-17
No ratings yet
BDS 2016-17
4 pages
Biological Data Science Lecture4
No ratings yet
Biological Data Science Lecture4
21 pages
w2e_multivariate_gaussian
No ratings yet
w2e_multivariate_gaussian
6 pages
MATH11183 Week 1-Part 2
No ratings yet
MATH11183 Week 1-Part 2
18 pages
BDS 2018-19
No ratings yet
BDS 2018-19
6 pages
Part 5
No ratings yet
Part 5
31 pages
MDA3S
No ratings yet
MDA3S
22 pages
Biological Data Science Lecture6
No ratings yet
Biological Data Science Lecture6
29 pages
Part 4
No ratings yet
Part 4
24 pages
Week 2 Naive Bayes
No ratings yet
Week 2 Naive Bayes
15 pages
MLPR w0f - Machine Learning and Pattern Recognition
No ratings yet
MLPR w0f - Machine Learning and Pattern Recognition
3 pages
Part 3
No ratings yet
Part 3
29 pages
Week 8 Pca
No ratings yet
Week 8 Pca
26 pages
PMRslides 03 B
No ratings yet
PMRslides 03 B
45 pages
Machine Learning and Pattern Recognition - Laplace - Approximation
No ratings yet
Machine Learning and Pattern Recognition - Laplace - Approximation
4 pages
PMRslides 02
No ratings yet
PMRslides 02
13 pages
Slides 03 A
No ratings yet
Slides 03 A
21 pages
TS Part2
No ratings yet
TS Part2
62 pages
Bayesian Workshop1 Solution
No ratings yet
Bayesian Workshop1 Solution
3 pages
Bio Statslectures
No ratings yet
Bio Statslectures
60 pages
w9b Netflix Prize
No ratings yet
w9b Netflix Prize
3 pages
2019 AMAM Exam Paper
No ratings yet
2019 AMAM Exam Paper
3 pages
W6a Gaussian Process Kernels
No ratings yet
W6a Gaussian Process Kernels
6 pages
Heat Advection
No ratings yet
Heat Advection
12 pages
Bayesian Week4 LectureNotes
No ratings yet
Bayesian Week4 LectureNotes
15 pages
Machine Learning and Pattern Recognition Variational KL
No ratings yet
Machine Learning and Pattern Recognition Variational KL
5 pages
2017 AMAM Exam Paper
No ratings yet
2017 AMAM Exam Paper
6 pages
Machine Learning and Pattern Recognition Minimal Stochastic Variational Inference Demo
No ratings yet
Machine Learning and Pattern Recognition Minimal Stochastic Variational Inference Demo
3 pages
A Review of AI Based Threat Detection Enhancing Network Security With Machine Learning
No ratings yet
A Review of AI Based Threat Detection Enhancing Network Security With Machine Learning
9 pages
Session 2-4: Credit Risk Database For SME Financial Inclusion by Bohui Zhang
No ratings yet
Session 2-4: Credit Risk Database For SME Financial Inclusion by Bohui Zhang
9 pages
Data Analyst
No ratings yet
Data Analyst
1 page
Data Science Cheatsheet 2.0: Statistics Model Evaluation Logistic Regression
No ratings yet
Data Science Cheatsheet 2.0: Statistics Model Evaluation Logistic Regression
4 pages
Applied Machine Learning Supervised Machine Learning (Part 2)
No ratings yet
Applied Machine Learning Supervised Machine Learning (Part 2)
47 pages
(IJETA-V11I3P38) :abhay Purohit, Avnish Kumar, Ritu Vaishnav, Shubham Singh
No ratings yet
(IJETA-V11I3P38) :abhay Purohit, Avnish Kumar, Ritu Vaishnav, Shubham Singh
6 pages
Machine Learning With Python
100% (1)
Machine Learning With Python
41 pages
2303 10130 PDF
No ratings yet
2303 10130 PDF
34 pages
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
No ratings yet
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
33 pages
Spyros
No ratings yet
Spyros
67 pages
Predicting The Stages of Diabetic Retinopathy Using Deep Learning
No ratings yet
Predicting The Stages of Diabetic Retinopathy Using Deep Learning
6 pages
Improving Floating Search Feature Selection Using Genetic Algorithm
No ratings yet
Improving Floating Search Feature Selection Using Genetic Algorithm
19 pages
Data Science & Machine Learning by Using R Programming
No ratings yet
Data Science & Machine Learning by Using R Programming
6 pages
Machine Learning Python Packages
No ratings yet
Machine Learning Python Packages
9 pages
Artificial Intelligence Interview Questions and Answers For 2020
No ratings yet
Artificial Intelligence Interview Questions and Answers For 2020
13 pages
Risk Control Presentation
No ratings yet
Risk Control Presentation
6 pages
Top 50 Machine Learning Interview Questions (2023) - Simplilearn
No ratings yet
Top 50 Machine Learning Interview Questions (2023) - Simplilearn
24 pages
Investment Banking Summer Internship Project
No ratings yet
Investment Banking Summer Internship Project
9 pages
1 s2.0 S1359644621003287 Main 1
No ratings yet
1 s2.0 S1359644621003287 Main 1
8 pages
Rajiv+Malhotra+ +Artificial+Intelligence+and+the+Future+of+Power +5+battlegrounds+ (2021,+Rupa+Publications+India) + +libgen - Li+
No ratings yet
Rajiv+Malhotra+ +Artificial+Intelligence+and+the+Future+of+Power +5+battlegrounds+ (2021,+Rupa+Publications+India) + +libgen - Li+
410 pages
LLM Lec-Transformer Architecture
No ratings yet
LLM Lec-Transformer Architecture
2 pages
DLCV Ch2 Example Exercise
No ratings yet
DLCV Ch2 Example Exercise
25 pages
ShivamMishra- DataScience_2024
No ratings yet
ShivamMishra- DataScience_2024
2 pages
M.tech CSE AI ML Curriculum
No ratings yet
M.tech CSE AI ML Curriculum
3 pages
Idc Oracles Autonomous Database 4497146
No ratings yet
Idc Oracles Autonomous Database 4497146
8 pages
Machine Learning Zhihua Zhou pdf download
100% (1)
Machine Learning Zhihua Zhou pdf download
86 pages
Master Resume
No ratings yet
Master Resume
5 pages
Artificial Intelligence Adoption in Criminal Incestigations - Chal
No ratings yet
Artificial Intelligence Adoption in Criminal Incestigations - Chal
22 pages
Spiking-Diffusion Vector Quantized Discrete Diffusion Model With Spiking Neural Networks
No ratings yet
Spiking-Diffusion Vector Quantized Discrete Diffusion Model With Spiking Neural Networks
6 pages
CV Generate
No ratings yet
CV Generate
1 page

Week 6 SVM

Uploaded by

Week 6 SVM

Uploaded by

IAML: Support Vector Machines I

I Separating hyperplane with maximum margin

I Support vector machines are one of the most effective and

(If you do not remember this, see supplementary maths notes

Seems okay This is crap

I Define z as the point on

I Now we want c, the distance from x to the hyperplane.

I Now we want c, the distance from x to the hyperplane.

I The perpendicular distance from a point x to the

I Note that (w, w0 ) and (cw, cw0 ) defines the same

I With this constraint, the margin is always 1/||w||.

I That means we can simplify to

I Here’s a compact way to write those two constraints

I Finally, note that maximizing 1/||w|| is the same thing as

I So the SVM weights are determined by solving the

I Solving this will require maths that we don’t have in this

You might also like