0% found this document useful (0 votes)

44 views

Session 19 - SVM

1) The document discusses optimization techniques like gradient descent and genetic algorithms for minimizing error functions. It then introduces support vector machines (SVMs) for classification. 2) SVMs find the optimal separating hyperplane that maximizes the margin between two classes of data points. The support vectors are the data points closest to the hyperplane. 3) The optimization problem for SVMs is to minimize the width of the margin while ensuring data points are classified correctly. Kernels are introduced to map data to higher dimensions to allow for nonlinear decision boundaries.

Uploaded by

raghu atluri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views

Session 19 - SVM

Uploaded by

raghu atluri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Machine Learning (19CSE305)

Error Surface, Parameter Optimization & SVM

Dr. Peeta Basa Pati

Ms. Priyanka V
Department of Computer Science & Engineering,
Amrita School of Engineering, Bengaluru
1
Topics
• Recap of Optimization
• Support Vectors

2
Functions, Derivatives & Convexity

Source: Internet

3
Local & Global – Minimum, Maximum; Saddle point
• Brute force search
• Gradient Descent Search
• Genetic algorithm based approaches
• Evolutionary computation
techniques
• Tabu Search
• Simulated Annealing
• Hill Climbing techniques
• Ant colony optimization
• Particle swarm optimization
• Random forest optimization

Source: Engineering Optimization, S S Rao

4
Derivation of Gradient Descent Algorithm

5
Notes on Gradient Descent Algorithm
• Error is summed over all inputs and then the weights are updated
• Linear (pass through) activation function is used → f(x) = x
• Assumes a convex error space
• If there are local minima, the search may get stuck and never come-out
• Since error is summed over all inputs, the convergence may be slow
• Incremental or stochastic gradient descent is a variation of the algorithm
• Weight update done with each input
• Sometimes helps in overcoming the local minima
• The same principle can be applied with other activation functions as well.
However, mathematical proof of convergence may be difficult.

6
Support Vector Machines

7
Classes & Boundaries

8
Linear binary classifier
𝑥2
𝑥1
ഥ 𝑥ҧ + 𝑏 = 0
𝒘. 𝑥ҧ =
𝑥2 α
ഥ
𝒙 𝒇 𝒚′
ഥ
𝒘

𝒚′ = 𝒇 𝑥,ҧ 𝒘,
ഥ 𝑏
= 𝒔𝒊𝒈𝒏 (𝒘.ഥ 𝑥ҧ + 𝑏)

Denotes +ve class

ഥ 𝑥ҧ + 𝑏 < 0
𝒘. ഥ 𝑥ҧ + 𝑏 > 0
𝒘. Denotes -ve class
𝑥1

9
Linear binary classifier
𝑥2
𝑥1
ഥ 𝑥ҧ + 𝑏 = 0
𝒘. 𝑥ҧ =
𝑥2 α
ഥ
𝒙 𝒇 𝒚′

ഥ 𝑥ҧ + 𝑏 < 0
𝒘.
𝒚′ = 𝒇 𝑥,ҧ 𝒘,
ഥ 𝑏
= 𝒔𝒊𝒈𝒏 (𝒘.ഥ 𝑥ҧ + 𝑏)

Denotes +ve class

ഥ 𝑥ҧ + 𝑏 > 0
𝒘. Denotes -ve class
𝑥1

10
Support Vectors, Gutters, Separating Hyperplane & Margin
𝑥2 Separating Hyperplane

Gutters
A
Margin

Denotes +ve class

C Denotes -ve class

Support Vectors
𝑥1
11
Linear binary classifier α
𝑥2 ഥ 𝑥ҧ + 𝑏 = 0
𝒘. Denotes +ve class
ഥ
𝒙 𝒇 𝒚′
ഥ
𝒘
Denotes -ve class
A 𝒚′ = 𝒇 𝑥,ҧ 𝒘,
ഥ 𝑏
= 𝒔𝒊𝒈𝒏 (𝒘.ഥ 𝑥ҧ + 𝑏)

ഥ 𝑥ҧ 𝐴 + 𝑏 = +1
𝒘.
C ഥ 𝑥ҧ 𝐵 + 𝑏 = +1
𝒘.
B
ഥ 𝑥ҧ 𝐶 + 𝑏 = −1
𝒘.
Generalization
ഥ 𝑥ҧ + 𝑣𝑒 + 𝑏 ≥ +1
𝒘.

𝑥1 ഥ 𝑥ҧ − 𝑣𝑒 + 𝑏 ≤ −1
𝒘.
1

12
Linear binary classifier α
𝑥2 ഥ 𝑥ҧ + 𝑏 = 0
𝒘. Denotes +ve class
ഥ
𝒙 𝒇 𝒚′
ഥ
𝒘
Denotes -ve class
A 𝒚′ = 𝒇 𝑥,ҧ 𝒘,
ഥ 𝑏
= 𝒔𝒊𝒈𝒏 (𝒘.ഥ 𝑥ҧ + 𝑏)

ഥ 𝑥ҧ 𝐴 + 𝑏 = +1
𝒘.
C ഥ 𝑥ҧ 𝐵 + 𝑏 = +1
𝒘.
B
ഥ 𝑥ҧ 𝐶 + 𝑏 = −1
𝒘.

ഥ | = 𝒘.
𝑴. | 𝒘 ഥ 𝑥𝐴 ഥ 𝑥ҧ 𝐶 + 𝑏
ҧ + 𝑏 − 𝒘.
2
𝑥1 M= 2
ഥ |
|𝒘
13
Optimization for SVM
2
1 ഥ | = 𝒘.
𝑴. | 𝒘 ഥ 𝑥𝐴 ഥ 𝑥ҧ 𝐶 + 𝑏
ҧ + 𝑏 − 𝒘.
ഥ 𝑥ҧ + 𝑣𝑒 + 𝑏 ≥ +1
𝒘. 2
M=
ഥ |
|𝒘
ഥ 𝑥ҧ − 𝑣𝑒 + 𝑏 ≤ −1
𝒘.

Multiply each equation with yi We want to maximize the margin M.

yi = +1 for all +ve class vectors ➔Minimize |𝒘ഥ |
yi = -1 for all –ve class vectors ഥ T. 𝒘)
➔Minimize (𝒘 ഥ

ഥ 𝑥ҧ 𝑖 + 𝑏 𝑦𝑖 ≥ +1
𝒘. Formulate the optimization problem & constraints
φ(𝒘) ഥ T. 𝒘)
ഥ = ½ (𝒘 ഥ
𝒚′ = 𝒇 𝒙
ഥ ഥ 𝑥ҧ 𝑖 + 𝑏 𝑦𝑖 ≥ +1
subject to 𝒘.
= σ𝒊 ∝ 𝑖 𝑦𝑖 𝒙ഥ𝑖𝑇. 𝒙
ഥ + 𝑏) for all 𝑖
14
Optimization for SVM

𝑦𝑖 are all scalars; hence,

෍ ∝ 𝑖 𝑦𝑖 = 0 ∝ 𝑖 are also scalars.
Formulate the optimization problem & constraints 𝒊
φ(𝒘) ഥ T. 𝒘)
ഥ = ½ (𝒘 ഥ
ഥ 𝑥ҧ 𝑖 + 𝑏 𝑦𝑖 ≥ +1
subject to 𝒘.
for all 𝑖 𝒚′ = 𝒇 𝒙
ഥ
= σ𝒊 ∝ 𝑖 𝑦𝑖 𝒙ഥ𝑖𝑇. 𝒙
ഥ + 𝑏)

15
SVM

𝒚′ = 𝒇 𝒙
ഥ Kernel function is some
function which maintains
= σ𝒊 ∝ 𝑖 𝑦𝑖 𝒙ഥ𝑖𝑇. 𝒙
ഥ + 𝑏) the sanctity of dot product
in some expanded space

Kernel function: Kernel function generalized as:

K 𝒙ഥ𝑖, 𝒙ഥ𝑗 = 𝒙ഥ𝑖 𝑇. 𝒙ഥ𝑗 K 𝒙ഥ𝑖, 𝒙ഥ𝑗 = φ(𝒙ഥ𝑖)𝑇. φ(𝒙ഥ𝑗)

Kernel function by mapping

the vectors to some
expanded space, introduces
linearity which is absent in
the original space.

16
Multiclass Scenario
𝑥2

𝑥1

17
One against another approach
𝑥2
𝑥2 𝑥2

𝑥1
𝑥1 𝑥1

• Consider each class separately against all other classes combined

✓ Tomato against brinjal
✓ Kiwi against green chilies
✓ Apples against tomatoes
• N * (N-1) / 2 SVM models generated (10 models in this example)
• Test pattern is run through each models
• Class value assigned based on absolute maximum score & sign (+/-ve) from all models

18
One vs all others approach
𝑥2
• Consider each class separately
against all other classes
combined
✓ Tomato against all others
✓ Kiwi against all others
✓ Apples against all others
• SVM models generated as
many classes are available (5
models in this example)
• Test pattern is run through
each models
✓ Score of being an apple
✓ Score of being a tomato
• Class value assigned based on
𝑥1 maximum membership score

19
Python Code

20
Thank you !!!!!

UFLEX Sparklo - Business - Deck Feb 2024
No ratings yet
UFLEX Sparklo - Business - Deck Feb 2024
23 pages
Gp680 User Manual
No ratings yet
Gp680 User Manual
3 pages
08 Classification
No ratings yet
08 Classification
46 pages
SVM Tutorial
No ratings yet
SVM Tutorial
34 pages
SVM PRESENTATION
No ratings yet
SVM PRESENTATION
34 pages
SVM Tutorial
No ratings yet
SVM Tutorial
34 pages
Introduction To: Support Vector Machines
No ratings yet
Introduction To: Support Vector Machines
53 pages
CS-13410 Introduction To Machine Learning
No ratings yet
CS-13410 Introduction To Machine Learning
33 pages
Machine Learning - Open Elective - Part III
No ratings yet
Machine Learning - Open Elective - Part III
90 pages
SVM Tutorial
No ratings yet
SVM Tutorial
34 pages
Support Vector Machine
No ratings yet
Support Vector Machine
45 pages
Machine Learning
No ratings yet
Machine Learning
45 pages
SVM Tutorial
No ratings yet
SVM Tutorial
31 pages
Lec 05
No ratings yet
Lec 05
54 pages
ML Lec SVM Linear
No ratings yet
ML Lec SVM Linear
19 pages
CH 5 SVM
No ratings yet
CH 5 SVM
25 pages
Kernel Method and Support Vector Machines: Nguyen Duc Dung, Ph.D. Ioit, Vast
No ratings yet
Kernel Method and Support Vector Machines: Nguyen Duc Dung, Ph.D. Ioit, Vast
34 pages
Support Vector Machines For Classification and Regression: Steve R. Gunn
No ratings yet
Support Vector Machines For Classification and Regression: Steve R. Gunn
66 pages
Support Vector Machine
No ratings yet
Support Vector Machine
35 pages
SVM Notes
No ratings yet
SVM Notes
40 pages
UNIT - 2
No ratings yet
UNIT - 2
15 pages
B24 ML Exp-3
No ratings yet
B24 ML Exp-3
10 pages
Support Vector Machines For Classification and Regression: Steve R. Gunn
No ratings yet
Support Vector Machines For Classification and Regression: Steve R. Gunn
66 pages
Ds 2
No ratings yet
Ds 2
27 pages
Time Series Forecasting by Using Wavelet Kernel SVM
No ratings yet
Time Series Forecasting by Using Wavelet Kernel SVM
52 pages
A Introduction To SVM PDF
No ratings yet
A Introduction To SVM PDF
48 pages
22-Kernel Tricks Shit
No ratings yet
22-Kernel Tricks Shit
43 pages
1632118884_ML-TCS-Lecture-15 (1)
No ratings yet
1632118884_ML-TCS-Lecture-15 (1)
46 pages
MergedPDF Iml
No ratings yet
MergedPDF Iml
114 pages
Assignment II Machine Learning
No ratings yet
Assignment II Machine Learning
8 pages
Presentation - SVM & KM - May 2009
No ratings yet
Presentation - SVM & KM - May 2009
24 pages
SVM
No ratings yet
SVM
40 pages
Introduction To Support Vector Machines: BTR Workshop Fall 2006
No ratings yet
Introduction To Support Vector Machines: BTR Workshop Fall 2006
88 pages
Introduction To Support Vector Machines: BTR Workshop Fall 2006
No ratings yet
Introduction To Support Vector Machines: BTR Workshop Fall 2006
88 pages
SVM Presentation
No ratings yet
SVM Presentation
27 pages
UNIT 1,2,3
No ratings yet
UNIT 1,2,3
17 pages
Machine Learning 3
No ratings yet
Machine Learning 3
35 pages
This Is
No ratings yet
This Is
7 pages
Support Vector Machines: (Vapnik, 1979)
No ratings yet
Support Vector Machines: (Vapnik, 1979)
34 pages
Support_Vector_Machine(SVM)[1]
No ratings yet
Support_Vector_Machine(SVM)[1]
103 pages
T R Ik-Cl Ervor Er Kis: (Example)
No ratings yet
T R Ik-Cl Ervor Er Kis: (Example)
122 pages
SVM-CDing2024 11 15
No ratings yet
SVM-CDing2024 11 15
54 pages
Basic Concept of SVM
No ratings yet
Basic Concept of SVM
29 pages
lec3
No ratings yet
lec3
22 pages
Lecture 18 - SVM
No ratings yet
Lecture 18 - SVM
54 pages
Aim of The Experiment-Software Required - Theory
No ratings yet
Aim of The Experiment-Software Required - Theory
6 pages
Support Vector Machines: Xiaojin Zhu
No ratings yet
Support Vector Machines: Xiaojin Zhu
41 pages
Support Vector Machines
No ratings yet
Support Vector Machines
24 pages
SVM EXAMPLE
No ratings yet
SVM EXAMPLE
24 pages
27-Module 4 - Support Vector Machine and Naïve Bayes-20-09-2024
No ratings yet
27-Module 4 - Support Vector Machine and Naïve Bayes-20-09-2024
31 pages
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
No ratings yet
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
5 pages
Support Vector Machine
No ratings yet
Support Vector Machine
55 pages
Sesion4
No ratings yet
Sesion4
37 pages
EDAN96_2024_Last_lecture-1
No ratings yet
EDAN96_2024_Last_lecture-1
78 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Lec06 SVM
No ratings yet
Lec06 SVM
25 pages
Support Vector Machines: Jeff Wu
No ratings yet
Support Vector Machines: Jeff Wu
35 pages
AIML Lec-14
No ratings yet
AIML Lec-14
12 pages
ML W8 Merged
No ratings yet
ML W8 Merged
27 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
Autodesk Algor Simulation Course Agenda
No ratings yet
Autodesk Algor Simulation Course Agenda
4 pages
Guia de Lumen RT
No ratings yet
Guia de Lumen RT
14 pages
Compilação CVD + Order Book
100% (1)
Compilação CVD + Order Book
20 pages
Register Organization of 8086 PDF
No ratings yet
Register Organization of 8086 PDF
10 pages
Asset Management Menggunakan QR Code Dengan Metode
No ratings yet
Asset Management Menggunakan QR Code Dengan Metode
14 pages
Compendium Training 243 Aruba Clearpass Fundamentals
No ratings yet
Compendium Training 243 Aruba Clearpass Fundamentals
3 pages
Instant Access to Ansible: Up & Running, 3rd Edition (Final Release) Bas Meijer ebook Full Chapters
100% (3)
Instant Access to Ansible: Up & Running, 3rd Edition (Final Release) Bas Meijer ebook Full Chapters
40 pages
Route Planning For Logistics
No ratings yet
Route Planning For Logistics
4 pages
Ch09 Space and Time Tradeoffs
No ratings yet
Ch09 Space and Time Tradeoffs
41 pages
Digital Jewellery Documentation
100% (2)
Digital Jewellery Documentation
18 pages
Question Bank
100% (2)
Question Bank
11 pages
Sourashtra NHM Writer
No ratings yet
Sourashtra NHM Writer
9 pages
MMPC 008
No ratings yet
MMPC 008
5 pages
Rough Guide To 3G and HSPA 2009
100% (1)
Rough Guide To 3G and HSPA 2009
40 pages
Chapter 4, Lecture 1 Boolean Expression and Logic Simplification
No ratings yet
Chapter 4, Lecture 1 Boolean Expression and Logic Simplification
33 pages
Cse115 Lab Manual 11 Nested - Loop - Part3
No ratings yet
Cse115 Lab Manual 11 Nested - Loop - Part3
2 pages
Wireless Access Points (Fat AP) Web-Based Configuration Guide
No ratings yet
Wireless Access Points (Fat AP) Web-Based Configuration Guide
2 pages
LifeSize Express 220 Datasheet en
No ratings yet
LifeSize Express 220 Datasheet en
2 pages
Oracle Banking Platform (OBP) Solution Engineer Specialist Dump
No ratings yet
Oracle Banking Platform (OBP) Solution Engineer Specialist Dump
7 pages
FTP and Proxy Server
No ratings yet
FTP and Proxy Server
3 pages
Flutterdeveloperskillstomasterin2024 231103095207 E06ff660
No ratings yet
Flutterdeveloperskillstomasterin2024 231103095207 E06ff660
7 pages
Guideline Auto Proctor
No ratings yet
Guideline Auto Proctor
14 pages
DWM Unit 1
No ratings yet
DWM Unit 1
24 pages
5 Bit Binary To 1 of 32 Select Decoder (To Be Used in 5 Bit Dac)
No ratings yet
5 Bit Binary To 1 of 32 Select Decoder (To Be Used in 5 Bit Dac)
24 pages
Unit III Notes
No ratings yet
Unit III Notes
44 pages
Assignment 2
No ratings yet
Assignment 2
33 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
3 pages
Business Level 3 Btec Extended Diploma Coursework
100% (1)
Business Level 3 Btec Extended Diploma Coursework
7 pages

Session 19 - SVM

Uploaded by

Session 19 - SVM

Uploaded by

Machine Learning (19CSE305)

Error Surface, Parameter Optimization & SVM

Dr. Peeta Basa Pati

Source: Engineering Optimization, S S Rao

Denotes +ve class

Denotes +ve class

Denotes +ve class

C Denotes -ve class

Multiply each equation with yi We want to maximize the margin M.

𝑦𝑖 are all scalars; hence,

Kernel function: Kernel function generalized as:

Kernel function by mapping

• Consider each class separately against all other classes combined

You might also like