0% found this document useful (0 votes)

2 views5 pages

SVM Set-2

Support Vector Machines (SVM) is a supervised learning algorithm used for classification and regression, aiming to find a hyperplane that maximizes the margin between classes. The document explains key concepts such as Hinge Loss, the dual formulation of SVM, the kernel trick, and different kernel types including Polynomial and RBF kernels. It also discusses the implications of the hyperparameter C on the model's bias-variance trade-off and the choice between primal and dual forms based on dataset size.

Uploaded by

gunjan09102000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views5 pages

SVM Set-2

Uploaded by

gunjan09102000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

1) Can you explain SVM?

2) What is the geometric intuition behind SVM?

3) What is Hinge Loss?
4) Explain the Dual form of SVM formulation?
5) What’s the “kernel trick” and how is it useful?
6) What is a Polynomial kernel?
7) What is RBF-Kernel?
8) Should you use the primal or the dual form of the SVM problem to train a model
on a training set with millions of instances and hundreds of features?
9) Explain about SVM Regression?
10)What is the role of C in SVM? How does it affect the bias/variance trade-off?

Solutions:

1) Explanation: Support vector machines is a supervised machine learning

algorithm which works both on classification and regression problems. It tries to
classify data by finding a hyperplane that maximizes the margin between the
classes in the training data. Hence, SVM is an example of a large margin
classifier.

The basic idea of support vector machines:

● Optimal hyperplane for linearly separable patterns
● Extend to patterns that are not linearly separable by transformations of
original data to map into new space(i.e the kernel trick)

2) Explanation: If you are asked to classify two different classes. There can be
multiple hyperplanes which can be drawn.

SVM chooses the hyperplane which separates the data points as widely as
possible. SVM draws a hyperplane parallel to the actual hyperplane intersecting
with the first point of class A (also known as Support Vectors) and another
hyperplane parallel to the actual hyperplane intersecting with the first point of
class B. SVM tries to maximize these margins. Eventually, this margin
maximization improves the model’s accuracy on unseen data.

3) Explanation: Hinge Loss is a loss function which penalises the SVM model for
inaccurate predictions.
If Yi(WT*Xi +b) ≥ 1, hinge loss is ‘0’ i.e the points are correctly classified.

When Yi(WT*Xi +b) < 1, then hinge loss increases massively.

As Yi(WT*Xi +b) increases with every misclassified point, the upper bound of
hinge loss {1- Yi(WT*Xi +b)} also increases exponentially.
Hence, the points that are farther away from the decision margins have a greater
loss value, thus penalizing those points.

We can formulate hinge loss as max[0, 1- Yi(WT*Xi +b)]

4) The aim of the Soft Margin formulation is to minimize

subject to
This is also known as the primal form of SVM.
The duality theory provides a convenient way to deal with the constraints. The
dual optimization problem can be written in terms of dot products, thereby
making it possible to use kernel functions.
It is possible to express a different but closely related problem, called its dual
problem. The solution to the dual problem typically gives a lower bound to the
solution of the primal problem, but under some conditions, it can even have the
same solutions as the primal problem. Luckily, the SVM problem happens to
meet these conditions, so you can choose to solve the primal problem or the dual
problem; both will have the same solution.

5) Earlier we have discussed applying SVM on linearly separable data but it is very

rare to get such data. Here, the kernel trick plays a huge role. The idea is to map
the non-linear separable data-set into a higher dimensional space where we can
find a hyperplane that can separate the samples.

It reduces the complexity of finding the mapping function. So, Kernel function
defines the inner product in the transformed space. Application of the kernel
trick is not limited to the SVM algorithm. Any computations involving the dot
products (x, y) can utilize the kernel trick.
6) Polynomial kernel is a kernel function commonly used with support vector
machines (SVMs) and other kernelized models, that represents the similarity of
vectors (training samples) in a feature space over polynomials of the original
variables, allowing learning of non-linear models.

For d-degree polynomials, the polynomial kernel is defined as:

7) The RBF kernel on two samples x and x’, represented as feature vectors in some
input space, is defined as

||x-x’||² recognized as the squared Euclidean distance between the two feature
vectors. sigma is a free parameter.

8) This question applies only to linear SVMs since kernelized can only use the dual
form. The computational complexity of the primal form of the SVM problem is
proportional to the number of training instances m, while the computational
complexity of the dual form is proportional to a number between m² and m³. So,
if there are millions of instances, you should use the primal form, because the
dual form will be much too slow.

9) The Support Vector Regression (SVR) uses the same principles as the SVM for
classification, with only a few minor differences. First of all, because the output is
a real number it becomes very difficult to predict the information at hand, which
has infinite possibilities. In the case of regression, a margin of tolerance (epsilon)
is set in approximation to the SVM

10)

In the given Soft Margin Formulation of SVM, C is a

hyperparameter.

C hyperparameter adds a penalty for each misclassified data point.

Large Value of parameter C implies a small margin, there is a tendency to overfit
the training model.
Small Value of parameter C implies a large margin which might lead to
underfitting of the model.

Module 2 General Mathematics Networks Bound Reference
No ratings yet
Module 2 General Mathematics Networks Bound Reference
8 pages
Applications of Discrete Mathematics in Computer Science: Presentation by - Chetan Parate
No ratings yet
Applications of Discrete Mathematics in Computer Science: Presentation by - Chetan Parate
14 pages
Support Vector Machine (SVM) : Basic Terminologies
100% (1)
Support Vector Machine (SVM) : Basic Terminologies
2 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Support Vactor Machine Final
No ratings yet
Support Vactor Machine Final
11 pages
Machine Learning - SVM
No ratings yet
Machine Learning - SVM
11 pages
Chapter 07 SVM
No ratings yet
Chapter 07 SVM
20 pages
UNIT-III Support Vector Machines
No ratings yet
UNIT-III Support Vector Machines
43 pages
Support Vector Machines: Jeff Wu
No ratings yet
Support Vector Machines: Jeff Wu
35 pages
Ann Unit III
No ratings yet
Ann Unit III
20 pages
SVM notes unit 4.docx
No ratings yet
SVM notes unit 4.docx
8 pages
SVM Tutorial
No ratings yet
SVM Tutorial
28 pages
SVM Set3
No ratings yet
SVM Set3
6 pages
SVM Manual
No ratings yet
SVM Manual
7 pages
Support vector machine
No ratings yet
Support vector machine
49 pages
SVM Notes
No ratings yet
SVM Notes
8 pages
12. Support Vector Machine
No ratings yet
12. Support Vector Machine
3 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
43 pages
Time Series Forecasting by Using Wavelet Kernel SVM
No ratings yet
Time Series Forecasting by Using Wavelet Kernel SVM
52 pages
Slide+-+SVM
No ratings yet
Slide+-+SVM
12 pages
SVM
No ratings yet
SVM
12 pages
Support Vector Machines: (Vapnik, 1979)
No ratings yet
Support Vector Machines: (Vapnik, 1979)
34 pages
DMML Unit4 - SVM
No ratings yet
DMML Unit4 - SVM
50 pages
SVM
No ratings yet
SVM
6 pages
SVM1
No ratings yet
SVM1
4 pages
UNIT - 2
No ratings yet
UNIT - 2
15 pages
Lecture Notes SVM
No ratings yet
Lecture Notes SVM
4 pages
Lecture Notes SVM
No ratings yet
Lecture Notes SVM
4 pages
Supervised Alg
No ratings yet
Supervised Alg
27 pages
Svm
No ratings yet
Svm
52 pages
Support Vector Machine (SVM) PDF
No ratings yet
Support Vector Machine (SVM) PDF
15 pages
What is a Support Vector Machine
No ratings yet
What is a Support Vector Machine
3 pages
Svm
No ratings yet
Svm
52 pages
Support Vector Machines (SVMs) - Introduction and Key Concepts
No ratings yet
Support Vector Machines (SVMs) - Introduction and Key Concepts
52 pages
Support Vector Machine in R Paper
No ratings yet
Support Vector Machine in R Paper
28 pages
Unit II 2.2 ML Kernel Machines SVM
No ratings yet
Unit II 2.2 ML Kernel Machines SVM
50 pages
SVM Tutorial
100% (1)
SVM Tutorial
34 pages
Lecture Slides-Week12
100% (1)
Lecture Slides-Week12
41 pages
SVM Tutorial
No ratings yet
SVM Tutorial
31 pages
SVM Consolidated
No ratings yet
SVM Consolidated
34 pages
Session Svmclassification
No ratings yet
Session Svmclassification
28 pages
20 SVM
No ratings yet
20 SVM
35 pages
Unit 2 SVM
No ratings yet
Unit 2 SVM
16 pages
Support Vector Machine - Explanation
No ratings yet
Support Vector Machine - Explanation
12 pages
SVM
No ratings yet
SVM
28 pages
SVM Tutorial
No ratings yet
SVM Tutorial
34 pages
Machine Learning - Open Elective - Part III
No ratings yet
Machine Learning - Open Elective - Part III
90 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
SVM Assignment ABA Course To Be Returned With Your Answers
No ratings yet
SVM Assignment ABA Course To Be Returned With Your Answers
10 pages
svm
No ratings yet
svm
8 pages
Unit2 notes What is a Support Vector Machine
No ratings yet
Unit2 notes What is a Support Vector Machine
11 pages
Another Introduction SVM
No ratings yet
Another Introduction SVM
4 pages
Presentation - SVM & KM - May 2009
No ratings yet
Presentation - SVM & KM - May 2009
24 pages
Unit IV Naïve Bayes and Support Vector Machine
No ratings yet
Unit IV Naïve Bayes and Support Vector Machine
22 pages
UNIT - 2-1
No ratings yet
UNIT - 2-1
7 pages
Unit 2
No ratings yet
Unit 2
47 pages
Ankita
No ratings yet
Ankita
10 pages
Unit 2 PPT - Part 2
100% (1)
Unit 2 PPT - Part 2
81 pages
Machine Learning Term Test 2
No ratings yet
Machine Learning Term Test 2
20 pages
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Anna University: Chennai - 600 025
No ratings yet
Anna University: Chennai - 600 025
7 pages
Mock Paper 1
No ratings yet
Mock Paper 1
20 pages
Algorithms - Data Structures
No ratings yet
Algorithms - Data Structures
3 pages
Moore Machine and Mealy Machine
100% (2)
Moore Machine and Mealy Machine
25 pages
Numerical Analysis
No ratings yet
Numerical Analysis
21 pages
Data Structures and Algorithms: Maximum Flows
No ratings yet
Data Structures and Algorithms: Maximum Flows
26 pages
Maxima and Minima of Functions of Two Variables
No ratings yet
Maxima and Minima of Functions of Two Variables
10 pages
Binary Search Trees
No ratings yet
Binary Search Trees
116 pages
Chapter 2 Simplex
No ratings yet
Chapter 2 Simplex
36 pages
Algorithm and Flowchart
No ratings yet
Algorithm and Flowchart
47 pages
OOP Final Exam - Solution
No ratings yet
OOP Final Exam - Solution
4 pages
Module 2: Matrices and Elementary Row Operations: Letters)
No ratings yet
Module 2: Matrices and Elementary Row Operations: Letters)
14 pages
Lec2 Linear Regression With One Variable
No ratings yet
Lec2 Linear Regression With One Variable
48 pages
Risch Algorithm
No ratings yet
Risch Algorithm
10 pages
1.1 Algorithms and IPO Diagrams
No ratings yet
1.1 Algorithms and IPO Diagrams
15 pages
END 395 Lecture 5 Handout
No ratings yet
END 395 Lecture 5 Handout
5 pages
AON vs. AOA
100% (1)
AON vs. AOA
3 pages
MATLAB Session 2 Newton-Raphson Method Fall 2021-2022
No ratings yet
MATLAB Session 2 Newton-Raphson Method Fall 2021-2022
8 pages
Principles of Mathematics and Biostatistics: PT02CBIC02 (Unit - I)
No ratings yet
Principles of Mathematics and Biostatistics: PT02CBIC02 (Unit - I)
27 pages
41 Undirected Graphs
No ratings yet
41 Undirected Graphs
73 pages
IT 8601 Computational Intelligence SYLLABUS
No ratings yet
IT 8601 Computational Intelligence SYLLABUS
2 pages
Data Structures Explained - CodeHype
No ratings yet
Data Structures Explained - CodeHype
10 pages
2.7_Composition_of_Functions_Assignment
No ratings yet
2.7_Composition_of_Functions_Assignment
2 pages
18-Dial's Algorithm-06-02-2025
No ratings yet
18-Dial's Algorithm-06-02-2025
24 pages
ML UNIT IV PART II
No ratings yet
ML UNIT IV PART II
9 pages
Cs1303 / Theory of Computation Unit I Part - A: N N 2n N 2n
No ratings yet
Cs1303 / Theory of Computation Unit I Part - A: N N 2n N 2n
7 pages
Queue
No ratings yet
Queue
5 pages
(Slides) 0 - Course Introduction
No ratings yet
(Slides) 0 - Course Introduction
9 pages

SVM Set-2

Uploaded by

SVM Set-2

Uploaded by

1) Can you explain SVM?

2) What is the geometric intuition behind SVM?

1) Explanation: Support vector machines is a supervised machine learning

The basic idea of support vector machines:

When Yi(WT*Xi +b) < 1, then hinge loss increases massively.

We can formulate hinge loss as max[0, 1- Yi(WT*Xi +b)]

4) The aim of the Soft Margin formulation is to minimize

For d-degree polynomials, the polynomial kernel is defined as:

In the given Soft Margin Formulation of SVM, C is a

C hyperparameter adds a penalty for each misclassified data point.

You might also like