0% found this document useful (0 votes)

44 views

Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University

This document discusses support vector machines and how kernels allow SVMs to handle non-linear and non-vectorial data. Kernels transform data into a higher-dimensional feature space, allowing SVMs to find linear separating boundaries in that space. Kernels compute the dot product in feature space without explicitly representing features. Common kernels include polynomial and radial basis function kernels. Kernels can be designed to handle discrete, structured, and non-vectorial data types. SVMs with kernels are expressive, computationally efficient, and avoid problems like local optima.

Uploaded by

igrice sve

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views

Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University

Uploaded by

igrice sve

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Support Vector Machines:

Kernels

CS4780/5780 – Machine Learning

Fall 2011

Thorsten Joachims
Cornell University

Reading: Schoelkopf/Smola Chapter 7.4, 7.6, 7.8

Cristianini/Shawe-Taylor 3.1, 3.2, 3.3.2, 3.4
Outline

• Transform a linear learner into a non-linear

learner
• Kernels can make high-dimensional spaces
tractable
• Kernels can make non-vectorial data
tractable
Non-Linear Problems

Problem:
• some tasks have non-linear structure
• no hyperplane is sufficiently accurate
How can SVMs learn non-linear classification rules?
Extending the Hypothesis
Space
Idea: add more features

 Learn linear rule in feature space.

Example:

 The separating hyperplane in feature space is degree

two polynomial in input space.
Example

• Input Space: (2 attributes)

• Feature Space:
(6 attributes)
Dual SVM Optimization
Problem
• Primal Optimization Problem

• Dual Optimization Problem

• Theorem: If w* is the solution of the Primal and

α* is the solution of the Dual, then
Kernels

Problem: Very many Parameters! Polynomials of

degree p over N attributes in input space lead to
O(Np) attributes in feature space!
Solution [Boser et al.]: The dual OP depends only
on inner products => Kernel Functions

Example: For
calculating computes inner
product in feature space.
 no need to represent feature space explicitly.
SVM with Kernel
Training:

Classification:

New hypotheses spaces through new Kernels:

• Linear:
• Polynomial:
• Radial Basis Function:
• Sigmoid:
Examples of Kernels

Polynomial Radial Basis Function

What is a Valid Kernel?

Definition: Let X be a nonempty set. A function

is a valid kernel in X if for all n and all x1,…, xn
2 X it produces a Gram matrix
Gij = K(xi, xj)
that is symmetric
G = GT
and positive semi-definite
How to Construct Valid
Kernels
Theorem: Let K1 and K2 be valid Kernels over X £
X, X µ <N,  ≥ 0, 0 ≤  ≤ 1, f a real-valued
function on X, :X! <m with a kernel K3 over <m
£ <m, and K a symmetric positive semi-definite
matrix. Then the following functions are valid
Kernels
K(x,z) =  K1(x,z) + (1-) K2(x,z)
K(x,z) =  K1(x,z)
K(x,z) = K1(x,z) K2(x,z)
K(x,z) = f(x) f(z)
K(x,z) = K3((x),(z))
K(x,z) = xT K z
Kernels for Discrete and
Structured Data
Kernels for Sequences: Two sequences are similar, if
the have many common and consecutive
subsequences.
Example [Lodhi et al., 2000]: For 0 ≤  ≤ 1 consider
the following features space
c-a c-t a-t b-a b-t c-r a-r b-r
(cat) 2 3 2 0 0 0 0 0
(car) 2 0 0 0 0 3 2 0
(bat) 0 0 2 2 3 0 0 0
(bar) 0 0 0 2 0 0 2 3

=> K(car,cat) = 4, efficient computation via dynamic

programming
Kernels for Non-Vectorial
Data
• Applications with Non-Vectorial Input Data
 classify non-vectorial objects
– Protein classification (x is string of amino acids)
– Drug activity prediction (x is molecule structure)
– Information extraction (x is sentence of words)
– Etc.
• Applications with Non-Vectorial Output Data
 predict non-vectorial objects
– Natural Language Parsing (y is parse tree)
– Noun-Phrase Co-reference Resolution (y is clustering)
– Search engines (y is ranking)
 Kernels can compute inner products efficiently!
Properties of SVMs with
Kernels
• Expressiveness
– SVMs with Kernel can represent any boolean function
(for appropriate choice of kernel)
– SVMs with Kernel can represent any sufficiently
“smooth” function to arbitrary accuracy (for appropriate
choice of kernel)
• Computational
– Objective function has no local optima (only one
global)
– Independent of dimensionality of feature space
• Design decisions
– Kernel type and parameters
– Value of C
SVMs for other Problems

• Multi-class Classification
– [Schoelkopf/Smola Book, Section 7.6]
• Regression
– [Schoelkopf/Smola Book, Section 1.6]
• Outlier Detection
– D.M.J. Tax and R.P.W. Duin, "Support vector domain
description", Pattern Recognition Letters, vol. 20, pp. 1191-
1199, 1999b. 26
• Structural Prediction
– B. Taskar, C. Guestrin, D. Koller - Advances in Neural
Information Processing Systems, 2003.
– I. Tsochantaridis, T. Hofmann, T. Joachims, and Y. Altun,
Support Vector Machine Learning for Interdependent and
Structured Output Spaces, Proceedings of the International
Conference on Machine Learning (ICML), 2004.

Pteridofitas de México
No ratings yet
Pteridofitas de México
1,065 pages
Unit 2 Day 6 HW m3
No ratings yet
Unit 2 Day 6 HW m3
4 pages
Euthenics 2 Quiz 1 Prelim
100% (3)
Euthenics 2 Quiz 1 Prelim
13 pages
Introduction To Support Vector Machines: BTR Workshop Fall 2006
No ratings yet
Introduction To Support Vector Machines: BTR Workshop Fall 2006
88 pages
Introduction To Support Vector Machines: BTR Workshop Fall 2006
No ratings yet
Introduction To Support Vector Machines: BTR Workshop Fall 2006
88 pages
Kernel Models 1233
No ratings yet
Kernel Models 1233
56 pages
Icml Tutorial
No ratings yet
Icml Tutorial
85 pages
Lect 3
No ratings yet
Lect 3
14 pages
SVM Class 2
No ratings yet
SVM Class 2
87 pages
Chapter 4- Kernel Theory
No ratings yet
Chapter 4- Kernel Theory
9 pages
03 - Kernelization
No ratings yet
03 - Kernelization
32 pages
SVM Kernel Functions
No ratings yet
SVM Kernel Functions
12 pages
Lecture Notes SVM
No ratings yet
Lecture Notes SVM
4 pages
Lecture Notes SVM
No ratings yet
Lecture Notes SVM
4 pages
07 Kernels
No ratings yet
07 Kernels
6 pages
Handout 03 Classic Classifiers
No ratings yet
Handout 03 Classic Classifiers
39 pages
22-Kernel Tricks Shit
No ratings yet
22-Kernel Tricks Shit
43 pages
Atc Lecture Tyliu
No ratings yet
Atc Lecture Tyliu
48 pages
KernelMethods
No ratings yet
KernelMethods
19 pages
This Is
No ratings yet
This Is
7 pages
Time Series Forecasting by Using Wavelet Kernel SVM
No ratings yet
Time Series Forecasting by Using Wavelet Kernel SVM
52 pages
hw2 4
No ratings yet
hw2 4
3 pages
Lect3 2
No ratings yet
Lect3 2
43 pages
SVM-CDing2024 11 15
No ratings yet
SVM-CDing2024 11 15
54 pages
B43 Exp3 ML
No ratings yet
B43 Exp3 ML
5 pages
Machine Learning 3
No ratings yet
Machine Learning 3
35 pages
Lecture Slides-Week12
100% (1)
Lecture Slides-Week12
41 pages
0701907v3
No ratings yet
0701907v3
53 pages
A Practical Guide To Support Vector Classification: I I I N L
No ratings yet
A Practical Guide To Support Vector Classification: I I I N L
15 pages
Kernal Methods Machine Learning
No ratings yet
Kernal Methods Machine Learning
53 pages
DMML Unit4 - SVM
No ratings yet
DMML Unit4 - SVM
50 pages
Lecture09 SVM Intro, Kernel Trick (Updated)
No ratings yet
Lecture09 SVM Intro, Kernel Trick (Updated)
36 pages
Svmprotein PDF
No ratings yet
Svmprotein PDF
12 pages
SVM Overview
No ratings yet
SVM Overview
4 pages
Kernel Functions: Tejumade Afonja Jan 2, 2017 6 Min Read
No ratings yet
Kernel Functions: Tejumade Afonja Jan 2, 2017 6 Min Read
6 pages
Support Vector Machines: (Vapnik, 1979)
No ratings yet
Support Vector Machines: (Vapnik, 1979)
34 pages
A Practical Guide To Support Vector Classification: I I I N L
No ratings yet
A Practical Guide To Support Vector Classification: I I I N L
12 pages
Support Vector Machine in R Paper
No ratings yet
Support Vector Machine in R Paper
28 pages
Kernel Functions
No ratings yet
Kernel Functions
35 pages
Supervised Learning - Support Vector Machines and Feature Reduction
No ratings yet
Supervised Learning - Support Vector Machines and Feature Reduction
11 pages
Ds 11
No ratings yet
Ds 11
21 pages
SVM Tutorial
No ratings yet
SVM Tutorial
34 pages
HW2 2
No ratings yet
HW2 2
3 pages
SVM Tutorial
No ratings yet
SVM Tutorial
34 pages
SVM Extra Kernels
No ratings yet
SVM Extra Kernels
29 pages
SVM and Kernels
No ratings yet
SVM and Kernels
13 pages
SVM
No ratings yet
SVM
12 pages
SVM Tutorial
No ratings yet
SVM Tutorial
31 pages
Support Vector Machin, An Excellent Tool
No ratings yet
Support Vector Machin, An Excellent Tool
36 pages
Machine Learning - Open Elective - Part III
No ratings yet
Machine Learning - Open Elective - Part III
90 pages
MergedPDF Iml
No ratings yet
MergedPDF Iml
114 pages
Chapter 7
No ratings yet
Chapter 7
64 pages
Chap6.1-KernelMethods
No ratings yet
Chap6.1-KernelMethods
36 pages
Lecture 8_Kernels
No ratings yet
Lecture 8_Kernels
32 pages
ML Assignment 2 PDF
No ratings yet
ML Assignment 2 PDF
5 pages
SML Unit 4
No ratings yet
SML Unit 4
61 pages
Introduction To Kernels: Max Welling
No ratings yet
Introduction To Kernels: Max Welling
16 pages
Machine Learning
No ratings yet
Machine Learning
45 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
45 pages
Mastering Data Structures and Algorithms in C and C++
From Everand
Mastering Data Structures and Algorithms in C and C++
Sachin Naha
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
The Numpy Pocketbook: Essentials on the Go
From Everand
The Numpy Pocketbook: Essentials on the Go
Silas Meadowlark
No ratings yet
Thesis and Dissertation in Special Education
100% (2)
Thesis and Dissertation in Special Education
6 pages
Ceb HR Coaching Workbook
No ratings yet
Ceb HR Coaching Workbook
14 pages
Sension + Ph3
No ratings yet
Sension + Ph3
280 pages
Advanced linear modeling statistical learning and dependent data 3rd Edition Christensen R pdf download
100% (2)
Advanced linear modeling statistical learning and dependent data 3rd Edition Christensen R pdf download
59 pages
GRE - Eec.s.00.xx.a.00000.16.001.01 Ais Substations
No ratings yet
GRE - Eec.s.00.xx.a.00000.16.001.01 Ais Substations
245 pages
List of Affixes
No ratings yet
List of Affixes
4 pages
13.29 PP 440 442 Private Fears in Public Spaces
No ratings yet
13.29 PP 440 442 Private Fears in Public Spaces
3 pages
Gas Power Cycles The Carnot Gas Power Cycle
No ratings yet
Gas Power Cycles The Carnot Gas Power Cycle
6 pages
Foam in WBM
No ratings yet
Foam in WBM
10 pages
Test Bank for Health Psychology 6th by Straub - Download All Chapters Immediately In PDF Format
100% (6)
Test Bank for Health Psychology 6th by Straub - Download All Chapters Immediately In PDF Format
41 pages
Teaching Generation Z
No ratings yet
Teaching Generation Z
8 pages
Protein Isolates From Bambara Groundnut Voandz.
No ratings yet
Protein Isolates From Bambara Groundnut Voandz.
19 pages
Tcs Bps 2024 Hiring
No ratings yet
Tcs Bps 2024 Hiring
2 pages
Robot-Assisted Smart Firefighting and Interdisciplinary Perspectives
No ratings yet
Robot-Assisted Smart Firefighting and Interdisciplinary Perspectives
7 pages
12a How To Do AHP Analysis in Excel
No ratings yet
12a How To Do AHP Analysis in Excel
21 pages
Tugas 14 Inggris
No ratings yet
Tugas 14 Inggris
4 pages
Department of Agriculture Engineering: Question Bank
No ratings yet
Department of Agriculture Engineering: Question Bank
23 pages
Friedman JM 2021
No ratings yet
Friedman JM 2021
5 pages
Sika Raintite I
No ratings yet
Sika Raintite I
4 pages
EST Microproject
No ratings yet
EST Microproject
18 pages
Emersion Meaning - Google Search
No ratings yet
Emersion Meaning - Google Search
1 page
Universidad de Las Fuerzas Armadas ESPE: Diseño Experimental
No ratings yet
Universidad de Las Fuerzas Armadas ESPE: Diseño Experimental
15 pages
ACCE 151 Chapter 2 (Lecture 7)
No ratings yet
ACCE 151 Chapter 2 (Lecture 7)
18 pages
Full The Routledge Companion To Feminist Philosophy 1st Edition Ann Garry Ebook All Chapters
100% (4)
Full The Routledge Companion To Feminist Philosophy 1st Edition Ann Garry Ebook All Chapters
62 pages
Spec 108 Activities
No ratings yet
Spec 108 Activities
61 pages
Download Full Geology and Landscape Evolution: General Principles Applied to the United States 2nd Edition Joseph A. Dipietro PDF All Chapters
100% (4)
Download Full Geology and Landscape Evolution: General Principles Applied to the United States 2nd Edition Joseph A. Dipietro PDF All Chapters
49 pages
Web-Sikkim Census 2011 Data
No ratings yet
Web-Sikkim Census 2011 Data
15 pages

Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University

Uploaded by

Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University

Uploaded by

Support Vector Machines:

CS4780/5780 – Machine Learning

Reading: Schoelkopf/Smola Chapter 7.4, 7.6, 7.8

• Transform a linear learner into a non-linear

 Learn linear rule in feature space.

 The separating hyperplane in feature space is degree

• Input Space: (2 attributes)

• Dual Optimization Problem

• Theorem: If w* is the solution of the Primal and

Problem: Very many Parameters! Polynomials of

New hypotheses spaces through new Kernels:

Polynomial Radial Basis Function

Definition: Let X be a nonempty set. A function

=> K(car,cat) = 4, efficient computation via dynamic

You might also like