Logistic Regression for Binary Classification

The lecture focuses on constructing binary classifiers using Logistic Regression, covering topics such as applying logistic regression, formulating likelihoods, and deriving gradients and Hessians. It introduces the Iteratively Reweighted Least Squares (IRLS) algorithm for optimization and discusses the softmax link for logistic regression. The next lecture will address automatic derivative computation through back-propagation.

Uploaded by

Tachbir Dewan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views16 pages

Logistic Regression for Binary Classification

Uploaded by

Tachbir Dewan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Outline of the lecture

This lecture describes the construction of binary classifiers using a

technique called Logistic Regression. The objective is for you to learn:

 How to apply logistic regression to discriminate between two

classes.
 How to formulate the logistic regression likelihood.
 How to derive the gradient and Hessian of logistic regression.
 How to incorporate the gradient vector and Hessian matrix into
Newton’s optimization algorithm so as to come up with an algorithm
for logistic regression, which we call IRLS.
 How to do logistic regression with the softmax link.
McCulloch-Pitts model of a neuron
PSigmoid function
sigm(´) refers to the sigmoid function, also known as the logistic or
logit function:
1 e´
sigm(´) = ¡´
= ´
1+e e +1
Linear separating hyper-plane

[Greg Shakhnarovich]
Bernoulli: a model for coins
A Bernoulli random variable r.v. X takes values in {0,1}

q if x=1
p(x|q ) =
1- q if x=0

Where q 2 (0,1). We can write this probability more succinctly as

follows:
Entropy
In information theory, entropy H is a measure of the uncertainty
associated with a random variable. It is defined as:

H(X) = - S
x
p(x|q ) log p(x|q )

Example: For a Bernoulli variable X, the entropy is:

Logistic regression
The logistic regression model speci¯es the probability of a binary output
yi 2 f0; 1g given the input xi as follows:
n
Y
p(yjX; µ) = Ber(yi jsigm(xi µ))
i=1
Yn · ¸yi · ¸1¡yi
1 1
= 1¡
i=1
1 + e¡xi µ 1 + e¡xi µ
Pd
where xi µ = µ0 + j=1 µj xij
Gradient and Hessian of binary logistic regression
The gradient and Hessian of the negative loglikelihood, J(µ) = ¡ log p(yjX; µ),
are given by:

X n
d
g(w) = J(µ) = xTi (¼i ¡ yi ) = XT (¼ ¡ y)
dµ i=1
d X
T
H = g(µ) = ¼i (1 ¡ ¼i )xi xTi = XT diag(¼i (1 ¡ ¼i ))X
dµ i

where ¼i = sigm(xi µ)

One can show that H is positive de¯nite; hence the NLL is convex and
has a unique global minimum.

To ¯nd this minimum, we turn to batch optimization.

Iteratively reweighted least squares (IRLS)
For binary logistic regression, recall that the gradient and Hessian of the
negative log-likelihood are given by

gk = XT (¼ k ¡ y)
Hk = XT Sk X
Sk := diag(¼1k (1 ¡ ¼1k ); : : : ; ¼nk (1 ¡ ¼nk ))
¼ik = sigm(xi µ k )

The Newton update at iteration k + 1 for this model is as follows (using

´k = 1, since the Hessian is exact):

µ k+1 = µ k ¡ H¡1 gk
= µ k + (XT Sk X)¡1 XT (y ¡ ¼ k )
T ¡1
£ T T
¤
= (X Sk X) (X Sk X)µ k + X (y ¡ ¼ k )
= (XT Sk X)¡1 XT [Sk Xµ k + y ¡ ¼ k ]
Softmax formulation
Likelihood function
Negative log-likelihood criterion
Neural network representation of loss
Manual gradient computation
Manual gradient computation
Next lecture
In the next lecture, we develop an automatic layer-wise way of
computing all the necessary derivatives known as back-propagation.

This is the approach used in Torch. We will review the torch nn class.

Logistic Regression and Neuron Models
No ratings yet
Logistic Regression and Neuron Models
30 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
Chapter 02
No ratings yet
Chapter 02
9 pages
Logistic Regression Overview and Concepts
No ratings yet
Logistic Regression Overview and Concepts
25 pages
Understanding Logistic Regression Models
No ratings yet
Understanding Logistic Regression Models
10 pages
Logistic Regression and Gradient Ascent
No ratings yet
Logistic Regression and Gradient Ascent
3 pages
Logistic Regression with Gradient Descent
No ratings yet
Logistic Regression with Gradient Descent
19 pages
Logistic Regression for Binary Classification
No ratings yet
Logistic Regression for Binary Classification
18 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
16 pages
IRLS in Logistic Regression
No ratings yet
IRLS in Logistic Regression
8 pages
12 Logistic Regression
No ratings yet
12 Logistic Regression
84 pages
Logistic Regression and Classification Overview
No ratings yet
Logistic Regression and Classification Overview
25 pages
Logistic Regression for Classification
No ratings yet
Logistic Regression for Classification
28 pages
5 Logistic Regression
No ratings yet
5 Logistic Regression
7 pages
Logistic Regression and Regularization Explained
No ratings yet
Logistic Regression and Regularization Explained
40 pages
Logistic Regression for Binary Classification
No ratings yet
Logistic Regression for Binary Classification
7 pages
6.1 Logistic Regression
No ratings yet
6.1 Logistic Regression
41 pages
3 Logistic Regression
No ratings yet
3 Logistic Regression
20 pages
Logistic Regression Overview and Methods
No ratings yet
Logistic Regression Overview and Methods
19 pages
Logistic Regression Explained: Classifiers
No ratings yet
Logistic Regression Explained: Classifiers
94 pages
Logistic Regression Explained: Classifiers
No ratings yet
Logistic Regression Explained: Classifiers
91 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
10 pages
Lecture No 3
No ratings yet
Lecture No 3
10 pages
Understanding Logistic Regression Models
No ratings yet
Understanding Logistic Regression Models
23 pages
Logistic Regression Overview and Types
No ratings yet
Logistic Regression Overview and Types
9 pages
Decision Tree Depth in Logistic Regression
No ratings yet
Decision Tree Depth in Logistic Regression
66 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
4 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
65 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
23 pages
Logistic Regression for Credit Card Fraud
No ratings yet
Logistic Regression for Credit Card Fraud
36 pages
Logistic Regression for Breast Cancer Detection
No ratings yet
Logistic Regression for Breast Cancer Detection
20 pages
Logistic Regression Overview
No ratings yet
Logistic Regression Overview
44 pages
Logistic Regression Overview
No ratings yet
Logistic Regression Overview
44 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
16 pages
Logistic Regression Overview and Methods
No ratings yet
Logistic Regression Overview and Methods
25 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
40 pages
Classification and Logistic Regression Overview
No ratings yet
Classification and Logistic Regression Overview
28 pages
Logistic Regression for Binary Classification
No ratings yet
Logistic Regression for Binary Classification
4 pages
Understanding Softmax Regression Basics
100% (1)
Understanding Softmax Regression Basics
10 pages
Logistic Regression and Training Methods
No ratings yet
Logistic Regression and Training Methods
10 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
8 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
7 pages
Logistic Regression Overview and Workflow
No ratings yet
Logistic Regression Overview and Workflow
28 pages
Logistic Regression Basics Explained
No ratings yet
Logistic Regression Basics Explained
10 pages
Logistic Regression for Classification
No ratings yet
Logistic Regression for Classification
17 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
13 pages
Logistic Regression Explained with Examples
No ratings yet
Logistic Regression Explained with Examples
10 pages
Understanding the Sigmoid Function
No ratings yet
Understanding the Sigmoid Function
26 pages
Logistic Regression for Binary Outcomes
No ratings yet
Logistic Regression for Binary Outcomes
7 pages
CS229 Logistic Regression Notes
No ratings yet
CS229 Logistic Regression Notes
7 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
43 pages
Linear Classifiers in Machine Learning
No ratings yet
Linear Classifiers in Machine Learning
4 pages
Cost Function in Logistic Regression
No ratings yet
Cost Function in Logistic Regression
9 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
17 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
10 pages
Understanding Logistic Regression
No ratings yet
Understanding Logistic Regression
28 pages
Logistic Regression Class Notes Guide
No ratings yet
Logistic Regression Class Notes Guide
3 pages
Logistic Regression for Text Classification
No ratings yet
Logistic Regression for Text Classification
64 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Tetris Game Project Approval Letter
No ratings yet
Tetris Game Project Approval Letter
13 pages
Understanding URL Components and Network Protocols
No ratings yet
Understanding URL Components and Network Protocols
27 pages
Tetris Game Development Strategies
No ratings yet
Tetris Game Development Strategies
14 pages
Tetris Game Project Report - BSMRSTU
No ratings yet
Tetris Game Project Report - BSMRSTU
1 page
Image Compression Techniques Overview
No ratings yet
Image Compression Techniques Overview
23 pages
Tetris Project Report - BSMRSTU CSE
No ratings yet
Tetris Project Report - BSMRSTU CSE
1 page
Introduction to Sequence Models and LSTMs
No ratings yet
Introduction to Sequence Models and LSTMs
22 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
6 pages
Mathematical Modeling in Engineering
No ratings yet
Mathematical Modeling in Engineering
17 pages
Approximations and Round-off Errors
No ratings yet
Approximations and Round-off Errors
16 pages
Bracketing Methods for Root Finding
No ratings yet
Bracketing Methods for Root Finding
26 pages
Newton's Interpolating Polynomials Explained
No ratings yet
Newton's Interpolating Polynomials Explained
15 pages
Employee Management System Project Report
No ratings yet
Employee Management System Project Report
22 pages
ICT Data Backup and Recovery Policy 28 March 2018
No ratings yet
ICT Data Backup and Recovery Policy 28 March 2018
31 pages
Angular Developer Resume - Deepak Rawat
No ratings yet
Angular Developer Resume - Deepak Rawat
2 pages
Step-by-Step Application Guide
No ratings yet
Step-by-Step Application Guide
16 pages
Pokémon Unbound v2.0.3.2 Walkthrough
No ratings yet
Pokémon Unbound v2.0.3.2 Walkthrough
270 pages
EcoFlow Power Kit Console Manual
No ratings yet
EcoFlow Power Kit Console Manual
84 pages
C Sorting Algorithms: Quicksort, Mergesort, Selection Sort
No ratings yet
C Sorting Algorithms: Quicksort, Mergesort, Selection Sort
11 pages
Red Canary Threat Intelligence Insights
No ratings yet
Red Canary Threat Intelligence Insights
4 pages
Intellectual Property Rights Overview
No ratings yet
Intellectual Property Rights Overview
88 pages
AQA GCSE Computer Science Workbook
No ratings yet
AQA GCSE Computer Science Workbook
102 pages
Online Student Clearance DBMS Project
No ratings yet
Online Student Clearance DBMS Project
41 pages
Apple vs. Microsoft Financial Analysis
No ratings yet
Apple vs. Microsoft Financial Analysis
6 pages
Analyzing Malware Persistence Techniques
No ratings yet
Analyzing Malware Persistence Techniques
10 pages
DevOps and QA Expertise Overview
No ratings yet
DevOps and QA Expertise Overview
7 pages
Computer Evolution and Performance Insights
No ratings yet
Computer Evolution and Performance Insights
32 pages
Simple Loan Payment Calculator
No ratings yet
Simple Loan Payment Calculator
14 pages
ERP System Stakeholders and Risks 2024
No ratings yet
ERP System Stakeholders and Risks 2024
35 pages
Podcasting Success: Create, Grow, Monetize
No ratings yet
Podcasting Success: Create, Grow, Monetize
12 pages
Microsoft 365 Exam MS-900 Practice Questions
No ratings yet
Microsoft 365 Exam MS-900 Practice Questions
8 pages
GitHub Copilot: Configuration and Usage Guide
No ratings yet
GitHub Copilot: Configuration and Usage Guide
6 pages
Huffman Coding: Lossless Compression
No ratings yet
Huffman Coding: Lossless Compression
15 pages
Marriott Data Breach Analysis 2018
No ratings yet
Marriott Data Breach Analysis 2018
6 pages
CS61A Su20 - Lab 00 Setup
No ratings yet
CS61A Su20 - Lab 00 Setup
18 pages
Analyzing Array Multiplier Delays
No ratings yet
Analyzing Array Multiplier Delays
92 pages
CS101 Course Logistics Overview
No ratings yet
CS101 Course Logistics Overview
60 pages
SAP DMS Configuration Guide
No ratings yet
SAP DMS Configuration Guide
11 pages
Study G-Major February PDF
100% (2)
Study G-Major February PDF
2 pages
Laboratory Management API Guide
No ratings yet
Laboratory Management API Guide
24 pages
Android OS Development Overview
No ratings yet
Android OS Development Overview
12 pages
Cybersecurity Engineer Career Roadmap
No ratings yet
Cybersecurity Engineer Career Roadmap
3 pages
Innovus Hierarchical Design Flow Guide
No ratings yet
Innovus Hierarchical Design Flow Guide
23 pages

Logistic Regression for Binary Classification

Uploaded by

Logistic Regression for Binary Classification

Uploaded by

Outline of the lecture

This lecture describes the construction of binary classifiers using a

 How to apply logistic regression to discriminate between two

Where q 2 (0,1). We can write this probability more succinctly as

Example: For a Bernoulli variable X, the entropy is:

To ¯nd this minimum, we turn to batch optimization.

The Newton update at iteration k + 1 for this model is as follows (using

You might also like