W8 - Logistic Regression

Logistic regression is a statistical method used for classification problems. It uses a logistic function to model the probabilities of different classes. The logistic regression model calculates a hypothetical function using weights and a bias, and applies a sigmoid activation function to convert the output to a probability value between 0 and 1. It uses a logistic cost function instead of mean squared error for optimization. Gradient descent is used to update the weights iteratively to minimize the cost function. The weights are adjusted to increase the likelihood of the training data. Once trained, the model predicts class 1 if the probability is above 0.5, and class 0 if below 0.5. Examples demonstrate training a logistic regression model on sample data to classify articles as

Uploaded by

5599RAJNISH SINGH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views18 pages

W8 - Logistic Regression

Uploaded by

5599RAJNISH SINGH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

LOGISTIC REGRESSION

Likelihood Vs Probability
Probability vs Statistics
• In probability theory we consider some underlying process which has
some randomness or uncertainty modeled by random variables, and
we figure out what happens.
• In statistics we observe something that has happened, and try to
figure out what underlying process would explain those observations.
• Likelihood function is a fundamental concept in statistical inference.
• It indicates how likely a particular population is to produce an
observed sample.
• Probability points to chances while likelihood denotes a possibility.
Likelihood Vs Probability
• Probability is simply how likely something is to happen.
• The occurrence of discrete values yk is expressed by the probability P(yk).
• The distribution of all possible values of discrete random variable y is
expressed as probability distribution.
• We assume that there is some a priori probability (or simply prior) P(yk)
that the next feature vector belongs to the class k.
• P(x| yk) is called the class likelihood and is the conditional probability that a
pattern belonging to class yk has the associated observation value x.
• Any class that maximizes P(x| yq) is called Maximum Likelihood (ML) class.
Likelihood Vs Probability
• Probability follows clear parameters and computations while a likelihood is
based merely on observed factors/data.

• P(data; μ, σ) It means “the probability density of observing the data with

model parameters μ and σ”. It’s worth noting that we can generalise this to
any number of parameters and any distribution.
• On the other hand L(μ, σ; data) means “the likelihood of the parameters μ
and σ taking certain values given that we’ve observed a bunch of data.”
• But despite these two things being equal, the likelihood and the
probability density are fundamentally asking different questions — one is
asking about the data and the other is asking about the parameter values.
Example of Probability
• Consider a dataset containing the heights of the people of a particular
country. Let’s say the mean of the data is 170 & the standard deviation is 3.5.
• When Probability has to be calculated of any situation using this dataset,
then the dataset features will be constant i.e. mean & standard deviation of
the dataset will be constant, they will not be altered.
• Let’s say the probability of height > 170 cm has to be calculated for a random
record in the dataset, then that will be calculated using the information
shown below:

• While calculating probability, feature value can be varied, but the

characteristics(mean & Standard Deviation) of the data distribution cannot
be altered.
Example of Likelihood
• Likelihood calculation involves calculating the best distribution or best
characteristics of data given a particular feature value or situation.
• Consider the exactly same dataset example as provided above for
probability, if their likelihood of height > 170 cm has to be calculated then it
will be done using the information shown below:

• In the calculation of the Likelihood, the equation of the conditional

probability flips as compared to the equation in the probability calculation.
• Here, the dataset features will be varied, i.e. Mean & Standard Deviation of
the dataset will be varied to get the maximum likelihood for height > 170 cm.
• The likelihood in very simple terms means to increase the chances of a
particular situation to happen/occur by varying the characteristics of the
dataset distribution.
Logistic Regression Implementation
Hypothetical function
• In Logistic Regression, we apply the sigmoid activation function on the
hypothetical function of linear regression.
• So the resultant hypothetical function for logistic regression is given below:
h( x ) = sigmoid( wx + b )
Here, w is the weight vector.
x is the feature vector.
b is the bias.
sigmoid( z ) = 1 / ( 1 + e( - z ) )
Cost function
• The cost function of linear regression (mean square error) can’t be used in
logistic regression because it is a non-convex function of weights.
• Optimizing algorithms like i.e gradient descent only converge convex
function into a global minimum.
• So, the simplified cost function we use :
J = - ylog( h(x) ) - ( 1 - y )log( 1 - h(x) ) (it’s derived in last class)
here, y is the real target value
h( x ) = sigmoid( wx + b )
For y = 0, J = - log( 1 - h(x) )
and y = 1, J = - log( h(x) )
Gradient Descent Calculation
repeat until convergence {
tmpi = wi - alpha * dwi
wi = tmpi
}
where alpha is the learning rate.
• The chain rule is used to calculate the gradients like i.e dwi.

• here, a = sigmoid( z ) and z = wx + b.

Next?
• Update weights in an iterative process
• After completing all iterations, calculate Hypothetical function h( x )

Threshold classifier output h( x ) at 0.5:

If h( x ) , predict “y = 1”
If h( x ) , predict “y = 0”
Logistic Regression Numerical
Example 1
• Some samples of two classes of
articles: Technical (1) and Non-
technical (0) are given.
• Each class has two features:
• Time, which represent the
average time required to read an
article in hours,
• Sentences, representing a
number of sentences in a book
• first, we need to train our logistic
regression model.

1.9 3.1 ?
Example 1
• Training involves finding optimal
values of coefficients which are B0,
β1, and β2.
• While training, we find some value
of coefficients in the first step and
use those coefficients in another
step to optimize their value.
• We continue to do it until we get
consistent accuracy from the model.

=
1.9 3.1 ?
Example 1
• After 20 iteration, we get:
B0 = -0.1068913
B1 = 0.41444855
B2 = -0.2486209
• Thus, the decision boundary is given
as:
Z = B0+B1*X1+B2*X2
Z = -0.1068913 +0.41444855*Time-
0.2486209*Sentences

1.9 3.1 ?
Example 1
• For, X1 = 1.9 and X2 = 3.1, we get:
Z = -0.101818+0.41444855*1.9 -
0.2486209*3.1
Z = -0.085090545
• Now, we use sigmoid function to
find the probability and thus
predicting the class of given
variables.

• As y=0.477, is less than 0.5, we can

safely classify given sample to class
Non-technical. 1.9 3.1 ?
Examples 2 & 3
• Can be seen here in the following links
• https://machinelearningmastery.com/logistic-regression-tutorial-for-
machine-learning/
• https://courses.lumenlearning.com/introstats1/chapter/introduction-
to-logistic-regression/

ML Assignment
No ratings yet
ML Assignment
20 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
09_23ECE216_LogisticRegression
No ratings yet
09_23ECE216_LogisticRegression
40 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
lec20
No ratings yet
lec20
16 pages
Mathematics Behind Logistic Regression Model 1598272636
No ratings yet
Mathematics Behind Logistic Regression Model 1598272636
6 pages
Text Classification Using Logistics Regression
No ratings yet
Text Classification Using Logistics Regression
64 pages
2+Logistic_regression
No ratings yet
2+Logistic_regression
10 pages
Reference Material - Logistic - Regression
No ratings yet
Reference Material - Logistic - Regression
11 pages
04- Linear-Classification-2024
No ratings yet
04- Linear-Classification-2024
65 pages
5 LR Apr 7 2021
No ratings yet
5 LR Apr 7 2021
94 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Reference Material - Logistic - Regression
No ratings yet
Reference Material - Logistic - Regression
11 pages
Reference Material Logistic Regression
No ratings yet
Reference Material Logistic Regression
11 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
FALLSEM2024-25_BCSE209L_TH_VL2024250101695_2024-08-12_Reference-Material-II
No ratings yet
FALLSEM2024-25_BCSE209L_TH_VL2024250101695_2024-08-12_Reference-Material-II
19 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Logistic Regression
No ratings yet
Logistic Regression
78 pages
Slide 2
No ratings yet
Slide 2
30 pages
Classification-Introduction, Logistic Regression
No ratings yet
Classification-Introduction, Logistic Regression
26 pages
Logistic Regression Notes
No ratings yet
Logistic Regression Notes
23 pages
5_LR_Apr_7_2021 (3)
No ratings yet
5_LR_Apr_7_2021 (3)
93 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Logistic Regression(Probability Concepts) and Perceptron
No ratings yet
Logistic Regression(Probability Concepts) and Perceptron
20 pages
7 Logistic-Regression
No ratings yet
7 Logistic-Regression
63 pages
Logistic Regression
No ratings yet
Logistic Regression
18 pages
Lecture Note #9_PEC-CS701E
No ratings yet
Lecture Note #9_PEC-CS701E
41 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
ML - Unit 2
No ratings yet
ML - Unit 2
155 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Practical - Logistic Regression
No ratings yet
Practical - Logistic Regression
84 pages
ML_MU_Unit_2 - Supervised Learning-Classification Techniques
No ratings yet
ML_MU_Unit_2 - Supervised Learning-Classification Techniques
153 pages
Business Analytics & Machine Learning: Logistic and Poisson Regressions
No ratings yet
Business Analytics & Machine Learning: Logistic and Poisson Regressions
62 pages
Log-Linear Models and Conditional Random Fieldsels
No ratings yet
Log-Linear Models and Conditional Random Fieldsels
27 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
cs188 Fa23 Note22
No ratings yet
cs188 Fa23 Note22
3 pages
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
No ratings yet
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
10 pages
3-LG_Eval
No ratings yet
3-LG_Eval
52 pages
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
No ratings yet
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
53 pages
02 LogisticRegression
No ratings yet
02 LogisticRegression
29 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
Multimedia Application L9
No ratings yet
Multimedia Application L9
43 pages
Logistic regression by Nirzona
No ratings yet
Logistic regression by Nirzona
11 pages
Logistic Regression[2]
No ratings yet
Logistic Regression[2]
36 pages
29 LogisticRegression
No ratings yet
29 LogisticRegression
15 pages
Lecture Notes Chapt13
No ratings yet
Lecture Notes Chapt13
15 pages
unit 3 LOGISTIC (1)
No ratings yet
unit 3 LOGISTIC (1)
7 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
3. LR, decision tree
No ratings yet
3. LR, decision tree
48 pages
Notes 05
No ratings yet
Notes 05
51 pages
Key Concepts in Probabilistic Learning and SVMs
No ratings yet
Key Concepts in Probabilistic Learning and SVMs
15 pages
DDA3020 Lecture 06 Logistic Regression
No ratings yet
DDA3020 Lecture 06 Logistic Regression
47 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
100% (1)
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
42 pages
DS203 2024 01 02 LogisticRegression
No ratings yet
DS203 2024 01 02 LogisticRegression
38 pages
Unit II
100% (1)
Unit II
13 pages
Logistic Regressions
No ratings yet
Logistic Regressions
11 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
21 pages
M02Logistic Regression Logistic RegressioLogistic Regressionn
No ratings yet
M02Logistic Regression Logistic RegressioLogistic Regressionn
19 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
ERROR and Confusion Matrix
No ratings yet
ERROR and Confusion Matrix
29 pages
W6 Clustering
No ratings yet
W6 Clustering
29 pages
Ch09 Monitoring and Control
No ratings yet
Ch09 Monitoring and Control
34 pages
Topic12ADTS GenericDataStructures
No ratings yet
Topic12ADTS GenericDataStructures
27 pages
Ch11 Managing People
No ratings yet
Ch11 Managing People
23 pages
01 Intro CPP
No ratings yet
01 Intro CPP
159 pages
Ch08 Resource Allocation
No ratings yet
Ch08 Resource Allocation
17 pages
Multiple Server Queues
No ratings yet
Multiple Server Queues
4 pages
Data Preprocessing
No ratings yet
Data Preprocessing
63 pages
Analytical Evaluation of Third Virial Coefficient With Lennard-Jones (12-6) Potential and Its Applications
No ratings yet
Analytical Evaluation of Third Virial Coefficient With Lennard-Jones (12-6) Potential and Its Applications
6 pages
Quant Finance RoadMap
No ratings yet
Quant Finance RoadMap
8 pages
Stack
No ratings yet
Stack
14 pages
CSCI 8080 Design and Analysis of Algorithms: Greedy Algorithm
No ratings yet
CSCI 8080 Design and Analysis of Algorithms: Greedy Algorithm
4 pages
Concepts and Techniques: - Chapter 7
No ratings yet
Concepts and Techniques: - Chapter 7
70 pages
Feature Scaling Techniques: Machine Learning
No ratings yet
Feature Scaling Techniques: Machine Learning
27 pages
The Black-Derman-Toy Model (BDT) : T D R T RDT TDW T
No ratings yet
The Black-Derman-Toy Model (BDT) : T D R T RDT TDW T
20 pages
NM Lecture 4
No ratings yet
NM Lecture 4
16 pages
Tao 2021
No ratings yet
Tao 2021
19 pages
T. Y. B. Sc. (Mathematics) (Sem. - V) Examination March - 2023 MTH - 505: Mathematics Graph Theory
No ratings yet
T. Y. B. Sc. (Mathematics) (Sem. - V) Examination March - 2023 MTH - 505: Mathematics Graph Theory
2 pages
LPP (DPP 1)
No ratings yet
LPP (DPP 1)
2 pages
Sentimental Analysis
No ratings yet
Sentimental Analysis
13 pages
16-Interpolation With Non Equvi-Spaced Data Points - Lagrange Interpolation-28-Jan-2019Reference Materia PDF
No ratings yet
16-Interpolation With Non Equvi-Spaced Data Points - Lagrange Interpolation-28-Jan-2019Reference Materia PDF
47 pages
Complexity Theory Chapter 1
100% (1)
Complexity Theory Chapter 1
53 pages
PSAT89ResourcesonKA 13020
No ratings yet
PSAT89ResourcesonKA 13020
2 pages
Deepfake Video Detection System Using Deep Neural Networks
No ratings yet
Deepfake Video Detection System Using Deep Neural Networks
6 pages
CS1B_April_2024_Exam_Paper
No ratings yet
CS1B_April_2024_Exam_Paper
7 pages
A Review of PID Control, Tuning Methods and Applications: Rakesh P. Borase D. K. Maghade S. Y. Sondkar S. N. Pawar
No ratings yet
A Review of PID Control, Tuning Methods and Applications: Rakesh P. Borase D. K. Maghade S. Y. Sondkar S. N. Pawar
10 pages
What Are Repeatability and Reproducibility, Part 2 The E11 Viewpoint, ASTM Data Points, May-June 2009
No ratings yet
What Are Repeatability and Reproducibility, Part 2 The E11 Viewpoint, ASTM Data Points, May-June 2009
4 pages
Taylor Series and Numerical Methods
No ratings yet
Taylor Series and Numerical Methods
14 pages
Extendible Hashing PDF
No ratings yet
Extendible Hashing PDF
3 pages
REVIEW PROBLEMS Markov chain -2025
No ratings yet
REVIEW PROBLEMS Markov chain -2025
4 pages
MDSP 1
No ratings yet
MDSP 1
48 pages
Research Sampling_Digital_Communication
No ratings yet
Research Sampling_Digital_Communication
2 pages
Algorithms List in AI
No ratings yet
Algorithms List in AI
59 pages
D2l-En Deep Learning PDF
No ratings yet
D2l-En Deep Learning PDF
639 pages
Boosting
No ratings yet
Boosting
13 pages

W8 - Logistic Regression

Uploaded by

W8 - Logistic Regression

Uploaded by

LOGISTIC REGRESSION

• P(data; μ, σ) It means “the probability density of observing the data with

• While calculating probability, feature value can be varied, but the

• In the calculation of the Likelihood, the equation of the conditional

• here, a = sigmoid( z ) and z = wx + b.

Threshold classifier output h( x ) at 0.5:

• As y=0.477, is less than 0.5, we can

You might also like