0% found this document useful (0 votes)

10 views

[Slide] Logistic Regression

The document outlines the AI Insight Course on Logistic Regression, presented by Quang-Vinh Dinh, Ph.D. in Computer Science. It covers key concepts including the sigmoid function, the transition from linear to logistic regression, and different methods of logistic regression such as stochastic, mini-batch, and batch. The course aims to provide a comprehensive understanding of logistic regression and its applications in data classification.

Uploaded by

nguyennhathoang22tdt2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

[Slide] Logistic Regression

Uploaded by

nguyennhathoang22tdt2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

AI VIETNAM

AI Insight Course

Logistic Regression

Quang-Vinh Dinh
Ph.D. in Computer Science

Year 2020
Outline
 Sigmoid function
 From Linear to Logistic Regression
 Logistic Regression – Stochastic
 Logistic Regression – Mini-batch
 Logistic Regression – Batch
AI VIETNAM
AI Insight Course
Sigmoid Function
Sigmoid function

1
𝑦 = 𝜎(𝑢) =
1 + 𝑒 −𝑢 𝑦2
𝑢 ∈ −∞ +∞
𝑦∈ 0 1 𝒚

Property 𝑦1

∀𝑢1 𝑢2 ∈ 𝑎 𝑏 và 𝑢1 ≤ 𝑢2 𝑢1 𝑢2
→ 𝜎(𝑢1 ) ≤ 𝜎(𝑢1 )
𝒖
−∞ +∞

Year 2020 1
AI VIETNAM
AI Insight Course
Sigmoid Function

𝑦 = 𝜽𝑇 𝒙
𝜽𝑇 𝒙 𝜽𝑇 𝒙
𝑦 ∈ −∞ +∞

𝒙 𝒙

𝑧 = 𝜽𝑇 𝒙
1
𝑦 = 𝜎(𝑧) = 1 1
1 + 𝑒 −𝑧 𝑇𝒙 𝑇𝒙
1 + 𝑒 −𝜽 1 + 𝑒 −𝜽
𝑦∈ 0 1

𝜽𝑇 𝒙 𝜽𝑇 𝒙
Year 2020 2
AI VIETNAM
AI Insight Course
Sigmoid Function

𝑦 = 𝜽𝑇 𝒙
𝜽𝑇 𝒙 𝜽𝑇 𝒙
𝑦 ∈ −∞ +∞

𝒙 𝒙

𝑧 = 𝜽𝑇 𝒙
1
𝑦 = 𝜎(𝑧) =
1 + 𝑒 −𝑧
1 1
𝑦∈ 0 1 1 + 𝑒 −𝜽
𝑇𝒙
1 + 𝑒 −𝜽
𝑇𝒙

𝜽𝑇 𝒙 𝜽𝑇 𝒙
Year 2020 3
AI VIETNAM
AI Insight Course
Sigmoid Function
Feature Label

Category 1

1
Category 2 𝑇𝒙
1 + 𝑒 −𝜽

𝑥
𝑧 = 𝜽𝑇 𝒙
1
𝑦 = 𝜎(𝑧) =
1 + 𝑒 −𝑧 𝑧 = 0.535 ∗ 𝑥 − 0.654
𝑦∈ 0 1
Year 2020 4
AI VIETNAM
AI Insight Course
Sigmoid Function
Feature Label

Category 1

Category 2 1
𝑇𝒙
1 + 𝑒 −𝜽

𝑥
𝑧 = 𝜽𝑇 𝒙
1
𝑦 = 𝜎(𝑧) =
1 + 𝑒 −𝑧 𝑧 = 2.331 ∗ 𝑥 − 5.156
𝑦∈ 0 1
Year 2020 5
AI VIETNAM
AI Insight Course
Sigmoid Function
Feature Label

Category 1

Category 2

1
𝑇𝒙
1 + 𝑒 −𝜽

𝑧 = 𝜽𝑇 𝒙
1
𝑦 = 𝜎(𝑧) =
1 + 𝑒 −𝑧
𝑦∈ 0 1

𝑥
Year 2020 6
Outline
 Sigmoid function
 From Linear to Logistic Regression
 Logistic Regression – Stochastic
 Logistic Regression – Mini-batch
 Logistic Regression – Batch
AI VIETNAM
AI Insight Course
Idea of Logistic Regression
 Linear regression
𝒚

Area-based House Price Data

error
Feature Label
error
construct 𝑦ො = 𝜽𝑇 𝒙 = 𝑎𝑥 + 𝑏
error
𝑦ො ∈ −∞ +∞ error

Model

Training data 𝒙

Find the line 𝑦ො = 𝜽𝑇 𝒙 that is best fit given data, then

use 𝑦 to predict for new data

Year 2020 7
AI VIETNAM
AI Insight Course
Idea of Logistic Regression
 Given a new kind of data Category 2
Feature Label
Plot data

Category 1

Category 1
Category 2
Feature
Assign numbers
to categories

Feature Label
A line is not suitable
for this data
Category 1

Category 2
Year 2020 Feature 8
AI VIETNAM
AI Insight Course
Idea of Logistic Regression
 Given a new kind of data Sigmoid function
Feature Label could fit the data

𝑧 = 𝜽𝑇 𝒙 1
1 1 + 𝑒 −𝜽
𝑇𝒙
Category 1 𝑦ො = 𝜎(𝑧) =
1 + 𝑒 −𝑧
Category 2
𝑦ො ∈ 0 1
Feature
Assign numbers
to categories Error error =1-𝑦ො

Feature Label if 𝑦 = 1
error = 1 − 𝑦ො
if 𝑦 = 0
Category 1 error = 𝑦ො
error = 𝑦ො
Category 2 Feature
Year 2020 9
AI VIETNAM
AI Insight Course
Idea of Logistic Regression
 Construct loss
error =1-𝑦ො
belief = 1 − 𝑦ො

belief = 𝑦ො

error = 𝑦ො

Error Belief
if 𝑦𝑖 = 1 if 𝑦𝑖 = 1
error = 1 − 𝑦ො𝑖 belief = 𝑦ො𝑖
if 𝑦𝑖 = 0 if 𝑦𝑖 = 0
error = 𝑦ො𝑖 belief = 1 − 𝑦ො𝑖
𝑦𝑖 1−𝑦𝑖
𝑃 = 𝑦ො𝑖 1 − 𝑦ො𝑖
Year 2020 Minimize error ~ maximize belief ~ Minimize (-belief) 10
7
AI VIETNAM
AI Insight Course
Idea of Logistic Regression
𝑛
 Construct loss
belief = ෑ 𝑃𝑖 since iid
𝑖=1
𝑛
belief = 1 − 𝑦ො
log_belief = ෍ log𝑃𝑖
𝑖=1
belief = 𝑦ො 𝑛

log_belief = ෍ 𝑦𝑖 log𝑦ො𝑖 + 1 − 𝑦𝑖 log 1 − 𝑦ො𝑖

𝑖=1

loss = −log _belief

Belief 𝑛

= − ෍ 𝑦𝑖 log𝑦ො𝑖 + 1 − 𝑦𝑖 log 1 − 𝑦ො𝑖

if 𝑦𝑖 = 1 𝑖=1

belief = 𝑦ො𝑖
if 𝑦𝑖 = 0 1
L= −𝑦 𝑇 𝑙𝑜𝑔 𝑦ො − (1 − 𝑦 𝑇 )𝑙𝑜𝑔 1 − 𝑦ො
belief = 1 − 𝑦ො𝑖 𝑁
𝑦𝑖
Binary cross-entropy
𝑃𝑖 = 𝑦ො𝑖 1 − 𝑦ො𝑖 1−𝑦𝑖
Year 2020 11
8
AI VIETNAM
AI Insight Course
Idea of Logistic Regression
 Construct loss 𝜕𝐿 𝜕𝐿 𝜕𝑦ො 𝜕𝑧 Derivative
=
𝜕𝜃 𝜕𝑦ො 𝜕𝑧 𝜕𝜃
Model and Loss
𝑧 = 𝜽𝑇 𝒙 𝜕𝐿 1 𝑦 1−𝑦 1 𝑦ො − 𝑦
= − + =
1 𝜕𝑦ො 𝑁 𝑦ො 1 − 𝑦ො 𝑁 𝑦(1
ො − 𝑦)ො
𝑦ො = 𝜎(𝑧) =
1 + 𝑒 −𝑧 𝜕𝑦ො
= 𝑦(1
ො − 𝑦)
ො
1 𝜕𝑧 𝜕𝐿 1 𝑇
L= −𝑦 𝑇 log 𝑦ො − (1 − 𝑦 𝑇 )log 1 − 𝑦ො = 𝑥 (𝑦ො − 𝑦)
𝑁 𝜕𝑧 𝜕𝜃 𝑁
=𝑥
𝜕𝜃

Year 2020 12
9
AI VIETNAM
AI Insight Course
Idea of Logistic Regression
𝜕𝐿 𝜕𝐿 𝜕𝑦ො 𝜕𝑧 Derivative
 Construct loss =
𝜕𝜃 𝜕𝑦ො 𝜕𝑧 𝜕𝜃
Model and Loss 𝜕𝐿 1 𝑦 1−𝑦 1 𝑦ො − 𝑦
𝑧 = 𝜽𝑇 𝒙 = − + =
1 𝜕𝑦ො 𝑁 𝑦ො 1 − 𝑦ො 𝑁 𝑦(1
ො − 𝑦)ො
𝑦ො = 𝜎(𝑧) = 𝜕𝑦ො
1 + 𝑒 −𝑧 = 𝑦(1
ො − 𝑦)
ො
1 𝜕𝑧 𝜕𝐿 1 𝑇
L= −𝑦 𝑇 log 𝑦ො − (1 − 𝑦 𝑇 )log 1 − 𝑦ො = 𝑥 (𝑦ො − 𝑦)
𝑁 𝜕𝑧 𝜕𝜃 𝑁
=𝑥
𝜕𝜃

-log(𝑦)
ො -log(1-𝑦)
ො

Year 2020 13
9
AI VIETNAM
AI Insight Course
Idea of Logistic Regression
Feature Label Feature Label

𝑧 = 𝜽𝑇 𝒙
Category 1 1 Category 1
𝑦ො = 𝜎(𝑧) =
1 + 𝑒 −𝑧
Category 2 Category 2

1 1
𝑇𝒙 𝑇𝒙
1 + 𝑒 −𝜽 1 + 𝑒 −𝜽

Year 2020 14
4
Outline
 Sigmoid function
 From Linear to Logistic Regression
 Logistic Regression – Stochastic
 Logistic Regression – Mini-batch
 Logistic Regression – Batch
AI VIETNAM
AI Insight Course
Logistic Regression-Stochastic
𝜽𝑇 = [𝑏 𝑤1 𝑤2 ]
1) Pick a sample (𝑥, 𝑦) from training data 𝑥1 𝑥2
𝒙𝑇 = [1 𝑥1 𝑥2 ]
2) Tính output 𝑦ො
𝑧 = 𝜽𝑇 𝒙 Model
1
𝑦ො = 𝜎(𝑧) =
1 + 𝑒 −𝑧 𝑏 𝑤1 𝑤2
3) Tính loss
𝐿(𝜽) = −ylogොy−(1−y)log(1−ොy )

4) Tính đạo hàm 𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏

𝐿′𝜽 = 𝐱(ොy − 𝑦)
1 Label
5) Cập nhật tham số 𝑦ො = 𝜎(𝑧) =
1 + 𝑒 −𝑧
𝜽 = 𝜽 − 𝜂𝐿′𝜽 𝑦
𝜂 is learning rate Loss
−y T logොy−(1−y)T log(1−ොy )
Year 2020 15
4
AI VIETNAM
AI Insight Course
Logistic Regression-Stochastic
𝒙1 = 1.4 𝒙2 = 0.2
𝑥1 𝑥2
Dataset
Model
𝑏 𝑤1 𝑤2
0.1 0.5 -0.1

𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝒛 = 0.78
1
𝒙 = 1.4 𝒚= 0
0.2
1 Label
ෝ = 0.6856
𝒚 𝑦ො = 𝜎(𝑧) =
1 + 𝑒 −𝑧
𝑦 𝒚= 0

Loss
𝑳 = 1.1573 −y T logොy−(1−y)T log(1−ොy )
16
AI VIETNAM
AI Insight Course
Logistic Regression-Stochastic
𝒙1 = 1.4 𝑥1 𝑥2 𝒙2 = 0.2
Dataset
𝜂 = 0.01
Model
𝑏 𝑤1 𝑤2
𝑏 = 0.1 − 𝜂0.6856=0.0931 0.1 0.5 -0.1
𝑤1 = 0.5 − 𝜂0.9598=0.4990
𝐿′𝑏 𝐿′𝑤1 𝐿′𝑤2
𝑤2 = −0.1+𝜂0.1371=−0.1013
𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝒛 = 0.78
1
𝒙 = 1.4 𝒚= 0 𝐿′𝜽 = x(ොy − 𝑦) ෝ = 0.6856
𝒚
0.2 1 1 Label
= 1.4 0.6856 𝑦ො = 𝜎(𝑧) =
1 + 𝑒 −𝑧
0.2 𝒚= 0 𝑦
0.6856 𝐿′𝑏
= 0.9599 = 𝐿′𝑤1 Loss
−0.1371 𝐿′𝑤2 -y T logොy−(1−y)T log(1−ොy )

Year 2020 𝑳 = 1.1573 17

AI VIETNAM
AI Insight Course
Logistic Regression-Stochastic
𝒙1 = 1.4 𝑥1 𝑥2 𝒙2 = 0.2
Dataset
𝜂 = 0.01
Model
𝑏 𝑤1 𝑤2
𝑏 = 0.1 − 𝜂0.6856=0.0931 0.0931 0.4904 -0.1013
𝑤1 = 0.5 − 𝜂0.9598=0.4990
𝐿′𝑏 𝐿′𝑤1 𝐿′𝑤2
𝑤2 = −0.1+𝜂0.1371=−0.1013
𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝒛 = 0.78
1
𝒙 = 1.4 𝒚= 0 𝐿′𝜽 = x(ොy − 𝑦) ෝ = 0.6856
𝒚
0.2 1 1 Label
= 1.4 0.6856 𝑦ො = 𝜎(𝑧) =
1 + 𝑒 −𝑧
0.2 𝒚= 0 𝑦
0.6856 𝐿′𝑏
= 0.9599 = 𝐿′𝑤1 Loss
−0.1371 𝐿′𝑤2 -y T logොy−(1−y)T log(1−ොy )

Year 2020 𝑳 = 1.1573 18

Outline
 Sigmoid function
 From Linear to Logistic Regression
 Logistic Regression – Stochastic
 Logistic Regression – Mini-batch
 Logistic Regression – Batch
AI VIETNAM
AI Insight Course
Logistic Regression - Minibatch
1) Pick m samples from training data
2) Tính output 𝑦ො 𝑥1 𝑥2
𝑧 = 𝜽𝑇 𝒙
1 Model Mini-batch m=2
ෝ = 𝜎(𝑧) =
𝒚
1 + 𝑒 −𝑧
3) Tính loss
𝑏 𝑤1 𝑤2 𝜽𝑇 = [𝑏 𝑤1 𝑤2 ]
1
𝐿(𝜽) = −y T logොy−(1−y)T log(1−ොy ) 1 1
m (1) (2)
𝒙 = 𝑥1 𝑥1
4) Tính đạo hàm (1) (2)
1 T 𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝑥2 𝑥2
𝐿′𝜽 = x (ොy − 𝑦)
m
1 Label
5) Cập nhật tham số 𝑦ො = 𝜎(𝑧) =
𝜽 = 𝜽 − 𝜂𝐿′𝜽 1 + 𝑒 −𝑧
𝑦
𝜂 is learning rate
Loss

Year 2020 -y T logොy−(1−y)T log(1−ොy ) 19

4
AI VIETNAM
AI Insight Course
Logistic Regression - Minibatch
1.5 0.2
Dataset 𝒙1 =
4.1
𝑥1 𝑥2 𝒙2 =
1.3

Model
𝑏 𝑤1 𝑤2
0.1 0.5 -0.1

0.83
𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝒛=
1 1 2.02
𝒙 = 1.5 4.1 0
𝒚=
0.2 1.3 1
0.6963 1 Label
ෝ=
𝒚 𝑦ො = 𝜎(𝑧) =
0.8828 1 + 𝑒 −𝑧 0
𝑦 𝒚=
1
Loss
1.1918
𝑳= −y T logොy−(1−y)T log(1−ොy )
0.1245 20
𝑏 = 0.1 − 𝜂0.28961=0.097103
𝑤1 = 0.5 − 𝜂0.28217=0.49717
Dataset 𝑤2 = −0.1+𝜂0.0064=−0.09993
1.5 0.2
𝒙1 =
4.1
𝑥1 𝑥2 𝒙2 =
1.3

Model
𝑏 𝑤1 𝑤2
0.1 0.5 -0.1

𝐿′𝑏 𝐿′𝑤1 𝐿′𝑤2

1 1.5 0.2 0
𝒙= 𝒚= 0.83
1 4.1 1.3 1 𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝒛=
2.02
1 T
𝐿′𝜽 = x (ොy − 𝑦)
m 0.6963 1 Label
1 1.0 1.0 0.6963 ෝ=
𝒚
0.8828 𝑦ො = 𝜎(𝑧) =
= 1.5 4.1 1 + 𝑒 −𝑧 0
2 −0.1171 𝑦 𝒚=
1
0.2 1.3
0.28961 𝐿′𝑏 Loss
= 0.28217 = 𝐿′𝑤1 𝑳=
1.1918
−y T logොy−(1−y)T log(1−ොy )
−0.0064 𝐿′𝑤2 0.1245
21
Outline
 Sigmoid function
 From Linear to Logistic Regression
 Logistic Regression – Stochastic
 Logistic Regression – Mini-batch
 Logistic Regression – Batch
AI VIETNAM
AI Insight Course
Logistic Regression - Batch
𝑥1 𝑥2 Input
1) Pick all the samples from training data
Model
2) Tính output 𝑦ො
𝑧 = 𝜽𝑇 𝒙 Parameters
1 𝑏 𝑤1 𝑤2
ෝ = 𝜎(𝑧) =
𝒚
1 + 𝑒 −𝑧
3) Tính loss (binary cross-entropy)
1
𝐿(𝜽) = −y T logොy−(1−y)T log(1−ොy )
N 𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏
4) Tính đạo hàm
1 T 1
𝐿′𝜽 = x (ොy − 𝑦) 𝑦ො = 𝜎(𝑧) = Label
N
1 + 𝑒 −𝑧
5) Cập nhật tham số 𝑦
𝜽 = 𝜽 − 𝜂𝐿′𝜽
Loss
𝜂 is learning rate
−y T logොy−(1−y)T log(1−ොy )
Year 2020 22
Logistic Regression
1.4 0.2
Phân loại hoa Iris dựa vào chiều 1.5 0.2 Forward
dài và chiều rộng của cánh hoa 𝒙1 = 𝒙2 =
3.0 𝑥1 𝑥2 1.1 Step
4.1 1.3
Dataset
Model
𝑏 𝑤1 𝑤2
0.1 0.5 -0.1

0.78
0.83
𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝒛=
1.49
0.6856 2.02
1 1.4 0.2 0 0.6963
1 1.5 0.2 0 ෝ=
𝒚
𝒙= 𝒚= 0.8160 1
1 3.0 1.1 1 0.8828 Label
𝑦ො = 𝜎(𝑧) = 0
1 4.1 1.3 1 1 + 𝑒 −𝑧 0
𝑦 𝒚=
1
1.1573 Loss 1
Average loss = 0.6692 1.1918
𝑳=
0.2032 −y T logොy−(1−y)T log(1−ොy )
0.1245 23
𝑥1 𝑥2 Input
Phân loại hoa Iris dựa vào chiều
Backward
dài và chiều rộng của cánh hoa
Step Model
𝑏 𝑤1 𝑤2
Dataset
0.1 0.5 -0.1
𝜂 = 0.01 𝐿′𝑤2
𝐿′𝑏 𝐿′𝑤1
0.78
𝑏 = 0.1 − 𝜂0.2702=0.0972 0.83
𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝒛=
𝑤1 = 0.5 − 𝜂0.2431=0.4975 1.49
2.02
𝑤2 = −0.1+𝜂0.0195=−0.0998

0.6856
1 Label
𝑦ො = 𝜎(𝑧) =
1 1.4 0.2 0
ෝ=
𝒚
0.6963 1 + 𝑒 −𝑧
1 1.5 0.2 𝒚=
0 0.8160 𝑦
𝒙=
1 3.0 1.1 1 0.8828
1 4.1 1.3 1 Loss
1 T
𝐿′𝜽 = x (ොy − 𝑦) −y T logොy−(1−y)T log(1−ොy )
1.0 1.0 1.0 1.0 N
𝐱T = 1.4 1.5 3.0 4.1 0.6856 𝐿′𝑏
1 1.0 1.0 1.0 1.0 0.2702
0.2 0.2 1.1 1.3 0.6963
= 1.4 1.5 3.0 4.1 = 0.2431 = 𝐿′𝑤1
4 −0.1839
0.2 0.2 1.1 1.3 −0.019 𝐿′𝑤2
−0.1171 24
𝑥1 𝑥2 Input
Phân loại hoa Iris dựa vào chiều
Backward
dài và chiều rộng của cánh hoa
Step Model
𝑏 𝑤1 𝑤2
Dataset
0.0972 0.4975 -0.0998
𝜂 = 0.01 𝐿′𝑤2
𝐿′𝑏 𝐿′𝑤1
0.78
𝑏 = 0.1 − 𝜂0.2702=0.0972 0.83
𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝒛=
𝑤1 = 0.5 − 𝜂0.2431=0.4975 1.49
2.02
𝑤2 = −0.1+𝜂0.0195=−0.0998

0.6856
1 Label
𝑦ො = 𝜎(𝑧) =
1 1.4 0.2 0
ෝ=
𝒚
0.6963 1 + 𝑒 −𝑧
1 1.5 0.2 𝒚=
0 0.8160 𝑦
𝒙=
1 3.0 1.1 1 0.8828
1 4.1 1.3 1 Loss
1 T
𝐿′𝜽 = x (ොy − 𝑦) −y T logොy−(1−y)T log(1−ොy )
1.0 1.0 1.0 1.0 N
𝐱T = 1.4 1.5 3.0 4.1 0.6856 𝐿′𝑏
1 1.0 1.0 1.0 1.0 0.2702
0.2 0.2 1.1 1.3 0.6963
= 1.4 1.5 3.0 4.1 = 0.2431 = 𝐿′𝑤1
4 −0.1839
0.2 0.2 1.1 1.3 −0.019 𝐿′𝑤2
−0.1171 25
Phân loại hoa Iris dựa vào chiều 1.4 0.2
dài và chiều rộng của cánh hoa 1.5 0.2 Forward
𝒙1 = 𝒙2 =
3.0 𝑥1 𝑥2 1.1
4.1 1.3 Step
Dataset
Model
𝑏 𝑤1 𝑤2
0.0972 0.4975 -0.0998

0.77
0.82
𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝒛=
1.48
0.6843 2.00
1 1.4 0.2 0
0.6950
1 1.5 0.2 0 ෝ=
𝒚
𝒙= 𝒚= 0.8146
1 3.0 1.1 1 1 Label
0.8815 𝑦ො = 𝜎(𝑧) = 0
1 4.1 1.3 1
1 + 𝑒 −𝑧 0
𝑦 𝒚=
1
1.1531 Loss 1
1.1875
𝑳=
0.2050 −y T logොy−(1−y)T log(1−ොy )
Average loss = 0.6679
Loss giảm từ 0.6692 xuống 0.6679 0.1260
26
Logistic Regression - Question
1.4 0.2
Phân loại hoa Iris dựa vào chiều 𝒙1 =
1.5
𝒙2 =
0.2
dài và chiều rộng của cánh hoa 3.0 𝑥1 𝑥2 1.1 Backward
4.1 1.3 Step
Dataset
Model
𝑏 𝑤1 𝑤2
? ? ?

𝐿′𝑏 𝐿′𝑤1 𝐿′𝑤2

0.77
0.82
𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝒛=
1.48
0.6843 2.00
1 1.4 0.2 0 0.6950
1 1.5 0.2 0 ෝ=
𝒚
𝒙= 𝒚= 0.8146 1
1 3.0 1.1 1 0.8815 Label
1 𝑦ො = 𝜎(𝑧) = 0
1 4.1 1.3 1 + 𝑒 −𝑧 0
𝑦 𝒚=
1.0 1.0 1.0 1.0 1
T 1
𝐱 = 1.4 1.5 3.0 4.1 1.1531 Loss
0.2 0.2 1.1 1.3 1.1875
𝑳=
0.2050 −y T logොy−(1−y)T log(1−ොy )
0.1260
27
AI VIETNAM
AI Insight Course
Logistic Regression
 Demo

Year 2020 28
4
AI VIETNAM
AI Insight Course
Tanh function

𝑒 𝑥 − 𝑒 −𝑥 𝑒 2𝑥 − 1
tanh 𝑥 = 𝑥 −𝑥 = 2𝑥
𝑒 +𝑒 𝑒 +1
2
= 1 − 2𝑥
𝑒 +1

𝑒 𝑥 − 𝑒 −𝑥 1 − 𝑒 −2𝑥
tanh 𝑥 = 𝑥 =
𝑒 + 𝑒 −𝑥 1 + 𝑒 −2𝑥
𝑒 −2𝑥 − 1 2
= − −2𝑥 = −2𝑥 −1
𝑒 +1 𝑒 +1

Year 2020 29
4
AI VIETNAM
AI Insight Course
Tanh function
𝑒 𝑥 − 𝑒 −𝑥 𝑒 2𝑥 − 1 2
tanh 𝑥 = 𝑥 −𝑥 = 2𝑥 = 1 − 2𝑥
𝑒 +𝑒 𝑒 +1 𝑒 +1

𝑒 𝑥 − 𝑒 −𝑥 1 − 𝑒 −2𝑥 𝑒 −2𝑥 − 1 2
tanh 𝑥 = 𝑥 = = − −2𝑥 = −1
𝑒 + 𝑒 −𝑥 1 + 𝑒 −2𝑥 𝑒 + 1 𝑒 −2𝑥 + 1

𝑥 − 𝑒 −𝑥 ′
𝑒 𝑒 𝑥 + 𝑒 −𝑥 𝑒 𝑥 + 𝑒 −𝑥 − 𝑒 𝑥 − 𝑒 −𝑥 𝑒 𝑥 − 𝑒 −𝑥
𝑡𝑎𝑛ℎ′ (𝑥) = 𝑥 =
𝑒 + 𝑒 −𝑥 𝑒 𝑥 + 𝑒 −𝑥 2
𝑒 𝑥 + 𝑒 −𝑥 2 − 𝑒 𝑥 − 𝑒 −𝑥 2
=
𝑒 𝑥 + 𝑒 −𝑥 2
2
𝑒 𝑥 − 𝑒 −𝑥
=1− 𝑥 = 1 − 𝑡𝑎𝑛ℎ2 (𝑥)
𝑒 + 𝑒 −𝑥
Year 2020 30
4
AI VIETNAM
AI Insight Course
Tanh function
𝑒 𝑥 − 𝑒 −𝑥 𝑒 2𝑥 − 1 2
tanh 𝑥 = 𝑥 −𝑥 = 2𝑥 = 1 − 2𝑥
𝑒 +𝑒 𝑒 +1 𝑒 +1

𝑒 𝑥 − 𝑒 −𝑥 1 − 𝑒 −2𝑥 𝑒 −2𝑥 − 1 2
tanh 𝑥 = 𝑥 = = − −2𝑥 = −1
𝑒 + 𝑒 −𝑥 1 + 𝑒 −2𝑥 𝑒 + 1 𝑒 −2𝑥 + 1
′
2 4𝑒 −2𝑥 𝑒 −2𝑥 + 1 − 1
𝑡𝑎𝑛ℎ′ (𝑥) = −1 = −2𝑥 =4
𝑒 −2𝑥 + 1 𝑒 +1 2 𝑒 −2𝑥 + 1 2

11 4 4
= 4 −2𝑥 − −2𝑥 2 =− −
𝑒 +1 𝑒 +1 𝑒 −2𝑥 + 1 2 𝑒 −2𝑥 + 1
2
4 4 2
=− 2−
+1 −1 =1− −1 = 1 − 𝑡𝑎𝑛ℎ2 (𝑥)
𝑒 −2𝑥 + 1 𝑒 −2𝑥 + 1 𝑒 −2𝑥 + 1

Year 2020 31
4
AI VIETNAM
AI Insight Course
Logistic Regression - Tanh
 Construct loss 𝜕𝐿 𝜕𝐿 𝜕𝑦ො 𝜕𝑧 Derivative
=
𝜕𝜃 𝜕𝑦ො 𝜕𝑧 𝜕𝜃
Model and Loss
𝑧 = 𝜽𝑇 𝒙 𝜕𝐿 1 𝑦 1−𝑦 1 𝑦ො − 𝑦
𝑒 𝑧 − 𝑒 −𝑧 = − + =
𝜕𝑦ො 𝑁 𝑦ො 1 − 𝑦ො 𝑁 𝑦(1
ො − 𝑦)ො
𝑦ො = 𝑡𝑎𝑛ℎ(𝑧) = 𝑧
𝑒 + 𝑒 −𝑧 𝜕𝑦ො
1 = 1 − 𝑦ො 2
𝜕𝑧 𝜕𝐿 1 𝑇 (𝑦ො − 𝑦)(1 + 𝑦)
ො
L= −𝑦 𝑇 log 𝑦ො − (1 − 𝑦 𝑇 )log 1 − 𝑦ො = 𝑥
𝑁 𝜕𝑧 𝜕𝜃 𝑁 𝑦ො
=𝑥
𝜕𝜃

Year 2020 32
9
AI VIETNAM
AI Insight Course
Logistic Regression-MSE
 Construct loss Derivative

𝜕𝐿 𝜕𝐿 𝜕𝑦ො 𝜕𝑧 𝜕𝑦ො
Model and Loss = = 𝑦(1
ො − 𝑦)
ො
𝜕𝜃 𝜕𝑦ො 𝜕𝑧 𝜕𝜃 𝜕𝑧
𝜕𝑧
𝑧 = 𝜽𝑇 𝒙 𝜕𝐿 =𝑥
= 2(𝑦ො − 𝑦) 𝜕𝜃
1 𝜕𝑦ො
𝑦ො = 𝜎(𝑧) = L = (𝑦ො − 𝑦)2
1 + 𝑒 −𝑧

𝜕𝐿 1 𝑇
= 2𝑥 (𝑦ො − 𝑦)𝑦(1
ො − 𝑦)
ො
𝜕𝜃 𝑁

Year 2020 33
AI VIETNAM
AI Insight Course

Summary
1) Pick all the samples from training data
2) Tính output 𝑦ො
𝑦2 𝑧 = 𝜽𝑇 𝒙
1
ෝ = 𝜎(𝑧) =
𝒚
1 + 𝑒 −𝑧
1 3) Tính loss (binary cross-entropy)
𝒚 𝑦=
1 + 𝑒 −𝑥 1
𝐿(𝜽) = −y T logොy−(1−y)T log(1−ොy )
N
𝑦1
4) Tính đạo hàm
𝑥1 𝑥2 1 T
𝐿′𝜽 = x (ොy − 𝑦)
N
𝒙 5) Cập nhật tham số
−∞ +∞
𝜽 = 𝜽 − 𝜂𝐿′𝜽
Sigmoid function
𝜂 is learning rate
34

Vectors Answers MME
No ratings yet
Vectors Answers MME
3 pages
From Linear Regression To Logistic Regression - Update - 1
No ratings yet
From Linear Regression To Logistic Regression - Update - 1
71 pages
Logistic Regression - Update - 2
No ratings yet
Logistic Regression - Update - 2
60 pages
3-LG_Eval
No ratings yet
3-LG_Eval
52 pages
Module 04 - Extra Class: Logistic Regression
No ratings yet
Module 04 - Extra Class: Logistic Regression
41 pages
Extra Simple Linear Regression v2
No ratings yet
Extra Simple Linear Regression v2
73 pages
Exp3 ML
No ratings yet
Exp3 ML
4 pages
Lecture3 Logistic Regression Regularization
No ratings yet
Lecture3 Logistic Regression Regularization
39 pages
Logistic Regression Cost Function
No ratings yet
Logistic Regression Cost Function
1 page
Task1
No ratings yet
Task1
7 pages
2+Logistic_regression
No ratings yet
2+Logistic_regression
10 pages
output_23
No ratings yet
output_23
6 pages
09_23ECE216_LogisticRegression
No ratings yet
09_23ECE216_LogisticRegression
40 pages
SUPERVISED MACHINE LEARNING
No ratings yet
SUPERVISED MACHINE LEARNING
56 pages
Lec12 Logreg
No ratings yet
Lec12 Logreg
41 pages
Notes on logistic regression
No ratings yet
Notes on logistic regression
3 pages
Reference Material - Logistic - Regression
No ratings yet
Reference Material - Logistic - Regression
11 pages
AI2025_Lecture05_inperson_slide
No ratings yet
AI2025_Lecture05_inperson_slide
47 pages
Logistic Regession Numericals
No ratings yet
Logistic Regession Numericals
3 pages
Unit II
100% (1)
Unit II
13 pages
Reference Material - Logistic - Regression
No ratings yet
Reference Material - Logistic - Regression
11 pages
Reference Material Logistic Regression
No ratings yet
Reference Material Logistic Regression
11 pages
Linear Regression
No ratings yet
Linear Regression
26 pages
Binary Classification and Logistic Regression
No ratings yet
Binary Classification and Logistic Regression
7 pages
W2 Ann
No ratings yet
W2 Ann
12 pages
Logistic Regression-1
No ratings yet
Logistic Regression-1
3 pages
Cost Function For Logistic Regression
No ratings yet
Cost Function For Logistic Regression
42 pages
COMP-377Week6_v1.1
No ratings yet
COMP-377Week6_v1.1
38 pages
3 - Loss Functions
No ratings yet
3 - Loss Functions
14 pages
Problem Set Logistic Regression
No ratings yet
Problem Set Logistic Regression
2 pages
ML Lec-9
No ratings yet
ML Lec-9
13 pages
05 Logistic - Regression
No ratings yet
05 Logistic - Regression
7 pages
Logistic Regression and Decision Tree (2)
No ratings yet
Logistic Regression and Decision Tree (2)
75 pages
Logistic Regression
No ratings yet
Logistic Regression
19 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
ML - LAB - BE CSE (DS) Final
No ratings yet
ML - LAB - BE CSE (DS) Final
110 pages
D2S1 - Classification Algorithms
No ratings yet
D2S1 - Classification Algorithms
30 pages
ML DSBA Lab2
No ratings yet
ML DSBA Lab2
4 pages
Lecture W3
No ratings yet
Lecture W3
28 pages
15 Surrogate
No ratings yet
15 Surrogate
3 pages
cs188 Fa23 Note22
No ratings yet
cs188 Fa23 Note22
3 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
Machine Learning Using Optimization and Logistic Regression and Sigmoid Function_grp 06
No ratings yet
Machine Learning Using Optimization and Logistic Regression and Sigmoid Function_grp 06
31 pages
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
No ratings yet
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
42 pages
05 LogisticRegression PDF
No ratings yet
05 LogisticRegression PDF
23 pages
LR2
No ratings yet
LR2
25 pages
A Research Project On Applying Logistic Regression To Predict Result of Binary Classification Problems
No ratings yet
A Research Project On Applying Logistic Regression To Predict Result of Binary Classification Problems
6 pages
01B-DL2023-LinearModels
No ratings yet
01B-DL2023-LinearModels
47 pages
Linear Log Reggression
No ratings yet
Linear Log Reggression
27 pages
Logistic Regression - Jaikrishna 2
No ratings yet
Logistic Regression - Jaikrishna 2
5 pages
LSVM - Jaikrishna 1
No ratings yet
LSVM - Jaikrishna 1
5 pages
AI2025_Lecture04_recording_slide
No ratings yet
AI2025_Lecture04_recording_slide
42 pages
CS 229, Autumn 2017 Problem Set #2: Supervised Learning II
No ratings yet
CS 229, Autumn 2017 Problem Set #2: Supervised Learning II
6 pages
ML Logistic Regression
No ratings yet
ML Logistic Regression
19 pages
Linear and Logistic Regression: Marta Arias Marias@lsi - Upc.edu
No ratings yet
Linear and Logistic Regression: Marta Arias Marias@lsi - Upc.edu
25 pages
CS229 Supplemental Lecture Notes: 1 Binary Classification
No ratings yet
CS229 Supplemental Lecture Notes: 1 Binary Classification
7 pages
ML Assignment
No ratings yet
ML Assignment
20 pages
Chp2 Logistic Regression
No ratings yet
Chp2 Logistic Regression
6 pages
Lecture 05
No ratings yet
Lecture 05
5 pages
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
From Everand
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
No ratings yet
Philosophy Of Perception A Contemporary Introduction Routledge Contemporary Introductions To Philosophy 1st Edition William Fish download
No ratings yet
Philosophy Of Perception A Contemporary Introduction Routledge Contemporary Introductions To Philosophy 1st Edition William Fish download
52 pages
sheet2_mechatronics
No ratings yet
sheet2_mechatronics
2 pages
Rubric NSD A4
No ratings yet
Rubric NSD A4
1 page
Analytical Exposition Text: Purpose Generic Structure Language Features
No ratings yet
Analytical Exposition Text: Purpose Generic Structure Language Features
11 pages
Safety Data Sheet PLANISEAL 288 Comp. B
No ratings yet
Safety Data Sheet PLANISEAL 288 Comp. B
8 pages
Strength of Materials Exam
No ratings yet
Strength of Materials Exam
1 page
T-61.246 Digital Signal Processing and Filtering T-61.246 Digitaalinen Signaalink Asittely Ja Suodatus Description of Example Problems
No ratings yet
T-61.246 Digital Signal Processing and Filtering T-61.246 Digitaalinen Signaalink Asittely Ja Suodatus Description of Example Problems
35 pages
Data Mining: Homework 1 Solution
No ratings yet
Data Mining: Homework 1 Solution
5 pages
Line Follower Robot Design
No ratings yet
Line Follower Robot Design
8 pages
Earth Not A Globe But Positively A Plane (John T. Lawson)
100% (1)
Earth Not A Globe But Positively A Plane (John T. Lawson)
8 pages
2B- Nelson Functions Ch. 2 Answers
No ratings yet
2B- Nelson Functions Ch. 2 Answers
6 pages
Journal of Statistical Software: Elastic Net Regularization Paths For All Generalized Linear Models
No ratings yet
Journal of Statistical Software: Elastic Net Regularization Paths For All Generalized Linear Models
31 pages
Quiz#5 ME-523L IPE
No ratings yet
Quiz#5 ME-523L IPE
1 page
Conjunctions LP
No ratings yet
Conjunctions LP
3 pages
Youtube Channel Business Plan
No ratings yet
Youtube Channel Business Plan
3 pages
SSC Study Plan - Vinod3
No ratings yet
SSC Study Plan - Vinod3
10 pages
Soalan Literature
No ratings yet
Soalan Literature
22 pages
WP&A Lab 01
No ratings yet
WP&A Lab 01
5 pages
CSR - Seminar
No ratings yet
CSR - Seminar
16 pages
2019 GKS Associate Degree Program Successful Candidates of 2nd Round
No ratings yet
2019 GKS Associate Degree Program Successful Candidates of 2nd Round
1 page
Lab 01 - Streaking For Isolation and Interpreting Primary Culture Results
No ratings yet
Lab 01 - Streaking For Isolation and Interpreting Primary Culture Results
18 pages
Session 2: Dynamics&Forces - Solutions
No ratings yet
Session 2: Dynamics&Forces - Solutions
9 pages
Chinas Super Psychics Paul Dong OCR SD
100% (4)
Chinas Super Psychics Paul Dong OCR SD
131 pages
SAS 2130 Statistics - Civil, BEd
No ratings yet
SAS 2130 Statistics - Civil, BEd
4 pages
Rotational Mechanical System Transfer Functions
No ratings yet
Rotational Mechanical System Transfer Functions
4 pages
Seismic Design of Bridges South East Asia Part 1
No ratings yet
Seismic Design of Bridges South East Asia Part 1
173 pages
K1000TCi-D_V3.1_┴¼¢ËÁ¸╩È╩Í▓ßen_190424
No ratings yet
K1000TCi-D_V3.1_┴¼¢ËÁ¸╩È╩Í▓ßen_190424
265 pages
Liang 2014
No ratings yet
Liang 2014
12 pages
Curve and Surface Construction Using Variable Degree Polynomial Splines
No ratings yet
Curve and Surface Construction Using Variable Degree Polynomial Splines
28 pages