0% found this document useful (0 votes)

8 views

Lesson 13

Uploaded by

kunalgarg83603

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Lesson 13

Uploaded by

kunalgarg83603

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Lesson 13

Activation Functions and Their Derivatives

Activation Functions
i1 Activation Function is applied over the linear
weighted summation of the incoming
information to a node.
i2 h1

Convert linear input signals from perceptron to

i3 a linear/non-linear output signal.

Activation It decides whether to activate a node or not.

Function
Activation Functions i1

i2 h1
Activation functions must be monotonic,
differentiable, and quickly converging.
i3

Types of Activation Functions:

Activation
• Linear Function
• Non-Linear
Linear
𝑓 𝑥 = 𝑎𝑥 + 𝑏

𝑑𝑓(𝑥)
=𝑎
𝑑𝑥

Observations:
• Constant gradient
• Gradient does not depend on the change in the input
Linear
𝑓 𝑥 = 𝑎𝑥 + 𝑏

𝑓 𝑥 = 𝑎1 𝑥1 + 𝑎2 𝑥2 + 𝑎3 𝑥3 + ⋯ + 𝑏
Linear i1

i2 h1
𝑓 𝑥 = 𝑎𝑥 + 𝑏
i3

𝑓 𝑥 = 𝑎1 𝑥1 + 𝑎2 𝑥2 + 𝑎3 𝑥3 + ⋯ + 𝑏
Perceptron
Non-Linear
• Sigmoid (Logistic)
• Hyperbolic Tangent (Tanh)
• Rectified Linear Unit (ReLU)
• Leaky Relu
• Parametric Relu
• Exponential Linear Unit (ELU)
Sigmoid Activation Functions (Logistics)
1
𝑓 𝑥 =
1 + 𝑒 −𝑥

𝑑𝑓(𝑥)
= 𝑓 𝑥 (1 − 𝑓(𝑥))
𝑑𝑥

Observations:
• Output: 0 to 1
• Outputs are not zero-centered
• Can saturate and kill (vanish) gradients
Tanh Activation Function
𝑒 𝑥 − 𝑒 −𝑥
𝑓 𝑥 = 𝑥
𝑒 + 𝑒 −𝑥

𝑑𝑓(𝑥)
= 1 − 𝑓(𝑥)2
𝑑𝑥

Observations:
• Output: -1 to +1
• Outputs are zero-centered
• Can Saturate and kill (vanish) gradients
• Gradient is more steeped than Sigmoid, resulting
in faster convergence
Rectified Linear Unit(ReLU)
𝑓 𝑥 = max(0, 𝑥)

𝑑𝑓(𝑥)
=1
𝑑𝑥

Observations:
• Greatly increase training speed compared to tanh
and sigmoid
• Reduces likelihood of killing(vanishing) gradient
• It can blow up activation
• Dead nodes
Leaky-ReLU
𝑓 𝑥 = max(0.01𝑥, 𝑥)

𝑑𝑓(𝑥) 0.01, 𝑥<0

=ቊ
𝑑𝑥 1, 𝑥≥0

Observations:
• Fixed dying ReLU
Parameterized-ReLU
𝑓 𝑥 = max(𝛼𝑥, 𝑥)

𝑑𝑓(𝑥) 𝛼, 𝑥<0
=ቊ
𝑑𝑥 1, 𝑥≥0

Observations:
Exponential Linear Unit (ELU)
𝛼(𝑒 𝑥 − 1), 𝑥<0
f(𝑥) = ቊ
1𝑥 𝑥 ≥ 0

𝑑𝑓(𝑥) 𝑓 𝑥 + 𝛼, 𝑥<0
=ቊ
𝑑𝑥 1, 𝑥≥0

Observations:
• It can produce –ve output
• It can blow up activation function
Complete Chain

X1 y y’
0.6 1

X2
0.4 0

X3
V
U

𝑧1 = ෍ ℎ𝑗 𝑉𝑗1 𝑦1 = 𝑓(𝑧1 ) E 𝛿𝐸 𝛿𝑧1 𝛿𝑦1 𝛿𝐸

= × ×
𝑉11 𝛿𝑉11 𝛿𝑉11 𝛿𝑧1 𝛿𝑦1

14
Deep Network

X3 V

U W

15
Deep Network - Vanishing/Exploding Gradient

X3 V

U W
𝛿𝐸 𝛿𝑏1 𝛿𝑔1 𝛿𝑎1 𝛿ℎ1 𝛿𝑧1 𝛿𝑦1 𝛿𝐸
= × × × × × ×
𝛿𝑈11 𝛿𝑈11 𝛿𝑏1 𝛿𝑔1 𝛿𝑎1 𝛿ℎ1 𝛿𝑧1 𝛿𝑦1

𝑎1 = ෍ 𝑔𝑖 𝑊𝑖1 ℎ1 = 𝑓(𝑎1 ) 𝑧1 = ෍ ℎ𝑗 𝑉𝑗1 𝑦1 = 𝑓(𝑧1 )

𝑏1 = ෍ 𝑥𝑖 𝑈𝑖1 𝑔1 = 𝑓(𝑏1 ) 𝑊 𝑉11
𝑈11 11

16
Summary
• We learn characteristics of different Activation Functions and their
gradient
• The choice of activation function depend on the nature of the
problem, nature of the target output and the deepness of the
network.

Sample Q&A For GIS
88% (8)
Sample Q&A For GIS
6 pages
Zoll AED+ - Service Manual
50% (2)
Zoll AED+ - Service Manual
32 pages
Electronic Word of Mouth PDF
No ratings yet
Electronic Word of Mouth PDF
148 pages
Performance Analysis of Various Activation Functio
No ratings yet
Performance Analysis of Various Activation Functio
7 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
11 pages
Ijisae 4865
No ratings yet
Ijisae 4865
8 pages
Need and Use of Activation Functions in Anndeep Learning
No ratings yet
Need and Use of Activation Functions in Anndeep Learning
7 pages
9 DL_ANN_ActivationFunctions
No ratings yet
9 DL_ANN_ActivationFunctions
20 pages
UNIT-III Activation-function
No ratings yet
UNIT-III Activation-function
6 pages
Activation Function: Deep Neural Networks
No ratings yet
Activation Function: Deep Neural Networks
47 pages
5 TH
No ratings yet
5 TH
22 pages
Activation Functions
No ratings yet
Activation Functions
4 pages
Activation Function
No ratings yet
Activation Function
36 pages
Deep Learning: International Islamic University of Chittagong
No ratings yet
Deep Learning: International Islamic University of Chittagong
31 pages
Neural_Networks_Activation_Functions__1694135997
No ratings yet
Neural_Networks_Activation_Functions__1694135997
7 pages
Experiment No. 1 SL-II (ANN)
No ratings yet
Experiment No. 1 SL-II (ANN)
3 pages
Activation Functions
No ratings yet
Activation Functions
10 pages
Activation F
No ratings yet
Activation F
4 pages
activatn fn 2
No ratings yet
activatn fn 2
10 pages
Lecture 2.1.2activation Function
No ratings yet
Lecture 2.1.2activation Function
15 pages
Act_Fun
No ratings yet
Act_Fun
7 pages
Unit Iv
No ratings yet
Unit Iv
34 pages
Pr1_ANN_Writeup.docx
No ratings yet
Pr1_ANN_Writeup.docx
7 pages
Activation Functions
No ratings yet
Activation Functions
9 pages
7 Types of Neural Network Activation Functions
No ratings yet
7 Types of Neural Network Activation Functions
16 pages
Activation Function
No ratings yet
Activation Function
31 pages
Lec08-1Activation Functions
No ratings yet
Lec08-1Activation Functions
19 pages
preprints202301.0463.v1
No ratings yet
preprints202301.0463.v1
14 pages
Functii de Activare1
No ratings yet
Functii de Activare1
89 pages
Activation
No ratings yet
Activation
7 pages
AyushChokhani AI Asiignment 2
No ratings yet
AyushChokhani AI Asiignment 2
12 pages
4 4 Choosing The Right Activation Function For Neural Networks
No ratings yet
4 4 Choosing The Right Activation Function For Neural Networks
25 pages
Activation Function
No ratings yet
Activation Function
13 pages
lecture 9-NN- modified
No ratings yet
lecture 9-NN- modified
94 pages
Activation Function in NN
No ratings yet
Activation Function in NN
29 pages
Activation Functions in Neural Networks: What Is Activation Function?
No ratings yet
Activation Functions in Neural Networks: What Is Activation Function?
11 pages
Activation Function
No ratings yet
Activation Function
4 pages
Activation Function
No ratings yet
Activation Function
18 pages
Module 2
No ratings yet
Module 2
13 pages
Perceptron in Machine Learning
No ratings yet
Perceptron in Machine Learning
11 pages
Lect 5- Non Linear Activation Functions
No ratings yet
Lect 5- Non Linear Activation Functions
41 pages
What Is An Activation Level in Machine Learning?
No ratings yet
What Is An Activation Level in Machine Learning?
2 pages
Activation Function
No ratings yet
Activation Function
43 pages
ReLu Heuristics For Avoiding Local Bad Minima
100% (2)
ReLu Heuristics For Avoiding Local Bad Minima
10 pages
Study of Ensemble of Activation Functions in Deep Learning
No ratings yet
Study of Ensemble of Activation Functions in Deep Learning
10 pages
ANN notes
No ratings yet
ANN notes
7 pages
Activation Functions
No ratings yet
Activation Functions
34 pages
26- netinput activation function forward and back propogation
No ratings yet
26- netinput activation function forward and back propogation
41 pages
Activation Functions and Their Characteristics in Deep Neural Networks
No ratings yet
Activation Functions and Their Characteristics in Deep Neural Networks
6 pages
Activation Functions in Neural Networks - 241102 - 224129
No ratings yet
Activation Functions in Neural Networks - 241102 - 224129
7 pages
Activation functions 2
No ratings yet
Activation functions 2
5 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
10 pages
Activation Funtions
No ratings yet
Activation Funtions
26 pages
Activation Function
No ratings yet
Activation Function
9 pages
Activation Functions
No ratings yet
Activation Functions
8 pages
Ad3451 Ml Unit 4 Notes
No ratings yet
Ad3451 Ml Unit 4 Notes
34 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
Unit 5 Activation Function
No ratings yet
Unit 5 Activation Function
15 pages
How To Choose An Activation Function For Deep Learning
No ratings yet
How To Choose An Activation Function For Deep Learning
15 pages
DL M2 Tech
No ratings yet
DL M2 Tech
32 pages
Module1 - Upto Loss Function
No ratings yet
Module1 - Upto Loss Function
137 pages
Deep Learning
No ratings yet
Deep Learning
10 pages
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Matrix OLS NYU Notes
No ratings yet
Matrix OLS NYU Notes
14 pages
Euclid TopsOJ Mock
No ratings yet
Euclid TopsOJ Mock
12 pages
CCC MCQ Chapter-6 Internet WWW
No ratings yet
CCC MCQ Chapter-6 Internet WWW
51 pages
DH302 Spring2025 Assignment03-Problems (2)
No ratings yet
DH302 Spring2025 Assignment03-Problems (2)
16 pages
1043B33E103
No ratings yet
1043B33E103
2 pages
1 s2.0 S187705091931974X Main
No ratings yet
1 s2.0 S187705091931974X Main
6 pages
Autodesk - Corrupt CascadeInfo
No ratings yet
Autodesk - Corrupt CascadeInfo
2 pages
Journal of Cleaner Production: Muhammad Afzal, Yuhan Liu, Jack C.P. Cheng, Vincent J.L. Gan
No ratings yet
Journal of Cleaner Production: Muhammad Afzal, Yuhan Liu, Jack C.P. Cheng, Vincent J.L. Gan
22 pages
3x3 Rubik's Cube Printable Guide
No ratings yet
3x3 Rubik's Cube Printable Guide
2 pages
Call Center CPNI Policy
No ratings yet
Call Center CPNI Policy
5 pages
BS (COMMERCE) Scheme of Studies 2022
No ratings yet
BS (COMMERCE) Scheme of Studies 2022
116 pages
Jawaharlal Nehru Technological University Kakinada: College Name: Vikas College of Engg and Tech., Nunna, Vijayawada:Nq
No ratings yet
Jawaharlal Nehru Technological University Kakinada: College Name: Vikas College of Engg and Tech., Nunna, Vijayawada:Nq
6 pages
8155 Ques.
No ratings yet
8155 Ques.
8 pages
Hytera tm600 Service - Manual PDF
No ratings yet
Hytera tm600 Service - Manual PDF
143 pages
How To Attach Expert Advisor in MT5 - Guide - English
No ratings yet
How To Attach Expert Advisor in MT5 - Guide - English
9 pages
Module 1 Challenge 1 PDF
No ratings yet
Module 1 Challenge 1 PDF
1 page
ReadMe MotopilotMP v1.0
No ratings yet
ReadMe MotopilotMP v1.0
3 pages
Cluster-Analysis
No ratings yet
Cluster-Analysis
89 pages
Intercom ATKINSON PDF
No ratings yet
Intercom ATKINSON PDF
28 pages
Thesis About Technology in Society
100% (4)
Thesis About Technology in Society
8 pages
Lenovo Specifications K9 Note
No ratings yet
Lenovo Specifications K9 Note
4 pages
Wa0003.
No ratings yet
Wa0003.
7 pages
Aspen Manufacturing Suite Advanced Process Control: Release Notes
No ratings yet
Aspen Manufacturing Suite Advanced Process Control: Release Notes
35 pages
Chapter 8: The Matrix of A Graph
No ratings yet
Chapter 8: The Matrix of A Graph
12 pages
Solution TD 1 2009
No ratings yet
Solution TD 1 2009
4 pages
Technical Report Writing: Report On Electrical Engineering Department Job Fair 2017
No ratings yet
Technical Report Writing: Report On Electrical Engineering Department Job Fair 2017
18 pages
CSC336-WT Lec15 Slides
No ratings yet
CSC336-WT Lec15 Slides
21 pages

Lesson 13

Uploaded by

Lesson 13

Uploaded by

Lesson 13

Activation Functions and Their Derivatives

Convert linear input signals from perceptron to

Activation It decides whether to activate a node or not.

Types of Activation Functions:

𝑑𝑓(𝑥) 0.01, 𝑥<0

𝑧1 = ෍ ℎ𝑗 𝑉𝑗1 𝑦1 = 𝑓(𝑧1 ) E 𝛿𝐸 𝛿𝑧1 𝛿𝑦1 𝛿𝐸

𝑎1 = ෍ 𝑔𝑖 𝑊𝑖1 ℎ1 = 𝑓(𝑎1 ) 𝑧1 = ෍ ℎ𝑗 𝑉𝑗1 𝑦1 = 𝑓(𝑧1 )

You might also like