0% found this document useful (0 votes)

43 views

Sharma S. - Activation Functions in Neural Networks

This document discusses different types of activation functions used in neural networks. It begins by explaining that activation functions determine the output of neural network nodes. There are two main types of activation functions: linear and non-linear. Some commonly used non-linear activation functions include the sigmoid, tanh, ReLU, and Leaky ReLU. Each has distinct properties like being monotonic, differentiable, and suited for certain types of network outputs. The document provides visual examples and equations to explain the characteristics and applications of different activation functions.

Uploaded by

Julie Kalpo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views

Sharma S. - Activation Functions in Neural Networks

Uploaded by

Julie Kalpo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Open in app Get started

Published in Towards Data Science

SAGAR SHARMA Follow

Sep 6, 2017 · 5 min read · Listen

Save

Activation Functions in Neural Networks

Sigmoid, tanh, Softmax, ReLU, Leaky ReLU EXPLAINED !!!

What is Activation Function?

It’s just a thing function that you use to get the output of node. It is also known as Transfer
Function.

Why we use Activation functions with Neural Networks?

It is used to determine the output of neural network like yes or no. It maps the resulting
Open in app Get started

1. Linear Activation Function

2. Non-linear Activation Functions

FYI: The Cheat sheet is given below.

Linear or Identity Activation Function

As you can see the function is a line or linear. Therefore, the output of the functions will
not be confined between any range.

Fig: Linear Activation Function

Equation : f(x) = x
Open in app Get started

Non-linear Activation Function

The Nonlinear Activation Functions are the most used activation functions.
Nonlinearity helps to makes the graph look something like this

Fig: Non-linear Activation Function

It makes it easy for the model to generalize or adapt with variety of data and to
differentiate between the output.

The main terminologies needed to understand for nonlinear functions are:

Derivative or Differential: Change in y-axis w.r.t. change in x-axis.It is also known as

slope.

Monotonic function: A function which is either entirely non-increasing or non-

decreasing.
Open in app Get started

Fig: Sigmoid Function

The main reason why we use sigmoid function is because it exists between (0 to 1).
Therefore, it is especially used for models where we have to predict the probability as
an output.Since probability of anything exists only between the range of 0 and 1,
sigmoid is the right choice.

The function is differentiable.That means, we can find the slope of the sigmoid curve
at any two points.

The function is monotonic but function’s derivative is not.

The logistic sigmoid function can cause a neural network to get stuck at the training
time.

The softmax function is a more generalized logistic activation function which is used
for multiclass classification.

2. Tanh or hyperbolic tangent Activation Function

Open in app Get started

Fig: tanh v/s Logistic Sigmoid

The advantage is that the negative inputs will be mapped strongly negative and the
zero inputs will be mapped near zero in the tanh graph.

The function is differentiable.

The function is monotonic while its derivative is not monotonic.

The tanh function is mainly used classification between two classes.

Both tanh and logistic sigmoid activation functions are used in feed-forward nets.

3. ReLU (Rectified Linear Unit) Activation Function

The ReLU is the most used activation function in the world right now.Since, it is used in
almost all the convolutional neural networks or deep learning.
Open in app Get started

Fig: ReLU v/s Logistic Sigmoid

As you can see, the ReLU is half rectified (from bottom). f(z) is zero when z is less than
zero and f(z) is equal to z when z is above or equal to zero.

Range: [ 0 to infinity)

The function and its derivative both are monotonic.

But the issue is that all the negative values become zero immediately which decreases
the ability of the model to fit or train from the data properly. That means any negative
input given to the ReLU activation function turns the value into zero immediately in the
graph, which in turns affects the resulting graph by not mapping the negative values
appropriately.

4. Leaky ReLU
It is an attempt to solve the dying ReLU problem
Open in app Get started

Fig : ReLU v/s Leaky ReLU

Can you see the Leak?

The leak helps to increase the range of the ReLU function. Usually, the value of a is 0.01
or so.

When a is not 0.01 then it is called Randomized ReLU.

Therefore the range of the Leaky ReLU is (-infinity to infinity).

Both Leaky and Randomized ReLU functions are monotonic in nature. Also, their
derivatives also monotonic in nature.

Why derivative/differentiation is used ?

When updating the curve, to know in which direction

and how much to change or update the curve
depending upon the slope.That is why we use
differentiation in almost every part of Machine
Learning and Deep Learning.
Open in app Get started

Fig: Activation Function Cheetsheet

Open in app Get started

Fig: Derivative of Activation Functions

Open in app Get started

If you liked it

So, follow me on Medium, LinkedIn to see similar posts.

Any comments or if you have any questions, write them in the comment.

Clap it! Share it! Follow Me!

Previous stories you will love:

What is Linear Regression and How does it work? - theffork

It is a method used for predicting future values by finding a linear
pattern in the previously given data. The linear…
theffork.com

What the Hell is “Tensor” in “TensorFlow”?

I didn’t know it…
hackernoon.com
Open in app Get started
Epoch vs Batch Size vs Iterations
Know your code…
towardsdatascience.com

12.4K 49
Monte Carlo Tree Search
MCTS For Every Data Science Enthusiast
towardsdatascience.com

Sign up for The Variable

By Towards Data Science

Every Thursday, the Variable delivers the very best of Towards Data Science: from hands-on tutorials
and cutting-edge research to original features you don't want to miss. Take a look.

Get this newsletter

On CPU with Inception-v3(In seconds)

towardsdatascience.com

How to Send Emails using Python

Design Professional Mails using Flask!
medium.com

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6434)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (641)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1173)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (997)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1853)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4102)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (628)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1018)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (581)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (297)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1138)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (279)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4360)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2876)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (835)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
Werner, Raab - The Truth of Riemann Hypothesis
No ratings yet
Werner, Raab - The Truth of Riemann Hypothesis
11 pages
Stockfish Multithread Testing (2020)
No ratings yet
Stockfish Multithread Testing (2020)
7 pages
List Rivals, Armors, Weapons For The Pit (BBS ASCII Game)
No ratings yet
List Rivals, Armors, Weapons For The Pit (BBS ASCII Game)
5 pages
Shaw International Long Distance Rates 2023
No ratings yet
Shaw International Long Distance Rates 2023
11 pages
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
Tutorial & Example. S-Curve Modeling and Applications
No ratings yet
Tutorial & Example. S-Curve Modeling and Applications
6 pages
Lecture 2.1.2activation Function
No ratings yet
Lecture 2.1.2activation Function
15 pages
Simulating Neural Networks
No ratings yet
Simulating Neural Networks
360 pages
Performance Analysis of Various Activation Functions in Neural
No ratings yet
Performance Analysis of Various Activation Functions in Neural
20 pages
Sharma S. - Activation Functions in Neural Networks
No ratings yet
Sharma S. - Activation Functions in Neural Networks
11 pages
Shaping Individual Development Along The S Curve
No ratings yet
Shaping Individual Development Along The S Curve
9 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
10 pages
Chapter 13 Organisms and Populations
No ratings yet
Chapter 13 Organisms and Populations
6 pages
Function Activate
No ratings yet
Function Activate
9 pages
LED Irradiance Pattern at Short Distances: Ivan Moreno
No ratings yet
LED Irradiance Pattern at Short Distances: Ivan Moreno
6 pages
Elementary Functions
100% (1)
Elementary Functions
554 pages
Application of Artificial Neural Network To Predict Total Dissolved Solid in Achechay River Basin
No ratings yet
Application of Artificial Neural Network To Predict Total Dissolved Solid in Achechay River Basin
9 pages
Three Decades of Activations - A Comprehensive Survey
No ratings yet
Three Decades of Activations - A Comprehensive Survey
107 pages
A Note On The Schnute Growth Model: Nikolay Kyurkchiev, Svetoslav Markov, Anton Iliev
No ratings yet
A Note On The Schnute Growth Model: Nikolay Kyurkchiev, Svetoslav Markov, Anton Iliev
8 pages
Shapes and Functions of Species-Area Curves A Revi
No ratings yet
Shapes and Functions of Species-Area Curves A Revi
10 pages
ASDA-B2-Modo Velocidade
No ratings yet
ASDA-B2-Modo Velocidade
8 pages
Commercializing Growth: Connecting Valuation with Management and Governance Methodologies Jerry Schaufeld - Quickly access the ebook and start reading today
100% (1)
Commercializing Growth: Connecting Valuation with Management and Governance Methodologies Jerry Schaufeld - Quickly access the ebook and start reading today
68 pages
Activation Functions
No ratings yet
Activation Functions
8 pages
Sigmoid Deep Learning
No ratings yet
Sigmoid Deep Learning
8 pages
Sigmoid Function Definition - DeepAI
No ratings yet
Sigmoid Function Definition - DeepAI
15 pages
Notes-1-Activation Functions
No ratings yet
Notes-1-Activation Functions
2 pages
Assignment EE5179 ME20B145 Report
No ratings yet
Assignment EE5179 ME20B145 Report
6 pages
Python Special Assignment Solution Abhijeet
No ratings yet
Python Special Assignment Solution Abhijeet
21 pages
4
No ratings yet
4
18 pages
Sigmoid function - Wikipedia
No ratings yet
Sigmoid function - Wikipedia
6 pages
Activation Functions
No ratings yet
Activation Functions
2 pages
Artificial Neural Networks with Java: Tools for Building Neural Network Applications, 2nd Edition Igor Livshin download pdf
100% (3)
Artificial Neural Networks with Java: Tools for Building Neural Network Applications, 2nd Edition Igor Livshin download pdf
55 pages
s41598-024-74600-4
No ratings yet
s41598-024-74600-4
27 pages
ML Unit4
No ratings yet
ML Unit4
32 pages
Calculation of The Compression Index and Precompression Stress From Soil Compression Test Data
No ratings yet
Calculation of The Compression Index and Precompression Stress From Soil Compression Test Data
13 pages

Sharma S. - Activation Functions in Neural Networks

Uploaded by

Sharma S. - Activation Functions in Neural Networks

Uploaded by

Open in app Get started

Published in Towards Data Science

SAGAR SHARMA Follow

Sep 6, 2017 · 5 min read · Listen

Activation Functions in Neural Networks

What is Activation Function?

Why we use Activation functions with Neural Networks?

1. Linear Activation Function

2. Non-linear Activation Functions

FYI: The Cheat sheet is given below.

Linear or Identity Activation Function

Fig: Linear Activation Function

Non-linear Activation Function

Fig: Non-linear Activation Function

The main terminologies needed to understand for nonlinear functions are:

Derivative or Differential: Change in y-axis w.r.t. change in x-axis.It is also known as

Monotonic function: A function which is either entirely non-increasing or non-

Fig: Sigmoid Function

The function is monotonic but function’s derivative is not.

2. Tanh or hyperbolic tangent Activation Function

Fig: tanh v/s Logistic Sigmoid

The function is differentiable.

The function is monotonic while its derivative is not monotonic.

The tanh function is mainly used classification between two classes.

3. ReLU (Rectified Linear Unit) Activation Function

Fig: ReLU v/s Logistic Sigmoid

The function and its derivative both are monotonic.

Fig : ReLU v/s Leaky ReLU

Can you see the Leak?

When a is not 0.01 then it is called Randomized ReLU.

Therefore the range of the Leaky ReLU is (-infinity to infinity).

Why derivative/differentiation is used ?

When updating the curve, to know in which direction

Fig: Activation Function Cheetsheet

Fig: Derivative of Activation Functions

So, follow me on Medium, LinkedIn to see similar posts.

Clap it! Share it! Follow Me!

Previous stories you will love:

What is Linear Regression and How does it work? - theffork

What the Hell is “Tensor” in “TensorFlow”?

Sign up for The Variable

Get this newsletter

On CPU with Inception-v3(In seconds)

How to Send Emails using Python

You might also like