0% found this document useful (0 votes)

5 views13 pages

Lecture_8.2

Uploaded by

mr.jhion.adbar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views13 pages

Lecture_8.2

Uploaded by

mr.jhion.adbar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

A step by step forward pass and backpropagation

example

There are multiple libraries (PyTorch, TensorFlow) that can assist you in
implementing almost any architecture of neural networks. This article is not
about solving a neural net using one of those libraries. There are already
plenty of articles, videos on that. In this article, we’ll see a step by step
forward pass (forward propagation) and backward pass (backpropagation)
example. We’ll be taking a single hidden layer neural network and solving
one complete cycle of forward propagation and backpropagation.

Getting to the point, we will work step by step to understand how weights
are updated in neural networks. The way a neural network learns is by
updating its weight parameters during the training phase. There are multiple
concepts needed to fully understand the working mechanism of neural
networks: linear algebra, probability, calculus. I’ll try my best to re-visit
calculus for the chain rule concept. I will keep aside the linear algebra
(vectors, matrices, tensors) for this article. We’ll work on each and every
computation and in the end up we’ll update all the weights of the example
neural network for one complete cycle of forward propagation and
backpropagation. Let’s get started.

Here’s a simple neural network on which we’ll be working.

Example Neural Network

I think the above example neural network is self-explanatory. There are two
units in the Input Layer, two units in the Hidden Layer and two units in the
Output Layer. The w1,w2,w2,…,w8 represent the respective weights. b1
and b2 are the biases for Hidden Layer and Output Layer, respectively.

In this article, we’ll be passing two inputs i1 and i2, and perform a forward
pass to compute total error and then a backward pass to distribute the error
inside the network and update weights accordingly.

Before getting started, let us deal with two basic concepts which should be
sufficient to comprehend this article.

Peeking inside a single neuron

Inside h1 (first unit of the hidden layer)

Inside a unit, two operations happen (i) computation of weighted sum and
(ii) squashing of the weighted sum using an activation function. The result
from the activation function becomes an input to the next layer (until the
next layer is an Output Layer). In this example, we’ll be using the Sigmoid
function (Logistic function) as the activation function. The Sigmoid
function basically takes an input and squashes the value between 0 and +1.
We’ll discuss the activation functions in later articles. But, what you should
note is that inside a neural network unit, two operations (stated above)
happen. We can suppose the input layer to have a linear function that
produces the same value as the input.

Chain Rule in Calculus

The Forward Pass

Remember that each unit of a neural network performs two operations:
compute weighted sum and process the sum through an activation function.
The outcome of the activation function determines if that particular unit
should activate or become insignificant.

Let’s get started with the forward pass.

For h1,

Now we pass this weighted sum through the logistic function (sigmoid
function) so as to squash the weighted sum into the range (0 and +1). The
logistic function is an activation function for our example neural network.
Similarly for h2, we perform the weighted sum operation sumh2 and
compute the activation value outputh2.

Now, outputh1outputh1 and outputh2outputh2 will be considered as

inputs to the next layer.

For o1,
Computing the total error
We started off supposing the expected outputs to be 0.05 and 0.95
respectively for outputo1outputo1 and outputo2outputo2. Now we
will compute the errors based on the outputs received until now and the
expected outputs.

We’ll use the following error formula,

The Backpropagation
The aim of backpropagation (backward pass) is to distribute the total error
back to the network so as to update the weights in order to minimize the cost
function (loss). The weights are updated in such as way that when the next
forward pass utilizes the updated weights, the total error will be reduced by
a certain margin (until the minima is reached).

For weights in the output layer (w5, w6, w7, w8)

For w5,

Let’s compute how much contribution w5 has on E1. If we become clear

on how w5 is updated, then it would be really easy for us to generalize the
same to the rest of the weights. If we look closely at the example neural
network, we can see that E1 is affected by outputo1, outputo1 is affected
by sumo1, and sumo1 is affected by w5. It’s time to recall the Chain
Rule.
Component 2: partial derivative of Output w.r.t. Sum

The output section of a unit of a neural network uses non-linear activation

functions. The activation function used in this example is Logistic Function.
When we compute the derivative of the Logistic Function, we get:
For weights in the hidden layer (w1, w2, w3, w4)
Similar calculations are made to update the weights in the hidden layer.
However, this time the chain becomes a bit longer. It does not matter how
deep the neural network goes, all we need to find out is how much error is
propagated (contributed) by a particular weight to the total error of the
network. For that purpose, we need to find the partial derivative of Error
w.r.t. to the particular weight. Let’s work on updating w1 and we’ll be able
to generalize similar calculations to update the rest of the weights.

For w1 (with respect to E1),

Let’s quickly go through the above chain. We know that E1 is affected

by outputo1outputo1, outputo1 is affected by sumo1, sumo1 is affected
by outputh1outputh1, outputh1outputh1 is affected by sumh1sumh1
, and finally sumh1 is affected by w1. It is quite easy to comprehend, isn’t
it?
For the first component of the above chain,

We’ve already computed the second component. This is one of the benefits
of using the chain rule. As we go deep into the network, the previous
computations are re-usable.
Once we’ve computed all the new weights, we need to update all the old
weights with these new weights. Once the weights are updated, one
backpropagation cycle is finished. Now the forward pass is done and the
total new error is computed. And based on this newly computed total error
the weights are again updated. This goes on until the loss value converges
to minima. This way a neural network starts with random values for its
weights and finally converges to optimum values.

I hope you found this article useful. I’ll see you in the next one.

TDWI Data Management Maturity Model Assessment Guide 2023 Web
No ratings yet
TDWI Data Management Maturity Model Assessment Guide 2023 Web
13 pages
How To Build Your Own Neural Network From Scratch in
No ratings yet
How To Build Your Own Neural Network From Scratch in
6 pages
Chapter3 - BP
No ratings yet
Chapter3 - BP
12 pages
A Step by Step Forward Pass and Backpropagation Example
No ratings yet
A Step by Step Forward Pass and Backpropagation Example
14 pages
Unit 4
No ratings yet
Unit 4
16 pages
NN Lecture Notes
No ratings yet
NN Lecture Notes
45 pages
Understanding and Creating Neural Networks
No ratings yet
Understanding and Creating Neural Networks
69 pages
A Step by Step Backpropagation Example
No ratings yet
A Step by Step Backpropagation Example
9 pages
Types of MAC Protocols
No ratings yet
Types of MAC Protocols
16 pages
Ann
No ratings yet
Ann
31 pages
Annette Paper
No ratings yet
Annette Paper
7 pages
Backpropagation Example
No ratings yet
Backpropagation Example
9 pages
ANN_example
No ratings yet
ANN_example
10 pages
nn_pdf
No ratings yet
nn_pdf
11 pages
A Step by Step Backpropagation
No ratings yet
A Step by Step Backpropagation
8 pages
Step by Step Back Propagation
No ratings yet
Step by Step Back Propagation
8 pages
Forward & Backward Propagation
No ratings yet
Forward & Backward Propagation
2 pages
Building Neural Networks_ A Hands-On Journey from Scratch with Python _ by Long Nguyen _ Medium
No ratings yet
Building Neural Networks_ A Hands-On Journey from Scratch with Python _ by Long Nguyen _ Medium
21 pages
Back propogation
No ratings yet
Back propogation
9 pages
Backpropagation in Neural Nets
No ratings yet
Backpropagation in Neural Nets
13 pages
Understanding Backpropagation Algorithm - Towards Data Science
No ratings yet
Understanding Backpropagation Algorithm - Towards Data Science
11 pages
Forward & Backward
No ratings yet
Forward & Backward
1 page
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
17 pages
ANN research
No ratings yet
ANN research
18 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
9 pages
neural network 2
No ratings yet
neural network 2
14 pages
Mind - How To Build A Neural Network (Part One)
No ratings yet
Mind - How To Build A Neural Network (Part One)
9 pages
Artificial Neural Networks - Lect - 3
No ratings yet
Artificial Neural Networks - Lect - 3
16 pages
NeuralNetworks
No ratings yet
NeuralNetworks
29 pages
Neural Networks
No ratings yet
Neural Networks
52 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
شبكات عصبية ٢
No ratings yet
شبكات عصبية ٢
6 pages
Unit 2 Deep Learning
No ratings yet
Unit 2 Deep Learning
19 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
lab3
No ratings yet
lab3
40 pages
nn2
No ratings yet
nn2
12 pages
Back Propagation
No ratings yet
Back Propagation
8 pages
Unit III
No ratings yet
Unit III
37 pages
6.Deriving Back Propogation
No ratings yet
6.Deriving Back Propogation
11 pages
Activation Function To Back Pro
No ratings yet
Activation Function To Back Pro
22 pages
Presentation 1
No ratings yet
Presentation 1
14 pages
Artificial Neural Networks Mathematics of Backpropagation (Part 4) - BRIAN DOLHANSKY
No ratings yet
Artificial Neural Networks Mathematics of Backpropagation (Part 4) - BRIAN DOLHANSKY
9 pages
Neural Network (Perceptrons)
No ratings yet
Neural Network (Perceptrons)
31 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
8 pages
Main
No ratings yet
Main
25 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
MODULE 2
No ratings yet
MODULE 2
14 pages
lect8_dnn (1)
No ratings yet
lect8_dnn (1)
33 pages
Neural Networks: Derivation: 1 Model
No ratings yet
Neural Networks: Derivation: 1 Model
9 pages
How To Build Your Own Neural Network From Scratch in Python
No ratings yet
How To Build Your Own Neural Network From Scratch in Python
11 pages
ML unit 5
No ratings yet
ML unit 5
34 pages
MLP Numerical
No ratings yet
MLP Numerical
19 pages
Pr2_ANN_WriteUp.docx
No ratings yet
Pr2_ANN_WriteUp.docx
11 pages
Chapter 1 Annexe
No ratings yet
Chapter 1 Annexe
17 pages
6.034f Neural Net Notes October 28, 2010
No ratings yet
6.034f Neural Net Notes October 28, 2010
7 pages
Unit 2
No ratings yet
Unit 2
38 pages
L4deep Learning
No ratings yet
L4deep Learning
14 pages
Single Neuron Model
No ratings yet
Single Neuron Model
16 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Digital Circuit Simulation Using Excel
From Everand
Digital Circuit Simulation Using Excel
Anthony Mazzurco
No ratings yet
Machine Learning Engineer Nvidia
No ratings yet
Machine Learning Engineer Nvidia
2 pages
Ppt Data Mining
No ratings yet
Ppt Data Mining
13 pages
SB Chandra Original Without Contact
No ratings yet
SB Chandra Original Without Contact
2 pages
Comparison of CNN Architecutres
No ratings yet
Comparison of CNN Architecutres
7 pages
SIH_]1571[[1]
No ratings yet
SIH_]1571[[1]
6 pages
Report - Mini Project
No ratings yet
Report - Mini Project
66 pages
Atharva Tambat Resume PDF
No ratings yet
Atharva Tambat Resume PDF
2 pages
Sop Ncsu
No ratings yet
Sop Ncsu
3 pages
The Digital Agricultural Revolution: Innovations and Challenges in Agriculture Through Technology Disruptions 1st Edition Roheet Bhatnagar
100% (16)
The Digital Agricultural Revolution: Innovations and Challenges in Agriculture Through Technology Disruptions 1st Edition Roheet Bhatnagar
67 pages
SYLLABUS
No ratings yet
SYLLABUS
2 pages
Lecture Notes For Chapter 4 Artificial Neural Networks Introduction To Data Mining, 2 Edition
No ratings yet
Lecture Notes For Chapter 4 Artificial Neural Networks Introduction To Data Mining, 2 Edition
20 pages
AdaBoost New PDF
No ratings yet
AdaBoost New PDF
45 pages
Deep Reinforcement Learning in Action 1st Edition Alexander Zai - Quickly download the ebook to read anytime, anywhere
100% (3)
Deep Reinforcement Learning in Action 1st Edition Alexander Zai - Quickly download the ebook to read anytime, anywhere
56 pages
Practical Data Analysis Cookbook - Sample Chapter
100% (1)
Practical Data Analysis Cookbook - Sample Chapter
31 pages
Generative AI in Manufacturing
No ratings yet
Generative AI in Manufacturing
15 pages
ET-Wealth 22-05-2023
No ratings yet
ET-Wealth 22-05-2023
24 pages
COMPSCI 760 - 2023 Semester Two - Advanced Topics in Machine Learning
No ratings yet
COMPSCI 760 - 2023 Semester Two - Advanced Topics in Machine Learning
6 pages
Airbnb Price Estimation
No ratings yet
Airbnb Price Estimation
1 page
Imbalanced Data: How To Handle Imbalanced Classification Problems
No ratings yet
Imbalanced Data: How To Handle Imbalanced Classification Problems
17 pages
Fire Detection Using Image Processing: Bibek Shrestha
No ratings yet
Fire Detection Using Image Processing: Bibek Shrestha
39 pages
AI-DrivenDecisionSupportSystemsinManagement
No ratings yet
AI-DrivenDecisionSupportSystemsinManagement
10 pages
DLT Unit-1 Answers
No ratings yet
DLT Unit-1 Answers
36 pages
Lecture 3 Types of Machine Learning
No ratings yet
Lecture 3 Types of Machine Learning
40 pages
An Advanced Analysis of Disease Prediction and Prevention Using Machine Learning
No ratings yet
An Advanced Analysis of Disease Prediction and Prevention Using Machine Learning
4 pages
Chen Progressive Differentiable Architecture Search Bridging the Depth Gap Between Search ICCV 2019 Paper
No ratings yet
Chen Progressive Differentiable Architecture Search Bridging the Depth Gap Between Search ICCV 2019 Paper
10 pages
Executive Post Graduate Certification in Data Science and Artificial Intelligence 1
No ratings yet
Executive Post Graduate Certification in Data Science and Artificial Intelligence 1
14 pages
Fresher PDF
No ratings yet
Fresher PDF
2 pages
Natural Language Processing for Analyzing Online C
No ratings yet
Natural Language Processing for Analyzing Online C
37 pages

Lecture_8.2

Uploaded by

Lecture_8.2

Uploaded by

A step by step forward pass and backpropagation

Here’s a simple neural network on which we’ll be working.

Peeking inside a single neuron

Chain Rule in Calculus

The Forward Pass

Let’s get started with the forward pass.

Now, outputh1outputh1 and outputh2outputh2 will be considered as

We’ll use the following error formula,

For weights in the output layer (w5, w6, w7, w8)

Let’s compute how much contribution w5 has on E1. If we become clear

The output section of a unit of a neural network uses non-linear activation

For w1 (with respect to E1),

Let’s quickly go through the above chain. We know that E1 is affected

You might also like