0% found this document useful (0 votes)

69 views15 pages

Gradient Descent for Neural Networks in Python

Deep learning

Uploaded by

sakshishetty149

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views15 pages

Gradient Descent for Neural Networks in Python

Deep learning

Uploaded by

sakshishetty149

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

IMPLEMENT THE GRADIENT DESCENT ALGORITHM FOR A THREE- STAGE FULLY

CONNECTED NEURAL NETWORK USING PYTHON AND

BASIC PACKAGES SUCH AS NUMPY AND MATPLOTLIB

EXPERIMENT 1
STRUCTURE OF THE NEURAL NETWORK

• Three-Stage Network:
• Input Layer: Takes the
input data.
• Hidden Layer: Processes
input from the input layer.
• Output Layer: Produces
the final output.
GRADIENT DESCENT

• Definition: Gradient Descent is an optimization algorithm used to

minimize the cost function by iteratively adjusting the weights.
• A first order Optimization Algorithm used to find Minima(Primarily) or
Maxima(Gradient Ascent) of a given function.
• Steps Involved:

1. Calculate the Predicted Output (Forward Propagation)

2. Compute the Loss/Error
3. Calculate the Gradient of the Loss (Backward Propagation)
4. Update the Weights Using the Gradients
REAL-TIME WORKING OF GRADIENT
DESCENT ALGORITHM
• Initialize Weights: Start with random weights.
• Forward Propagation: Calculate the predicted output using current weights.
• Compute Loss: Measure the error between predicted and actual output.
• Back Propagation: Calculate the gradient of the loss function w.r.t weights.
• Update Weights: Adjust weights in the direction of the negative gradient.
• Repeat: Iterate through the above steps until convergence.
MATHEMATICS HERE?

• We assume a parabola here, as its helpful in visualizing the basic principles of

gradient Descent.
• Updating Parameters based on gradient.
• Concept of moving towards a minimum.

• A single parabola will have only one minima & that would be global minima
• Cost Function(loss) we take as Mean Squared Error form.
• Aim is to minimize the cost.
• There are certain parameters involved here along with input & output.
FINDING MINIMA

• Gradient Descent in Action:

• Start at an initial point.
• Compute the gradient (slope) of the cost function.
• Take steps proportional to the negative gradient.
• Continue until the slope is near zero, indicating a minimum.

• Challenges:
• May get stuck in local minima.
• Requires careful tuning of parameters.
LEARNING RATE:

• Definition: The learning rate is a hyperparameter that controls how much

to change the model(how big the steps) in response to the estimated error
each time the model weights are updated.
• Importance: Determines the size of the steps taken towards the minimum
of the cost function.
• Effects:
• Too High: Can cause the algorithm to converge too quickly to a suboptimal
solution or even diverge.
• Too Low: Results in a slow convergence, potentially getting stuck in local minima.
BATCH SIZE

• Definition: The number of training examples utilized in one iteration.

• Variants:
• Batch Gradient Descent: Uses the entire dataset to compute the gradient.
• Stochastic Gradient Descent (SGD): Uses one training example per
iteration.
• Mini-batch Gradient Descent: Uses a subset of the training data (a mini-
batch) for each iteration.
• Importance: Affects the stability and speed of convergence.
NUMBER OF ITERATIONS (EPOCHS)

• Definition: The number of times the entire training dataset is passed

forward and backward through the neural network.
• Importance: Determines how many times the weights are updated.
• Effects:
• More Iterations: Typically improves the accuracy but requires more
computational resources.
• Fewer Iterations: Can lead to underfitting, where the model fails to learn
adequately from the training data.
CODING

• Importing Libraries
1. NumPy: Provides support for arrays and matrices.
2. Matplotlib: Used for plotting graphs.
CODING HINTS

• Defining the Neural Network Class which encapsulates the neural network logic within a
class for better organization.
• Initialization of Weights: Randomly initialize weights & Set a learning rate for the gradient
descent.
• Forward Propagation: Compute the dot product of inputs and weights.
• Backward Propagation and Gradient Calculation: Derivative of the sigmoid function, used
to calculate gradients.
• Gradient Descent Method: Iteratively update weights to minimize the cost function.
• Training the Network: Define the dataset & Train the neural network using gradient
descent.
CONCLUSION:

• Implemented a three-stage neural network with Python using NumPy

and Matplotlib.
• Applied gradient descent to optimize weights and minimize the cost
function.
• Understanding gradient descent provides a foundation for deeper
machine learning exploration, after which we can explore more
advanced optimization techniques.
THANK YOU

Understanding Gradient Descent Methods
No ratings yet
Understanding Gradient Descent Methods
6 pages
Understanding Gradient Descent Techniques
No ratings yet
Understanding Gradient Descent Techniques
31 pages
Deep Learning: Gradient Optimization Techniques
No ratings yet
Deep Learning: Gradient Optimization Techniques
40 pages
Backpropagation and Gradient Descent Explained
No ratings yet
Backpropagation and Gradient Descent Explained
10 pages
Neural Network Training and Optimization
No ratings yet
Neural Network Training and Optimization
34 pages
Understanding Gradient Descent Algorithm
No ratings yet
Understanding Gradient Descent Algorithm
5 pages
Gradient Descent & Backpropagation in Neural Networks
No ratings yet
Gradient Descent & Backpropagation in Neural Networks
38 pages
Understanding Gradient Descent Methods
No ratings yet
Understanding Gradient Descent Methods
2 pages
Gradient Descent in Neural Networks
No ratings yet
Gradient Descent in Neural Networks
13 pages
Multilayer Feedforward Neural Network Architecture
No ratings yet
Multilayer Feedforward Neural Network Architecture
44 pages
Gradient Descent and SGD Overview
No ratings yet
Gradient Descent and SGD Overview
8 pages
Neural Network Architecture and Training
No ratings yet
Neural Network Architecture and Training
20 pages
Gradient Descent in Neural Networks
No ratings yet
Gradient Descent in Neural Networks
26 pages
Gradient Descent Optimization Explained
No ratings yet
Gradient Descent Optimization Explained
14 pages
Gradient Descent in Machine Learning
No ratings yet
Gradient Descent in Machine Learning
67 pages
Gradient Descent in Neural Networks
No ratings yet
Gradient Descent in Neural Networks
6 pages
Learning Algorithm Anatomy and Gradient Descent
No ratings yet
Learning Algorithm Anatomy and Gradient Descent
43 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
10 pages
SLFN Learning: SGD vs. Mini-Batch GD
No ratings yet
SLFN Learning: SGD vs. Mini-Batch GD
7 pages
Types of Gradient Descent Algorithms
No ratings yet
Types of Gradient Descent Algorithms
9 pages
Gradient Descent: Intuition & 2 Variables
No ratings yet
Gradient Descent: Intuition & 2 Variables
8 pages
SGD Variants in Neural Networks
No ratings yet
SGD Variants in Neural Networks
211 pages
Supervised Deep Learning Techniques
No ratings yet
Supervised Deep Learning Techniques
28 pages
Gradient Descent in Neural Networks
No ratings yet
Gradient Descent in Neural Networks
44 pages
Types of Gradient Descent Explained
No ratings yet
Types of Gradient Descent Explained
8 pages
Optimization Techniques in ML by Paik
No ratings yet
Optimization Techniques in ML by Paik
37 pages
Gradient Descent Optimization Techniques
No ratings yet
Gradient Descent Optimization Techniques
54 pages
Stochastic Gradient Descent Basics
No ratings yet
Stochastic Gradient Descent Basics
41 pages
Understanding Gradient Descent and Loss
No ratings yet
Understanding Gradient Descent and Loss
13 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
9 pages
Mini-Batch Gradient Descent Explained
No ratings yet
Mini-Batch Gradient Descent Explained
23 pages
Understanding Gradient Descent Variants
No ratings yet
Understanding Gradient Descent Variants
5 pages
Understanding Gradient Descent Steps
No ratings yet
Understanding Gradient Descent Steps
15 pages
Understanding Gradient Descent Basics
No ratings yet
Understanding Gradient Descent Basics
22 pages
Overview of Gradient Descent Algorithms
No ratings yet
Overview of Gradient Descent Algorithms
12 pages
Neural Network Optimization Techniques
No ratings yet
Neural Network Optimization Techniques
28 pages
AI & ML Certificate Course Overview
No ratings yet
AI & ML Certificate Course Overview
32 pages
Deep Learning Fundamentals Overview
No ratings yet
Deep Learning Fundamentals Overview
72 pages
Understanding Stochastic Gradient Descent
No ratings yet
Understanding Stochastic Gradient Descent
9 pages
Understanding Perceptrons in Deep Learning
No ratings yet
Understanding Perceptrons in Deep Learning
62 pages
Gradient Descent in Neural Networks
No ratings yet
Gradient Descent in Neural Networks
98 pages
Gradient Descent Variants for Neural Networks
No ratings yet
Gradient Descent Variants for Neural Networks
20 pages
Backpropagation and Gradient Descent Explained
No ratings yet
Backpropagation and Gradient Descent Explained
5 pages
Neural Networks: Concepts and Derivation
No ratings yet
Neural Networks: Concepts and Derivation
9 pages
Types of Gradient Descent Explained
No ratings yet
Types of Gradient Descent Explained
5 pages
Understanding Gradient Descent Methods
No ratings yet
Understanding Gradient Descent Methods
37 pages
Understanding Gradient Descent Methods
No ratings yet
Understanding Gradient Descent Methods
15 pages
Back Propagation in Neural Networks
No ratings yet
Back Propagation in Neural Networks
16 pages
Understanding Multilayer Perceptrons and Gradient Descent
No ratings yet
Understanding Multilayer Perceptrons and Gradient Descent
14 pages
Stochastic Gradient Descent Explained
No ratings yet
Stochastic Gradient Descent Explained
7 pages
Gradient Descent in Machine Learning
No ratings yet
Gradient Descent in Machine Learning
9 pages
FFNN Training and Backpropagation Guide
No ratings yet
FFNN Training and Backpropagation Guide
46 pages
Understanding Gradient Descent
No ratings yet
Understanding Gradient Descent
20 pages
Neural Network Loss Functions Explained
No ratings yet
Neural Network Loss Functions Explained
29 pages
Lect 6
No ratings yet
Lect 6
60 pages
Understanding Gradient Descent Methods
No ratings yet
Understanding Gradient Descent Methods
13 pages
Introduction to Neural Networks Basics
No ratings yet
Introduction to Neural Networks Basics
25 pages
Deep Learning Fundamentals in NN
No ratings yet
Deep Learning Fundamentals in NN
46 pages
Gradient Descent in Deep Learning
No ratings yet
Gradient Descent in Deep Learning
5 pages
Micro-A748 Development Kit Manual
No ratings yet
Micro-A748 Development Kit Manual
21 pages
Micro-A748 Kit User Manual
No ratings yet
Micro-A748 Kit User Manual
22 pages
Deep CNN for Object Detection & Classification
No ratings yet
Deep CNN for Object Detection & Classification
48 pages
Twitter Spam Detection with Autoencoders
No ratings yet
Twitter Spam Detection with Autoencoders
26 pages
U-Net for Lung Segmentation in X-Rays
No ratings yet
U-Net for Lung Segmentation in X-Rays
29 pages
Linear and Logistic Regression Guide
No ratings yet
Linear and Logistic Regression Guide
16 pages
Structural Analysis II Problem Solutions
No ratings yet
Structural Analysis II Problem Solutions
10 pages
Week 2 Quiz: Algorithm Analysis
No ratings yet
Week 2 Quiz: Algorithm Analysis
3 pages
Bisection and Regula-Falsi Methods
No ratings yet
Bisection and Regula-Falsi Methods
12 pages
Linear Programming in Social Sciences
No ratings yet
Linear Programming in Social Sciences
13 pages
Understanding Polynomials and Their Types
No ratings yet
Understanding Polynomials and Their Types
1 page
LU Decomposition for Matrix Inverse
No ratings yet
LU Decomposition for Matrix Inverse
5 pages
Quadratic Function Forms Explained
No ratings yet
Quadratic Function Forms Explained
1 page
Understanding Vanilla RNNs
No ratings yet
Understanding Vanilla RNNs
52 pages
Newton's Interpolation Formulae Explained
No ratings yet
Newton's Interpolation Formulae Explained
21 pages
Advanced Sorting Algorithms Explained
No ratings yet
Advanced Sorting Algorithms Explained
7 pages
Finding Zeros of a Polynomial
No ratings yet
Finding Zeros of a Polynomial
2 pages
Introduction to ODEs and Solving Methods
No ratings yet
Introduction to ODEs and Solving Methods
10 pages
Optimization Techniques in Engineering
100% (1)
Optimization Techniques in Engineering
4 pages
Merge Sort for Linked List Sorting
No ratings yet
Merge Sort for Linked List Sorting
4 pages
Compute Correlation & Regression in SPSS
No ratings yet
Compute Correlation & Regression in SPSS
6 pages
Stochastic Games in AI: Unit 2 Overview
No ratings yet
Stochastic Games in AI: Unit 2 Overview
8 pages
Legendre Polynomials: 50 MCQs & Answers
No ratings yet
Legendre Polynomials: 50 MCQs & Answers
3 pages
Primitive Polynomials in UFDs
No ratings yet
Primitive Polynomials in UFDs
4 pages
MECH 307 Numerical Methods Syllabus
No ratings yet
MECH 307 Numerical Methods Syllabus
5 pages
First Order Optimization Methods Explained
No ratings yet
First Order Optimization Methods Explained
130 pages
Key Activation Functions in Neural Networks
No ratings yet
Key Activation Functions in Neural Networks
3 pages
Review of Automatic Differentiation
No ratings yet
Review of Automatic Differentiation
15 pages
Design and Analysis of Algorithms Course
No ratings yet
Design and Analysis of Algorithms Course
2 pages
Eigenvalues and Eigenvectors Explained
No ratings yet
Eigenvalues and Eigenvectors Explained
24 pages
Linear Programming Concepts Quiz
No ratings yet
Linear Programming Concepts Quiz
3 pages
Machine Learning Exam Questions 2019
No ratings yet
Machine Learning Exam Questions 2019
2 pages
Self-Stabilizing Spanning Tree Algorithm
No ratings yet
Self-Stabilizing Spanning Tree Algorithm
11 pages
Sorting Algorithms in C: Merge, Quick, Insertion, Shell
No ratings yet
Sorting Algorithms in C: Merge, Quick, Insertion, Shell
8 pages
Bracketing Methods in Engineering Analysis
No ratings yet
Bracketing Methods in Engineering Analysis
53 pages
Deep Learning Course Syllabus BCSE332L
No ratings yet
Deep Learning Course Syllabus BCSE332L
3 pages

Gradient Descent for Neural Networks in Python

Uploaded by

Gradient Descent for Neural Networks in Python

Uploaded by

IMPLEMENT THE GRADIENT DESCENT ALGORITHM FOR A THREE- STAGE FULLY

CONNECTED NEURAL NETWORK USING PYTHON AND

• Definition: Gradient Descent is an optimization algorithm used to

1. Calculate the Predicted Output (Forward Propagation)

• We assume a parabola here, as its helpful in visualizing the basic principles of

• Gradient Descent in Action:

• Definition: The learning rate is a hyperparameter that controls how much

• Definition: The number of training examples utilized in one iteration.

• Definition: The number of times the entire training dataset is passed

• Implemented a three-stage neural network with Python using NumPy

You might also like