0% found this document useful (0 votes)

21 views6 pages

DL Exp2

The document outlines an experiment to apply Batch, Stochastic, and Mini-Batch Gradient Descent algorithms for training a single layer feed forward neural network using Jupyter Notebook, Python, TensorFlow, and Keras. It explains the working principles of neural networks, the concept of gradients, and the differences between the types of gradient descent in terms of speed and convergence. The conclusion confirms the successful implementation of these learning algorithms.

Uploaded by

Madhura Kanse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views6 pages

DL Exp2

Uploaded by

Madhura Kanse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Department of Artificial Intelligence & Data Science

EXPERIMENT NO: 2

Aim: Apply the following learning algorithms to learn the parameters of the supervised single
layer feed forward neural network: Batch, Stochastic, and Mini-Batch Gradient Descent.

Software Required: Jupyter Notebook, Python, TensorFlow, Keras.

Theory:
 Single Layer Feed Forward Neural Network
A Single Layer Feedforward Neural Network is one of the simplest types of artificial neural
networks. It is a type of neural network that has only one input layer and a single layer of output.

Figure 1: Single Layer Feedforward Neural Network

Working Principle:
Each neuron performs: 𝑦 =𝑓(∑𝑛𝑖=1𝑤𝑖𝑥𝑖 +𝑏) where, 𝑥𝑖 = inputs, 𝑤𝑖 =
weights, b = bias, and f = activation function.
The network learns by adjusting weights using a learning algorithm like gradient descent and a
loss function.
 Feed Forward Neural Network
Feedforward neural networks are also known as Multi-layered Network of Neurons (MLN). In
these networks the information only travels forward in the neural network, through the input nodes
then through the hidden layers (single or many layers) and finally through the output nodes. In
MLN there are no feedback connections such that the output of the network is fed back into itself.
 Gradient
A gradient is the direction and magnitude calculated during the training of a neural network it is
used to teach the network weights in the right direction by the right amount. The higher the
gradient, the steeper the slope and the faster a model can learn. But if the slope is zero, the model
stops learning. Mathematically, a gradient is a partial derivative with respect to its inputs.
 Gradient Descent:
Department of Artificial Intelligence & Data Science

Gradient Descent is an optimization algorithm used to minimize the loss function by iteratively
updating the weights in the direction of the negative gradient. This direction, defined by the slope
of the loss function, leads the model toward the minimum point, where prediction error is at its
lowest. A crucial factor in this process is the learning rate, which defines the step size toward the
minimum. A lower learning rate results in smaller steps, increasing the time to reach the minimum
but often yielding a more precise result. If learning rate is large, then larger will be the steps and
model may not reach the local minimum because it just bounces back and forth between the convex
function of gradient descent.

Figure 2: Impact of Learning Rate on Convergence Speed and Stability

 Types of Gradient Descent

1) Batch Gradient Descent - Updates weights after computing the gradient over the entire dataset.
2) Stochastic Gradient Descent - Updates weights for each training samples individually.
3) Mini-Batch Gradient Descent - Updates weights after computing the gradient over a small batch
of training samples.
1. Batch Gradient Descent
Batch gradient descent, also called vanilla gradient descent, uses the entire dataset to calculate the
gradient and update the parameters once per epoch. Advantage of Batch Gradient Descent is that
it produces a stable error gradient and a stable convergence however it requires that the entire
training set resides in memory and is available to the algorithm. However, the problem with Batch
Gradient Descent is that it can be computationally expensive, especially for large datasets, as it
computes gradients using the entire dataset at each step. As we need to calculate the gradient on
the whole dataset to perform just one update, batch gradient descent can be very slow and is
intractable for datasets that don’t fit in memory. It may also converge slowly, get stuck in local
minima or saddle points, and suffer from poor generalization if not properly tuned.
Department of Artificial Intelligence & Data Science

Figure 3: Effect of Learning Rate on Gradient Descent

2. Stochastic Gradient Descent

Stochastic Gradient Descent updates the model parameters using only one training sample at a
time. It is computationally efficient and allows for faster iterations, making it suitable for
largescale datasets. One advantage is that the frequent updates allow us to have a detailed rate of
improvement. However, because of its frequent updates, it introduces noise in the optimization
path, leading to a more fluctuating convergence. This randomness can help escape local minima
but may also prevent reaching the exact minimum unless properly tuned.
3. Mini-Batch Gradient Descent
Mini-Batch Gradient Descent combines the advantages of both Batch and Stochastic Gradient
Descent. It splits the training dataset into smaller batches, and then it computes the gradient and
updates the parameters using these smaller batches, offering a good trade-off between convergence
speed and stability. It reduces the variance of updates compared to SGD while being more efficient
than Batch Gradient Descent. Therefore, it creates a balance between the robustness of stochastic
gradient descent and the efficiency of batch gradient descent. Mini-batching also allows for
parallelization and efficient use of hardware (like GPUs), making it the most commonly used
method in practice.
 Batch vs. Stochastic vs. Mini-Batch Gradient Descent
1) Which is faster (for the same number of epochs)?
• Order: Batch > Mini-Batch > Stochastic
• Reason: Batch is fastest per epoch as it uses full data at once; SGD is slowest due to noisy
updates

2) Which converges faster (for the same number of epochs)?

• Order: Stochastic > Mini-Batch > Batch
• Reason: SGD updates more frequently, reaching minima faster, though noisily

3) Which converges more smoothly and stably?

Department of Artificial Intelligence & Data Science

Figure 4: Comparison of Cost Function Behaviour

• Order: Batch > Mini-Batch > Stochastic

• Reason: Batch has the smoothest gradient, while SGD fluctuates a lot.

Conclusion:
Thus, we successfully implemented the learning algorithms to learn the parameters of the
supervised single layer feed forward neural network.

Code:

import pandas as pd
import [Link] as plt
from [Link] import StandardScaler

df = pd.read_csv('Social_Network_Ads.csv')
[Link]()

df = df[['Age', 'EstimatedSalary', 'Purchased']]

[Link]()

[Link]
Department of Artificial Intelligence & Data Science

X = [Link][:, 0:2] y
= [Link][:, -1]

scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)
X_scaled.shape

import tensorflow as tf from

tensorflow import keras from
keras import Sequential from
[Link] import Dense

model = Sequential() [Link](Dense(10,

activation='relu', input_dim=2)) [Link](Dense(10,
activation='relu')) [Link](Dense(1,
activation='sigmoid')) [Link]()

# Batch Gradient Descent

[Link](loss='binary_crossentropy', metrics=['accuracy']) history = [Link](X_scaled,
y, epochs=500, batch_size=400, validation_split=0.2, verbose=0)
[Link]([Link]['loss'])

# Stochastic Gradient Descent

[Link](loss='binary_crossentropy', metrics=['accuracy'])
history = [Link](X_scaled, y, epochs=500, batch_size=1, validation_split=0.2)
[Link]([Link]['loss'])
Department of Artificial Intelligence & Data Science

# Mini-Batch Gradient Descent

[Link](loss='binary_crossentropy', metrics=['accuracy'])
history = [Link](X_scaled, y, epochs=500, batch_size=250, validation_split=0.2)
[Link]([Link]['loss'])

DNN Training and Optimization Techniques
No ratings yet
DNN Training and Optimization Techniques
114 pages
Gradient Descent
No ratings yet
Gradient Descent
13 pages
Module 4 Lab 3
No ratings yet
Module 4 Lab 3
6 pages
Deep Learning: Gradient Optimization Techniques
No ratings yet
Deep Learning: Gradient Optimization Techniques
40 pages
Backpropagation, Sgmiod Neuron & Gradient Discend
No ratings yet
Backpropagation, Sgmiod Neuron & Gradient Discend
29 pages
Lesson 4 Gradient Descent
No ratings yet
Lesson 4 Gradient Descent
13 pages
An Overview of Gradient Descent Optimization Algorithms PDF
No ratings yet
An Overview of Gradient Descent Optimization Algorithms PDF
12 pages
UNIT2
No ratings yet
UNIT2
25 pages
1 Intro
No ratings yet
1 Intro
91 pages
Gradient Descent Types Explained
No ratings yet
Gradient Descent Types Explained
5 pages
Gradient Descent in Machine Learning
No ratings yet
Gradient Descent in Machine Learning
98 pages
Gradient Descent - PR
No ratings yet
Gradient Descent - PR
31 pages
Gradient Descent Optimization Explained
No ratings yet
Gradient Descent Optimization Explained
14 pages
Neural Network Training and Optimization
No ratings yet
Neural Network Training and Optimization
34 pages
Neural Network Optimization Techniques
No ratings yet
Neural Network Optimization Techniques
28 pages
Unit 2.a Optimzer
No ratings yet
Unit 2.a Optimzer
10 pages
Optimization Techniques for Gradient Descent
No ratings yet
Optimization Techniques for Gradient Descent
37 pages
Gradient Descent
No ratings yet
Gradient Descent
2 pages
Unit 4 - GRADIENT LEARNING
No ratings yet
Unit 4 - GRADIENT LEARNING
3 pages
Gradient Descent Optimization Techniques
No ratings yet
Gradient Descent Optimization Techniques
54 pages
Optimization Gradient Descent
No ratings yet
Optimization Gradient Descent
13 pages
Deep Learning: Gradient Descent Explained
No ratings yet
Deep Learning: Gradient Descent Explained
41 pages
Lec7 8+CNN 2
No ratings yet
Lec7 8+CNN 2
69 pages
CS601 - Machine Learning - Unit 2 New
No ratings yet
CS601 - Machine Learning - Unit 2 New
56 pages
Linear Models-Gradient Descent, Regularization (Introduction)
No ratings yet
Linear Models-Gradient Descent, Regularization (Introduction)
26 pages
Gradient Descent
No ratings yet
Gradient Descent
17 pages
DL Module 2 1 (Sami)
No ratings yet
DL Module 2 1 (Sami)
17 pages
Gradient Descent DS Rohit Sharma Fench Knjs
No ratings yet
Gradient Descent DS Rohit Sharma Fench Knjs
15 pages
Neural Network Optimization Tactics
No ratings yet
Neural Network Optimization Tactics
20 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
4 pages
AIMLB PGP 2025 Session 5
No ratings yet
AIMLB PGP 2025 Session 5
67 pages
Implement 03-1
No ratings yet
Implement 03-1
24 pages
Feedforward Neural Network Optimization
No ratings yet
Feedforward Neural Network Optimization
83 pages
ZDL 4
No ratings yet
ZDL 4
7 pages
Understanding Perceptrons in Deep Learning
No ratings yet
Understanding Perceptrons in Deep Learning
62 pages
Gradient Descent for Deep Learning
No ratings yet
Gradient Descent for Deep Learning
21 pages
ML Lec 08 Gradient Descent
No ratings yet
ML Lec 08 Gradient Descent
37 pages
Gradient Descent
No ratings yet
Gradient Descent
2 pages
DL Test-2
No ratings yet
DL Test-2
28 pages
ML3 Unit 4-3
No ratings yet
ML3 Unit 4-3
13 pages
Understanding Gradient Descent Algorithm
No ratings yet
Understanding Gradient Descent Algorithm
5 pages
Chapter 4
No ratings yet
Chapter 4
33 pages
Gradient Descent in Machine Learning
No ratings yet
Gradient Descent in Machine Learning
36 pages
Machine Learning Optimization Techniques
No ratings yet
Machine Learning Optimization Techniques
10 pages
Deep Learning Class Notes for CSE 2024
No ratings yet
Deep Learning Class Notes for CSE 2024
23 pages
Gradient Descent Optimization Guide
No ratings yet
Gradient Descent Optimization Guide
9 pages
Deep Learning Optimizers Explained
No ratings yet
Deep Learning Optimizers Explained
20 pages
Stochastic Gradient Descent Basics
No ratings yet
Stochastic Gradient Descent Basics
41 pages
Supervised Deep Learning Techniques
No ratings yet
Supervised Deep Learning Techniques
28 pages
Gradient Descent and Optimization in Machine Learning
No ratings yet
Gradient Descent and Optimization in Machine Learning
9 pages
Different Types of Gradient Descent
No ratings yet
Different Types of Gradient Descent
4 pages
AD601 Deep Learning Unit-2 Notes
No ratings yet
AD601 Deep Learning Unit-2 Notes
14 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
20 pages
Gradient Descent
No ratings yet
Gradient Descent
8 pages
Neural Network Training Optimization
No ratings yet
Neural Network Training Optimization
62 pages
Chapter 8-Deep Learning Book
No ratings yet
Chapter 8-Deep Learning Book
27 pages
Understanding Gradient Descent Methods
No ratings yet
Understanding Gradient Descent Methods
2 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
31 pages
Module4 (A) - Private Blockchain
No ratings yet
Module4 (A) - Private Blockchain
24 pages
Module 5 - Cryptocurrencies and Digital Tokens
No ratings yet
Module 5 - Cryptocurrencies and Digital Tokens
87 pages
CNN Module-4 2
No ratings yet
CNN Module-4 2
304 pages
DL Module-1.1
No ratings yet
DL Module-1.1
76 pages
Mod 6 - BT Applications
No ratings yet
Mod 6 - BT Applications
50 pages
Sepm Exp1
No ratings yet
Sepm Exp1
5 pages
Data Knowledge Management
No ratings yet
Data Knowledge Management
9 pages
Bda Question Bank Module Wise
No ratings yet
Bda Question Bank Module Wise
2 pages
Module 2 - Consensus Protocol
No ratings yet
Module 2 - Consensus Protocol
49 pages
Module 5 - Cryptocurrencies and Digital Tokens
No ratings yet
Module 5 - Cryptocurrencies and Digital Tokens
87 pages
Module 1 Notes
No ratings yet
Module 1 Notes
23 pages
DL Module 2
No ratings yet
DL Module 2
148 pages
Autoencoders
No ratings yet
Autoencoders
103 pages
Merkle Tree
No ratings yet
Merkle Tree
10 pages
NLP Exp 2
No ratings yet
NLP Exp 2
4 pages
DL Exp 1
No ratings yet
DL Exp 1
4 pages
CCL Exp 2
No ratings yet
CCL Exp 2
10 pages
Data Warehouse and Mining Concepts Explained
No ratings yet
Data Warehouse and Mining Concepts Explained
12 pages
Exp 10
No ratings yet
Exp 10
4 pages
CSS Exp 1
No ratings yet
CSS Exp 1
10 pages
Mathematics Solutions for Dec 2019 Exam
No ratings yet
Mathematics Solutions for Dec 2019 Exam
20 pages
CBCGS Applied Mathematics Solutions
No ratings yet
CBCGS Applied Mathematics Solutions
19 pages
Mumbai University Digital Logic Exam 2017
No ratings yet
Mumbai University Digital Logic Exam 2017
25 pages
CBCGS Applied Mathematics Solutions
No ratings yet
CBCGS Applied Mathematics Solutions
20 pages
Chemistry Solutions for Semester 1 Exam
No ratings yet
Chemistry Solutions for Semester 1 Exam
28 pages
SE-Comps SEM4 CG MAY16
No ratings yet
SE-Comps SEM4 CG MAY16
1 page
Mini-Project Report on AI & Data Science
No ratings yet
Mini-Project Report on AI & Data Science
18 pages
Computer Organization Exam Questions
No ratings yet
Computer Organization Exam Questions
23 pages
AI's Impact: Transformations & Challenges
No ratings yet
AI's Impact: Transformations & Challenges
2 pages
NLP Class10 AI Notes
No ratings yet
NLP Class10 AI Notes
2 pages
Aiml Project Bca Students
No ratings yet
Aiml Project Bca Students
4 pages
Freqtrade Chapter II Pages 6 To 10
No ratings yet
Freqtrade Chapter II Pages 6 To 10
3 pages
Token Ization
No ratings yet
Token Ization
15 pages
NLP Applications in Corporate Law Firms
No ratings yet
NLP Applications in Corporate Law Firms
8 pages
Deep Learning Course Overview
No ratings yet
Deep Learning Course Overview
46 pages
Detecting Fake News Using Advanced Language Models: Bert and Roberta
No ratings yet
Detecting Fake News Using Advanced Language Models: Bert and Roberta
7 pages
The Benefits of Using AI to Enhance Literature Studies
No ratings yet
The Benefits of Using AI to Enhance Literature Studies
3 pages
5th International Conference On AI, Machine Learning in Communications and Networks (AIMLNET 2025)
No ratings yet
5th International Conference On AI, Machine Learning in Communications and Networks (AIMLNET 2025)
2 pages
Keras
No ratings yet
Keras
4 pages
Understanding Machine Learning Types
No ratings yet
Understanding Machine Learning Types
8 pages
Emotion Detection From Text
No ratings yet
Emotion Detection From Text
16 pages
AI Assignment Solution
No ratings yet
AI Assignment Solution
22 pages
Managing The Risks of AI
No ratings yet
Managing The Risks of AI
14 pages
AI in Finance
No ratings yet
AI in Finance
14 pages
Research Paper AI Impact On Business
No ratings yet
Research Paper AI Impact On Business
2 pages
RL - Book - SuttonAndBarto - PDF - Google Drive
No ratings yet
RL - Book - SuttonAndBarto - PDF - Google Drive
1 page
Telugu Ai Ethics
No ratings yet
Telugu Ai Ethics
4 pages
Computer Science and Engineering - 24 - Mar - 2025
No ratings yet
Computer Science and Engineering - 24 - Mar - 2025
19 pages
Backdoor Attacks
No ratings yet
Backdoor Attacks
42 pages
AI Project Report for Polytechnic Diploma
No ratings yet
AI Project Report for Polytechnic Diploma
18 pages
Law of Torts Project
No ratings yet
Law of Torts Project
9 pages
Assesment Test For Online FDP On Introduction To Generative AI Models and Applications
No ratings yet
Assesment Test For Online FDP On Introduction To Generative AI Models and Applications
13 pages
Knowledge AcquisitionUNIT 4
No ratings yet
Knowledge AcquisitionUNIT 4
22 pages
Lecture 1
No ratings yet
Lecture 1
26 pages
AI Exam Paper for Class IX - DPS Kanpur
No ratings yet
AI Exam Paper for Class IX - DPS Kanpur
3 pages
Artificial Intelligence: Rohan Raj Poudel
No ratings yet
Artificial Intelligence: Rohan Raj Poudel
25 pages
CS Presentation (Auto Text Summarization)
No ratings yet
CS Presentation (Auto Text Summarization)
11 pages
Invalid
No ratings yet
Invalid
2 pages

DL Exp2

Uploaded by

DL Exp2

Uploaded by

Department of Artificial Intelligence & Data Science

Software Required: Jupyter Notebook, Python, TensorFlow, Keras.

Figure 1: Single Layer Feedforward Neural Network

Figure 2: Impact of Learning Rate on Convergence Speed and Stability

 Types of Gradient Descent

Figure 3: Effect of Learning Rate on Gradient Descent

2. Stochastic Gradient Descent

2) Which converges faster (for the same number of epochs)?

3) Which converges more smoothly and stably?

Figure 4: Comparison of Cost Function Behaviour

• Order: Batch > Mini-Batch > Stochastic

df = df[['Age', 'EstimatedSalary', 'Purchased']]

import tensorflow as tf from

model = Sequential() [Link](Dense(10,

# Batch Gradient Descent

# Stochastic Gradient Descent

# Mini-Batch Gradient Descent

You might also like