0% found this document useful (0 votes)

15 views

Stochastic Gradient Descent (SGD) :: Import As

The document compares three optimization algorithms: stochastic gradient descent (SGD), RMSprop, and Adam. SGD updates model parameters iteratively using mini-batches. RMSprop adapts learning rates for each parameter. Adam combines advantages of RMSprop and momentum, maintaining separate learning rates and averages of gradients. The document also includes plots comparing the algorithms on convergence speed, robustness, and ease of use.

Uploaded by

John Joshua surangula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

Stochastic Gradient Descent (SGD) :: Import As

Uploaded by

John Joshua surangula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Stochastic Gradient Descent (SGD):

Characteristics:

Iteratively updates the model parameters using the gradients of the loss function with respect to
the parameters. Randomly selects a subset of the training data (mini-batch) for each iteration.
Advantages:

Simplicity and ease of implementation. Can perform well on large-scale datasets. Drawbacks:

Prone to getting stuck in local minima. Can have slow convergence, especially in the presence of
noisy gradients.

RMSprop (Root Mean Square Propagation):

Characteristics:

Adapts the learning rates of each parameter individually. Divides the learning rate by the root
mean square of the exponentially weighted moving average of squared gradients. Advantages:

Effective in dealing with sparse data and non-stationary objectives. Helps overcome some of the
issues with constant learning rates in SGD. Drawbacks:

May suffer from vanishing or exploding learning rates. Requires tuning of additional
hyperparameters.

Adam (Adaptive Moment Estimation):

Characteristics:

Combines the advantages of both RMSprop and momentum. Maintains separate learning rates
for each parameter and an exponentially decaying average of past gradients and squared
gradients. Advantages:

Fast convergence and robustness to noisy gradients. Automatic adjustment of learning rates for
each parameter. Drawbacks:

May exhibit erratic behavior on some non-convex optimization problems. Introduces additional
hyperparameters that need tuning.

import matplotlib.pyplot as plt

optimizers = ['SGD', 'RMSprop', 'Adam']

convergence_speed = [3, 4, 5] # Hypothetical scores (higher is
better)
robustness = [3, 4, 5] # Hypothetical scores (higher is better)
ease_of_use = [4, 3, 3] # Hypothetical scores (higher is better)

fig, ax = plt.subplots()
bar_width = 0.25
index = range(len(optimizers))

bar1 = ax.bar(index, convergence_speed, bar_width, label='Convergence

Speed')
bar2 = ax.bar([i + bar_width for i in index], robustness, bar_width,
label='Robustness')
bar3 = ax.bar([i + 2 * bar_width for i in index], ease_of_use,
bar_width, label='Ease of Use')

ax.set_xlabel('Optimizers')
ax.set_ylabel('Score')
ax.set_title('Comparison of Optimization Algorithms')
ax.set_xticks([i + bar_width for i in index])
ax.set_xticklabels(optimizers)
ax.legend()

plt.show()
import plotly.express as px

# Data for the line plot

data = {
'Optimizer': ['SGD', 'RMSprop', 'Adam'],
'Convergence Speed': [3, 4, 5],
'Robustness': [3, 4, 5],
'Ease of Use': [4, 3, 3],
}

# Create a DataFrame from the data

import pandas as pd
df = pd.DataFrame(data)

# Melt the DataFrame to have 'Score' as a variable

df_melted = pd.melt(df, id_vars='Optimizer', var_name='Score',
value_name='Value')

# Create a line plot using Plotly Express

fig = px.line(df_melted, x='Score', y='Value', color='Optimizer',
labels={'Value': 'Score', 'Score': 'Metric'},
title='Comparison of Optimization Algorithms')

# Update layout for better visualization

fig.update_layout(
xaxis_title='Score',
yaxis_title='Optimizer',
height=500,
width=800
)

# Show the plot

fig.show()

Image Processing
No ratings yet
Image Processing
5 pages
Minutely
No ratings yet
Minutely
1 page
Deep learning exp 2.3 MU
No ratings yet
Deep learning exp 2.3 MU
4 pages
To Improve The Performance of Models Predicting Ba
No ratings yet
To Improve The Performance of Models Predicting Ba
6 pages
ML Concepts
No ratings yet
ML Concepts
3 pages
Building a RMSprop Optimizer 1721650945
No ratings yet
Building a RMSprop Optimizer 1721650945
10 pages
DL Practical 02 Binary Class Classifier Using ANN
No ratings yet
DL Practical 02 Binary Class Classifier Using ANN
5 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 3
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 3
30 pages
ML Lab....... 3-Converted New
No ratings yet
ML Lab....... 3-Converted New
27 pages
ChatGPT
No ratings yet
ChatGPT
4 pages
Image Classifications
No ratings yet
Image Classifications
4 pages
HyperParameterTuning
No ratings yet
HyperParameterTuning
4 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
EE2211 CheatSheet
No ratings yet
EE2211 CheatSheet
15 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
DL 3 Ks
No ratings yet
DL 3 Ks
6 pages
ml
No ratings yet
ml
9 pages
vertopal.com_Untitled33
No ratings yet
vertopal.com_Untitled33
2 pages
Assignment7
No ratings yet
Assignment7
5 pages
Remarque Sur L'effect Des Different Valeurs C
No ratings yet
Remarque Sur L'effect Des Different Valeurs C
5 pages
ANN_EXPERIENTIAL_LEARNING
No ratings yet
ANN_EXPERIENTIAL_LEARNING
43 pages
Important Optimization Algorithms Essentials
No ratings yet
Important Optimization Algorithms Essentials
12 pages
Handwritten Digit Recognition Systems
No ratings yet
Handwritten Digit Recognition Systems
12 pages
som
No ratings yet
som
19 pages
Pyopt Quickguide: Peter W. Jansen, Ruben E. Perez
No ratings yet
Pyopt Quickguide: Peter W. Jansen, Ruben E. Perez
5 pages
A3 - Jupyter Notebook PDF
No ratings yet
A3 - Jupyter Notebook PDF
5 pages
MMDS Da3
No ratings yet
MMDS Da3
8 pages
Final ML File
No ratings yet
Final ML File
34 pages
20102A0071 DL Experiment5.b
No ratings yet
20102A0071 DL Experiment5.b
5 pages
Deep Learning
No ratings yet
Deep Learning
23 pages
DL2 - Jupyter Notebook
No ratings yet
DL2 - Jupyter Notebook
5 pages
Optimizers - Building a Parameterized Model Notes
No ratings yet
Optimizers - Building a Parameterized Model Notes
3 pages
1 - Standard Linear Regression: Numpy NP Pandas
No ratings yet
1 - Standard Linear Regression: Numpy NP Pandas
4 pages
21bit0706 VL2024250106861 Da
No ratings yet
21bit0706 VL2024250106861 Da
7 pages
Question- 2-Interview Question ML
No ratings yet
Question- 2-Interview Question ML
13 pages
COMPARISON - Jupyter Notebook
No ratings yet
COMPARISON - Jupyter Notebook
5 pages
dlweek6
No ratings yet
dlweek6
4 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
23 pages
Implementing Custom Randomsearchcv: 'Red' 'Blue'
No ratings yet
Implementing Custom Randomsearchcv: 'Red' 'Blue'
1 page
Lab 1. Boston House
No ratings yet
Lab 1. Boston House
7 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
22 pages
Big Data Assignment - 7
No ratings yet
Big Data Assignment - 7
7 pages
AI lab 8
No ratings yet
AI lab 8
14 pages
Optimization With Scipy
No ratings yet
Optimization With Scipy
26 pages
graph_analysis3_code
No ratings yet
graph_analysis3_code
2 pages
Arabic Digit Recognition
No ratings yet
Arabic Digit Recognition
5 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
Improving ML, DL networks Hyperparameter tuning, Regularization & Optimization
No ratings yet
Improving ML, DL networks Hyperparameter tuning, Regularization & Optimization
16 pages
On TFIDF Vectorizer
No ratings yet
On TFIDF Vectorizer
7 pages
Sofcomputing Da2
No ratings yet
Sofcomputing Da2
7 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
7 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 5
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 5
27 pages
vertopal.com_project
No ratings yet
vertopal.com_project
16 pages
Garishav Basra 102103129 2CO5
No ratings yet
Garishav Basra 102103129 2CO5
8 pages
Deep-Learning-Keras-Tensorflow - 1.1.1 Perceptron and Adaline - Ipynb at Master Leriomaggio - Deep-Learning-Keras-Tensorflow
No ratings yet
Deep-Learning-Keras-Tensorflow - 1.1.1 Perceptron and Adaline - Ipynb at Master Leriomaggio - Deep-Learning-Keras-Tensorflow
11 pages
Shiva Teja
No ratings yet
Shiva Teja
19 pages
ML Interview Questions
No ratings yet
ML Interview Questions
10 pages
DL
No ratings yet
DL
12 pages
AI LAB
No ratings yet
AI LAB
19 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Multitask Trajectory Planning Based On Predictive Control
No ratings yet
Multitask Trajectory Planning Based On Predictive Control
8 pages
Ant Colony Optimization Model For Tsunamis Evacuation Routes
No ratings yet
Ant Colony Optimization Model For Tsunamis Evacuation Routes
15 pages
Lecture Notes in Bioinformatics
No ratings yet
Lecture Notes in Bioinformatics
160 pages
UMTS Optimization 5.1
No ratings yet
UMTS Optimization 5.1
270 pages
Parameter Estimation of Internal Thermal Mass of Building Dynamic Models Using Genetic Algorithm 2005 PDF
No ratings yet
Parameter Estimation of Internal Thermal Mass of Building Dynamic Models Using Genetic Algorithm 2005 PDF
15 pages
Transportation and Transshipment
No ratings yet
Transportation and Transshipment
8 pages
PSO Mini Tutorial
No ratings yet
PSO Mini Tutorial
46 pages
Lecture Note 8-BSB315 - SPACE AUDIT ASSESSMENT IN HIGHER EDUCATION INSTITUTIONS
No ratings yet
Lecture Note 8-BSB315 - SPACE AUDIT ASSESSMENT IN HIGHER EDUCATION INSTITUTIONS
38 pages
(CSUR '17) Optimization of Complex Dataflows With User-Defined Functions
No ratings yet
(CSUR '17) Optimization of Complex Dataflows With User-Defined Functions
39 pages
B e
No ratings yet
B e
908 pages
Topic 8
No ratings yet
Topic 8
98 pages
Spreadsheet Modeling and Decision Analysis A Practical Introduction To Business Analytics 7th Edition Cliff Ragsdale Solution Manual
100% (39)
Spreadsheet Modeling and Decision Analysis A Practical Introduction To Business Analytics 7th Edition Cliff Ragsdale Solution Manual
16 pages
Simply Rhino Grasshopper Level 2 Evening Class
No ratings yet
Simply Rhino Grasshopper Level 2 Evening Class
7 pages
Inventory Basics Simulation Report
No ratings yet
Inventory Basics Simulation Report
2 pages
Chapter 2-Optimal Operation of Thermal Power Plants
No ratings yet
Chapter 2-Optimal Operation of Thermal Power Plants
55 pages
Robust Inventory-Production Control Problem With Stochastic Demand
No ratings yet
Robust Inventory-Production Control Problem With Stochastic Demand
20 pages
AT-06307-BRO-Aspen GDOT brochure-REFINING-2020-0827
No ratings yet
AT-06307-BRO-Aspen GDOT brochure-REFINING-2020-0827
7 pages
Bayesian Optimization For Accelerating Hyper-Parameter Tuning
No ratings yet
Bayesian Optimization For Accelerating Hyper-Parameter Tuning
4 pages
Optimizing Bond Allocation For Collateral Posting Under Multiple CSAs
No ratings yet
Optimizing Bond Allocation For Collateral Posting Under Multiple CSAs
9 pages
An Efficient Implementation of The Transportation Problem
No ratings yet
An Efficient Implementation of The Transportation Problem
82 pages
[FREE PDF sample] Knowledge based clustering from data to information granules 1st Edition Witold Pedrycz ebooks
100% (3)
[FREE PDF sample] Knowledge based clustering from data to information granules 1st Edition Witold Pedrycz ebooks
55 pages
Introduction Wps Office
No ratings yet
Introduction Wps Office
5 pages
Models: The Definition of A Model: Simplified
No ratings yet
Models: The Definition of A Model: Simplified
29 pages
Mergedtest 1 Nov6
No ratings yet
Mergedtest 1 Nov6
137 pages
Noc20 Cs81 Assignment 01 Week 03
No ratings yet
Noc20 Cs81 Assignment 01 Week 03
5 pages
Effective Implementation of Site Layout On Construction Site A Review Paper
No ratings yet
Effective Implementation of Site Layout On Construction Site A Review Paper
5 pages
Assignment-1 Problems On ELD and Hydro-Thermal Coordination PDF
No ratings yet
Assignment-1 Problems On ELD and Hydro-Thermal Coordination PDF
3 pages
ECSA Discipline-Specific Training Guideline For Candidate Engineers in Metallurgical Engineering
No ratings yet
ECSA Discipline-Specific Training Guideline For Candidate Engineers in Metallurgical Engineering
7 pages
Cherlinka 2016 Models
No ratings yet
Cherlinka 2016 Models
18 pages

Stochastic Gradient Descent (SGD) :: Import As

Uploaded by

Stochastic Gradient Descent (SGD) :: Import As

Uploaded by

Stochastic Gradient Descent (SGD):

RMSprop (Root Mean Square Propagation):

Adam (Adaptive Moment Estimation):

import matplotlib.pyplot as plt

optimizers = ['SGD', 'RMSprop', 'Adam']

bar1 = ax.bar(index, convergence_speed, bar_width, label='Convergence

# Data for the line plot

# Create a DataFrame from the data

# Melt the DataFrame to have 'Score' as a variable

# Create a line plot using Plotly Express

# Update layout for better visualization

# Show the plot

You might also like