UNIT-II Regularization in Deep Learning

The document discusses overfitting in deep learning, explaining how it occurs when a model learns noise from training data, leading to poor generalization on new data. It introduces regularization techniques, such as L1 and L2 regularization, early stopping, dropout, data augmentation, and batch normalization, which help improve model performance by preventing overfitting. The document also includes a practical lab activity demonstrating the implementation of dropout layers in a multi-layer neural network using TensorFlow.

Uploaded by

22b81a6610

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

UNIT-II Regularization in Deep Learning

Uploaded by

22b81a6610

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Regularization in Deep Learning

What is Overfitting?
• When a model trains on sample data for an excessively long time or
becomes very complicated, it may begin to learn "noise," or
unimportant information, from the dataset.
• The model becomes "overfitted" and unable to generalize
successfully to new data when it memorizes the noise.
• A model won't be able to carry out the classification or prediction
tasks that it was designed for if it can't generalize successfully to new
data.
What is Regularization?
• When a neural network faces entirely new data, regularization acts as
a guiding principle to prevent it from becoming too focused on just
the training data.
• By slightly altering the learning process, the model learns to
generalize better, improving its performance on unseen data.
• The model then performs better on the unobserved data as a result.
Why Regularization?
• Through Regularization the bigger coefficient input parameters
receive a "penalty", which ultimately reduces the variance of the
model, and particularly in deep learning the nodes weight matrices
are penalized.
• With regularization, a more optimized and better accurate model for
better output is achieved.
How does Regularization work?
• When modeling the data, a low bias and high variance scenario is
referred to as overfitting.
• To handle this, regularization techniques trade more bias for less
variance.
• Effective regularization is one that strikes the optimal balance
between bias and variation.
• Additionally, Regularization orders possible models from weakest
overfit to biggest and adds penalties to more complicated models.
• Regularization makes the assumption that the least weights could
result in simpler models and help prevent overfitting.
Techniques of Regularization
L1 Regularization(Lasso Regression)
L2 Regularization(Ridge Regression)
Early stopping
Dropout Regularization
Data Augmentation
Batch Normalization
L1 Regularization
• L1 regularization adds the absolute values of weights to the loss
function as a penalty.
• This encourages some weights to shrink to exactly zero, effectively
eliminating those parameters from the model.
• This is particularly useful for feature selection, as it helps the model
focus on only the most important inputs while ignoring irrelevant
ones.
• Mathematical representation for the L1 regularization is:

Here the lambda is the regularization parameter .

m is the number of training samples
L2 Regularization
• By limiting the coefficient and maintaining all the variables, L2
regularization helps solve problems with multi-collinearity .
• The importance of predictors may be estimated using L2 regression,
and based on that, the unimportant predictors can be penalized.
• The mathematical representation for the L2 regularization is:
Early Stopping
• Stops the training process when performance on a validation set
starts to degrade.
• Prevents the model from overfitting by halting training before the
model becomes overly complex.
Metrics used to perform early stopping
Steps to perform Early Stopping
Dropout Regularization
•Randomly sets a fraction of neurons' outputs to zero during training.
•Prevents co-adaptation of neurons and forces the network to learn more robust features.
•Used primarily in fully connected layers of neural networks.
•Dropout is turned off during testing.
Steps to perform Dropout Regularization
Data Augmentation
• Increases the diversity of the training data by applying
transformations like rotations, flipping, cropping, or color
adjustments.
• Improves generalization by exposing the model to a broader range of
input variations.
Batch Normalization
• Normalizes the inputs to each layer by scaling and shifting them to a
standard distribution.
• Acts as a regularizer by reducing the internal covariate shift, making
the network less sensitive to weight initialization and learning rates.
m = number of neurons in that layer
µ= mean
Standard Deviation
ϒ, β are learnable parameters
Steps to perform Batch Normalization
Lab Activity : Build a multi-layer neural network to improve the test
accuracy using drop out layers.
import tensorflow as tf y_train = to_categorical(y_train, 10)
from tensorflow.keras import Sequential y_test = to_categorical(y_test, 10)
from tensorflow.keras.layers import Dense, Dropout
# Build the neural network with dropout layers
from tensorflow.keras.datasets import mnist
model = Sequential([
from tensorflow.keras.utils import to_categorical
Dense(512, activation='relu', input_shape=(28 * 28,)), # First
hidden layer
# Load and preprocess the MNIST dataset
Dropout(0.2), # Dropout with 20% probability
(x_train, y_train), (x_test, y_test) = mnist.load_data()
Dense(256, activation='relu'), # Second hidden layer
# Normalize the images to the range [0, 1] Dropout(0.3), # Dropout with 30% probability
x_train = x_train / 255.0 Dense(128, activation='relu'), # Third hidden layer
x_test = x_test / 255.0 Dropout(0.4), # Dropout with 40% probability
Dense(10, activation='softmax') # Output layer for 10 classes
# Flatten the images (28x28 to 784) and convert labels to
])
one-hot encoding
x_train = x_train.reshape(-1, 28 * 28)
x_test = x_test.reshape(-1, 28 * 28)
# Compile the model plt.plot(history.history['accuracy'], label='Training Accuracy')
model.compile(optimizer='adam', plt.plot(history.history['val_accuracy'], label='Validation Accuracy')
loss='categorical_crossentropy', plt.title('Model Accuracy')
metrics=['accuracy']) plt.xlabel('Epoch')
plt.ylabel('Accuracy')
# Train the model
plt.legend()
history = model.fit(x_train, y_train,
epochs=10, # Plot training & validation loss values
batch_size=128, plt.subplot(1, 2, 2)
validation_split=0.2) plt.plot(history.history['loss'], label='Training Loss')
plt.plot(history.history['val_loss'], label='Validation Loss')
# Evaluate the model
plt.title('Model Loss')
test_loss, test_accuracy = model.evaluate(x_test, y_test)
plt.xlabel('Epoch')
print(f"\nTest Accuracy: {test_accuracy * 100:.2f}%")
plt.ylabel('Loss')
# Plot training history plt.legend()
import matplotlib.pyplot as plt
plt.tight_layout()
plt.figure(figsize=(12, 6)) plt.show()

# Plot training & validation accuracy values

plt.subplot(1, 2, 1)
output

PG TRB PHYSICS STUDY MATERIAL - PDF'
87% (15)
PG TRB PHYSICS STUDY MATERIAL - PDF'
150 pages
Regularization Slides (2)
No ratings yet
Regularization Slides (2)
50 pages
What is Regularization.
No ratings yet
What is Regularization.
10 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
16 pages
DL+lect+7 (1)
No ratings yet
DL+lect+7 (1)
15 pages
Convolutional Neural Networks (Image Recognition) Part - II: Dr. Syed M. Usman
No ratings yet
Convolutional Neural Networks (Image Recognition) Part - II: Dr. Syed M. Usman
75 pages
03 Reg Slides
No ratings yet
03 Reg Slides
64 pages
Unit 2.3
No ratings yet
Unit 2.3
43 pages
Regularization_for_Neural_Networks_1718966083
No ratings yet
Regularization_for_Neural_Networks_1718966083
9 pages
Deep Learning Basics Lecture 4 Regularization II
No ratings yet
Deep Learning Basics Lecture 4 Regularization II
27 pages
Regularization in Machine Learning
No ratings yet
Regularization in Machine Learning
17 pages
DL_IT324a_3
No ratings yet
DL_IT324a_3
13 pages
WEEK 10
No ratings yet
WEEK 10
69 pages
Regularization: Swetha V, Research Scholar
No ratings yet
Regularization: Swetha V, Research Scholar
32 pages
DL Class3
No ratings yet
DL Class3
28 pages
cours4
No ratings yet
cours4
30 pages
Unit Ii
No ratings yet
Unit Ii
8 pages
Dataset Augmentation
No ratings yet
Dataset Augmentation
30 pages
UNIT LV
No ratings yet
UNIT LV
8 pages
465-Lecture 10-11
No ratings yet
465-Lecture 10-11
79 pages
Module - 2 Ver 1.4
No ratings yet
Module - 2 Ver 1.4
35 pages
Regularization
No ratings yet
Regularization
46 pages
CM20315 09 Regularization
No ratings yet
CM20315 09 Regularization
44 pages
A Quick Guide On Basic Regularization Methods For Neural Networks - by Jaime Durán - Yottabytes - Medium
No ratings yet
A Quick Guide On Basic Regularization Methods For Neural Networks - by Jaime Durán - Yottabytes - Medium
9 pages
4. Regularization
No ratings yet
4. Regularization
19 pages
Module-4_4
No ratings yet
Module-4_4
19 pages
4 NN Regularization
No ratings yet
4 NN Regularization
13 pages
Unit 4
No ratings yet
Unit 4
35 pages
Lec 05 Regularization
No ratings yet
Lec 05 Regularization
77 pages
Deep Feedforward Networks and Regularization: Licheng Zhang
No ratings yet
Deep Feedforward Networks and Regularization: Licheng Zhang
56 pages
Nndl Notes
No ratings yet
Nndl Notes
73 pages
S10_DNN_Regularization_wip
No ratings yet
S10_DNN_Regularization_wip
11 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
26 pages
Deep Neural Network Module 4 Regularization
No ratings yet
Deep Neural Network Module 4 Regularization
53 pages
4th Unit DL Final Class Notes (1)
No ratings yet
4th Unit DL Final Class Notes (1)
68 pages
Regularization in Deep Learning (1)
No ratings yet
Regularization in Deep Learning (1)
49 pages
BACK PROPAGATION and REGULATION, BATCH NORMALIZATION
No ratings yet
BACK PROPAGATION and REGULATION, BATCH NORMALIZATION
20 pages
unit4
No ratings yet
unit4
93 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
DL_Unit-3
No ratings yet
DL_Unit-3
56 pages
Module-4_3
No ratings yet
Module-4_3
20 pages
5-Introduction To regularization-03-Aug-2020Material - I - 03-Aug-2020 - Module3 - Regularization
No ratings yet
5-Introduction To regularization-03-Aug-2020Material - I - 03-Aug-2020 - Module3 - Regularization
10 pages
Lecture 6
No ratings yet
Lecture 6
41 pages
07_regularization
No ratings yet
07_regularization
51 pages
5 Regularization
No ratings yet
5 Regularization
79 pages
Lecture 5-6
No ratings yet
Lecture 5-6
45 pages
Unit-2 L2 (3)
No ratings yet
Unit-2 L2 (3)
22 pages
dlweek6
No ratings yet
dlweek6
4 pages
Batch Normalization Separate
No ratings yet
Batch Normalization Separate
20 pages
Regularization (mathematics) - Wikipedia
No ratings yet
Regularization (mathematics) - Wikipedia
13 pages
Regularization For Deep Learning: Tsz-Chiu Au Chiu@unist - Ac.kr
No ratings yet
Regularization For Deep Learning: Tsz-Chiu Au Chiu@unist - Ac.kr
100 pages
Unit-2 Improving-Deep-Neural-Networks
No ratings yet
Unit-2 Improving-Deep-Neural-Networks
18 pages
Deep Learning: Computer Science and Engineering
No ratings yet
Deep Learning: Computer Science and Engineering
18 pages
Regularization: Updates To Assignment
No ratings yet
Regularization: Updates To Assignment
21 pages
Overfitting Underfitting: UNIT 2: Optimization and Regularization in Neural Networks
No ratings yet
Overfitting Underfitting: UNIT 2: Optimization and Regularization in Neural Networks
18 pages
Underfitting Overfitting
No ratings yet
Underfitting Overfitting
38 pages
LECTURE#9 EE258 F22 Part2 Draft v1
No ratings yet
LECTURE#9 EE258 F22 Part2 Draft v1
14 pages
Practical Aspects of Deep Learning PI
No ratings yet
Practical Aspects of Deep Learning PI
46 pages
mod4
No ratings yet
mod4
65 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Cubes and Cube Roots Worksheet-1
No ratings yet
Cubes and Cube Roots Worksheet-1
2 pages
A Matlab Simulink Model of A Grid Connected Single-Phase Inverter
No ratings yet
A Matlab Simulink Model of A Grid Connected Single-Phase Inverter
6 pages
Barlow Mathematical Physics (1913) PDF
100% (1)
Barlow Mathematical Physics (1913) PDF
332 pages
Chapter 3 Cost Behavior Analysis and Use
100% (1)
Chapter 3 Cost Behavior Analysis and Use
45 pages
Data IHSG Data Harga Saham PT Bumi Resourches, TBK
No ratings yet
Data IHSG Data Harga Saham PT Bumi Resourches, TBK
11 pages
SRM Valliammai Engineering College: Question Bank
No ratings yet
SRM Valliammai Engineering College: Question Bank
21 pages
An Application of Factor Analysis To Identify The Most Effective Reasons That University Students Hate To Read Books
No ratings yet
An Application of Factor Analysis To Identify The Most Effective Reasons That University Students Hate To Read Books
15 pages
Scribd
No ratings yet
Scribd
87 pages
Cambridge International AS & A Level: Mathematics 9709/12
No ratings yet
Cambridge International AS & A Level: Mathematics 9709/12
16 pages
Contingency, Time-Reversibility, and Pluralism in Biological Science
No ratings yet
Contingency, Time-Reversibility, and Pluralism in Biological Science
4 pages
HND - AUTOMOBILE Corrected
No ratings yet
HND - AUTOMOBILE Corrected
46 pages
Ssas MDX Query Interview Questions and Answers
No ratings yet
Ssas MDX Query Interview Questions and Answers
8 pages
Hit Bulls Eye Algebra Questions
100% (1)
Hit Bulls Eye Algebra Questions
22 pages
MBA Syllabus 23-24 - 20240207025811
No ratings yet
MBA Syllabus 23-24 - 20240207025811
206 pages
7 - Turmudi (Forum Innovation 3)
No ratings yet
7 - Turmudi (Forum Innovation 3)
24 pages
Mathematics PDF
No ratings yet
Mathematics PDF
200 pages
Linux Homework Answer
No ratings yet
Linux Homework Answer
8 pages
Mathematics F2 QS
No ratings yet
Mathematics F2 QS
10 pages
Introduction To Finite Element Method: Tai Hun Kwon
No ratings yet
Introduction To Finite Element Method: Tai Hun Kwon
7 pages
Electric Field and Potential Lab Report
No ratings yet
Electric Field and Potential Lab Report
6 pages
On Certain Spaces of Bicomplex Sequences
No ratings yet
On Certain Spaces of Bicomplex Sequences
6 pages
Operator Generic Fundamentals Basic Electricity - Part 2
No ratings yet
Operator Generic Fundamentals Basic Electricity - Part 2
152 pages
Nba Co - Po Mapping
No ratings yet
Nba Co - Po Mapping
2 pages
Proforma Mte 3103 - Geometry
No ratings yet
Proforma Mte 3103 - Geometry
4 pages
(Stephen T. Thornton, Andrew Rex) Modern Physics F (BookFi)
No ratings yet
(Stephen T. Thornton, Andrew Rex) Modern Physics F (BookFi)
2 pages
Data Analysis Expressions (DAX)
No ratings yet
Data Analysis Expressions (DAX)
3 pages
Sheniblog SSLC Maths Question Papers
100% (1)
Sheniblog SSLC Maths Question Papers
38 pages
De Vos Et Al. (2015)
No ratings yet
De Vos Et Al. (2015)
12 pages
Certificate: Smt. Sushiladevi Deshmukh College of Arts, SCIENCE AND COMMERCE, Airoli, Navi Mumbai - 400 708
No ratings yet
Certificate: Smt. Sushiladevi Deshmukh College of Arts, SCIENCE AND COMMERCE, Airoli, Navi Mumbai - 400 708
30 pages