dlweek6

Uploaded by

malakuntlasreehitha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views4 pages

dlweek6

Uploaded by

malakuntlasreehitha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Roll No: 1601217333164 Exp.

No : Date :

Experiment No-6:

AIM:
To perform regularization on a dataset with different hyperparameter values and identifying the
best model

DESCRIPTION:
Overfitting occurs when a model learns not only the underlying patterns in the training data but
also the noise, leading to poor performance on unseen data. Regularization is crucial for
enhancing the generalization capabilities of machine learning models, allowing them to
perform well on new, unseen datasets.

What is Regularization?
Regularization introduces a penalty term to the loss function used during model training. This
penalty discourages overly complex models by constraining the model's parameters, thus
controlling their ability to fit the training data. The primary goal is to improve the model’s
ability to generalize beyond the training set, enhancing its performance on unseen data.
Common regularization techniques in deep learning include:
1. L1 Regularization (Lasso Regression):
o This technique adds a penalty equivalent to the absolute value of the magnitude
of coefficients to the loss function. The formulation of the regularized cost
function becomes: Cost function=Loss+λ∑∣wi∣
o L1 regularization tends to produce sparse weight matrices, meaning it can
effectively reduce some weights to zero. This property is beneficial for feature
selection, as it can eliminate unnecessary features from the model.
2. L2 Regularization (Ridge Regression):
o L2 regularization adds a penalty equal to the square of the magnitude of
coefficients to the loss function: Cost function= Cost function=Loss+λ∑wi2
o This technique discourages large weights but does not necessarily drive them to
zero. It is often preferred over L1 regularization because it generally leads to
better generalization.
3. Dropout:
o Dropout is a regularization technique that randomly sets a fraction of input units
to zero during training. This prevents neurons from co-adapting too much. It can
be applied after layers during training and helps to create more robust features
by encouraging redundancy in the network.
4. Early Stopping:
o This method involves monitoring the model’s performance on a validation set
during training and stopping the training process once the performance ceases to
improve. Early stopping helps to avoid overfitting by preventing the model from
training too long.

Implementation Plan
1. Dataset Preparation: We will use a well-known dataset, such as MNIST or CIFAR-10,
to facilitate training and evaluation.
2. Model Design: A neural network model will be designed with the flexibility to
incorporate different regularization techniques. Layers will include activation functions,
dropout, and the regularization parameters.
3. Training the Model: The models will be trained with varying hyperparameters for the
regularization methods. For example:

Page No. Signature of the

Faculty………………………...
Roll No: 1601217333164 Exp. No : Date :
o For L1 and L2 regularization, we will adjust the λ values (e.g., 0.01, 0.001,
0.0001).

o For dropout, we will try different rates.

4. Evaluate Model Performance: After training, we will evaluate each model on the
validation set, recording accuracy and loss.
5. Results Analysis: Finally, we will analyze the results to determine which regularization
technique and hyperparameter combination yielded the best performance, highlighting
the importance of regularization in developing robust machine learning models.

CODE:
import numpy as np
import pandas as pd
# from scipy.misc import imread
import imageio
from sklearn.metrics import accuracy_score
from keras.models import Sequential
from keras.layers import Dense, Dropout
from keras.optimizers import Adam, SGD, RMSprop
from keras.datasets import mnist
from tensorflow.keras.utils import to_categorical
# Load and preprocess the MNIST dataset
(x_train, y_train), (x_test, y_test) = mnist.load_data()
x_train = x_train.reshape(-1, 784).astype('float32') / 255.0
x_test = x_test.reshape(-1, 784).astype('float32') / 255.0
y_train = to_categorical(y_train, 10)
y_test = to_categorical(y_test, 10)

# Define hyperparameter configurations to test

hyperparams_list = [
{'epochs': 10, 'batch_size': 128, 'optimizer': 'adam', 'dropout': 0.2, 'hidden_layers': 3},
{'epochs': 20, 'batch_size': 64, 'optimizer': 'adam', 'dropout': 0.3, 'hidden_layers': 4},
{'epochs': 15, 'batch_size': 256, 'optimizer': 'rmsprop', 'dropout': 0.4, 'hidden_layers': 5},
{'epochs': 12, 'batch_size': 128, 'optimizer': 'sgd', 'dropout': 0.3, 'hidden_layers': 4},
{'epochs': 10, 'batch_size': 128, 'optimizer': 'adam', 'dropout': 0.5, 'hidden_layers': 5},
# Add more configurations as needed
]

# Function to build the model with given hyperparameters

def build_model(input_dim, hidden_layers, dropout_rate, optimizer_name):
model = Sequential()
for _ in range(hidden_layers):
model.add(Dense(500, activation='relu', input_dim=input_dim))
model.add(Dropout(dropout_rate))

model.add(Dense(10, activation='softmax')) # Output layer

optimizer = {'adam': Adam(), 'sgd': SGD(), 'rmsprop': RMSprop()}[optimizer_name]
model.compile(optimizer=optimizer, loss='categorical_crossentropy', metrics=['accuracy'])
return model
# Train and evaluate models with each hyperparameter configuration
results = []
for params in hyperparams_list:

Page No. Signature of the

Faculty………………………...
Roll No: 1601217333164 Exp. No : Date :
print(f"Training with params: {params}")
model = build_model(784, params['hidden_layers'], params['dropout'], params['optimizer'])
history = model.fit(x_train, y_train,

epochs=params['epochs'],
batch_size=params['batch_size'],
validation_data=(x_test, y_test),
verbose=0)
# Evaluate the model
score = model.evaluate(x_test, y_test, verbose=0)
print(f"Test accuracy for params {params}: {score[1]:.4f}")
results.append({
'epochs': params['epochs'],
'batch_size': params['batch_size'],
'optimizer': params['optimizer'],
'dropout': params['dropout'],
'hidden_layers': params['hidden_layers'],
'test_accuracy': score[1]
})
# Convert results to DataFrame and display
results_df = pd.DataFrame(results)
print("\nResults summary:")
print(results_df)

OUTPUT:
Training with params: {'epochs': 10, 'batch_size': 128, 'optimizer': 'adam', 'dropout': 0.2,
'hidden_layers': 3}
Test accuracy for params {'epochs': 10, 'batch_size': 128, 'optimizer': 'adam', 'dropout': 0.2,
'hidden_layers': 3}: 0.9788
Training with params: {'epochs': 20, 'batch_size': 64, 'optimizer': 'adam', 'dropout': 0.3,
'hidden_layers': 4}
Test accuracy for params {'epochs': 20, 'batch_size': 64, 'optimizer': 'adam', 'dropout': 0.3,
'hidden_layers': 4}: 0.9839
Training with params: {'epochs': 15, 'batch_size': 256, 'optimizer': 'rmsprop', 'dropout': 0.4,
'hidden_layers': 5}
Test accuracy for params {'epochs': 15, 'batch_size': 256, 'optimizer': 'rmsprop', 'dropout': 0.4,
'hidden_layers': 5}: 0.9823
Training with params: {'epochs': 12, 'batch_size': 128, 'optimizer': 'sgd', 'dropout': 0.3,
'hidden_layers': 4}
Test accuracy for params {'epochs': 12, 'batch_size': 128, 'optimizer': 'sgd', 'dropout': 0.3,
'hidden_layers': 4}: 0.9522
Training with params: {'epochs': 10, 'batch_size': 128, 'optimizer': 'adam', 'dropout': 0.5,
'hidden_layers': 5}
Test accuracy for params {'epochs': 10, 'batch_size': 128, 'optimizer': 'adam', 'dropout': 0.5,
'hidden_layers': 5}: 0.9773

Page No. Signature of the

Faculty………………………...
Roll No: 1601217333164 Exp. No : Date :

Page No. Signature of the

Faculty………………………...

Keyprog Brochure
No ratings yet
Keyprog Brochure
52 pages
EXP4-Regulizars
No ratings yet
EXP4-Regulizars
8 pages
vertopal.com_HandWritten
No ratings yet
vertopal.com_HandWritten
13 pages
Project Report: CS 574 - Computer Vision Using Machine Learning
No ratings yet
Project Report: CS 574 - Computer Vision Using Machine Learning
38 pages
Regularization: Updates To Assignment
No ratings yet
Regularization: Updates To Assignment
21 pages
LAB03
No ratings yet
LAB03
8 pages
Recurrent Neural Networks: Pytorch
No ratings yet
Recurrent Neural Networks: Pytorch
6 pages
Case Study - AP23322130042
No ratings yet
Case Study - AP23322130042
7 pages
To Improve The Performance of Models Predicting Ba
No ratings yet
To Improve The Performance of Models Predicting Ba
6 pages
Keras - Datasets Keras - Datasets: "X - Train Shape" "Y - Train Shape" "X - Test Shape" "Y - Test Shape"
No ratings yet
Keras - Datasets Keras - Datasets: "X - Train Shape" "Y - Train Shape" "X - Test Shape" "Y - Test Shape"
6 pages
AIML lab ex 2
No ratings yet
AIML lab ex 2
4 pages
deep learning lab manual - 23-24
No ratings yet
deep learning lab manual - 23-24
41 pages
DL2 - Jupyter Notebook
No ratings yet
DL2 - Jupyter Notebook
5 pages
DEEP LEARNING LAB MANUAL
No ratings yet
DEEP LEARNING LAB MANUAL
11 pages
Practical No Title
No ratings yet
Practical No Title
30 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
Computer Vision (CS 6384.002) Project 2: Program Description
No ratings yet
Computer Vision (CS 6384.002) Project 2: Program Description
3 pages
Vertopal.com HW4ML Project Code
No ratings yet
Vertopal.com HW4ML Project Code
24 pages
dl_6
No ratings yet
dl_6
5 pages
DL Practical
No ratings yet
DL Practical
23 pages
Deep Learning Programs Updated
No ratings yet
Deep Learning Programs Updated
24 pages
Deep Learning lab with Tensorflow (2)
No ratings yet
Deep Learning lab with Tensorflow (2)
84 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
UNIT-II Regularization in Deep Learning
No ratings yet
UNIT-II Regularization in Deep Learning
24 pages
6 Neural Network
No ratings yet
6 Neural Network
4 pages
4
No ratings yet
4
6 pages
DL_LAB_MANUAL_mugesh
No ratings yet
DL_LAB_MANUAL_mugesh
12 pages
AD3511 - Deep Learning Lab Manual - - Copy
No ratings yet
AD3511 - Deep Learning Lab Manual - - Copy
61 pages
Chapter05 Fundamentals-Of-Ml
No ratings yet
Chapter05 Fundamentals-Of-Ml
7 pages
DL Lab Experiments 2
No ratings yet
DL Lab Experiments 2
12 pages
01 249212 012 10129792044 11122022 112910pm
No ratings yet
01 249212 012 10129792044 11122022 112910pm
8 pages
A3 - Jupyter Notebook PDF
No ratings yet
A3 - Jupyter Notebook PDF
5 pages
mlp-v4
No ratings yet
mlp-v4
27 pages
Lab
No ratings yet
Lab
12 pages
nndl2 (2)
No ratings yet
nndl2 (2)
67 pages
ccs355 Lab Manual
No ratings yet
ccs355 Lab Manual
24 pages
Pdf
No ratings yet
Pdf
41 pages
Deep Learning Manual (1)
No ratings yet
Deep Learning Manual (1)
53 pages
keras
No ratings yet
keras
4 pages
hand writing using _cnn (1)
No ratings yet
hand writing using _cnn (1)
5 pages
(23mca24) Practical 1 & Practical 2
No ratings yet
(23mca24) Practical 1 & Practical 2
6 pages
7
No ratings yet
7
4 pages
Regularization For Neural Network
No ratings yet
Regularization For Neural Network
37 pages
ass_3
No ratings yet
ass_3
5 pages
Import Numpy as Np
No ratings yet
Import Numpy as Np
5 pages
AI Coding.ipynb - Colab
No ratings yet
AI Coding.ipynb - Colab
6 pages
CHARACTER RECOGNITION USING DNN
No ratings yet
CHARACTER RECOGNITION USING DNN
2 pages
Deep Learning Program
No ratings yet
Deep Learning Program
5 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
(23mca32) Practical 1 & Practical 2
No ratings yet
(23mca32) Practical 1 & Practical 2
9 pages
NNDL Record Final
No ratings yet
NNDL Record Final
46 pages
lab-report-03
No ratings yet
lab-report-03
14 pages
NN & DL Lab Manual 1[1]
No ratings yet
NN & DL Lab Manual 1[1]
44 pages
DL Lab Manual
100% (1)
DL Lab Manual
35 pages
DeepLearningLab2.Ipynb - Colab
No ratings yet
DeepLearningLab2.Ipynb - Colab
7 pages
DL Practical
No ratings yet
DL Practical
25 pages
Report on Neural Network Implementation and Optimization Techniques-10.02.25
No ratings yet
Report on Neural Network Implementation and Optimization Techniques-10.02.25
13 pages
Downloaded by R GAYATHRI (R.gayathri@aalimec - Ac.in)
No ratings yet
Downloaded by R GAYATHRI (R.gayathri@aalimec - Ac.in)
56 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Class XLL
No ratings yet
Class XLL
2,725 pages
Discussion Scope - Expert
No ratings yet
Discussion Scope - Expert
2 pages
Internship Orientation Aastu
No ratings yet
Internship Orientation Aastu
24 pages
Evaluation Form - Speech Therapist
No ratings yet
Evaluation Form - Speech Therapist
6 pages
7 Hints and Tips Writing A Genealogical Report New Style PDF
No ratings yet
7 Hints and Tips Writing A Genealogical Report New Style PDF
4 pages
Get Biostatistics for the Biological and Health Sciences 2nd Edition Triola Test Bank free all chapters
100% (29)
Get Biostatistics for the Biological and Health Sciences 2nd Edition Triola Test Bank free all chapters
50 pages
StudentRatingform Editable
No ratings yet
StudentRatingform Editable
5 pages
EMCEE SCRIPT-seminar
No ratings yet
EMCEE SCRIPT-seminar
2 pages
Modals of Permission Obligation and Prohibition
No ratings yet
Modals of Permission Obligation and Prohibition
12 pages
The Art Of Effortless Trading _ TraderLion
No ratings yet
The Art Of Effortless Trading _ TraderLion
8 pages
Adobe Scan 22 Nov 2024 Compressed
No ratings yet
Adobe Scan 22 Nov 2024 Compressed
1 page
Thesis FH Dortmund
100% (4)
Thesis FH Dortmund
6 pages
Manarat Jeddah School - Maarif Kenya Programme - Projects Abroad Schools
No ratings yet
Manarat Jeddah School - Maarif Kenya Programme - Projects Abroad Schools
1 page
Bid Management-1
No ratings yet
Bid Management-1
1 page
Slidesgo Input Devices Enhancing Interaction With Technology 20240619102029BHtB
No ratings yet
Slidesgo Input Devices Enhancing Interaction With Technology 20240619102029BHtB
14 pages
Smea Program
100% (1)
Smea Program
3 pages
Immediate download Test Bank for Managing Human Resources, 7th Canadian Edition : Belcourt all chapters
100% (9)
Immediate download Test Bank for Managing Human Resources, 7th Canadian Edition : Belcourt all chapters
50 pages
Plane Mesimore 2021 - Tremujor I - Sidi
No ratings yet
Plane Mesimore 2021 - Tremujor I - Sidi
72 pages
Self Stigma
100% (1)
Self Stigma
4 pages
Ahmad Ozair: Live:ahmadozair
No ratings yet
Ahmad Ozair: Live:ahmadozair
13 pages
Research Methodology Mba
No ratings yet
Research Methodology Mba
99 pages
Edge AI - Driving Next-Gen AI Applications in 2024 - viso.ai
No ratings yet
Edge AI - Driving Next-Gen AI Applications in 2024 - viso.ai
18 pages
Cad Rubric
No ratings yet
Cad Rubric
4 pages
A Detailed Lesson Plan Format
No ratings yet
A Detailed Lesson Plan Format
3 pages
Homework Help Sites For College Students
100% (1)
Homework Help Sites For College Students
7 pages
Classroom Observation Checklist
100% (1)
Classroom Observation Checklist
2 pages
Inb 372 Course Outline MBT Fall 2016
No ratings yet
Inb 372 Course Outline MBT Fall 2016
4 pages
ENGLISG 3 Story
No ratings yet
ENGLISG 3 Story
14 pages
master rteee2ec
No ratings yet
master rteee2ec
2 pages