0% found this document useful (0 votes)

67 views

DeepLearningForVisionSystems Ch5 ResNet

This document discusses implementing a ResNet image classifier model in TensorFlow. It includes code to import libraries, initialize parameters, load and preprocess a dataset of images, and define the ResNet model architecture. The model uses residual skip connections to allow deeper networks without gradient problems. The dataset contains 5000 training and 1000 validation images across 5 classes. The code crops images, defines a residual block, and maps and caches the training and validation datasets for use in the model.

Uploaded by

mkkadambi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views

DeepLearningForVisionSystems Ch5 ResNet

Uploaded by

mkkadambi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.

ipynb - Colaboratory

This is an implementation of ResNet Image Classi er based on the paper:

https://arxiv.org/pdf/1512.03385.pdf

This model uses residual skip connections that allows a network to go deeper without suffering
from the vanishing/ explodiing gradient problem

Initializations and Imports

# Importing Tensorflow
import tensorflow as tf
from tensorflow.keras import Input, Model
from tensorflow.keras.layers import Dense, Conv2D, BatchNormalization, MaxPool2D, Ave
from tensorflow.keras.optimizers import Adam, SGD
from tensorflow.keras.losses import CategoricalCrossentropy
from tensorflow.keras.metrics import CategoricalAccuracy

import numpy as np

# import random, os for data loading
import random, os

import matplotlib.pyplot as plt

# pandas for displaying confusion matrix
import pandas as pd

print(tf.__version__)

# Display GPU availability if any
from tensorflow.python.client import device_lib

def get_available_gpus():
local_device_protos = device_lib.list_local_devices()
return [x.name for x in local_device_protos if x.device_type == 'GPU']
print("devices =" , tf.config.list_physical_devices())

print(get_available_gpus())

2.5.0
devices = [PhysicalDevice(name='/physical_device:CPU:0', device_type='CPU'), Phy
['/device:GPU:0']

# Set the seed value for consistent results

def set_seed(seed=31415):
    np.random.seed(seed)
https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 1/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory
    tf.random.set_seed(seed)
    os.environ['PYTHONHASHSEED'] = str(seed)
    os.environ['TF_DETERMINISTIC_OPS'] = '1'
set_seed()

Parameter Initializations

# Input image dimensions
input_shape = (224, 224, 3)

# number of images to process in a batch
batch_size = 30

# Number of categories in the dataset
num_classes = 5

checkpoint_filePath = '/content/drive/MyDrive/MachineLearning/ResNet_2.h5'

Data Loading and Preprocessing

Procuring the Dataset

path='/content/Linnaeus 5 256X256'
# Check if the folder with the dataset already exists, if not copy it from the saved
if not os.path.isdir(path):
!cp '/content/drive/MyDrive/MachineLearning/Linnaeus 5 256X256.rar' '/content/'
get_ipython().system_raw("unrar x '/content/Linnaeus 5 256X256.rar'")

categories = os.listdir(os.path.join(path, 'train'))
print(len(categories), " categories found =", categories)

5 categories found = ['dog', 'bird', 'flower', 'other', 'berry']

Training and Validation Dataset

train_image_dataset = tf.keras.preprocessing.image_dataset_from_directory(
        os.path.join(path, 'train')
      , labels='inferred'
https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 2/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory
,
      , label_mode='categorical'
      , class_names=categories
      , batch_size=batch_size
      , image_size=(256, 256)
      , shuffle=True
      , seed=2
      , validation_split=0.1
      , subset= 'training'
  )

validation_image_dataset = tf.keras.preprocessing.image_dataset_from_directory(
        os.path.join(path, 'train')
      , labels='inferred'
      , label_mode='categorical'
      , class_names=categories
      , batch_size=batch_size
      , image_size=(256, 256)
      , shuffle=True
      , seed=2
      , validation_split=0.1
      , subset= 'validation'
  )

print("Training class names found =" , train_image_dataset.class_names)

def crop_images(images, labels):
  '''
  Expecting categories to be names of subfolders and the images belonging to each
  of the subfolders be stored inside them. While reading the images, they are resized
  and then cropped to 224x224x3 based on the way the paper describes (randomly betwee
  diagnostics: bool (default False), If True it will print a lot of debug information

  '''
  # In order to clip the image in either from top-left, top-right, bottom-left, botto
  # we create an array of possible start positions
  corners_list = [0, (256-input_shape[0])//2, 256-input_shape[0]]

  # Sampling one number from the list of start positions
  offset_height = offset_width = random.sample(corners_list, 1)[0]

  images = tf.image.per_image_standardization(images-127)
  images = images/tf.math.reduce_max(tf.math.abs(images))

  # Since there is an auxillary arm of the model, we have to concatenate two labels w
  return  tf.image.crop_to_bounding_box(images, offset_height, offset_width, input_sh

validation_datasource = validation_image_dataset.map(crop_images)
validation_datasource = validation_datasource.cache().prefetch(buffer_size=tf.data.AU

training datasource train image dataset map(crop images)
https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 3/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory
training_datasource = train_image_dataset.map(crop_images)
training_datasource = training_datasource.cache().prefetch(buffer_size=tf.data.AUTOTU

Found 6000 files belonging to 5 classes.

Using 5400 files for training.
Found 6000 files belonging to 5 classes.
Using 600 files for validation.
Training class names found = ['dog', 'bird', 'flower', 'other', 'berry']

for images, labels in training_datasource:
  print("images =", images.shape)
  print("labels =", type(labels))
  break

training_datasource = train_image_dataset.map(crop_images)
training_datasource = training_datasource.cache().prefetch(buffer_size=tf.data.AUTOTU

images = (30, 224, 224, 3)

labels = <class 'tensorflow.python.framework.ops.EagerTensor'>

Test Data

test_image_dataset = tf.keras.preprocessing.image_dataset_from_directory(
        os.path.join(path, 'test')
      , labels='inferred'
      , label_mode='categorical'
      , class_names=categories
      , batch_size=batch_size
      , image_size=(256, 256)
      , seed=2
  )
def test_data_crop_images(images, labels):
  '''
  Definiing separate function for test data because labels do not have to be
   concatenated during testing and the map function does not allow multiple function

  Expecting categories to be names of subfolders and the images belonging to each
  of the subfolders be stored inside them. While reading the images, they are resized
  and then cropped to 224x224x3 based on the way the paper describes (randomly betwee
  diagnostics: bool (default False), If True it will print a lot of debug information

  '''
  # In order to clip the image in either from top-left, top-right, bottom-left, botto
  # we create an array of possible start positions
  corners_list = [0, (256-input_shape[0])//2, 256-input_shape[0]]

# Sampling one number from the list of start positions
https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 4/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory
  # Sampling one number from the list of start positions
  offset_height = offset_width = random.sample(corners_list, 1)[0]

  images = tf.image.per_image_standardization(images-127)
  images = images/tf.math.reduce_max(tf.math.abs(images))
  # Since there is an auxillary arm of the model, we have to concatenate two labels w
  return  tf.image.crop_to_bounding_box(images, offset_height, offset_width, input_sh

test_datasource = test_image_dataset.map(test_data_crop_images)
test_datasource = test_datasource.cache().prefetch(buffer_size=tf.data.AUTOTUNE).shuf

Found 2000 files belonging to 5 classes.

Model Architecture

De ne the Residual Block

def residual_block(input, filter_configs, shortcut_filter_configs, name_prefix=''):
  '''
  This function is to define the residual block that is specified in the paper.
  Based on the shortcut_filteere_configs parameters, the function can either
  directly join the input with the output of the conv filters on the main path,
  or,
  can add conv filters in the shortcut and then add to the main path.

  After addition of the shortcut and the main path, the output is provided
   through an activation layer

  Parameters:
  input: input Tensor
  filter_configs: list of dictionary with convolution filter configurations in the ma
        Each item in the list is a layer inside the residual block. The
        dictionary should be of the form -
               {'filters': number of filters for the
               , 'kernel_size':  kernel size of the filter
               , 'strides': strides of the filter
               , 'padding': padding of the filter
               , 'activation': Activatioin of the convolution filter
               }
  shortcut_filter_configs: list of dictionaries with convolution filter
        configurations for the filters in the shortcut path.
        Structure of the dictionary is the same as filter_configs
  name_prefix = String that will be added as a prefix to all layers of the block
  '''
  shortcut_path = input # This is for the shortcut path

i th i t # Thi i f th i th
https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 5/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory
  main_path = input # This is for the main path
  for i, config in enumerate(filter_configs):
    # Going through the main path conv filter creations
    main_path = Conv2D(filters = config['filters']
               , kernel_size = config['kernel_size']
               , strides=config['strides']
               , padding=config['padding']
               , activation=config['activation']
               , use_bias=True
               , kernel_initializer='glorot_uniform'
               , name=name_prefix + '_main_'+ str(i+1)
          )(main_path)

  # Check if the shortcut filter configs has been defined
  if shortcut_filter_configs is not None:
    # We need to add filters in the shortcut path
    # As per the paper, there only needs to be 1x1 kernel filter with the required de
    # but the code here gives the "unwanted/ unwarranted" freedom to define multiple

    for i, config in enumerate(shortcut_filter_configs):
      # Go through the list of filters and create the conv filters
      shortcut_path = Conv2D(filters = config['filters']
               , kernel_size = config['kernel_size']
               , strides=config['strides']
               , padding=config['padding']
               , activation=config['activation']
               , use_bias=True
               , kernel_initializer='glorot_uniform'
               , name = name_prefix + '_shortcut_' + str(i+1)
          )(shortcut_path)
  # Going through the Add layer to add the input with the output of
  combined_path = Add(name=name_prefix + '_add_junction')([shortcut_path, main_path])

  # The output of the Add has to go through an activation layer
  residual_block_output = Activation('elu', name=name_prefix + '_activation_output')(
  # TODO: Make the activation method here as a configurable parameter

  return residual_block_output

De ne the architecture of the model

https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 6/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory

ResNet50 residual block con gurations

def get_resnet50_config():
  # First 3 residual blocks have in their main path, 1x1x64, 3x3x64, 1x1x256
  # Since the output of the maxpool_1 is of depth 64, only the first residual
  # block will require a 1x1x256 filter in its shortcut path

  conv2_1_2_3_filter_configs= [
        {'filters': 64, 'kernel_size': 1, 'strides': 1, 'padding': 'same', 'activatio
        {'filters': 64, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activatio
        {'filters': 256, 'kernel_size': 1, 'strides': 1, 'padding': 'same', 'activati
      ]
  conv2_1_shortcut_filter_configs = [
    {'filters': 256, 'kernel_size': 1, 'strides': 1, 'padding': 'same', 'activation':
  ]

  # Next 4 residual blocks in Conv3_x have filters 1x1x128, 3x3x128, 1x1x512
  # The output of the previous block has depth of 256, so the first of the 4 residual
  # will require a 1x1x512 conv filter in its shortcut path, the others will have ide

  # Conv3_1 will have stride = 2, so that will be defined separately
  # stride 2 will reduce the width and height from 56x56 to 28x28
  conv3_1_filter_configs= [
        {'filters': 128, 'kernel_size': 1, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 128, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 512, 'kernel_size': 1, 'strides': 2, 'padding': 'same', 'activati
      ]

  conv3_1_shortcut_filter_configs = [
    {'filters': 512, 'kernel_size': 1, 'strides': 2, 'padding': 'same', 'activation':
https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 7/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory
  ]

  # Rest of the 3 blocks of conv3 are defined below
  conv3_2_3_4_filter_configs= [
        {'filters': 128, 'kernel_size': 1, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 128, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 512, 'kernel_size': 1, 'strides': 1, 'padding': 'same', 'activati
      ]

  # There will be 6 blocks in conv4_x
  # conv4_1 will have strride = 2
  # widthxheight will be reduced from 28x28 to 14x14
  conv4_1_filter_configs= [
        {'filters': 256, 'kernel_size': 1, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 256, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 1024, 'kernel_size': 1, 'strides': 2, 'padding': 'same', 'activat
      ]
  conv4_1_shortcut_filter_configs = [
    {'filters': 1024, 'kernel_size': 1, 'strides': 2, 'padding': 'same', 'activation'
  ]
  # Rest of the conv4_x blocks:
  conv4_2_to_6_filter_configs= [
        {'filters': 256, 'kernel_size': 1, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 256, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 1024, 'kernel_size': 1, 'strides': 1, 'padding': 'same', 'activat
      ]

  # conv5_x has 3 blocks
  # conv5_1 will have stride =2
  # widthxheight will reduce from 14x14 to 7x7
  conv5_1_filter_configs= [
        {'filters': 512, 'kernel_size': 1, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 512, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 2048, 'kernel_size': 1, 'strides': 2, 'padding': 'same', 'activat
      ]
  conv5_1_shortcut_filter_configs = [
    {'filters': 2048, 'kernel_size': 1, 'strides': 2, 'padding': 'same', 'activation'
  ]

  conv5_2_3_filter_configs= [
        {'filters': 512, 'kernel_size': 1, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 512, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 2048, 'kernel_size': 1, 'strides': 1, 'padding': 'same', 'activat
      ]

  resNet50_residual_block_config = [
      (conv2_1_2_3_filter_configs, conv2_1_shortcut_filter_configs, "conv2_1"), # Con
      (conv2_1_2_3_filter_configs, None, "conv2_2"),                            # Con
      (conv2_1_2_3_filter_configs, None, "conv2_3"),                            # Con
      (conv3_1_filter_configs, conv3_1_shortcut_filter_configs, "conv3_1"),     # Con
      (conv3_2_3_4_filter_configs, None, "conv3_2"),                            # Con
      (conv3_2_3_4_filter_configs, None, "conv3_3"),                            # Con
https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 8/24
7/25/2021 _ _ _ _ _ DeepLearningForVisionSystems-Ch5-ResNet.ipynb
_ - Colaboratory

      (conv3_2_3_4_filter_configs, None, "conv3_4"),                            # Con
      (conv4_1_filter_configs, conv4_1_shortcut_filter_configs, "conv4_1"),     # Con
      (conv4_2_to_6_filter_configs, None, "conv4_2"),                           # Con
      (conv4_2_to_6_filter_configs, None, "conv4_3"),                           # con
      (conv4_2_to_6_filter_configs, None, "conv4_4"),                           # con
      (conv4_2_to_6_filter_configs, None, "conv4_5"),                           # con
      (conv4_2_to_6_filter_configs, None, "conv4_6"),                           # con
      (conv5_1_filter_configs, conv5_1_shortcut_filter_configs, "conv5_1"),     # con
      (conv5_2_3_filter_configs, None, "conv5_2"),                              # con
      (conv5_2_3_filter_configs, None, "conv5_3")                               # con
  ]
  return resNet50_residual_block_config

ResNet34 model con guration

def get_resnet34_config():
  # First 2 residual blocks have in their main path, 3x3x64
  # Since the output of the maxpool_1 is of depth 64, shortcut paths can just be iden

  conv2_1_2_3_filter_configs= [
        {'filters': 64, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activatio
        {'filters': 64, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activatio
      ]

  # Next 4 residual blocks in Conv3_x have filters 3x3x128, 3x3x128
  # The output of the previous block has depth of 64, so the first of the 4 residual
  # will require a 1x1x128 conv filter in its shortcut path, the others will have ide

  # Conv3_1 will have stride = 2, so that will be defined separately
  # stride 2 will reduce the width and height from 56x56 to 28x28
  conv3_1_filter_configs= [
        {'filters': 128, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 128, 'kernel_size': 3, 'strides': 2, 'padding': 'same', 'activati
      ]

  conv3_1_shortcut_filter_configs = [
    {'filters': 128, 'kernel_size': 1, 'strides': 2, 'padding': 'same', 'activation':
  ]

  # Rest of the 3 blocks of conv3 are defined below with stride 1
  conv3_2_3_4_filter_configs= [
        {'filters': 128, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 128, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
      ]

  # There will be 6 blocks in conv4_x
  # conv4_1 will have strride = 2
  # widthxheight will be reduced from 28x28 to 14x14
https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 9/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory

  conv4_1_filter_configs= [
        {'filters': 256, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 256, 'kernel_size': 3, 'strides': 2, 'padding': 'same', 'activati
      ]
  conv4_1_shortcut_filter_configs = [
    {'filters': 256, 'kernel_size': 1, 'strides': 2, 'padding': 'same', 'activation':
  ]
  # Rest of the conv4_x blocks:
  conv4_2_to_6_filter_configs= [
        {'filters': 256, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 256, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
      ]

  # conv5_x has 3 blocks
  # conv5_1 will have stride =2
  # widthxheight will reduce from 14x14 to 7x7
  conv5_1_filter_configs= [
        {'filters': 512, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 512, 'kernel_size': 3, 'strides': 2, 'padding': 'same', 'activati
      ]
  conv5_1_shortcut_filter_configs = [
    {'filters': 512, 'kernel_size': 1, 'strides': 2, 'padding': 'same', 'activation':
  ]

  # block 2 and 3 will have stride = 1
  conv5_2_3_filter_configs= [
        {'filters': 512, 'kernel_size': 1, 'strides': 1, 'padding': 'same', 'activati
        {'filters': 512, 'kernel_size': 3, 'strides': 1, 'padding': 'same', 'activati
      ]

  resNet34_residual_block_config = [
      (conv2_1_2_3_filter_configs, None, "conv2_1"), # Conv 2_1
      (conv2_1_2_3_filter_configs, None, "conv2_2"),                            # Con
      (conv2_1_2_3_filter_configs, None, "conv2_3"),                            # Con
      (conv3_1_filter_configs, conv3_1_shortcut_filter_configs, "conv3_1"),     # Con
      (conv3_2_3_4_filter_configs, None, "conv3_2"),                            # Con
      (conv3_2_3_4_filter_configs, None, "conv3_3"),                            # Con
      (conv3_2_3_4_filter_configs, None, "conv3_4"),                            # Con
      (conv4_1_filter_configs, conv4_1_shortcut_filter_configs, "conv4_1"),     # Con
      (conv4_2_to_6_filter_configs, None, "conv4_2"),                           # Con
      (conv4_2_to_6_filter_configs, None, "conv4_3"),                           # con
      (conv4_2_to_6_filter_configs, None, "conv4_4"),                           # con
      (conv4_2_to_6_filter_configs, None, "conv4_5"),                           # con
      (conv4_2_to_6_filter_configs, None, "conv4_6"),                           # con
      (conv5_1_filter_configs, conv5_1_shortcut_filter_configs, "conv5_1"),     # con
      (conv5_2_3_filter_configs, None, "conv5_2"),                              # con
      (conv5_2_3_filter_configs, None, "conv5_3")                               # con
  ]
  return resNet34_residual_block_config

https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 10/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory

Building the model

def build_model(residual_block_config):
  # input layer definition, input shape = 224x224x3
  input = Input (shape=input_shape, batch_size = batch_size, name='main_input')

  #First layer is a conv filter as per the paper, output = 112x112x64
  main_path = Conv2D(filters=64, kernel_size= 7, strides= 2, padding='same'
                  , activation='elu', use_bias = True, name='conv1')(input)

  # MaxPool layer output = 56x56x64
  main_path = MaxPool2D(pool_size=3, strides=2, padding='same'
                        , name='maxpool_1')(main_path)

  # Create the chain of residual blocks using the below for loop
  for filter_configs, shortcut_filter_configs, name_prefix in residual_block_config:
    main_path = residual_block(main_path
                              , filter_configs
                              , shortcut_filter_configs
                              , name_prefix= name_prefix
                              )

  #AveragePool layer and getting ready to create the output classification layer,
  main_path = AveragePooling2D(pool_size = 7, strides = 7, padding='same'
                              , name='avg_pool')(main_path)

  # Flatten the data to get it ready for FC layer
  main_path = Flatten(name="flatten")(main_path)

  #Adding final FC layer to classify the output
  main_path = Dense(num_classes, activation='softmax', name='fc_final_output')(main_p

  model= Model(inputs=input, outputs = main_path)
  return model

Create ResNet50 Model

resNet50_residual_block_config = get_resnet50_config()
model = build_model(resNet50_residual_block_config)
model.summary()

Model: "model"
________________________________________________________________________________
Layer (type) Output Shape Param # Connected to

https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 11/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory
================================================================================
main_input (InputLayer) [(30, 224, 224, 3)] 0
________________________________________________________________________________
conv1 (Conv2D) (30, 112, 112, 64) 9472 main_input[0][0
________________________________________________________________________________
maxpool_1 (MaxPooling2D) (30, 56, 56, 64) 0 conv1[0][0]
________________________________________________________________________________
conv2_1_main_1 (Conv2D) (30, 56, 56, 64) 4160 maxpool_1[0][0]
________________________________________________________________________________
conv2_1_main_2 (Conv2D) (30, 56, 56, 64) 36928 conv2_1_main_1[
________________________________________________________________________________
conv2_1_shortcut_1 (Conv2D) (30, 56, 56, 256) 16640 maxpool_1[0][0]
________________________________________________________________________________
conv2_1_main_3 (Conv2D) (30, 56, 56, 256) 16640 conv2_1_main_2[
________________________________________________________________________________
conv2_1_add_junction (Add) (30, 56, 56, 256) 0 conv2_1_shortcu
conv2_1_main_3[
________________________________________________________________________________
conv2_1_activation_output (Acti (30, 56, 56, 256) 0 conv2_1_add_jun
________________________________________________________________________________
conv2_2_main_1 (Conv2D) (30, 56, 56, 64) 16448 conv2_1_activat
________________________________________________________________________________
conv2_2_main_2 (Conv2D) (30, 56, 56, 64) 36928 conv2_2_main_1[
________________________________________________________________________________
conv2_2_main_3 (Conv2D) (30, 56, 56, 256) 16640 conv2_2_main_2[
________________________________________________________________________________
conv2_2_add_junction (Add) (30, 56, 56, 256) 0 conv2_1_activat
conv2_2_main_3[
________________________________________________________________________________
conv2_2_activation_output (Acti (30, 56, 56, 256) 0 conv2_2_add_jun
________________________________________________________________________________
conv2_3_main_1 (Conv2D) (30, 56, 56, 64) 16448 conv2_2_activat
________________________________________________________________________________
conv2_3_main_2 (Conv2D) (30, 56, 56, 64) 36928 conv2_3_main_1[
________________________________________________________________________________
conv2_3_main_3 (Conv2D) (30, 56, 56, 256) 16640 conv2_3_main_2[
________________________________________________________________________________
conv2_3_add_junction (Add) (30, 56, 56, 256) 0 conv2_2_activat
conv2_3_main_3[
________________________________________________________________________________
conv2_3_activation_output (Acti (30, 56, 56, 256) 0 conv2_3_add_jun
________________________________________________________________________________
conv3_1_main_1 (Conv2D) (30, 56, 56, 128) 32896 conv2_3_activat
________________________________________________________________________________
conv3_1_main_2 (Conv2D) (30, 56, 56, 128) 147584 conv3_1_main_1[
________________________________________________________________________________
conv3_1_shortcut_1 (Conv2D) (30, 28, 28, 512) 131584 conv2_3_activat
________________________________________________________________________________
conv3_1_main_3 (Conv2D) (30, 28, 28, 512) 66048 conv3_1_main_2[
________________________________________________________________________________
conv3_1_add_junction (Add) (30, 28, 28, 512) 0 conv3_1_shortcu
conv3_1_main_3[
________________________________________________________________________________
conv3_1_activation_output (Acti (30, 28, 28, 512) 0 conv3_1_add_jun
________________________________________________________________________________
conv3 2 main 1 (Conv2D) (30, 28, 28, 128) 65664 conv3 1 activat

https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 12/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory

Resnet50 Model gure

tf.keras.utils.plot_model(model)

https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 13/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory

Create ResNet34 Model

resNet34_residual_block_config = get_resnet34_config()
model = build_model(resNet34_residual_block_config)
model.summary()

Model: "model_1"
________________________________________________________________________________
Layer (type) Output Shape Param # Connected to
================================================================================
main_input (InputLayer) [(30, 224, 224, 3)] 0
________________________________________________________________________________
conv1 (Conv2D) (30, 112, 112, 64) 9472 main_input[0][0
________________________________________________________________________________
maxpool_1 (MaxPooling2D) (30, 56, 56, 64) 0 conv1[0][0]
________________________________________________________________________________
conv2_1_main_1 (Conv2D) (30, 56, 56, 64) 36928 maxpool_1[0][0]
________________________________________________________________________________
conv2_1_main_2 (Conv2D) (30, 56, 56, 64) 36928 conv2_1_main_1[
________________________________________________________________________________
conv2_1_add_junction (Add) (30, 56, 56, 64) 0 maxpool_1[0][0]
conv2_1_main_2[
________________________________________________________________________________
conv2_1_activation_output (Acti (30, 56, 56, 64) 0 conv2_1_add_jun
________________________________________________________________________________
conv2_2_main_1 (Conv2D) (30, 56, 56, 64) 36928 conv2_1_activat
________________________________________________________________________________
conv2_2_main_2 (Conv2D) (30, 56, 56, 64) 36928 conv2_2_main_1[
________________________________________________________________________________
conv2_2_add_junction (Add) (30, 56, 56, 64) 0 conv2_1_activat
conv2_2_main_2[
________________________________________________________________________________
conv2_2_activation_output (Acti (30, 56, 56, 64) 0 conv2_2_add_jun
________________________________________________________________________________
conv2_3_main_1 (Conv2D) (30, 56, 56, 64) 36928 conv2_2_activat
________________________________________________________________________________
conv2_3_main_2 (Conv2D) (30, 56, 56, 64) 36928 conv2_3_main_1[
________________________________________________________________________________
conv2_3_add_junction (Add) (30, 56, 56, 64) 0 conv2_2_activat
conv2_3_main_2[
________________________________________________________________________________
conv2_3_activation_output (Acti (30, 56, 56, 64) 0 conv2_3_add_jun

https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 14/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory
________________________________________________________________________________
conv3_1_main_1 (Conv2D) (30, 56, 56, 128) 73856 conv2_3_activat
________________________________________________________________________________
conv3_1_shortcut_1 (Conv2D) (30, 28, 28, 128) 8320 conv2_3_activat
________________________________________________________________________________
conv3_1_main_2 (Conv2D) (30, 28, 28, 128) 147584 conv3_1_main_1[
________________________________________________________________________________
conv3_1_add_junction (Add) (30, 28, 28, 128) 0 conv3_1_shortcu
conv3_1_main_2[
________________________________________________________________________________
conv3_1_activation_output (Acti (30, 28, 28, 128) 0 conv3_1_add_jun
________________________________________________________________________________
conv3_2_main_1 (Conv2D) (30, 28, 28, 128) 147584 conv3_1_activat
________________________________________________________________________________
conv3_2_main_2 (Conv2D) (30, 28, 28, 128) 147584 conv3_2_main_1[
________________________________________________________________________________
conv3_2_add_junction (Add) (30, 28, 28, 128) 0 conv3_1_activat
conv3_2_main_2[
________________________________________________________________________________
conv3_2_activation_output (Acti (30, 28, 28, 128) 0 conv3_2_add_jun
________________________________________________________________________________
conv3_3_main_1 (Conv2D) (30, 28, 28, 128) 147584 conv3_2_activat

ResNet34 Model Figure

tf.keras.utils.plot_model(model)

https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 15/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory

Training Prep

Callbacks Declaration

Learning Rate Scheduler

def lr_control(epoch, learning_rate):
  #The paper talks about reducing the learning rate by 4% every 8 epochs
  tf.print("inside lr_control, epoch =", epoch, " lr = ", learning_rate)
  #Checking if 8 epochs are complete
  if epoch > 7 and epoch%8 == 0 :
    # Reducing the learning rate by 10%
    return learning_rate* 0.9
  else:
i
https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 16/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory
    return learning_rate

lrScheduler = tf.keras.callbacks.LearningRateScheduler(schedule=lr_control, verbose=1

Model Checkpoint

checkpoint = tf.keras.callbacks.ModelCheckpoint(filepath = checkpoint_filePath
                                                , monitor='val_loss'
                                                , verbose = 1
                                                , save_best_only = True
                                                , save_weights_only = False
                                                )

Early Stopper

earlyStopper = tf.keras.callbacks.EarlyStopping(monitor='val_loss'
                                                , min_delta = 0.0001
                                                , patience = 9
                                                , verbose=1
                                                , restore_best_weights=True
                                                )

Model Compilation

optimizer = SGD(learning_rate=0.00001, momentum=0.9)

model.compile(optimizer=optimizer
              , loss = 'categorical_crossentropy'
              , metrics = [ 'accuracy']
              )

Train Model
Training ResNet34 to avoid heavy computation on Colab

# Start the training process and collect the metrics data for plotting
metrics = model.fit(training_datasource
          , epochs=50
https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 17/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory
          , batch_size=batch_size
          , validation_data = validation_datasource
          , callbacks = [lrScheduler, checkpoint, earlyStopper]
          )

Epoch 1/50
inside lr_control, epoch = 0 lr = 9.999999747378752e-06

Epoch 00001: LearningRateScheduler reducing learning rate to 9.999999747378752e-

180/180 [==============================] - 70s 194ms/step - loss: 1.5451 - accur

Epoch 00001: val_loss improved from inf to 1.50194, saving model to /content/dri
/usr/local/lib/python3.7/dist-packages/tensorflow/python/keras/utils/generic_uti
category=CustomMaskWarning)
Epoch 2/50
inside lr_control, epoch = 1 lr = 9.999999747378752e-06

Epoch 00002: LearningRateScheduler reducing learning rate to 9.999999747378752e-

180/180 [==============================] - 35s 194ms/step - loss: 1.4661 - accur

Epoch 00002: val_loss improved from 1.50194 to 1.47287, saving model to /content
Epoch 3/50
inside lr_control, epoch = 2 lr = 9.999999747378752e-06

Epoch 00003: LearningRateScheduler reducing learning rate to 9.999999747378752e-

180/180 [==============================] - 36s 200ms/step - loss: 1.4366 - accur

Epoch 00003: val_loss improved from 1.47287 to 1.44669, saving model to /content
Epoch 4/50
inside lr_control, epoch = 3 lr = 9.999999747378752e-06

Epoch 00004: LearningRateScheduler reducing learning rate to 9.999999747378752e-

180/180 [==============================] - 35s 196ms/step - loss: 1.4103 - accur

Epoch 00004: val_loss improved from 1.44669 to 1.42079, saving model to /content
Epoch 5/50
inside lr_control, epoch = 4 lr = 9.999999747378752e-06

Epoch 00005: LearningRateScheduler reducing learning rate to 9.999999747378752e-

180/180 [==============================] - 36s 199ms/step - loss: 1.3799 - accur

Epoch 00005: val_loss improved from 1.42079 to 1.38507, saving model to /content
Epoch 6/50
inside lr_control, epoch = 5 lr = 9.999999747378752e-06

Epoch 00006: LearningRateScheduler reducing learning rate to 9.999999747378752e-

180/180 [==============================] - 35s 197ms/step - loss: 1.3470 - accur

Epoch 00006: val_loss improved from 1.38507 to 1.35605, saving model to /content
Epoch 7/50
inside lr_control, epoch = 6 lr = 9.999999747378752e-06

Epoch 00007: LearningRateScheduler reducing learning rate to 9.999999747378752e-

180/180 [==============================] - 36s 198ms/step - loss: 1.3165 - accur

Epoch 00007: val_loss improved from 1.35605 to 1.31727, saving model to /content
Epoch 8/50
https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 18/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory
inside lr_control, epoch = 7 lr = 9.999999747378752e-06

Epoch 00008: LearningRateScheduler reducing learning rate to 9.999999747378752e-

180/180 [==============================] - 36s 198ms/step - loss: 1.2877 - accur

Epoch 00008: val_loss improved from 1.31727 to 1.29456, saving model to /content
Epoch 9/50

Plot Loss and Accuracy

import matplotlib.pyplot as plt
acc = metrics.history['accuracy']
val_acc = metrics.history['val_accuracy']

loss = metrics.history['loss']
val_loss = metrics.history['val_loss']

epochs_range = range(len(metrics.history['accuracy']))

plt.figure(figsize=(8, 4))
plt.subplot(1, 2, 1)
plt.plot(epochs_range, acc, label='Training Accuracy')
plt.plot(epochs_range, val_acc, label='Validation Accuracy')
plt.legend(loc='lower right')
plt.title('Training and Validation Accuracy')

plt.subplot(1, 2, 2)
plt.plot(epochs_range, loss, label='Training Loss')
plt.plot(epochs_range, val_loss, label='Validation Loss')
plt.legend(loc='upper right')
plt.title('Training and Validation Loss')
plt.show()

https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 19/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory

Test the Model

predictions = []
actuals=[]

for i, (images, labels) in enumerate( test_datasource):
  pred = model(images)
  for j in range(len(labels)):
    actuals.append( labels[j])
    predictions.append(pred[j])

# Printing a few labels and predictions to ensure that there are no dead-Relus
for j in range(10):
  print(labels[j].numpy(), "\t", pred[j].numpy())

[0. 1. 0. 0. 0.] [0.0235266 0.10897809 0.3271751 0.03340529 0.5069149

[0. 1. 0. 0. 0.] [0.06303899 0.3555932 0.147747 0.41043457 0.02318621
[0. 1. 0. 0. 0.] [0.12345209 0.6606989 0.02768849 0.11548411 0.07267643
[0. 1. 0. 0. 0.] [0.25657988 0.2589431 0.09734413 0.36727422 0.0198587
[0. 0. 0. 1. 0.] [0.08905374 0.2783467 0.0774235 0.5402862 0.0148899
[0. 0. 1. 0. 0.] [1.1086810e-06 2.3582480e-03 9.1496962e-01 7.3341923e-0
[0. 0. 0. 0. 1.] [0.02767825 0.00861757 0.06848286 0.001408 0.8938133
[0. 0. 1. 0. 0.] [5.0281787e-06 5.5233145e-04 9.9243379e-01 2.6762958e-0
[0. 0. 0. 1. 0.] [0.2789667 0.16835254 0.0278518 0.5011415 0.0236875
[0. 0. 0. 0. 1.] [0.01274782 0.03140369 0.17357181 0.00523592 0.7770408

Confusion Matrix

pd.DataFrame(tf.math.confusion_matrix(
    np.argmax(actuals, axis=1), np.argmax(predictions, axis=1), num_classes=num_class
    , columns = test_image_dataset.class_names
    , index =  test_image_dataset.class_names)

dog bird flower other berry

dog 278 44 16 44 18

bird 76 166 37 59 62

ﬂower 12 21 283 13 71

other 45 54 37 237 27

berry 21 28 43 16 292

Training for 50 more epochs

https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 20/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory

# Start the training process and collect the metrics data for plotting
metrics = model.fit(training_datasource
          , epochs=50
          , batch_size=batch_size
          , validation_data = validation_datasource
          , callbacks = [lrScheduler, checkpoint, earlyStopper]
          )

Epoch 1/50
inside lr_control, epoch = 0 lr = 5.314409918355523e-06

Epoch 00001: LearningRateScheduler reducing learning rate to 5.314409918355523e-

180/180 [==============================] - 37s 204ms/step - loss: 0.9105 - accur

Epoch 00001: val_loss improved from inf to 0.98180, saving model to /content/dri
/usr/local/lib/python3.7/dist-packages/tensorflow/python/keras/utils/generic_uti
category=CustomMaskWarning)
Epoch 2/50
inside lr_control, epoch = 1 lr = 5.314409918355523e-06

Epoch 00002: LearningRateScheduler reducing learning rate to 5.314409918355523e-

180/180 [==============================] - 35s 195ms/step - loss: 0.9078 - accur

Epoch 00002: val_loss did not improve from 0.98180

Epoch 3/50
inside lr_control, epoch = 2 lr = 5.314409918355523e-06

Epoch 00003: LearningRateScheduler reducing learning rate to 5.314409918355523e-

180/180 [==============================] - 35s 197ms/step - loss: 0.8972 - accur

Epoch 00003: val_loss improved from 0.98180 to 0.95182, saving model to /content
Epoch 4/50
inside lr_control, epoch = 3 lr = 5.314409918355523e-06

Epoch 00004: LearningRateScheduler reducing learning rate to 5.314409918355523e-

180/180 [==============================] - 36s 197ms/step - loss: 0.8965 - accur

Epoch 00004: val_loss did not improve from 0.95182

Epoch 5/50
inside lr_control, epoch = 4 lr = 5.314409918355523e-06

Epoch 00005: LearningRateScheduler reducing learning rate to 5.314409918355523e-

180/180 [==============================] - 36s 197ms/step - loss: 0.8928 - accur

Epoch 00005: val_loss did not improve from 0.95182

Epoch 6/50
inside lr_control, epoch = 5 lr = 5.314409918355523e-06

Epoch 00006: LearningRateScheduler reducing learning rate to 5.314409918355523e-

180/180 [==============================] - 36s 198ms/step - loss: 0.8837 - accur

Epoch 00006: val_loss did not improve from 0.95182

Epoch 7/50
inside lr_control, epoch = 6 lr = 5.314409918355523e-06

https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 21/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory

Epoch 00007: LearningRateScheduler reducing learning rate to 5.314409918355523e-

180/180 [==============================] - 35s 196ms/step - loss: 0.8830 - accur

Epoch 00007: val_loss did not improve from 0.95182

Epoch 8/50
inside lr_control, epoch = 7 lr = 5.314409918355523e-06

Epoch 00008: LearningRateScheduler reducing learning rate to 5.314409918355523e-

180/180 [==============================] - 36s 197ms/step - loss: 0.8761 - accur

Epoch 00008: val_loss improved from 0.95182 to 0.92298, saving model to /content
Epoch 9/50

https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 22/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory

Regenerating the Confusion Matrix

[0. 1. 0. 0. 0.] [0.01077838 0.34070885 0.3440716 0.05659881 0.24784239

[1. 0. 0. 0. 0.] [9.2472053e-01 2.7816322e-02 4.1169493e-04 4.6582796e-0
[0. 1. 0. 0. 0.] [0.5207156 0.25393432 0.06951063 0.10588971 0.04994976
[1. 0. 0. 0. 0.] [0.8596171 0.05024306 0.00718466 0.08116411 0.00179107
[0. 0. 1. 0. 0.] [8.9897007e-10 1.3873712e-05 9.9563569e-01 4.1923081e-0
[0. 0. 0. 0. 1.] [0.01755744 0.1187838 0.21793851 0.05667475 0.58904546
[1. 0. 0. 0. 0.] [0.4189833 0.4218033 0.04952971 0.03220975 0.07747391
[0. 0. 0. 1. 0.] [5.75252634e-05 1.39359403e-02 6.97477162e-01 2.7825748
1.02719115e-02]
[0. 0. 0. 1. 0.] [0.03660994 0.15446916 0.09701248 0.30783463 0.4040738
[0. 1. 0. 0. 0.] [0.39282748 0.5328101 0.0057617 0.00335127 0.06524937

pd.DataFrame(tf.math.confusion_matrix(
                  np.argmax(actuals, axis=1)
                  , np.argmax(predictions, axis=1)
                  , num_classes=num_classes
                  , dtype=tf.dtypes.int32).numpy()
    , columns = test_image_dataset.class_names
    , index =  test_image_dataset.class_names)

dog bird flower other berry

dog 281 44 4 67 4

bird 46 234 21 75 24

ﬂower 4 38 266 43 49

other 20 57 22 285 16

berry 12 54 39 19 276

https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 23/24
7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.ipynb - Colaboratory

https://colab.research.google.com/drive/1TbmgZbXTvHQhxgiwaS2lK9B5_8s-bfm5#printMode=true 24/24

Step by Step Learning To Sap Adobe Forms
100% (7)
Step by Step Learning To Sap Adobe Forms
27 pages
Deep Learning TensorFlow and Keras
No ratings yet
Deep Learning TensorFlow and Keras
454 pages
Jobtrac Quick Reference Guide
No ratings yet
Jobtrac Quick Reference Guide
2 pages
How To Configure Browser For R13 Using TOCF (EE) in Jboss
No ratings yet
How To Configure Browser For R13 Using TOCF (EE) in Jboss
8 pages
BreastCancer EXP
No ratings yet
BreastCancer EXP
8 pages
Deep Learning Using Python + Keras (Chapter 3) - ResNet - CodeProject
No ratings yet
Deep Learning Using Python + Keras (Chapter 3) - ResNet - CodeProject
24 pages
Applied Machine and Deep Learning
No ratings yet
Applied Machine and Deep Learning
34 pages
Biologically Inspired Deep Residual Networks
No ratings yet
Biologically Inspired Deep Residual Networks
10 pages
DeepLearningForVisionSystems Ch5 AlexNet
No ratings yet
DeepLearningForVisionSystems Ch5 AlexNet
32 pages
DEEPLEARNINGTUTORIAL.ipynb-Colaboratory
No ratings yet
DEEPLEARNINGTUTORIAL.ipynb-Colaboratory
8 pages
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-09-09 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-09-09 Reference-Material-I
9 pages
Introduction To Keras!: Vincent Lepetit!
No ratings yet
Introduction To Keras!: Vincent Lepetit!
33 pages
DLCV Ch3 Convolutional Neural Network
No ratings yet
DLCV Ch3 Convolutional Neural Network
45 pages
DL Programs
No ratings yet
DL Programs
12 pages
Resnet Model Code Explanation
No ratings yet
Resnet Model Code Explanation
2 pages
CV Assignment - Object Recognition Using CNN
No ratings yet
CV Assignment - Object Recognition Using CNN
1 page
Machine Vison Homework 10
No ratings yet
Machine Vison Homework 10
11 pages
detect
No ratings yet
detect
6 pages
Big Data Machine Learning Lab 4
No ratings yet
Big Data Machine Learning Lab 4
7 pages
Object Detection Webcam
No ratings yet
Object Detection Webcam
3 pages
Prac 1
No ratings yet
Prac 1
6 pages
361 Project Code
No ratings yet
361 Project Code
10 pages
ResNet_Deep_Learning_Presentation
No ratings yet
ResNet_Deep_Learning_Presentation
8 pages
C2_W1_Assignment
No ratings yet
C2_W1_Assignment
24 pages
VGG and Resnet
No ratings yet
VGG and Resnet
18 pages
lab 6 ml
No ratings yet
lab 6 ml
7 pages
IMPLEMENT A NEURAL NETWORK USING PYTHON
No ratings yet
IMPLEMENT A NEURAL NETWORK USING PYTHON
4 pages
Digit Recognizer Using CNN
No ratings yet
Digit Recognizer Using CNN
4 pages
Unit-5-1
No ratings yet
Unit-5-1
1 page
Vehicle Seat Vacancy Identification
No ratings yet
Vehicle Seat Vacancy Identification
4 pages
Explore the Implementation of CNNs in Python
No ratings yet
Explore the Implementation of CNNs in Python
10 pages
Deep-Learning Using Caffe Model
No ratings yet
Deep-Learning Using Caffe Model
5 pages
PS4 - Ritesh Jaiswal - Ritesh - 054
No ratings yet
PS4 - Ritesh Jaiswal - Ritesh - 054
8 pages
Csc413 Project Semantic Segmentation
No ratings yet
Csc413 Project Semantic Segmentation
84 pages
Weely Assignment-I VGG16
No ratings yet
Weely Assignment-I VGG16
5 pages
MICROPROJECT_REPORT_GROUP_2
No ratings yet
MICROPROJECT_REPORT_GROUP_2
15 pages
Ex 07
No ratings yet
Ex 07
2 pages
Res Net 4
No ratings yet
Res Net 4
23 pages
Dlv Lab Manual Print
No ratings yet
Dlv Lab Manual Print
29 pages
MVS_Expt8 Object Detection and Reconstruction Using CNN
No ratings yet
MVS_Expt8 Object Detection and Reconstruction Using CNN
5 pages
Report
No ratings yet
Report
15 pages
Deep Learning Lab With Output
No ratings yet
Deep Learning Lab With Output
12 pages
DEEP LEARNING EXPERIMENTS
No ratings yet
DEEP LEARNING EXPERIMENTS
42 pages
Kanoria Shubham Anil 2023HT01569
No ratings yet
Kanoria Shubham Anil 2023HT01569
9 pages
Python Deep Learning Lab Programs (2)
No ratings yet
Python Deep Learning Lab Programs (2)
35 pages
A First Look On Nueral Network
No ratings yet
A First Look On Nueral Network
8 pages
Handwritten Digit Recognition Using a Neural Network (2)
No ratings yet
Handwritten Digit Recognition Using a Neural Network (2)
4 pages
DSE_3141_Deep_Learning_Lab_Manual_2024_Week4
No ratings yet
DSE_3141_Deep_Learning_Lab_Manual_2024_Week4
14 pages
MICCAI Educational Challenge
No ratings yet
MICCAI Educational Challenge
3 pages
Project
No ratings yet
Project
3 pages
Deep Residual Learning For Image Recognition
No ratings yet
Deep Residual Learning For Image Recognition
16 pages
EXP5-VGG16v2
No ratings yet
EXP5-VGG16v2
7 pages
DL Ex 13
No ratings yet
DL Ex 13
5 pages
How To Use Colab
100% (1)
How To Use Colab
13 pages
Experiment 3
No ratings yet
Experiment 3
5 pages
Lab Manual
No ratings yet
Lab Manual
45 pages
C2_W1_Assignment
No ratings yet
C2_W1_Assignment
25 pages
nndlmac
No ratings yet
nndlmac
9 pages
EE292A Lecture 2.ML - Hardware - 2 - April9
No ratings yet
EE292A Lecture 2.ML - Hardware - 2 - April9
13 pages
Object Detection Webcam
No ratings yet
Object Detection Webcam
3 pages
Res Net
No ratings yet
Res Net
46 pages
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
From Everand
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
Kanto
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Introduction To Internet Of Things - - Unit 8 - Week 5
No ratings yet
Introduction To Internet Of Things - - Unit 8 - Week 5
1 page
PeopleSoft Technical Training - Day 1 PDF
No ratings yet
PeopleSoft Technical Training - Day 1 PDF
249 pages
Fortinet - Diag Sys Top
No ratings yet
Fortinet - Diag Sys Top
3 pages
Abap Project Demo On Wednesday 11.30 Am: BASICS (1 Day)
No ratings yet
Abap Project Demo On Wednesday 11.30 Am: BASICS (1 Day)
2 pages
Function Arguments and Keyword Arguments
No ratings yet
Function Arguments and Keyword Arguments
13 pages
Using Order of Operations: Answers
No ratings yet
Using Order of Operations: Answers
3 pages
Annual Examination Class Xi (2020-21)
No ratings yet
Annual Examination Class Xi (2020-21)
3 pages
Laboratory 2
No ratings yet
Laboratory 2
3 pages
Assignment No:-2: Object Oriented Analysis & Design
No ratings yet
Assignment No:-2: Object Oriented Analysis & Design
15 pages
Documentation Structure-Computer Science CUEA
No ratings yet
Documentation Structure-Computer Science CUEA
5 pages
Hanumanth 3+ Testing Resume
No ratings yet
Hanumanth 3+ Testing Resume
3 pages
(Ebooks PDF) Download C# 7 Quick Syntax Reference: A Pocket Guide To The Language, APIs, and Library 2nd Edition Mikael Olsson Full Chapters
100% (4)
(Ebooks PDF) Download C# 7 Quick Syntax Reference: A Pocket Guide To The Language, APIs, and Library 2nd Edition Mikael Olsson Full Chapters
52 pages
Yashwant Kanitker - VC++, COM and Beyond
100% (3)
Yashwant Kanitker - VC++, COM and Beyond
20 pages
Lab Questions: Presented By: Tekendra Nath Yogi
No ratings yet
Lab Questions: Presented By: Tekendra Nath Yogi
19 pages
Outbound Delivery Automatic Packing
No ratings yet
Outbound Delivery Automatic Packing
52 pages
Log
No ratings yet
Log
117 pages
Page No: Acknowledgement List of Tables List of Figures List of Symbols, Abbrevations Chapter No Title 1
No ratings yet
Page No: Acknowledgement List of Tables List of Figures List of Symbols, Abbrevations Chapter No Title 1
2 pages
Make A Code Book With Markdown by Christopher Topalian
No ratings yet
Make A Code Book With Markdown by Christopher Topalian
48 pages
Nuwangi_kan-1721977424597-568261-E195687-1708404515495-366222-E195687 Programming (2)
No ratings yet
Nuwangi_kan-1721977424597-568261-E195687-1708404515495-366222-E195687 Programming (2)
84 pages
Samuel Erowele CV
No ratings yet
Samuel Erowele CV
4 pages
Pasdf
No ratings yet
Pasdf
139 pages
Model Based Testing
No ratings yet
Model Based Testing
43 pages
Extracto ISO de XBOX 360
No ratings yet
Extracto ISO de XBOX 360
3 pages
JAVA PROGRAMMING
No ratings yet
JAVA PROGRAMMING
3 pages
Manual Update Instructions - Changelog
No ratings yet
Manual Update Instructions - Changelog
18 pages
Chapter 3 Function
No ratings yet
Chapter 3 Function
73 pages
r20 I-II PWC++ Lab Manual 20ecl203 22-23
No ratings yet
r20 I-II PWC++ Lab Manual 20ecl203 22-23
27 pages

DeepLearningForVisionSystems Ch5 ResNet

Uploaded by

DeepLearningForVisionSystems Ch5 ResNet

Uploaded by

7/25/2021 DeepLearningForVisionSystems-Ch5-ResNet.

This is an implementation of ResNet Image Classi er based on the paper:

Initializations and Imports

Data Loading and Preprocessing

Procuring the Dataset

5 categories found = ['dog', 'bird', 'flower', 'other', 'berry']

Training and Validation Dataset

Found 6000 files belonging to 5 classes.

images = (30, 224, 224, 3)

Found 2000 files belonging to 5 classes.

De ne the Residual Block

De ne the architecture of the model

ResNet50 residual block con gurations

ResNet34 model con guration

Building the model

Create ResNet50 Model

Resnet50 Model gure

Create ResNet34 Model

ResNet34 Model Figure

Learning Rate Scheduler

Epoch 00001: LearningRateScheduler reducing learning rate to 9.999999747378752e-

Epoch 00002: LearningRateScheduler reducing learning rate to 9.999999747378752e-

Epoch 00003: LearningRateScheduler reducing learning rate to 9.999999747378752e-

Epoch 00004: LearningRateScheduler reducing learning rate to 9.999999747378752e-

Epoch 00005: LearningRateScheduler reducing learning rate to 9.999999747378752e-

Epoch 00006: LearningRateScheduler reducing learning rate to 9.999999747378752e-

Epoch 00007: LearningRateScheduler reducing learning rate to 9.999999747378752e-

Epoch 00008: LearningRateScheduler reducing learning rate to 9.999999747378752e-

Plot Loss and Accuracy

Test the Model

[0. 1. 0. 0. 0.] [0.0235266 0.10897809 0.3271751 0.03340529 0.5069149

dog bird flower other berry

Training for 50 more epochs

Epoch 00001: LearningRateScheduler reducing learning rate to 5.314409918355523e-

Epoch 00002: LearningRateScheduler reducing learning rate to 5.314409918355523e-

Epoch 00002: val_loss did not improve from 0.98180

Epoch 00003: LearningRateScheduler reducing learning rate to 5.314409918355523e-

Epoch 00004: LearningRateScheduler reducing learning rate to 5.314409918355523e-

Epoch 00004: val_loss did not improve from 0.95182

Epoch 00005: LearningRateScheduler reducing learning rate to 5.314409918355523e-

Epoch 00005: val_loss did not improve from 0.95182

Epoch 00006: LearningRateScheduler reducing learning rate to 5.314409918355523e-

Epoch 00006: val_loss did not improve from 0.95182

Epoch 00007: LearningRateScheduler reducing learning rate to 5.314409918355523e-

Epoch 00007: val_loss did not improve from 0.95182

Epoch 00008: LearningRateScheduler reducing learning rate to 5.314409918355523e-

Regenerating the Confusion Matrix

[0. 1. 0. 0. 0.] [0.01077838 0.34070885 0.3440716 0.05659881 0.24784239

dog bird flower other berry

You might also like