DL-UNIT-3
DL-UNIT-3
Input Image
First of all, the input image will be broken down into pixels. If it is a black
and white image, it will only have one layer and pixels will be interpreted as
2D array with the value from 0 to 255. If it is colored image, it will have 3
layers (red, green, blue) and will be interpreted as 3D array.
Convolution Layer
This is the first layer that filters the input images. Its purpose is to extract
features from the image. It captures color, edges, gradient orientation, and
other features so it can be differentiated.
This layer is usually added after the convolutional layer. Pooling layer
reduces the spatial size of the output from convolutional layer and extracts
dominant features. Pooling layer can be differentiated into two types, which
are:
· Max Pooling
It returns the maximum value from the portion of the image covered by
kernel. This layer discards noisy activation and help over-fitting by providing
an abstracted form of the representation.
· Average Pooling
This type of pooling returns the average value from the portion of the image
covered by kernel.
There’s no way we can say max pooling is better than average pooling or vice
versa. The type of pooling we use depends on the input image that we “feed”
the model. From the image above, we can see that in case (a), it is better to
use average pooling but in case (b), max pooling is better.
Fully Connected Input Layer (Flatten)
Fully connected layers are layers where all the inputs from one layer are
connected to every activation unit of the next layer. The layer takes the
output of the pooling and flatten them into single vector.
The flattened matrix goes through a fully connected layer to classify the
images. The purpose of this layer is to classify the image into a label. It takes
the output of previous layer and predicts the best label by applying weights
and “voting”. The final output will be the probabilities for each label.
3. Describe convolutional neural network in detail.
4. Explain Alexnet in detail.
5. Describe VGG 16 in detail.
6. Summarize Multiple filters at one time in CNN in detail.
7. Classify ResNet and VGG.
8. Discuss any one convnet architecture in detail.