Deep Learning For Computer Vision
Deep Learning For Computer Vision
computer vision
Dr. Noman Islam
Introduction
• Convolutional neural networks, also known as
Convnets, is a type of deep-learning model
almost universally used in computer vision
applications
• Image classification is the task of classifying a
whole image as a single label
• For example, an image classification task could
label an image as a dog or a cat, given an image
is either a dog or a cat
Creating a ConvNet in Keras
Summary of the model
Convolution Operation
• The fundamental difference between a
densely connected layer and a convolution
layer is this:
– Dense layers learn global patterns in their input
feature space (for example, for a MNIST digit,
patterns involving all pixels),
– whereas convolution layers learn local patterns in
the case of images, patterns found in small 2D
windows of the inputs.
The patterns are translation invariant
• After learning a certain pattern in the lower-right
corner of a picture, a convnet can recognize it
anywhere: for example, in the upper-left corner.
• A densely connected network would have to learn
the pattern anew if it appeared at a new location.
• This makes convnets data efficient when processing
images (because the visual world is fundamentally
translation invariant): they need fewer training
samples to learn representations that have
generalization power.
Learn spatial hierarchies of patterns