0% found this document useful (0 votes)

4 views

Unit-V

The document provides an overview of Convolutional Neural Networks (CNNs), detailing their architecture and functionality for image recognition tasks. It explains the process of how computers read images, the limitations of fully connected networks, and the key components of CNNs including convolution, ReLU, pooling, and fully connected layers. Additionally, it discusses various CNN architectures such as AlexNet, ZFNet, VGGNet, GoogLeNet, and ResNet, highlighting their innovations and performance characteristics.

Uploaded by

jaibalaya524

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Unit-V

Uploaded by

jaibalaya524

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 23

Unit-V

Convolutional Neural Network (CNN)

let us discuss what is Convolutional Neural Network (CNN) and the architecture behind
Convolutional Neural Networks – which are designed to address image
recognition systems and classification problems. Convolutional Neural
Networks have wide applications in image and video recognition, recommendation systems
and natural language processing.

How Does A Computer Read an Image?

Consider this image of the New York skyline, upon first glance you will see a lot
of buildings and colors. So how does the computer process this image?

The image is broken down into 3 color-channels which is Red,

Green and Blue. Each of these color channels are mapped to the image’s pixel.

Then, the computer recognizes the value associated with each

pixel and determine the size of the image.

However, for black-white images, there is only one channel and the concept is
the same.

Why Not Fully Connected Networks?

We cannot make use of fully connected networks when it comes to Convolutional
Neural Networks, here’s why!
Consider the following image:

Here, we have considered an input of images with the size 28x28x3 pixels. If
we input this to our Convolutional Neural Network, we will have about 2352
weights in the first hidden layer itself.

But this case isn’t practical. Now, take a look at this:

Any generic input image will atleast have 200x200x3 pixels in size. The size of the
first hidden layer becomes a whooping 120,000. If this is just the first hidden layer,
imagine the number of neurons needed to process an entire complex image-set.

This leads to over-fitting and isn’t practical. Hence, we cannot make use of fully
connected networks.

What Are Convolutional Neural Networks?

Convolutional Neural Networks, like neural networks, are made up
of neurons with learnable weights and biases. Each neuron receives
several inputs, takes a weighted sum over them, pass it through
an activation function and responds with an output.

The whole network has a loss function and all the tips and tricks that we developed
for neural networks still apply on Convolutional Neural Networks.

Pretty straightforward, right?

Neural networks, as its name suggests, is a machine learning technique which is

modeled after the brain structure. It comprises of a network of learning units called
neurons.

These neurons learn how to convert input signals (e.g. picture of a cat) into
corresponding output signals (e.g. the label “cat”), forming the basis of automated
recognition.
Let’s take the example of automatic image recognition. The process
of determining whether a picture contains a cat involves an activation function. If
the picture resembles prior cat images the neurons have seen before, the
label “cat” would be activated.

Hence, the more labeled images the neurons are exposed to, the better it learns
how to recognize other unlabelled images. We call this the process
of training neurons.

How Do Convolutional Neural Networks Work?

There are four layered concepts we should understand in Convolutional Neural
Networks:

1. Convolution,
2. ReLu,
3. Pooling and
4. Full Connectedness (Fully Connected Layer).

Let’s begin by checking out a simple example:

Example of CNN:
Consider the image below:

Here, there are multiple renditions of X and O’s. This makes it tricky for the computer
to recognize. But the goal is that if the input signal looks like previous images it
has seen before, the “image” reference signal will be mixed into,
or convolved with, the input signal. The resulting output signal is then passed on to
the next layer.
So, the computer understands every pixel. In this case, the white pixels are said to
be -1 while the black ones are 1. This is just the way we’ve implemented
to differentiate the pixels in a basic binary classification.

Now if we would just normally search and compare the values between a normal
image and another ‘x’ rendition, we would get a lot of missing pixels.

So, how do we fix this?

Post Grduate Diploma in Artificial Intelligence Course
Explore Curriculum

We take small patches of the pixels called filters and try to match them in the
corresponding nearby locations to see if we get a match. By doing this, the
Convolutional Neural Network gets a lot better at seeing similarity than directly
trying to match the entire image.

Convolution of an Image
Convolution has the nice property of being translational invariant. Intuitively, this
means that each convolution filter represents a feature of interest (e.g pixels in
letters) and the Convolutional Neural Network algorithm learns
which features comprise the resulting reference (i.e. alphabet).

We have 4 steps for convolution:

 Line up the feature and the image

 Multiply each image pixel by corresponding feature pixel
 Add the values and find the sum
 Divide the sum by the total number of pixels in the feature

Consider the above image – As you can see, we are done with the first 2 steps. We
considered a feature image and one pixel from it. We multiplied this with
the existing image and the product is stored in another buffer feature image.
With this image, we completed the last 2 steps. We added the values which led to
the sum. We then, divide this number by the total number of pixels in the feature
image. When that is done, the final value obtained is placed at the center of
the filtered image as shown below:

Now, we can move this filter around and do the same at any pixel in the image.
For better clarity, let’s consider another example:

As you can see, here after performing the first 4 steps we have the value at 0.55! We
take this value and place it in the image as explained before. This is done in the
following image:
Similarly, we move the feature to every other position in the image and see how the
feature matches that area. So after doing this, we will get the output as:

Here we considered just one filter. Similarly, we will perform the same convolution
with every other filter to get the convolution of that filter.

The output signal strength is not dependent on where the features are located, but
simply whether the features are present. Hence, an alphabet could be sitting
in different positions and the Convolutional Neural Network algorithm would still
be able to recognize it.

ReLU Layer
ReLU is an activation function. But, what is an activation function?

Rectified Linear Unit (ReLU) transform function only activates a node if the input is
above a certain quantity, while the input is below zero, the output is zero, but when
the input rises above a certain threshold, it has a linear relationship with the
dependent variable.

Consider the below example:

We have considered a simple function with the values as mentioned above. So the
function only performs an operation if that value is obtained by the dependent
variable. For this example, the following values are obtained:

Why do we require ReLU here?

The main aim is to remove all the negative values from the convolution. All the
positive values remain the same but all the negative values get changed to zero as
shown below:
So after we process this particular feature we get the following output:

Now, similarly we do the same process to all the other feature images as well:
Inputs from the convolution layer can
be “smoothened” to reduce the sensitivity of
the filters to noise and variations. This smoothing process is
called subsampling and can be achieved by taking averages or taking
the maximum over a sample of the signal.

Pooling Layer
In this layer we shrink the image stack into a smaller size. Pooling is done after
passing through the activation layer. We do this by implementing the following 4
steps:

 Pick a window size (usually 2 or 3)

 Pick a stride (usually 2)
 Walk your window across your filtered images
 From each window, take the maximum value

Let us understand this with an example. Consider performing pooling with a window
size of 2 and stride being 2 as well.
So in this case, we took window size to be 2 and we got 4 values to choose from.
From those 4 values, the maximum value there is 1 so we pick 1. Also, note that
we started out with a 7×7 matrix but now the same matrix after pooling came down
to 4×4.

But we need to move the window across the entire image. The procedure is
exactly as same as above and we need to repeat that for the entire image.

Do note that this is for one filter. We need to do it for 2 other filters as well. This is
done and we arrive at the following result:
Well the easy part of this process is over. Next up, we need to stack up all these
layers!

Stacking Up The Layers

So to get the time-frame in one picture we’re here with a 4×4 matrix from
a 7×7 matrix after passing the input through 3 layers – Convolution,
ReLU and Pooling as shown below:

But can we further reduce the image from 4×4 to something lesser?
Yes, we can! We need to perform the 3 operations in an iteration after the first pass.
So after the second pass we arrive at a 2×2 matrix as shown below:

The last layers in the network are fully connected, meaning that neurons of
preceding layers are connected to every neuron in subsequent layers.

This mimics high level reasoning where all possible pathways from
the input to output are considered.

Also, fully connected layer is the final layer where the classification actually happens.
Here we take our filtered and shrinked images and put them into one single list as
shown below:

So next, when we feed in, ‘X’ and ‘O’ there will be some element in the vector that
will be high. Consider the image below, as you can see for ‘X’ there are different
elements that are high and similarly, for ‘O’ we have different elements that
are high:
Well, what did we understand from the above image?

When the 1st, 4th, 5th, 10th and 11th values are high, we can classify the image
as ‘x’. The concept is similar for the other alphabets as well – when
certain values are arranged the way they are, they can be mapped to
an actual letter or a number which we require, simple right?

Prediction Of Image Using Convolutional Neural

Networks – Fully Connected Layer
At this point in time, we’re done training the network and we can begin to predict
and check the working of the classifier. Let’s check out a simple example:

In the above image, we have a 12 element vector obtained

after passing the input of a random letter through all the layers of our network.
But, how do we check to know what we’ve obtained is right or wrong?

We make predictions based on the output data by comparing the obtained

values with list of ‘x’and ‘o’!

Well, it is really easy. We just added the values we which found out as high (1st,
4th, 5th, 10th and 11th) from the vector table of X and we got the sum to be 5. We
did the exact same thing with the input image and got a value of 4.56.

When we divide the value we have a probability match to be 0.91! Let’s do

the same with the vector table of ‘o’ now:

We have the output as 0.51 with this table. Well, probability being 0.51 is less
than 0.91, isn’t it?

So we can conclude that the resulting input image is an ‘x’!

Case studies of convolutional architectures- AlexNet, ZFNet, VGG,
GoogLeNet, ResNet

1. AlexNet (2012)

✅ Main Idea:
AlexNet was the first deep CNN to achieve high performance on a large-scale
dataset (ImageNet). It proved that deep learning could outperform traditional
computer vision methods.
🧠 Architecture Design:
 Input size: 227×227×3 (image)
 Layers: 8 total
o 5 convolutional layers
o 3 fully connected layers
 Uses:
o ReLU activation (instead of sigmoid/tanh for faster training)
o Max pooling to reduce spatial size
o Dropout to reduce overfitting in FC layers
o Data augmentation
 Trained on 2 GPUs in parallel
💡 Innovations:
 ReLU → faster training
 Dropout → prevents overfitting
 GPU-based training → reduced training time
⚖️Pros and Cons:
 ✅ Powerful and fast for its time
 ❌ Very large number of parameters (~60 million)
 ❌ High memory usage
🔹 2. ZFNet (2013)
✅ Main Idea:
Improved AlexNet by tweaking hyperparameters and made the model more
interpretable using deconvolutional visualizations.
🧠 Architecture Design:
 Similar to AlexNet in structure (8 layers)
 Key change:
o First convolutional layer's filter size reduced from 11×11 to 7×7
o Reduced stride for finer feature maps
 Visualized intermediate feature maps to understand what CNN is
learning
💡 Innovations:
 DeconvNet to visualize what filters learn
 Adjusted filter size and stride to improve accuracy
⚖️Pros and Cons:
 ✅ Better than AlexNet with minor changes
 ❌ Still limited by depth

🔹 3. VGGNet (2014)
✅ Main Idea:
VGG showed that deeper networks (16–19 layers) improve performance. It
introduced a very simple and uniform architecture using only 3×3
convolutional filters.
🧠 Architecture Design:
 Input: 224×224×3
 VGG-16:
o 13 convolutional layers
o 3 fully connected layers
o 2×2 max pooling after every few conv layers
 Uses 3×3 filters throughout the network
 Same padding to maintain size
💡 Innovations:
 Uniform architecture: easier to implement and scale
 Replaced large filters with stacked 3×3 ones → better feature extraction
⚖️Pros and Cons:
 ✅ Great accuracy
 ✅ Easy to use for transfer learning
 ❌ Huge number of parameters (≈138M)
 ❌ Computationally expensive

🔹 4. GoogLeNet / Inception v1 (2014)

✅ Main Idea:
Use multi-scale feature extraction (1×1, 3×3, 5×5 filters) in parallel to improve
accuracy without increasing computation much. Avoid fully connected layers.
🧠 Architecture Design:
 Inception module: key building block
o Combines 1×1, 3×3, 5×5 convs and max pooling
o Uses 1×1 conv before 3×3 and 5×5 to reduce dimensions (like
bottlenecks)
 Overall:
o 22 layers
o Uses global average pooling instead of fully connected layers
💡 Innovations:
 Inception modules
 1×1 conv for dimensionality reduction
 Fewer parameters (~5M)
⚖️Pros and Cons:
 ✅ Very efficient
 ✅ High accuracy with low memory
 ❌ Complex architecture (many branches)

🔹 5. ResNet (2015)
✅ Main Idea:
Very deep networks suffer from vanishing gradients, so ResNet introduced
skip (residual) connections that let gradients flow directly.
🧠 Architecture Design:
 Residual block:
Output = F(x) + x
where F(x) is some transformation (e.g., 2 conv layers)
 Many versions:
o ResNet-18, ResNet-34, ResNet-50, ResNet-101, ResNet-152
 Input size: 224×224×3
 Uses Batch Normalization, ReLU
💡 Innovations:
 Residual learning to allow very deep models
 First network to train 152 layers effectively
⚖️Pros and Cons:
 ✅ Solved vanishing gradient
 ✅ Enabled ultra-deep CNNs
 ✅ Excellent transfer learning performance
 ❌ Slightly complex to implement
📊 Summary Table
Model Year Layers Main Idea Key Innovation Params
Deep CNN + GPU + ReLU, Dropout, Data
AlexNet 2012 8 ~60M
ReLU Augment
Improved AlexNet + DeconvNet, small
ZFNet 2013 8 ~62M
Visualization filters
Deeper, uniform Stacked 3×3 conv
VGG-16 2014 16 ~138M
architecture layers
Multi-scale convs 1×1 conv, no FC
GoogLeNet 2014 22 ~5M
(Inception) layers
Very deep, residual
ResNet-50 2015 50 Skip connections ~25M
blocks
🔍 What Are Pretrained Models?
Pretrained models are deep learning models that have already been trained
on large datasets (like ImageNet) and are available for reuse. Instead of
training a new model from scratch (which is time-consuming and requires a lot
of data), you can use or fine-tune these pretrained models for your specific
task.
🎯 Benefits of Using Pretrained Models
Advantage Explanation
🔧 Saves time You skip the training-from-scratch process
🧠 Requires less Works well even with small datasets (using fine-tuning or
data feature extraction)
🚀 High Based on training on huge datasets (like ImageNet with
performance 1.2M images)
You can transfer knowledge to new tasks (e.g.,
🔁 Transfer Learning
classification, detection)
Convolutional Autoencoder (CAE):
A Convolutional Autoencoder (CAE) is a type of autoencoder specifically
designed to work with image data. Instead of using fully connected layers like
traditional autoencoders, CAEs use convolutional and pooling layers to learn
spatial hierarchies and preserve local features in images.

🔹 Architecture of a Convolutional Autoencoder

It typically consists of two parts:
1. Encoder
 Applies convolutional layers followed by activation functions (usually
ReLU).
 Often includes pooling layers (e.g., max pooling) to reduce spatial
dimensions.
 Purpose: Extract meaningful features from the input image and reduce
its dimensionality.
2. Decoder
 Applies transposed convolution (also called deconvolution) or
upsampling layers to reconstruct the original image from the encoded
representation.
 Final layer usually has a sigmoid or tanh activation to bring pixel values
into the desired range (e.g., [0,1]).
🔹 Applications of Convolutional Autoencoders
 Image Denoising
 Image Compression
 Anomaly Detection (in images, like medical imaging)
 Dimensionality Reduction
 Pre-training for CNNs

Applications of Convolutional Neural Networks (CNNs)

1. Content-Based Image Retrieval (CBIR)
What it means:
Instead of searching images using text (like "red car"), CBIR finds images based
on image content like colors, shapes, or textures.
How CNN helps:
CNNs extract deep features from an image (like edges, patterns) that describe
it better than manual features. These features are used to compare and
retrieve similar images from a database.
Example:
You upload a photo of a shoe, and an e-commerce site finds visually similar
shoes for you.

2. 🎯 Object Detection
What it means:
Detects and locates different objects in an image. It answers:
 What is in the image?
 Where is it?
How CNN helps:
CNNs scan the image and detect key features at different locations and sizes.
They help draw bounding boxes around each object.
Example:
In a traffic camera image, CNN detects cars, bikes, and pedestrians and draws
boxes around them.

3. 📄 Natural Language and Sequence Learning

What it means:
CNNs can be used for analyzing text or other sequential data (like sound waves
or time series).
How CNN helps:
CNNs scan the input (like text) with filters to find important patterns such as
keywords, phrases, or repeated signals.
Example:
 Spam detection: CNN can learn what kind of phrases are commonly used
in spam.
 Sentiment analysis: Classifying if a review is positive or negative.

4. 📊 Sequence Learning (Time-Series, Bioinformatics)

What it means:
Learning patterns in data that comes in a sequence, like heartbeats (ECG),
stock prices, or DNA sequences.
How CNN helps:
CNNs can detect temporal patterns using 1D convolutions. This is faster and
simpler than using RNNs.
Example:
 CNN used to detect abnormal heart rhythms from ECG signals.
 Predicting stock price trends.

🔄 Summary Table
Application What CNN Does Example Use Case
Content-Based Finds similar images based on Fashion, medical,
Image Retrieval content artwork search
Identifies and locates multiple Self-driving cars,
Object Detection
objects in images surveillance
Natural Language Understands and classifies text Sentiment analysis, spam
Processing or language patterns detection
Finds patterns in sequential ECG analysis, DNA
Sequence Learning
data (1D signals) sequence prediction

Using Deep Learning Neural Networks and Candlestick Chart Representation To Predict Stock Market
No ratings yet
Using Deep Learning Neural Networks and Candlestick Chart Representation To Predict Stock Market
13 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
66 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
53 pages
Lecture-25 - Building - Training CNN
No ratings yet
Lecture-25 - Building - Training CNN
26 pages
Scan 30 Sep 23 18 20 44
No ratings yet
Scan 30 Sep 23 18 20 44
30 pages
NN 07
No ratings yet
NN 07
24 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
12 pages
CNN
No ratings yet
CNN
10 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Theory of CNN (Convolutional Neural Network)
No ratings yet
Theory of CNN (Convolutional Neural Network)
4 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
Chap 2 DL
No ratings yet
Chap 2 DL
88 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
27 pages
Deep Learning_Lecture 4_CNNs
No ratings yet
Deep Learning_Lecture 4_CNNs
53 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
9 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
Topic 3ii - Convolutional Neural Network
No ratings yet
Topic 3ii - Convolutional Neural Network
43 pages
Computer Vision Part 2
No ratings yet
Computer Vision Part 2
5 pages
UNIT III DEEP LEARNING
No ratings yet
UNIT III DEEP LEARNING
31 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
Introduction To Convolutional Neural Networks
No ratings yet
Introduction To Convolutional Neural Networks
41 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
37 pages
CNN 2
No ratings yet
CNN 2
47 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
77 pages
CNN Architecture
No ratings yet
CNN Architecture
24 pages
CS 601 Machine Learning Unit 3
No ratings yet
CS 601 Machine Learning Unit 3
37 pages
CNN Iitkgp
No ratings yet
CNN Iitkgp
112 pages
DL U4
No ratings yet
DL U4
59 pages
Deep LearningUNIT-IV
No ratings yet
Deep LearningUNIT-IV
16 pages
CNNs 1697477106
No ratings yet
CNNs 1697477106
42 pages
Introduction to Convolution Neural Network
No ratings yet
Introduction to Convolution Neural Network
15 pages
1.5+Convolutional+Neural+Networks (1)
No ratings yet
1.5+Convolutional+Neural+Networks (1)
9 pages
3 - DeepLearning - and - CNN v3
No ratings yet
3 - DeepLearning - and - CNN v3
50 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
FODL Unit-4
No ratings yet
FODL Unit-4
46 pages
Project Exhibition 2
No ratings yet
Project Exhibition 2
42 pages
CVlecture 5
No ratings yet
CVlecture 5
56 pages
CNN
No ratings yet
CNN
3 pages
Convolutional Neural Networks Notes
No ratings yet
Convolutional Neural Networks Notes
29 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
[Fall 2024] Images and Convolutions
No ratings yet
[Fall 2024] Images and Convolutions
69 pages
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
No ratings yet
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
55 pages
CNN notes unit-3
No ratings yet
CNN notes unit-3
12 pages
Convolutional_Networks_2024
No ratings yet
Convolutional_Networks_2024
44 pages
Lecture 6
No ratings yet
Lecture 6
17 pages
convolution operation
No ratings yet
convolution operation
23 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
Liu_2018_J._Phys.__Conf._Ser._1087_062032
No ratings yet
Liu_2018_J._Phys.__Conf._Ser._1087_062032
8 pages
Unit4 CNN
No ratings yet
Unit4 CNN
187 pages
convolutional_neural_networks
No ratings yet
convolutional_neural_networks
108 pages
CNN 1
No ratings yet
CNN 1
9 pages
ML-13
No ratings yet
ML-13
34 pages
Convolution Neural Networks U2
No ratings yet
Convolution Neural Networks U2
24 pages
new
No ratings yet
new
8 pages
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
No ratings yet
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
64 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
Deep Learning
No ratings yet
Deep Learning
17 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Introduction to Deep Learning
From Everand
Introduction to Deep Learning
Eugene Charniak
No ratings yet
Scodeen Global Python DS - ML - Django Syllabus Version 13
No ratings yet
Scodeen Global Python DS - ML - Django Syllabus Version 13
22 pages
GitHub - peggy1502_Amazing-Resources_ List of references and online resources related to data science, machine learning and deep learning_
No ratings yet
GitHub - peggy1502_Amazing-Resources_ List of references and online resources related to data science, machine learning and deep learning_
41 pages
6673572(Ebook) Data Analytics Applied to the Mining Industry by Ali Soofastaei ISBN 9781138360006, 1138360007 - The latest ebook version is now available for instant access
100% (1)
6673572(Ebook) Data Analytics Applied to the Mining Industry by Ali Soofastaei ISBN 9781138360006, 1138360007 - The latest ebook version is now available for instant access
82 pages
Transformer Fault Diagnosis Method Based on TimesN
No ratings yet
Transformer Fault Diagnosis Method Based on TimesN
19 pages
Reinforcement Learning: By: Chandra Prakash IIITM Gwalior
No ratings yet
Reinforcement Learning: By: Chandra Prakash IIITM Gwalior
64 pages
A Deep Learning Approach Replacing The Finite Difference Method For in Situ Stress Prediction
No ratings yet
A Deep Learning Approach Replacing The Finite Difference Method For in Situ Stress Prediction
12 pages
Sentiment Prediction for Market Volatility
No ratings yet
Sentiment Prediction for Market Volatility
25 pages
Navigating Artificial Intelligence-Aug 2024
No ratings yet
Navigating Artificial Intelligence-Aug 2024
50 pages
Handbook of Research on Disease Prediction Through Data Analytics and Machine Learning EPUB DOCX PDF Download
100% (7)
Handbook of Research on Disease Prediction Through Data Analytics and Machine Learning EPUB DOCX PDF Download
16 pages
Automated Insights Platform for Business Intelligence
No ratings yet
Automated Insights Platform for Business Intelligence
5 pages
Week 7 Part 1KNN K Nearest Neighbor Classification
No ratings yet
Week 7 Part 1KNN K Nearest Neighbor Classification
47 pages
KNN Algorithm
100% (1)
KNN Algorithm
11 pages
Ethical Framework For Harnessing The Power of AI I
No ratings yet
Ethical Framework For Harnessing The Power of AI I
32 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
No ratings yet
Lecture - 2 Classification (Machine Learning Basic and KNN)
94 pages
Artificial Intelligence Aided Electronic Warfare S
No ratings yet
Artificial Intelligence Aided Electronic Warfare S
21 pages
Generative Context-Aware Fine-Tuning of Self-Supervised Speech Models
No ratings yet
Generative Context-Aware Fine-Tuning of Self-Supervised Speech Models
5 pages
ioegc-10-032-100471
No ratings yet
ioegc-10-032-100471
8 pages
Marketing Analytics Certification Program
No ratings yet
Marketing Analytics Certification Program
38 pages
Harnessing Cutting-Edge Technology and Analytics to Convert Data into Actionable Cybersecurity Insights
No ratings yet
Harnessing Cutting-Edge Technology and Analytics to Convert Data into Actionable Cybersecurity Insights
14 pages
Sl. No. Sap Id Name of The Student
No ratings yet
Sl. No. Sap Id Name of The Student
10 pages
Research Outputs
No ratings yet
Research Outputs
137 pages
00 Artificial Intelligence
No ratings yet
00 Artificial Intelligence
0 pages
Pm4py Software Impacts
No ratings yet
Pm4py Software Impacts
7 pages
Assignment 2 - (ML) (MSE)
No ratings yet
Assignment 2 - (ML) (MSE)
2 pages
These Popular Products Are Free Each Month For 12 Months
No ratings yet
These Popular Products Are Free Each Month For 12 Months
43 pages
CAC CHU DE BAO CAO KTHP (1)
No ratings yet
CAC CHU DE BAO CAO KTHP (1)
4 pages
Final Doc1
No ratings yet
Final Doc1
57 pages
Number (Old) Title Old Course Area (Before July 2019) New Course Area (After June 2019)
No ratings yet
Number (Old) Title Old Course Area (Before July 2019) New Course Area (After June 2019)
5 pages
What Is Data Analytics?
No ratings yet
What Is Data Analytics?
56 pages

Unit-V

Uploaded by

Unit-V

Uploaded by

Unit-V

Convolutional Neural Network (CNN)

How Does A Computer Read an Image?

The image is broken down into 3 color-channels which is Red,

Then, the computer recognizes the value associated with each

Why Not Fully Connected Networks?

But this case isn’t practical. Now, take a look at this:

What Are Convolutional Neural Networks?

Pretty straightforward, right?

Neural networks, as its name suggests, is a machine learning technique which is

How Do Convolutional Neural Networks Work?

Let’s begin by checking out a simple example:

So, how do we fix this?

We have 4 steps for convolution:

 Line up the feature and the image

Consider the below example:

Why do we require ReLU here?

 Pick a window size (usually 2 or 3)

Stacking Up The Layers

Prediction Of Image Using Convolutional Neural

In the above image, we have a 12 element vector obtained

We make predictions based on the output data by comparing the obtained

When we divide the value we have a probability match to be 0.91! Let’s do

So we can conclude that the resulting input image is an ‘x’!

🔹 4. GoogLeNet / Inception v1 (2014)

🔹 Architecture of a Convolutional Autoencoder

Applications of Convolutional Neural Networks (CNNs)

3. 📄 Natural Language and Sequence Learning

4. 📊 Sequence Learning (Time-Series, Bioinformatics)

You might also like