0% found this document useful (0 votes)

318 views37 pages

ImageNet Classification With Deep Convolutional Convolutional Neural Networks PDF

The document summarizes a 2012 paper that achieved breakthrough results in image classification using deep convolutional neural networks. The paper introduced a network architecture with multiple convolutional and pooling layers, dropout for regularization, and ReLU activations. This network achieved top-5 test error rates of 15.3% on ImageNet classification, significantly outperforming prior methods. The network was trained on over 1 million images with 1,000 categories using stochastic gradient descent on GPUs for around a week.

Uploaded by

Dominic

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

318 views37 pages

ImageNet Classification With Deep Convolutional Convolutional Neural Networks PDF

Uploaded by

Dominic

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

ImageNet Classification

with
Deep Convolutional Neural
Networks
Alex Krizhevsky
Krizhevsky,, Ilya Sutskever,
Sutskever, Geoffrey E. Hinton
Motivation
Classification goals:
•Make 1 guess about the label (Top-1 error)
•Make 5 guesses about the label (Top-5 error)

No
Bounding
Box
Database
ImageNet
15M images
22K categories
Images collected from Web
RGB Images
Variable-resolution
Human labelers (Amazon’s Mechanical Turk crowd-sourcing)

ImageNet Large Scale Visual Recognition Challenge

(ILSVRC-2010)
1K categories
1.2M training images (~1000 per category)
50,000 validation images
150,000 testing images
Strategy – Deep Learning
“Shallow” vs. “deep” architectures

Learn a feature hierarchy all the way from pixels to classifier

reference : http://web.engr.illinois.edu/~slazebni/spring14/lec24_cnn.pdf
Neuron - Perceptron

Input
(raw pixel)
x1 Weights
w1
x2
w2
Output: f(w*x+b)
x3 w3 f

… wd

reference : http://en.wikipedia.org/wiki/Sigmoid_function#mediaviewer/File:Gjl-t(x).svg
Multi--Layer Neural Networks
Multi
Input Hidden Output
Layer Layer Layer

Nonlinear classifier
Learning can be done
by gradient descent
Back-Propagation
algorithm
Feed Forward Operation
input layer: hidden layer: output layer:
d features m outputs, one for
each class

wji vkj
x(1)
z1

x(2)

x(d)
zm

bias unit
Notation for Weights
Use wji to denote the weight between input
unit i and hidden unit j
input unit i hidden unit j
wji
x(i) wjix(i) yj

Use vkj to denote the weight between hidden

unit j and output unit k
hidden unit j output unit k
vkj
vkjyj zk
yj
Notation for Activation
Use neti to denote the activation and hidden unit j
hidden unit j
d
net j = ∑ x (i )w ji + w j 0
i =1
yj

Use net*k to denote the activation at output unit k

NH output unit k
net k* = ∑ y j v kj + v k 0
j =1
zj
Network Training
1. Initialize weights wji and vkj randomly but not to 0
2. Iterate until a stopping criterion is reached

 z1 
input sample xp
MNN with weights
output z= M 
choose p zm 
wji and vkj  

Compare output z with the desired target t;

adjust wji and vkj to move closer to the goal
t (by backpropagation)
BackPropagation
Learn wji and vkj by minimizing the training error
What is the training error?
Suppose the output of MNN for sample x is z and the target
(desired output for x ) is t

Error on one sample: 1 m

J (w , v ) = ∑ (t c − zc )
2

2 c =1

1 n m (i )
Training error: J (w , v ) = ∑∑ (t c − zc )
(i ) 2
2 i =1 c =1

v (0 ) ,w (0 ) = random
Use gradient descent: repeat until convergence:

w (t +1) = w (t ) − η ∇ w J (w (t ) )
v (t +1) = v (t ) − η ∇ v J (v (t ) )
BackPropagation:: Layered Model
BackPropagation
d
activation at
hidden unit j net j = ∑ x (i )w ji + w j 0
i =1

output at
hidden unit j (
y j = f net j )
NH
activation at
output unit k net k* = ∑ y j v kj + v k 0
j =1

chain rule
chain rule
z k = f (net k* )
activation at
output unit k

1 m ∂J ∂J
J (w , v ) = ∑ (t c − zc )
2
objective function
2 c =1 ∂v kj ∂w ji
BackPropagation of Errors
∂J
m
∂J = − (t k − z k )f ' (net k* ) y j
∂w ji
( )
= −f ′ net j x ∑ (t k − zk ) f ′(net k* )v kj
( i)
∂v kj
k =1
error
unit i
unit j
z1

Name “backpropagation” because during training, errors

propagated back from output to hidden layer
Learning Curves

classification error

training time

this is a good time to stop training, since after this time we start to overfit
Stopping criterion is part of training phase, thus validation data is part of the training data
To assess how the network will work on the unseen examples, we still need test data
Momentum
Gradient descent finds only a local minima
not a problem if J(w) is small at a local minima. Indeed, we do not wish to find w
s.t. J(w) = 0 due to overfitting

J(w)

reasonable local
minimum

global minimum
J(w)
problem if J(w) is large at a local
minimum w

bad local
minimum

global minimum
Momentum
Momentum: popular method to avoid local minima and
also speeds up descent in plateau regions
weight update at time t is
∆w (t ) = w (t ) − w (t −1)
add temporal average direction in which weights have been moving recently

(t +1) (t )  ∂J  (t −1)
w =w + (1 − α ) η  + α ∆ w
 ∂w 
previous
steepest descent direction
direction
at α = 0, equivalent to gradient descent
at α = 1, gradient descent is ignored, weight update continues in the direction in
which it was moving previously (momentum)
usually, α is around 0.9
1D Convolution
Neural 1D Convolution
Implementation
2D Convolution Matrix

reference : http://en.wikipedia.org/wiki/Kernel_(image_processing)
Convolutional Filter

.
.
.

Input
Feature Map
reference : http://cs.nyu.edu/~fergus/tutorials/deep_learning_cvpr12/fergus_dl_tutorial_final.pptx
Architecture

•Trained with stochastic gradient descent on two NVIDIA GPUs for about a
week (5~6 days)
•650,000 neurons, 60 million parameters, 630 million connections
•The last layer contains 1,000 neurons which produces a distribution over
the 1,000 class labels.
Architecture
Architecture
Architecture
Response--Normalization Layer
Response
: the activity of a neuron computed by applying kernel i at
position (x, y)
The response-normalized activity is given by

N : the total # of kernels in the layer

n : hyper-parameter, n=5
k : hyper-parameter, k=2
α : hyper-parameter, α=10^(-4)
β : hyper-parameter, β =0.75
This aids generalization even though ReLU don’t require it.
This reduces top-1 error by 1.4 , top-5 error rate by 1.2%
Pooling Layer

◦ Non-overlapping / overlapping regions

◦ Sum or max

Max

Sum

Reduces the error rate of top-1 by 0.4% and top-5 by 0.3%

reference : http://cs.nyu.edu/~fergus/tutorials/deep_learning_cvpr12/fergus_dl_tutorial_final.pptx
Architecture
First Layer Visualization
ReLU
Learning rule
Use stochastic gradient descent with a batch size of 128 examples,
momentum of 0.9, and weigh decay of 0.0005
The update rule for weight w was

i : the iteration index

: the learning rate, initialized at 0.01 and reduced three times prior to
termination
: the average over the i-th batch Di of the derivative of the
objective with respect to w
Train for 90 cycles through the training set of 1.2 million images
Fighting overfitting - input
This neural net has 60M real-valued
parameters and 650,000 neurons

It overfils a lot therefore train on five

224x224 patches extracted randomly from
256x256 images, and also their horizontal
reflections
Fighting overfitting - Dropout
Independently set each hidden unit activity to zero with 0.5
probability
Used in the two globally-connected hidden layers at the net's
output
Doubles the number of iterations required to converge

reference : http://www.image-net.org/challenges/LSVRC/2012/supervision.pdf
Results - Classification
ILSVRC-2010 test set

ILSVRC-2012 test set

Results Classification
Results Retrival
The End
Thank you for your attention
Refernces
www.cs.toronto.edu/~fritz/absps/imagenet.pd
https://prezi.com/jiilm_br8uef/imagenet-
classification-with-deep-convolutional-
neural-networks/
sglab.kaist.ac.kr/~sungeui/IR/.../second/201454
81오은수.pptx
http://alex.smola.org/teaching/cmu2013-10-
701/slides/14_PrincipalComp.pdf
Hagit Hel-or (Convolution Slide)
http://www.cs.haifa.ac.il/~rita/ml_course/lectures
/NN.pdf

Quiz AI For Business L1 PDF
No ratings yet
Quiz AI For Business L1 PDF
6 pages
Evolve - L6 - Unit 1 Quiz - A
67% (3)
Evolve - L6 - Unit 1 Quiz - A
3 pages
Haha User Manual PDF
No ratings yet
Haha User Manual PDF
34 pages
Orking Draft.: On The Dangers of Stochastic Parrots: Can Language Models Be Too Big?
No ratings yet
Orking Draft.: On The Dangers of Stochastic Parrots: Can Language Models Be Too Big?
12 pages
Software Is Eating The World
No ratings yet
Software Is Eating The World
6 pages
Download full See Solve Scale How Anyone Can Turn an Unsolved Problem into a Breakthrough Success 1st Edition Danny Warshay ebook all chapters
100% (4)
Download full See Solve Scale How Anyone Can Turn an Unsolved Problem into a Breakthrough Success 1st Edition Danny Warshay ebook all chapters
50 pages
Ebooks File Future For Investors Why The Tried and The True Triumphs Over The Bold and The New The All Chapters
100% (3)
Ebooks File Future For Investors Why The Tried and The True Triumphs Over The Bold and The New The All Chapters
34 pages
BobFischer Tolkaczew
No ratings yet
BobFischer Tolkaczew
49 pages
Chapter 11 Neural Nets
No ratings yet
Chapter 11 Neural Nets
39 pages
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
No ratings yet
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
6 pages
Lecture 10 Merged
No ratings yet
Lecture 10 Merged
14 pages
Chapter 11 Neural Nets (Python)
No ratings yet
Chapter 11 Neural Nets (Python)
43 pages
3 - DeepLearning - and - CNN v3
No ratings yet
3 - DeepLearning - and - CNN v3
50 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
Alexnet Tugce Kyunghee
No ratings yet
Alexnet Tugce Kyunghee
35 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
37 pages
Neural Network
No ratings yet
Neural Network
44 pages
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
No ratings yet
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
106 pages
cv_2025_Spring_16
No ratings yet
cv_2025_Spring_16
53 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
Ch10 Deep Learning
No ratings yet
Ch10 Deep Learning
104 pages
CAPSTONE PROJECT
No ratings yet
CAPSTONE PROJECT
7 pages
Lecture8 DeepLearning
No ratings yet
Lecture8 DeepLearning
94 pages
midterm_study_guide_csci566
No ratings yet
midterm_study_guide_csci566
20 pages
More On CNN
No ratings yet
More On CNN
131 pages
APKA Report
No ratings yet
APKA Report
3 pages
Ece18898g Neural Networks
No ratings yet
Ece18898g Neural Networks
47 pages
CNN For Computer Vision Problem (Session 1)
No ratings yet
CNN For Computer Vision Problem (Session 1)
43 pages
Lecture 10
No ratings yet
Lecture 10
155 pages
5 1 ArtificialNeuralNetworks 4up
No ratings yet
5 1 ArtificialNeuralNetworks 4up
12 pages
Machine Learning With Convolutional Neural Networks
No ratings yet
Machine Learning With Convolutional Neural Networks
22 pages
Module 4 Continued
No ratings yet
Module 4 Continued
244 pages
Super VIP Cheetsheet - Deep Learning, AI, ML
No ratings yet
Super VIP Cheetsheet - Deep Learning, AI, ML
47 pages
Artificial Neural Networks: Introduction To Computational Neuroscience
No ratings yet
Artificial Neural Networks: Introduction To Computational Neuroscience
42 pages
IoT - Lecture 11
No ratings yet
IoT - Lecture 11
58 pages
L7-Lecture-Image.classification.DNN-v4
No ratings yet
L7-Lecture-Image.classification.DNN-v4
61 pages
Lec 8
No ratings yet
Lec 8
43 pages
CNN Architectures 01
No ratings yet
CNN Architectures 01
66 pages
Neural Network BSC
No ratings yet
Neural Network BSC
32 pages
Module1 ECO-598 AI & ML Aug 21
No ratings yet
Module1 ECO-598 AI & ML Aug 21
45 pages
Neural Networks - 2
No ratings yet
Neural Networks - 2
79 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
Stage 424 June 2023
No ratings yet
Stage 424 June 2023
89 pages
2. Deep Neural Network
No ratings yet
2. Deep Neural Network
60 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
chapter_5_summary
No ratings yet
chapter_5_summary
5 pages
Unit III
No ratings yet
Unit III
89 pages
Module11 - NNandDeep Learning
No ratings yet
Module11 - NNandDeep Learning
84 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
Neural Networks - Slides - CMU - Aarti Singh & Barnabas Poczos
No ratings yet
Neural Networks - Slides - CMU - Aarti Singh & Barnabas Poczos
36 pages
Neural - Networks
No ratings yet
Neural - Networks
47 pages
Deep Learning PDF
No ratings yet
Deep Learning PDF
55 pages
DL Intro
No ratings yet
DL Intro
64 pages
2 Deep Neural Network_241120_095158
No ratings yet
2 Deep Neural Network_241120_095158
47 pages
Business Data Mining Week 12
No ratings yet
Business Data Mining Week 12
24 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Lect 12 -Deep Feed Forward NN- Review
No ratings yet
Lect 12 -Deep Feed Forward NN- Review
93 pages
Bai 1 Eng
No ratings yet
Bai 1 Eng
10 pages
Seminar Report cnn1
No ratings yet
Seminar Report cnn1
23 pages
Lec14-CNNRNNModels
No ratings yet
Lec14-CNNRNNModels
64 pages
PDAS Brochure
No ratings yet
PDAS Brochure
1 page
Xtraseal Threadlocker 9262 - Anaerobic
No ratings yet
Xtraseal Threadlocker 9262 - Anaerobic
3 pages
IGFA International Angling Rules
No ratings yet
IGFA International Angling Rules
14 pages
Pteromys: Interactive Design and Optimization of Free-Formed Free-Flight Model Airplanes
No ratings yet
Pteromys: Interactive Design and Optimization of Free-Formed Free-Flight Model Airplanes
10 pages
M9Xx Getting Started
No ratings yet
M9Xx Getting Started
75 pages
ST8 Manual V1.0
No ratings yet
ST8 Manual V1.0
20 pages
Deep Learning: Convolutional Neural Network With Python (Keras and Tensorflow)
No ratings yet
Deep Learning: Convolutional Neural Network With Python (Keras and Tensorflow)
2 pages
Xix Embedded Multi Cameras Brochure HQ
No ratings yet
Xix Embedded Multi Cameras Brochure HQ
4 pages
xiSWITCH Infographic
No ratings yet
xiSWITCH Infographic
1 page
Absolute LATEX Beginner
100% (1)
Absolute LATEX Beginner
10 pages
Lumped System Theory
No ratings yet
Lumped System Theory
3 pages
Do Rosenblatt and Nataf Isoprobabilistic Transformation Really Differ
No ratings yet
Do Rosenblatt and Nataf Isoprobabilistic Transformation Really Differ
9 pages
Determinants of The Triangle Centers PDF
No ratings yet
Determinants of The Triangle Centers PDF
4 pages
The Karmarkar Revolution: 1.1 Classical Portrait of The Field
No ratings yet
The Karmarkar Revolution: 1.1 Classical Portrait of The Field
2 pages
Uncertainty Analysis - Monte Carlo Simulation User Guide PDF
No ratings yet
Uncertainty Analysis - Monte Carlo Simulation User Guide PDF
12 pages
Gradient Extrapolated Stochastic Kriging
No ratings yet
Gradient Extrapolated Stochastic Kriging
25 pages
Parul University: Seat No: - Enrollment No
No ratings yet
Parul University: Seat No: - Enrollment No
2 pages
Reinforcement Learning 2012
No ratings yet
Reinforcement Learning 2012
653 pages
Handout Introduction To AWS Services Compute, Storage, Databases
No ratings yet
Handout Introduction To AWS Services Compute, Storage, Databases
34 pages
Zep Sqoop Big Data Interview Questions
No ratings yet
Zep Sqoop Big Data Interview Questions
25 pages
ITC Notes 2
No ratings yet
ITC Notes 2
36 pages
Ai Notes Unit (I-Iii)
No ratings yet
Ai Notes Unit (I-Iii)
290 pages
EE 675 Lecture 27th March
No ratings yet
EE 675 Lecture 27th March
4 pages
Fuzzy Logic and Neural Networks
100% (1)
Fuzzy Logic and Neural Networks
125 pages
User Research Questions For Dashboard Design
No ratings yet
User Research Questions For Dashboard Design
3 pages
Cross Spectral Iris Matching Based On Predictive Image Mapping
No ratings yet
Cross Spectral Iris Matching Based On Predictive Image Mapping
5 pages
Feature Engineering For Mid-Price Prediction With Deep Learning
No ratings yet
Feature Engineering For Mid-Price Prediction With Deep Learning
39 pages
Introduction To AI-1
No ratings yet
Introduction To AI-1
24 pages
8D Course Outline Sample
No ratings yet
8D Course Outline Sample
3 pages
RDBMS
No ratings yet
RDBMS
1 page
Object Detection - Week 1 - Object Detection in 20 Years - Final
No ratings yet
Object Detection - Week 1 - Object Detection in 20 Years - Final
280 pages
PURCOM
No ratings yet
PURCOM
16 pages
Amharic Abstractive Text Summarization
No ratings yet
Amharic Abstractive Text Summarization
5 pages
Fall Detection
No ratings yet
Fall Detection
8 pages
Metalearning Applications To Automated Machine Learning and Data Mining (Pavel Brazdil, Jan N. Van Rijn, Carlos Soares Etc.) (Z-Library)
No ratings yet
Metalearning Applications To Automated Machine Learning and Data Mining (Pavel Brazdil, Jan N. Van Rijn, Carlos Soares Etc.) (Z-Library)
349 pages
Natural Language Processing For Sentiment Analysis in Social Media
No ratings yet
Natural Language Processing For Sentiment Analysis in Social Media
3 pages
Attention Processing Model 2i8i7le
No ratings yet
Attention Processing Model 2i8i7le
7 pages
Lab8 K Mean Clustering
No ratings yet
Lab8 K Mean Clustering
7 pages
CNN Notes - Rohan
No ratings yet
CNN Notes - Rohan
2 pages
Pybibx - A Python Library For Bibliometric and Scientometric Analysis Powered With Artificial Intelligence Tools
No ratings yet
Pybibx - A Python Library For Bibliometric and Scientometric Analysis Powered With Artificial Intelligence Tools
30 pages
Learning From Observations
No ratings yet
Learning From Observations
51 pages
Krishnakanth K: Data Analyst
No ratings yet
Krishnakanth K: Data Analyst
2 pages
TB - 04 - Superwised Learning
No ratings yet
TB - 04 - Superwised Learning
24 pages
Yousef Udacity Deep Learning Part 3 CNN
No ratings yet
Yousef Udacity Deep Learning Part 3 CNN
253 pages

ImageNet Classification With Deep Convolutional Convolutional Neural Networks PDF

Uploaded by

ImageNet Classification With Deep Convolutional Convolutional Neural Networks PDF

Uploaded by

ImageNet Classification

ImageNet Large Scale Visual Recognition Challenge

Learn a feature hierarchy all the way from pixels to classifier

Use vkj to denote the weight between hidden

Use net*k to denote the activation at output unit k

Compare output z with the desired target t;

Error on one sample: 1 m

Name “backpropagation” because during training, errors

N : the total # of kernels in the layer

◦ Non-overlapping / overlapping regions

Reduces the error rate of top-1 by 0.4% and top-5 by 0.3%

i : the iteration index

It overfils a lot therefore train on five

ILSVRC-2012 test set

You might also like