anthony
anthony
Anthony Hershberger
1 / 34
Outline
ImageNet Competition
Object Detection
Object Localization
Current Research
TensorFlow
TensorBoard
2 / 34
ImageNet Large Scale Visual Recognition Challenge 2018
3 / 34
Tasks of Image Recognition
4 / 34
ImageNet Dataset
5 / 34
Object Localization
I For each photo input, the algorithm will output 5 class labels
denoted
ci , i = 1, 2, ...5
in decreasing order of confidence.
I The algorithm will also output 5 bounding boxes denoted
bi , i = 1, 2, ..., 5
Ck , k = 1, 2, ..., n
class labels.
6 / 34
Object Localization
Bkm , m = 1, 2, ..., Mk
7 / 34
Object Localization
1X
e= (mini minm max(d(ci , Ck ), f (bi , Bkm )
n
where d is 0 if the your algorithm predicts the same class as
the bounded class label and f (bi , Bkm ) = 0 if your algorithm’s
bounding box overlaps the ground truth bounding box by
more than 50
8 / 34
Object Detection Challenge
9 / 34
Current Research in Image Recognition
10 / 34
Current Research in Image Recognition
11 / 34
Convolutional Neural Networks
12 / 34
Convolutional Neural Networks
13 / 34
Layers of Convolutional Neural Networks
14 / 34
Spatial Arrangements
15 / 34
Depth, Stride, and Zero Padding
16 / 34
Activation Functions
17 / 34
Backpropogation
18 / 34
Backpropogation
dE (X , θt
θt+1 = thetat − α
dθ
19 / 34
backpropogation
20 / 34
Computational Complexity of CNNs
21 / 34
Tools for Neural Networks
TensorFlow PyTorch
23 / 34
TensorFlow installed for CUDA
24 / 34
CNN Code Explanation
import tensorflow as tf
25 / 34
CNN Code Explanation
26 / 34
CNN Code Explanation
27 / 34
CNN Code Explanation
28 / 34
CNN Code Explanation
29 / 34
Backpropogation
30 / 34
TensorBoard
31 / 34
How do you set up TensorBoard?
32 / 34
MNIST Data Demo
33 / 34