0% found this document useful (0 votes)

71 views22 pages

Machine Learning With Convolutional Neural Networks

The document summarizes machine learning with convolutional neural networks. It provides an overview of supervised learning and describes single-layer and multi-layer neural networks. It then discusses convolutional neural networks and their applications. Key concepts covered include training data, predictive models, learning rules, activation functions, forward propagation, backpropagation, and updating weights with gradient descent.

Uploaded by

TàngHình

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views22 pages

Machine Learning With Convolutional Neural Networks

Uploaded by

TàngHình

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Machine Learning with

Convolutional Neural Networks

November 29, 2017

Overview

Supervised Learning

Single-layer Neural Network

Multi-layer Neural Network

Neural Network and Convolutional Neural Network

Machine Learning - Overview
correct output

Training Data input output

{input, correct output}

error

I Machine Learning: Adjust the parameters of the Predictive

Model with Training Data and iteratively reduce the error
using a Learning Rule
I Tasks: Prepare Training Data and develop a Predictive Model
with a Learning Rule
I Topics
1. Basics, Single-layer Neural Network, LMS Rule, MNIST
2. Multi-layer Neural Network, Learning Rules, MNIST
3. Convolutional Neural Network, Learning Rules, MNIST
Machine Learning - Applications 1
Input CNN Output

Denoise

Super
resolve

Segment

Detect

Patrol Boat (99%)

Classify
Boat (1%)
Machine Learning - Applications 2
Supervised Learning with Training Data

Training and Testing

correct output

input output

error

Application

input output
Supervised Learning with Neural Network
correct output

Training Data input Neural Network output

{input, correct output} output=f(input)

error

Definitions
x, d input and correct output (desired, target, label) vector
y, e model output and error vector

Output Error
y = f (x) e = d −y
y = f (x)
Y = f (X )
Supervised Learning with Neural Network correct output

Training Data input Neural Network output

{input, correct output} output=f(input)

error

Learning procedure
1. Initialize neural network with adequate weight values
2. Take input/correct output from the training data, feed input
to neural network, obtain the output, and calculate error
3. Adjust the weights to reduce the error with Gradient Descent
Method ∂E
wji (n + 1) = wji (n) − η = wji (n) + ∆wji (n) (1)
∂wji
4. Repeat Steps 2 - 3 for all training data
Single Layer Neural Network
x y=f(wTx+b) x y=f(Wx+b)
b b

input layer output layer wji

xi
input nodes neuron node yj

x, y, d input, output and desired (target) vector

W, f (·) weight matrix and activation function
X
y = f (wT x + b) = f ( wi · xi + b) (2)
i
y = f (Wx + b) (3)
ej = dj − yj (4)
wji (n + 1) = wji (n) + ηej xi LMS algorithm (5)
Activation functions
I Linear function
f (x) = x
I Sigmoid function
1
f (x) =
1 + e −x
I ReLU
f (x) = max(0, x)
f(x)

Sigmoid
ReLU

x
Linear
Multi Layer Neural Network
xi yj yk
wji wkj

I nodes J nodes K nodes

input hidden output
layer layer layer

Forward propagation
I
X
aj = wji xi + bj (6)
i=1
yj = fj (aj ), j = 1, . . . , J (7)
J
X
ak = wkj yj + bk (8)
j=1
yk = fk (ak ), k = 1, . . . , K (9)
ek = dk − yk , k = 1, . . . , K (10)
Generalized Delta Rule - 1
Error
K K
1X 2 1X
E = ek = (dk − yk )2 (11)
2 2
k=1 k=1

Partial derivative of the error with respect to the output layer

weights

∂E ∂fk (ak ) ∂(ak )

= −(dk − yk ) · · (12)
∂wkj ∂(ak ) ∂wkj
 
J
∂(ak ) ∂ X
= wkj yj + bk  = yj (13)
∂wkj ∂wkj
j=1
∂E 0
= −(dk − yk ) · fk (ak ) · yj (14)
∂wkj
Generalized Delta Rule - 2
General update rule
0
wkj (n + 1) = wkj (n) + η · (dk − yk ) · fk (ak ) · yj (15)

Linear output function

fk (ak ) = ak (16)
0
fk (ak ) = 1 (17)
wkj (n + 1) = wkj (n) + η · (dk − yk ) · yj (18)

Sigmoid output function

1
fk (ak ) = (19)
1 + exp(−ak )
0
fk (ak ) = fk (ak ) · (1 − fk (ak )) = yk · (1 − yk ) (20)
wkj (n + 1) = wkj (n) + η · (dk − yk ) · yk · (1 − yk ) · yj (21)
Generalized Delta Rule - 3
General update rule
0
wkj (n + 1) = wkj (n) + η · (dk − yk ) · fk (ak ) · yj (22)

Delta function
0
δk = (dk − yk ) · fk (ak ) (23)
0
= ek · fk (ak ) (24)
xi
wji δj yj ej w δk yk ek
kj

I nodes J nodes K nodes

input hidden output
layer layer layer

Weight-update equation

wkj (n + 1) = wkj (n) + η · δk · yj (25)

Hidden Layer Weight Update - 1
Error
K K K
1X 2 1X 1X
E = ek = (dk − yk )2 = (dk − fk (ak ))2
2 2 2
k=1 k=1 k=1
K J
1 X X
= (dk − fk ( wkj yj + bk ))2 (26)
2
k=1 j=1

Partial derivative of the error with respect to the hidden layer

weights
∂E X ∂fk (ak ) ∂(ak ) ∂yj ∂(aj )
= − (dk − yk )
∂wji ∂(ak ) ∂yj ∂(aj ) ∂wji
k
0 0
X
= − (dk − yk ) · fk (ak ) · wkj · fj (aj ) · xi (27)
k
0 0
X
= −fj (aj ) · xi (dk − yk ) · fk (ak ) · wkj (28)
k
0
X
= −fj (aj ) · xi δk · wkj (29)
k
Hidden Layer Weight Update - 2

0
X
∆wji = ηfj (aj ) · xi δk · wkj (30)
k
0 0
X
δj = fj (aj ) δk · wkj = fj (aj ) · ej (31)
k
∆wji = η · δj · xi (32)
xi
wji δj yj ej w δk yk ek
kj

I nodes J nodes K nodes

input hidden output
layer layer layer

Weight-update equation

wji (n + 1) = wji (n) + η · δj · xi (33)

Neural Network - Forward and Backprop

Forward Propagation

o1 w11 w12 i1
o2 w21 w22 i2
o3 w31 w32
1 1
2 2
3

Back Propagation
i1 w11 w21 w31 o1
i2 w12 w22 w32 o2
o3

o = W·i
i = WT · o
Neural Network - Generalization
x δL-1 yL-1eL-1 L δL yLeL
WL-1 W

Forward

yL−1 = f (WL−1 · x + bL−1 ) (34)

L L L−1 L
y = f (W · y +b ) (35)

Backward

δ L = f 0 (aL ) ◦ eL (36)
L−1 L T L
e = (W ) · δ (37)
L−1 0 L−1 L−1
δ = f (a )◦e (38)
Training and Validation/Testing

Training Data Predictive Model

Training Data
(Batch)
Epoch
Minibatch

Validation Data
Weight Update: Batch, SGD, Minibatch
Batch Gradient Descent (complete dataset or batch)

1 X (m) (m) (m) (m) (m)

∆wkj = η · yk · (1 − yk ) · (dk − yk ) · yj
M
m∈dataset
(39)

Stochastic Gradient Descent (one single m)

(m) (m) (m) (m) (m)
∆wkj = η · yk · (1 − yk ) · (dk − yk ) · yj (40)

Minibatch Gradient Descent (complete dataset split into smaller

minibatches)

1 X (m) (m) (m) (m) (m)

∆wkj = η · yk · (1 − yk ) · (dk − yk ) · yj
M
m∈minibatch
(41)
Neural Network and Convolutional Neural Network
Forward Propagation

o1 w11 w12 i1
o2 w21 w22 i2
o3 w31 w32
1 1
2 2
3

Back Propagation
i1 w11 w21 w31 o1
i2 w12 w22 w32 o2
o3
Forward Correlation Backward Correlation with rot180(w) or rot180(out)

1 2 3 11 12 22 21 22 21 22 21
4 5 6 21 22 1 2 12 11
1 2 12
1 11
2 1 12
2 11
7 8 9 3 4 3 4 3 4 3 4
1
2
3 22 21
1 2 22
1 21
2 1 22
2 21
1 12 11 12
4 3 4 3 11
4 3 12
4 11
2
5
6 3
7 1 2 1 2 1 2
4 22 21
8 3 4 3 21
22 4 3 22
4 21
12 11 12 11 12 11
9

4 3 4 3 4 3
2 11
1 12 11
2 12
1 11 12
2 1
21 22 21 22 21 22

4 11
3 12 11
4 12
3 11 12
4 3
2 21
1 22 21
2 22
1 21 22
2 1

11 12 11 12 11 12
4 21
3 22 4 22
21 3 4 3
21 22
2 1 2 1 2 1
Convolutional Neural Network
Forward Correlation Backward Correlation with rot180(w) or rot180(out)

4 3 4 3 4 3
2 11
1 12 11
2 12
1 11 12
2 1
21 22 21 22 21 22

4 11
3 12 11
4 12
3 11 12
4 3
2 21
1 22 21
2 22
1 21 22
2 1

11 12 11 12 11 12
4 21
3 22 4
21 3
22 4 3
21 22
2 1 2 1 2 1

Y = X ∗ rot180 (W)
X = Y∗W =W∗Y

D12 User Manual - 1 Phase Output
No ratings yet
D12 User Manual - 1 Phase Output
31 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
Lecture 10
No ratings yet
Lecture 10
155 pages
Neural network intro lecture 4
No ratings yet
Neural network intro lecture 4
46 pages
ML807_Distributed_and_Federated_Learning_Slides_2
No ratings yet
ML807_Distributed_and_Federated_Learning_Slides_2
211 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Neural Networks: Introduction & Matlab Examples
No ratings yet
Neural Networks: Introduction & Matlab Examples
46 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Deep Learning 10 Hours: - Artificial Neural Networks (ANN) : Architecture
No ratings yet
Deep Learning 10 Hours: - Artificial Neural Networks (ANN) : Architecture
24 pages
Neural - Networks
No ratings yet
Neural - Networks
47 pages
Lec 15 MLP Cont
No ratings yet
Lec 15 MLP Cont
34 pages
3ML.05.NeuralNetworks DeepLearning
No ratings yet
3ML.05.NeuralNetworks DeepLearning
67 pages
Business Data Mining Week 12
No ratings yet
Business Data Mining Week 12
24 pages
Lecture8 DeepLearning
No ratings yet
Lecture8 DeepLearning
94 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
Deep Learning Week 201
No ratings yet
Deep Learning Week 201
3 pages
chapter_5_summary
No ratings yet
chapter_5_summary
5 pages
19 - Introduction To Neural Networks
No ratings yet
19 - Introduction To Neural Networks
7 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
Lecture 02 - Artificial Neural Network
No ratings yet
Lecture 02 - Artificial Neural Network
37 pages
A Beginner's Tutorial For CNN
100% (1)
A Beginner's Tutorial For CNN
35 pages
Neural Network BSC
No ratings yet
Neural Network BSC
32 pages
Module1 ECO-598 AI & ML Aug 21
No ratings yet
Module1 ECO-598 AI & ML Aug 21
45 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Multi Layer Perceptron Haykin
No ratings yet
Multi Layer Perceptron Haykin
50 pages
3 - DeepLearning - and - CNN v3
No ratings yet
3 - DeepLearning - and - CNN v3
50 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
Ece18898g Neural Networks
No ratings yet
Ece18898g Neural Networks
47 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
Chap11 Neural Nets
No ratings yet
Chap11 Neural Nets
38 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
71 pages
Multi Layer Perceptron 1
No ratings yet
Multi Layer Perceptron 1
54 pages
NeuralNetworks
No ratings yet
NeuralNetworks
29 pages
Inference and Learning
No ratings yet
Inference and Learning
33 pages
Lecture_09_slides_-_after
No ratings yet
Lecture_09_slides_-_after
57 pages
Neural Networks - Slides - CMU - Aarti Singh & Barnabas Poczos
No ratings yet
Neural Networks - Slides - CMU - Aarti Singh & Barnabas Poczos
36 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
Ia Davma Unidad 2
No ratings yet
Ia Davma Unidad 2
113 pages
Notes On MSE Gradients For Neural Networks: 1 2 Mean Squared Error (MSE)
No ratings yet
Notes On MSE Gradients For Neural Networks: 1 2 Mean Squared Error (MSE)
10 pages
CNN and Gan: Introduction To
No ratings yet
CNN and Gan: Introduction To
58 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
12. NN Introduction MES
No ratings yet
12. NN Introduction MES
39 pages
Bai 1 Eng
No ratings yet
Bai 1 Eng
10 pages
Chapter 5 Artificial Neural Networks
No ratings yet
Chapter 5 Artificial Neural Networks
50 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
68 pages
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
100% (1)
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
40 pages
ImageNet Classification With Deep Convolutional Convolutional Neural Networks PDF
No ratings yet
ImageNet Classification With Deep Convolutional Convolutional Neural Networks PDF
37 pages
6 Working Example 01-08-2024
No ratings yet
6 Working Example 01-08-2024
21 pages
Lec03 NeuralNetwork
No ratings yet
Lec03 NeuralNetwork
39 pages
AI_Lec24-25
No ratings yet
AI_Lec24-25
63 pages
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
From Everand
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Catalog75 Hydraulic Nozzles Metric Units WhirlJet BD
No ratings yet
Catalog75 Hydraulic Nozzles Metric Units WhirlJet BD
5 pages
Homework 2.
No ratings yet
Homework 2.
3 pages
Blockchain Technology: Presented By:-Suryansh Pathak Akash Mishra Aakash Gautam
No ratings yet
Blockchain Technology: Presented By:-Suryansh Pathak Akash Mishra Aakash Gautam
11 pages
RBC Disorder 2
No ratings yet
RBC Disorder 2
9 pages
Lesson 5 - Philosophy and Spirituality
No ratings yet
Lesson 5 - Philosophy and Spirituality
24 pages
Curve PDF
No ratings yet
Curve PDF
1 page
Epc CW
No ratings yet
Epc CW
5 pages
Phraser Connector: The 1st Annual Santa Parade
No ratings yet
Phraser Connector: The 1st Annual Santa Parade
12 pages
Course outline
No ratings yet
Course outline
3 pages
Less 5 Religion and Beliefs in Ancient China
No ratings yet
Less 5 Religion and Beliefs in Ancient China
9 pages
Investigation of Fracture Toughness of SAE 1020 Welded Using Rutile Covered Electrodes, and Determination of Mismatch Factor
No ratings yet
Investigation of Fracture Toughness of SAE 1020 Welded Using Rutile Covered Electrodes, and Determination of Mismatch Factor
8 pages
Physics
100% (1)
Physics
5 pages
FO-63P
No ratings yet
FO-63P
4 pages
sensors-24-07546
No ratings yet
sensors-24-07546
18 pages
Construction Machinery Brochur IGUS
No ratings yet
Construction Machinery Brochur IGUS
24 pages
Ecx4233 Tma1 2016
No ratings yet
Ecx4233 Tma1 2016
6 pages
Neodymium-Iron-Boron Magnet Grades PDF
No ratings yet
Neodymium-Iron-Boron Magnet Grades PDF
82 pages
Palacios 2004
No ratings yet
Palacios 2004
10 pages
Alternative Treatment For Wildlife PDF
100% (2)
Alternative Treatment For Wildlife PDF
46 pages
COOKERY 10 Quarter 2 LAS No. 5
100% (2)
COOKERY 10 Quarter 2 LAS No. 5
5 pages
835
No ratings yet
835
6 pages
Electrochemistry - Concentration Cells
No ratings yet
Electrochemistry - Concentration Cells
5 pages
Bisection Workbook NM A1[1]
No ratings yet
Bisection Workbook NM A1[1]
13 pages
Kant On Empiricism and Rationalism
No ratings yet
Kant On Empiricism and Rationalism
23 pages
Certificado 5elem FM Actualizado 2016
No ratings yet
Certificado 5elem FM Actualizado 2016
11 pages
2013 Broadband Characterization of On-Chip RF Spiral Inductor Using Multi-Line TRL Calibration
No ratings yet
2013 Broadband Characterization of On-Chip RF Spiral Inductor Using Multi-Line TRL Calibration
4 pages
Concept Map On Hypertension
No ratings yet
Concept Map On Hypertension
1 page
Cable and Cable Fault Locating - Part 3
No ratings yet
Cable and Cable Fault Locating - Part 3
5 pages
Mas List
No ratings yet
Mas List
8 pages

Machine Learning With Convolutional Neural Networks

Uploaded by

Machine Learning With Convolutional Neural Networks

Uploaded by

Machine Learning with

Convolutional Neural Networks

November 29, 2017

Single-layer Neural Network

Multi-layer Neural Network

Neural Network and Convolutional Neural Network

Training Data input output

I Machine Learning: Adjust the parameters of the Predictive

Patrol Boat (99%)

Training and Testing

Training Data input Neural Network output

Training Data input Neural Network output

input layer output layer wji

x, y, d input, output and desired (target) vector

I nodes J nodes K nodes

Partial derivative of the error with respect to the output layer

∂E ∂fk (ak ) ∂(ak )

Linear output function

Sigmoid output function

I nodes J nodes K nodes

wkj (n + 1) = wkj (n) + η · δk · yj (25)

Partial derivative of the error with respect to the hidden layer

I nodes J nodes K nodes

wji (n + 1) = wji (n) + η · δj · xi (33)

yL−1 = f (WL−1 · x + bL−1 ) (34)

Training Data Predictive Model

1 X (m) (m) (m) (m) (m)

Stochastic Gradient Descent (one single m)

Minibatch Gradient Descent (complete dataset split into smaller

1 X (m) (m) (m) (m) (m)

You might also like