0% found this document useful (0 votes)

55 views76 pages

IE643 Lecture6 2020sep1

The document discusses moving beyond the limitations of perceptrons for non-linearly separable problems. It introduces multi-layer perceptrons (MLPs) as a way to represent non-linear decision boundaries using multiple layers of neurons with non-linear activations. Specifically, it shows how an MLP can learn the XOR function, which cannot be represented by a single perceptron. It provides notation for neurons and activations in an MLP and works through an example XOR classification task to demonstrate their non-linear representation capability.

Uploaded by

Ankit Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views76 pages

IE643 Lecture6 2020sep1

Uploaded by

Ankit Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 76

Deep Learning - Theory and Practice

IE 643
Lecture 6

September 1, 2020.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 1 / 76

Outline

1 Moving on from Perceptron

2 Multi Layer Perceptron

MLP-Data Perspective

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 2 / 76

Moving on from Perceptron

Perceptron - Caveat

Not suitable when linear separability assumption fails

Example: Classical XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 3 / 76

Moving on from Perceptron

Perceptron - Caveat

Not suitable when linear separability assumption fails

Example: Classical XOR problem

Heavily criticized by M. Minsky and S. Papert in their book: Perceptrons,

MIT Press, 1969.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 4 / 76

Moving on from Perceptron

Perceptron - Caveat
Not suitable when linear separability assumption fails
Example: Classical XOR problem

x1 x2 y = x1 ⊕ x2
0 0 -1
0 1 1
1 0 1
1 1 -1

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 5 / 76

Moving on from Perceptron

Perceptron - Caveat

Not suitable when linear separability assumption fails

Example: Classical XOR problem

x1 x2 y = x1 ⊕ x2 ŷ = sign(w1 x1 + w2 x2 − θ)
0 0 -1 sign(−θ)
0 1 1 sign(w2 − θ)
1 0 1 sign(w1 − θ)
1 1 -1 sign(w1 + w2 − θ)

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 6 / 76

Moving on from Perceptron

Perceptron - Caveat
Not suitable when linear separability assumption fails
Example: Classical XOR problem

sign(−θ) = −1 =⇒ θ > 0
sign(w2 − θ) = 1 =⇒ w2 − θ ≥ 0
sign(w1 − θ) = 1 =⇒ w1 − θ ≥ 0
sign(w1 + w2 − θ) = −1 =⇒ −w1 − w2 + θ > 0

Note: This system is inconsistent. (Homework!)

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 7 / 76

Moving on from Perceptron

Perceptron - Caveat
Not suitable when linear separability assumption fails
Example: Classical XOR problem

sign(−θ) = −1 =⇒ θ > 0
sign(w2 − θ) = 1 =⇒ w2 − θ ≥ 0
sign(w1 − θ) = 1 =⇒ w1 − θ ≥ 0
sign(w1 + w2 − θ) = −1 =⇒ −w1 − w2 + θ > 0

Note: This system is inconsistent. (Homework!)

Recall: We verified this using code for linear separability check.
P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 8 / 76
Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 9 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

Assume that the sample features x ∈ Rd .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 10 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

Assume that the sample features x ∈ Rd .

Idea: Use a transformation φ : Rd → Rq , where q d, to lift the

data samples x ∈ Rd into φ(x) ∈ Rq hoping to see a separating
hyperplane in the transformed space.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 11 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

Assume that the sample features x ∈ Rd .

Idea: Use a transformation φ : Rd → Rq , where q d, to lift the

data samples x ∈ Rd into φ(x) ∈ Rq hoping to see a separating
hyperplane in the transformed space.

Forms the core idea behind kernel methods. (Will not be pursued in
this course!)

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 12 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 13 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

Idea: The separating surface need not be linear and can be assumed
to take some non-linear form.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 14 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

Idea: The separating surface need not be linear and can be assumed
to take some non-linear form.

Hence for an input space X and output space Y, the learned map
h : X → Y can take some non-linear form.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 15 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

Idea: The separating surface need not be linear and can be assumed
to take some non-linear form.

Hence for an input space X and output space Y, the learned map
h : X → Y can take some non-linear form.

Forms the idea behind multi-layer perceptrons!

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 16 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 17 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 18 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

Some notations
nk` denotes k-th neuron at layer `.
ak` denotes the activation of the neuron nk` .
P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 19 / 76
Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

Activation at neuron n11 :

a11 = max{px1 + qx2 + b1 , 0}.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 20 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

Activation at neuron n21 :

a21 = max{rx1 + sx2 + b2 , 0}.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 21 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

Activation at neuron n12 :

a12 = sign(ta11 + ua21 + b3 ).

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 22 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

Activation at neuron n12 :

a12 = sign(ta11 + ua21 + b3 ).
Note: The activation a12 is the output of the network denoted by ŷ .
P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 23 / 76
Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

x1 x2 a11 a21 ŷ y
0 0 max{b1 , 0} max{b2 , 0} sign(ta11 + ua21 + b3 ) -1
0 1 max{q + b1 , 0} max{s + b2 , 0} sign(ta11 + ua21 + b3 ) +1
1 0 max{p + b1 , 0} max{r + b2 , 0} sign(ta11 + ua21 + b3 ) +1
1 1 max{p + q + b1 , 0} max{r + s + b2 , 0} sign(ta11 + ua21 + b3 ) -1

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 24 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 25 / 76

Moving on from Perceptron

Moving away from perceptron - Dealing with XOR problem

A different Multi Layer Perceptron (MLP) architecture is given for XOR

problem in:
David. E. Rumelhart, Geoffrey E. Hinton and Ronald J. Williams.
Learning Internal Representations by Error Propagation,
Technical Report, UCSD, 1985.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 26 / 76

Multi Layer Perceptron

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 27 / 76

Multi Layer Perceptron

Notable features:

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 28 / 76

Multi Layer Perceptron

Notable features:
Multiple layers stacked together.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 29 / 76

Multi Layer Perceptron

Notable features:
Multiple layers stacked together.
Zero-th layer usually called input layer.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 30 / 76

Multi Layer Perceptron

Notable features:
Multiple layers stacked together.
Zero-th layer usually called input layer.
Final layer usually called output layer.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 31 / 76

Multi Layer Perceptron

Notable features:
Multiple layers stacked together.
Zero-th layer usually called input layer.
Final layer usually called output layer.
Intermediate layers are called hidden layers.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 32 / 76

Multi Layer Perceptron

Notable features:
Each neuron in the hidden and output layer is like a perceptron.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 33 / 76

Multi Layer Perceptron

Notable features:
Each neuron in the hidden and output layer is like a perceptron.
However, unlike perceptron, different activation functions are used.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 34 / 76

Multi Layer Perceptron

Notable features:
Each neuron in the hidden and output layer is like a perceptron.
However, unlike perceptron, different activation functions are used.
max{x, 0} has a special name called ReLU (Rectified Linear Unit).

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 35 / 76

Multi Layer Perceptron

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 36 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 37 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

This MLP contains an input layer L0 , 2 hidden layers denoted by

L1 , L2 , and output layer L3 .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 38 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

Recall:
nk` denotes k-th neuron at `-th layer.
ak` denotes activation of neuron nk` .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 39 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

wij` denotes weight of connection connecting ni` from nj`−1 .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 40 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

wij` denotes weight of connection connecting ni` from nj`−1 .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 41 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

wij` denotes weight of connection connecting ni` from nj`−1 .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 42 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

In this particular case, the inputs are x1 and x2 at input layer L0 .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 43 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

At layer L1 :
I At neuron n11 :
F a11 = φ(w11
1 1
x1 + w12 x2 ) .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 44 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

At layer L1 :
I At neuron n11 :
F a11 = φ(w11
1 1
x1 + w12 x2 ) =: φ(z11 ) .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 45 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

At layer L1 :
I At neuron n21 :
F a21 = φ(w21
1 1
x1 + w22 x2 ) .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 46 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

At layer L1 :
I At neuron n21 :
F a21 = φ(w21
1 1
x1 + w22 x2 ) =: φ(z21 ) .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 47 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

At layer L1 :
1
φ(z11 ) 1 x + w1 x )

a1 φ(w11 1 12 2
= =
a21 φ(z21 ) 1 x + w1 x )
φ(w21 1 22 2

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 48 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

1 1

w11 w12 x
Letting W1 = 1 1 and x = 1 , we have at layer L1 :
w21 w22 x2
1 1 1 1 x

a1 z1 w11 x1 + w12 2
=φ =φ = φ(W 1 x)
a21 z21 1 x + w1 x
w21 1 22 2

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 49 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

1
a
Letting a1 = 11 , we have at layer L1 :
a2
1
a
a = 11 = φ(W 1 x)
1
a2

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 50 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

At layer L2 :
I At neuron n12 :
F a12 = φ(w11
2 1 2 1
a1 + w12 a2 ) .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 51 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

At layer L2 :
I At neuron n12 :
F a12 = φ(w11
2 1 2 1
a1 + w12 a2 ) =: φ(z12 ) .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 52 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

At layer L2 :
I At neuron n22 :
F a22 = φ(w21
2 1 2 1
a1 + w22 a2 ) .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 53 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

At layer L2 :
I At neuron n22 :
F a22 = φ(w21
2 1 2 1
a1 + w22 a2 ) =: φ(z22 ).

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 54 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

At layer L2 :
2
φ(z12 ) 2 a1 + w 2 a1 )

2 a1 φ(w11 1 12 2
a = 2 = =
a2 φ(z22 ) 2 a1 + w 2 a1 )
φ(w21 1 22 2

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 55 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

2 2

w11 w12
Letting W2 = 2 2 , we have at layer L2 :
w21 w22
2 2 2 1 2 a1
1
2 a1 z1 w11 a1 + w12 2 2 a1
a = 2 =φ =φ =φ W
a2 z22 2 a1 + w 2 a1
w21 1 22 2 a21

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 56 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

We have at layer L2 :
2 2 1
2 a1 z1 2 a1
a = 2 =φ =φ W = φ(W 2 a1 )
a2 z22 a21

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 57 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

At layer L3 :
I At neuron n13 :
F a13 = φ(w11
3 2 3 2
a1 + w12 a2 ) .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 58 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

At layer L3 :
I At neuron n13 :
F a13 = φ(w11
3 2 3 2
a1 + w12 a2 ) =: φ(z13 ) .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 59 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

At layer L3 :

a3 = a13 = φ(z13 ) = φ(w11

3 a2 + w 3 a2 )

1 12 2

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 60 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

3 3 , we have at layer L :
Letting W 3 = w11

w12 3
2
3 3 a1
3 3 3 2 3 2

a = a1 = φ z1 = φ w11 a1 + w12 a2 = φ W
a22

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 61 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

3 3 , we have at layer L :
Letting W 3 = w11

w12 3
2
3 3 a1
= φ(W 3 a2 )
3 3
a = a1 = φ z1 = φ W
a22

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 62 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

a3 = φ(W 3 a2 )

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 63 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

a3 = φ(W 3 a2 ) = φ(W 3 φ(W 2 a1 ))

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 64 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

a3 = φ(W 3 a2 ) = φ(W 3 φ(W 2 a1 )) = φ(W 3 φ(W 2 φ(W 1 x)))

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 65 / 76

Multi Layer Perceptron

Multi Layer Perceptron - More notations

ŷ = a3 = φ(W 3 a2 ) = φ(W 3 φ(W 2 a1 )) = φ(W 3 φ(W 2 φ(W 1 x)))

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 66 / 76

Multi Layer Perceptron MLP-Data Perspective

Multi Layer Perceptron - Data Perspective

Given data (x, y ), multi layer perceptron predicts:

ŷ = φ(W 3 φ(W 2 φ(W 1 x)))

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 67 / 76

Multi Layer Perceptron MLP-Data Perspective

Multi Layer Perceptron - Data Perspective

Given data (x, y ), multi layer perceptron predicts:

ŷ = φ(W 3 φ(W 2 φ(W 1 x))) =: MLP(x)

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 68 / 76

Multi Layer Perceptron MLP-Data Perspective

Multi Layer Perceptron - Data Perspective

Given data (x, y ), multi layer perceptron predicts:

ŷ = φ(W 3 φ(W 2 φ(W 1 x))) =: MLP(x)

Similar to perceptron, if y 6= ŷ an error E (y , ŷ ) is incurred.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 69 / 76

Multi Layer Perceptron MLP-Data Perspective

Multi Layer Perceptron - Data Perspective

Given data (x, y ), multi layer perceptron predicts:

ŷ = φ(W 3 φ(W 2 φ(W 1 x))) =: MLP(x)

Similar to perceptron, if y 6= ŷ an error E (y , ŷ ) is incurred.

Aim: To change the weights W 1 , W 2 , W 3 , such that the error E (y , ŷ ) is
minimized.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 70 / 76

Multi Layer Perceptron MLP-Data Perspective

Multi Layer Perceptron - Data Perspective

Given data (x, y ), multi layer perceptron predicts:

ŷ = φ(W 3 φ(W 2 φ(W 1 x))) =: MLP(x)

Similar to perceptron, if y 6= ŷ an error E (y , ŷ ) is incurred.

Aim: To change the weights W 1 , W 2 , W 3 , such that the error E (y , ŷ ) is
minimized.
Leads to an error minimization problem.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 71 / 76

Multi Layer Perceptron MLP-Data Perspective

Multi Layer Perceptron - Data Perspective

Input: Training Data D = {(x s , y s )}Ss=1 .

For each sample x s the prediction ŷ s = MLP(x s ).
Error: e s = E (y s , ŷ s ).
Aim: To minimize Ss=1 e s .
P

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 72 / 76

Multi Layer Perceptron MLP-Data Perspective

Multi Layer Perceptron - Data Perspective

Optimization perspective
Given training data D = {(x s , y s )}Ss=1 ,
S
X
min es
s=1

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 73 / 76

Multi Layer Perceptron MLP-Data Perspective

Multi Layer Perceptron - Data Perspective

Optimization perspective
Given training data D = {(x s , y s )}Ss=1 ,
S
X S
X
min es = E (y s , ŷ s )
s=1 s=1

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 74 / 76

Multi Layer Perceptron MLP-Data Perspective

Multi Layer Perceptron - Data Perspective

Optimization perspective
Given training data D = {(x s , y s )}Ss=1 ,
S
X S
X S
X
min es = E (y s , ŷ s ) = E (y s , MLP(x s ))
s=1 s=1 s=1

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 75 / 76

Multi Layer Perceptron MLP-Data Perspective

Multi Layer Perceptron - Data Perspective

Optimization perspective
Given training data D = {(x s , y s )}Ss=1 ,
S
X S
X S
X
min es = E (y s , ŷ s ) = E (y s , MLP(x s ))
s=1 s=1 s=1

Note: The minimization is over the weights of the MLP W 1 , . . . , W L ,

where L denotes number of layers in MLP.
P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 76 / 76

Unit 5
No ratings yet
Unit 5
61 pages
IE643 Lecture2 2020aug18
No ratings yet
IE643 Lecture2 2020aug18
65 pages
IE643 Lecture3 2020aug21
No ratings yet
IE643 Lecture3 2020aug21
60 pages
IE643 Lecture7 2020sep4 Moodle
No ratings yet
IE643 Lecture7 2020sep4 Moodle
67 pages
IE643 Lecture4 2020aug25
No ratings yet
IE643 Lecture4 2020aug25
71 pages
Module I
No ratings yet
Module I
109 pages
1c Perceptrons4
No ratings yet
1c Perceptrons4
5 pages
1c Perceptrons
No ratings yet
1c Perceptrons
20 pages
Artificial Neural Networks[1]
No ratings yet
Artificial Neural Networks[1]
87 pages
Lecture 3-4
No ratings yet
Lecture 3-4
50 pages
Perceptron Linear Classifiers
No ratings yet
Perceptron Linear Classifiers
42 pages
Perceptron Lecture 3
No ratings yet
Perceptron Lecture 3
25 pages
ML 03
No ratings yet
ML 03
42 pages
Course Material- Artificial Intelligence-Week2_update
No ratings yet
Course Material- Artificial Intelligence-Week2_update
63 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
Perceptron: Neuron Model (Special Form of Single Layer Feed Forward)
No ratings yet
Perceptron: Neuron Model (Special Form of Single Layer Feed Forward)
17 pages
Slide 2
No ratings yet
Slide 2
35 pages
Dave Reed: Connectionist Approach To AI
No ratings yet
Dave Reed: Connectionist Approach To AI
26 pages
Supervised Learning Neural Networks
No ratings yet
Supervised Learning Neural Networks
34 pages
P5 Neural Nets
No ratings yet
P5 Neural Nets
114 pages
Bim309 Ai Week13
No ratings yet
Bim309 Ai Week13
53 pages
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
No ratings yet
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
13 pages
NN Unit 2
No ratings yet
NN Unit 2
20 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
151 pages
Deep Learning.pdf
No ratings yet
Deep Learning.pdf
289 pages
Curriculum: Tuesday, February 15, 2022 3:30 PM
No ratings yet
Curriculum: Tuesday, February 15, 2022 3:30 PM
358 pages
file
No ratings yet
file
408 pages
Curriculum: Tuesday, February 15, 2022 3:30 PM
No ratings yet
Curriculum: Tuesday, February 15, 2022 3:30 PM
408 pages
Lecture Notes 3 Perceptron
No ratings yet
Lecture Notes 3 Perceptron
7 pages
Percptron
No ratings yet
Percptron
25 pages
Chapter 7
No ratings yet
Chapter 7
31 pages
DL Full Merged
No ratings yet
DL Full Merged
454 pages
nn1
No ratings yet
nn1
6 pages
Unit 1 Until MLP
No ratings yet
Unit 1 Until MLP
56 pages
DL
No ratings yet
DL
9 pages
Single Layer Feedforward Networks
No ratings yet
Single Layer Feedforward Networks
21 pages
ANN - Perceptron - Adaline
No ratings yet
ANN - Perceptron - Adaline
15 pages
Neural Networks Three
No ratings yet
Neural Networks Three
60 pages
IS23A Chuong 7 Hocsau-Deep Learning v1
No ratings yet
IS23A Chuong 7 Hocsau-Deep Learning v1
44 pages
AIML-UNIT-5
No ratings yet
AIML-UNIT-5
34 pages
Lesson 3 Basics of Neural Networks_Perceptron
No ratings yet
Lesson 3 Basics of Neural Networks_Perceptron
26 pages
Unit 1 and Unit 2
No ratings yet
Unit 1 and Unit 2
30 pages
2EL1730 ML Lecture07 Neural Networks
No ratings yet
2EL1730 ML Lecture07 Neural Networks
65 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
ANN (Perceptron) 02
No ratings yet
ANN (Perceptron) 02
14 pages
AI UNIT 4 PART 2
No ratings yet
AI UNIT 4 PART 2
45 pages
2_MLP and CNN
No ratings yet
2_MLP and CNN
32 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
3 pages
DL mod 1 final
No ratings yet
DL mod 1 final
4 pages
Lecture 9
No ratings yet
Lecture 9
97 pages
Class Notes 23mar2023
No ratings yet
Class Notes 23mar2023
55 pages
Neural Networks: Aroob Amjad Farrukh
No ratings yet
Neural Networks: Aroob Amjad Farrukh
6 pages
deep learning
No ratings yet
deep learning
11 pages
Inbound 8392301798635648784
No ratings yet
Inbound 8392301798635648784
43 pages
Session: Deep Learning: Module: Digital Image Processing Module Code: IMP302
No ratings yet
Session: Deep Learning: Module: Digital Image Processing Module Code: IMP302
34 pages
Linear Regression Ons-5
No ratings yet
Linear Regression Ons-5
30 pages
Neural Network and Fuzzy Logic
50% (2)
Neural Network and Fuzzy Logic
54 pages
Pattern Recognition & Analysis Assignment - Ii
No ratings yet
Pattern Recognition & Analysis Assignment - Ii
19 pages
Applications of Derivatives Errors and Approximation (Calculus) Mathematics Question Bank
From Everand
Applications of Derivatives Errors and Approximation (Calculus) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
P. Balamurugan Deep Learning - Theory and Practice
No ratings yet
P. Balamurugan Deep Learning - Theory and Practice
47 pages
IE643 Lecture10 Part2 25sep2020 PDF
No ratings yet
IE643 Lecture10 Part2 25sep2020 PDF
55 pages
IE643 Lecture9 2020sep15 Moodle
No ratings yet
IE643 Lecture9 2020sep15 Moodle
75 pages
IE643 Lecture8 2020sep11 2020sep8
No ratings yet
IE643 Lecture8 2020sep11 2020sep8
100 pages
P. Balamurugan Deep Learning - Theory and Practice
No ratings yet
P. Balamurugan Deep Learning - Theory and Practice
29 pages
Slides Laplace Transforms April 10 2019
No ratings yet
Slides Laplace Transforms April 10 2019
198 pages
Tutorial 4 & 5 Sol
No ratings yet
Tutorial 4 & 5 Sol
17 pages
The Sacred Heart Church in Elk Rapidsjnjpm PDF
No ratings yet
The Sacred Heart Church in Elk Rapidsjnjpm PDF
2 pages
3.06 Hic and Ille Charts
No ratings yet
3.06 Hic and Ille Charts
2 pages
Eced 21 Lesson 1
No ratings yet
Eced 21 Lesson 1
22 pages
Immediate download Test Bank for Mating Game A Primer on Love Sex and Marriage 3rd Edition Regan 1483379213 9781483379210 all chapters
No ratings yet
Immediate download Test Bank for Mating Game A Primer on Love Sex and Marriage 3rd Edition Regan 1483379213 9781483379210 all chapters
48 pages
Boston Symphony
No ratings yet
Boston Symphony
3 pages
Behavioral Interview For Product Managers 1694899273
No ratings yet
Behavioral Interview For Product Managers 1694899273
23 pages
"Xi Jinping: A Modern Emperor" What Part Xi's Early Socialization and Family Background Played in The Rise of His Leadership?
No ratings yet
"Xi Jinping: A Modern Emperor" What Part Xi's Early Socialization and Family Background Played in The Rise of His Leadership?
5 pages
Formula Grammar PPT B2 U5
No ratings yet
Formula Grammar PPT B2 U5
7 pages
Study On Lane Detection Applide Image Processing
No ratings yet
Study On Lane Detection Applide Image Processing
50 pages
General Training Practice Reading
No ratings yet
General Training Practice Reading
30 pages
Psychiatric Times - The Pseudocommando Mass Murderer A Blaze of Vainglory - 2014-01-28
No ratings yet
Psychiatric Times - The Pseudocommando Mass Murderer A Blaze of Vainglory - 2014-01-28
6 pages
Exploring philosophy: an introductory anthology Cahn instant download
No ratings yet
Exploring philosophy: an introductory anthology Cahn instant download
54 pages
Present Perfect
No ratings yet
Present Perfect
1 page
Pengaruh Konsentrasi Natrium Benzoat Dan Lama Penyimpanan Terhadap Mutu Minuman Sari Buah Sirsak (Annona Muricata L) Berkarbonasi
No ratings yet
Pengaruh Konsentrasi Natrium Benzoat Dan Lama Penyimpanan Terhadap Mutu Minuman Sari Buah Sirsak (Annona Muricata L) Berkarbonasi
81 pages
Statistics Unlocking the Power of Data 1st Edition Lock Test Bankpdf download
100% (4)
Statistics Unlocking the Power of Data 1st Edition Lock Test Bankpdf download
62 pages
Carl Sagan - Reading Advanced
No ratings yet
Carl Sagan - Reading Advanced
3 pages
Imaging - Lecture 3 (Fractures)
No ratings yet
Imaging - Lecture 3 (Fractures)
11 pages
A Musical Anthology of The Arabian Peninsula (Review 2)
100% (1)
A Musical Anthology of The Arabian Peninsula (Review 2)
4 pages
Financial Risk Management Solution
No ratings yet
Financial Risk Management Solution
8 pages
Affidavit of Solo Parent: IN WITNESS WHEREOF, I Have Hereunto Set My Hand
No ratings yet
Affidavit of Solo Parent: IN WITNESS WHEREOF, I Have Hereunto Set My Hand
1 page
Amaan Ahmed
No ratings yet
Amaan Ahmed
16 pages
CH 2 Intro To Mech Design Problems and Process
No ratings yet
CH 2 Intro To Mech Design Problems and Process
30 pages
All About Bruno Mars Power Point
No ratings yet
All About Bruno Mars Power Point
15 pages
Using Conversation Analysis in The Second Language Classroom
No ratings yet
Using Conversation Analysis in The Second Language Classroom
22 pages
Chapter - 6 - H - Distinction Between Suability and Liability PDF
No ratings yet
Chapter - 6 - H - Distinction Between Suability and Liability PDF
5 pages
Constructivist Theory by J. Bruner
80% (10)
Constructivist Theory by J. Bruner
9 pages
Prospects of Cattle Feed Industry in Ind PDF
No ratings yet
Prospects of Cattle Feed Industry in Ind PDF
12 pages
Icy Watermelon / Sandia Fria by Mary Sue Galindo
No ratings yet
Icy Watermelon / Sandia Fria by Mary Sue Galindo
34 pages
001.acielj - Ethiopia (1-9)
No ratings yet
001.acielj - Ethiopia (1-9)
9 pages
Study Questionnaire
No ratings yet
Study Questionnaire
2 pages

IE643 Lecture6 2020sep1

Uploaded by

IE643 Lecture6 2020sep1

Uploaded by

Deep Learning - Theory and Practice

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 1 / 76

1 Moving on from Perceptron

2 Multi Layer Perceptron

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 2 / 76

Not suitable when linear separability assumption fails

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 3 / 76

Not suitable when linear separability assumption fails

Heavily criticized by M. Minsky and S. Papert in their book: Perceptrons,

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 4 / 76

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 5 / 76

Not suitable when linear separability assumption fails

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 6 / 76

Note: This system is inconsistent. (Homework!)

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 7 / 76

Note: This system is inconsistent. (Homework!)

Moving away from perceptron - Dealing with XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 9 / 76

Moving away from perceptron - Dealing with XOR problem

Assume that the sample features x ∈ Rd .

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 10 / 76

Moving away from perceptron - Dealing with XOR problem

Assume that the sample features x ∈ Rd .

Idea: Use a transformation φ : Rd → Rq , where q  d, to lift the

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 11 / 76

Moving away from perceptron - Dealing with XOR problem

Assume that the sample features x ∈ Rd .

Idea: Use a transformation φ : Rd → Rq , where q  d, to lift the

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 12 / 76

Moving away from perceptron - Dealing with XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 13 / 76

Moving away from perceptron - Dealing with XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 14 / 76

Moving away from perceptron - Dealing with XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 15 / 76

Moving away from perceptron - Dealing with XOR problem

Forms the idea behind multi-layer perceptrons!

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 16 / 76

Moving away from perceptron - Dealing with XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 17 / 76

Moving away from perceptron - Dealing with XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 18 / 76

Moving away from perceptron - Dealing with XOR problem

Moving away from perceptron - Dealing with XOR problem

Activation at neuron n11 :

a11 = max{px1 + qx2 + b1 , 0}.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 20 / 76

Moving away from perceptron - Dealing with XOR problem

Activation at neuron n21 :

a21 = max{rx1 + sx2 + b2 , 0}.

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 21 / 76

Moving away from perceptron - Dealing with XOR problem

Activation at neuron n12 :

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 22 / 76

Moving away from perceptron - Dealing with XOR problem

Activation at neuron n12 :

Moving away from perceptron - Dealing with XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 24 / 76

Moving away from perceptron - Dealing with XOR problem

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 25 / 76

Moving away from perceptron - Dealing with XOR problem

A different Multi Layer Perceptron (MLP) architecture is given for XOR

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 26 / 76

Multi Layer Perceptron

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 27 / 76

Multi Layer Perceptron

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 28 / 76

Multi Layer Perceptron

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 29 / 76

Multi Layer Perceptron

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 30 / 76

Multi Layer Perceptron

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 31 / 76

Multi Layer Perceptron

P. Balamurugan Deep Learning - Theory and Practice September 1, 2020. 32 / 76

Multi Layer Perceptron

Idea: Use a transformation φ : Rd → Rq , where q d, to lift the

Idea: Use a transformation φ : Rd → Rq , where q d, to lift the