0% found this document useful (0 votes)

122 views27 pages

Lecture 12 - Neural Networks (DONE!!) PDF

This document provides an overview of Lecture 12 of the EE2211 Introduction to Machine Learning course. The lecture will cover neural networks, including multi-layer perceptrons and activation functions. It will discuss training and testing neural networks using forward and backward propagation, and also cover convolutional neural networks. The lecture notes indicate that while neural networks are an important topic, the course will only provide a conceptual introduction due to time constraints, and exam questions will be relatively simple.

Uploaded by

Sharelle Tew

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

122 views27 pages

Lecture 12 - Neural Networks (DONE!!) PDF

Uploaded by

Sharelle Tew

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

EE2211 Introduction to

Machine Learning
Lecture 12

Wang Xinchao
[email protected]

!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Course Contents
• Introduction and Preliminaries (Xinchao)
– Introduction
– Data Engineering
– Introduction to Linear Algebra, Probability and Statistics
• Fundamental Machine Learning Algorithms I (Helen)
– Systems of linear equations
– Least squares, Linear regression
– Ridge regression, Polynomial regression
• Fundamental Machine Learning Algorithms II (Thomas)
– Over-fitting, bias/variance trade-off
– Optimization, Gradient descent
– Decision Trees, Random Forest
• Performance and More Algorithms (Xinchao)
– Performance Issues
– K-means Clustering
– Neural Networks
2
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
About this week’s lecture…
• Neural Network (NN) is a very big topic
– In NUS we have multiple full-semester modules to discuss NN
• EE4305 Fuzzy/Neural Systems for Intelligent Robotics
• EE5934/EE6934 Deep Learning
– In EE2211, we only give a very gentle introduction

A
• Understanding at conceptual level is sufficient
– In final exam, we have only 1 True/False + 1 MCQ about NN
– No computation is required

• You will do some computation in tutorial, but final exam

will be much simpler than the questions in tutorial

3
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Outline
• Introduction to Neural Networks
– Multi-layer perceptron
– Activation Functions
• Training and Testing of Neural Networks
– Training: Forward and Backward
– Testing: Backward
• Convolutional Neural Networks

4
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
In ML theperceptionis an algorithmforsupervisedlearningofbinaryclassifiers
Perceptron buildingblockofneuralnetwork

"& bias anartificialneuronthatdoescertaincomputations

y #& intheinput
data
+1 #& #!
W= #
"! #! "
##
#" Σ#&"& '(Σ*! +! )
O
"" !
## vectorproduct X !W WTX C'(X " W)
Neuron
"# Summation Activation
Function
bias
1
"! Neuron
$= "
"
"# Output of Neuron: '(X " W) or '(Σ*! +! )

Activation Function: non-linear function to

introduce non-linearity into the neural networks!

5
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Activation Functions
ISigmoid Activation Function
XTW 1
σ * = , To I
1 + , /01

v
can describeas the
ofinputofclasses
probability

g
XTW
6
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Activation Functions

ReLU Activation Function my www.ngyygnw

σ * = max 0, *
Rectified Linear Unit (ReLU)

7
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
near a
Multilayer Perceptron (Neural Network)
+! usethisas a classifier n'numberofclasses
+$ numbers ofneuron Nested function!
X= + outputwillbemputfornextlayer
%
+& h1 h2 hn-1 hn
player
!! #
2nd
!!,!
layer … &
!!,! Class 1
!,! output
σ1 σ2 … σn-1 σn
x1 ofinput
index

!
!#,! #
!#,! Class 2
x2 σ1 σ2 … σn-1 σn
⁞ ⁞ ⁞ ⁞
Class C
x !
3 !$,$
σ1 σ2 … σn-1 σn
t probability

x0 !
!%,$ 2# (X)
+1 +1 +1
4
bias
+1

layer 0 layer 1 layer 2 layer n-1 layer n

Input layer Output layer
Note: hn denotes the number of hidden neurons in layer n.
8
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Multilayer Perceptron (Neural Network)
!
!%,! ! number
, … , !%,' than
!
then
!
neuronX ( W! = [Σ!),!
!
) , … , Σ!),'! )) ]

494
! !
!!,!, … , !!,'! o
)
c clan vector
of W = !
! !
input !#,! , … , !#,'!
! !
!$,! , … , !$,'!
h1 h2 hn-1 hn
#
!#,#
… &
!#,# σn
x1 σ1 σ2 … σn-1
+! #
!!,# !
+$ !!,#
X= + x2 σ1 σ2 … σn-1 σn
%
⁞ ⁞ ⁞ ⁞
+&
σn-1
x3 !$,$
#
σ1 σ2 … σn

#" #
!%,$ !! (")
+1 +1 +1 +1

layer 0 layer 1 layer 2 layer n-1 layer n

Input layer Output layer
A
A neural network is essentially a nested function.
pad!$ X = $([1, … $( 1, $ * % + & ]%x + ' … ](x + ) )
bits
9
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Outline
• Introduction to Neural Networks
– Multi-layer perceptron
– Activation Functions
• Training and Testing of Neural Networks
– Training: Forward and Backward
– Testing: Backward Forward
• Convolutional Neural Networks

10
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
I Goal of Neural Network
ran
Training:
0.6
X = 0.5
to Learn W
0.7 h1 h2 hn-1 hn
#
!#,#
… &
!#,# σn
0.6
x1 σ1 σ2 … σn-1 0.7
#
!!,# !
!!,#
0.5 x2 σ1 σ2 … σn-1 σn 0.1
⁞ ⁞ ⁞ ⁞
σn-1
0.7 x3 !$,$
#
σ1 σ2 … σn 0.2

#
!%,$ !! (")
0.7
+1 +1 +1 +1 32 = 4* (X) = 0.1
0.2
layer 0 layer 1 layer 2 layer n-1 layer n
Input layer Output layer

ok
Specifically, W is learned through
1. Random initialization
2. Backpropagation
11
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
am Ittihad
I Neural Network Training: Backpropagation
Assume we train a NN for 3-class classification
One-hot
Input data h1 h2
…
hn-1 hn
Output Class Labels
0.6 #
!#,#
σ1 σ2 … σn-1
&
!#,# σn
0.7 0
x1
0.6 ! #
!
!!,#
X = 0.5
!,#
0.5 σ1 σ2 … σn-1 σn 0.1 0
x2 compare
0.7 ⁞ ⁞ ⁞ ⁞
0.7 σ1 σ2 … σn-1 σn 0.2 1
x3 ! #
$,$

!! (")
0.7 0
#
!%,$
+1 +1 +1 +1
32 = 4* (X) = 0.1 3= 0
layer 0 layer 1 layer 2 layer n-1 layer n
A 0.2 1
Output layer
Input layer
prediction label
A loss function
A
1. Forward: (weights are fixed) Randominitialization for a single sample:
To compute network responses
To compute the errors at each output min# ∑'$%& (65$ − 6$ )(
or
2. Backward: (weights are updated) min# | 65 − 6 |(
To pass back the error from the output to the hidden layers
To update all weights to optimize the network Update W!

12
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Neural Network Training: Backpropagation

• Recall that the parameters W are randomly initialized.

• We use Backpropagation to update W.
• In essence, Backpropagation is gradientAdescent!
• Assume we have N samples, each sample denoted by X4
and the output of NN by 4 4 , loss function is then
"
5= ∑6
45! 47 4 4
−4 ， min7 5 tolearnw
Recall gradient descent in Lec 8: w ← w − η∇85
• We would therefore like to compute ∇85!
– < is a function of 6,
5 and 65 is a function of w.
– Use gradient descent and chain rule!
Being aware of the concept is sufficient for exam. No calculation needed.

13
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
a
Neural Network Testing
Once all network is trained and parameters are updated

Input data h1 h2
…
hn-1 hn
Output
0.6 #
!#,#
σ1 σ2 … σn-1
&
!#,# σn
0.7
x1
0.6 ! #
!
!!,#
X = 0.5
!,#
0.5 σ1 σ2 … σn-1 σn 0.1
x2
0.7 ⁞ ⁞ ⁞ ⁞
0.7 σ1 σ2 … σn-1 σn 0.2
x3 ! #
$,$

#
!%,$ !! (")
+1 +1 +1 +1 0.7
layer 0 layer 1 layer 2 layer n-1 layer n
32 = 0.1
Input layer Output layer 0.2

1. Forward: (weights are fixed)

To estimate compute network responses
To predict the output labels given novel inputs

14
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Supplementary materials
(Not required for exam)
1) https://www.youtube.com/watch?v=tIeHLnjs5U8

This video series includes animations that explain backpropagation

calculus.

2)
https://www.youtube.com/playlist?list=PLQVvvaa0QuDcjD5BAw2DxE6
OF2tius3V3

This video series includes hands-on coding examples in Python.

15
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Outline
• Introduction to Neural Networks
– Multi-layer perceptron
– Activation Functions
• Training and Testing of Neural Networks
– Training: Forward and Backward
– Testing: Backward
• Convolutional Neural Networks

16
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Convolutional Neural Network (CNN)
• A convolutional neural network (CNN) is a special type of
feed-forward network that significantly reduces the
number of parameters in a deep neural network.
• Very popular in image-related applications
• Each image is stored as a matrix in a computer

eachimageismodelled
as a matrix eachpixelis
modelled
as an entry of
the matrix

thehigherthepixel
thebrighter
https://medium.com/lifeandtech/convert-csv-file-to-images-309b6fdb8c49

17
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Convolutional Neural Network (CNN)
• If we model all matrix entries as inputs all at once
– Assume we have an image/matrix size of 200x200
– Assume we have 10K neuros in the first layer
– We already have 200x200x10K=400 Million parameters to learn!
a lot
First Layer Neurons
200
Fully connectedNeural
Networkisout good

10K
200 Neurons
Everyneuronislinkedto
everypixel

18
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Convolutional Neural Network (CNN)
• Hence, we introduce CNN to reduce the number of
parameters.
reducememusetitiatiiidairetriedin scanning

Designonlyoneneuron maketheneuron
to slideovertheimage
slidethroughtheimage
justoneneuron
H makea neurontoonlylookat a
verysmalllocal

19
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Convolutional Neural Network (CNN)

9915141031 1 t 89
my puny
Wi Wiz
0 -1 0

-1 5 -1

0 -1 0

g[× ,× ]
Kernels to
be learned

Image source: https://brilliant.org/wiki/convolutional-neural-network/

20
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Convolutional Neural Network (CNN)

0 -1 0

-1 5 -1

0 -1 0

g[× ,× ]

21
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Convolutional Neural Network (CNN)

0 -1 0

-1 5 -1

0 -1 0

g[× ,× ]

22
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Convolutional Neural Network (CNN)

0 -1 0

-1 5 -1

0 -1 0

g[× ,× ]

23
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Convolutional Neural Network (CNN)

0 -1 0

-1 5 -1

0 -1 0

g[× ,× ]

24
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Neural Networks are Effective
imageNet

25
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
Summary
• Introduction to Neural Networks
– Multi-layer perceptron
– Activation Functions
• Training and Testing of Neural Networks
– Training: Forward and Backward
– Testing: Backward Forward
• Convolutional Neural Networks

26
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"
27
!"#$%&'()*+",,-"./01"233"4()*+5"4656'7681"

Spectroil Q100
67% (3)
Spectroil Q100
100 pages
CS5242 Neural Networks and Deep Learning: Quiz 1
No ratings yet
CS5242 Neural Networks and Deep Learning: Quiz 1
2 pages
Maccura F560 - F 580 (Hematology Analyser)
No ratings yet
Maccura F560 - F 580 (Hematology Analyser)
29 pages
Understanding Machine Learning Solution Manual: 2 Gentle Start
No ratings yet
Understanding Machine Learning Solution Manual: 2 Gentle Start
67 pages
Exercises 695 Clas
No ratings yet
Exercises 695 Clas
3 pages
EE2211 Introduction To Machine Learning: Semester 1 2021/2022
No ratings yet
EE2211 Introduction To Machine Learning: Semester 1 2021/2022
34 pages
Lecture 6 - Ridge Regression, Polynomial Regression (DONE!!) PDF
No ratings yet
Lecture 6 - Ridge Regression, Polynomial Regression (DONE!!) PDF
26 pages
EE2211 CheatSheet
No ratings yet
EE2211 CheatSheet
15 pages
Back Propagation Technique
No ratings yet
Back Propagation Technique
24 pages
Tutorial 1 Question
No ratings yet
Tutorial 1 Question
3 pages
EE2211 Introduction To Machine Learning: Semester 1 2020/2021
No ratings yet
EE2211 Introduction To Machine Learning: Semester 1 2020/2021
34 pages
Chapter 6 ML Classifications
100% (1)
Chapter 6 ML Classifications
51 pages
CSE Artificial Neural Networks Report
No ratings yet
CSE Artificial Neural Networks Report
22 pages
ML Unit-4
No ratings yet
ML Unit-4
9 pages
Decision Trees
No ratings yet
Decision Trees
32 pages
Unit 2 - Soft Computing - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Soft Computing - WWW - Rgpvnotes.in
20 pages
ML Unit 1-Notes
No ratings yet
ML Unit 1-Notes
21 pages
Query Operation 2021
No ratings yet
Query Operation 2021
35 pages
Lecture 3 - Introduction To Linear Algebra, Probability and Statistics (DONE!!)
No ratings yet
Lecture 3 - Introduction To Linear Algebra, Probability and Statistics (DONE!!)
41 pages
ML Lab Manual (5cs4-23)
No ratings yet
ML Lab Manual (5cs4-23)
53 pages
ARTIFICIAL NEURAL NETWORKS-moduleIII
No ratings yet
ARTIFICIAL NEURAL NETWORKS-moduleIII
61 pages
ANN Quiz - PDF - Artificial Neural Network - Computational Science
No ratings yet
ANN Quiz - PDF - Artificial Neural Network - Computational Science
17 pages
Duda Solutions PDF
No ratings yet
Duda Solutions PDF
77 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
CS230 Midterm Solutions Fall 2022
No ratings yet
CS230 Midterm Solutions Fall 2022
20 pages
KNN Algorithm
No ratings yet
KNN Algorithm
3 pages
Soft Max
No ratings yet
Soft Max
6 pages
Neuro Fuzzy Systems
100% (2)
Neuro Fuzzy Systems
27 pages
Decision Trees
No ratings yet
Decision Trees
25 pages
Hw1 Theory Solution PuHK4fmHvB
No ratings yet
Hw1 Theory Solution PuHK4fmHvB
4 pages
Lecture 2.1.2activation Function
No ratings yet
Lecture 2.1.2activation Function
15 pages
An Introduction To Kohonen Self Organizing Maps: Rajarshi Guha
No ratings yet
An Introduction To Kohonen Self Organizing Maps: Rajarshi Guha
12 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
18 pages
Amt305 Introduction To Machine Learning, Pyq
No ratings yet
Amt305 Introduction To Machine Learning, Pyq
5 pages
Ch13 5-ConditionalRandomFields
No ratings yet
Ch13 5-ConditionalRandomFields
57 pages
K Fold Cross Validation
No ratings yet
K Fold Cross Validation
17 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
SOFT COMPUTING - NOTES - UNIT 4 and UNIT 5
No ratings yet
SOFT COMPUTING - NOTES - UNIT 4 and UNIT 5
32 pages
FSD Unit III
No ratings yet
FSD Unit III
22 pages
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
No ratings yet
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
3 pages
Machine L-Lab-Manual
No ratings yet
Machine L-Lab-Manual
90 pages
Unit 5
No ratings yet
Unit 5
36 pages
Unit4 DL Final
No ratings yet
Unit4 DL Final
30 pages
Data Mining ppt-1
No ratings yet
Data Mining ppt-1
16 pages
Assignment 8 Solution
No ratings yet
Assignment 8 Solution
7 pages
ML Unit-3
No ratings yet
ML Unit-3
24 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
29 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
19 pages
Chap 7-2 Regularization For Deep Learning-Hyun-Lim Yang
No ratings yet
Chap 7-2 Regularization For Deep Learning-Hyun-Lim Yang
49 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
15 pages
Model Building Through
No ratings yet
Model Building Through
21 pages
Soft Computing UNIT 3
No ratings yet
Soft Computing UNIT 3
10 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
02 ML Supervised Learning
No ratings yet
02 ML Supervised Learning
32 pages
Nptel Itcp 2022 Assignments Combined
No ratings yet
Nptel Itcp 2022 Assignments Combined
47 pages
Machine Learning With Python Unit 1-17-84 Final13092024
No ratings yet
Machine Learning With Python Unit 1-17-84 Final13092024
68 pages
01-Introduction Machine Learning
100% (1)
01-Introduction Machine Learning
48 pages
A Beginner's Tutorial For CNN
100% (1)
A Beginner's Tutorial For CNN
35 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
7 Neural Networks
No ratings yet
7 Neural Networks
70 pages
CNN and Gan: Introduction To
No ratings yet
CNN and Gan: Introduction To
58 pages
10 Neural Network
No ratings yet
10 Neural Network
65 pages
Dffccil 2 X 25 KV Tender Document
No ratings yet
Dffccil 2 X 25 KV Tender Document
264 pages
0257-AMA-C-0038.Rev.2 Excel
No ratings yet
0257-AMA-C-0038.Rev.2 Excel
1 page
Exam Guide - 406 - Kinetic Tools Management
No ratings yet
Exam Guide - 406 - Kinetic Tools Management
8 pages
MT6622 MediaTek
No ratings yet
MT6622 MediaTek
35 pages
Isabela State University: Republic of The Philippines Cauayan City, Isabela
No ratings yet
Isabela State University: Republic of The Philippines Cauayan City, Isabela
20 pages
MDS Manual en-GB
No ratings yet
MDS Manual en-GB
24 pages
HTML Cheatsheet
No ratings yet
HTML Cheatsheet
6 pages
10 IPS 4 - Akun Office 365
No ratings yet
10 IPS 4 - Akun Office 365
1 page
Assignment 4 - OSF
No ratings yet
Assignment 4 - OSF
3 pages
Wang Real-ESRGAN Training Real-World Blind Super-Resolution With Pure Synthetic Data ICCVW 2021 Paper Compressed
No ratings yet
Wang Real-ESRGAN Training Real-World Blind Super-Resolution With Pure Synthetic Data ICCVW 2021 Paper Compressed
10 pages
Kashish
No ratings yet
Kashish
30 pages
Business Statistics: Assignment
No ratings yet
Business Statistics: Assignment
3 pages
BBDMS Skproject
No ratings yet
BBDMS Skproject
23 pages
Hands-On Exercise No. 4 Batch-10 Graphic Design Total Marks: 10 Due Date: 19/08/2021
No ratings yet
Hands-On Exercise No. 4 Batch-10 Graphic Design Total Marks: 10 Due Date: 19/08/2021
3 pages
F 6504888
No ratings yet
F 6504888
5 pages
General Terminal Commands::cd:pwd
No ratings yet
General Terminal Commands::cd:pwd
19 pages
SME AC Panel Manual 052005 en
100% (1)
SME AC Panel Manual 052005 en
65 pages
Ddco Question Bank
No ratings yet
Ddco Question Bank
1 page
Navneet Kaur PM 1
No ratings yet
Navneet Kaur PM 1
3 pages
Nslookup PC
No ratings yet
Nslookup PC
2 pages
FS - PP-01-Batch Number Print
100% (1)
FS - PP-01-Batch Number Print
11 pages
Popa Tanda de Ioan Slavici
0% (1)
Popa Tanda de Ioan Slavici
26 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
4 pages
Gann Circle Swing Levels
No ratings yet
Gann Circle Swing Levels
2 pages
PES MTech Brochure
No ratings yet
PES MTech Brochure
12 pages
Chapter 3 Part 1
No ratings yet
Chapter 3 Part 1
10 pages
Week 04 Data Base Design: Database System
No ratings yet
Week 04 Data Base Design: Database System
47 pages
Internet Banking Manual - Final
No ratings yet
Internet Banking Manual - Final
11 pages