0% found this document useful (0 votes)

8 views

Lecture 19 NN

Uploaded by

Fasih Ullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Lecture 19 NN

Uploaded by

Fasih Ullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 32

Neural Network

Biological Neuron
●
Neurons are interconnected nerve cells in the human brain
that are involved in:
–Processing and transmitting chemical and electrical signals
A human brain has billions of neurons
●

●
Dendrites are branches that receive information from other
neurons
Biological Neurons
●
Dendrites are branches that
receive information from other
neurons
●
Cell nucleus or Soma
processes the information
received from dendrites
●
Axon is a cable that is used by
neurons to send information
●
Synapse is the connection
between an axon and other
neuron dendrites
Artificial Neuron
●
An artificial neuron is a
mathematical function based on
a model of biological neurons,
where each neuron takes:
–Inputs

–Weighs them separately

–Sums them up and
–Passes this sum through a
nonlinear function to produce
output
Biological Neuron vs. Artificial
Neuron
Artificial Neuron at a Glance
●
A neuron is a mathematical function modeled on the working of
biological neurons
●
It is an elementary unit in an artificial neural network
●
One or more inputs are separately weighted
●
Inputs are summed and passed through a nonlinear function to
produce output
●
Every neuron holds an internal state called activation signal
●
Each connection link carries information about the input signal
●
Every neuron is connected to another neuron via connection link
Perceptron
●
A perceptron is a neural network unit
(an artificial neuron) that does certain
computations to detect features or
business intelligence in the input data
Perceptron was introduced by Frank
●

Rosenblatt in 1957
●
A Perceptron is an algorithm for
supervised learning of binary
classifiers
●
This algorithm enables neurons to
learn and processes elements in the
training set one at a time
Types of Perceptron
●
There are two types of Perceptrons:
–Single layer
–Multilayer

●
Single layer Perceptrons can learn only linearly separable patterns
Multilayer Perceptrons or feedforward neural networks with two or
●

more layers have the greater processing power

●
The Perceptron algorithm learns the weights for the input signals
in order to draw a linear decision boundary
Perceptron Learning Rule
●
Perceptron Learning Rule states that the
algorithm would automatically learn the
optimal weight coefficients
●
The input features are then multiplied with
these weights to determine if a neuron fires
or not
●
The Perceptron receives multiple input
signals, and if the sum of the input signals
exceeds a certain threshold, it either
outputs a signal or does not return an
output
●
In the context of supervised learning and
classification, this can then be used to
predict the class of a sample
Perceptron Function
●
Perceptron is a function that maps its input “x,” which is multiplied with the learned weight
coefficient; an output value ”f(x)”is generated

●
Where:
–“w” = vector of real-valued weights
–“b”= bias (an element that adjusts the boundary away from origin without any dependence on the
input value)
–“x” = vector of input x values
●
“m” = number of inputs to the Perceptron
The output can be represented as “1” or “0.” It can also be represented as “1” or “-1” depending on
●

which activation function is used

Inputs of a Perceptron
●
Perceptron accepts inputs
●
Adjust weight values
●
Applies the transformation
function to output the final result
It has only two values: Yes and
●

No or True and False

●
The summation function “∑”
multiplies all inputs of “x” by
weights “w” and then adds them
up
Activation Functions of Perceptron
●
The activation function applies a step rule
(convert the numerical output into +1 or -1)
to check if the output of the weighting
function is greater than zero or not
●
Step function gets triggered above a
certain value of the neuron output Else it
outputs zero
●
Sign Function outputs +1 or -1 depending
on whether neuron output is greater than
zero or not
●
Sigmoid is the S-curve and outputs a
value between 0 and 1.
Error and Output of Perceptron
●
Predicted output is compared with the known output. If it
does not match, the error is propagated backward to allow
weight adjustment to happen
Perceptron Summary
●
Perceptron is an algorithm for Supervised Learning
●
Optimal weight coefficients are automatically learned
●
Weights are multiplied with the input features and decision is made if the neuron is
fired or not
●
Activation function applies a step rule to check if the output of the weighting function is
greater than zero
●
Linear decision boundary is drawn enabling the distinction between the two linearly
separable classes +1 and -1
●
If the sum of the input signals exceeds a certain threshold, it outputs a signal,
otherwise, there is no output
●
Types of activation functions include the sign, step, and sigmoid functions.
Running Example
Going to work on AND Gate problem
●

The gate returns if and only if both inputs are true

●

We are going to set weights randomly

●

Let’s say that w1 = 0.9 and w2 = 0.9

●
Round 1
●
We apply 1st instance to the perceptron. x1 = 0 and x2 = 0
●
Σ = x1 * w1 + x2 * w2 = 0 * 0.9 + 0 * 0.9 = 0
●
Suppose that activation threshold would be 0.5
●
Sum unit was 0 for the 1st instance. So, activation unit would
return 0 because it is less than 0.5.
Similarly, its output should be 0 as well. We will not update
●

weights because there is no error in this case

●
Let’s focus on the 2nd instance. x1 = 0 and x2 = 1.
●
Sum unit: Σ = x1 * w1 + x2 * w2 = 0 * 0.9 + 1 * 0.9 = 0.9
Round-1 (Error)
●
Activation unit will return 1 because sum unit is greater than 0.5
but Output of this instance should be 0
●
Not predicted correctly, so will update weights based on the error
●
ε = actual – prediction = 0 – 1 = -1
●
Add error times learning rate value to the weights and let learning
rate would be 0.5
●
w1 = w1 + α * ε = 0.9 + 0.5 * (-1) = 0.9 – 0.5 = 0.4
●
w2 = w2 + α * ε = 0.9 + 0.5 * (-1) = 0.9 – 0.5 = 0.4
Round-1 (3rd and 4th instance)
●
The 3rd instance. x1 = 1 and x2 = 0
●
Sum unit: Σ = x1 * w1 + x2 * w2 = 1 * 0.4 + 0 * 0.4 = 0.4
●
Activation unit will return 0 this time because output of the sum
unit is 0.4 and it is less than 0.5 (No update)
●
4th instance is x1 = 1 and x2 = 1
●
Sum unit: Σ = x1 * w1 + x2 * w2 = 1 * 0.4 + 1 * 0.4 = 0.8
●
Activation unit will return 1 because output of the sum unit is 0.8
●
Actual value is 1 is predicted correctly. We will not update
Round-2 (1st instance)
●
In previous round, 1st instance was classified correctly. Let’s
apply feed forward for the new weight values.
Remember that 1st instance is x1 = 0 and x2 = 0
●

Sum unit: Σ = x1 * w1 + x2 * w2 = 0 * 0.4 + 0 * 0.4 = 0

●

●
Activation unit will return 0 because sum unit is 0 and it is
less than the threshold value 0.5. The output of the 1st
instance should be 0 as well. This means that the instance is
classified correctly.
No update in weights
●
Round-2 (2nd Instance)
Feed forward for the 2nd instance. x1 = 0 and x2 = 1
●

Sum unit: Σ = x1 * w1 + x2 * w2 = 0 * 0.4 + 1 * 0.4 = 0.4

●

●
Activation unit will return 0 because sum unit is less than the
threshold 0.5.
Output will be 0 and means that it is classified correctly and
●

we will not update weights

●
We’ve applied feed forward calculation for 3rd and 4th
instances already for the current weight values in the
previous round-1. They were classified correctly
Learning Term
●
Updating weights means
learning in the perceptron
●
We set weights to 0.9 initially
but it causes some errors
●
Then, we update the weight
values to 0.4. In this way, we
can predict all instances
correctly
●
Luckily, the best weights in 2
round
Multilayer Perceptron
Multilayer Perceptron Introduction
●
Single-layer perceptron is not good to learn and identify non-linear patterns
●
The public lost interest in perceptron as most problems in the real world are
non-linear
●
Fast forward almost two decades to 1986, Geoffrey Hinton, David Rumelhart,
and Ronald Williams published a paper “Learning representations by back-
propagating errors”, which introduced:
–Backpropagation, a procedure to repeatedly adjust the weights so as to
minimize the difference between actual output and desired output
–Hidden Layers, which are neuron nodes stacked in between inputs and
outputs, allowing neural networks to learn more complicated features
Neural Networks with Hidden
Layers
●
Adding more neurons in
between the input and output
layers
●
Data in the input layer is
labeled as x with subscripts
1, 2, 3, …, m
●
Neurons in the hidden layer
are labeled as h with
subscripts 1, 2, 3, …, n
How it Works
●
With m features in input X, need m weights to perform a dot
product
With n hidden neurons in the hidden layer, need n sets of
●

weights (W1, W2, … Wn) for performing dot products

●
With 1 hidden layer, you perform n dot products to get the
hidden output h: (h1, h2, …, hn)
●
It’s just like a single-layer perceptron, we use hidden output
h: (h1, h2, …, hn) as input data that has n features, perform
dot product with 1 set of n weights (w1, w2, …, wn) to get
Hidden Layer Forward Propogation
Hidden Layer Procedure
Final Output Calculation
Sigmoid Neuron
Some Detail about Sigmoid
●
Sigmoid function produces similar results to step function in that
the output is between 0 and 1. The curve crosses 0.5 at z=0, which
we can set up rules for the activation function, such as:
–Ifthe sigmoid neuron’s output is larger than or equal to 0.5, it
outputs 1; if the output is smaller than 0.5, it outputs 0
●
Sigmoid function does not have a jerk on its curve
●
If z is very negative, then the output is approximately 0; if z is very
positive, the output is approximately 1; but around z=0 where z is
neither too large or too small (in between the two outer vertical
dotted grid lines), we have relatively more deviation as z changes
Non-Linear Activation Options
Use Relu whenever possible, on every hidden layer
●

●
Use Softmax on output layers with more than two categories
to be predicted
Use Sigmoid on an output layer with two categories
●

5 - APM403 Communication (DEC2017)
100% (1)
5 - APM403 Communication (DEC2017)
19 pages
Lecture 19 NN
No ratings yet
Lecture 19 NN
32 pages
ADVANCED_SUPERVISED_LEARNING[1]
No ratings yet
ADVANCED_SUPERVISED_LEARNING[1]
17 pages
20200428135045cfbc718e2c (1)
No ratings yet
20200428135045cfbc718e2c (1)
30 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Lecture 9
No ratings yet
Lecture 9
97 pages
Unit 1
No ratings yet
Unit 1
19 pages
Lecture 5 NN
No ratings yet
Lecture 5 NN
57 pages
Unit 4
No ratings yet
Unit 4
9 pages
Artificial Neural Network: Lecture Module 22
No ratings yet
Artificial Neural Network: Lecture Module 22
54 pages
08 NN
No ratings yet
08 NN
43 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
53 pages
DEEP LEARNING-MATERIAL FOR THE UNITS 1,2,3
No ratings yet
DEEP LEARNING-MATERIAL FOR THE UNITS 1,2,3
36 pages
ML UNIT 3-2-18
No ratings yet
ML UNIT 3-2-18
17 pages
A Presentation On: By: Edutechlearners
No ratings yet
A Presentation On: By: Edutechlearners
33 pages
ML Lecture#4
No ratings yet
ML Lecture#4
109 pages
Lecture 10 Neural Network
No ratings yet
Lecture 10 Neural Network
34 pages
chp1 NN, MLFFN, weight, bias, threshold, activation fn, loss fn
No ratings yet
chp1 NN, MLFFN, weight, bias, threshold, activation fn, loss fn
19 pages
ML tushar assignment
No ratings yet
ML tushar assignment
8 pages
Unit_2
No ratings yet
Unit_2
20 pages
DL CHPT 1
No ratings yet
DL CHPT 1
59 pages
Slide 2
No ratings yet
Slide 2
35 pages
unit 5
No ratings yet
unit 5
46 pages
Unit 2
No ratings yet
Unit 2
15 pages
Perceptron
No ratings yet
Perceptron
24 pages
Artificial Neural Networks Unit 3: Single-Layer Perceptrons
No ratings yet
Artificial Neural Networks Unit 3: Single-Layer Perceptrons
11 pages
Final Ppt DataMining
No ratings yet
Final Ppt DataMining
64 pages
Ch1-fundamental of neural network
No ratings yet
Ch1-fundamental of neural network
59 pages
AI - II - Cihan - Lect 6 PDF
No ratings yet
AI - II - Cihan - Lect 6 PDF
31 pages
Neural Networks
No ratings yet
Neural Networks
54 pages
Ml Unit 3 Study Material-1
No ratings yet
Ml Unit 3 Study Material-1
32 pages
AI UNIT 4 PART 2
No ratings yet
AI UNIT 4 PART 2
45 pages
Data Mining Techniques: Presentation On Neural Network
No ratings yet
Data Mining Techniques: Presentation On Neural Network
55 pages
CMPE 442 Introduction To Machine Learning: Artificial Neural Networks
No ratings yet
CMPE 442 Introduction To Machine Learning: Artificial Neural Networks
65 pages
Neural Network and Fuzzy Logic
50% (2)
Neural Network and Fuzzy Logic
54 pages
1 Neural Networks
No ratings yet
1 Neural Networks
16 pages
Chapter 5 Artificial Neural Networks
No ratings yet
Chapter 5 Artificial Neural Networks
50 pages
Neural Networks and CNN
No ratings yet
Neural Networks and CNN
25 pages
Neural
No ratings yet
Neural
53 pages
AI Lec13
No ratings yet
AI Lec13
65 pages
The Introduction To Neural Networks 10 4 24
No ratings yet
The Introduction To Neural Networks 10 4 24
54 pages
Deep Learning Unit1
No ratings yet
Deep Learning Unit1
25 pages
Uni2 NNDL
No ratings yet
Uni2 NNDL
21 pages
ML-Lec11
No ratings yet
ML-Lec11
14 pages
3 - Perceptron in Machine Learning
No ratings yet
3 - Perceptron in Machine Learning
7 pages
ANN PG Module1
No ratings yet
ANN PG Module1
75 pages
SC - M2 -Ktunotes.in
No ratings yet
SC - M2 -Ktunotes.in
124 pages
Dave Reed: Connectionist Approach To AI
No ratings yet
Dave Reed: Connectionist Approach To AI
26 pages
Module 5 Lecture 2
No ratings yet
Module 5 Lecture 2
45 pages
NNDL
No ratings yet
NNDL
96 pages
Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
19 pages
Machine Learning Using Neural Networks: Presentation By: C. Vinoth Kumar SSN College of Engineering
No ratings yet
Machine Learning Using Neural Networks: Presentation By: C. Vinoth Kumar SSN College of Engineering
24 pages
Perceptron For Class
No ratings yet
Perceptron For Class
28 pages
Ann Muj
No ratings yet
Ann Muj
65 pages
Neural Networks
No ratings yet
Neural Networks
42 pages
Unit 1 NNDL
No ratings yet
Unit 1 NNDL
8 pages
Exercises of Derivatives
From Everand
Exercises of Derivatives
Simone Malacrida
No ratings yet
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Exercises of Quantum Physics
From Everand
Exercises of Quantum Physics
Simone Malacrida
No ratings yet
Exercises of Double and Triple Integrals
From Everand
Exercises of Double and Triple Integrals
Simone Malacrida
No ratings yet
WATO EX-55&65 Service Manual V5.0 en
No ratings yet
WATO EX-55&65 Service Manual V5.0 en
490 pages
CSE2002 Data - Structures - and - Algorithm - BL2023241000790 - CCM - DR Dheresh Soni
No ratings yet
CSE2002 Data - Structures - and - Algorithm - BL2023241000790 - CCM - DR Dheresh Soni
57 pages
Download Full (Ebook) C# Programming For Absolute Beginners by Radek Vystavel ISBN 9781484271469, 1484271467 PDF All Chapters
100% (12)
Download Full (Ebook) C# Programming For Absolute Beginners by Radek Vystavel ISBN 9781484271469, 1484271467 PDF All Chapters
81 pages
Cs 403
No ratings yet
Cs 403
6 pages
Lab Sesseion #1
No ratings yet
Lab Sesseion #1
14 pages
Session 4 Class Activities / Tasks: Topic: Install, Configure, and Test The Functionality of Virtual Machines
No ratings yet
Session 4 Class Activities / Tasks: Topic: Install, Configure, and Test The Functionality of Virtual Machines
7 pages
Polynomials Division of Polynomials By: A. Long Division B. Synthetic Division
No ratings yet
Polynomials Division of Polynomials By: A. Long Division B. Synthetic Division
27 pages
Time: 3 Hours Answer Any One Question Max. Marks 100: Ec8561 - Communication Systems Laboratory
No ratings yet
Time: 3 Hours Answer Any One Question Max. Marks 100: Ec8561 - Communication Systems Laboratory
2 pages
ACOM 1010 Manual
No ratings yet
ACOM 1010 Manual
20 pages
Student_Nintendo and video game industry case
No ratings yet
Student_Nintendo and video game industry case
17 pages
Area_and_Delay_Efficient_Hybrid_Prefix_Adders_for_Residue_Number_System_Applications
No ratings yet
Area_and_Delay_Efficient_Hybrid_Prefix_Adders_for_Residue_Number_System_Applications
5 pages
Soylu Et Al, 2017, Ontology-Based End-User Visual Query Formulation
No ratings yet
Soylu Et Al, 2017, Ontology-Based End-User Visual Query Formulation
33 pages
Excel Help
No ratings yet
Excel Help
1 page
Pdfmergerfreecom Free Download Terjemahan Kitab Durratun Nashihincompress
No ratings yet
Pdfmergerfreecom Free Download Terjemahan Kitab Durratun Nashihincompress
1 page
B2 - M13.10 - 2012.07.11 Ata45
100% (2)
B2 - M13.10 - 2012.07.11 Ata45
116 pages
4485-C - VITEK 2 Systems - 9.01 and 9.02 Update - Att. 9 - 051632-01 - Kit 9.02 Update Instructions
No ratings yet
4485-C - VITEK 2 Systems - 9.01 and 9.02 Update - Att. 9 - 051632-01 - Kit 9.02 Update Instructions
27 pages
2024-Beihang University Master Program For International Students Admissions
No ratings yet
2024-Beihang University Master Program For International Students Admissions
15 pages
Nokia WaveFabric Elements (PSE-V) Interactive Ebook EN
No ratings yet
Nokia WaveFabric Elements (PSE-V) Interactive Ebook EN
16 pages
Data Structures Using C & Object Oriented Programming Concepts Using C++ S. Y. B. Sc. (Computer Science)
No ratings yet
Data Structures Using C & Object Oriented Programming Concepts Using C++ S. Y. B. Sc. (Computer Science)
50 pages
Search Engine Optimization: Book Name
No ratings yet
Search Engine Optimization: Book Name
5 pages
DSA All Labs
No ratings yet
DSA All Labs
117 pages
Instructions PDF
No ratings yet
Instructions PDF
4 pages
Cheat Sheet Final
No ratings yet
Cheat Sheet Final
3 pages
CN - Wps.moffice Eng
No ratings yet
CN - Wps.moffice Eng
3 pages
hud_screen.json
No ratings yet
hud_screen.json
64 pages
lec1
No ratings yet
lec1
13 pages
CG Manual
No ratings yet
CG Manual
33 pages
GIO Executive Summary For Translation
No ratings yet
GIO Executive Summary For Translation
5 pages
ALGIZ 8x-Manual PDF
No ratings yet
ALGIZ 8x-Manual PDF
29 pages

Lecture 19 NN

Uploaded by

Lecture 19 NN

Uploaded by

Neural Network

–Weighs them separately

more layers have the greater processing power

which activation function is used

No or True and False

The gate returns if and only if both inputs are true

We are going to set weights randomly

Let’s say that w1 = 0.9 and w2 = 0.9

weights because there is no error in this case

Sum unit: Σ = x1 * w1 + x2 * w2 = 0 * 0.4 + 0 * 0.4 = 0

Sum unit: Σ = x1 * w1 + x2 * w2 = 0 * 0.4 + 1 * 0.4 = 0.4

we will not update weights

weights (W1, W2, … Wn) for performing dot products

You might also like