0% found this document useful (0 votes)

17 views

Unit 2-nn

Uploaded by

Sujithra sai

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Unit 2-nn

Uploaded by

Sujithra sai

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 40

19AI411- NEURAL NETWORK

UNIT –II
PERCEPTRON
Contents
 Single layer Perceptron: Adaptive Filtering Problem
 Unconstrained Organization Techniques
 Linear Least Square Filters
 Least Mean Square Algorithm
 Learning Curves, Learning Rate Annealing Techniques
 Perceptron Convergence Theorem
 Relation Between Perceptron and Bayes Classifier for a Gaussian Environment
 Multilayer Perceptron: Back Propagation Algorithm
 XOR Problem
 Heuristics
 Output Representation and Decision Rule
 Feature Detection

2
Adaptive Filtering Problem
Dynamic System The external behavior of the system:
T: {x(i), d(i); i=1, 2, …, n, …}
where x(i)=[x1(i), x2(i), …, xm(i)]T
x(i) can arise from:
 Spatial: x(i) is a snapshot of data.
Signal-flow Graph of the  Temporal: x(i) is uniformly spaced in time.
Adaptive Filter
 Filtering Process
 y(i) is produced in response to x(i).
 e(i) = d(i) - y(i)
 Adaptive Process
 Automatic Adjustment of the synaptic
weights in accordance with e(i).
m y (i )  xT (i )w (i )
e(i )  d (i )  y (i )
3 y (i )  v(i )   wk (i ) xk (i )
where w(i)  w1 (i ), w2 (i ),..., wm (i )
T
k 1
Important Points

 The algorithm starts from an arbitrary setting

of the neuron’s synaptic weights
 Adjustments to synaptic weights – in
response to statistical variations in the
systems’ behavior that are made on
continuous basis
 Computation of adjustments to the synaptic
weights are completed inside a time interval.
4
Unconstrained Optimization Techniques

 Let C(w) be a continuously differentiable function of some unknown weight

(parameter) vector w.
 C(w) maps w into real numbers.
 Goal: Find an optimal solution w* that satisfies C(w*)C(w)  Minimize
C(w) with respect to w.
Necessary Condition for optimality: C(w*)=0 ( is the gradient operator)
T T
      C C C 
 , ,...,  C w    , ,..., 
 w1 w2 wm   w1 w2 wm 

A class of unconstrained optimization algorithm:

Starting with an initial guess denoted by w(0), generate a sequence of weight
vectors w(1), w(2), …, such that the cost function C(w) is reduced at each
5 iteration of the algorithm.
Method of Steepest Descent

The successive adjustments applied to w are in the direction of steepest

descent, that is, in a direction opposite to the gradient vector C(w).
Let  
g  C w
The steepest descent algorithm: w(n+1)=w(n)-g(n)
: a positive constant called the stepsize or learning-rate parameter.
w(n) = w(n+1) - w(n) = -g(n)
Small  Large 
 Overdamp the transient  Underdamp the transient
response. response.

6 If exceeds a certain value, the algorithm becomes unstable.

Newton’s Method

Minimize the quadratic approximation of the cost function C(w) around the
current point w.
Applying second-order Taylor series expansion of C(w) around
Cw(n).
w n   C w n  1  C w n 
1 C(w) is minimized when
 g n w n   w n H n w n 
T T

2 C w n 
 C 2
C
2
C 
2  gn   H n w n   0
   w n 
 2  w 2
1  w 1w 2 w 1 wm   w n    H 1
n gn 
 C C C 
2 2

H   C w    w w
2
w22

w2 wm  w n  1  w n   w n 
 2 1

 2       w n   H 1
n gn 
 C C C 
2 2

 wm w1 wm w2 wm2  Generally speaking, Newton’s
method converges quickly
7
Gauss-Newton Method

Gauss-Newton method is applicable to a cost function C(w) that is the sum of

error squares.
1 n 2
Let C w    e i 
2 i 1
 ei 
T

e(i, w )  e(i )    w  w n , i  1,2,...,n

 w  w  w ( n )
e(n, w )  e(n)  J n w  w n , where e(n)  e(1),e(2),...,e(n)
T

 e1 e1 e1  The Jacobian J(n) is [e(n)]T

 w 
w2 wm  e(n)  e(1), e(2),..., e(n)
 1

 e 2  e 2  
e 2  
J n    w1 w2 wm 
1  2
      Goal w n  1  arg min  e ( n, w ) 
 en  en  en  w
2 
   :
8  w1 w2 wm  w  w n 
Gauss-Newton Method (Cont.)
1
2
1

e(n, w )  e(n, w ) e(n, w )
2

2
T

1

 e(n)  J n w  w n  e(n)  J n w  w n 
2
T

1
 
 eT (n)  w  w n  J T n  e(n)  J n w  w n 
2
T


2

1 T 2 T T

e (n)  eT (n)J n w  w n   w  w n  J T n e(n)  w  w n  J T n J n w  w n 

 
eT (n)J n w  w n   w  w n  J T n e(n) and both of them are scalars.
T T

1 1 1
e(n, w )  eT (n)  eT (n)J n w  w n   w  w n  J T n J n w  w n 
2 2 T

2 2 2
Differentiating this expression with respect to w and setting the result to be zero.
T
  T
     
J n e( n )  J n J n w  w n  0 w n  1  w n   J T
n Jn  1 T

J n e(n)
To guard against the possibility that J(n) is rank deficient.
9  
w n  1  w n   J T n J n   I J T n e(n)
1
Linear Least-Squares Filter

 Characteristics of Linear Least-Squares Filter

– The single neuron around which it is built is linear.
– The cost function C(w) consists of the sum of error squares.
e(n)  d(n)  x(1), x(2),..., x(n) w n 
T

 d(n)  X(n)w n 
where d(n)=[d(1), d(2),…, d(n)]T X(n)=[x(1), x(2),…, x(n)]T
e(n)
 e ( n )   X T ( n ) J ( n)   X( n)
w n 
Substituting it into equation derived from Gauss-Newton Method
 
w n  1  w n   XT n Xn  XT n d(n)  Xn w n 
1

 
 XT n Xn  XT n d(n)
1

10  
Let X  n   XT n Xn  XT n 
1
w n  1  X  n d(n)
Wiener Filter
Limiting form of the Linear Least-Squares Filter for an
Ergodic Environment

Let Rx denote the Correlation Matrix of input vector x(i).

  1 n 1
R x  E x(i )x (i )  lim  x(i )xT (i )  lim XT (n) X(n)
T
n  n n  n
i 1

Let rxd denote the Cross-correlation Vector of x(i) and d(i).

1 n 1
rxd  E x(i )d (i )  lim  x(i )d (i )  lim XT (n)d(n)
n  n n  n
i 1

Let w0 denote the Wiener solution to the linear optimum filtering problem.

n  n 
 
w 0  lim w n  1  lim XT n Xn  XT n d(n)
1

n 
 
 lim XT n Xn  lim XT n d(n)
1

n 

 R x1rxd
11
Least-Mean-Square (LMS) Algorithm

LMS is based on instantaneous values for the cost function

1
C w   e 2 n  e(n) is the error signal measured at time n.
2
C w  en  because en   d n   xT n w n 
 en 
w w
en  C n  ˆ n  1  w
ˆ n   xn en 
 xn   gˆ n   xn en  w
w n  w n 
ŵ n  is used in place of w(n) to emphasize that LMS produces an estimate
of w that result from the method of steepest descent.
Training Sample : Input signal vector  x(n)
Desired response  d(n)
Summary of the User - selected parameter :
LMS Algorithm ˆ (0)  0
Initialization. Set w
Computation. For n  1,2, , compute
e(n)  d(n) - w
ˆ T (n)x(n)
12 ˆ (n)  x(n)en 
ˆ (n  1)  w
w
Virtues and Limitations of LMS

 Virtues
– Simplicity
 Limitations
– Slow rate of convergence
– Sensitivity to variations in the eigenstructure of
the input

13
Learning Curve

14
Learning Rate Annealing

Normal Approach:  n    0 for all n

c
Stochastic Approximation:  n   c is a constant
n
There is a danger of parameter blowup for small n when c is
large. 0
Search-then-converge schedule:  n   0 and  are constants
1  n /  

15
Perceptron
 The simplest form used for the classification of patterns said to be linearly
separable.
Bias, b m
x1
w1 v   wi xi  b
vk j(×) m
vn    wi n xi n 
w2 Output i 1
x2
Hard yk
...

Inputs
liniter Let x0=1 and b=w0 i 0
...

 w T n xn 
wm

 Goal: Classify the set {x(1), x(2), …, x(n)} into one of two classes, C1 or C2.
 Decision Rule: Assign x(i) to class C1 if y=+1 and to class C2 if y=-1.

wTx > 0 for every input vector x belonging to class C1

16 wTx  0 for every input vector x belonging to class C2

Perceptron (Cont.)

Algorithms:
w(n+1)=w(n) if wTx(n) > 0 and x(n) belongs to class C1
1.
w(n+1)=w(n) if wTx(n)  0 and x(n) belongs to class C2
w(n+1)=w(n)-(n)x(n) if wTx(n) > 0 and x(n) belongs to class C2
2.
w(n+1)=w(n)+(n)x(n) if wTx(n)  0 and x(n) belongs to class C1
 1 if x(n) belongs to class C1
Let d n   
 1 if x(n) belongs to class C2
w(n+1) = w(n) + [d(n)-y(n)]x(n) (Error-correction learning rule form)

 Smaller  provides stable weight estimates.

 Larger  provides fast adaption.
17
Perceptron (Cont.)

18
Perceptron (Cont.)

19
Perceptron (Cont.)

20
Perceptron (Cont.)

21
Perceptron (Cont.)

22
Perceptron (Cont.)

23
Perceptron (Cont.)

For n=nmax

24
Perceptron (Cont.)

25
Perceptron (Cont.)

26
Perceptron (Cont.)

27
Perceptron Convergence
Algorithm

28
Perceptron (Cont.)

Two essential points to design a perceptron

29
Relation between the Perceptron
and Bayes Classifier

• Classical Pattern classifier – Bayes Classifier

• Gaussian Distribution

30
Relation between the Perceptron and
Bayes Classifier

31
Relation between the Perceptron and
Bayes Classifier

32
Relation between the Perceptron and
Bayes Classifier

33
Relation between the Perceptron and
Bayes Classifier

Likelihood function

Threshold

34
Relation between the Perceptron and
Bayes Classifier

Key Points:

35
Relation between the Perceptron and
Bayes Classifier

C- Covariance Matrix
C- non-diagonal Matrix & non-singular matrix
(C-1 exists)
36
Relation between the Perceptron and
Bayes Classifier

Conditional Probability density function is given by

Misclassification cost is equal and cost of correct classification is

zero
37
Relation between the Perceptron and
Bayes Classifier

Linear Classifier

38
Relation between the Perceptron and
Bayes Classifier

Perceptron Bayes Classifier

1 It operates on the pattern to Operates on Gaussian

be classified are linearly distribution of two patterns do
separable overlap each other

39
Relation between the Perceptron and Bayes Classifier
Perceptron Bayes Classifier

2 Inputs are separable Inputs are non-separable

3 No overlapping here. Minimizes probability of error.

Difficult to minimize the Because it is Independent to
error probability overlap between gaussian
distribution
4 Converges is no-parametric Parametric

5 No assumption with Assumptions involved in

decision decision making
6 It operates on It operates focusing on
concentrating on errors probability density function
7 Adaptive and simple Fixed and can be made
adaptive
8 Less computation More computational complexity
40 complexity

Assignment 1 Thermodyanmics
No ratings yet
Assignment 1 Thermodyanmics
23 pages
Choice of Statistical Method Flow Diagram
No ratings yet
Choice of Statistical Method Flow Diagram
1 page
NN LMS DR Gamal PDF
No ratings yet
NN LMS DR Gamal PDF
34 pages
ANN - Ch2-Adaline and Madaline
100% (1)
ANN - Ch2-Adaline and Madaline
29 pages
ANN - Ch2-Adaline and Madaline
No ratings yet
ANN - Ch2-Adaline and Madaline
27 pages
Computational Mechanics: Lecture May 20
No ratings yet
Computational Mechanics: Lecture May 20
31 pages
Spectral Estimation X
No ratings yet
Spectral Estimation X
87 pages
课件_review
No ratings yet
课件_review
49 pages
EE213-Unit 2 Ch10 Power
No ratings yet
EE213-Unit 2 Ch10 Power
53 pages
Chap 2
No ratings yet
Chap 2
65 pages
Special Relativity: 1 The Invariant Interval
No ratings yet
Special Relativity: 1 The Invariant Interval
8 pages
Neural Networks Three
No ratings yet
Neural Networks Three
60 pages
Unscented Kalman Filter - Annotated
No ratings yet
Unscented Kalman Filter - Annotated
25 pages
Quant Mech Curs 2016 C8
No ratings yet
Quant Mech Curs 2016 C8
42 pages
Introduction To ROBOTICS
No ratings yet
Introduction To ROBOTICS
35 pages
8 AC Circuit Power
100% (1)
8 AC Circuit Power
29 pages
Cs217 BP Finishing Logic Week6 10feb25
No ratings yet
Cs217 BP Finishing Logic Week6 10feb25
74 pages
Quant Mech Curs 2018 C8-R
No ratings yet
Quant Mech Curs 2018 C8-R
32 pages
BMM3553 Mechanical Vibrations: Chapter 3: Damped Vibration of Single Degree of Freedom System (Part 1)
No ratings yet
BMM3553 Mechanical Vibrations: Chapter 3: Damped Vibration of Single Degree of Freedom System (Part 1)
37 pages
Planned Sequence: Examine Classical Waves
No ratings yet
Planned Sequence: Examine Classical Waves
17 pages
Chapter 10 Sinusoidal Steady-State Power Calculations
No ratings yet
Chapter 10 Sinusoidal Steady-State Power Calculations
28 pages
Special Theory of Relativity
100% (9)
Special Theory of Relativity
52 pages
CNN Lecture 1 by Dr. Vibha Tiwari
No ratings yet
CNN Lecture 1 by Dr. Vibha Tiwari
33 pages
K-Means Clustering: CMPUT 615 Applications of Machine Learning in Image Analysis
No ratings yet
K-Means Clustering: CMPUT 615 Applications of Machine Learning in Image Analysis
13 pages
112 Formulae
No ratings yet
112 Formulae
4 pages
Computational Fluid Dynamics : February 21
No ratings yet
Computational Fluid Dynamics : February 21
37 pages
Chap 40N
No ratings yet
Chap 40N
37 pages
Circular Motion - Form
No ratings yet
Circular Motion - Form
9 pages
II-01 - Review of Wave Optics
No ratings yet
II-01 - Review of Wave Optics
69 pages
Support Vector Machines: Nonlinear Case
No ratings yet
Support Vector Machines: Nonlinear Case
37 pages
Lec07 Special Relativity
No ratings yet
Lec07 Special Relativity
20 pages
Slides Amari IG3
No ratings yet
Slides Amari IG3
138 pages
GCL 06
No ratings yet
GCL 06
26 pages
Relativity: The Principle of Newtonian Relativity
No ratings yet
Relativity: The Principle of Newtonian Relativity
13 pages
DSP
No ratings yet
DSP
46 pages
13 SpecialRelativity
No ratings yet
13 SpecialRelativity
22 pages
Schrodinger's Equation For A Particle Moves in 1D
No ratings yet
Schrodinger's Equation For A Particle Moves in 1D
7 pages
Chapter 4 Multiple-Degree-of-Freedom (MDOF) Systems Examples
No ratings yet
Chapter 4 Multiple-Degree-of-Freedom (MDOF) Systems Examples
17 pages
Turbulence Modeling: A Discussion On Different Techniques Used in Turbulence Modeling
No ratings yet
Turbulence Modeling: A Discussion On Different Techniques Used in Turbulence Modeling
19 pages
1st Assignment
No ratings yet
1st Assignment
2 pages
Kalman Filtering Practical Work: Last Name: First Name: Group
No ratings yet
Kalman Filtering Practical Work: Last Name: First Name: Group
9 pages
Physics Formula: Topic Phase-1
No ratings yet
Physics Formula: Topic Phase-1
13 pages
Boundary Value Problems
No ratings yet
Boundary Value Problems
15 pages
1-dwaveequation-120421000615-phpapp02
No ratings yet
1-dwaveequation-120421000615-phpapp02
7 pages
SHM 3.0 With Practice
No ratings yet
SHM 3.0 With Practice
82 pages
Additional Material: Relativistic Phenomena: MC V V M E MC C C C V MV MC C MC MVC MC PC C V E MC PC E P C
No ratings yet
Additional Material: Relativistic Phenomena: MC V V M E MC C C C V MV MC C MC MVC MC PC C V E MC PC E P C
20 pages
Special Relativity Review
No ratings yet
Special Relativity Review
2 pages
Complementary Relativity - Beyond The Lorentz Transformations - Eng+Ita
No ratings yet
Complementary Relativity - Beyond The Lorentz Transformations - Eng+Ita
40 pages
Formula Rio
No ratings yet
Formula Rio
1 page
Lesson 8 VC.06 Day2
No ratings yet
Lesson 8 VC.06 Day2
16 pages
Robotics1 12.04.26 PDF
No ratings yet
Robotics1 12.04.26 PDF
6 pages
Design Charts For Clamped Orthotropic Plates
No ratings yet
Design Charts For Clamped Orthotropic Plates
9 pages
Lec. 3
No ratings yet
Lec. 3
50 pages
Gradient Descent Learning: Minimize Objective Function: Error Landscape
No ratings yet
Gradient Descent Learning: Minimize Objective Function: Error Landscape
14 pages
EEE8129 Data
No ratings yet
EEE8129 Data
1 page
IIT-JAM-2022
No ratings yet
IIT-JAM-2022
30 pages
MTH404 (Dynamics) Notes For Final Term
No ratings yet
MTH404 (Dynamics) Notes For Final Term
59 pages
Fundamentals of Dynamics of MDOF Systems
No ratings yet
Fundamentals of Dynamics of MDOF Systems
28 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Exercises of Quantum Physics
From Everand
Exercises of Quantum Physics
Simone Malacrida
No ratings yet
Exercises of Tensors
From Everand
Exercises of Tensors
Simone Malacrida
No ratings yet
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
The Thermal Behaviour 0F Overhead Conductors
No ratings yet
The Thermal Behaviour 0F Overhead Conductors
14 pages
Department of Education: General Physics 1
No ratings yet
Department of Education: General Physics 1
5 pages
H2 - Radial Heat Conduction
No ratings yet
H2 - Radial Heat Conduction
4 pages
Fishtank 7th Grade Math Geometry Lesson 10 LW2QZY
No ratings yet
Fishtank 7th Grade Math Geometry Lesson 10 LW2QZY
7 pages
GR - 3 - Math - MT - RWS - QP2 Revision
No ratings yet
GR - 3 - Math - MT - RWS - QP2 Revision
6 pages
Application of Derivatives Class 12 Maths MCQs PDF
No ratings yet
Application of Derivatives Class 12 Maths MCQs PDF
14 pages
Image Encryption Using Elliptic Curve CR PDF
No ratings yet
Image Encryption Using Elliptic Curve CR PDF
3 pages
MATHS-Find The Remainder When 13571357
No ratings yet
MATHS-Find The Remainder When 13571357
7 pages
Third Quarter PT Math 10
No ratings yet
Third Quarter PT Math 10
4 pages
Where can buy Quadrivium The Four Classical Liberal Arts of Number Geometry Music Cosmology Miranda Lundy ebook with cheap price
100% (9)
Where can buy Quadrivium The Four Classical Liberal Arts of Number Geometry Music Cosmology Miranda Lundy ebook with cheap price
75 pages
Ch5 Summary and Exercises Student v3
No ratings yet
Ch5 Summary and Exercises Student v3
41 pages
Semi-Detailed Lesson Plan in Mathematics 7
100% (2)
Semi-Detailed Lesson Plan in Mathematics 7
5 pages
Isometric Projections
No ratings yet
Isometric Projections
1 page
Geometry Preparation & Meshing Techniques Using HYPERMESH: Module HM 1
No ratings yet
Geometry Preparation & Meshing Techniques Using HYPERMESH: Module HM 1
7 pages
2009 Ies Exam Question Paper-1
No ratings yet
2009 Ies Exam Question Paper-1
2 pages
Aliaksei Maistrou - Finite Element Method Demystified
No ratings yet
Aliaksei Maistrou - Finite Element Method Demystified
26 pages
Path and Trajectory Planning
No ratings yet
Path and Trajectory Planning
96 pages
Introduction to the Theory of Computation 3ed. Edition Sipser M. - The full ebook version is ready for instant download
No ratings yet
Introduction to the Theory of Computation 3ed. Edition Sipser M. - The full ebook version is ready for instant download
46 pages
Strategy Consulting
No ratings yet
Strategy Consulting
3 pages
UNICAL-UTME-PAST-QUESTIONS-2 (1)
No ratings yet
UNICAL-UTME-PAST-QUESTIONS-2 (1)
244 pages
11pssc Elimination Sample Puzzles
No ratings yet
11pssc Elimination Sample Puzzles
12 pages
Mathematics in The Modern World Statistics: Data Gathering and Organizing Data
No ratings yet
Mathematics in The Modern World Statistics: Data Gathering and Organizing Data
12 pages
Personality Predictors of Leadership Styles and The Self-Other Agreement Problem
No ratings yet
Personality Predictors of Leadership Styles and The Self-Other Agreement Problem
13 pages
Constant Head in Situ Permeability Tests in Clay Strata
No ratings yet
Constant Head in Situ Permeability Tests in Clay Strata
23 pages
Cornell CS578: Bagging and Boosting
No ratings yet
Cornell CS578: Bagging and Boosting
10 pages
eMDP On Supply Chain Strategy & Management
No ratings yet
eMDP On Supply Chain Strategy & Management
27 pages
2011 Introduction To Theory of Computation
0% (1)
2011 Introduction To Theory of Computation
12 pages
CH4 Hayt
No ratings yet
CH4 Hayt
18 pages

Unit 2-nn

Uploaded by

Unit 2-nn

Uploaded by

19AI411- NEURAL NETWORK

 The algorithm starts from an arbitrary setting

 Let C(w) be a continuously differentiable function of some unknown weight

A class of unconstrained optimization algorithm:

The successive adjustments applied to w are in the direction of steepest

6 If exceeds a certain value, the algorithm becomes unstable.

Gauss-Newton method is applicable to a cost function C(w) that is the sum of

e(i, w )  e(i )    w  w n , i  1,2,...,n

 e1 e1 e1  The Jacobian J(n) is [e(n)]T

 Characteristics of Linear Least-Squares Filter

Let Rx denote the Correlation Matrix of input vector x(i).

Let rxd denote the Cross-correlation Vector of x(i) and d(i).

LMS is based on instantaneous values for the cost function

Normal Approach:  n    0 for all n

wTx > 0 for every input vector x belonging to class C1

16 wTx  0 for every input vector x belonging to class C2

 Smaller  provides stable weight estimates.

Two essential points to design a perceptron

• Classical Pattern classifier – Bayes Classifier

Conditional Probability density function is given by

Misclassification cost is equal and cost of correct classification is

Perceptron Bayes Classifier

1 It operates on the pattern to Operates on Gaussian

2 Inputs are separable Inputs are non-separable

3 No overlapping here. Minimizes probability of error.

5 No assumption with Assumptions involved in

You might also like