0% found this document useful (0 votes)

115 views

On Neural Networks in Identification and Control of Dynamic Systems

This document discusses the use of neural networks for system identification and control of dynamic systems. It explains that neural networks can approximate any input-output function, making them well-suited for identifying and controlling both linear and nonlinear systems. The document outlines different types of neural networks and compares the neural network approach to conventional linear system identification and adaptive control methods. It provides examples to illustrate how neural networks can identify and control linear dynamic systems.

Uploaded by

vane-16

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

115 views

On Neural Networks in Identification and Control of Dynamic Systems

Uploaded by

vane-16

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

https://ntrs.nasa.gov/search.jsp?

R=19930021849 2020-05-14T18:51:00+00:00Z

NASA Technical Memorandum 107702

ON NEURAL NETWORKS IN IDENTIFICATION

AND CONTROL OF DYNAMIC SYSTEMS

Minh Phan
Jer-Nan Juang
David C. Hyland

(NASA-TM-107702) O N NEURAL N 9 3- 3 1038

NETWORKS I N I D E N T I F I C A T I O N AN0
CONTROL OF D Y N A M I C SYSTEMS (NASA)
34 P Unclas

G3/39 0177080

June 1993

National Aeronautics and

Space Ad rninist rat ion

Langley Research Center

Harnpton, Virginia 23681-0001
On Neural Networks in Identification and
Control of Dynamic Systems
Minh Phan 1
Lockheed Engineering & Sciences Co., Hampton, Virginia
Jer-Nan Juang
NASA Langley Research Center, Hampton, Virginia
and
David C. Hyland 3
Harris Corporation, Melbourne, Florida

T h e paper presents a discussion on the applicability of neural

networks in t h e identification a n d control of dynamic systems.
Emphasis is placed on the understanding of how the neural networks
handle linear systems a n d how the new approach is related t o
conventional system identification a n d control methods. Extensions of
the approach to non-linear systems a r e then made. The paper explains
the fundamental concepts of neural networks in their simplest terms.
Among the topics discussed a r e feedforward and recurrent networks in
relation to the standard state-space and observer models, linear and non-
linear auto-regressive models, linear predictors, one-step ahead control,
and model reference adaptive control for linear and non-linear systems.
Numerical examples a r e presented to illustrate the application of these
important concepts.

1. Introduction

System identification and control are two related fields that have received
considerable development in the last few decades. System identification deals with the
problem of finding a mathematical description of a physical system from experimental data.
Control theory devises ways to influence the system in a desirable and predictable manner.
Typical control objectives are pointing control, vibration suppression, and tracking control.
System identification provides the necessary mathematical model of a system for a
particular control scheme to be designed. In turn, information gathered during the control
process can be used to evaluate the validity of the assumed model. Existing system
identification and control methods are based o n mathematical systems theory, which first
deals with deterministic then stochastic systems. For the most part, the systems under
study are idealized. They are linear, time-invariant, and often assumed to be noise-free.
When noises are present, they are assumed to be white, zero-mean, and with known
characteristics. These assumptions are often justified because less idealized assumptions
tend to render the analysis mathematically intractable.

* Senior Enginwr, Langley Program Office.

Principal Scientist, Spacccrart Dynamics Branch.
Senior Scicntist, Govcrnmcnt Acrospacc Systcms Division, Harris Corporation.
In practice, however, all systems are affected by noises and non-lineanties which
may lead to instabilities for control laws that are based on idealized models. This motivates
the development of a variety of control approaches which, if classified according to the
amount of information required to design a controller, can be broadly divided in three
classes. They are model-independent control, robust control, and adaptive control. A
model-independent controller seeks to guarantee stability of the closed-loop system
independently of the system model. As implied, such a design does not require that the
system be known in advance. Robust controllers can tolerate certain specific variations
about some nominal model, which is required to be known with somewhat accuracy, and
variations about the nominal model need to be quantified. While a nominal model may be
obtained analytically or experimentally, meaningful characterization of the variations about
the nominal model is often difficult to obtain. In both model-independent control and
robust control, there is a trade-off between stability robustness and performance. This is
because performance requires that the system is known with certainty. If for some reason
such knowledge is in error, then the designed optimal performance will not be achieved and
instabilities may occur. Striking somewhat of a balance between the two control
approaches is adaptive control which involves some level of on-line parameter estimation,
where knowledge of the system being controlled is gained during the control process. The
estimated parameters can be either the system parameters or the controller gains. In the
former case, known as indirect adaptive control, the parameters representing a mathematical
model of the system are identified on-line, and the control input is then computed. In the
latter case, known as direct adaptive control, the system identification step is bypassed and
the controller gains are directly updated at each time step. Adaptive control identifies the
appropriate parameters of the system only for the purpose of control, thus offers a
meaningful way to integrate system identification and control in one package. Adaptive
control also offers the potential ability to handle systems with changing dynamics by
constantly identifying and adjusting the control action accordingly.
Recently, there has been a substantial amount of interest in the field of neural
networks. As a collection of interconnected neurons, a multi-layer neural network with
appropriate weights has been shown to be able to approximate any input-output function.
Consequently, the neural network is a natural candidate in the area of identification and
control of both linear and non-linear systems.*-5 The neural networks are typically
implemented in the adaptive form, and thus possess similar attributes of adaptive control.
The main objective of this paper is to examine the implication of the neural networks
approach for linear systems, and to see how this approach is similar to and different from
conventional methods. Only after a firm grasp of how the neural networks treat the linear
problem, extensions to the non-linear problem can then be made, and the potential benefits
of the non-linear approach can be better understood and appreciated. The extent to which
linear approaches can handle non-linear systems can also be revealed. To this end, basic
concepts in neural network will be presented and whenever possible, direct connection to
existing system identification and control methods are made. Linear system identification
techniques and adaptive control theory will be heavily drawn upon to make this
. ~ this paper, we focus on the role of the neural networks as applicable to
c ~ n n e c t i o n . ~In
structural system identification and control as opposed to other fields such as pattern
recognition, image processing, and computer science. The recently developed
ObserverKalman filter identification (OKID) algorithm will also be discussed in the context
of the neural networks.8-10 Potential applicability of the existing techniques to non-linear
problems will be examined. Several numerical examples will be used to illustrate the basic
concepts discussed in this paper. Since accelerometers are often used in structural system
identification and control, a direct transmission term is included in input-output models that
are discussed in this paper. Minor adjustments can be easily made when this term is not
present which is the case treated in Ref. 1.

2
2. The Neural Networks
A neural network is simply a set of interconnected individual units called neurons.
Depending on the connection between the neurons, there are two basic types of networks
known as the multi-layer feedforward networks and the recurrent networks, which will be
described in this section.

2.1 The Neuron. As a basic building block for a neural network, an individual
neuron has a finite number of scalar inputs and one scalar output. Associated with each
input is a scalar weighting value. The input signals are weighted by these values and added
at the summation junction. The combined signal is then passed through an activation
function producing the output signal. The activation function y ( x ) can take a variety of
forms, the most common one is a sigmoid function denoted by sigm(x),

1 - e-x
sigm(x) = -
1 + e-’
A plot of the sigmoid function is shown in Fig. 1 below. Generally, the activation function
can be any non-decreasing differentiable function which has finite limits at both ends as
shown in Fig. 1 below.

-2 I -
-10 -5 0 5 0
X
Figure 1: The sigmoid function.

Let r inputs of a neuron be denoted by ul, u2, ..., ur and the output denoted by y .
Let the r weights be denoted by wI, w2,...) w,. The output of the neuron can be
expressed mathematically as

The neuron is shown schematically in Fig. 2 below with the sigmoid function as the
activation function. For simplicity of notations, the network weights for the i-th layer may
sometimes be presented collectively as W = { w,,w 2 ,w3,...} .

3
Y
sigm ( x )

Figure 2: Schematic diagram of a single neuron.

Remark 2.1.1. The activation function is a limiter that bounds the incoming
signal, which serves as a non-linear element in a neuron. The activation function given in
Eq. (1) has a linear range about the origin, and is bounded between -1 and 1. If output of
a neuron is bounded between -a and +a by taking y ( x ) = asigrn(x) as its activation
function then Eq. (2) becomes

jj
(ill )
=7 wiui = asigm C wiui = a y
(ill ) (3)

which is the same as the output of a neuron with the original activation function multiplied
by a constant gain a. Therefore, the activation function can be taken to be between -1 and
1 provided an additional factor is inserted after the neuron. In a network, this factor is
absorbed into the weights of the following neurons that directly receive the output of this
neuron as their inputs.
Remark 2.1.2. If the activation function is a linear function, y ( x ) = x , then the
neuron is a linear neuron. The input-output relationship of a linear neuron is

Equation (4) simply says that the output signal is a weighted (linear) combination of the
input signals with the weighting coefficients being the weights of the neuron.
2.2 Multi-Layer Feedforward Neural Network. A multi-layer feedforward
neural network consists of an input layer, a number of hidden layers, and an output layer.
In a fully connected feedforward network, every neuron in each layer accepts as its inputs
all signals coming from all neurons in the layer immediately preceding it (see Fig. 3). In a
partially connected network, some of these connections are missing. This is equivalent to
setting the corresponding network weights to zero. Figure 3 show a typical three-layer
three-input three-output feedforward network with two hidden layers.

Wl w2 w3
Yl
Y2

I
Figure 3: A three-layer three-input three-output feedforward neural network.

~-
2.3 Recurrent Network. A feedforward network with time delay feedback
elements is called a recurrent network. The delay elements take the outputs of certain
neurons in the network, delay them for a certain number of time steps, and feed back as
input to the neurons. In other words, in a recurrent network, time delayed outputs of a
certain number of neurons are the inputs to other neurons. A special one layer network
when the delayed outputs of the neurons are fed back as inputs to themselves is called a
Hopfield network (see Fig. 4).

W
Yl
Y2

Figure 4: A three-input three-output Hopfield network.

Remark 2.3.1. Unlike a feedforward network, a recurrent network contains self-

propagation dynamics. Due to the feedback mechanism, starting with some non-zero initial
conditions, the output values of a recurrent network will evolve over time, thus simulating
a dynamic system.
Remark 2.3.2. When compared to a recurrent network, a feedforward network is
a static in the sense that it simply accepts a set of input values (or input pattern) and
produces a set of output values (or output pattern) without any self-propagation
mechanism. Thus it may seem that a recurrent network is preferred for modelling dynamic
systems. In fact, the two types of networks simply represent two different ways of
modelling a dynamic system. In identification, if a set of input-output data is already
available, then the weights of a recurrent network can be identified by a feedforward
network where the time-delayed outputs are treated as inputs. This seems a bit confusing,
but should later be obvious when connections between these types of neural networks to
standard ways of representing linear systems are made in the next section.
3. Neural Network Representation of Linear Systems
For each neuron, the only element that is non-linear is the activation function. If the
activation is taken to be a linear function, y(x) = x , then the network becomes linear. This
is true no matter how complicated the neural network is. In this section, attention will be
focused on linear networks. The relationship between these networks and conventional
ways to represent linear systems will be discussed.
3.1 A Multi-Layer Feedforward Linear Network. Consider the
feedforward network shown in Figure 3, with the activation function being a linear
function. Furthermore, it is instructive to examine a simple case of a two-layer three-input
one-output network shown in Figure 4 below.

5
Figure 5 : A two-layer three-input one-output feedforward network of linear neurons
The network weights between the individual connections are shown in the figure.
Since the neurons are linear, each neuron is represented by a summation junction and the
activation being a linear function is omitted. The output of the network in Fig. 5 is simply,

which is the output of a single neuron with the following weights:

The above example can be immediately generalized to show that a single-output

multi-layer neural network with linear neurons is equivalent to a single linear neuron with
appropriate weights. The following remarks can be immediately made.

Remark 3.1.1. A multi-layer feedforward network of linear neurons is simply an

over-parameterized set of linear equations where the over-parameterization takes the form
of the type shown in Eqs. (6). This form is non-linear in the parameters. Thus the
problem of determining these parameters from known input-output data is a non-linear
parameter estimation problem even if the network is linear.
Remark 3.1.2. Since any single-output feedforward network of linear neurons is
equivalent to a particular single linear neuron, there is no benefit in using an over-
parameterized multi-layer linear network for linear system identification. In fact, in such an
over-parameterized model, the network weights cannot be uniquely determined from input-
output data. This is obvious, for example, in the case of the network shown in Fig. 5.
The set of three equations in (6) contains eight unknowns. Thus the use of a complicated
multi-layer linear network for linear system identification is therefore neither advantageous
nor necessary. The same is true for the case of using a non-linear network to identify a
linear system.
Remark 3.1.3. For a multi-output system, a multi-layer feedforward network of
linear neurons is equivalent to a collection of single neurons arranged in parallel, each of
which sharing the same number of inputs. In other words, any multi-output multi-layer
feedforward network of linear neurons has an equivalent multi-output single-layer network
representation.

6
3.2. Feedforward Linear Network and the State Space Model. This
section describes the relationship between the feedforward linear network and the state
space model which is a common form of representing linear systems. The discrete-time
state space model of an n-th order, m-input, q-output system is a set of n simultaneous first
order difference equations of the form
x ( k + 1) = A x ( k ) + Bu(k)
(7)
y ( k ) = Cx(k)+ Du(k)

where the dimensions of A, B , C, and D are n x n , n x m, 4 x n , and 4 x rn, respectively.

Solving for the output y(k) in terms of the previous inputs yields
k
y ( k ) = C l & u ( k- i)
i=O

where the parameters,

b = D , hk =CA'-'B , k = l , 2 , 3 ,... (9)

are the Markovparumeters of the system described by Eqs. (7), which are also the system
pulse response samples. The Markov parameters are expressed in terms of the system
discrete state space matrices A, B , C , D. Since the state vector is coordinate-dependent, the
state space matrices are not unique for a given system but the Markov parameters are
unique. Let the state vector be transformed by a coordinate transformation T , z ( k ) = T x ( k ) ,
then the relationship between u(k) and y(k) via a new state vector z(k) can be described by
a new state space representation TAT-', TB, CT-', D . The sy&em Markov parameters
computed using the new state space matrices are the same as before, i.e.,

For an asymptotically stable systems, the pulse response can be neglected after a finite
number of time steps, say p s . The input-output description in Eq. (8) can be approximated
by a finite number of Markov parameters

where pJ is sufficiently large such that CA'B = 0, k 2 pJ. Comparing Eq. (1 1) with the
structure of the linear neurons immediately leads to the following remarks.

Remark 3.2.1. The elements of the Markov parameters a-e simply the weights of a
single-layer linear network where inputs to the network include both current and past
values of the input signal. Note that the time delayed inputs do not affect the neuron
configuration because they are feedforward signals and thus can be treated as separate input
channels. This case is shown in Fig. 6.

7
Figure 6: Representation of linear systems by a feedforward network
with the system Markov parameters as network weights.

Remark 3.2.2. Since a generql multi-layer feedforward network of linear neurons

is equivalent to a single-layer network, the relationship between the weights of the multi-
layer network which represents a linear system and the Markov parameters of its equivalent
state space model is also immediately obvious. For example, if the network shown in Fig.
5 represents some linear system with three non-zero Markov parameters, then

provided that ul = u(k), u2 = u(k - l), u3 = u(k - 2 ) .

Remark 3.2.3. In practice, if the system is lightly damped, a large number of
system Markov parameters is needed to maintain Eq. (1 1) a valid approximation. This
implies that the equivalent network representing the same system has a large number of
input channels containing distant past input values, not a large number of hidden layers. In
other words, it is not possible to represent such a system by simply adding extra neurons
or extra hidden layers in the feedforward network. The fact that a large number of system
Markov parameters is required to represent a lightly damped system of the form in Eq. (1 1)
is a major weakness of the representation. The same can be said for the equivalent neural
network representation.
3 . 2 . Recurrent Linear Network and the Observer Model. .This section
shows the connection between the recurrent network and an observer of the system.
Adding and subtracting the term My(i) to the right hand side of the state equation in Eq.(7)
yields
x(k+ 1)= Ax(k)+Bu(k)+My(k)-My(k)
= ( A + MC)x(k) + ( B + MD)u(k)- M y ( k ) (13)

8
If M is a matrix such that A + MC is deadbeat of order p , i.e.,

(A+MC)"rO, k 2 p

then for k 2 p , the output y(k) can be expressed as

The matrix M in the above development can be interpreted as an observer gain. The
system considered in Eqs. (7) has an observer of the form

?(k + I) = & ( k ) + Bu(k)- M[y(k)- j ( k ) ]

(17)
j ( k ) = C i ( k )+ D u ( k )

Besides the effect of noises, j ( k ) may differ from y(k) if the actual initial condition x(0) is
not known and some different initial condition is assumed for i ( 0 ) . Defining the state
estimation error e(k)= x ( k ) - i ( k ) , the equation that governs e(k) is

e(k + 1) = ( A + M C ) e ( k ) (18)

For an observable system, the matrix M exists such that the eigenvalues of A + MC may be
placed in any desired (symmetric) configuration. If the matrix M is such that A + M C is
asymptotically stable, then the estimated state i ( k ) tends to the true state x(k) as k tends to
infinity for any initial difference between the assumed observer state and the actual system
state. The matrix M can therefore be interpreted as an observer gain. The parameters
defined as

F(k)=C(A+MC)'-'[B+MD , -MI
=[a 7

are the Markov parameters of an observer system, hence they are referred to as observer
Markov parameters. Like the system Markov parameters, the observer Markov
parameters are also invariant with respect to a coordinate transformation of the state vector.
To see this, again let the state vector be transformed by a coordinate transformation T ,
z ( k ) = T x ( k ) , then the observer is described by a new state space representation
TAT-', TB, CT-I, D and a new observer gain T M . The observer Markov parameters
computed using these new state space matrices and the new observer gain are the same as
before,
y(k)= CT-l(TAT-I+ TMCT-l)'-'[TB+ TMD , - TM]
(20)
=C(A+MC)'-'[B+MD, - M I , k = l , 2, 3, ...
Notice that in Eq. (15), the output y ( k ) is the open-loop response of the system, yet
the Coefficients ak, p k are related to an observer gain. Consider the special case where A4
is a deadbeat observer gain where all eigenvalues of A + MC are zero, the observer Markov
parameters will become identically zero after a finite number of terms. For lightly damped
stwtures, this means that the system can be described by a reduced number of observer
Markov parameters r ( k ) instead of an otherwise large number of the usual system Markov
parameters Y(k). For this reason, the observer Markov parameters are important in linear
system identification. By examining of the structure of Eq. (15), the following remarks
can be made.
Remark 3.3.1. The input-output equation given in Eq. (15) can be represented by
a recurrent network with a single layer of linear neurons. The number of neurons is equal
to the number of outputs of the system. The inputs to the neurons consists of both the
feedforward time-delayed input signals and the feedback time-delayed output signals.
Figure 7 shows the configuration of such a network for a single-output system.

Figure 7: Representation of a single-output system by a recurrent network

Remark 3.3.2. The recurrent network weights are precisely the elements of the
observer Markov parameters. The relationship between the weights of a recurrent network
and an equivalent feedforward network is the same as that between the observer Markov
parameters and the system Markov parameters. It can be shown that the system Markov
parameters or the feedforward network weights are related to the recurrent network weights
by
where ak= 0, pk I 0 for k > p . To describe a system of order n, the number of observer
Markov parameters p must be such that qp 2 n where 4 is the number of outputs.
Furthermore, the maximum order of a system that can be described with p observer
Markov parameters is qp? The implication of this result to the network configuration is
that a recurrent network generally requires fewer number of parameters (or weights) than
that required by an equivalent feedforward network. The two equivalent networks,
however, have the same number of neurons. The minimum number of recurrent network
weight matrices that can describe the system is pmh,which is the smallest value of p such
that 4pAn 2 n.
Remark 3.3.3. As mentioned previously, to represent lightly damped structures,
the feedforward representation requires a large number of weights. Furthermore, it is not
possible to represent a marginally stable or unstable system by a feedforward network.
However, it is possible to represent such a system by a recurrent network. The implication
of this fact for the system identification problem will be discussed further in later sections.
4. Identification of Linear Systems using Neural Networks
It has been shown that a general network of linear neurons is equivalent to a single
neuron with appropriate weights. The problem of linear system identification using neural
network is therefore reduced to finding these network weights from input-output data. The
computation may be done off-line or on-line. In off-line computation, the input-output data
is already available and a network representing the system is to be determined. On-line
computation refers to the case where the network weights are continually updated as data is
made available.
4.1. Parallel vs. Series-Parallel Identification Models. In previous
consideration, it appears that the recurrent network is more advantageous in representing
certain systems than the feedforward network. To identify the recurrent network weights
one can simply use the feedforward network configuration with actual delayed system
outputs appeared as inputs to the feedforward network. Consider two identification models
shown in Figs. 8 and 9 below, which are known as parallel and series-parallel
identification model. The block denoted by D represents the time delay elements.

Figure 8: Identification using pmllel model.

11
Figure 9: Identification using series-parallel model.

The basic difference between the two schemes is that in the parallel identification
model, the estimated output j ( k ) is computed based on the model own previous (estimated)
values whereas i n the series-parallel model, i t is based on actual output values.
Mathematically_, in the parallel model, the purpose of the identification is to obtain the
estimates &k, pk of the coefficients %, /$ that minimize the estimation error, e ( k ) =
y(k) - j ( k ) , where the estimated output j(k) is computed from

In the series-parallel model, however, the estimated output is computed from

j(k)=&,y(k-l)+ + ~ , , ~ ( k - p ) + ~ ~ , u ( k ) + P I u ( k - l -..
) + +B,u(k-p) (23)
The difference between the two above equations is a subtle but important one. As
discussed in the previous section, the estimated output of the model in Eq. (22) is the
estimated open-loop prediction even though the coefficients of the model are related to an
observer. On the other hand, the estimated output of the model in Eq. (23) is that of an
observer. To see this, substitute the expression for j ( k ) to the estimated state equation in
(17) produces

i ( k + 1) = ( A + M C ) i ( k )+ ( B + MD)u(k)- My(&) (24)

Since j ( k ) = C.?(k)+ D u ( k ) , one can obtain Eq. (23) assuming zero initial conditions for
the observer. Therefore, j ( k ) in Eq. (23) represents the estimated output provided by the
observer. The estimation error 2(k) is the difference between the actual output and the
estimated output provided by the observer. On the other hand, if the actual response y(k) is
replaced by the estimated value j ( k ) in Eq. (23) then the terms involving the observer gain
Mcancel each other identically for any arbitrary initial condition i ( O ) ,

Z(k+I)=(A + MC)i(k)+(B+MD)u(k)-Mj(k)
= A i ( k ) + Bu(k)

12
Therefore, there is no longer any observer involved in the equation; ?(k) now plays the
role of the state vector x ( k ) as in Eq. (7) and the estimated output j ( k ) = C?(k)+ Du(k) is
the same as that produced by the open-loop model provided that the initial conditions for
i ( k ) and x ( k ) are identical. The quantity $ ( k ) now represents the predicted output
provided by the open-loop model alone, which is referred in this paper as open-loop
prediction. In this case, the error i ( k ) is the difference between the actual output and the
predicted output provided by the identified open-loop model.
Remark 4.1.1. First recall that the model structure in Eq. (15) subsumes an
observer. If the parallel identification model is used in conjunction with the model structure
of Eq. (15) then the prediction error that drives the parameter estimation scheme is simply
the open-loop prediction error not the observer (output) estimation error. Consequently,
the observer portion of the model cannot be identified. This fact accounts for the
difficulties encountered in parallel model identification, namely, the conditions for which
the scheme will converge are presently not known.
Remark 4.1.2. In the series-parallel identification model, since the actual instead of
(open-loop) predicted output enters the model, a feedforward network with delayed input
and actual output measurements can be used to identify the system. This consideration
eliminates the use of a recurrent network which would introduce additional but unnecessary
difficulties to the system identification problem, (see Fig. 10). Each output of the system is
represented by a single linear neuron. A multiple-output system is represented by a single
layer of neurons. The identified network can be used either as a feedforward or a recurrent
network. In the former case, the network provides estimation of the response by an
observer. In the latter case, it is an open-loop predictor. Again, this depends on whether
actual or predicted output is used in computing the response.

Figure 10: Feedforward representation of the series-parallel identification model by a

single neuron for each output.
4.2. Identification of the Network Weights. This section shows how the
weights of the network represented by Eq. (15) can be computed using a feedforward
model. For linear systems, it is sufficient to use a one layer network having as many
neurons as the number of outputs. This is a simple linear parameter estimation problem.
The off-line computation is shown first, followed by an equivalent on-line computation.

13
For simplicity, consider the case where the system starts from zero initial conditions.
Equation (15) can be written as

where network weight matrices pi , ai are defined in Eq. (16). Writing Eq. (26) in matrix
form for a set of input-output data N+1 samples long yields

y=YV

where

The network weight matrices are estimated using the equation

? = yv+ (31)

, where (.)+ denotes the pseudo-inverse of the quantity in the parentheses. If the initial
conditions are not zero then a slightly differen-t equation muit be used to solve for the
network weights, that is

where yt and VI are obtained by deleting the first p columns in y and V, respectively.

Remark 4.2.1. The least-squares solution in Eq. (31) or (32) minimizes the enor
between the actual output and the estimated output computed using the actual input and
output data, i.e., theJeast-sqyares solution minimizes the residual e = y - $ where 9 is
computed from 9 = YV and Y is given in E% (31). If Eq. (32) is used instead, then the
least-squares solution minimizes 6, = y - y, where 9, = YV,. This computation,
therefore, corresponds to the series-parallel identification scheme that minimizes the
observer estimation error.
Remark 4.2.2. Ideally, the error between the actual output and the predicted output
provided by the identified open-loop model is the proper error to be minimized for the
identification of the system open-loop model. The above computation minimizes the
observer estimation error instead. For a linear system, it turns out that in the absence of

14
noises the open-loop system can be identified exactly by minimizing the observer
estimation error. In the presence of noises, however, minimizing the observer estimation
error does not necessarily implies that the open-loop prediction error is minimized.
Therefore, it is possible that the observer model fits the data well but the open-loop model
does not. Fortunately, if the order of the regression equation is chosen to be sufficiently
large then simultaneous observer and system identification will still be achieved in the limit
as the data record tends to infinity and the noises are white, Gaussian, and zero-mean (see
Ref. 9).
Remark 4.2.3. The least-squares solution in Eq. (31) can be obtained by an on-
line parameter estimation scheme. First, write each column in V as

so that at each time step k, Eq. (27) can be written as

The recursive least-squares equation for the network weights is simply,

where Y ( k )= [ b o ( k ) , b , ( k ) , &(IC), b 2 ( k ) , & ( k ) , ..., b , ( k ) , &,(k)], Y(0) is an arbitrary

initial guess, and R(0) is any symmetric positive definite matrix. The recursive equations
for (32) are analogous.
Remark 4.2.4. In a multi-layer neural network, the back propagation algorithm is
typically used to update the network weights recursively. This is a gradient-based
parameter update algorithm. When expressed in the block diagram form for hardware
implementation, the algorithm resembles the forward network except that the signal travels
in the opposite direction, leading to the name back propagation. In the present case,
because the network is simply one layer of linear neurons, it is not efficient to use the back
propagation algorithm to compute the network weights. For on-line implementation, the
least squares algorithm given in Eqs. (35-36) or its variants for fast computation are
preferred.
5. Predictor Models For Linear Dynamic Systems
In theory, the open-loop model can be used to predict the system response based on
current and past input values. However, this is not desirable in practice because of the
requirement that both the open-loop model and the initial conditions be known exactly.
Such a prediction is also sensitive to noises. On the other hand, an observer which is
typically used to estimate the system state based on actual input-output data can also be
used to provide an estimate of the system output. This section first discusses the use of an
observer as an one-step ahead predictor. This interpretation is important because of its
connection to the control problem which will be discussed in later sections. Extensions to
the identification and use of multiple-step ahead predictors will then be made. For linear
systems, these predictors are simply special single-layer networks of linear neurons.

1s
5.1. One-Step Ahead Predictor. To express explicitly the observer as an
one-step ahead predictor, one simply writes the observer equations as

i(k+l)=(A+MC)?(k)+(B+MD)u(k)-My(k)
(37)
j ( k + 1) = C i ( k + 1) + Du(k + 1)

As a predictor, the interested quantity is j(k + 1). One can therefore bypass the state
equation by writing

F(k+l)=a,y(k)+ +a,y(k-p+l)+&u(k+l)+P,u(k)+ +P,u(k-p+l) (38)

The following remarks can be made regarding the forms of Ey. (37) and Eq.(38).

Remark 5.1.1. In theory, if the state space model (A, B , C, D ) is known exactly
then one can design an observer gain M such that A + M C is asymptotically stable. To use
Eq.(38) as an output predictor, one need to include a sufficient number of terms such that
( A + MC)' is negligible for i 2 p. The state space representation in 3.(37) is a better
choice since it involves no such approximation. The above comment no longer holds true
if M is such that A + M C is deadbeat, i.e., ( A + MC)' 0, i 2 p since the approximation
becomes exact in this case.
Remark 5.1.2. In practice, the system model cannot be known exactly. To
identify the system from input-output data using the series-parallel structure, one in fact
computes directly the coefficients in Q. (38) rather than the state space mamces. To obtain
a minimal order state space representation from these coefficients, realization is rcquired.
As an output predictor, therefore, Q. (38) should be used directly because conversion to a
state space representation is not necessary for this purpose.
Remark 5.1.3. Equation (38) clearly indicates that the one-step ahead predictor
takes the form of a single layer network of linear neurons with actual input and output
signals entering the network, and the output of the network represents the one-step ahead
prediction. Schematically, this is the same as shown in Fig. 10.

5.2. A Two-step Ahead Predictor. This section derives the equations for a
two-step ahead predictor for linear systems, and shows that it also has a linear neural
network form. First, from Eq. (7). one can write

x(k+2)=Ax(k+l)+Bu(k+l)
+
= A * x ( k ) ABu(k) + Bu(k + 1) (39)
+
y(k + 2) = Cx(k + 2) Du(k + 2)

Adding and subtracting the term Gy(i) to the right hand side of the state equation yields

~ ( +k2) = A2x(k) + ABu(k) + Bu(k + 1) + Gy(k)- Cy(k)

(40)
= ( A 2 +GC)x(k)+[AB+GD, B]

If G is a mamx such that A2 + GC is deadbeat of order p, i.e.,

16
then the relationship between the input and output of the system can be described as a linear
combination of input-output data of the form

for k 2 2 p - 2 so that at sufficiently large time steps, terms involving the states x(0) and
x(1) vanish', Le., C(A2+ GC)'x(O)= 0, C(A2+ GC)'x(l) = 0, for i 2 p due to the
imposed deadbeat condition for A2 + GC. Furthermore, there is only a finite number of
coefficients that make up the linear combination in g(.), which are the predictor Markov
parameters of the form

G ( k ) = C ( A 2 + G C ) ' - ' [ A B + G D , B , -GI, k = l , 2, ..., p (43)

Existence of the matrix G such that ( A 2 + GC)' E0 , k 2 p is assured if the pair (A2, C)
is observable.

Remark 5.2.1. The above derivation justifies the form of a two-step ahead
predictor. In fact, one can identify the coefficients of this predictor from input-output data
by minimizing the two-step prediction error. The procedure is similar to that discussed in
Section 4.
Remark 5.2.2. To obtain the two-step ahead prediction one can also propagate the
observer, which is a one-step ahead predictor, in two successive time steps by treating the
estimated output from the first time step as the actual output for the second time step
However, such a procedure would amount to performing open-loop prediction in the
second time step and is therefore sensitive to noises. On the other hand, if one uses the
predictor form with the coefficients directly identified from input-output data then only
actual data enter the computation and thus minimizes the errors due to noises.
Remark 5.2.3. Again, the predictor form can be represented by a single layer of
linear neurons, and the weights of this network are simply the elements of the predictor
Markov parameters shown i n Eq. (43). Results presented in this section can be easily
generalized to a general multi-step predictor. The relationship between such predictors and
the deadbeat control problem will be discussed in a later reference.
6. Control of Linear Systems using Neural Networks
As formulated in previous sections, linear systems can be represented by a single-
layer network of linear neurons. The weights of this network can be identified from input-
output data. Once identified, the network can be used as a one-step ahead predictor. This
section discusses the use of such network directly for control application without requiring
the state space model to be extrilcted from these weights.
6.1. A One-Step Ahead Controller. First, consider the case where the
linear system can be expressed in the form,

y(k+1)= a , y ( k ) + .'. + a , y ( k - p + 1 ) + P o u ( k + l ) + P , u ( k ) + - e - +P,u(k-p+l) (44)

where the coefficients are assumed to be known. Let the desired response be denoted by
r(k). To obtain a controller directly from the above equation, one simply replaces y ( k + 1)

17
by its desired value r ( k + l ) and then solve for the control input u(k+l) to obtain the control
law,

If the coefficients in Eq. (44)are not known exactly then Eq. (46) represents an one-step
ahead estimate of what the system will produce based on current and past input-output data,

The control law then is simply

u(k + 1) =j+k + 1) - & , y ( k ) - ..- -&,y(k - p + l)-j,u(k)- -..-&U(k - p + l)] (47)

The behavior o f the closed-loop system using the above control law can be examined as
follows. Let e(k+l) denote the error between the actual response and the predicted
response,

j ( k + 1) = y(k + 1) - e(k + 1) (48)

Substituting Eq. (47)and Eq.(48)i n t o Eq. (46)produces

y(k + 1) - e ( k + 1) = c;s,y(k)+ *-. +Cjlpy(k - p + 1) + j , u ( k ) + ... + j , U ( k - p + 1)

+ ii){h]
[r ( k + 1) - & ; y ( k )- ** - - &,y(k - p + 1)
-j,u(k)- --- -P,u(k-p+l)]}
= r(k + 1) (49)
or
, y ( k + 1 ) = r(k + 1) + E(k + 1) (50)

Define the tracking error to be the difference between the actual response and the desired
response, ~ ( k=) y ( k ) - r ( k ) , Eq. (50) reveals that
E(k + 1) = e ( k + 1) (51)
Therefore, if the predictor is such that its prediction error vanishes in the limit, then the
tracking error also vanishes in the limit, i.e.,

lim e ( k ) = 0 3 lim ~ ( k=) 0

k-+- c-r-

Remark 6.1.1. The one-step ahead control law has the property that the tracking
error is the same as the prediction error. The above analysis shows that accuracy of the
predictor model governs the accuracy of the tracking response. As long as the predictor
can perform a reasonably good one-step ahead prediction of the system response then the
control input can be computed to make the system track a desired trajectory. In the ideal
case where the system is linear and the data is noise-free, the prediction error and the

18
tracking error will be zero identically. Non-zero prediction and tracking error can only
come about during adaptation or when noises are present. This is different from the non-
linear case where both the estimation and tracking error are non-zero even when there are
no noises in the system. An important resmction of the one-step ahead controller is that the
open-loop system is required to be stably invertible, (ie., there are no unstable zeros in the
linear case). If this condition is not met, it is possible to have the controller producing
unbounded input while maintaining zero tracking error.
Remark 6.1.2. To obtain the result in Eq. (51), the controller coefficients must be
the same as those of the predictor model. In the event the coefficients of the predictor
model are updated at each time step, then the controller coefficients must also match those
of the predictor model. Mathematically, if at time step k, the predictor takes the form

j ( k + l ) = &(k)y(k)+ ..* +ki,(k)y(k-p+l)

+bo(k)u(k+l)+bl(k)u(k)+ +b,(k)u(k- p + 1 ) (53)

then the control law will be taken to be

u(k + 1 ) = b O ( k ) - l [ r ( k + 1 ) - & , ( k ) y ( k ) - -ki,(k)y(k-p+l)

Remark 6.1.3. The above controller can be implemented in neural network form.
Such a controller simply copies the weights of the feedforward predictor network to
generate the control input. This is shown schematically in Fig. 1 1 below.

/COPY
II

: CONTROLLER

I
I

Figure 1 1 : Adaptive one-step ahead controller.

Remark 6.1.4. Since the controller attempts to make the system track the desired
trajectory in one step, excessive control efforts are usually required. This makes the
approach unattractive in practice. To alleviate this problem, the weighted one-step ahead

19
controller is used, such that at each time step the control input minimizes the following
quadratic cost function

1 1
k 1) + -u(k
J ( k + 1) = - ~ ( k+ l ) T Q ~ ( + + l ) T S ~ ( +k 1) (55)
2 2
where the tracking error ~ ( +k1) = y ( k + 1) - r(k + 1). The weighting matrices Q and S are
required to be symmetric and positive definite. Substituting the expression for ~ ( +k1) and
y(k + 1) into the cost function and then performing the minimization produces

Setting the result to zero and solving for the control input yields

The above is known as a weighted one-step ahead controller in adaptive control literature.6
6.2. Model Reference Controller. A different way to avoid the requirement
that the system track a desired trajectory in one step is to use a control scheme known as
model reference control. Let the control law in Eq. (47) be modified as

Substituting Eq. (48) and Eq. (58) into Eq. (46) yields

-ylYW - -.. - y,y(k - p + 1) + r(k + 111)

= -yIy(k) - - y,y(k - p + 1) + r(k + 1)
-** (59)
which can be expressed as

The system response y ( k ) now no longer follows the reference input r(k) directly as in the
case in Eq. (49). Its behavior can be conveniently interpreted in terms of a reference
model. Define y,(k) as the response of a reference model when driven by the reference
input r(k),

20
and the tracking error E,(k) as the difference between the system response y ( k ) and the
reference model response y m ( k ) ,
I .

The equation that governs the behavior of this tracking error is obtained by subtracting Eq.
(61) from Eq. (60),

~,(k+l)+y,~,(k)+ +y,~,(k-p+l)=e(k+l) (63)

Therefore, convergence of the prediction error to zero implies convergence of the tracking
error to zero provided that the characteristic equation governing the homogeneous part of
the difference equation is asymptotically stable,

This requirement is easily satisfied since the coefficients y l , y2, ..., yp are the design
variables to be selected a priori.

Remark 6.2.1. The difference between this case and the previous case is that the
desired trajectory is not specified by the reference input r(k),but rather by the response of
the reference model. Since the reference model is known, the reference input r(k) that is
needed to make the reference model produces the desired response can be easily computed.
The introduction of the reference model is to slow down the convergence of the tracking
error so that excessive correction during the adaptation process does not occur.
Remark 6.2.2. The model reference control scheme can also be implemented in
neural network form. At any time step, the controller network copies the coefficients of the
predictor network.and uses them in the generation of the control input. The configuration
for this control scheme is shown in Fig. 12.
Remark 6.2.3. Equation (63) shows that the prediction error acts as a driving term
for the difference equation that governs the behavior of the tracking error. If the reference
model coefficients are designed such that the homogeneous solution is asymptotically stable
then the steady state tracking error is simply the particular solution of the difference
equation. One thus has the ability to affect the steady state tracking error through the
reference model coefficients. However, this freedom is constrained by the residual
dynamics of the prediction error that the steady state tracking error may be amplified or
reduced. Generally speaking, the natural frequencies of the reference model should be
placed away from those dominating the residual dynamics.
Remark 6.2.4. If the coefficients of the predictor model are updated at each time
step, then the controller coefficients must match those of the predictor model at each time
step. The resulting integration between parameter estimation and control computation is
known as model reference adaptive control. The adaptive scheme is summarized in the
following equations where the ordinary least-squares algorithm is used to perform the
parameter estimation step. Again, let Y ( k )denote the estimated coefficients of the predictor
model at time step k ,

21
7* SYSTEM
Y(k)
I I

/y COPY
- CONTROLLER 4

I
/y COPY

x'y
I I

I
.@
REFERENCE
MODEL
1
I
Ym(k)

Figure 12: Model reference adaptive control.

starting with Y(0) as an arbitrary initial guess. The control input is computed from

u(k+ 1) = ~ o ( k ) - l [ - & ( k ) y ( k ) - -&,,(k)y(k - p + l ) - b ( k ) u ( k ) -

where the reference model coefficients y l , y2, ..., yp are time-invariant and chosen a
priori. The above control input is applied to the system producing response y ( k + l ) . The
predictor coefficients are then updated according to the rule

staring with R ( 0 ) as any symmetric positive definite matrix. The newly estimated
parameters are then used to compute the control input for the next time step u(k+2).
Remark 6.2.5. The control schemes discussed in this section deals with a one-step
ahead predictor model of the form shown in Eq. (38). The previous section shows that a
two-step ahead predictor or a multi-step ahead predictor has the same linear form.
Therefore, the results presented in this section can be easily extended to these predictors.

22
For example, the two-step ahead controller will compute the control u(k + 2) requiring the
measurements upto y ( k ) only.

7. Modelling and Control of Non-Linear Systems

Up to this point, the discussion has been restricted to linear systems. It has been
shown that for linear systems, it is not necessary to have a complicated neural network for
identification and control, but rather a single layer of linear neurons is adequate. This
section extends the results to non-linear systems. The significance of linear predictor
models for non-linear systems will be discussed. This has an important implication on the
extent to which linear techniques can be used to handle non-linear systems. The modelling
and control of non-linear systems using non-linear neural networks will then be examined.
7.1. Linear Predictor Models for Non-Linear Systems. The predictor
model derived in Section 5 is based on the open-loop state space model which is a linear
representation. For linear systems, the identified coefficients of an one-step ahead
predictor take a particular form, namely, the Markov parameters of an observer model
consisting of the open-loop state space model and an observer gain. Recall that a predictor
uses actual input and output data to compute the predicted response at each time step. To
qualify as a valid open-loop model as well, the predictor must also accurately predicts the
system response in an open-loop test using input data alone. As mentioned previously, for
linear systems, the predictor is also valid as an open-loop model because it can also
produce correct open-loop prediction. For a non-linear system, this is no longer the case.
However, when the predicted response is modeled as a linear combination of past input and
output data, it turns out that surprisingly good prediction can still be obtained even for non-
linear systems. Such predictor models do not qualify as open-loop models of the actual
system because they do not predict correct open-loop response using input data alone. This
point will be further illustrated by a numerical example in a later section.
7.2. Control of Non-Linear Systems Using Predictor Models. In this
section, we show that the model reference controller considered in Section 6.2, or its
special version, the one-step ahead controller in Section 6.1, can be used to control a class
of non-linear systems which can be represented by linear predictors of the form considered
in Eq. (38). Suppose that the non-linear system can be represented by an non-linear auto-
regressive model of the form

wherefl.) is some non-linear function of past input and output data. First, note that for the
response of the system to follow that of a reference model,

we require that the response of the controlled system be described by

SO that the tracking error, ~ , ( k )= y ( k ) - y , ( k ) , will be governed by

E,(k+l)+ y * E m ( k ) + ... + ype,(k-p+l)=O (72)

23
Therefore, at time step k + l , one wishes to determine the control input u(k+l) such that Eq.
(69) is satisfied. Since the relationship between y(k+l) and u(k+l) is non-linear and is not
known, one cannot solve for u(k+l) directly. However, if the non-linear system is such
that there exists a predictor of the form given in Eq.(44)such that jj(k + 1) = y(k + 1)then
one satisfies the following equation,

w + 1) + y * y ( k )+ ... + y,y(k - p + 1) = r(k + 1)

instead of Eq. (71). The control law is then determined by substituting Eq. (46) in Eq.
(73) and then solve for u(k+l), producing exactly the same control law as given in Eq.
(66). The fact that the control law satisfies Eq. (73) instead of Eq. ( 7 1 ) will make the
tracking error equation governed by Eq. (63) instead of Eq. (72), where e(k+l) denotes the
prediction error defined in Eq. (48).
7.3. Identification and Control of Non-Linear Systems using Non-
Linear Neural Networks. The identification and control scheme discussed in this
paper can be extended to include non-linear neurons. The basic assumption is that the
system response is a non-linear function of previous input and output data which can be
represented by a mu1 ti-layer feedforward network having a sufficient number of non-linear
neurons. Let the non-linear function be denoted by f(.) and its neural network
representation by A'(.),

Y ( k + 1) = f ( y ( k ) , y ( k - I), ...
( u(k + l), u(k), u(k - l), ...)
= N ( Y ( k ) , Y ( k - 11, ..., u(k + l), u(k), u(k - l), ...) (74)

When a non-linear network of sufficiently large number of hidden layers is used, then it
may also qualify as an open-loop model of the non-linear system besides its being an one-
step head predictor. This is the fundamental difference between identification using a linear
network versus a non-linear network. Generally speaking, the theoretical advantage of
using a non-linear network for non-linear system identification is off-set by the difficulties
in finding such a network in practice. Neither the number of hidden layers nor the number
of neurons in each layer are known a priori. For a chosen network configuration, the back
propagation algorithm is often used to determine the network weights. Typically, the
convergence rate is slow and a large amount of data is needed. The back propagation
algorithm is well-known and discussed extensively in the literature.
In the model reference control problem, the theoretical advantage of a non-linear
network is somewhat diminished because the open-loop model need not be found for
purpose of tracking control. The model reference control scheme can accommodate a non-
linear network rather easily. Assume that the network representing the non-linear system
can be expressed in the form,

where e l ( k -t1) denotes the fitting error introduced with the separation of the u(k+l) term
from A'(.). The control input is computed from

24
where yl, y2,..., y p are the coefficients of the reference model representing the desired
response. The control input when applied to the system yields the closed-loop response,

The tracking error, &,(k) = y(k) - ym(k), where ym(k) is the response of the reference
model is governed by the difference equation

~ , ( k+ 1) + y ] ~ , ( k+) + yPem(k- p + 1) = el(k + 1) (78)

In practice, one identifies an approximation of N1(.) denoted by I?,(.). The control law is
then based on I?l(.) ,

u(k+l)=Po(k)-l[y(k+l)- 71y(k)- ... - ypy(k-p-l)-I?l(.)] (79)

The closed-loop system becomes

y(k + 1) = Nl(.)+ r(k + 1)- y l y ( k ) - -..- y,y(k -p - 1)- fil(.)+e l ( k + 1) (80)

Let ez(k + 1) denote the approximation error, e2(k + 1) = Nl(.)- I?,(.). The tracking error is
now governed by

Em(k+1)+ y l E m ( k ) + + y,E,(k-p+1)=el(k+1)+e2(k+1) (81)

A schematic diagrdm of the control scheme is the same as shown in Fig. 12, except that the
block representing the identification model is now a non-linear neural network.
8. Numerical Examples
In this section, several examples will be presented to illustrate various concepts
discussed in this paper. The case of a linear system is considered first, followed by a non-
linear system. Both identification and control aspects of each case will be shown.
8.1. Network Representation of a Linear System. Consider a linear
single-input single-output system with three vibration modes at 0.40Hz7 1.37Hz, and
2.21Hz, each with a damping factor of 0.5%. The state space matrices shown represent a
discrete model at a sampling rate of 10 Hz.

0.6585 0.2100 0.0144 0.4391 0.0382 0.0017 0.0050

0.2100 0.4701 0.3149 0.0382 0.4035 0.0573 0.1124
0.0144 0.3149 0.6705 0.0017 0.0573 0.4405 0.0075
A= , B=
-1.2408 0.6925 0.1093 0.6516 0.2119 0.0155 0.0382
0.6925 -1.7694 1.0387 0.21 19 0.4630 0.3179 0.4035
0.1093 1.0387 -1.1498 0.0155 0.3179 0.6646 0.0573

25
C=[l.O -0:5 0.0 1.0 0.5 0.01 , 0~1.5
The system is excited by random input shown in Fig.13 producing the response shown in
Fig. 14.

4, 6,

-4 I J
0 4 8 12 16 20 0 4 8 12 16 20
Time (sec.) Time (steps)
Figure 13: Excitation input time history. Figure 14: System response time history.
Using the above time histories, the network weights can be identified using Eq. (31).
First, consider the case where p = 6, the following values for the network weights are
obtained.

The above results are checked against the data by performing an open-loop prediction of the
response using the input alone,

jl(k)=aljj(k-l)+ + a , ~ ( k - 6 ) + P , ~ u ( k ) + P l u ( k - 1 ) + - - - +P,u(k-6) (84)

and an one-step ahead prediction (or observer estimation) using both actual input and
output data,

It can be verified that in both cases, both predicted responses match the actual data exactly.
Again, it should be emphasized that the result shown in Eqs. (83) represents a set of
weights that can be identified from any feedforward network that uses 6 past values of
input and output data to predict the current response. Specifically, if one uses a network
consisting of a single neuron, then the values listed in Eqs. (83) are precisely the weights
of this neuron. On the other hand, if a feedforward network consisting of several layers of
linear neurons is used to identify the system, then the values in Eqs. (83) are the weights of
a single neuron representation that is mathematically equivalent to the multi-layer network.
The system in Eqs. (82) in fact contains one uncontrollable mode as revealed by the
singular values of the controllability matrix, C = [ASB,A4B,..., A B , B ] ,

26
The model in a.
(84) is therefore an over-parameterizedmodel. The same system can be
modeled by using data from only 4 past time steps to predict the current response, i.e., p =
4. The corresponding weights are given below:

Note that the over-parameterization in Eq. (84) is in the form of having more distant past
input and output data to predict the current response, corresponding to the case of a neuron
having additional input channels. This is in contrast to the case where over-parameterization
is in the form of having additional neurons added to the network.
8.2. Model Reference Adaptive Control of A Linear System. Next, we
consider the application of the model reference adaptive control of the above system. The
goal is to have the system track a desired trajectory prescribed via the reference model,

ym(k+ 1) = 0.4ym(k)+ 0.5ym(k- 1) - 0.3ym(k- 2) + r(k + 1) (88)

where r ( k )= sin(k/21r). First, consider the ideal case where disturbance and noises are
not present. Since the system has a single output, the predictor network consists of only
one linear neuron. In this example, 6 past input and output values are used to predict the
current response. Recall that this is a case of over-parameterization since the effective order
of the system is only 4. The system is assumed to be unknown to the controller at the
beginning, and the weights are initially set to zero. Simultaneous prediction and control is
carried out producing the results shown in Figs. 15a-d below. Figure 15a shows that the
system response (dashed curve) quickly tracks the desired response (solid curve). The
time histones of the prediction error and of the tracking error during the process are shown
in Fig 15b and 15c, respectively. The control input time history is shown in Fig. 15d
revealing that the adaptive mechanism quickly produces the necessary control input to make
the system track the desired response.

4r----l
2

output 0

-2

-4 t J
0 4 8 12 16 20 0 4 8 12 16 20
Time (sec.) Time (sec.)

Figure 15a: Tracking response. Figure 15b: Prediction error.

27
Tracking
Error
0.

-1.5 -2
0 4 8 12 16 20 0 4 8 12 16 20
Tme (sec.) Time (sec.)
Figure 1%: Tracking error. Figure 15b: Control input.

Figures 16 a-c show the adaptation when a disturbance, d(k)= 0.5cos(k/2~),and

5% measurement noise are added to the system. With the same adaptive controller, the
system continues to track the desired response as shown in Fig. 16a. The effect of the
noises can be seen in the random variation in the prediction error and the tracking error time
histories, Figs. 16b and 16c. The new control input history that makes the system track the
desired response and accommodate this disturbance is shown in Fig. I@.

2
1

Figure 16a: Tracking response with Figure 16b: Prediction error.

disturbance and noise present.

1
1.5 .
Tracking Control 0

-1.5
0 4 8 12 16 20 0 4 8 1 2 1 6 ,3
Time (sec.) Time (sec.)
Figure 1k : Tracking error. Figure 16b: Control input.
8.3. Identification and Prediction of a Non-Linear System. While it is
not possible to have a linear model that can reproduce the open-loop response of a non-
linear system, it is possible to have a linear predictor that can reasonably predict the non-
linear response. The predictor model uses actual input and output data to compute the

28
predicted response. Consider the system whose state space matrices are shown previously,
but the input and output are related by the following non-linear relationship,

where the non-linear functions f [ x ( k ) ]= Ix(k)~’*sgn[x(k)],

g [ x ( k ) ]= sin[Cx(k)] operate
on each element of the state vector x(k). Note that in this example, the non-linearity affect
takes place from one sampling interval to the next. For purpose of identification, the
system is excited by a random input sequence and the resulting response is used in the
series-parallel identification scheme. For the case p = 6, the following model coefficients
are identified:

bo= 1.48, b, = -1.89, b2= 7.41 x lO-l, b 3 = -8.74 x

fi4= 9.58 x loe2, bs = -2.38 x lO-l, fi,j = 1.93 x lo-’ (90)
Recall that the identified model can be used either as an open-loop model or an one-step
ahead predictor. The model is checked against the response of the actual system to a sine-
wave input excitation, u(k) = sin(0.2~k).This is shown in Fig. 17 where the solid curve
is the actual response of the non-linear system and the dashed curve is the open-loop
prediction using the identified linear model. The two curves do not match as expected.
However, when the same coefficients are used in the predictor model, then the predicted
response follows closely the actual response as shown in Fig. 18. It is precisely this
potential ability of the linear predictor model that makes it applicable to the control of non-
linear systems. Furthermore, note that the coefficients are time-invariant. Allowing these
coefficients to be time-varying as in the case of model reference adaptive control can further
enhance the applicability of such models for non-linear systems.

--
2 2

output 0 output 0

-2 -2
I I I I
-4 -4
0 4 8 12 16 20 0 4 8 12 16 20
Time (sec.) Time (sec.)

Figure 17: Open-loop prediction. Figure 18: One-step ahead prediction.

(non-linear system) (non-linear system)

8.4. Model Reference Adaptive Control of a Non-Linear System.

Finally, this example illustrates the application of model reference adaptive control to the
non-linear system considered in the above section. For comparison purpose, the same
adaptive control law is used as in the linear case, except that it is now applied to the non-

29
linear system. Figures 18a-d show the tracking response, the prediction error, the tracking
error, and the control input time histories, respectively. Recall that the control method does
not require that the open-loop model be identified, but rather the predictor model that can
reasonably predict the response, which is the case illustrated in the previous example.

0.5

output -2 ~~~

-4 b
0 4 8 12 16 20
-1 [
0 4 8 12 16
I
20
Time (sec.) Time (sec.)
Figure 18a: Tracking response. Figure 18b: Prediction error.
(non-linear system)

3 2r

Tracking
Error
1.5

0
---I Control
Input

-1
1

-1.5 -2
4 8 12 16 20 0 4 8 12 16 0
Time (sec.) Time (sec.)
Figure 18c: Tracking error. Figure 18b: Control input.
When disturbance and noise are added to the system, the resulting behavior of the
system is shown in Figs. 19a-d. Again, this reveals a certain degree of stability robustness
of the adaptive scheme to possible disturbance and noises. This is due to the inherent
robustness in the ability of linear predictors that can predict the non-linear response.

4 1

2 0.5

output 0 Prediction
Error

-2 -0.5

-4 -1
0 4 8 12 16 20 Q 4 a 12 16 20
Time (sec.) Time (sec.)
Figure 19a: Tracking response with Figure 19b: Prediction error.
dist. and noise present (non-linear syst.)

30
Tracking
Error
0 .

-1.5 I
0. 4 8 12 16 20 0 4 8 12 16 20
Time (sec.) Time (sac.)
Figure 1%: Tracking error. Figure 19b: Control input.

8. Summary and Concluding Remarks

This paper presents the basic concepts of the neural networks as related to the
problem of modelling and control of a dynamic system. Two basic forms of the neural
networks, the feedforward network and the recurrent network, are discussed. Emphasis is
placed on the interpretation of the neural networks in terms of standard linear system theory
so that better insight may be gained when these concepts are applied in practice.
Relationship between the feedforward neural network and the state space model and
between the recurrent network and the observer model is explained. To identify a linear
system, the discussion in this paper reveals that it is neither advantageous nor necessary to
use a multi-layer network, but rather a single layer of linear neurons is adequate. The
resultant simplified network is then equivalent to standard regressive models that are often
used in adaptive systems theory. The real benefit of a neural network in system
identification is in its capacity to capture non-linearities, in which case the neurons must be
non-linear. With respect to the control of both linear and non-linear systems, however, it is
shown that it is not the identification of the open-loop system that governs the stability of
the tracking behavior, but rather the ability of a mechanism that can predict future response
based on actual available input-output data. It is shown that this mechanism can often be
provided simply by a linear predictor. A linear predictor, consisting of a single layer of
h e a r neurons, is in fact an optimal choice for a linear system. The same linear predictor
can often be adequate for non-linear systems as well, making it directly applicable to the
control a non-linear system. The resulting control technique is simply model reference
adaptive control, a well-known technique in linear system control. Such a linear predictor
can be easily determined from input-output data. If implemented on-line, the method can
also adapt to changing dynamics. The recent advent of the ObserverKalman filter
identification (OKID) method has motivated the design of controllers that are based directly
on the observer Markov parameters. This paper shows that one such design is model
reference control because the observer Markov parameters are precisely the coefficients of
an optimal linear predictor. The design takes advantage of the ability of the predictor to
handle certain non-linear systems, an often observed fact in practical implementation of the
OKID method.
To make adaptive control truly useful in practice, constraints with respect to
sampling and computation speeds must be addressed. Naturally, the adaptive scheme
places heavy emphasis on on-line measurements rather than some known model of the
system for control. Sensor failure, therefore, becomes an important issue. Practical
consideration may dictate a compromise between fixed-gain and adaptive control, thus
requiring a mechanism to determine when adaptation should take place. The question of
sensor placement and sensor selection are also important ones. This is to avoid the
situation where the system can produce bounded output when driven by unbounded input.

31
This case requires additional theoretical treatment than that presented in this paper. Finally, \
the paper concerned mostly with stability rather than performance robustness issues.
Further work is required to assess this aspect of the problem.
9. References
1 Narendra, K.S. and Parthasarathy, K., "Identification and Control of Dynamical
Systems Using Neural Networks," IEEE Transactions on NeuraI Network, Vol. 1,
No. 1, March 1990.
2 Hornik, K., Stinchcombe, M., and White, H., "Multilayer Feedforward Neural
Networks Are Universal Approximators," Neural Networks, Vol. 2, No. 5, 1989.
3 Billings, S.A. and Leontaritis, I.J., "Input-Output Parametric Models for Non-Linear
Systems. Part 1: Deterministic Non-Linear Systems; Part 2: Stochastic Non-Linear
Systems, International Journal of Control, Vol. 41, 1985.
It

4 Chen, S., Billings, and Grant, P.M., "Non-Linear System Identification Using Neural
Network," International Journal of Control, Vol. 5 1, No. 6, 1990.
5 Hyland, D.C., "Neural Network Architectures for On-Line System Identification and
Adaptively Optimized Control," Proceedings of the IEEE Conference on Decision
and Control, Brighton, U.K., December 1991.
6 Goodwin, G.C. and Sin, K.S., Aduptive Filtering, Prediction, and Control, Prentice
Hall, Englewood Cliffs, New Jersey, 1984.
7 Ljung, L. and Siidertr6m, T., Theory and Practice of Recursive Identification, The
MIT Press, Cambridge, Massachusetts, 1983.
8 Chen, C.-W., Huang, J.-K., Phan, M. and Juang, J.-N., "Integrated System
Identification and Modal State Estimation for Control of Large Flexible Space
Structures," Journal of Guidance, Control, and Dynamics, Vol. 15, No. 1, pp. 88-95,
January-February 1992.
9 Juang, J.-N., Phan, M., Horta, L.G., and Longman, R.W., "Identification of
ObserverKalman Filter Markov Parameters: Theory and Experiments," Proceedings of
the AIAA Guidance, Navigation, and Control Conference, New Orleans, Louisiana,
August 1991; accepted for publication in the Journal of Guidance, Control, and
Dynamics.
10 Phan, M., Horta, L.G., Juang, J.-N., and Longman, R.W.,"Linear System
Identification Via An Asymptotically Stable Observer," Proceedings of the AIAA
Guidance, Navigation, and Control Conference, New Orleans, Louisiana, August
1991; also, accepted for publication in the Journal of Optimization Theory and
Applications.
REPORT DOCUMENTATION PAGE I F- Approved
0M8No.0~-0188

. AGENCY USE ONLY (leave Man&) 3. REPORT TYPE AND DATES COVERED
Technical Memorandum
I. TITLE AND SUBTITLE 5. FUNDING NUMBERS
On Neural Networks in Indentification and Control
of Dynamic Systems WU 585-03-11-09
.1 AUTHOR(5) I
Minh Phan*, Jer-Nan Juang, David C. Hyland**

. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) 8. PERFORMING ORGANIZATION

REPORT NUMBER

. SPONSORING/ MONITORING AGENCY NAME(S) AND ADDRESS(ES) 10.SPONSORING /MONITORING

AGENCY REPORT NUMBER
National Aeronautics and Space Administration
Washington, DC 20546-0001 NASA 'I'M-107702

2r. DISTRIBUTION I AVAILABILITY STATEMENT 12b. DISTRIBUTION CODE

Unclassified-Unlimited
Subject Category 39
I

3. ABSTRACT (Maximum ZOO words)

The paper presents a discussion on the applicability of neural networks in the identification and control
of dynamic systems. Emphasis is placed on the understanding of how the neural networks handle linear
systems and how the new approach is related to conventional system identification and control methods.
Extensions of the approach to non-linear systems are then made. The paper explains the fundamental
concepts of neural networks in their simplest terms. Among the topics discussed are feedforward and
recurrent networks in relation to the standard state-space and observer models, linear and non-linear
auto-regressive models, linear, predictors, one-step ahead control, and model reference adaptive control
for linear and non-linear systems. Numerical examples are presented to illustrate the application of these
important concepts.

14. SUBJECT TERMS 115.^ NUMBER

^
OF PAGES
33
Adaptive control; neural networks; system identification
16. PRICE CODE
A03
17. SECURITY CLASSIFICATION 18. SECURITY CLASSIFICATION 19. SECURITY CLASSIFICATION 20. LIMITATION OF ABSTRACl
OF REPORT OF THIS PAGE OF ABSTRACT
Unclassified I Unclassified
I
I Unclassified I
SN 7540-01-280-SSOO Standard Form 298 (Rev. 2-89)
P r w f M by ANSI 5td 239-18
298.102

Robust Systems Theory and Applications
100% (2)
Robust Systems Theory and Applications
258 pages
PDF Radial Basis Function RBF Neural Network Control For Mechanical Systems - Compress
No ratings yet
PDF Radial Basis Function RBF Neural Network Control For Mechanical Systems - Compress
15 pages
Chemical Process Control A First Course With Matlab - P.C. Chau PDF
No ratings yet
Chemical Process Control A First Course With Matlab - P.C. Chau PDF
255 pages
Neurocontrol of An Aircraft: Application To Windshear: Pergamon 08957177 (95) 00101-8
No ratings yet
Neurocontrol of An Aircraft: Application To Windshear: Pergamon 08957177 (95) 00101-8
16 pages
Identification and Control of Dynamical Systems Using Neural Network PDF
No ratings yet
Identification and Control of Dynamical Systems Using Neural Network PDF
24 pages
Applications of ANN
No ratings yet
Applications of ANN
19 pages
Model-Following Neuro-Adaptive Control Design For Non-Square, Non-Affine Nonlinear Systems
No ratings yet
Model-Following Neuro-Adaptive Control Design For Non-Square, Non-Affine Nonlinear Systems
12 pages
(Lecture Notes in Control and Information Sciences) Andrzej Janczak - Identification of Nonlinear Systems Using Neural Networks and Polynomial Models - A Block-Oriented Approach-Springer (2004)
No ratings yet
(Lecture Notes in Control and Information Sciences) Andrzej Janczak - Identification of Nonlinear Systems Using Neural Networks and Polynomial Models - A Block-Oriented Approach-Springer (2004)
208 pages
Adaptive Fourier Series Neural Network PID Controller - s12555-020-0185-3
No ratings yet
Adaptive Fourier Series Neural Network PID Controller - s12555-020-0185-3
12 pages
NN Control
No ratings yet
NN Control
30 pages
fernandes20
No ratings yet
fernandes20
6 pages
Nonlinear and Adaptive Control
No ratings yet
Nonlinear and Adaptive Control
125 pages
Inverted Pendulum
No ratings yet
Inverted Pendulum
13 pages
Adaptive Control Using Neural Networks and Approximate Models
No ratings yet
Adaptive Control Using Neural Networks and Approximate Models
11 pages
Adaptive Control Stability Convergence and Robustness (S. Sastry and M. Bodson.)
No ratings yet
Adaptive Control Stability Convergence and Robustness (S. Sastry and M. Bodson.)
196 pages
Book Reviews: Regulation
No ratings yet
Book Reviews: Regulation
2 pages
Neural Network Adaptive Control For A Class of Nonlinear Uncertain Dynamical Systems With Asymptotic Stability Guarantees
No ratings yet
Neural Network Adaptive Control For A Class of Nonlinear Uncertain Dynamical Systems With Asymptotic Stability Guarantees
10 pages
Neural Networks
No ratings yet
Neural Networks
12 pages
Observer-Based Adaptive Neural Network Control For Nonlinear Systems in Nonstrict-Feedback Form
No ratings yet
Observer-Based Adaptive Neural Network Control For Nonlinear Systems in Nonstrict-Feedback Form
10 pages
Controls Engineering in FRC
No ratings yet
Controls Engineering in FRC
238 pages
ricsannipoct03
No ratings yet
ricsannipoct03
17 pages
Editorial: New Trends in Nonlinear Control Systems and Applications
No ratings yet
Editorial: New Trends in Nonlinear Control Systems and Applications
3 pages
Immediate download Advances and Applications in Nonlinear Control Systems 1st Edition Sundarapandian Vaidyanathan ebooks 2024
100% (1)
Immediate download Advances and Applications in Nonlinear Control Systems 1st Edition Sundarapandian Vaidyanathan ebooks 2024
62 pages
Adaptive Control Question Bank Without Answer Key
50% (2)
Adaptive Control Question Bank Without Answer Key
5 pages
Previewpdf
No ratings yet
Previewpdf
196 pages
Neural Controller Matlab
0% (1)
Neural Controller Matlab
10 pages
Adaptive Nonlinear Control of Agile Antiair Missiles Using Neural Networks
No ratings yet
Adaptive Nonlinear Control of Agile Antiair Missiles Using Neural Networks
8 pages
Generalized Self-Tuning Regulator Based On Online Support Vector
100% (1)
Generalized Self-Tuning Regulator Based On Online Support Vector
27 pages
NNFL Unit III For ECE & EEE
No ratings yet
NNFL Unit III For ECE & EEE
29 pages
Modern Ch-1
No ratings yet
Modern Ch-1
25 pages
Stability Analysis of Neural Networks-Based System
No ratings yet
Stability Analysis of Neural Networks-Based System
8 pages
Modern Control Theory - Ad Damen
100% (1)
Modern Control Theory - Ad Damen
163 pages
Modern Control Theory: Ad Damen
No ratings yet
Modern Control Theory: Ad Damen
163 pages
DH Ahff DT: Implementation of Neural Network Based Control Scheme On The Benchmark Conical Tank Level System
No ratings yet
DH Ahff DT: Implementation of Neural Network Based Control Scheme On The Benchmark Conical Tank Level System
5 pages
Download Advances and Applications in Nonlinear Control Systems 1st Edition Sundarapandian Vaidyanathan ebook All Chapters PDF
100% (1)
Download Advances and Applications in Nonlinear Control Systems 1st Edition Sundarapandian Vaidyanathan ebook All Chapters PDF
55 pages
Introduction To Control Systems
No ratings yet
Introduction To Control Systems
45 pages
Book Control Systems
No ratings yet
Book Control Systems
164 pages
Articulo Asymptotic Behavior of Nonlinear Networked Control Systems
No ratings yet
Articulo Asymptotic Behavior of Nonlinear Networked Control Systems
5 pages
PART1 (1)
No ratings yet
PART1 (1)
23 pages
Switching Control Mosca
No ratings yet
Switching Control Mosca
103 pages
Article1 IJAST - 65 2014
No ratings yet
Article1 IJAST - 65 2014
15 pages
Adaptive Control: Stability, Convergence, and Robustness
No ratings yet
Adaptive Control: Stability, Convergence, and Robustness
201 pages
Artificial Neural Network Application in Logic System: Siddharth Saxena TCET, Mumbai
No ratings yet
Artificial Neural Network Application in Logic System: Siddharth Saxena TCET, Mumbai
5 pages
Control Perspectives On Numerical Algorithms and Matrix Problems Advances in Design and Control
100% (1)
Control Perspectives On Numerical Algorithms and Matrix Problems Advances in Design and Control
299 pages
Zhongkui Li Cooperative Control of Multi-Agent Systems
No ratings yet
Zhongkui Li Cooperative Control of Multi-Agent Systems
263 pages
Controller Design of Lti Sys
No ratings yet
Controller Design of Lti Sys
70 pages
Fuzzy System Identification and Adaptive Control by Ruiyun Qi, Gang Tao, Bin Jiang
No ratings yet
Fuzzy System Identification and Adaptive Control by Ruiyun Qi, Gang Tao, Bin Jiang
293 pages
Sigmetrics08 Control PDF
No ratings yet
Sigmetrics08 Control PDF
31 pages
Radial Basis Function (RBF) Neural Network Control For Mechanical Systems
No ratings yet
Radial Basis Function (RBF) Neural Network Control For Mechanical Systems
15 pages
Networked Nonlinear Model Predictive Control of The Ball and Beam System
No ratings yet
Networked Nonlinear Model Predictive Control of The Ball and Beam System
5 pages
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Adaptive Control PDF
No ratings yet
Adaptive Control PDF
380 pages
Adaptive Control
No ratings yet
Adaptive Control
382 pages
Neural Networks
From Everand
Neural Networks
Sasha Kurzweil
No ratings yet
Attractor Networks: Fundamentals and Applications in Computational Neuroscience
From Everand
Attractor Networks: Fundamentals and Applications in Computational Neuroscience
Fouad Sabry
No ratings yet
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
From Everand
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
Fouad Sabry
No ratings yet
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
From Everand
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
Fouad Sabry
No ratings yet
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
From Everand
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
Fouad Sabry
No ratings yet
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet
Artificial Neural Networks: Fundamentals and Applications for Decoding the Mysteries of Neural Computation
From Everand
Artificial Neural Networks: Fundamentals and Applications for Decoding the Mysteries of Neural Computation
Fouad Sabry
No ratings yet
AI Techniques and Tools Through Python. Supervised Learning: Classification Methods, Ensemble Learning and Neural Networks
From Everand
AI Techniques and Tools Through Python. Supervised Learning: Classification Methods, Ensemble Learning and Neural Networks
César Pérez López
No ratings yet
An Extended Functional Network Model and Its Application For A Gas Sensing System
No ratings yet
An Extended Functional Network Model and Its Application For A Gas Sensing System
12 pages
Optimization Methods Using Artificial Intelligence Algorithms To Estimate Thermal Efficiency of PVT System
No ratings yet
Optimization Methods Using Artificial Intelligence Algorithms To Estimate Thermal Efficiency of PVT System
15 pages
Physics-Informed Deep Learning
100% (1)
Physics-Informed Deep Learning
18 pages
Neural Networks and Applications Tutorial
No ratings yet
Neural Networks and Applications Tutorial
45 pages
Slides Active Flow Control Deep Reinforcement Learning
No ratings yet
Slides Active Flow Control Deep Reinforcement Learning
46 pages
Simulator-Free Solution of High-Dimensional Stochastic Elliptic Partial Differential Equations Using Deep Neural Networks
No ratings yet
Simulator-Free Solution of High-Dimensional Stochastic Elliptic Partial Differential Equations Using Deep Neural Networks
31 pages
Simulation and Energy Consumption Analysis of A Propane Plus Recovery Plant From Natural Gas
No ratings yet
Simulation and Energy Consumption Analysis of A Propane Plus Recovery Plant From Natural Gas
7 pages
Accounts of Experiences in The Application of Artificial Neural Networks in Chemical Engineering
No ratings yet
Accounts of Experiences in The Application of Artificial Neural Networks in Chemical Engineering
15 pages
Neural Network For SSCV Hydrodynamics
No ratings yet
Neural Network For SSCV Hydrodynamics
104 pages
Prediction of The Minimum Film Boiling Temperature Using Artificial Neural Network
No ratings yet
Prediction of The Minimum Film Boiling Temperature Using Artificial Neural Network
11 pages
Use of Neural Networks in Process Engineering Thermodynamics, Diffusion, and Process Control and Simulation Applications
No ratings yet
Use of Neural Networks in Process Engineering Thermodynamics, Diffusion, and Process Control and Simulation Applications
16 pages
Process Modeling and Optimization Using Focused Attention Neural Networks
No ratings yet
Process Modeling and Optimization Using Focused Attention Neural Networks
12 pages
Prediction of Natural Gas Viscosity Using - Artificial Neural - Network Approach
No ratings yet
Prediction of Natural Gas Viscosity Using - Artificial Neural - Network Approach
7 pages
State Space Model of A Mechanical System in Matlab Simulink PDF
No ratings yet
State Space Model of A Mechanical System in Matlab Simulink PDF
7 pages
AI - 03 (Problems, State Space)
No ratings yet
AI - 03 (Problems, State Space)
44 pages
Lambert W-Function Revisited - Applications in Science
No ratings yet
Lambert W-Function Revisited - Applications in Science
12 pages
Part 2 Modelling and Simulation in MATLAB - Overview
100% (1)
Part 2 Modelling and Simulation in MATLAB - Overview
68 pages
Ninteger Manual
No ratings yet
Ninteger Manual
97 pages
Systems Analysis and Control: Matthew M. Peet
No ratings yet
Systems Analysis and Control: Matthew M. Peet
23 pages
State Space Models
No ratings yet
State Space Models
19 pages
Sliding Mode Control of A Multi-Degree-Of-Freedom Structural System With Active Tuned Mass Damper (#145301) - 126724
No ratings yet
Sliding Mode Control of A Multi-Degree-Of-Freedom Structural System With Active Tuned Mass Damper (#145301) - 126724
7 pages
BCS2213 - Introduction To Z Specification
No ratings yet
BCS2213 - Introduction To Z Specification
50 pages
State Space Multi-Machine
No ratings yet
State Space Multi-Machine
7 pages
11 - Chapter 2 PDF
No ratings yet
11 - Chapter 2 PDF
25 pages
NonLinSys NSJ MCT
No ratings yet
NonLinSys NSJ MCT
19 pages
EE462 Design of Digital Control Systems PDF
No ratings yet
EE462 Design of Digital Control Systems PDF
2 pages
ELEC4632 Lab 03 2022
No ratings yet
ELEC4632 Lab 03 2022
8 pages
Kom3712 Control System Design, Homework-1&2
No ratings yet
Kom3712 Control System Design, Homework-1&2
6 pages
Teoria de Control II
No ratings yet
Teoria de Control II
11 pages
Time and Frequency Response of First Order System
No ratings yet
Time and Frequency Response of First Order System
6 pages
Aerial Robotics Lecture 2C - 4 Quadrotor Equations of Motion
No ratings yet
Aerial Robotics Lecture 2C - 4 Quadrotor Equations of Motion
4 pages
THT - CLASS01 - 13621062 - Evelio Christian Fresley
No ratings yet
THT - CLASS01 - 13621062 - Evelio Christian Fresley
11 pages
Standup and Stabilization of The Inverted Pendulum: Massachusetts Institute of Technology
No ratings yet
Standup and Stabilization of The Inverted Pendulum: Massachusetts Institute of Technology
62 pages
AV355 - MCT - Lec2 - State Differential Eqs
No ratings yet
AV355 - MCT - Lec2 - State Differential Eqs
42 pages
Ip01 2 Sip LQR Student 512
No ratings yet
Ip01 2 Sip LQR Student 512
35 pages
Smart Cities and Cyber-Physical Energy Systems: Peter Palensky AIT Energy Department
No ratings yet
Smart Cities and Cyber-Physical Energy Systems: Peter Palensky AIT Energy Department
26 pages
Assignment3
No ratings yet
Assignment3
3 pages
Microsoft Word - 11-11-008 PDF
No ratings yet
Microsoft Word - 11-11-008 PDF
10 pages
Chapter 3 Mathematical Modeling of Dynamic Systems
No ratings yet
Chapter 3 Mathematical Modeling of Dynamic Systems
28 pages
Buck_Boost_Converter_Small_Signal_Model
No ratings yet
Buck_Boost_Converter_Small_Signal_Model
14 pages
Theory:: Experiment No: - 12 TITLE: Design Controller Using Pole Placement Method
No ratings yet
Theory:: Experiment No: - 12 TITLE: Design Controller Using Pole Placement Method
4 pages
Eee R19 Iii Ii
No ratings yet
Eee R19 Iii Ii
35 pages

On Neural Networks in Identification and Control of Dynamic Systems

Uploaded by

On Neural Networks in Identification and Control of Dynamic Systems

Uploaded by

https://ntrs.nasa.gov/search.jsp?

NASA Technical Memorandum 107702

ON NEURAL NETWORKS IN IDENTIFICATION

(NASA-TM-107702) O N NEURAL N 9 3- 3 1038

National Aeronautics and

Langley Research Center

T h e paper presents a discussion on the applicability of neural

* Senior Enginwr, Langley Program Office.

Figure 2: Schematic diagram of a single neuron.

Figure 4: A three-input three-output Hopfield network.

Remark 2.3.1. Unlike a feedforward network, a recurrent network contains self-

which is the output of a single neuron with the following weights:

The above example can be immediately generalized to show that a single-output

Remark 3.1.1. A multi-layer feedforward network of linear neurons is simply an

where the dimensions of A, B , C, and D are n x n , n x m, 4 x n , and 4 x rn, respectively.

where the parameters,

b = D , hk =CA'-'B , k = l , 2 , 3 ,... (9)

Remark 3.2.2. Since a generql multi-layer feedforward network of linear neurons

provided that ul = u(k), u2 = u(k - l), u3 = u(k - 2 ) .

then for k 2 p , the output y(k) can be expressed as

?(k + I) = & ( k ) + Bu(k)- M[y(k)- j ( k ) ]

Figure 7: Representation of a single-output system by a recurrent network

Figure 8: Identification using pmllel model.

In the series-parallel model, however, the estimated output is computed from

i ( k + 1) = ( A + M C ) i ( k )+ ( B + MD)u(k)- My(&) (24)

Figure 10: Feedforward representation of the series-parallel identification model by a

The network weight matrices are estimated using the equation

so that at each time step k, Eq. (27) can be written as

The recursive least-squares equation for the network weights is simply,

where Y ( k )= [ b o ( k ) , b , ( k ) , &(IC), b 2 ( k ) , & ( k ) , ..., b , ( k ) , &,(k)], Y(0) is an arbitrary

F(k+l)=a,y(k)+ +a,y(k-p+l)+&u(k+l)+P,u(k)+ +P,u(k-p+l) (38)

~ ( +k2) = A2x(k) + ABu(k) + Bu(k + 1) + Gy(k)- Cy(k)

If G is a mamx such that A2 + GC is deadbeat of order p, i.e.,

G ( k ) = C ( A 2 + G C ) ' - ' [ A B + G D , B , -GI, k = l , 2, ..., p (43)

y(k+1)= a , y ( k ) + .'. + a , y ( k - p + 1 ) + P o u ( k + l ) + P , u ( k ) + - e - +P,u(k-p+l) (44)

The control law then is simply

u(k + 1) =j+k + 1) - & , y ( k ) - ..- -&,y(k - p + l)-j,u(k)- -..-&U(k - p + l)] (47)

j ( k + 1) = y(k + 1) - e(k + 1) (48)

Substituting Eq. (47)and Eq.(48)i n t o Eq. (46)produces

y(k + 1) - e ( k + 1) = c;s,y(k)+ *-. +Cjlpy(k - p + 1) + j , u ( k ) + ... + j , U ( k - p + 1)

lim e ( k ) = 0 3 lim ~ ( k=) 0

j ( k + l ) = &(k)y(k)+ ..* +ki,(k)y(k-p+l)

+bo(k)u(k+l)+bl(k)u(k)+ +b,(k)u(k- p + 1 ) (53)

u(k + 1 ) = b O ( k ) - l [ r ( k + 1 ) - & , ( k ) y ( k ) - -ki,(k)y(k-p+l)

Figure 1 1 : Adaptive one-step ahead controller.

-ylYW - -.. - y,y(k - p + 1) + r(k + 111)

~,(k+l)+y,~,(k)+ +y,~,(k-p+l)=e(k+l) (63)

Figure 12: Model reference adaptive control.

u(k+ 1) = ~ o ( k ) - l [ - & ( k ) y ( k ) - -&,,(k)y(k - p + l ) - b ( k ) u ( k ) -

7. Modelling and Control of Non-Linear Systems

we require that the response of the controlled system be described by

SO that the tracking error, ~ , ( k )= y ( k ) - y , ( k ) , will be governed by

E,(k+l)+ y * E m ( k ) + ... + ype,(k-p+l)=O (72)

w + 1) + y * y ( k )+ ... + y,y(k - p + 1) = r(k + 1)

~ , ( k+ 1) + y ] ~ , ( k+) + yPem(k- p + 1) = el(k + 1) (78)

u(k+l)=Po(k)-l[y(k+l)- 71y(k)- ... - ypy(k-p-l)-I?l(.)] (79)

The closed-loop system becomes

y(k + 1) = Nl(.)+ r(k + 1)- y l y ( k ) - -..- y,y(k -p - 1)- fil(.)+e l ( k + 1) (80)

Em(k+1)+ y l E m ( k ) + + y,E,(k-p+1)=el(k+1)+e2(k+1) (81)

0.6585 0.2100 0.0144 0.4391 0.0382 0.0017 0.0050

jl(k)=aljj(k-l)+ + a , ~ ( k - 6 ) + P , ~ u ( k ) + P l u ( k - 1 ) + - - - +P,u(k-6) (84)

ym(k+ 1) = 0.4ym(k)+ 0.5ym(k- 1) - 0.3ym(k- 2) + r(k + 1) (88)

Figure 15a: Tracking response. Figure 15b: Prediction error.

Figures 16 a-c show the adaptation when a disturbance, d(k)= 0.5cos(k/2~),and

Figure 16a: Tracking response with Figure 16b: Prediction error.

where the non-linear functions f [ x ( k ) ]= Ix(k)~’*sgn[x(k)],

bo= 1.48, b, = -1.89, b2= 7.41 x lO-l, b 3 = -8.74 x

Figure 17: Open-loop prediction. Figure 18: One-step ahead prediction.

8.4. Model Reference Adaptive Control of a Non-Linear System.

8. Summary and Concluding Remarks

. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) 8. PERFORMING ORGANIZATION

. SPONSORING/ MONITORING AGENCY NAME(S) AND ADDRESS(ES) 10.SPONSORING /MONITORING

2r. DISTRIBUTION I AVAILABILITY STATEMENT 12b. DISTRIBUTION CODE

3. ABSTRACT (Maximum ZOO words)

14. SUBJECT TERMS 115.^ NUMBER

You might also like