Models of Linear Systems
Models of Linear Systems
The nonlinear system models that are the topic of this book are all generalizations, in
one way or another, of linear models. Thus, to understand nonlinear models, it is first
necessary to appreciate the structure and behavior of the linear system models from which
they evolved. Consequently, this chapter will review a variety of mathematical models
used for linear systems.
The fundamental properties of linear systems will be defined first and then the most
commonly used models for linear systems will be introduced. Methods for identifying
models of linear systems will be presented in Chapter 5.
Linear systems must obey both the principles of proportionality (1.5) and superposition
(1.6). Thus, if N is a linear system, then
39
40 MODELS OF LINEAR SYSTEMS
Input
6
0
−1 3
X
0 0
−1 3 −1 3
6 6
X
0 0
−1 3 −1 3
6 6
X
0 0
−1 3 −1 3
Time Time
Figure 3.1 Expansion of a signal onto a basis of delayed pulses. The input signal (top) is projected
onto a series of delayed pulses (left), resulting in the decomposition shown on the right.
NONPARAMETRIC MODELS 41
Amplitude
0 0
−1 3 −1 3
(C) 6 (D) 3
Amplitude
0 0
−1 3 −1 3
(E) 6 (F) 3
Amplitude
0 0
−1 3 −1 3
(G) 6 (H) 3
Amplitude
0 0
−1 3 −1 3
Time Time
Figure 3.2 Linear systems and superposition. (A, C, E) Scaled and delayed unit pulse inputs.
(B, D, F) Responses of a linear, time-invariant system to the scaled pulses. (G) The sum of the
three scaled, delayed pulses. (H) The system’s response to the input G. By superposition, this is
the sum of the responses to the individual pulses.
as illustrated in Figure 3.2A. Let the response of the linear system, N, to a single pulse be
y(t) = N(u(t))
∞
= uk N(d(t − kt , t ))
k=−∞
∞
= uk h(t − kt , t ) (3.4)
k=−∞
The relationships among these signals are illustrated in Figure 3.2. Thus, if the input
can be represented as the weighted sum of pulses, then the output can be written as the
equivalently weighted sum of the pulse responses.
Now, consider what happens in the limit, as t → 0, and the unit-area pulses become
impulses,
The pulse response becomes the impulse response, h(t), and the sum of equation (3.4)
becomes the convolution integral,
∞
y(t) = h(τ )u(t − τ ) dτ (3.6)
−∞
Thus, the response of the system to an arbitrary input can be determined by convolving it
with the system’s impulse response. Hence, the impulse response function (IRF) provides
a complete model of a system’s dynamic response in the time domain.∗
Theoretically, the limits of the integration in (3.6) extend from τ = −∞ to τ = ∞.
However, in practice, the impulse response will usually be of finite duration so that
The value of T1 , the lower integration limit in the convolution, determines whether
the system is causal or noncausal. If T1 ≥ 0, the IRF starts at the same time or after
the impulse. In contrast, if T1 < 0, the IRF starts before the impulse is applied and the
system is noncausal. Any physically realizable system must be causal but, as discussed
in Section 1.2.2, there are important, practical situations where noncausal responses are
observed. These include behavioral experiments with predictable inputs, signals measured
from inside feedback loops, and situations where the roles of the system input and output
are deliberately reversed for analysis purposes.
The value of T2 , the upper integration limit in the convolution (3.6), determines the
memory length of the system. This defines how long the response to a single impulse
∗ The convolution operation, (3.6), is often abbreviated using an centered asterisk—that is, y(t) = h(τ ) ∗ u(t).
NONPARAMETRIC MODELS 43
lasts or, conversely, how long a “history” must be considered when computing the cur-
rent output.
Thus, for a finite memory, causal system, the convolution integral simplifies to
T
y(t) = h(τ )u(t − τ ) dτ (3.8)
0
If the sampling rate is adequate, the convolution integral can be converted to discrete
time using rectangular integration, to give the sum:
−1
T
y(t) = h(τ )u(t − τ )t (3.9)
τ =0
where t is the sampling interval. Note that in this formulation the time variable, t,
and time lag, τ , are discrete variables that are integer multiples of the sampling interval.
Notice that although the upper limit of the summation is T − 1, the memory length is
described as “T ,” the number of samples between 0 and T − 1 inclusively.
In practice, the impulse response is often scaled to incorporate the effect of the sam-
pling increment, t . Thus, equation (3.9) is often written as
−1
T
y(t) = g(τ )u(t − τ )
τ =0
0.1
Complicance (Nm/rad)
0
−0.1
−0.2
−0.3
0 0.05 0.1
Lag (s)
Figure 3.3 Impulse response of the dynamic compliance of the human ankle.
guarantee that the steady-state response to a sinusoidal input will be a sinusoid at the
same frequency, but with a different amplitude and phase. Thus, the steady-state response
of a linear system may be fully characterized in terms of how the amplitude and phase of
its sinusoidal response change with the input frequency. Normally, these are expressed as
the complex-valued frequency response of the system, H (j ω). The magnitude, |H (j ω)|,
describes how the amplitude of the input is scaled, whereas the argument (a.k.a. angle or
phase), φ(H (j ω)), defines the phase shift. Thus, the output generated by the sinusoidal
input, u(t) = sin(ωt), is given by
expands time domain signals onto an infinite basis of sinusoids. It provides a convenient
tool for using the frequency response to predict the response to an arbitrary input, u(t).
Thus, the Fourier transform of the input, U (j ω), is computed and then multiplied by the
frequency response, H (j ω), to give the Fourier transform of the output,
Y (j ω) = H (j ω)U (j ω) (3.12)
Taking the inverse Fourier transform of Y (j ω) gives the time domain response, y(t).
Methods for estimating the frequency response will be discussed in Section 5.3. The
nonlinear generalization of the frequency response, along with methods for its identifi-
cation, will be presented in Section 6.1.3.
(A) −40
Gain (dB)
−60
−80
100 101 102 103
(B) 180
Phase (degrees)
90
0
100 101 102 103
Frequency (Hz)
Figure 3.4 Frequency response of the dynamic compliance of the human ankle. (A) Transfer func-
tion magnitude |H (j ω)|. (B) Transfer function phase φ(H (j ω)). The system’s impulse response
is shown in Figure 3.3.
The phase plot shows that at low frequencies the output has the opposite sign to the
input. In contrast, at higher frequencies, the output is in phase with the input, although
greatly attenuated.
This behavior is consistent with a physical interpretation of a system having elastic,
viscous, and inertial properties.
A(D)y = B(D)u
bm s m + · · · + b1 s + b0 B(s)
H (s) = = (3.14)
s + an−1 s
n n−1 + · · · + a1 s + a0 A(s)
200
Imaginary (rad/sec)
100
−100
−200
−60 −40 −20 0 20
Real frequency (rad/sec)
Figure 3.5 Pole-zero map of a continuous-time, parametric model of the human ankle compli-
ance. Notice that the system has two poles and no zeros.
of u(t) = sin(ωt), and then taking the inverse Laplace transform. Operationally, this
corresponds to making the substitution j ω = s in the transfer function.
Gωn2
H (s) =
s 2 + 2ζ ωn s + ωn2
where ωn is the undamped natural frequency of the system, G is its DC gain, and ζ is
the damping factor. For this model we have
ωn = 163.4 rad/s = 26 Hz
G = −0.0028
ζ = 0.35
In fact, this model was constructed using values for these parameters obtained from a
review on human joint dynamics (Kearney and Hunter, 1990). This transfer function was
used to generate all the compliance models presented in this chapter.
48 MODELS OF LINEAR SYSTEMS
Even though there is an analogy between the derivative and the forward difference,
it is the backward shift operator, z−1 , that is most commonly employed in difference
equation models. Thus, by analogy to equation (3.13), any deterministic, linear, time-
invariant, discrete-time system can be modeled using a difference equation of the form
B(z−1 )
z(t) = u(t) + w(t) (3.19)
A(z−1 )
The difference equation (3.18) and output error (3.19) models are sometimes referred
to as autoregressive moving average (ARMA) models, or as ARMA models with additive
∗ Note that in this context, z and z−1 are operators that modify signals. Throughout this text, the symbol z(t)
is used in both discrete and continuous time to denote the measured output of a system. The context should
make it apparent which meaning is intended.
PARAMETRIC MODELS 49
0.5
Imaginary
0
−0.5
−1
1 0 1
Real
Figure 3.6 Pole-zero map of the discrete-time, parametric frequency domain representation
dynamic ankle compliance, at a sampling frequency of 200 Hz. Note the presence of two zeros at
z = −1.
noise in the case of an OE model. However, the term ARMA model is more correctly
used to describe the stochastic signal model described in the next section.
Finally, note that if A(z−1 ) = 1, the difference equation reduces to a finite impulse
response (FIR) filter,
where w(t) is filtered by the all-pole filter, 1/A(z−1 ). In a systems context, w(t) would
be termed a process disturbance since it excites the dynamics of a system. The origin of
the term autoregressive can best be understood by examining the difference equation of
an AR model:
which shows that the current value of the output depends on the current noise sample,
w(t), and n past output values.
50 MODELS OF LINEAR SYSTEMS
A more general signal description extends the filter to include both poles and zeros,
This structure is known as an autoregressive moving average (ARMA) model and has
the same form as the deterministic difference equation, (3.18). However, in this case the
input is an unmeasured, white-noise, process disturbance so the ARMA representation
is a stochastic model of the output signal. In contrast, equation (3.18) describes the
deterministic relationship between measured input and output signals.
u (t ) z (t )
Gc (z ) Gp (z)
(B)
u (t ) Gc(z)Gp(z )
1 + Gc(z )Gp(z )
w1(t ) Gp(z ) z (t )
1 + Gc(z )Gp(z )
w2(t ) 1
1 + Gc(z )Gp(z )
Figure 3.7 Two equivalent representations of a closed-loop control system with controlled input,
u(t), and two process disturbances, w1 (t) and w2 (t). (A) Block diagram explicitly showing the
feedback loop including the plant Gp (z) and controller Gc (z). (B) Equivalent representation com-
prising three open-loop transfer functions, one for each input. Note that the three open-loop transfer
functions all share the same denominator.
PARAMETRIC MODELS 51
incorporates terms accounting for both a measured (exogenous) input, u(t), and an unob-
served process disturbance, w(t). Consequently, its output will contain both deterministic
and stochastic components.
The ARMA model can be generalized in a similar manner to give the ARMAX model,
B(z−1 ) C(z−1 )
y(t) = −1
u(t) + w(t)
A(z ) A(z−1 )
This makes it evident that the deterministic input, u(t), and the noise, w(t), are filtered
by the same dynamics, the roots of A(z−1 ). This is appropriate if the noise is a process
disturbance. For example, consider the feedback control system illustrated in Figure 3.7.
Regardless of where the disturbance (or control signal) enters the closed-loop system, the
denominator of the transfer function will be 1+Gc (z)Gp (z). Thus, both the deterministic
and noise models will have the same denominators, and an ARX or ARMAX model
structure will be appropriate.
However, if the noise source is outside the feedback loop, or if the process generating
the disturbance input contains additional poles not found in the closed-loop dynamics,
the ARMAX structure is not appropriate. The more general Box–Jenkins structure
B(z−1 ) C(z−1 )
y(t) = u(t) + w(t) (3.25)
A(z−1 ) D(z−1 )
addresses this problem. Here the deterministic and stochastic inputs are filtered by sep-
arate dynamics, so the effects of process and measurement noise may be combined
into the single term, w(t). The Box–Jenkins model is the most general parametric linear
system model; all other linear parametric models are special cases of the Box–Jenkins
model. This is illustrated in Figure 3.8 as follows:
1. The output error model Figure (3.8B) is obtained by removing the noise filter from
the Box–Jenkins model, so that C(z−1 ) = D(z−1 ) = 1.
2. The ARMAX model Figure (3.8C), is obtained by equating the denominator poly-
nomials in the deterministic and noise models (i.e., A(z−1 ) = D(z−1 )).
3. Setting the numerator of the noise model to 1, C(z−1 ) = 1, reduces the ARMAX
model to an ARX structure Figure (3.8D).
4. The ARMA model (Figure 3.8E) can be thought of as a Box–Jenkins (or ARMAX)
structure with no deterministic component.
5. Setting the numerator of the noise model to 1, C(z−1 ) = 1, reduces the ARMA
model to an AR model Figure (3.8F)
52 MODELS OF LINEAR SYSTEMS
C (z −1)
D (z −1) w (t )
u (t ) B (z −1) z (t ) u (t ) B (z −1) z (t )
A (z −1) A (z −1)
C (z −1) 1
A (z −1) A (z −1)
u (t ) B (z −1) z (t ) u (t ) B (z −1) z (t )
A (z −1) A (z −1)
C (z −1) 1
A (z −1) A (z −1)
z (t ) z (t )
Figure 3.8 Taxonomy of the different parametric difference equation-type linear system models.
The input, u(t), and output, z(t), are assumed to be measured signals. The disturbance, w(t), is an
unobserved white noise process. (A) The Box–Jenkins model is the most general structure. (B–F)
Models that are special cases of the Box–Jenkins structure.
This representation is known as a state-space model where the state of the system
is defined by an n element vector, x(t). The n × n matrix A maps the n-dimensional
state onto its time derivative. For a single-input single-output (SISO) system, B is an
n × 1 (column) vector that maps the input onto the derivative of the n-dimensional state.
STATE-SPACE MODELS 53
Similarly, C will be a 1×n (row) vector, and D will be a scalar. Multiple-input multiple-
output (MIMO) systems may be modeled with the same structure simply by increasing
the dimensions of the matrices B, C and D. The matrices B and D will have one column
per input, while C and D will have one row per output.
In discrete time, the state-space model becomes a set of n coupled difference equations.
Note that the terms representing the process disturbance, w(t), and measurement noise,
v(t), remain distinct. This contrasts with the Box–Jenkins model, where they are com-
bined into a single disturbance term.
The impulse response of a discrete-time, state-space model may be generated by setting
the initial state, x(0), to zero, applying a discrete impulse input, and solving equations
(3.27) for successive values of t. This gives
T
h= D CB CAB CA2 B ... (3.29)
where T is an invertible matrix. The impulse response of this transformed system will be
T
hT (k) = DT CT BT CT AT BT . . .
T
= D CTT−1 B CTT−1 ATT−1 B ...
T
= D CB CAB . . .
which is the same as the original system. Hence, T is a similarity transformation that does
not alter the input–output behavior of the system. Thus, a system’s state-space model will
have many different realizations with identical input–output behaviors. Consequently, it
is possible to choose the realization that best suits a particular problem. One approach is
to seek a balanced realization where all states have the same order of magnitude; this is
best suited to fixed precision arithmetic applications. An alternative approach is to seek
54 MODELS OF LINEAR SYSTEMS
0.1
0.05
0
Complicance (Nm/rad)
−0.05
−0.1
−0.15
−0.2
−0.25
−0.3
0 0.02 0.04 0.06 0.08 0.1
Lag (s)
Figure 3.9 Impulse response computed from the discrete state-space model of the dynamic com-
pliance of the human ankle.
a minimal realization, one that minimizes the number of free parameters; this is most
efficient for computation since it minimizes the number of nonzero matrix elements.
1. Bendat and Piersol (1986) is a good reference for nonparametric models of linear
systems, in both the time and frequency domains.
THEORETICAL PROBLEMS 55
2. The relationships between continuous- and discrete-time signals and systems are
dealt with in texts on digital signal processing, such as Oppenheim and Schafer
(1989), and on digital control systems, such as Ogata (1995).
3. There are several texts dealing with discrete-time, stochastic, parametric models.
Ljung (1999), and Söderström and Stoica (1989) in particular are recommended.
4. For more information concerning state-space systems, the reader should consult
Kailath (1980).
Next, compute its impulse response. How long does it take for the impulse response
to decay to 10% of its peak value? How long does it take before it reaches less than
1%? How many lags would be required for a reasonable FIR approximation to this
system?
4. Consider the following state-space system