0% found this document useful (0 votes)

243 views178 pages

A Complex Variable Approach To The Analysis of Linear Multivariable Feedback Sy PDF

Uploaded by

rakheep123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

243 views178 pages

A Complex Variable Approach To The Analysis of Linear Multivariable Feedback Sy PDF

Uploaded by

rakheep123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 178

Lecture Notes in

Control and
Information Sciences
Edited by A.V. Balakrishnan and M.Thoma

lan Postlethwaite
Alistair G. J. MacFarlane

A Complex Variable Approach

to the Analysis of Linear
Multivariable Feedback Systems

Springer-Verlag
Berlin Heidelberg New York 1979
Series Editors
A.V. Balakrishnan M. Thoma

Advisory Board
A. G. J. MacFarlane H. Kwakernaak Ya. Z. Tsypkin

Authors
Dr. I. Postlethwaite,
Research Fellow, Trinity Hall, Cambridge, and
SRC Postdoctoral Research Fellow,
Engineering Department, University of Cambridge.

Professor A. G..I. MacFarlane,

Engineering Department, University of Cambridge,
Control and Management Systems Division,
Mill Laqe,
Cambridge C B 2 1RX.

ISBN 3-540-09340-0 Springer-Verlag Berlin Heidelberg NewYork

ISBN 0-387-09340-0 Springer-Verlag NewYork Heidelberg Berlin

This work is subject to copyright. All rights are reserved, whether the whole
or part of the material is concerned, specifically those of translation, re-
printing, re-use of illustrations, broadcasting, reproduction by photocopying
machine or similar means, and storage in data banks.
Under 54 of the German Copyright Law where copies are made for other
than private use, a fee is payable to the publisher, the amount of the fee to
be determined by agreement with the publisher.
by Springer-VerlagBerlin Heidelberg 1979
Printed in Germany
Printing and binding: Beltz Offsetdruck, Hemsbach/Bergstr.
2061/3020-543210
Contents

CHAPTER 1 Introducti0n 1
References 6

CHAPTER 2 Preliminaries 8
2.1 System description 8
2.2 Feedback configuration ii
2.3 Stability 13
2.3-1 Free systems 13
2.3-2 Forced systems 16
2.4 Relationship between open- and closed-loop
characteristic polynomials for the general
feedback configuration 17
References 20

CHAPTER 3 Characteristic gain functions and

characteristic f r e q u e n c y 'functions 22
3.1 Duality between open-loop gain and
closed-loop frequency 22
3.2 Algebraic functions: characteristic
gain functions and characteristic
frequency functions 25
3.3 Characteristic gain functions 27
3.3-1 Poles and zeros of a characteristic
gain function 29
3.3-2 Algebraic definition of poles and zeros
for a transfer function matrix 32
3.3-3 Relationship between algebraically defined
poles/zeros of the open-loop gain matrix
G(s) and the poles/zeros of the corresponding
set of characteristic gain functions 36
3.3-4 Riemann surface of a characteristic
gain function 39
3.3-5 Generalized root locus diagrams 46
3.3-6 Example of frequency surface and
characteristic frequency loci 47
3.4 Characteristic frequency functions 49
3.4-1 Generalized Nyquist diagram 50
3.4-2 Example of gain surface and
characteristic gain loci 51
References 57

CHAPTER 4 A ~eneralized Nyquist stability criterion 58

4.1 Generai izea Nyquist stability crfterion 58
4.2 Proof of the generalized Nyquist stability
criterion 60
4.3 Example 72
References 75

CHAPTER 5 A generalized inverse Nyquist stability

criterion 77
5.1 Inverse characteristic gain functions 77
5.2 Pole-zero relationships 78
N

5.3 Inverse characteristic gain loci -

generalized inverse Nyquist diagrams 81
5.4 Generalized inverse Nyquist criterion 81
5.5 Proof of generalized inverse Nyquist
stability criterion 86
5.6 Example 97
References 99

CHAPTER 6 Multivariable root loci iOO

6.1 Theoretical background IOO
6.2 Asymptotic behaviour 104
6.2-1 Butterworth patterns 109
6.3 Angles of departure and approach 113
6.4 Example 1 115
6.5 Asymptotic behaviour of optimal closed-
loop poles 123
6.6 Example 2 126
References 130

CHAPTER 7 On parametricstability and future

research 132
7.1 Characteristic frequency and
characteristic parameter functions 132
7.2 Gain and phase margins 135
7.3 Example 135
7.4 Future research 138
References 140

Appendix 1 Definition of an algebraic functipn 143

Appendix 2 A reduction to the irreducible rational

canonical form .......................... 143

Appendix 3 The discriminant 147

Appendix 4 .method for constructing the Riemann

surface domains of the algebraic functions
correspondins~to an open-lo.op gaip
matrix G(s) 150

Appendix 5 Extended Principle of the. Argume.nt 157

Appendix 6 Multivariable pivots from the

ch'araceristic equation A'('g,s).=O 166

Appendix 7 Association between branch, point.s

and stat.ionary points on the gain
and frequency surf.aces 168

References 169

Bibliography 170

Index 174
1. Introduction

The great success of the optimal control and optimal

filtering techniques developed for aerospace work during the

late 1950's and early 1960's n a t u r a l l y led to attempts to

apply these techniques to a wide range of e a r t h - b o u n d multi-

variable industrial processes. In many situations this was

less than immediately successful, particularly in cases where

the available plant models were not s u f f i c i e n t l y accurate or

where the p e r f o r m a n c e indices required to stipulate the controlled

plant b e h a v i o u r were m u c h less obvious in form than in the aero-

space context. Moreover, the controller w h i c h results from a

direct a p p l i c a t i o n of optimal control and optimal filtering

synthesis techniques is in general a c o m p l i c a t e d one; in fact,

if it incorporates a full K a l m a n - B u c y filter it has a

dynamical complexity equal to that of the plant which it is

controlling, since the filter e s s e n t i a l l y consists of a plant

model w i t h feedback around it. In contrast, what was needed

for many m u l t i v a r i a b l e process control problems was a relatively

simple controller w h i c h w o u l d both stabilize, about an operating

point, a plant for w h i c h only a very a p p r o x i m a t e model might be

available, and also mitigate the effect of low-frequency

disturbances by i n c o r p o r a t i n g integral action. To industrial

engineers brought up on f r e q u e n c y - r e s p o n s e ideas the s o p h i s t i c a t e d

optimal control methods seemed difficult to use; these engineers

e s s e n t i a l l y relied on a m i x t u r e of physical insight and straight-

forward techniques, such as the use of derivative and integral

action, to solve their problems. It became obvious that a

huge gap in techniques existed between the classical single-

loop f r e q u e n c y - r e s p o n s e methods, based on the work of Nyquist [i],

Bode [2] and Evans [3],which were still in use for many

industrial applications, and the elegant and powerful multi-

variable time-response methods developed for aerospace

applications.

For these reasons an interest in frequency-response

methods slowly began to revive during the mid-1960's. An

important first step towards closing the yawning gap between

an optimal control approach and the classical frequency-response

approach was taken by Kalman [4] , who studied the frequency-

domain characterization of optimality. A systematic attack

on the whole problem of developing a frequency-response analysis

and design theory for multivariable systems was begun in a

pioneering paper by Rosenbrock [5] which ushered in a decade

of increasing interest in a rejuvenated frequency-response

approach. Prior to this new point-of-departure some fairly

straightforward attacks had been made on the multivariable

control problem. Boksenbom and Hood [6] put forward the

idea of a non-interacting controller. Their procedure consisted

simply of choosing a cascaded compensator such that the overall

transfer function matrix of the compensated system had a

diagonal form. If such a compensator could be found then the

controller design could be finished off using standard single-

loop design techniques. The required compensating matrix which

usually results from such a procedure is necessarily a complicated

one, and the most succinct objection to this approach is simply

that it is not essential to go to such drastic lengths merely

to reduce interaction. A natural further step in this initial

approach to multivariable control was to see what could be

achieved by way of standard matrix calculations using rational

matrices; papers studying the problem in this way were produced

by Golomb and Usdin [7] , Raymond [8] ; Kavanagh [9] , [iO],[ii] ,

and Freeman [12] , [13] Rosenbrock [14],[15], however, opened up

a completely new line of d e v e l o p m e n t by seeking to reduce a multi-
variable p r o b l e m to one amenable to classical techniques in a more
sophisticated way. In his Inverse Nyquist Array Method [14] , [15]
the aim was to reduce interaction to an amount which would then
enable single-loop techniques to be employed, rather than to eliminate
interaction completely. The Rosenbrock approach was based upon a
careful use of a specific criterion of partial interaction - the
diagonal dominance concept. The success of this m e t h o d led other
investigators to develop ways of seeking to reduce a m u l t i v a r i a b l e
control p r o b l e m to a succession of single-loop problems, as in the
Sequential Return Difference approach of Mayne [163.

In the non-interacting, or partially non-interacting,

approach to m u l t i v a r i a b l e control the m o t i v a t i o n was the eventual
deployment of classical single-loop frequency-response techniques
during the final stages of a design study. An alternative approach,
however, is to investigate the t r a n s f e r - f u n c t i o n matrix representation

as a single object in its own right and to ask : how can the key basic
concepts of the classical single-loop frequency-response approach

be suitably extended? What are the relevant g e n e r a l i z a t i o n s to the

multivariable case of the specific concepts of pole, zero, Nyquist
diagram and root locus diagram? It is to questions of this sort
that the work p r e s e n t e d here is addressed, and it is shown that
complex-variable ideas have an important role to play in the study
of m u l t i v a r i a b l e feedback systems. An early attempt to extend

Nyquist diagram ideas to the m u l t i v a r i a b l e p r o b l e m was made by Bohn

[17] , [18] A generalization of the Nyquist stability criterion
was put forward by MacFarlane [19] and, following that heuristic
treatment, complex-variable based proofs were supplied by Barman
and K a t z e n e l s o n [20] and MacFarlane and Postlethwaite [21] . This

generalization of the Nyquist stability criterion to the m u l t i v a r i a b l e

situation was soon followed by complementary generalizations of the

root locus technique E21], E22], ~23], E24].

The aim of the work p r e s e n t e d in this text is to extend

the concepts underlying the techniques of Nyquist, Bode and Evans

to m u l t i v a r i a b l e systems. In the two classical approaches to

linear feedback system design the N y q u i s t - B o d e approach studies

gain as a function of frequency and the Evans' approach studies

frequency as a function of gain. In Chapter 3 it is shown how

the ideas of studying complex gain as a function of complex

frequency and complex frequency as a function of complex gain can

be extended to the m u l t i v a r i a b l e case by a s s o c i a t i n g with transfer

function matrices (having the same number of rows and columns) a

pair of analytic functions : a characteristic gain function and a

characteristic frequency function. These are algebraic functions

[253 and each is defined on an a p p r o p r i a t e Riemann surface [26].

Chapter 2 deals With a number of essential p r e l i m i n a r i e s such as

a d e s c r i p t i o n of the type of m u l t i v a r i a b l e feedback system being

considered; w i t h basic definitions of stability and related

theorems; and w i t h a fundamental r e l a t i o n s h i p between open- and

c l o s e d - l o o p behaviour based on the r e t u r n - d i f f e r e n c e operator.

Chapter 3 also contains a comprehensive discussion of the b a c k g r o u n d

to the g e n e r a l i z e d Nyquist stability criterion for m u l t i v a r i a b l e

feedback systems w h i c h is p r e s e n t e d in Chapter 4. The proof of

this criterion is based on the Principle of the Argument applied

to an algebraic function defined on an appropriate Riemann surface.

In Chapter 5 a generalization of the inverse Nyquist stability

criterion to the multivariable case is d e v e l o p e d which is complement-

ary to the exposition of the g e n e r a l i z e d Nyquist criterion given in

the previous chapter. Using the material developed in Chapter 3,

the Evans' root locus approach is extended to m u l t i v a r i a b l e systems

in Chapter 6; this uses well established results in algebraic

function theory. It is also shown how an algebraic-function

based approach can be used to find the asymptotic behaviour of

the closed-loop poles of a multivariable time-invariant optimal

linear regulator as the weight on the input terms of a quadratic

performance index approaches zero.

As the work presented progresses it becomes evident that

the gain variable used can be considered as a parameter of the
system, and consequently that the techniques developed are not
only applicable to gain and frequency but to any parameter and
frequency. In Chapter 7 the effect of parameter variations
on a multivariable feedback system is considered by the intro-
duction of the concepts of 'parametric' root loci and 'parametric'
Nyquist loci. This chapter concludes with a few tentative
proposals and suggestions for future research.

Information of secondary importance which would unnecessar-

ily break the flow of the text has been placed in appendices.
References are listed at the end of each chapter in which they
are cited, and also at the end of the text where a bibliography
is provided.

References

[i] H.Nyquist, "Regeneration theory", Bell Syst. Tech. J.,

ii, 126-147, 1932.

[23 H.W.Bode, "Network analysis and feedback amplifier design",

Van Nostrand, Princeton, N.J., 1945.

[3] W.R.Evans, "Graphical analysis of control systems", Trans.

AIEE, 67, 547-551, 1948.

[4] R.E.Kalman, "When is a linear control system optimal?",

Trans. ASME J.Basic Eng., Series D., 86, 51-60, 1964.

[5] H.H.Rosenbrock, "On the design of linear multivariable

control systems", Proc. Third IFAC Congress London, I,
1-16, 1966.
[6] A.S.Boksenbom and R.Hood, "General algebraic method
applied to control analysis of complex engine types",
National Advisory Committee for Aeronautics, Report
NCA-TR-980, Washington D.C., 1949.

[7] M.Golomb and E.Usdin, "A theory of multidimensional

servo systems", J.Franklin Inst., 253(1), 28-57, 1952.

[8] F.H.Raymond, "Introduction a l'~tude des asservissements

multiples simultanes", Bull. Soc. Fran. des Mecaniciens,
7, 18-25, 1953.

[9] R.J.Kavanagh, "Noninteraction in linear multivariable

systems", Trans. AIEE, 76, 95-100, 1957.

[i0 ] R.J.Kavanagh, "The application of matrix methods to

multivariable control systems", J.Franklin Inst., 262,
349-367, 1957.

[li ] R.J.Kavanagh, "Multivariable control system synthesis",

Trans. AIEE, Part 2, 77, 425-429, 1958.

[12 ] H.Freeman, "A synthesis method for multipole control

systems", Trans. AIEE, 76, 28-31, 1957.

[13 ] H.Freeman, "Stability and physical realizability con-

siderations in the synthesis of multipole control systems",
Trans.AIEE, Part 2, 77, 1-15, 1958.

[14 ] H.H.Rosenbrock, "Design of multivariable control systems

using the inverse Nyquist array, Proc. IEE, 116, 1929-1936,
1969.

[15 ] H.H.Rosenbrock, "Computer-aided control system design",

Academic Press, London, 1974.

[16 ] D.Q.Mayne, "The design of linear multivariable systems",

Automatica, 9, 201-207, 1973.

[17 ] E.V.Bohn, "Design and synthesis methods for a class of

multivariable feedback control systems based on single
variable methods", Trans.AIEE, 81, Part 2, 109--115, 1962.

[18 ] E.V.Bohn and T.Kasvand, "Use of matrix transformations and

system eigenvalues in the design of linear multivariable
control systems", Proc. IEE, iiO, 989-997, 1963.

[19 ] A.G.J.MacFarlane, "Return-difference and return-ratio

matrices and their use in the analysis and design of
multivariable feedback control systems", Proc. IEE, 117,
2037-2049, 1970.

[20 ] J.F.Barman and J.Katzenelson, "A generalized Nyquist-

type stability criterion for multivariable feedback
systems", Int.J.Control, 20, 593-622, 1974.
[21] A.G.J.MacFarlane and I. Postlethwaite, "The generalized
Nyquist stability criterion and multivariable root loci",
Int. J. Control, 25, 81-127, 1977.

[22] B.Kouvaritakis and U.Shaked, "Asymptotic behaviour of

root loci of linear multivariable systems", Int. J.
Control, 23, 297-340, 1976.

[23] I. Postlethwaite, " The asymptotic behaviour, the

angles of departure, and the angles of approach of
the characteristic frequency loci", Int. J. Control,
25, 677-695, 1977.

[24] A.G.J.MacFarlane, B.Kouvaritakis and ~ E d m u n d s ,

"Complex variable methods for multivariable feedback
systems analysis and design", Alternatives for Linear
Multivariable Control, National Engineering Consortium,
Chicago, 189-228, 1977.

[25] G.A.Bliss, "Algebraic functions", Dover, New York,

1966 (Reprint of 1933 original).

[26] G. Springer, "Introduction to Riemann surfaces",

Addison-Wesley, Reading, Mass., 1957.
2. Preliminaries

This text considers the g e n e r a l i z a t i o n of the classical

techniques of Nyquist and Evans to a linear time-invariant

dynamical feedback system which consists of several multi-

input, m u l t i - o u t p u t subsystems connected in series. In this

chapter a d e s c r i p t i o n of the m u l t i v a r i a b l e feedback system

under c o n s i d e r a t i o n is given. The chapter also includes

basic definitions of stability, some associated theorems,

and a fundamental r e l a t i o n s h i p between open- and closed-loop

b e h a v i o u r based on the return-difference operator.

2.1 System description

The basic description of a linear t i m e - i n v a r i a n t dynamical

system is taken to be the state-space model

x(t) = Ax(t) + Bu(t)

(2.1.1)
y(t) = Cx(t) + Du(t)

where x(t) is the state vector, y(t) the output vector,

u(t) the input vector; x(t) denotes the derivative of

x(t) w i t h respect to time; A,B,C, and D are constant real

matrices. For c o n v e n i e n c e the model will be d e n o t e d by

S(A,B,C,D) or S when the m e a n i n g is obvious, and r e p r e s e n t e d

diagramatically as shown in figure i.

In general S(A,B,C,D) will be considered as being the

state-space representation of several subsystems

Si (Ai,Bi,Ci,Di) : xi(t) = A . x . (t) + B.u. (t)

1 i i i (2.1.2)

i=1,2, ...... h Yi(t) = C.x.l

l(t) + Diui(t)

connected in series, as i l l u s t r a t e d in figure 2. If, for

example,S consists of two subsystems S 1 and S 2 then the

"I D ,I
+1
u(t ' t)
I
1
g''~~ fA 'c I
I
J S I
I J

Figure 1. Stote-spctce model

r" "~ 1

u{t}=u~(t} I
llYh(t)=
y(t)
I .~
I
I
'
I
S J

Figure 2. Series connection

of subsystems
10

state-space description of S is g i v e n by e q u a t i o n s (2.1.1) with

x(t) = Ix l(t)] , the c o m b i n e d states of both s u b s y s t e m s ,

!
[x2(t) ]
u(t) = ul(t) , the input to S 1 ,

y(t) = Y2(t) , the o u t p u t of S 2, (2.1.3)

B2c I A2 tB2DlJ

C = [ D2C 1 C2 ] , and D = D2D 1.

The s t a t e - s p a c e model for an i n t e r c o n n e c t i o n of s e v e r a l

subsystems can be d e r i v e d from the above formula by s u c c e s s i v e

application.

The s t a t e - s p a c e model of a s y s t e m is o f t e n referred

to as an i n t e r n a l description since it r e t a i n s a knowledge

of the s y s t e m ' s internal dynamical structure. An e x t e r n a l

or i n p u t - o u t p u t description is o b t a i n e d if in e q u a t i o n s

(2.1.1) single-sided Laplace transforms [i] are taken, to give

s~(s) - x(o) = A~(s) + B~(s)

(2.1.4)
~(s) = c~(s) + D~(s)
where ~(s) denotes the L a p l a c e transform of x(t). If the

initial conditions at time t=O are all zero so that x(o)=O,

then the input and o u t p u t transform vectors are r e l a t e d by

~(s) = G(s) ~(s) (2.1.5)

where

G(s) = C(SIn-A)-IB + D (2.1.6)

I is a unit m a t r i x of o r d e r n and ( [idenotes the i n v e r s e

n
of a matrix. G(s) is a m a t r i x - v a l u e d rational function
11

of the c o m p l e x variable s, and is c a l l e d the t r a n s f e r

function m a t r i x for the set of i n p u t - o u t p u t transforms, or

the o p e n - l o o p ~ain matrix. The t r a n s f e r function matrix

G(s) can be r e g a r d e d as d e s c r i b i n g a system's response to

an e x p o n e n t i a l input w i t h e x p o n e n t s [2], and t h e r e f o r e

the c o m p l e x variable s can be c o n s i d e r e d a complex

frequency variable as in the s i n g l e - i n p u t , single-output case.

When S(A,B,C,D) consists of h s u b s y s t e m s

{Si(Ai,Bi,Ci,Di): i=i,2, .... ,h} , as shown in figure 2,

each s u b s y s t e m has a t r a n s f e r function matrix

Gi(s ) = Ci(SIn.-Ai)
1
-1Bi + Di (2.1.7)

and the i n p u t - o u t p u t transform vectors of S are r e l a t e d by

~(s) =Sh (s) Gh_l (s) .... Gl(S) ~(s) (2.1.8)

with the o b v i o u s relationship for the o p e n - l o o p gain m a t r i x

G(s) = G h ( S ) G h _ l ( S ) .... Gl(S) (2.1.9)

For the p u r p o s e of c o n n e c t i n g outputs back to inputs to

form a f e e d b a c k loop G(s) is a s s u m e d to be a square m a t r i x

of o r d e r m.

2.2 Feedback confi@uration

The g e n e r a l feedback configuration that w i l l be c o n s i d e r e d

is shown in figure 3. The output of the f e e d b a c k system

is shown as that of the hth s u b s y s t e m but in p r a c t i c e it

m a y be the o u t p u t from an e a r l i e r subsystem in w h i c h case

the later subsystems can be t h o u g h t of as b e i n g feedback

compensators. The p a r a m e t e r k is a real gain c o n t r o l

variable c o m m o n to all the loops. The s y s t e m ' s input and

output are r e l a t e d to the r e f e r e n c e input r(t) by the e q u a t i o n s

e(t) = r(t) - y(t)

(2.2.1)
u(t) = ke(t)

and c o m b i n i n g these w i t h equations (2.1.1) the f o l l o w i n g

closed-loop state-space equations are obtained:

x(t) = AcX (t) + Bcr (t)

(2.2.2)
y(t) = CcX (t) + D c r (t)

where

Ac = A-B(k-IIm+D)-Ic
B c = k B - k B (k-iIm+D)-iD

Cc = (Im+kD) -i c

Dc = (k-iim+D) -I D

y(t)
~l 7"'ml -11 1, "7"hi

Figure 3. Feedback configuration

2.3 Stability

Stability is the most important single requirement

of a feedback system and for general time-dependent

nonlinear systems it poses very complex problems. The

stability problem for linear time-invariant dynamical

systems, however, is much simpler than in the general case.

This is because:

(i) all stability properties are constant with respect

to time, and

(ii) all stability properties are global, since any solution

for the state of the system is proportional to the state

at time zero; see equation (2.3.2).

There are many definitions of stability in the literature

and broadly speaking these can be divided into two classes.

The first class of definitions concerns stability of free

systems i.e. those in which there is no input; the second

class of definitions concerns the behaviour of forced

systems ioe. those in which there is a given input. Both

types of stability are discussed below, the definitions and

associated theorems following very closely those given by

Willems [3] .
2.3-1 Free systems

Let us consider the closed-loop dynamical system of

figure 3 described by equations (2.2.2) with r(t)=O and

Cc=I. Then the stability problem reduces to that of

considering the free system

x(t) = AcX(t) (2.3.1)

The equilibrium state for equation (2.3.1) is clearly the

origin (assuming A is non-singular), and therefore a

c
solution of (2.3.1) which passes through the origin at some

time remains there for all subsequent times; this solution

is called the null solution. The stability of the origin

equilibrium state is characterized using the following

definitions

Definition i. The origin of the free system (2.3.1) is

called stable if when the system is perturbed from the

origin all subsequent motions remain in a correspondingly

small neighbourhood of the origin

Definition 2. The origin of the free system (2.3.1) is

called asymptotically stable if when the system is perturbed

slightly from the origin all subsequent motions return to

the origin.

Definition 3. The origin of the free system (2.3.1) is

called asymptotically stable in the large, or globally

asymptotically stable, if it is stable, and if every

motion converges to the origin as t+~.

The general solution of equation (2.3.1) is [3]

x(t ;x(t O),tO) = exp [ Ac(t-t O) ].x(t O) (2.3.2)

which shows clearly that if the free system is asymptotically

stable it is also asymptotically stable in the large. If J

is the Jordan canonical form [4] of A c such that

A = TJT -I (2.3.3)
c
with

J = Jl
J2

where each Jordan block J has the form

l
IS

J. = I. 1
l l

~i ".

hi 1

and li is an eigenvalue of Ac, then it can be shown [ 3] , that

exp[ Ac(t-to) ] = T exp[J(t-to)].T-i (2.3.4)
with

exp [ J(t-t O) ] = exp[J 1 (t-to) ]

exp [ J2 (t-to)]

exp[ Jk (t-to)]

and

exp [ Ji (t-to)] = "i t t2/2 ' .... tr-I/(r-l) : exp[~ i (t-to) ]

0 1 t tr-2/(r-2)'

000 1
where ~ = (t-t O )
and r is the order of the Jordan block Ji" The general

solution of the free system can therefore be expressed as

x(t; x(t O),t O ) = T exp[J(t-to)].T-iX(to ) (2.3.5)

and from this the following theorems can be derived; see

[31 for proofs.

Theorem i. The null solution of system (2.3.1) is asymptotically

stable if and only if all eigenvalues of the matrix A c have

negative real parts.
Theorem 2. The null solution of system (2.3.1) is stable
if and only if the matrix A c has no eigenvalues with positive

real parts, and if the eigenvalues with zero real parts

correspond to Jordan blocks of order i.

2.3-2 Forced s~stems

Let us consider the closed-loop dynamical system
(2.2.2) which has the general solution [3] ,
x(t; x(t o),t o ) = exp (Act).X(to)+/texp[Ac(t-T)].Bcr (T)dT
o
(2.3.6)
To study the stability properties of this system we need to
introduce the concept of input-output stability.
Definition 4. A dynamical system is called input-output
stable if for any bounded input a bounded output results
regardless of the initial state.
By theorem i, asymptotic stability of the unforced
system (2.3.1) implies that all the eigenvalues of A c have
negative real parts, in which case there exist positive numbers
P and a such that
llexp(Act) ll < P exp (-at) ~t~O (2.3.7)
where II. II denotes the Euclidean norm of a matrix or
vector [3]. From equations (2.2.2), (2.3.6) and (2.3.7)

we then have

II y(t)ll -< H Ccx(t)II + II Dcr(t)II

4 II Dcr(t) ll + II Ccexp(Act)'x(to)II

+Cfotllexp[Ac (t- T)] IIllBcr(t)lld~

d +c llx(to)li+ cbMP/a

where b=llBcll, c=llCcl], d=llDcl], and ]]r(t)l]4M ~ t >~ O.

This result is summarized in the following theorem.

Theorem 3. If the null solution of the unforced system
(2.3.1) is asymptotically stable, then the forced system (2.2.2)
is input-output stable.
Note that input -output stability implies asymptotic stability
17

of the equilibrium state at the origin only if the system

(2.2.2) is state controllable and state observable; or if

all unobservable and/or uncontrollable modes have negative

real parts. In the remainder of this book system stabilit ~

is understood as meaning input-output stability coupled with

asymptotic stability of the equilibrium state at the origin.

Theorem 3 is important because it tells us that the

stability of a linear time-invariant system can be determined

solely from a knowledge of the eigenvalues of the system "A"

matrix. The stability conscious eigenvalues corresponding

to the closed-loop dynamical system (2.2.2) are values of 1

which satisfy the equation

det[IIn-Ac] = O (2.3.8)

The left-hand side of equation (2.3.8) is called the

closed-loop characteristi ~ polynomial, abbreviated as CLCP(1)

so that

CLCP(1)~ det [ IIn-Ac] (2.3.9)

Similarly for the open-loop system S(A,B,C,D) an open-loop

characteristic polynomial, OLCP(1), is defined as

OLCP(1) ~ det[IIn-A ]

= detflInl-Al]det [ IIn2-A 2] . . . .

.... det[iInh-Ah] (2.3.10)

In the next section it is shown how the open- and

closed-loop characteristic polynomials are related via the

return-difference operator [ 5 ] .

2.4. Relationship between open- and closed-loop characteristic

polynomials for the general feedback configuration

Let us suppose that all the feedback loops of the general

~(s)

(a)

P(s) 6(s) r--'q ~(S) r--q 9(s)

- ~ k [m~----~Gl(S)k ..........-,,pl~S)1

i 0 0 ....

(b)
Figure 4. Feedback configuration
(a) dosed-loop
(b) open-toop
19

closed-loop configuration are broken and that the subsystems

are represented by their transfer function matrices; see

figure 4. The corresponding return-difference matrix

[5] for this break point is

F(s) I + L(s) (2.4.1)

m
where

L(s) = kGh(S)Gh_l(S) .... Gl(S)

= kG(s) (2.4.2)

is called the system return-ratio matrix [5] A return-

difference operator generates the difference between injected

and returned signal transforms from the injected signal

transform. It plays a major role in feedback theory since

the essence of forging a feedback link is making two sets

of signals identically equal, thus making the difference

between them identically zero. Both F(s) and L(s) are

matrix-valued rational functions of a complex variable and

the key concepts in this text revolve around the properties

of such matrices. The importance of the return-difference

matrix is emphasized in the relationship between open- and

closed-loop characteristic polynomials which is now derived

for the general feedback configuration.

If we take determinants of equation (2.4.1) and represent

G(s) by its state-space model, we obtain

detF(s) = det[Im+kC(SIn-A)-iB+kD] (2.4.3)

which using Schur's formula [6] for the evaluation of

partitioned determinants can be rewritten as

detF(s) = det[sI -A i B l+det[SIn-A ] (2.4.4)

which is equivalent to
20

detF(s) = det
-I
% letlSn l+etSn
: Is L--~C--J, I S k
= det IsIn-A+B (k-IIm+D) -IC :--im+~
0 ]-- det[SIn-A ]
[----ic ...... -,
= det [SIn-A+B(k-iIm+D)-iC]det[Im+k~ (2.4.5)
det [SIn-A ]
Now from equations (2.2.2) we have
A c = A-B(k-IIm+D)-Ic
and it is obvious from equation (2.4.3) that
detF(~) = det[Im+k ~
and therefore under the assumption that det F(~)~O we have
from equation (2.4.5) the following relationship

detF(s) = det[SIn-Ac] = d e t [,,,,s, In-, ,A,,,,,,,,c,, ] ~ CLCP(s)

detF (~) det [Sin_ A ] det [SInh_Ah] . . ..det
..... [slnl_A~ OLCP (s)

(2.4.6)
The zeros of the open- and closed-loop characteristic
polynomials, OLCP(s) and CLCP(s), are known as the open-
and closed-loop poles or characteristic frequencies respectively.
Relationship (2.4.6) shows how the matrix-valued
rational transfer functions F(s) and G(s) are intimately
related to the stability of a dynamical feedback system.
The study of such matrices and their eigenvalues opens the way
to suitable extensions of the classical techniques of Nyquist
[7] and Evans [8;9] ; the results of such a study are given
in Chapter 3.
References

[I] R. Bracewell, "The Fourier Transform and Its Applications",

McGraw-Hill, New York, 1965.
21

[2] A.G.J. MacFarlane and N. Karcanias, "Poles and zeros of

linear multivariable systems: a survey of the algebraic,
geometric and complex variable theory", Int. J. Control, 24,
33-74, 1976.

[3] J.L. Willems, "Stability Theory of Dynamical Systems",

Nelson, London, 1970.

[4] P.M. Cohn, "Algebra", Vol. i, Wiley, London, 1974.

5] A.G.J. MacFarlane, '~eturn-difference and return-ratio matrices

and their use in analysis and design of multivariable
feedback control systems", Proc. IEE, 117, 2037-2049, 1970.

[6] F.R. Gantmacher, "Theory of M a t r i c e ~ Vol. I, Chelsea, New

York, 1959.

[7] H. Nyquist, "The Regeneration Theory", Bell System Tech. J.,

ii, 126-147, 1932.

[8] W.R. Evans, "Graphical Analysis of Control Systems", Trans.

AIEE, 67, 547-551, 1948.

[9] W.R. Evans, "Control System Synthesis by Root Locus Method",

Trans. AIEE, 69, 1-4, 1950.
3. Characteristic gain functions and
characteristic frequency functions

In the analysis and design of linear single-loop

feedback systems the two classical approaches use complex
functions to study open-loop gain as a function of imposed
frequency (theNyquist-Bode approach), and to study closed-
loop frequency as a function of imposed gain (the Evans
root locus approach). The primary purpose of this chapter
is to show how these techniques can be extended to the multi-
variable case by associating with appropriate matrix-
valued rational functions of a complex variable characteristic

gain functions and characteristic frequency function s.

3.1 Duality between open-loop vai n and closed-loop frequency

For the general feedback configuration of figure 4 we
have from section 2.4 the fundamental relationship

detF (s) det [Sln-Ac ]

detF(-~ = det[s~n, A ] (3.1.1)

where the return-difference matrix F(s) is given as

F(s) =
I + kG(s) (3.1.2)
m
If we substitute for F(s) in equation (3.1.1) we obtain

det[ SIn-Ac] _ det[ Im+kG(s) ]

det[ SIn-A ] det[ Im+kD ]

det[k-iIm+G(s~
= (3.1.3)
det[ k-iIm+D]

and substituting for the gain variable k using the expression

g= -I (3.1.4)

where g is allowed to be complex i.e. g e ~ (the complex

plane), we have
23

det[ SIn-Ac ] det[gIm-G (S)]

(3.1 .5)
det[ Sin-A ] det[ qIm-D ]
The closed-loop system matrix A c is given in equations
(2.2.2) as
A c = A - B(k-IIm+D)-Ic (3.1.6)

and substituting for k from equation (3.1.4) we have

A c = A + B(gIm-D)-Ic
S(g) (3.1.7)
The expression (3.1.5) can therefore be rewritten as

det[ SIn-S (g)] det[gIm-G(s) ]

det[ Sin-A ] det[gIm-D ]
or (3.1.8)

det[ SIn-S (g)] det[ gIm-G (s)]

det[ SIn-S (~ ~ det[ gIm-G (~)]

The form of this relationship shows a striking 'duality'

between the complex frequency variable s and the complex
gain variable g via their 'parent' matrices S(g) and G(s)
respectively. This duality between the roles of frequency
and gain forms the basis on which the classical complex
variable methods are generalized to the multivariable case.
S(g) is called the closed-loop frequency matrix; its eigen-
values are the closed-loop characteristic frequencies and
are clearly dependent on the gain variable g. The eigenvalues
of the open-loop 9ain matrix G(s) are called open-loop
characteristic ~ains and are clearly dependent on the frequency
variable s. The similarity between G(s) and S(g) is
stressed if one examines their state-space structures:
G(s) = C(SIn-A)-IB + D (3.1.9)
S(g) = B(gIm-D)-Ic + A (3.1.10)
24

In figure 5 the feedback configuration of figure 4a is

redrawn with zero reference input, the state-space representation

for G(s),and the substitution (3.1.4) for k in order to

illustrate explicity the duality between the closed-loop

characteristic frequency variable s and the open-loop

characteristic gain variable g.

~ I n _ ] ......

Figure 5. Feedback configuration illustrating

the duality between s and g

The importance of relationship (3.1.8) is that it shows,

for values of s ~ ~ (A) and values of g ~ ~(D) (this condition

is equivalent to det F(~)~O which has already been assumed),

where a(A) denotes the spectrum of A , that

I det[SIn-S(g)] = O ~ det[gIm-G(s)]=O~ (3.1.11)

This tells us that a knowledge of the open-loop characteristic

gain as a function of frequency is equivalent to a knowledge

of closed-loop characteristic frequency as a function of

gain. The inference from this is that it ought to be possible

to determine the stability of a feedback system from a

knowledge of the characteristic gain spectrum of G(s).

Note that from equation (3.1.8) we have that

CLCP(s) = det[gIm-G(s)] . OLCP(s)

(3.1.12)
det[gI m -D]

and such an expression makes it intuitively obvious that

there should be a generalization of Nyquist's stability theorem

to loci of the characteristic gains of G(s) as a function of

frequency.

3.2 A19ebraic functions: characteristic 9ain functions and

characteristic ' frequency function s.

The characteristic equations for G(s) and S(g) i.e.

d(g,s) ~ det[gI m - G(s)] = 0 (3.2.1)

and
?(s,g) ~ det[sI n - S(g)] = 0 (3.2.2)

are algebraic equations relating the complex v a r i a b l ~ s and g.

Each equation can be considered as a polynomial in g or s

with coefficients which are rational functions in s or g

respectively, and if irreducible over the field of rational

functions each equation defines a pair of algebraic functions

[i; appendix i]:

(i) a characteristic 9 a i n function g(s) which gives open-

loop characteristic gain as a function of frequency, and

(ii) a characteristic frequency function s(9) which gives

closed-loopcharacteristic frequency as a function of gain.

In general equations (3.2.1) and (3.2.2) will not be

irreducible and each equation will define a set of characteri-

stic gain and characteristic frequency functions. For

simplicity of exposition and because this is in any case the

usual situation for G(s) and S(g) arising from p r a c t i c a l

situations, it will normally be assumed that equations

(3.2.1) and (3.2.2) are irreducible over the field of rational

functions.

A l t h o u g h b o t h equation (3.2.1) and equation (3.2.2)

define the same functions g(s) and s(g), equation (3.2.2)

will in general contain more information about the system.

It is possible under certain c i r c u m s t a n c e s that ?(s,g) will

contain factors of s independent of g w h i c h are not present

in ~(g,s). These factors occur in the following situations:-

(i) When the A - m a t r i x of the open-loop system S(A,B,C,D)

has eigenvalues which correspond to modes of the system

which are u n o b s e r v a b l e and/or u n c o n t r o l l a b l e from the point

of view of considering the input as that of the first sub-

system and the output as that of the hth sybsystem. Note

that if output m e a s u r e m e n t s for earlier subsystems are

available then in practice some of the u n o b s e r v a b l e modes

of S(A,B,C,D) m a y in fact be observable.

(2) When the poles and zeros of the o p e n - l o o p gain matrix

G(s) are different from the poles and zeros of the characteristic

gain function g(s); see section 3.3-3.

These two conditions, under w h i c h equations (3.2.1) and

(3.2.2) differ, clearly p r e s e n t problems to the d e v e l o p m e n t

of a Nyquist-like stability criterion in terms-of loci of

the characteristic gains of G(s). However, by relating the

poles and zeros of g(s) to the poles and zeros of G(s), and

by careful consideration of the unobservable and uncontrollable

modes these problems can be overcome; a generalized Nyquist

stability criterion is developed in chapter 4.

In the next section a detailed study of the characteristic

gain function is given which results in a generalization

of the root locus diagram. In section 3.4 a similar study

of the characteristic frequency function results in a

generalized Nyquist diagram.

3.3 Characteristic @ain functions

The natural way to define the characteristic gain functiong(s)

is via the characteristic equation

A(g,s) ~ det[gIm-G(s) ] = O (3.3.1)

In general ~(g,s) will be reducible to the form

A(g,s) = Al(g,s)~2(g,s) ...... ~(g,s) (3.3.2)

where the factors {Ai(g,s) :i=1,2 .... ,} are polynomials in

g which are irreducible over the field of rational functions

in s. Let the irreducible factors Ai(g,s) have the form

ti ti-i
Ai(g's) = gi +ail(s)gi + ...... +ait. (s)=O
1 (3.3.3)

where t i is the degree of the ith irreducible polynomial

and the coefficients {aij(s) :i=l,2,...,;j=l,2,...,t i} are

rational functions in s. Then if b (s) is the least

lO
common denominator of the coefficients {aij(s) :j=l,2,...,t i}

equation (3.3.3) can be put in the form

t. t.-i
bio(S)g i l+bil(s)g i i + ...... +biti(s) = O (3.3.4)

i=i,2,...,

where the coefficients {bij(s) li=l,2,...,;j=l,2, .... ,t i}

are polynomials in s. The function of a complex variable

gi(s) defined by equation (3.3.4) is called an algebraic

function [i; appendix i]. Thus associated with an open-

loop gain matrix G(s) is a set of algebraic functions

{gi(s): i=l,2,...,Z} which are directly related to the

eigenvalues of G(s). The characteristic gain functions

of G(s) are defined to be the set of algebraic functions

{gi(s):i=l,2,...,~}.

The problem of finding the irreducible polynomials

{A i(g,s):i=l,2,...,} from which the characteristic gain

functions are defined is closely linked to the problem of

finding an appropriate canonical form of G(s). If A(g,s)

was reducible to factors linear in g then G(s) could be

put into Jordan form [2]. In general this will not be the

case and a suitable canonical form is defined as follows.

Let

O O . . O -aiti(s)

10 . . . O -ai,ti_l(S
(3.3.5)
C(Ai)~ O 1 . . . O -ai,ti_2(s)

O 1 -ail (s)

for t.>l with

c (A i) =A - a l l (s) if ti=l (3.3.6)

then a transformation matrix E(s) exists such that

-i
G(s) = E(s) Q(s) E(s) (3.3.7)

where Q(s) is a unique block diagonal matrix, which is

called the irreducible rational canonical form of G(s)

and is given by

Q(s) ~ diag[C(A I),C(A 2) ...... ,C(Az) ] (3.3.8)

It is clear that given Q(s) the irreducible factors Ai(g,s)

can easily be obtained. A proposed method for finding Q(s)

for any given G(s) is presented in appendix 2.

3.3-1 Poles and zeros of a characteristic vain function

Consider the defining equation for a characteristic gain

function g(s) :

~(g,s)~bo(s)g t + bl(S)g t-I + ... + bt(s) = O

(3.3.9)
We will take both

bo(S) ~ O and bt(s) ~ 0

since, if either or both of these polynomial coefficients

were to vanish, we could

find a reduced-order equation such
and
that both the coefficients of the highest/zeroth powers of

g(s) were non-zero; this reduced-order equation would then

be taken as defining an appropriate new algebraic function

for whose defining equation the supposition would be true.

It m a y happen however that bo(S) and bt(s ) share a

common factor and thus both vanish together at some specific

set of values of s . Before looking at the effect of this,

consider the situation when bo(S) and bt(s) do not share

a common factor. The algebraic function will obviously be

zero when

bt(s) = O (3.3.10)
30

and w i l l t e n d to i n f i n i t y as

bo(S) + O (3.3.11)

For this r e a s o n those v a l u e s of s which satisfy equation

(3.3.10) are d e f i n e d to be the zeros of the a l g e b r a i c function

g(s), and those v a l u e s of s which satisfy the e q u a t i o n

bo(S) = O (3.3.12)

are d e f i n e d to be the p o l e s of the a l g e b r a i c function g(s).

Unless stated otherwise the t e r m i n o l o g y 'poles and zeros'

should be t a k e n as r e f e r r i n g o n l y to finite poles and zeros.

The p o i n t s =~ r e q u i r e s special attention and is dealt w i t h

at the end of this sub-section.

In o r d e r to be able to take e q u a t i o n s (3.3.1o) and

(3.3.12) as d e f i n i n g the zeros and p o l e s of g(s) in the

general case, we m u s t show that they r e m a i n appropriate when

bo(s) and bt(s) share a common factor. Let us first d i s p o s e

of the t r i v i a l case w h e n all the c o e f f i c i e n t s {hi(s) : i=O,2,...,t}

share a c o m m o n factor by saying that such a common factor

would simply be d i v i d e d out to get a n e w d e f i n i n g equation

for an a p p r o p r i a t e algebraic function. Suppose t h e n that

bo(S) and bt(s) have a c o m m o n factor, but that some non-

empty set of c o e f f i c i e n t s {bu(S),bu+l(S),...,bv(S)} do

not share this c o m m o n factor. Then d i v i d i n g through the

left-hand side of e q u a t i o n (3.3.9) by bo(S) we get

t bl(S) gt-i b (s) bv(S) bt(s)
g + b----~ + "'" + u gt-U+, t-v+
-o" " bo(S ) "'+bo---~ "''+bo (s) = O
(3.3.13)

Then, as s + s w h e r e s is a zero of the c o m m o n factor of bo(S)

and bt(s), the m o d u l i of the c o e f f i c i e n t set

bu(S) bv(S)

all become arbitrarily large, and it is obvious that g(s)

will have a pole at s = s .

Again, suppose that bo(S) and bt(s) have a common

factor but that some non-empty set of coefficients

{bj(s), .... bm(S)} do not. Then as s ~ s where s is

a zero of the common factor, the algebraic equation (3.3.9)

may be replaced by

bj(s)gt-j(s)+ ... + bm(~)gt-m(~) = O (3.3.14)

where

bj(s) ~ O . . . . . bm(S) O

so that we must have

g(s) = 0

showing that s is indeed a zero of the algebraic function g(s).

We thus conclude that equations (3.3.10) and (3.3.12)

may be taken as defining the finite zeros and finite poles

of the algebraic function g(s), and that use of these

definitions enables us to cope with the existence of coincident

poles and zeros. The pole and zero polynomials of g(s),

denoted by p (s) and z (s), are defined as

g g
pg(S) ~ bo/ (s)

and (3.3.15)

Zg(S) ~ btl (s)

where b~(s) and bt[ (s) are the monic polynomials obtained

from bo(S) and bt(s) respectively, by dividing each polynomial

by its leading coefficient.

For the purpose of considering g(s) at the point s=~

we put
-i
s=z (3.3.16)

so that

~ (g,s) =~ (g,z-l)=z -q ~ (g,z) (3.3.17)

where q is the number of finite poles of g(s). In any

neighbourhood of the value z=O (the point z=O itself being

excluded from it) the equation ~(g,s)=O is equivalent to the

equation ~(g,z)=O. Therefore if we consider the equation

&
~(g,z)=Co(Z)gt+cl(z)gt-l+ .... +ct(z)=O (3.3.18)

it follows that:

(i) s =~ is a pole of the characteristic gain function g(s)

if and only if Co(O)=O

(ii) s =~ is a zero of the characteristic gain function g(s)

if and only if ct(o)=O

For an open-loop gain matrix G(s) describing a physically

realizable system, which by definition (see section 2.1) we

are considering here, it is not possible for g(s) to have

poles at infinity. In fact it is easy to show that for s=~

the values of the characteristic gain function g(s) are

simply the eigenvalues of D.

3.3-2 A!~ebraic definition of poles and zeros for a transfer

function matrix

Let T(s) be an mx rational m a t r i x - v a l u e d function of

the complex variable s. Then there exists a canonical

form for T(s), the Smith-McMillan form [3 ] MCs) , such

that

T(s) = H(s)M(s)J(s) (3.3.19)

where the mm matrix H(s) and the matrix J(s) are both
unimodular (that is having a constant value for their determinants,
independent of s ). If r is the normal rank of T(s)
(that is T(s) has rank r for almost all values of s ) then
M(s) has the form
]
M(s) = [ M*(s)rr Or, Z-r I (3.3.20)
I
r,r Om-r,m-rJ
with

Cl(S ) e2(s) er(S) l

M* (S) -- diag ~--~-~ , ~2(s ) , . . . . ~--~-~j(3.3.21)

where:
(i) each ei(s) divides all ei+ j (s) and

(ii) each ~i(s) divides all ~i-j (s).

With an appropriate partitioning of H(s),M(s) and J(s) we

therefore have

= HI (s)M* (S) Jl (s) (3.3.22)

where M*(s) is as defined in equation (3.3.21).
Thus T(S) may be expressed in the form

T(s) = Hi(s)
Idiag{ ei (s) 7
~i-~-s~}j Jl (s)

r ci(s) t
= (3.3.23)
h i(s) ~ 3i(s)
i=l
where :
(i) {hi(s) : i = 1,2 .... ,r} are the columns of the
matrix H l(s) ;
34

(ii) {j~(s) : i = 1,2,...,r} are the rows of the

matrix Jl(S)

We know that

r ~ min(,m)

and that H(s) and J(s) are unimodular matrices of full

rank m and Z respectively for all s. Suppose T(s) is the

transfer function matrix for a system with input transform

vector %(s) and output transform vector ~(s). Then any

A
input vector ~(s) is turned into an output vector y(s)

(3.3.24)

For the single-input single-output case where

y(s) = ke(s) ~(s)

~(s)

with k a constant, the transfer function

ks(s)
g(s) =
~(s)
is defined as having zeros at those values of s where (s)

vanishes and poles at those values of s where ~(s) vanishes.

Thus for a non-zero ~(s) the modulus of ~(s) vanishes

when s is a zero of g(s), and becomes arbitrarily large when s

is a pole of g(s). A natural way therefore to characterize

the zeros and poles of T(s) is in terms of those values of

s for which II ~(s)II becomes zero for non-zero II ~(s)ll '

and arbitrarily large for finite II ~(s) II ' where II "II

denotes the standard vector norm. 9his natural extension

of scalar case ideas leads directly to definitions of zeros

and poles of T(s) in terms of the Smith-McMillan form

quantities

E(s)

because of the following pair of simple results.

Zero lemma: II 9 (s)II vanishes for II ~ (s)II ~ O and s finite

if and only if some e. (s) is zero.

Pole lemma: II 9 (s)II+ ~ for II~(s)II < ~ if and only if

some ~i (s) + O.

These considerations lead naturally to the following

definitions [3].

Poles of T(s): The poIes of T(s) are defined to be the set

of all zeros of the set of polynomials {~i(s) : i = 1,2,...,r}.

In what follows we will usually denote the poles of T(s) by

{pl,P2,...,p n} and put

PT(S) = (s-Pl) (s-P2) ... (S-Pn) (3.3.25)

where PT(S) is conveniently referred to as the pole polynomial

of T(s) and is given by

r
PT(S) = ~ ~i(s) (3.3.26)
i=l

Zeros of T(s): The zeros of T(s) are defined to be the set

of all zeros of the set of polynomials {si(s) : i = 1,2,...,r}.

We will normally denote the zeros of T(s) by {Zl,Z 2 .... ,z }

and put

ZT(S) = (S-Zl) (s-z 2) ... (s-z) (3.3.27)

where ZT(S) is conveniently referred to as the zero polynomial

of T(s) and is given by

r
ZT(S) = ~ g. (s) (3.3.28)
i=l 1

It is important to remember that ZT(S) and PT(S) are not

necessarily relatively prime; for this reason it is wrong

to simply define ZT(S) and PT(S) for a square matrix T(s)

as the numerator and demominator polynomials of det T(s).

Rules for calculatin9 pole polyDomials and z e r o p o l y n o m i a l s

The route via the Smith-McMillan form is not always

convenient for the determination of the poles and zeros of

T(s), particularly if the calculation is being done by hand.

The following rules [4] can be shown to give the same results

as the Smith-McMillan definitions.

Pole polynomial rule: PT(S) is the monic polynomial obtained

from the least common denominator of all non-zero minors of

all orders of T(s).

Zero polynomial rule: ZT(S) is the monic polynomial obtained

from the greatest common divisor of the numerators of all

minors of T(s) of order r (r being the normal rank of T(s))

which minors hav e a l l been adjusted to have PT(S) as thei r

common denominator

3.3-3 Relationship between algebraically defined poles/zeros

of the open-loop 9ain matrix G(s) and the p~es/zer0 s of the

correspondin 9 set of characteristic @ain functions

As a key step in the establishment of a generalized

Nyquist stability criterion, it is crucially important to

relate the poles and zeros defined by algebraic means to

complex variable theory, and thus to the poles and zeros of

the set of characteristic gain functions.

The coefficients ai(s) in the expansion

det [gIm-G(s) ] = gm + al(s)gm-i + a2(s)gm-2 + ... +am(S)

(3.3.29)

are all appropriate sums of minors of Q(s) since it is well

known that:

det [gIm-G (s) ]

= g m - [trace G(s)]g m-1 + [~principal minors of G(s) of order 2]g m-2

- ... + (-l)mdet G(s) (3.3.30)

and thus the pcle polynomial bo;(S) is the monic polynomial

obtained from the least common denominator of all non-zero

principal minors of all orders of G(s).

Now the pole polynomial p~s) of a square matrix G(s)

is the monic polynomial obtained from the least common

denominator of all non-zero minors of all orders of G(s).

Therefore, if eG(s) is the monic polynomial obtained from the

least common denominator of all non-zero non-principa!

minors, with all factors common to bo;(S) removed, we have

that

PG (s) = eG(S)bo/(S) (3.3.31)

Furthermore since

det G(s) = am(S) = bm(S) (3.3.32)

5--(s)
o

and since from the Smith-McMillan form for G(s)

det G(s) = ~. ZG(S ) (3.3.33)

PG(S)

where a is a scalar quantity independent of s, we must have

that
38

ZG(S ) = eG (s) bml(S) (3.3.34)

In many cases the least common denominator of the non-

zero non-principal minors of G(s) will divide bo(S) ,

in which case eG(s) will be unity and the p01e and zero

polynomials for G(s) will be b~(s) and b i(s) respectively.

m
In general a square-matrix-valued function of a complex

variable G(s) will have a set of irreducible characteristic

gain functions in the form specified by equation (3.3.3) and

the general form for thepole and zero polynomials can be

written as

PG(S) = eG(s) H b/lo(s) (3.3.35)

i=l

and

ZG(S) = eG(s) ~ b i'ti(s)

i=l I (3.3.36)

where the pole and zero polynomials for the jth characteristic

gain function gj(s) are b!o(S) and b/ (s) respectively.

3 3,tj

Example demonstrating th 9 p o l e V z e r o r e l a t i o n s h i p s

Let
(s-l) (s+2) O
1
G(s) z

(s+l) (s+2) (s-l)

i s+l
o] (s+l) (s+2) (s-l) (s+l)

-i 1
s-i s+2

The pole polynomial for G(s) is obviously

PG(S) = (s+l)(s+2) (s-l)

and consequently the zero polynomial is

ZG(S) = (s-l)
39

The c h a r a c t e r i s t i c equation for G(s) is

i 1
det LgI-G(s)J = (g s+l ) (g - s - ~ ) = 0

so that the i r r e d u c i b l e characteristic equations are

1
Al(g,s) = g - ~-~ = O

and
1
A 2 (g,s) = g - s+---2 = O

which may be w r i t t e n as

(s+l)g - 1 = O

and

(s+2) g - 1 = 0

Therefore the p o ~ and zero p o l y n o m i a l s for the c h a r a c t e r i s t i c

gain functions gl(S) and g2(s) are

pgl(S) = blo(s) = (s+l) Zg I (s) = bll (s) = 1

-1
Pg2 (s) = b 2 0 (s) = (s+2) Zg 2(s) = b21(s) = 1
-i

Now for G(s) the m o n i c p o ~ n o m i a l obtained from the least

common denominator of all n o n - z e r o non-principal minors with

I I
all factors common to bJ(s)~ (=b~'o(S)b20(s))a removed is given by

e G (s) = (s-l)
which v e r i f i e s the r e l a t i o n s h i p s
2
PG(S) = eG(s) H b.; (s)
i= 1 lO

and
2
ZG(S) = eG(s) i=iH bilJ (s)

3.3-4 Riemann surface of a c h a r a c t e r i s t i c gain function

A characteristic gain function g(s) is d e f i n e d by an

irreducible equation of the form

bo(S)g t + bl(S)g t-I + ... + bt(s) = O (3.3.37)

having in general t distinct finite roots. An exception

occurs only if

(a) bo(S) = O, because the degree of the equation is then

lowered, and as b (s)O one or more of the roots becomes

o
infinite; or if

(b) the equation has multiple roots.

This last situation can occur for finite values of s if, and

only if, an exPression , called the discriminant of the

equation, vanishes. The discriminant [5] is an entire rational

function of the equation coefficients; it will be denoted

by Dg(S) , and is discussed in appendix 3.

Ordinary points of the characteristic gain function

An ordinary point [1;6] of the characteristic gain

function g(s) is any finite point of the complex plane such

that bo(S) ~ O and Dg(S) ~ O.

Critical points of t h e characteristic gain function

A critical point [i; 6] of g(s) is any point of the

complex plane at which either

b (s) = O or D (s) = O,
o g
or both, plus the point s =~.

Branch points of the characteristic function

Solutions of

D (s) = O
g
are called finite branch points of the characteristic gain

function. The point at infinity is a branch point if the

discriminant Dg(Z) of equation (3.3.18) satisfies Dg(O) = O

At every ordinary p o i n t t h e equation (3.3.37) defining the

characteristic gain function has t distinct roots, since

the discriminant does not vanish. The theory of algebraic

functions [i] then shows that in a simply connected region

of the complex plane punctured by the exclusion of the critical

points the values of the characteristic gain function g(s)

form a set of analytic functions; each of these analytic

functions is called a branch of the c h a r a c t e r i s t i c gain

function g(s). Arguments based on standard techniques of

analytic continuation, together w i t h the properties of

algebraic equations, show that the various branches can be

organized into a single entity: the corresponding algebraic

function. This is summarized in the following basic theorem

of algebraic function theory: an irreducible algebraic

equation of the form (3.3.37) defines precisely one t-valued

regular function g(s) in the punctured plane [7 ] .

Functions defined in this w a y are called algebraic

functions, and can be regarded as natural g e n e r a l i z a t i o n s

of the familiar e l e m e n t a r y functions of a complex variable.

An elementary function of a complex variable has the set of

complex numbers C as both its domain and its range. An

algebraic function has the complex number set C as its range

but has a new and a p p r o p r i a t e l y defined domain R which is

called its Riemann Surface [~ Since the Riemann

surface of an algebraic function plays a crucial role in this

work it is important to have an intuitive grasp of the ideas

underlying its definition and formation, which is therefore

now briefly considered.

Figure 6. Anotytic continuation

S u p p o s e we h a v e a r e p r e s e n t a t i o n of p a r t of one b r a n c h of

an a l g e b r a i c function in the f o r m of a p o w e r series; such

a representation is u s u a l l y c a l l e d a f u n c t i o n a l e l e m e n t .

I m a g i n e its c i r c l e of c o n v e r g e n c e to be cut out of p a p e r and

t h a t the i n d i v i d u a l p o i n t s of t h e p a p e r d i s c are m a d e b e a r e r s

of the u n i q u e f u n c t i o n a l v a l u e s of the e l e m e n t s . If n o w this

initital element is a n a l y t i c a l l y c o n t i n u e d by m e a n s of a

second power series, another c i r c l e of c o n v e r g e n c e can be

t h o u g h t of as b e i n g cut out and p a s t e d p a r t l y o v e r the first,

as i l l u s t r a t e d by f i g u r e 6. The p a r t s p a s t e d t o g e t h e r are

m a d e b e a r e r s of the same f u n c t i o n a l v a l u e s and are a c c o r d i n g l y

t r e a t e d as a s i n g l e r e g i o n c o v e r e d o n c e w i t h values. If a

further analytic continuation is c a r r i e d out, a further disc

is s i m i l a r l y p a s t e d on to the p r e c e d i n g one. Now suppose

that, after repeated analytic continuations, one of the d i s c s

lies o v e r a n o t h e r disc, not a s s o c i a t e d w i t h an i m m e d i a t e l y

preceding analytic continuation, as shown in figure 7.

Such an o v e r l a p p i n g disc is pasted together with the one

it overlaps if and only if both are bearers of the same

functional values. If,however,they bear different functional

values they are allowed to overlap but remain disconnected.

Thus two sheets, which are bearers of different functional

values, become superimposed on this part of the complex plane.

1
Figure 7 Repeated anatytic continuation

Continuing this process for as long as possible, a

surface-like configuration is obtained covering t "sheets"

of the complex plane, where t is the degree of the algebraic

function. To form the Riemann surface these sheets can be

joined together in the m o s t varied of ways. This m a y

involve connecting together two sheets which are separated

by several other sheets lying between them. Although such

a construction cannot be carried out in a t h r e e - d i m e n s i o n a l

space it is not difficult to give a p e r f e c t l y satisfactory

topological d e s c r i p t i o n of the process required. This

surface-like configuration is called the Riemann surface

of the m u l t i p l e - v a l u e d algebraic function. On the Riemann

surface the entire domain of values of the algebraic

function is spread out in a completely single-valued manner

so that, on every one of the t copies of the complex plane

involved, every point is the bearer of one and only one

value of the function.

A m e t h o d for building Riemann surfaces is given in

appendix 4. This involves the use of cuts in the complex

plane and it may be helpful to say a w o r d about them at this

point. Let an algebraic function g(s) have r critical

points { a l , a 2 , . . , a r } . Suppose them to be joined to one

another and then to the point at infinity by a line L .

Any line joining critical points will be called a cut. Let

L denote the set of complex numbers defined by the line L.

We then have that the solutions of equation (3.3.37) define

a set of t "distinct" analytic functions {gl(s),~2(s) .... ,gt(s)}

in the cut plane C -i . Each of these functions can be

analytically continued, by standard procedures, across the

cut L . NOW it fellows from the fundamental principles

of analytic c o n t i n u a t i o n that if an analytic function

satisfies an algebraic equation in one part of its domain

of definition, it must satisfy that equation in every region

into w h i c h it is a n a l y t i c a l l y continued. We must therefore

have that:

(i) there are only t "distinct" analytic functions

which satisfy the defining algebraic equation in the cut plane

C-i,
(ii) each analytic continuation of any of these

analytic functions {~i(s) : i = 1,2, .... t) gives rise to an

analytic function which also satisfies the defining algebraic

equation. It follows from this that the set of analytic

functions associated w i t h one side of the cut L must be a

simple p e r m u t a t i o n of the set of analytic functions

associated with the other side of the cut. Therefore by

identifying and suitably matching up c o r r e s p o n d i n g analytic

functions (via their sets of computed values) on opposite

sides of the cut L , one can produce an appropriate domain

on which a single analytic function may be specified which

defines a continuous single-valued m a p p i n g from this domain

into the complex plane. This function is of course the

algebraic function, conceived of as a single entity, and

the domain so constructed is its Riemann surface.

It is sufficient for the purposes of understanding this

book for the reader to know that a Riemann surface can be

constructed for any given algebraic function, on which its

values form a s i n g l e - v a l u e d function of positio n. Many

standard relationships and properties of analytic function

theory generalize, using the Riemann surface concept, to the

algebraic function case and, in p a r t i c u l a r the Principle of

the Argument holds on the Riemann surface; an extension of

the Principle of the Argument is developed in appendix 5.

The Riemann surface which is the domain of the

characteristic gain function g(s) will be called the

frequency surface or s-surface. When the o p e n - l o o p gain

matrix G(s) is m x m and has a corresponding characteristic

equation which is irreducible (i.e. the usual case in practice)

the frequency surface is formed out of m copies of the complex

frequency plane or s-plane.

3.3-5 G e n e r a l i z e d root locus diagrams

The characteristic gain function g(s) is a function of

a complex variable whose poles and zeros are located on the

frequency surface domain. It is convenient to exhibit the

nature of g(s) by drawing constant phase and constant

magnitude contours of g(s) on the frequency surface. If

the computational method outlined in appendix 4 is used to

construct the surface then the superposition of constant

phase and m a g n i t u d e contours is clearly a simple process.

The frequency surface can be thought of as the set of all

possible closed-loop characteristic frequencies associated

with all possible values of the complex gain parameter g.

When the surface is c h a r a c t e r i z e d by constant phase and

magnitude contours of g(s) we have a direct correspondence

between a closed-loop characteristic frequency and an open-

loop gain, and since the surface is constructed from m

copies of the complex frequency plane, for each value of s

there are m corresponding characteristic gains.

From equation (3.1.4) we have

g(s) = -1 (3.3.38)

so that the variation of the c l o s e d - l o o p poles (characteristic

frequencies) with the real control variable k traces out

loci which are equivalent to the 180 phase contours of g(s).

Equation (3.3.38) is a direct generalization of the defining

equation for the single-loop root locus diagram. The 180

phase contours of g(s) are the m u l t i v a r i a b l e root loci i.e.

the variation of the closed-loop poles with the gain control

variable k. The fact that m u l t i v a r i a b l e root loci 'live'

on a Riemann surface explains their c o m p l i c a t e d b e h a v i o u r

[9] as compared with the single-input, single-output case

where the root loci lie on a simple complex plane (a trivial,

i.e. one sheeted, Riemann surface). The m u l t i v a r i a b l e

root loci will sometimes be referred to as the c h a r a c t e r i s t i c

frequency loci.

Recall that in section 3.2 it was pointed out that the

characteristic equations for G(s) and S(g) are in general

different in that the equation for S(g) may contain factors

of s which are independent of g. These factors therefore

correspond to closed-loop poles which are independent of g,

or equivalently independent of the gain control variable k;

and, from the root locus point of view, these factors

correspond to degenerate loci each consisting of a single

point. The degenerate loci are therefore not p i c k e d out by

the 180 phase contours of g(s) on the frequency surface.

In practice the c h a r a c t e r i s t i c frequency loci are g e n e r a t e d

as the set of loci in a single copy of the complex frequency

plane traced out by the eigenvalues of S(g) as g traverses

the negative real axis in the gain plane. This approach

automatically picks out the d e g e n e r a t e loci. In common

with the classical root locus approach of Evans the characteristic

frequency loci are usually calibrated in terms of the gain

-i
control variable k=-g

3.3-6 Example of frg~uency surface and characteristic

frequency loci

As an i l l u s t r a t i v e example consider the general multi-

variable feedback configuration of figure 3 with a corresponding

open-loop gain m a t r i x
48

~,.':':~-- Root Ioc

Cut

Figure 8. Sheet 1 of the frequency surface

.... Root loci

Cut

-I

-3

Figure 9. Sheet 2 of the frequency surface

G(s) = 1 [s-i s]
1.25(s+i) (s+2) -6 s-2

The matrix is of order two and therefore the appropriate

surface will be constructed from two sheets of the complex

s-plane. The two sheets are shown characterized by

constant phase and magnitude contours of g(s) in figures

8 and 9. The cuts,identifiable by discontinuities in the

contours, are represented by thick black lines; and the

characteristic frequency loci, which are the 180 phase

contours of g(s), are identified by a diamond symbol.

The characteristic frequency loci indicate that variation

of the gain control parameter k, upwards from zero, causes

the system to experience stability, instability and stability

again. This phenomenon is clearly linked with the presence

of a branch point in the right half-plane (at s=~4).

Note that since we have completely characterized the

feedback configuration by its open-loop gain matrix there

are no unobservable or uncontrollable modes.

3.4 Characteristic frequency functions

The natural way to define the characteristic frequency

function s(g) is via the characteristic equation

V(s,g) ~ det[SIn-S(g) ] = 0 (3.4.1)

It is an algebraic function and the detailed study of the

characteristic gain function presented in the previous

section can be applied directly to it with the roles of s

and g reversed.

The Riemann surface which is the domain of the characteristic

frequency function will be called the ~ain surface or

@-surface. It is formed out of n copies of the complex

gain plane or g-plane since there are n values of closed-

loop characteristic frequency (closed-loop poles) for every

value of g. The gain surface can be thought of as the

set of all possible open-loop characteristic gains of the

open-loop gain matrix G(s) associated with all possible

closed-loop characteristic frequencies. In a similar

fashion to the gain function g(s) it is convenient to

exhibit the behaviour of s(g) on the gain surface by

superimposing constant phase and magnitude contours of

s(g) onto the surface. Like g(s) the frequency function

s(g) has poles and zeros but their significance is quite

different.

3.4-1 Generalized Nyquist diagram

Each 'sheet' of a gain surface characterized by constant

phase and magnitude contours of s(g)is divided into regions

corresponding to left half-plane and right half-plane closed

-loop characteristic frequencies. Therefore given such a

calibrated surface one can see at a glance which values of

g (or equivalently k) correspond to stable closed-loop

poles. The boundary between stable and unstable regions

is clearly the ~90 phase contours of s(g). The ~90 phase

contours of s(g) are a natural generalization of the single-

loop Nyquist diagram and are called characteristic gain loci.

In practice the characteristic gain loci are generated

as the loci in the complex gain plane traced out by the

eigenvalues of G(s) as s traverses the so called Nyquist

D-contour in the s-plane. Suppose that we consider a

portion of the imaginary axis. We can then compute a

set of loci corresponding to the eigenvalues gl(j~),...,gm(j~)

( where in this context j = /--2i-- )

in the following way:

(i) Select a value of angular frequency, say a

(ii) Compute the complex matrix G(j~ a)

(iii) Use a standard computer algorithm to compute the

eigenvalues of G(j~ a) , which are a set of complex

numbers denoted by {gi(J~a)} .

(iv) Plot the numbers {gi(J~a)} in the complex plane.

(v) Repeat with further values of angular frequency ~b,~c,

... etc., and join the resulting plots up into

continuous loci using a sorting routine based on the

continuity of the various branches of the characteristic

functions involved.

For the purpose of developing a g e n e r a l i z e d Nyquist

stability criterion in chapter 4 the Nyquist D - c o n t o u r is

traversed in the standard clockwise direction.

3.4-2 Example of ~ain surface and characteristic ~ain loci

As an illustrative example consider the open-loop

gain matrix considered in subsection 3.3-5 w h i c h has a

minimal state-space realization

=- 0.6

1 O.5

The system has two states and therefore the appropriate

gain surface will be c o n s t r u c t e d from two sheets of the

complex g-plane. The two sheets are shown c h a r a c t e r i z e d

by constant phase and m a g n i t u d e contours of s(g) in figures

i0 and ii. The characteristic gain loci, w h i c h are the

~9~phase contours of s(g), are denoted by a series of

crosses.
52

Charocteristic
.... gain loci
mmmm Cut

Figure 10. Sheet 1 of the gain surface

Characteristic
gain loci

b C~%

Figure11. Sheet 2 of the gain surface

~ Right. half
plane region
Left. half
plane region
. . . . . . . . Choract,erbtic
..... gain loci
Cut. between
branch points.

"l
-0"8~.533 19-2

Figure12. Sketch of figure 10 emphasizing right

half and left hatf-ptane regions

L
533 1<3,2

Figure13. Sketch of figure 11 emphasizing right

half and left half-ptane regions
54

In figures 12 and 13 sketches of figures iO and ii are

made to emphasize the right and left half-plane regions of

closed-loop poles. From these sketches it is easy to

infer bounds on the gain control parameter k for stability.

Since g=-k -I, as we increase k positively from zero the

critical value of g moves from -~ along the real axis towards

the origin on each sheet. On sheet i, g is in a right half-

plane region for 1.25<k<2.5 while on sheet 2, for positive

k, g never moves into a right half-plane region. Therefore

the closed-loop system is stable for 0~k<1.25 and 2.5<k<~.

If k is increased negatively from zero the critical value

of g moves from ~ along the real axis towards the origin

on each sheet. On both sheets the value of g is in right

half-plane regions for -~<k~-1.875. Therefore for negative

k, which corresponds to positive feedback, the closed-

loop system is stable for -1.875<k~0.

If the calibrated gain surface is projected onto the

complex gain plane ~ , we have the normal representation

of the characteristic gain loci, plus the superposition of

contours representing both right half-plane and left half-

plane regions. Stability can now be predicted by considering

the right half-plane regions in relation to a single critical

1
point (-~+30). However, this presentation will in general

be difficult to comprehend because of the overlapping of

contours. Therefore, although the counting of encirclements

in the generalized Nyquist stability criterion [chapter 4]

is not fundamental to system stability as pointed out by

Saeks [i0~, it does afford the simplest method of predicting

closed-loop stability in the gain plane ~ .

From a gain surface plot it is possible to determine

the closed-loop poles a n d hence the relative stability of

a closed-loop system. This is now i l l u s t r a t e d by finding

the dominant c l o s e d - l o o p poles for the example under

consideration with unity k. It is convenient for this

purpose to have the gain surface characterized by constant

real and imaginary contours as shown in figures 14 and 15.

Let the dominant closed-loop poles be

sd = ezj8
Then ~ is the smallest (in magnitude) negative real contour

that passes through any one of the critical, -I, values of

g, and 8 is the c o r r e s p o n d i n g imaginary contour. From

figures 14 and 15 we have

~- -0.05 and 8 = O

so that

sd ~ -0.05

By hand calculation the dominant c l o s e d - l o o p pole is

sd = -0.0528

Also from a gain surface it is possible to determine the

characteristic frequency loci. Apart from possible single-

point loci, the root loci are simply the values of s at which

the characteristic gain loci has a phase of 180 . Therefore

from a gain surface plot the root loci are determined by

the values of the constant contours as they cross the negative

real axis on each sheet. Similarly, from a calibrated

frequency surface, it is possible to determine the c h a r a c t e r i s t i c

gain loci from the values of constant contours as they cross

the imaginary axes.

U}
v ~
c~
6 E
r~ .Q
cs
O~ c-
O
~ql L
0 m.
QJ E
U~ v
L
LL
57

I ~ a l contours
labll~:l thus :

-0.9 -I'O -I'l -1'2 -#'3 -1'4 -f'5 Irnoginary contours

-oe 1.5 labelled thus : 0.5
Characteristic .......
gain |oci ........

-0.7 I'0- Cut

-06

-0.5
-0,4 0-5-
-0.3

4
-i.s ~ -,.o I -o:s..~f-o' ,!o 1.5

:2/ (lll :
0.9 ,'0 I"11~2 ~:3 ~.415
,o

-I.5-

Figure 15. Sheet 2 of the gain surface

References

Ill G.A. Bliss, "Algebraic Functions",

(reprint of 1933 original).
Dover, New York, 1966

[2] P.M. Cohn, "Algebra", vol. l, Wiley, London 1974.

[3] H.H. Rosenbrock, "State Space and Multivariable Theory",

Nelson, London, 1970.

[4] T. Kontakos, Ph.D. Thesis, University of Manchester, 1973.

[2] S. Barnett, "Matrices in Control Theory", Van Nostrand-

Reinhold, London, 1971.

[0] E. Hille, "Analytic Function Theory", Vol. 2, Ginn and Co.,

U.S.A., 1962.

[7] K. Knopp "Theory of Functions", Part 2, Dover, New York,

1947.

[8] G. Springer, "Introduction to Riemann Surfaces", Addison-

Wesley, Reading, Mass., 1957.

[9] B. Kouvaritakis and U Shaked, "Asymptotic behaviour of

root-loci of multivariable systems", Int. J. Control,
23, 297-340, 1977.

iO] R. Saeks, "On the Encirclement Condition and Its Generalization",

IEEE Trans. on Circuits and Systems, 22, 780-785, 1975.
58

4. A generalized Nyquist stability

criterion

The Nyquist stability criterion [i] is one of the most

fundamental results in the theory of linear feedback systems

and its generalization to the multivariable case is of

great interest. Such a generalization was put forward

by MacFarlane[2] and u s e d as part of a technique called

the Characteristic Locus Method [3] for feedback systems

analysis and design, but no satisfactory proof was supplied.

The proof of a generalized Nyquist stability theorem was

undertaken by Barman and Katzenelson [4;5] but their

approach ignored certain key algebraic properties of the

quantities involved; it also leaned very heavily on the

use of cuts in the complex plane; this made their treatment

technically complicated, and obscured the essential simplicity

of the result. The purpose of this chapter is to give

a rigorous proof of a generalized Nyquist-like stability

criterion for the general feedback configuration based on

a fundamental result in complex variable theory: the Principle

of the Argument applied to an algebraic function defined on

an appropriate Riemann surface [ appendix 5].

Two Nyquist-like stability tests are in fact stated and

proved, the usefulness of each depending on how the subsystems

are characterized. The two statements of the criterion

are given in section 4.1 and proved in section 4.2.

4.1 Generalized Nyquist stability criterion

If the subsystems of the general feedback configuration

of figure 3 are each characterized by a state-space model

then the following statement of the criterion is applicable.

Statement I. The general feedback configuration is closed-

loop stable if and only if:

(i) the net sum of anti-clockwise encirclements of the

critical point (-~+jO) by the set of characteristic gain

loci is equal to the number of right half-plane poles of G(s);

(2) the characteristic gain loci do not pass through the

critical point (-~+jO);

(3) the number of branches of the characteristic gain loci

passing through infinity is equal to the number of poles

of G(s) on the imaginary axis; and

(4) the eigenvalues of the A-matrix, of the open-loop

system S(A,B,C,D), which correspond to modes of the system

which are unobservable and/or uncontrollable from the point

of view of considering the input as that of the first sub-

system and the output as that of the hth subsystem, are

all in the left half-plane.

If the subsystems are completely characterized by

their transfer function matrices, or if it is known that

for each subsystem there are no unobservable and/or

uncontrollable modes in the right half-plane including the

imaginary axis, then the following statement of the criterion

applies.

Statement 2. The general feedback configuration is closed-

loop stable if and only if:

(i) bhe net sum of anti-clockwise encirclements of the

critical point (-~+jO) by the set of characteristic gain

loci is equal to the total number of right half-plane poles

of Gl(S),G2(s) .... , and Gh(S) ;

(2) the c h a r a c t e r i s t i c gain loci do not pass through the

critical point (-kl--+jO); and

(3) t h e number of branches of the characteristic gain

loci passing through infinity is equal to the number of

poles of G(s) on the imaginary axis.

Note that if condition (2) and/or condition (3) do not

hold the closed-loop system has one or more poles on the

imaginary axis and is therefore not input-output stable

although the e q u i l i b r i u m state at the origin may be stable.

4.2 Proof of the ~ e n e r a l i z e d Nyquist stability criterion

In section 2.4 it was shown how the return-difference

operator corresponding to the break point shoWn in figure 4

is related to the open- and c l o s e d - l o o p characteristic

polynomials by the following expression.

detF(s) = CLCP(s) (4.2.1)

detF (~) OLCP (s-sT

This fundamental r e l a t i o n s h i p is the foundation on which

the proof of the g e n e r a l i z e d Nyquist stability criterion

is based.

The first stage in the proof is to consider the

eigenvalue equation of the r e t u r n - d i f f e r e n c e matrix F(s),

that is

det[fI m - F(s)] = O (4.2.2)

which in general, as for the characteristic equation of

G(s), can be expressed as a product of irreducible algebraic

equations of the form

t. t.-i
dio (s)f i i+dil(s)f'i i + .... + dit (s) = O
i
i=i,2,...,~ (4.2.3)

defining a set of algebraic functions {fi(s) : i=l,2,,..,i}

Therefore, as in sub-section 3.3-3 where the p o l ~ and zeros

of G(s) are related to the poles and zeros of the characteristic

gain functions, the pole and zero polynomials of F(s) can

be related to the Ixle and zero polynomials of the algebraic

functions {fi(s):i=l,2,..,Z} as follows

PF(S) = eF(s) ~ d io
! (s)
i=l

and (4.2.4)

ZF(S) = eF(s) K di;tl (s)

i=l

By definition the open-loop characteristic polynomial

of the general feedback configuration is given by

OLCP(s)~det[SIn-A ]

=det[Sin~Al]det[sin2-A2] ... det[sIL n. -Ah ]

(4~2.5)
and it is easily shown [6] that

det[SIn[Ai] = PG. (S)Pd. (s) (4.2.6)

l l
where pG (s) is the pole polynomial for the transfer function
l
matrix Gi(s) and the monic polynomial Pdi(S) has as its
zeros the decoupling zeros of the ith subsystem associated

with that set of characteristic frequencies (eigenvalues)

of A i which correspond to modes of the ith subsystem which

are uncontrollable and/or unobservable. The open-loop

characteristic polynomial can therefore be expressed as

OLCP(s) = pG l(s)pG 2 ~s)...PGh(S)Pdl(S)Pd2(S)...pd (s)~,

(4.2.7)

The pole polynomial PG(S) for the open-loop gain matrix

G(s) is related to the pole polynomials of the subsystem

transfer functions through the relationship
62

PG(s)Px(S) = PGl(S)PG2(s)...p~(s) (4.2.8)

where Px(S) has as its zeros those poles of GI(S),G 2 (s), ....

d Gh(S) which are lost when G(s) is formed [ 7] . The

zeros of Px(S) are in fact a subset of the unobservable and

uncontrollable modes of the system S(A,B,C,D) where the

input is that of the first subsystem and the output is that

of the hth subsystem. The complete set of unobservable

and uncontrollable modes of S(A,B,C,D) is the set of zeros

of the polynomial Pd(S) where

Pd(S) = Px(s)Pdl (S)Pd2 (s)...Pdh (s) (4.2.9)

The open-loop characteristic polynomial can therefore be

rewritten as

OIX2P(s) = PG(S)Pd(S) (4.2. iO)

OLCP(s) = P G ( S ) P x (s)pd I ( s ) p d 2 ( s ) ' ' ' p ~ ( s ) (4.2.11)

and if we combine expression (4.2.10 with the fundamental

relationship (4.2.1) we obtain

CLCP (s) = pd (s) pG (s) detF (s) (4.2.12)
detF (~)

N o w from the Smith-McMillan canonical form for F(s) we

have that
detF(s) = ~. ZF(S) (4.2.13)
PF (s)
where 8 is a scalar quantity independent of s; and from

the structure of F(s), equation (3.1.2), it is clear that

the monic polynomials ZF(S) and PF(S) will be of the same

order and hence that

detF(~) = 8 (4.2.14)

Therefore combining equations (4.2.12-14) the closed-loop

characteristic polynomial is given by

ZF(S)
CLCP (s) = Pd(S)PG(S) (4.2.15)
P F (s)

If we now substitute from equations (4.2.4) and (3.3.35)

this expression can be rewritten as follows

CLCP(s) = Pd(S)eG(s) H b/lo(S) H d / (s)

i=l i=l iti
(4.2.16)

~d~o (s)
i=l

But using the eigenvalue shift theorem [ 8] , on equation

(3.1.2), we have that the characteristic functions

{fi(s) : i=i,2,... ,} of F(s) are related to the characteristic

gain functions {gi(s) : i=1,2...,} by

fi(s) = l+kg i(s) i=1,2 .... , (4.2.17)

and hence that the p o ~ p o l y n o m i a l s for both sets of

algebraic functions are identical that is

dio/(s) = ~ b~o(S) (4.2.18)
i=l i=l

and therefore from (4.2.16) we have

CLCP (s) = Pd(S)eG(s) H dit.l (s) (4.2.19)
i=l

Note that the polynomial H d I (s) is dependent on the

i=l it.I

gain control variable k.

The relationship (4.2.19) implies [ see section 2.3]

that the following conditions are necessary and sufficient

for closed-loop stability:

(a) eG(s) ~ O has only left half-plane roots;

(b) H dilt (s) = O has only left half-plane roots; and

i=l i

(c) Pd(S) = O has only left half-plane roots.

The next step in the proof is to show how condition

(b) can be replaced by an encirclement condition similar

to that in the classical Nyquist criterion.

For the set of irreducible characteristic equations

associated with the return-difference operator F(s) there

is a corresponding set of Riemann surfaces on which the

appropriate characteristic algebraic functions {fi(s) : i=i,2,...

become single-valued, and mappings from these surfaces on

to a corresponding fi - plane are one-to-one and continuous.

Let us consider the jth equation of the set defined by

equations (4.2.3). The degree of the equation is tj and

therefore the corresponding Riemann surface ~ f . is formed

3
by piecing together t. copies of the complex s-plane, C
3
Suppose now that a Nyquist D-contour, as shown in figure 16,

is drawn on each of the tj copies of C before they are

pieced together to f o r m ~ f
. Then when the surface is
3
formed the set of Nyquist D-contours combine to form a set

of closed Jordan contours [ 9] enclosing right half-plane

regions o f ~ f . . The extended Principle of the Argument

3
[appendix 5] can then be applied to each right half-plane

region on ~ f j . Therefore for a particular right half-

plane region, not necessarily simply connected but with a

boundary made up from Nyquist D-contours, we have that the

difference between the number of zeros and poles of the

algebraic function fi(s) in the region, is equal to the

number of clockwise encirclements of the origin in (the

complex f-plane) by the image of the boundary curves, under

fj(s) for that particular region. If we therefore consider

all the right half-plane regions o n ~ f and apply the

3
extended Argument Principle to each, we have that

N(fj , O) = Zf. - Pf. (4.2.20)

3 3
where:
(i) N(f@,O) is the net sum of clockwise encirclements

of the origin in C by the set of image curves, under fi (s) I

of the set of curves formed on ~ f . by piecing together
3
the appropriate set of Nyquist D-contours when f o r m i n g ~ f 3.;
(ii) Zf. is the number of right half-plane zeros of fj (s); and
3
(iii) Pf. is the number of right half-plane poles of fj (s).
3

Trn (s)

Radius r where
S -plane ) ..,......~ r-,-oobut#00
')///4/ / _
f
Re (s)

x Poles
O Zeros
z~ Branch points

Figure 16. Theoretical Nyquist D-contour

Note that the indentatiorson the D-contour are

necessary only from a theoretical viewpoint in the development

of the Extended Principle of the Argument [ appendix 5] and

in practice the D-contour is taken as the imaginary axis.

Condition (b) for closed-loop stability is equivalent

to saying that there are no zeros of

{fi(s) : i=1,2,...,~} (4.2.21)

in the right half-plane or on the imaginary axis, and can

therefore be replaced by

Zf. = 0 i=1,2,...,~
l
or (4.2.22)

N(fi,O) = -Pf. i=1,2,...,

l
plus the condition that the algebraic functions (4.2.21)

have no zeros on the imaginary axis. Conditions (4.2.22)

imply and are implied by

N(fi,O) = - ~ P (4.2.23)
i=l i=l

so that the necessary and sufficient conditions for closed-

loop stability can be rewritten as

(a') eG(S)=O has no right half-plane roots;

(b') e (s)=O has no roots on the imaginary axis;

(c') i=iZN(fi'O) =-i~l P % ;

(d ) {fi(s): i=l,2,..,Z } have no zeros on the imaginary

axis; and

(e') Pd(S) has only left half-plane zeros.

Now,as shown by equations (3.3.35) and (4.2.18),e(s)

together with the pole polynomials for the set of characteristic

functions {fi(s): i=1,2,...,} make up the pole polynomial

for G(s). This leads us to consider combining conditions

(a') and (c r) into the single equivalent condition

N ( f i ' O ) = -PG (4.2.24)

i=l
where PG is the number of right half-plane poles of G(s).

The n e c e s s i t y and s u f f i c i e n c y of c o n d i t i o n (4.2.24) for

closed-loop stability when conditions (b'),(d I) and (e')

are satisfied is p r o v e d as follows.

From e q u a t i o n s (3.3.35) and (4.2.18) we have that

PG = e + E Pfi (4.2.25)
i=l

where e is the n u m b e r of right half-plane zeros of eG(s),

and we also know that

Z N(fi,O) = Z Zf - Z Pf (4.2.26)
i=l i=l ii=l i

so that c o m b i n i n g these two e v p r e s s i o n s we have

Z N(fi,O) = ~iZfi + e -- PG (4.2.27)
i=l i

where i=ZlZfi'e and PG are all p o s i t i v e integers, or zero.

To e s t a b l i s h the n e c e s s i t y of c o n d i t i o n (4.2.24)

suppose that

N(fi'O) ~ -PG
i=l

then from e q u a t i o n (4.2.27) this implies that

~Zf ~O and/or e ~ 0
i=l i

and hence we conclude that condition (4.2.24) is n e c e s s a r y

for closed-loop stability.

For s u f f i c i e n c y suppose that

Z N(fi,O) = -PG
i=l

then from e q u a t i o n (4.2.27) we m u s t have that

iZl= Zf i = e = O
68

and hence the system is closed-loop stable. Thus the

sufficiency of condition (4.2.24) is established.

We have therefore shown that the following conditions

are necessary and sufficient for closed-loop stability:

(a") Z N(fi'O) = - PG ;
i=l
(b") [fi(s) : i=1,2 .... ,} have no zeros on the imaginary

axis;

(c") eG(s) has no zeros on the imaginary axis; and

(d") Pd(S) has only left half-plane zeros.

From equation (4.2.17) the image curve sets in ~ of

the Nyquist D-contour set mapped under fi(s) and kgi(s) are

simply related by a unit shift in C The stability

condition (a") can consequently be replaced by

Z
N (kgi, -i) = -PG (4.2.28)
i=l

where N(kgi,-l) is the net sum of clockwise encirclements

of the point (-l+j~ in C by the characteristic gain loci

scaled by k. As in the classical Nyquist criterion the

scaling by k is avoided by counting the number of encirclements

that the characteristic gain loci make of the critical point

1
(-~ + jO), and hence replacing condition (4.2.28) by

1
N(gi' -k) = -PG (4.2.29)
i=l

From equations, (4.2.17) we also have that condition

(b") is equivalent to
1
gi(Jw) ~-~ , i=i,2, .... i (4.2.30)

and in practice this corresponds to the characteristic

gain loci not passing through the critical point (-~+jO).

Condition (c") can also be replaced by a more practical

test. The p o l e p o l y n o m i a l for G(s) is given, equation

(3.3.35), as

H b/ (s) (4.2.31)
PG(S) = eG(s) i=l Io

where b I (s) is the pole p o l y m o m i a l for the jth c h a r a c t e r i s t i c

]o
gain f u n c t i o n gj(s). Therefore it is clear that eG(s)

will have no zeros on the i m a g i n a r y axis if, and only if,

the number of i n f i n i t e branches of the c h a r a c t e r i s t i c gain

loci is equal to the n u m b e r of poles of G(s) on the i m a g i n a r y

axis. Note that for a b r a n c h of the loci going off to

infinity there will also be a b r a n c h returning from i n f i n i t y

and care should be taken not to count this as two i n f i n i t e

branches.

The n e c e s s a r y and s u f f i c i e n t conditions for c l o s e d -

loop s t a b i l i t y can t h e r e f o r e be r e d u c e d to:

1
(i) Z N(gi' -k ) = - PG;
i=l

(2) the c h a r a c t e r i s t i c gain loci do not pass through the

critical p o i n t (-kl---+jO);
(3) the n u m b e r of b r a n c h e s of the c h a r a c t e r i s t i c gain loci

passing t h r o u g h infinity is equal to the n u m b e r of poles of

G(s) on the i m a g i n a r y axis; and

(4) Pd(S) has only left h a l f - p l a n e zeros.

This c o m p l e t e s the p r o o f of s t a t e m e n t 1 of the g e n e r a l i z e d

Nyquist s t a b i l i t y criterion.
70

If the s u b s y s t e m s are c o m p l e t e l y characterized by

their t r a n s f e r function matrices then t h e i r state-space

descriptions will each be o b s e r v a b l e and c o n t r o l l a b l e so

that

p d I (s) = p d~ (s) = .... = p d h(s) = i (4.2.32)

and h e n c e

Pd(S) = Px(S) (4.2.32)

Condition (4) of s t a t e m e n t 1 of the N y q u i s t criterion is

then e q u i v a l e n t to Px(S) having only left h a l f - p l a n e zeros.

This situation also arises if the subsystems are c h a r a c t e r i z e d

by their state-space descriptions and we have the a d d i t i o n a l

information that each subsystem has no u n o b s e r v a b l e or

uncontrollable modes in the right h a l f - p l a n e or on the

imaginary axis. In these situations conditions (i) and

(4) can be combined to give a c r i t e r i o n which needs no

information about decoupling zeros.

To s h o w this we w i l l first of all p r o v e that when

(i) Pd(S) = Px(S) (4.2.34)

(ii) Pd(S) = Px(S)P~ (S)Pd 2( s ) .... p d h (s)

(4.2.35)

where the zeros of {pd (s) : i=1,2, .... h} are

all in the left h a l f - p l a n e ,

conditions (a'), (c I) and (e I ) are e q u i v a l e n t to

h
N(fi,O) = -Z PG (4.2.36)
i=l i=l
i
where PG~ is the n u m b e r of r i g h t h a l f - p l a n e p o l e s of Gi(s).

The n e c e s s i t y and s u f f i c i e n c y of c o n d i t i o n (4.2.3 6 ) for

closed-loop stability when conditions (b') and (d') are

satisfied is proved as follows.

From equations (4.2.9),(4.2.18) and (4.2.31) we have

that

h
PG = e + Z Pfi + P (4.2.37)
i=l ~ i=l x

where Px is the number of right half-plane zeros of Px(S),

and combining this with equation (4.2.26) we have that

h
i=iZN(fi,O) = i~l Zfi + e + Px - i~iPGi (4.2.38)

Note also that

Pd(S) = Px(S)Pd] (s) .... Pdh(S)

so that if Pd denotes the number of right half-plane

zeros of Pd(S) we have that

Pd = Px (4.2.39)

and equation (4.2.38) becomes

h
Z N ( f i , O ) = ~ l Z f i + e + Pd - 2 PGi (4.2.40)
i=l i i=l

To establish the necessity of condition (4.2.36)

suppose that
h
7 N(fi,O ) ~ -
i=l il~--1PGi
then from equation (4.2.40) this implies that

Z Zfi ~ O, or e ~ O, or Pd ~ 0
i=l

or any combination of these and thus that the system will

be closed-loop unstable. Hence we conclude that condition

(4.2.36) is necessary for closed-loop stability.

For sufficiency suppose that

h
Z N(fi,O ) = - __ZIPGi
i=l i
72

then from e q u a t i o n (4.2.40) we m u s t have that

i~iZfi = e = Pd = O

and hence the s y s t e m is c l o s e d - l o o p stable. Thus the

sufficiency of c o n d i t i o n (4.2.36) is e s t a b l i s h e d .

We have t h e r e f o r e shown that w h e n e i t h e r condition

(4.2.34) or c o n d i t i o n (4.2.35) is s a t i s f i e d the f o l l o w i n g

are n e c e s s a r y and s u f f i c i e n t for c l o s e d - l o o p stability.

h
(a"~ Z N(f i O) = - Z PG ;
i=l ' i=l

(b"') {fi(s) : i = 1 , 2 , . . . , } have no zeros on the i m a g i n a r y

axis; and

(c i'') eG(s) has no zeros on the i m a g i n a r y axis.

But we have already shown, in p r o v i n g statement 1 of the

s t a b i l i t y criterion, that

Z N(fi,O) ~ Z N(gi,-!)
k
i=l i=l

that c o n d i t i o n (b"') is e q u i v a l e n t to the c h a r a c t e r i s t i c

gain loci not p a s s i n g through the c r i t i c a l point (-~+jO);

and that c o n d i t i o n (c"') is e q u i v a l e n t to the n u m b e r of

infinite branches of the c h a r a c t e r i s t i c gain loci b e i n g

equal to the n u m b e r of poles of G(s) on the i m a g i n a r y axis.

This t h e r e f o r e completes the p r o o f of s t a t m e n t 2 of the

generalized Nyquist stability criterion.

4.3 Example

To i l l u s t r a t e the s t a b i l i t y criterion consider the

example already u s e d in s u b - s e c t i o n s 3.3-5 and 3.4-2, where

the g e n e r a l feedback configuration is c h a r a c t e r i z e d by its

open-loop gain m a t r i x
73

0s l
1.25(s+i) (s+2)
i s i-6s s-2
1
By d e f i n i t i o n the s y s t e m has no u n o b s e r v a b l e or u n c o n t r o l l a b l e

modes and by v i r t u e of the p r o b l e m statement we can c o n s i d e r

the c o n f i g u r a t i o n as c o n s i s t i n g of just one subsystem Gl(S)=G(s).

Therefore e i t h e r statement 1 or s t a t e m e n t 2 of the s t a b i l i t y

criterion is d i r e c t l y applicable.

The pole p o l y n o m i a l for G(s) is

PG(S) = (s+l) (s+2)

which has no zeros in the right h a l f - p l a n e or on the i m a g i n a r y

axis and t h e r e f o r e closed-loop stability is e n s u r e d if the

net sum of a n t i - c l o c k w i s e encirclements of the c r i t i c a l

point (-~+jO) by the c h a r a c t e r i s t i c gain loci is zero, and

if the c h a r a c t e r i s t i c gain loci do not pass through the

critical point and have no infinite branches. The c h a r a c t e r i s t i c

gain loci are shown in figure 17 from w h i c h the f o l l o w i n g

stability c o n d i t i o n s are obtained.

1
(i) For -~< -~ < -0.8 there are no e n c i r c l e m e n t s of, or

passage through, the c r i t i c a l point and thus the c l o s e d -

loop system is stable for O ~k<1.25.

1
(ii) For -~ = -0.8 the c h a r a c t e r i s t i c gain loci pass t h r o u g h

the c r i t i c a l p o i n t and t h e r e f o r e the system is c l o s e d -

loop u n s t a b l e for k = 1 . 2 5

(iii) For - O . 8 < - ~ < - O . 4 there is one c l o c k w i s e encirclement

of the c r i t i c a l point and t h e r e f o r e the s y s t e m is c l o s e d -

loop u n s t a b l e for 1 . 2 5 < k < 2 . 5

74
Im
60 __
",=0-5
jl.O0. ~=1-0

]o.5o

~=2.0.
S ---,,-.- 00

~=0.05. f 60
60

-I.00 -0-50 ~= 20'0 IRe

~=-0.05 60 ~=1-0
~ =-2.,

-]o.5o

~ j l O=_o.s
" 0 = -I.0

Figure17. C h a r a c t e r i s t i c gain loci

1
(iv) For -~ = -0.4 the characteristic gain loci pass through

the critical point and therefore the system is closed-loop

unstable for k=2.5.

(v) For -0.4<-~<0 there are no encirclements of, or passage

through, the critical point and thus the closed-loop system

is stable for 2.5<k<~.

1
(vi) For -~=O the characteristic gain loci pass through

the critical point and therefore the system is closed-loop

unstable for k =~.

(vii) For 0<-~<O.533 there are two clockwise encirclements

of the critical point and therefore the system is closed-

loop unstable for -~<k<-1.875.

(viii) For -ki--=O.533 the characteristic gain loci pass through

the critical point and therefore the system is closed-loop

unstable for k=-1.875.

(ix) For O . 5 3 3 < - ~ there are no encirclements of, or

passage through, the critical point and thus the closed-

loop system is stable for -1.875<k~0.

Note that the conditions where k is negative correspond

to positive feedback.

References

i] H. Nyquist, "The Regeneration Theory", Bell System Tech. J.,

ii, 126-147, 1932.

2] A.G.J. MacFarlane, "Return-difference and return-ratio matrices

and their use in analysis and design of multivariable
feedback control systems", Proc. IEE, 117, 2037-2049, 1970.

~3] A.G.J. MacFarlane and J.J. Belletrutti, "The characteristic

Locus Design Method", Automatica, 9, 575-588, 1973.
76

4] J.F. Barman and J. Katzenelson, Memorandum ERL-383, Electronics

Research Laboratory, College of Engineering, Univ. of California r
Berkeley, 1973.

5] J.F. Barman and J. Katzenelson, "A generalized Nyquist-type

stability criterion for multivariable feedback systems",
Int. J. Control, 20, 593-622, 1974.

[6] A.G.J. MacFarlane and N. Karcanias, "Poles and zeros of linear

multivariable systems: a survey of the algebraic, geometric
and complex variable theory", Int. J. Control, 24, 33-74,
1976.

7] C.A. Desoer and W.S. Chan, "The Feedback Interconnection of

Lumped Linear Time-invariant Systems", J. Franklin Inst.,
300, 335-351, 1975.

8] A.G.J. MacFarlane, "Dynamical System Models", Harrap,

London, 1970.

~9] E. Hille, "Analytic Function Theory", Vol. i, Ginn and Co.,

U.S.A., 1959.
77

5. A generalized inverse Nyquis t stability criterion

In this chapter a g e n e r a l i z a t i o n of the inverse Nyquist

stability criterion [i] for single-input single-output

feedback systems is developed for the general feedback

c o n f i g u r a t i o n which is complementary to the exposition

of the g e n e r a l i z e d Nyquist stability criterion p r e s e n t e d

in chapter 4. The development is based on an association

of the o p e n - l o o p gain m a t r i x G(s) with a set of inverse

characteristic gain functions, and a corresponding set of

inverse Nyquist diagrams which will be termed the inverse

characteristic gain loci.

5.1 Inverse c h a r a c t e r i s t i c , ~ain functions

A main feature of the g e n e r a l i z a t i o n of N y q u i s t ' s

stability criterion, given in chapter 4, is the association

of a set of algebraic functions - the characteristic gain

functions - with a square transfer function m a t r i x G(s)

by means of the characteristic equation

A(g,s) = d e t [ gl m - G(s) ] = O (5.1.1)

-i
If G(s) has normal rank m then its inverse function G(s)

exists, and has a corresponding characteristic equation

given by

A(g,s) = det[gI m G(s) -I] = O (5.1.2)

If A(g,s) is regarded as a polynomial in g with coefficients

which are rational functions of s then equation (5.1.2)

defines a set of algebraic functions {gi(s) : i=1,2,...,}

w h i c h will be called the inverse characteristic gain functions.

If A(g,s) is also irreducible over the field of rational

functions in s, then the inverse characteristic gain function

g(s), like g(s),has as its domain an appropriate Riemann

surface formed out of m copies of the complex s-plane

suitably joined together. In fact, because the eigenvalues

of a matrix are reciprocal to the eigenvalues of the matrix

inverse

g(s) = 1 (5.1.3)
g(s)

and therefore g(s) and g(s) have the same branch points,and

hence the same Riemann surface domains.

The inverse characteristic gain function is the foundation

on which the generalized inverse Nyquist stability criterion

is based.

5.2 Pole-zero relationships

In this section a number of pole-zero relationships

are derived which will be used in the proof of the generalized

inverse Nyquist stability criterion, section 5.4.

Because the eigenvalues of a matrix are the reciprocal

of the eigenvalues of the matrix inverse the poles of the

set of inverse characteristic gain functions are simply

the zeros of the characteristic gain function set, and vice

versa. The pole and zero polynomials of gi(s), denoted

by pg(s) and Zg(s), are therefore expressible as

pg.* (s) = z~=.(s) (5.1.4)

1 1

and

~ (s) = pg (s) (5.1.5)

1 1
i=l,2,...,i

From sub-section 3.3-2 it is clear that for an open

-loop gain matrix G(s) there exists a canonical form, the

Smith-McMillan form [2 ] , such that

G(s) = HG(S)MG(S)JG(S) (5.1.6)

where HG(S) and JG(S) are both mxm unimodular matrices and

the Smith-McMillan form MG(S) is given by

MG(S) = diag [ ~l
(s)1(s) ~2
(s)e2(s)-' ~m--~
)em(s) ] (5.1.7)

Consequently the poles and zeros of G(s) are defined as

m
PG(S) = H ~i(s) (5.1.8)
i=l

and
m
ZG(S) = H ei(s) (5.1.9)
i=l

Now if G(s) is of normal rank m equation (5.1.6) can be

inverted to give
G(s) -1 = JG(S)-IMG(S)-IHG(S) -I (5. i. lO)

which using the elementary transformation matrix E, of order

m, and given by
O O ... 0 1

O O ... 1 O

E = : . = E
-i (5.1.11)

O 1 ... O O

1 O ... O 0

can be rewritten as
-i -i (5.1.12)
G(s) = JG (s) -IEEM G (s) -IEEHG (s)

or
G(s) -I = H~(s)M6(s)J~(s) (5.1.13)
80

where
H~(s) = JG(S)-iE (5.1.14)
-i
J~Cs) = EHG(S) (5.1.15)

and
M~(s) = EMG(S)-IE

[ ~m (s) ~m-i (s) . ~I (s) ] (5.1.16)

= diag e~-~) ' em_l(S) ' " "' el(S )

which is the Smith-McMillan form for G(s) -I. The pole-

zero definitions for a transfer function matrix [sub-
section 3.3-2] can now be applied to G-I(s) with the result
that
m
p~(s) -- i=iH s.1(s) = zG(s) (5.1.17)

and
m
z~(s) = H ~i(s) = PG(S) (5.1.18)
i=l
where p*(s)G and z~(s) denote the pole and zero polynomials

of G(s) -i respectively.
In sub-section 3.3-3 it is shown that the pole and
zero polymomials of G(s) are related to the poles and zeros
of the characteristic gain function set {gi(s) :i=1,2,...,} by

PG(S) = eG(s)i=l
K b ti(s) = eG(s)i~iPgi (s) (5.1.19)

and

ZG(S) = eG(s) K b ! (s) = eG(s) ~ z (s) (5.1.20)

i=l iti i=l gi

If the pole and zero polynomials of the complete characteristic

gain function set are denoted by pg(S) and Zg(S) respectively,
and similarly for the inverse characteristic gain function set
81

using pg(s) and z(s),

g then relationships (5.1.19) and

(5.1.20) can be combined with relationships (5.1.17) and

(5.1.18) to give

PG(S) = z~(s) = eG(S)pg(S)=eO(s)z~(s) (5.1.21)

and

ZG(S) = P*(s)=eG(S)Zg(S)G = eG(s)p~(s) (5.1.22)

These pole-zero relationships will be used in the proof of

the inverse Nyquist stability criterion presented later.

5.3 Inverse characteristic ~ain loci - generalized

inverse Nyquist diagrams

The inverse characteristic gain loci are the loci in

the complex plane traced out by the reciprocal of each of

the eigenvalues of the open-loop gain matrix G(s) as s

traverses the Nyquist D-contour in the standard (clockwise)

direction. The algorithm for computing the loci follows

that given for the characteristic gain loci, in sub-section

3.4-1, with the modification that at step (iv) the reciprocal

of the numbers gi(jwa) are plotted in the complex plane.

We are now in a position to state and prove the

generalized inverse Nyquist stability criterion.

5.4 Generalized i n v e r s ~ Nyquist stability criterion

If the subsystems of the general feedback configuration

of figure 3 are each characterized by a state-space model

then the following statement of the criterion is applicable.

Statement i. The general feedback configuration is closed-

loop stable if and only if:

(la) the net sum of anti-clockwise encirclements, of the

critical point (-k+jo), by the set of inverse characteristic

gain loci, minus the net sum of anti-clockwise encirclements,

of the origin, by the set of inverse characteristic gain loci,

is equal to the number of right half-plane poles of G(s);

(2) the inverse c h a r a c t e r i s t i c gain loci do not pass

through the critical point (-k+jo) ; and

(3a) the number of branches of the inverse characteristic

gain loci that pass t h r o u g h the origin is equal to the number

of poles of G ( s ) o n the imaginary axis; and

(4) the eigenvalues of the A- matrix, of the open-loop

system S(A,B,C,D), which correspond to modes of the system

which are unobservable and/or uncontrollable from the point

of v i e w of considering the input as that of the first subsystem

and the output as that of the hth subsystem, are all in the

left half-plane.

Alternatively conditions (la) and (3a) m a y be replaced

by:

(ib) the net sum of anti-clockwise encirclements of the critical

point (-k+jo) by the set of inverse characteristic gain loci

is equal to the n u m b e r of right half-plane zeros of G(s);

(3b) the number of branches of the inverse c h a r a c t e r i s t i c

gain loci passing through infinity is equal to the number of

zeros of G(s) on the imaginary axis.

If the subsystems are completely c h a r a c t e r i z e d by their

transfer function matrices, or if it is known that for each

subsystem there are no unobservable and/or uncontrollable modes

in the right h a l f - p l a n e including the imaginary axis, then

the following statement of the criterion applies.

Statement 2 The general feedback configuration is closed-loop stable

if and only if:

(la) the net sum of anti-clockwise encirclements of the

critical point (-k+jo), by the set of inverse charactezistic

gain loci, minus the net sum of anti-clockwise encirclements,

of the origin, by the set of inverse c h a r a c t e r i s t i c gain loci,

is equal to the total number of right half-plane poles of

Gl(S),G2(s), .... , and Gh(S);

(2) the inverse characteristic gain loci do not pass through

the critical point (-k+jo); and

(3a) the number of branches of the inverse characteristic gain

loci that pass through the origin is equal to the number of

poles of G(s) on the imaginary axis.

Alternatively condition (3a) m a y be replaced by condition

(3b), as in statement 1 of the criterion, and if the subsystems

are square, i.e. have the same number of outputs as inputs,

then condition (la) m a y be replaced by:

(ib) the net sum of anti-clockwise encirclements of the

critical point (-k+jo) by the set of inverse characteristic

gain loci is equal to the total number of right half-plane

zeros of Gl(S), G2(s) . . . . , and Gh(S).

Note that if condition (2) and/or condition (3a)/(3b) do

not hold then the closed-loop system has one or m o r e poles

on the imaginary axis and is therefore not input-output

stable although the e q u i l i b r i u m state at the origin may be

stable,

For strictly proper s y s t e m ~ t h a t is, ones in which the

system D-matrix is zero, the inverse c h a r a c t e r i s t i c gain loci

approach infinity as s approaches infinity, and s=~ is

a pole of the inverse function. Therefore, in practice, it

is n e c e s s a r y to traverse the whole of the D-contour, not

just the imaginary axis, in order to obtain closed curves.

In this way the net sum of encirclements of the critical

point and the origin by the inverse characteristic gain loci

can be obtained. As an example, the inverse characteristic

gain loci for

G(s) = 1 (5.4.1)
3
(s+l)

are shown in figure 18.

The traversal of the D - c o n t o u r off the imaginary axis,

however, is not n e c e s s a r y if we use conditions (la), rather

than (ib), in both statements of the stability criterion.

This is because the extra loci, corresponding to the circular

part of the D-contour, encircle the origin and the critical

point the same number of times, and thereby cancel each other.

A useful rule, when using the (la) conditions, is therefore

to join up the corresponding loose ends of the loci via a

large semi circle, and to forget any extra encirclements that

m a y actually exist. W h e t h e r the ends are joined via the

right half-plane or left half-plane is fixed by the fact that

the mapping under g(s), from the set of Nyquist curves formed

on the Riemann surface of g(s) (by piecing together an appropriate

w
set of Nyquist D-contours) to the g-plane, is conformal. This

means that if we imagine two people, X and Y, X walking along

the domain curve, and Y walking along the corresponding image

curve, such that at each step X defines the corresponding

image point, then if x turns to his right, Y will turn to hi__~

right.
85

*I~h
20~

~j_ plane "~x~=.2. 4 I0

$=0
J
- 3;0 - 2~0

-20.

(o)

I.O

/ -~
,o( o,:: f.,~ = 6 o
-0.5 0.5 l.~l
~ I ~ ,03

(b)

Figure18. Inverse charact.eris!,ic gain

Loci for G(s)= 1 / (s.l) 3
((:1)small area around origin
(b) large a r e a
86

5.5 Proof of gene!alize d inverse Nyquist stabilit Y criterion

For the general feedback configuration of figure 3 the

closed-loop transfer function matrix R(s) is given by the

relationship

R(s) = kG(s) [ Im4kG(s)]-i (5.5.1)

which after inverting becomes

-i -1 (5.5.2)
R(s) = 1 G(s) + I
m

If we now denote the set of algebraic functions defining the

eigenvalues of R(s) -I by {~i(s) :i=l,2,...,Z} we can apply

the eigenvalue shift theorem[3] to equation (5.5.2) with

the result that

~. (s) = 1 gi(s)+ 1 (5.5.3)

i=I,2, . . . ,

Then if the poleand zero polynomials for the complete set of

algebraic functions {ri(s):i=l,2,...,Z} are denoted by the

monic polynomials Pr* (s) and *

Zr(S) respectively, we have that

~r(S) = ~g(s) (5.5.4)

Now post-multiplying equation (5.5.2) by kG(s) we have

MR (s)-lG (s) = Im+kG (s)

(5.5.5)
= r(s)

the return difference matrix [4] , and taking determinants of

this we obtain
det F(s) = k m det[ s -l]
det[G(s) -I]

=y z~(s) * (5.5.6)
g( )
87

where y is a scalar constant independent of s. But

det F(s) = S ~(s) = 8 zf(s) (5.5.7)

PF{S) pf ('s)

where pf(s) and zf(s) are respectively the monic pole and

zero polynor~als for the set of algebraic functions {fi(s):i=1,2,...,}

and therefore

(5.5.8)

and

z f(s) = z*(s)
r @
p*(s)
)
pf(s) (5.5.9)

Now from equations (4.2.18) and (5.1.21)

pf(s) = pg(S) = z*(S)g (5.5.10)

and hence combining this with equations (5.5.4) and (5.5.9)

we obtain

Izf(s) = zr *(s)I (5.5.11)

In chapter 4, equation (4.2.19), it was shown that the

closed-loop characteristic polynomial can be given by

CLCP(s) = Pd(S)e~s) R d ! (s) (5.5.12)
i=l iti

or equivalently, u s i n g the notation presented in this chapter,

CLCP (s) = pd (s) eG(s) zf(s) (5.5.13)

and therefore combining this with equation (5.5.11) we have

CLCP(s) = Pd(s)es(S) Zr(S) ] (5.5.14)

This implies that the following conditions are necessary and

sufficient for closed-loop stability:

(a) eG(s) = 0 has only left half-plane roots;

(b) Zr(S) = 0 has only left half-plane roots; and

(c) Pd(S) = 0 has only left half-plane roots.

Suppose now that a Nyquist D-contour, as shown in figure

16, is drawn on m copies of the complex plane before they are

pieced together t o f o r m t h e Riemann s u r f a c e s {I~.:i=1,2,...,}

1
on which the {ri(s):i=l,2,...,~} are defined. Let us consider

the jth s u r f a c e ~ j corresponding to ~j(s). When the surface

is formed the set of Nyquist D-contours combine to form a set

of closed Jordan contours [5] enclosing right half-plane

regions o n e * . The extended Principle of the Argument

r.]

[appendix 5] can be applied to each right half-plane region

On~r* Therefore for a particular right half-plane region,

3
not necessarily simply connected but with a boundary made up

from Nyquist D-contours, we have that the difference between

the number of zeros and poles of the algebraic function rj(s)

in the region, is equal to the number of clockwise encirclements

of the origin in ~ (the complex r-plane) by t h e image o f t h e

boundary curves, under rj(s), for that particular region.

I f we t h e r e f o r e consider all the right half-plane regions and

apply the extended Argument Principle to each, we have that

N(rj, O) = Z~j - P* (5.5.15)

where:

(i) N(r~,O) is the net sum of clockwise encirclements of the

J
origin in C by the set of image curves, under ~j(s~ of the

set of curves formed on ~ * by p i e c i n g together the appropriate

rj
set of Nyquist D-contours w h e n f o r m i n g ~ . ~
3
w
(ii) Z~. is the number of right half-plane zeros of rj (s);
3
89

and

(iii) p* is the number of right half-plane poles of r.] (s).

r.
3
If we now consider the corresponding inverse characteristic

gain function gj(s) in the same way, we find that

N(gj , O) = Z~j - P~j (5.5.16)

where:
(i) N(gj,O) is the net sum of clockwise encirelements of

the o r i g i n in C by the inverse characteristic gain loci

corresponding to gj(s);

(ii) ~j is the number of right half-plane zeros of gj(s); and

(iii) ~ j is the number of right half-plane poles of gj(s).

Equations (5.5.15) and (5.5.16) can be combined to give

N(rj,O)-N(gj,O) = Z, - P, - Z, + P, (5.5.17)
rj rj gj gj

which using equation (5.5.4) becomes

N(rj,O) - N(gj,O) = Z~j - Z~j (5.5.18)

and if we consider the complete set of algebraic functions

* {*
{ri(s):i=l,2,...,} and the set gi(s) :i=l,2,...,Z}

we have
Z , ,
Z N(ri,O)- Z N(gi,O) = Z Z* - Z Z* (5.5.19)
i=l i=l i=l ri i=l gi

Now condition (b) for closed-loop stability is equivalent

to saying that there are no zeros of {ri(s):i=l,2,...,}

in the right half-plane or on the imaginary axis, and can

therefore be replaced by

N(ri,O) - N(gi,O) = - ~ Z* (5.5.20)

i=l i=l i=l gi
90

plus the c o n d i t i o n t h a t {ri(s):i=l,2,...,} have no zeros

on t h e imaginary axis. The necessary and sufficient

conditions for c l o s e d - l o o p stability can t h e r e f o r e be

rewritten as

(ai ) eG(s) = O has no right h a l f - p l a n e roots;

(b! ) eG(s) = O has no roots on the i m a g i n a r y axis;

. Z ,
(c ! ) Z N(ri,O) - Z N(gi,O) = - Z Z* ;
i=l i=l i=l gi

(d I ) {r* (s):i=i,2,...,} have no zeros on the i m a g i n a r y

i
axis; and

(e I ) Pd(S) has only left h a l f - p l a n e zeros.

The p o l e - z e r o relationship (5.1.21) n o w leads us to c o n s i d e r

combining c o n d i t i o n s (a')and(c') into the s i n g l e e q u i v a l e n t condition

. ,
N(ri,O) - Z N(gi,O) = -PG (5.5.21)
i=l i=l

where PG is the n u m b e r of right h a l f - p l a n e poles of G(s).

The n e c e s s i t y and s u f f i c i e n c y of c o n d i t i o n (5.5.21) for _

closed-loop stability when conditions (bl), (d') and (e I )

are s a t i s f i e d is p r o v e d as follows

From relationship (5.1.21) we have that

PG = e + Z Z* (5.5.22)
i=l gi

where e is the n u m b e r of right h a l f - p l a n e poles of eG(s), and

combining this w i t h e q u a t i o n (5.5.19) gives

. Z .
Z N(ri,O) - Z N(gi,O) = Z Z* + e-P G (5.5.23)
i=l i=l i=l ri

To e s t a b l i s h the n e c e s s i t y of c o n d i t i o n (5.5.21)

s u p p o s e that
, ,
N(ri,O) - 2 N(gi,O) fl -PG
i=l i=l
91

then from e q u a t i o n (5.5.23) this implies that

7. Z* ~ O and/or e ~ 0
i=l r i

and h e n c e we c o n c l u d e that c o n d i t i o n (5.5.21) is n e c e s s a r y

for c l o s e d - l o o p stability.

Fors u f f i c i e n c y suppose that

, ,
N(ri'O) - 7. N(gi'O) =-PG
i=l i=l

then from e q u a t i o n (5.5.23) this implies that

7. Z, = e = 0
i=l r.
l

and h e n c e the s y s t e m is c l o s e d - l o o p stable. Thus the sufficiency

of c o n d i t i o n (5.5.21) is e s t a b l i s h e d .

We have t h e r e f o r e shown that the f o l l o w i n g conditions

are n e c e s s a r y and s u f f i c i e n t for c l o s e d - l o o p stability:

, ~ ,
(a") ~ N(ri'O) - 7. N(gi'O) = -PG ;
i=l i=l
,
(b") {ri(s): i=i,2, .... } have no zeros on the i m a g i n a r y axis;

(c") eG(S)=O has no roots on the i m a g i n a r y axis; and

(d") Pd(S) has only left h a l f - p l a n e zeros.

F r o m e q u a t i o n (5.5.3) it is clear that

, ~ ,
7. N ( r i , O ) = 7 N(gi, -k) (5.5.24)
i=l i=l

and c o n s e q u e n t l y conditon (a") is e q u i v a l e n t to c o n d i t i o n (la)

in s t a t e m e n t 1 of the s t a b i l i t y criterion.

Also from e q u a t i o n (5.5.3) we have that {ri(s): i=i,2,...,}

have zeros on the i m a g i n a r y axis if, and only if the inverse

characteristic gain loci pass t h r o u g h the c r i t i c a l point

(-k+jO); hence condition (b") is e q u i v a l e n t to c o n d i t i o n 2

in s t a t e m e n t 1 of the s t a b i l i t y criterion.

F r o m the p o l e - z e r o relationships, (5.1.21) and (5.1.22),

condition (c") is c l e a r l y equivalent to c o n d i t i o n (3a)

and also c o n d i t i o n (3b) in e i t h e r statement of the s t a b i l i t y

criterion.

Therefore to c o m p l e t e the p r o o f of s t a t e m e n t 1 of the

stability criterion all that is left is to show that c o n d i t i o n

(ib) is e q u i v a l e n t to c o n d i t i o n (a"). To do this we will

s h o w that w h e n c o n d i t i o n s (b"), (c") and (d") are s a t i s f i e d ,

the c o n d i t i o n
, ,
E N(r i, O) = Z N(gi,-k) = - Z G, (5.5.25)
i=l i=l

where Z G is the n u m b e r of r i g h t h a l f - p l a n e zeros of G(s),

is n e c e s s a r y and s u f f i c i e n t for c l o s e d - l o o p stability.

From relationship (5.1.22) we have that

ZG = e + Z P, (5.5.26)
i=l gi

where P*~ is the n u m b e r of right h a l f - p l a n e p o l e s of gj(s),

gj , ,
but from e q u a t i o n (5.5.3) the p o l e s of ri(s) and gi(s) are

the same, and h e n c e

Z
ZG = e + Z P*r. (5.5.27)
i=l l

If we now combine equation (5.5.27) with equation (5.5.15),

which is v a l i d for { j = 1 , 2 , . . . , } , we have

Z N(ri,O)~. = Z Z*r. + e - ZG (5.5.28)

i=l i=l 1

To e s t a b l i s h the n e c e s s i t y of c o n d i t i o n (5.5.25) suppose

that
,
F N(ri,O ) ~ -Z G
i=l

then from e q u a t i o n (5.5.28) this implies that

i
iZ#l~ ~ O and/or e ~ O
i
93

and hence we conclude that c o n d i t i o n (5.5.28) is n e c e s s a r y

for c l o s e d - l o o p stability.

For s u f f i c i e n c y suppose that

,
N(ri,O) = -Z G
i=l

then from e q u a t i o n (5.5.28) we have that

Z* = e = O
i=l ri

and hence that the system is c l o s e d - l o o p stable, providing

conditions (b"), (c") and (d") are satisfied. Thus the

sufficiency of c o n d i t i o n (5.5.25) is established.

This completes the p r o o f of s t a t e m e n t 1 of the g e n e r a l i z e d

inverse Nyquist stability criterion.

Statement 2 of the s t a b i l i t y criterion applies to systems

see chapter 4] in w h i c h either

(i) Pd(S) = Px(S) (5.5.29)

(ii) Pd(S) = Px(S) (s) (s). (s)

Pd I Pd 2 "''Pd h (5.5.30)

where the zeros of {Pdi(S) :i=i,2,...,} are

all in the left half-plane.

In these situations it is possible to combine conditions (la)

and (4), of s t a t e m e n t i, into the single e q u i v a l e n t condition

* i w h
N(g i, -k) - Z N(gi,O) = - Z PG (5.5.31)
i=l i=l i=l i
94

The n e c e s s i t y and s u f f i c i e n c y of c o n d i t i o n (5.5.31) for c l o s e d -

loop s t a b i l i t y when conditions (2) and (3) are s a t i s f i e d

is p r o v e d as follows.

F r o m e q u a t i o n s (4.2.37) and (4.2.39) we have that

h
PG = e + Z P + Pd (5.5.32)
i=l i i=l f
i

But the poles of {fi(s): i~l,2,...,Z} are the same as the

poles of {gi(s): i=1,2,...,} which in turn are the same as

the zeros of {gi(s) : i = 1 , 2 , . . . , } , and t h e r e f o r e equation

(5.5.32) can be r e w r i t t e n as
h
~iPGi = e + Z Z* + Pd (5.5.33)
i i=l gi

Equation (5.5.33) can now be c o m b i n e d w i t h e q u a t i o n (5.5.19)

to give
, , h
Z N(ri,O) - Z N(gi,O) = Z Z* + e + Pd - Z=IPGi
i=l i=l i=l ri i

(5.5.34)
w h i c h u s i n g e q u a t i o n (5.5.24) b e c o m e s
, i , h
Z N(gf-k) - 2 N(gi,O) = Z Z* + e +Pd - __ZIPGi
i=l i=l i=l ri i

(5.5.35)

To e s t a b l i s h the n e c e s s i t y of c o n d i t i o n (5.5.31)

s u p p o s e that
, , h
N(gi,-k) - Z N(g,O) ~ -Z PG.
i=l i=l i=l l

then from e q u a t i o n (5.5.35) this implies that

Z Z* ~ O, or e ~ O, or Pd ~ O
i=l rl

or any c o m b i n a t i o n of these, and thus that the s y s t e m w i l l be

closed-loop unstable. H e n c e we c o n c l u d e that c o n d i t i o n (5.5.31)

is n e c e s s a r y for c l o s e d - l o o p stability.
95

For s u f f i c i e n c y s u p p o s e t h a t
, , h
N(gi,-k) - Z N(gi,O) = - Z PG
i=l i=l i=l i

then f r o m e q u a t i o n (5.5.35) we m u s t h a v e t h a t

Z Z* = e = Pd = 0
i=l r i

and h e n c e the s y s t e m is s t a b l e p r o v i d i n g conditions (2)

and (3) are s a t i s f i e d . T h u s the s u f f i c i e n c y of c o n d i t i o n

(5.5.31) is e s t a b l i s h e d .

Therefore to c o m p l e t e the p r o o f of s t a t e m e n t 2 of the

stability c r i t e r i o n all t h a t is left is to s h o w the e q u i v a l e n c e

between conditions (la) a n d (ib). To do t h i s we w i l l show

that w h e n c o n d i t i o n s (2) and (3) are s a t i s f i e d , condition

(ib) i.e.
i , , h
Z N(ri,O) = ~ N(gi,-k) = - ~ ZG. (5.5.36)
i=l i=l i=l l

where zG is the n u m b e r of r i g h t h a l f - p l a n e p o l e s of G. (s), is

i 1
n e c e s s a r y and s u f f i c i e n t for c l o s e d - l o o p s t a b i l i t y .

From equation (5.5.15), w h i c h is v a l i d for {j=1,2,...,},

Z N(ri,O) = Z N(gi,-k) = Z Z* - Z P*
i=l i=l i=l ri i=l ri

(5.5.37)

and since the p o l e s of {r. (s) : i=1,2,...,} are the same as

1
the p o l e s of {gi(s): i=1,2,...,~} equation (5.5.37) can be

rewritten as

Z N(gi,-k) = Z Z* - Z P* (5.5.38)
i=l i=l ri i=l gi

If the s u b s y s t e m s {Gi(s): i=l,2,...,h} are e a c h s q u a r e

t h e n P x ( = P d ) is the n u m b e r of r i g h t h a l f - p l a n e zeros (or

poles) w h i c h are lost t h r o u g h p o l e - z e r o cancellations when

G(s) is formed. Therefore if the n u m b e r of r i g h t h a l f - p l a n e

zeros of G(s) is g i v e n [ e q u a t i o n (3.3.36)] by

9,
ZG = e + Z Z (5.5.39)
i=l gi

t h e n we m u s t h a v e
h
ZG. = e + ~ Z + Pd (5.5.40)
i=l l i=l 9i

or, since the p o l e s of gi(s) are i d e n t i c a l l y e q u a l to the

zeros, of gi(s),
h
ZG = e + Z P* + Pd (5.5.41)
i=l i i=l g i

Consequently equations (5.5.38) and (5.5.41) can be c o m b i n e d

to g i v e
, Z h
Z N(gi,-k) = ~ Z* + e + Pd - ~iZGi
i=l i=l ri i
(5.5.42)

To e s t a b l i s h the n e c e s s i t y of c o n d i t i o n (5.5.36) suppose

that
Z . h
Z N(gi,-k) ~ - ~ ZG
i=l i=l i

then f r o m e q u a t i o n (5.5.42) t h i s i m p l i e s that

E Z* ~ O, or e ~ O, or Pd ~ O
i=l r i

or any c o m b i n a t i o n of t h e s e and t h u s that the s y s t e m w i l l be

closed-loop unstable. H e n c e we c o n c l u d e t h a t c o n d i t i o n

(5.5.42) is n e c e s s a r y for c l o s e d - l o o p stability.

For s u f f i c i e n c y s u p p o s e that
, h
N(gi,-k) = - Z ZG
i=l i=l l

then f r o m e q u a t i o n (5.5.42) we m u s t h a v e t h a t

Z* = e = Pd = O
i=l r i

and h e n c e the s y s t e m is c l o s e d - l o o p stable, providing

conditions (2) and (3) are s a t i s f i e d . T h u s the s u f f i c i e n c y

of c o n d i t i o n (5.5.36) is e s t a b l i s h e d .

This c o m p l e t e s the p r o o f of s t a t e m e n t 2 of the g e n e r a l i z e d

inverse N y q u i s t stability criterion.

5.6 Example

To i l l u s t r a t e the stability criteria consider the system

example used in c h a p t e r s 3 and 4 w h e r e

G(s) = Gl(S)= 1 [ s-i s ]

1.25(s+i) (s+2) -6 s-2

The p o l e and zero p o l y n o m i a l s for the o p e n - l o o p gain m a t r i x

G(s) are

PG(S) = (s+l) (s+2) and zG(s) = 1

so that

PG = O and Z G = O.

The i n v e r s e characteristic gain loci are shown in figure 19.

There are no u n o b s e r v a b l e and/or uncontrollable modes

and t h e r e f o r e either statement 1 or s t a t e m e n t 2 of the c r i t e r i o n

is d i r e c t l y applicable. In fact, b e c a u s e we have e f f e c t i v e l y

only one subsystem, there is no d i f f e r e n c e between the two

statements.

Testing the c o n d i t i o n s for s t a b i l i t y as o u t l i n e d by the

generalized inverse N y q u i s t stability criterion we find that

the c l o s e d - l o o p system is stable for

-1.875 < k ~ 0

O < k< 1.25

and 2.5 < k <

These bounds on the g a i n c o n t r o l variable k agree w i t h those

obtained in s e c t i o n 4.3 using the g e n e r a l i z e d Nyquist

stability criterion.

It is i n t e r e s t i n g to n o t e that an i n v e r s e N y q u i s t - l i k e
98

~.,, "%.
g-plane
ca.~ 5.o~

\ ..... . /

Figure 19. Inverse characteristic gain loci

stability criterion has recently been used by Mees and Rapp

[6] to establish stability criteria for multiple-loop nonlinear

feedback systems.

References

[1] A.L. Whiteley, "Fundamental Principles of Automatic Regulators

and Servo Mechanisms", J. IEE, 5-22, 1947.

[2] H.H. Rosenbrock, "State Space and Multivariable Theory",

Nelson, London, 1970.

[3] A.G.J. MacFarlane, "Dynamical System Models", Harrap, London,

1970.

[4] A.G.J. MacFarlane, "Return-difference and return-ratio matrices

and their use in analysis and design of multivariable
feedback control systems", Proc. IEE, 117, 2037-2049, 1970.

Is] E. Hille, "Analytic Function Theory", Vol. i, Ginn and Go.,

U.S.A., 1959.

[6] A.I. Mees and P.E. Rapp, "Stability criteria for multiple-
loop nonlinear feedback systems", Proc. IFAC Fourth Multivariable
Technological Systems Symposium, Fredericton, Canada, 1977.
6. Multivariable root loci

The Evans' root locus approach [1;2;3] is a well

established graphical technique used in the analysis and

design of linear time-invariant single-input single-output

feedback systems for estimating the system closed-loop poles

as a function of the gain control variable. Its generaliza%ion

to the multivariable case has caused considerable interest and

steps towards this end have been made with regard to the

asymptotic behaviour of the characteristic frequency loci

[4; 5] , and the angles of departure and approach of the

characteristic frequency loci [6] ; both from a state-space

point of view. In this chapter a method is developed, using

well established results in algebraic function theory [7;8;9],

which allows the asymptotic behaviour and the angles of

departure and approach of the characteristic frequency loci

to be determined from a Laplace transfer function matrix

description of the system.

It is also shown how the method can be utilised to find

the asymptotic behaviour of the optimal closed-loop poles of

a multivariable time-invariant linear regulator as the weight

on the input in the performance criterion approaches zero [ io].

The method seems particularly useful for systems with

small numbers of inputs since the calculations are then simple

enough to be carried out by hand.

6.1 Theoretical background

For the general feedback configuration of figure 3 it

has been shown in sub-section 3.3-4 that the characteristic

I01

frequency loci (multivariable root loci) are the 180 phase

contours of the characteristic gain function g(s), where g(s)

is defined via the equation

~(g,s) ~ det [gI m - G ( s ) ] = O (6.1.1)

or the equation

~(g,s) = bo(S)~(g,s) = O (6.1.2)

where we have multiplied through by the least common denominator

of the rational coefficients in s.

For simplicity of exposition, and because this is in any case

the usual situation for transfer function matrices arising

from practical situations, it is assumed that ~(g,s) is

irreducible over the field of rational functions in s. The

characteristic gain function g(s) is related to the gain

control variable k via the expression

1 (6.1.3)

so that substituting for g in equation (6.1.2) we have

~(-k-l,s) = k -m F(k,s) = O (6.1.4)

and solutions of

F(k,s) = O (6.1.5)

for s in terms of positive real k, determine the dependence

of the closed-loop poles on the gain control variable k.

Therefore, apart from possible single-point loci, the graphical

description of this dependence constitutes the characteristic

frequency loci of the given system. Note that if there are

no single-point loci then equation (6.1.5) is directly

equivalent to

CLCP(s)~det[SIn-Ac]= det[SIn-S(-k-l)] = O
(6.1.6)
I02

and the c h a r a c t e r i s t i c frequency loci have n branches.

E a c h of the n b r a n c h e s of the c h a r a c t e r i s t i c frequency

loci can in t h e o r y be r e p r e s e n t e d in the form

si(k) = ui(k) + Jvi(k) i=l,2, .... n (6.1.7)

where j= ~ and the s u b s c r i p t s i are labels for the v a r i o u s

branches. The t a n g e n t to the rth b r a n c h of the loci at a

point So=Sr(ko) is d e f i n e d as the l i m i t i n g position of the

straight line t h r o u g h s o and a n o t h e r p o i n t Sl=Sr(ko+~k) as

s I approaches s o along the branch, that is as gk+O. NOW

the c o m p l e x n u m b e r Sl-S can be r e p r e s e n t e d by the v e c t o r

from s O to s I (figure 20), and the v e c t o r corresponding to

(Sl-So)./~k, w h e r e 8 k > O , has the same d i r e c t i o n as that vector.

Si= 5 (ko+6k)

i\soo
o
Figure 20. To derive a brmuta for the tangent to
the characteristic frequency loci

It follows therefore that the v e c t o r corresponding to

st(k) = dkdSr(k)Jk = ~kOlim sl~_k-so

o
103

= lim Sr(ko+~k) - Sr(k O) (6.1.8)

6kO Sk

is tangent to the branch at s and the angle e between this

o
vector and the positive real axis is given by

% = argument {Sr(ko) } (6.1.9)

Formula (6.1.9) is fundamental to the determination of

the asymptotic behaviour of the c h a r a c t e r i s t i c frequency

loci, and the angles of departure and approach of the loci.

For k=O, equation (6.1.3) implies that the c h a r a c t e r i s t i c

frequency loci start at poles of the open-loop system, and

therefore the angle of departure of a branch of the loci from

a pole is given by formula (6.1.9) w i t h ko=O. For k=~,

equation (6.1.3) implies that the c h a r a c t e r i s t i c frequency

loci terminate at system zeros. Therefore for a branch

of the loci t e r m i n a t i n g at a finite zero, formula (6.1.9),

with ko=~, gives the angle of approach of the branch to the

zero. If a branch terminates at an infinite zero then formula

(6.1.9), w i t h k =~, gives the angle that the asymptote to

o
the branch makes with the positive real axis.

Formula (6.1.9) does not at first seem to be useful,

because expressions for the separate branches of the characteristic

frequency loci cannot in general be found explicitly. However,

in algebraic function theory [7;8;9], there exists a method

for the p r a c t i c a l construction of series representations

for the branches of an algebraic function in the n e i g h b o u r h o o d

of a given point. The m e t h o d consists of repeatedly using

a "Newton diagram" to find the next most significant terms

in the series. Therefore, by using the Newton diagram just

once, approximations can be obtained for the branches of the

104

characteristic frequency loci in the v i c i n i t y of a p o l e

or zero (finite or i n f i n i t e ) , of the f o r m

s (k) ~ a + bk ~ (6.1.10)
r
where a and b are c o m p l e x n u m b e r s , ~ is a r a t i o n a l r e a l

number, and w h e n e is f r a c t i o n a l the p r i n c i p a l root of k is

understood. For the n e g a t i v e feedback configuration under

c o n s i d e r a t i o n k is real and p o s i t i v e , ~k e-I w i l l a l w a y s be

real, and t h e r e f o r e applying formula (6.1.9) to the a p p r o x i m a t i o n

(6.1.10) we h a v e

0 = argument {b} + ~180 (6.1.11)

where

[ = O when e > O
T
= 1 when ~ < 0

as a r e s u l t of the d i f f e r e n t i a t i o n with respect to k.

In the f o l l o w i n g sections it is s h o w n h o w to o b t a i n

approximations of the forn% shown in e q u a t i o n (6.1.10) in

the v i c i n i t y of a p o l e or zero, and h e n c e to d e t e r m i n e the

asymptotic b e h a v i o u r of the c h a r a c t e r i s t i c frequency loci

and a l s o the a n g l e s of d e p a r t u r e and a p p r o a c h of the loci.

6.2 Asymptoti q behaviour

The N e w t o n d i a g r a m is a g r a p h i c a l construction which

can be u s e d to find e a c h of the m o s t significant t e r m s in the

series representations for the p a r t i c u l a r b r a n c h e s of an

algebraic f u n c t i o n q(v), in the v i c i n i t y of the origin,

which approach zero as v a p p r o a c h e s zero. Therefore, whether

f i n d i n g the a s y m p t o t i c b e h a v i o u r or the a n g l e s of d e p a r t u r e

and a p p r o a c h of the loci, our f i r s t aim is a l w a y s to r e d u c e

the p r o b l e m , by a c h a n g e of v a r i a b l e s in the c h a r a c t e r i s t i c
105

equation, to one of f i n d i n g a p p r o x i m a t i o n s for b r a n c h e s of

an a l g e b r a i c function which approach z e r o as the i n d e p e n d e n t

variable approaches zero.

To d e t e r m i n e the a s y m p t o t i c b e h a v i o u r of the c h a r a c t e r i s t i c

frequency loci we n e e d to o b t a i n an a p p r o x i m a t i o n to the b r a n c h

(or b r a n c h e s ) of the loci a b o u t the p o i n t s =~, as k a p p r o a c h e s

infinity. F o r this p u r p o s e we w i l l put

-i
s=z (6.2.1)

in the c h a r a c t e r i s t i c equation (6.1.2) to o b t a i n

(g,s) = ~ (g,z -I) = z -q ~(g,z) (6.2.2)

where q $ n is the n u m b e r of p o l e s of g(s), so that in any

neighbourhood of the v a l u e z = 0 (the p o i n t z = 0 itself is

excluded f r o m the region) the e q u a t i o n # ( g , s ) = O is e q u i v a l e n t

to the e q u a t i o n

~(g,z) = E~xygXzy = O (6.2.3)

For a s t r i c t l y p r o p e r system, that is one w h e r e D is zero,

or if D is s i n g u l a r

oo = 0 (6.2.4)

o t h e r w i s e C o o is n o n - z e r o which, as we shall see later f r o m

the N e w t o n d i a g r a m , corresponds to t h e r e b e i n g no a s y m p t o t i c

behaviour. N o t e t h a t if D is n o n - s i n g u l a r g(s) has t h e same

n u m b e r of f i n i t e z e r o s as p o l e s a n d t h e r e f o r e we w o u l d not

expect any c l o s e d - l o o p p o l e s to a p p r o a c h i n f i n i t y .

The n e x t step is to c o n s t r u c t the N e w t o n d i a g r a m for

~(g,z) , from which approximations of the f o r m

z ~cg ~ (6.2.5)

can be o b t a i n e d , where c is a c o m p l e x n u m b e r , and ~ a r a t i o n a l

real n u m b e r . The p r o c e d u r e for c o n s t r u c t i n g the a p p r o p r i a t e

N e w t o n d i a g r a m is as f o l l o w s [8].

In a ( g , z ) - p l a n e we p l o t the p o i n t s (x,y) for w h i c h

106

~xy ~ 0 in equation (6.2.3). As an example, the points for

(l-z+2z2-25z3+29z4)g 2 + (z-22z2+199z3-21Oz4)g

-(33z3-594z 4) = O (6.2.6)

are shown in figure 21. A straight line is then made to

coincide with the horizontal axis and rotated clockwise about

the smallest g-axis point P (the point (2,0) on figure 21)

o
until this line passes through another point P1 of our net.

A straight line is then drawn between the points Po and PI"

A horizontal line through P 1 ' and pointing away from P1

towazds the vertical z-axis, is then rotated clockwise about

the point P1 until it passes through another point P2 of

our net. A straight line is then drawn between the points

P1 and P2" The procedure is repeated until the vertical

z-axis is reached. The complete Newton diagram for equation

(6.2.6) is shown in figure 22.

The tangent of the acuteangle which the straight line

PoP1 makes with the vertical z-axis determines the first

possible value of ~ , and the tangent of the acute angle which

the straight line PIP2 makes with the verticle z-axis determines

the second possible value of p , etc. For the Newton diagram

of figure 22 we have U 1 = 1 and ~2 = "

For a particular exponent Pt there may be several approxi-

mations of the form

~t
zit-citg (6.2.7)

Note that if some root of g is implied by ~t i.e. Pt is a

fraction, then it is understood that the principal root is

being considered. To determine the coefficients cit it

107

Zl
4,

C i 2 a
tJ

Figure 21. Points of the Newton diagram

for equation (6.2.6)
z
4

3P,

0
2 a

Figure 22. Complete Newton diagram

for equation (6.2.6)
108

~t
is n e c e s s a r y to s u b s t i t u t e z = cg in the terms of

equation (6.2.3) corresponding to the p o i n t s of our net

lying on the link Pt-i Pt ' to e q u a t e to zero the result of

the s u b s t i t u t i o n and to solve the r e s u l t i n g equation. Let

the sum of the relevant terms of e q u a t i o n (6.2.3) have the

form

x y~ gXlzY 1
x g z + ..... (6.2.8)
oy~ "+ ~xlY 1

w h e r e y > ......... >YI' and ~ is a p o s i t i v e integer less than

the n u m b e r of i n f i n i t e zeros of the system. Then b e c a u s e

the terms correspond to the same link we have

x~-x I Xo_l-Xl _x 2 - x 1
Yc-Yl - YO-I-Yl ........ Y2 - Yl ~t

Therefore all the terms of the e x p r e s s i o n (6.2.8) become

Ut
similar as a r e s u l t of the s u b s t i t u t i o n z = cg , and h e n c e

for the d e t e r m i n a t i o n of the c o e f f i c i e n t cit we o b t a i n an

equation of t h ~ a f o r m
Yl
~xgy c + ........ + ~XlY 1 c = O (6.2.9)

Yl
or d i v i d i n g by c (c ~ O) we o b t a i n
Y~-Yl
~xay c + .... +~XlY 1 = 0 (6.2.10)

w h i c h has y a - y I solutions. Note that Yo-Yl is the

difference between the o r d i n a t e s of the p o i n t s Pt and Pt-l"

Consequently it is equal to the p r o j e c t i o n of the link P t _ i P t

on to the o r d i n a t e axis. This is a u s e f u l fact since it

enables us to see at a g l a n c e how m a n y coefficients there are

corresponding to the e x p o n e n t ~t" A l s o the o r d i n a t e associated

w i t h the p o i n t w h e r e the final link reaches the v e r t i c a l axis

109

must give the total number of coefficients for all the

approximations considering all exponents. This vertical

axis point on the final link is therefore the number of

infinite zeros of the system.

Having obtained the approximations in the form of equation

(6.2.7) it is a simple matter of substitution from equations

(6.1.3) and (6.2.1) to obtain the required form

~t
sit - bit k (6.2.11)

representing approximations to the characteristic frequency

loci about s =~, as k~. If we now apply formula (6.1.9) we

obtain the angles which the asymptotes make w i t h the positive

real axis.

The asymptotes, as will be shown in sub-section 6.2-1,

group into B u t t e r w o r t h configurations. Each pattern has

a common intercept which has been termed the "multivariable

pivot" by Kouvaritakis and Shaked [4] who also gave a m e t h o d

for calculating the pivot given the state space description

of the system. In appendix 6 it is shown how the m u l t i v a r i a b l e

pivots can be derived from the characteristic equation

~(g,s) = O.

6.2-1 B u t t e r w o r t h Patterns

In this sub-section it is proved that the closed-loop

poles that go off to infinity do so along asymptotes which

group into several B u t t e r w o r t h configurations. The proof

is based on well e s t a b l i s h e d results in algebraic function

theory [8]

Let us consider the closed-loop c h a r a c t e r i s t i c polynomial

defined by equation (6.1.5) i.e.

110

F (k,s) = O (6.2.12)

in w h i c h we a l l o w k to be complex. Also suppose that at a

point k c the a l g e b r a i c function s(k) defined by e q u a t i o n

(6.2.12) has a p- fold root So, and let the n branches of s(k)

in a n e i g h b o u r h o o d N of k be {s.(k):i=l,2,...,p,...,n}
c l

where {si(kc)=Sc:i=l,2,...,p}.

If we a n a l y t i c a l l y continue any one of the b r a n c h e s

{si(k) : i=p+l,...,n} around k c in N, then after one e n c i r c l e m e n t

we return to the o r i g i n a l function. If, however, we a n a l y t i c a l l y

continue Sl(k) around the critical point k c then after one

encirclement we obtain one of the functions {si(k) : i=2,3,...,p} ,

s2(k) say, and after another encirclement one of the functions

{si(k ) : i=3,4,...p}, s3(k ) say. After a finite number ~i' of

encirclements we return to the original function Sl(k).

In this case we shall say that the functions

Sl(k) t . . . . , S~l (k) (6.2.13)

constitute a cyclical system of b r a n c h e s of the a l g e b r a i c

function s(k) (in the n e i g h b o u r h o o d N of kc).

If 91<p we take the function s l+l(k) and a n a l y t i c a l l y

continue it around k in N. Repeating the p r e v i o u s arguments

o
we obtain a second cyclical system of branches, namely

S~l+l(k)' .... ' s~2(k) (6.2.14)

Finally all the functions Sl(k), ..... Sp(k) will be sorted into

cyclical systems.

The functions of each cyclical system pass consecutively

into one another according to the cyclical law when k goes

111

round the m u l t i p l e p o i n t k c. Hence every cyclical system

of b r a n c h e s defines in the n e i g h b o u r h o o d N of k an analytic

c
function. The n u m b e r of v a l u e s g i v e n by this a n a l y t i c

function at a p o i n t in the n e i g h b o u r h o o d N is equal to the

n u m b e r of b r a n c h e s comprising the c y c l i c a l s y s t e m being

considered. When a single b r a n c h comprises a cyclical system,

the c o r r e s p o n d i n g analytic f u n c t i o n w i l l be s i n g l e - v a l u e d

and i d e n t i c a l w i t h this branch.

Suppose the n e i g h b o u r h o o d N is r e p l a c e d by a p i l e of

n such regions, one for each of the f u n c t i o n s {si(k) : i=l,2,...,n}

and let a p o i n t on the ith sheet r e p r e s e n t the p a i r [k,si(k) ] .

Also suppose that each n e i g h b o u r h o o d N is c i r c u l a r and has a

radial line cut from the c r i t i c a l point k to its p e r i p h e r y ,

c
as shown in figure 23. Then the n e i g h b o u r h o o d s can be j o i n e d

together along the edges of the cuts so that if we a n a l y t i c a l l y

continue the jth b r a n c h (j<p) around k c on the jth sheet,

we move after one e n c i r c l e m e n t across the cut into the (j+l)th

b r a n c h on the (j+l)th sheet, etc. eventually returning to

the jth branch. The pile of n e i g h b o u r h o o d s obviously

falls into cycles containing one or m o r e sheets as shown in

figure 24, and each cycle is a d o m a i n for one of the a n a l y t i c

functions defined in the n e i g h b o u r h o o d N of k .

To d e n o t e an a n a l y t i c function determined by a c y c l i c a l

system we w i l l use a Roman numerical subscript. For e x a m p l e

the a n a l y t i c function corresponding to the c y c l i c a l system

Sl(k) , .... , s l(k) w i l l be d e n o t e d by the symbol si(k).

When we c o n s i d e r the a s y m p t o t i c behaviour of the

112

L cut

Figure 23. Neighbourhood N of the

critical point k c

Figure 24. Cyctes of neighbourhoods for

the critical point k c

characteristic frequency loci, we are concerned with the

branches of the algebraic function s(k) which approach

infinity, as k approaches infinity. Suppose we have p

infinite branches for k =~, then by the previous arguments

the p branches will be arranged into several cyclical

systems. Each cyclical system is defined by an analytic

function, the form of which is given by the f o l l o w i n g

theorem [8, page 39]

Theorem 4

The infinite branches of the algebraic function s(k),

constituting a cyclical system in the neighbourhood of its

critical point kc=~, in their totality constitute an analytic

113

function given by a series of the form

1 -i -2
s I = k 9 (bO + b I k V +b2k ~ +...) (6.2.15)

where v is the number of branches in the cycle, and I is a

positive integer.

Consequently the infinite branches corresponding to a

particular cyclical system in the n e i g h b o u r h o o d of k =~, can

be a p p r o x i m a t e d by an analytic function of the form

1
m

s I (k) ~bok~ (6.2.16)

The characteristic frequency loci are values of the

algebraic function s(k) for k going from zero to infinity

along the positive real axis. Therefore the infinite

branches of the frequency loci will occur in cyclical

systems which can be a p p r o x i m a t e d by equations of the form

(6.2.16). Taking the ~ roots of k in equation (6.2.16)

we find that the asymptotes corresponding to a cyclical

system arrange themselves into a B u t t e r w o r t h configuration

of order ~ ; that is,we have ~ asymptotes equally spaced

.360.
by angles of (--~-), in the complex plane.

In general, as already discussed, there will be several

cyclical systems of branches in the n e i g h b o u r h o o d of k = ~,

and therefore the asymptotes of the characteristic frequency

loci will arrange themselves into several B u t t e r w o r t h

configurations.

6.3 Angles of d e p a r t u r e and approach

To determine the angles of departure and approach of

the characteristic frequency loci we need to obtain approximations

to the branches of the loci at the poles and finite zeros

114

of the system. We w i l l first consider the angles of

departure f r o m the o p e n - l o o p poles of the system.

Suppose for the characteristic equation

~(g,s ) = O (6.3.1)

we have a pole (or m u l t i p l e pole) at s = ~. We w i l l m a k e

s'=s-8 the n e w independent variable for e q u a t i o n (6.3.1)

and also make the substitution g = d -1, so t h a t equation

(6.3.1) becomes

~(g,s) = ~(d-l,s'+B) = d - m H ( d , s ') = O (6.3.2)

The situation we are e x a m i n i n g now reduces to the case w h e r e

d = s' = O, and in the n e i g h b o u r h o o d of s = 8 (excluding

itself) equation #(g,s) = O is e q u i v a l e n t to the e q u a t i o n

H(d,s') = Z ~ x y d X s 'y = O , to o = O (6.3.3)

If w e construct the N e w t o n diagram for the e q u a t i o n

Z(d,s') = O an a p p r o x i m a t i o n (or a p p r o x i m a t i o n s in the case of

multiple poles) of the form

s' ~ ed ~ (6.3.4)

is o b t a i n e d , where e is a c o m p l e x number and m a r a t i o n a l

real number. From equation (6.1.3), the change of v a r i a b l e ,

a n d the substitution g = d -I , we therefore arrive at an

approximation to the b r a n c h (or b r a n c h e s ) of the c h a r a c t e r i s t i c

frequency loci departing from the p o l e B , in the form

Hd
s - B + bdk (6.3.5)

If w e n o w a p p l y formula (6.1.9) the angle of d e p a r t u r e 8d is

given as

ed = argument {b d} (6.3.6)

We w i l l now consider the angles of approach to the finite

115

zeros of the system. Suppose for the characteristic

equation (6.3.1) we h a v e a zero (or m u l t i p l e zero) at s = y .

We w i l l take s' = s-y as the n e w i n d e p e n d e n t variable so

that the e q u a t i o n becomes

~(g,s) = ~(g,s'+y) = x(g,s') = O (6.3.7)

The s i t u a t i o n reduces to the oase w h e r e g = s' = O, and in

the n e i g h b o u r h o o d of s = 7 (excluding y itself) equation

~(~s) = O is e q u i v a l e n t to the equation

gXs'Y= O , = 0 (6.3.8)
x(g's') = ~xy oo

If we construct the N e w t o n diagram for x(g,s') an

approximation of the form

s' ~ pg~ (6.3.9)

is o b t a i n e d where p is a c o m p l e x number a n d n is a r a t i o n a l

real n u m b e r . From equation (6.1.3) a n d the c h a n g e of v a r i a b l e

we therefore arrive at an a p p r o x i m a t i o n to the b r a n c h (or

branches) af the characteristic frequency loci approaching

the zero s = y , in the form

s - y + b ak~a (6.3.10)

If w e n o w a p p l y formula (6.1.9) the angle of a p p r o a c h 8 a is

given as

ea = argument {b a} ~ 180 (6.3.11)

Note that Ua w i l l always be n e g a t i v e and h e n c e the presence

of the 180 t e r m in f o r m u l a (6.3.11). Also note that this

definition for the angle of a p p r o a c h differs by 180 f r o m the

usual definition which is the d i r e c t i o n you would look, when

positioned at the zero, to see the locus arrive.

6.4 Example 1

Consider an o p e n - l o o p gain matrix

116

0s l [3s3+s21s 8s22s+l 1
s4+5s3-2s2-44s+40 s3+79s2+44s-868 -4s3-4s2+4Os-32

w h i c h has a c h a r a c t e r i s t i c e q u a t i o n

~(g,s) = ( s 4 + 5 s 3 - 2 s 2 - 4 4 s + 4 0 ) g2 + (s3+l16s-432)g

-12 (s2-2s+2) = O

(a) T_o d e t e r m i n e the a s y m p t o t i c b e h a v i o u r

-i
Putting s = z in the c h a r a c t e r i s t i c e q u a t i o n we o b t a i n

~(g,z) = (l+5z-2z2-44z3+4Oz4)g 2 + (z+l16z3-432z4)g

-12 (z2-2z3+2z 4) = O

The N e w t o n d i a g r a m for ~(g,z) is s h o w n in f i g u r e 25 from w h i c h

we o b t a i n ~ = 1 , and h e n c e the a p p r o x i m a t i o n

z - cg

The c o e f f i c i e n t c is c a l c u l a t e d (using the m e t h o d described

in s e c t i o n 6.2) to h a v e the v a l u e s -71 and ~1 . Therefore,

resubstituting for z and g = - k -I , we h a v e the f o l l o w i n g

approximations for the c h a r a c t e r i s t i c frequency loci

s - 4k and s - -3k as k + ~

Two b r a n c h e s of the c h a r a c t e r i s t i c frequency loci t h e r e f o r e

m o v e off to i n f i n i t y at a n g l e s of O and 180 to the p o s i t i v e

real axis.

o ~
\ S

Figure 25. Newton diagram for ~z(g,z)

117

(b) To find the an@les of departure

The system has four open-loop poles

s = i, s = 2 , s = -4+2j , and s = -4-~

Pole at s = 1
-i
Putting s' = s - i and g = d in t h e characteristic

equation we obtain

~l(d,s') = (S,4+gs,3+19S'2-29S ') + ( s ' 3 + 3 s ' 2 + l l g s ' - 3 1 5 ) d

- 12(s'2+l)d 2 = O

The Newton diagram for Hl(d,s') is shown in figure 26 from

which we obtain e I = 1 , and hence the approximation

s' ~ e l d

The coefficient e I is calculated (using the method described

in s e c t i o n 6.2) to h a v e the value -10.86, resulting in t h e

following approximation to t h e characteristic frequency loci

s - 1 + 10.86 k

about the pole s = i. Therefore the angle of departure from

the p o l e s = 1 is 0 .

Pole at s = 2

Putting s' = s-2 and g= d -I in the characteristic

equation we obtain

E2(d,s' ) = (s,4+13s,3+52s,2+4Os ,) + (s'3+6s'2+128s'-192)d

- 12(s'2+2s'+2)d 2 = O

The N e w t o n diagram for E2(d,s') is shown in figure 27 from

which we obtain ~2 = 1 , and hence the approximation

s' ~ e 2 d

The coefficient e 2 is c a l c u l a t e d to have the value 4.8,

resulting in t h e following approximation to t h e characteristic

frequency loci
118

s-2-4.Sk

about the pole s = 2 . Therefore the angle of d e p a r t u r e from

the pole s = 2 is 1 8 0 .

Pole a t s = -4+2j

Putting s' = s+4-2j a n d g = d -I in the c h a r a c t e r i s t i c

equation we obtain

H3(d's') = [s'4+(-ll+j8) s'3+(lO-j60)s'2+(88+j104)s']

+ [s' 3+ (_12+96) s' 2+ (152-j 48) s ' + (-912+9 320)] d

-12 I s ' 2+ (-lO+j 4) s ' + ( 2 2 - 9 2 0 ) ] d 2 = 0

The Newton diagram for 23(d,s') is s h o w n in f i g u r e 28 f r o m

which we o b t a i n ~3 = i, a n d h e n c e the approximation

s' ~- e 3d

The coefficient e 3 is c a l c u l a t e d to h a v e the value

912 - j 3 2 0
88 + j l O 4

resulting in the following approximation to the characteristic

frequency loci
(-912+j320) k
s - -4+2j + (88+9104)

about the pole s = -4+2j. Therefore the angle of d e p a r t u r e

from the pole s = -4+29 is

-912+j32Q~ o
argument { 88+9104 j = 110.9

Pole at s = - 4 - 2 j

By symmetry the angle of d e p a r t u r e from the pole

s = -4-2j is - 1 1 0 . 9 .

(c) To f i n d the angles of a p p r o a c h

The system has two finite zeros

s = l+j and s = l-j

119

o
L
\
2 d ( I 2 =d

Figure 26. Newton Figure 27 Newton

diagram for ~l(d,s') diagram for E2(d,s')
$'
$'
4'

,\
O ---
0 I 2 "d I 2
8
Figure 28. Newton Figure 29. Newton
diagram for E3(d,s') diagram for X(g,s')

3-0

2.0

I I
6'0 7"0

2/
Figure 30. Complete characteristic frequency loci
120

Zero at s = i ~

Putting s' = s-l-j in the c h a r a c t e r i s t i c e q u a t i o n we

obtain

(g,s') = [ S , 4 + ( 9 + j 4 ) s , 3 + ( 1 3 + j 2 9 ) S , 2 + ( _ 5 8 + j 4 6 ) S , + ( _ 1 8 _ j 3 8 ) ] g2

+[s'3+(3+j3)s'2+(l16+j8)s'+(-318+j118)] g

12[s2j2s '] = 0
The N e w t o n d i a g r a m for x(g,s') is shown in f i g u r e 29 f r o m

w h i c h we o b t a i n n = 1 , and h e n c e the a p p r o x i m a t i o n

s' - pg

The c o e f f i c i e n t p is c a l c u l a t e d to h a v e the v a l u e

-318 + jl18
j24

resulting in the f o l l o w i n g a p p r o x i m a t i o n to the c h a r a c t e r i s t i c

f r e q u e n c y loci
(318-jI18) k -i
s - 1 + j + j24

a b o u t the z e r o s = l+j. T h e r e f o r e the a n g l e of a p p r o a c h

to the zero s = l+j is

f318-jl18% _
argument L j24 ~ + 180 E 69.64

Zero at s = l-j

By s y m m e t r y the a n g l e of a p p r o a c h to the zero at

s = l-j is - 6 9 . 6 4 .

To c h e c k the r e s u l t s the c o m p l e t e characteristic

frequency loci are shown, projected onto a single complex

f r e q u e n c y plane, in f i g u r e 30. Branches of the loci c o i n c i d e

a b o u t the p o l e s = 1 m a k i n g the m o v e m e n t of the c l o s e d - l o o p

poles difficult to c o m p r e h e n d . The frequency surface

c h a r a c t e r i z e d by c o n s t a n t p h a s e and c o n s t a n t m a g n i t u d e

c o n t o u r s of g(s) is t h e r e f o r e shown in f i g u r e s 31(a), (b)

121

ChQracterJstic
:" ~: frequency loci

(a) I Cutjoining
branch points

(b)

Figure 31.Frequency surface (a)sheetl (b)sheet2

122

2.1

f 1.95 I ]
4-05 4'0 -3"95 -3"9

Figure 32. Angle of departure from pole s=-42j

J
1.0

0.98

0.9~

0-94

0"9~
0"90

0.90

Figure 33. Angle of approach to zero s=lj

123

to give a clearer description of the characteristic frequency

loci. Small regions in the neighbourhood of the pole

s = -4+2j and the zero s = l+j are shown in figures 32 and

33 to verify the calculated angles of departure and approach.

6.5 Asymptotic behaviour of optimal, closed-loop poles

In this section the "Newton diagram" approach is used

to determine the asymptotic behaviour of the optimal closed-

loop poles of a multivariable time-invariant linear regulator,

as the weight on the input in the performance criterion

approaches zero. The method is based on an association of

the optimal characteristic frequency loci with the branches

of an appropriate algebraic function. Although the procedure

is equivalent to that given by K w a k e r n a ~ [Ii] the essential

simplicity of the approach is emphasized in the setting of

algebraic function theory.

Consider the stabilizable and detectable time-invariant

linear system

dx(t) _ A x(t) + Bu(t) (6.5.1)

y(t) = C x(t) (6.5.2)

and the performance criterion

V(~) = /o~T(t)Qy(t)+puT(t) Ru(t) ] at (6.5.3)

where Q and R are positive definite symmetric matrices,

and the superscript T denotes the transpose of a matrix or

vector. Then it is well known (e.g. [12]) that the optimal

control action is given by

u(t) = - ! R-IB T Px(t) (6.5.4)

P
where P is the unique positive semi-definite solution of
124

the steady state matrix Riccati equation

-PA -ATp + ~PBR-IBTp = cTQc (6.5.5)

P
Kwakernaak [ii, equation (16)] has related the closed-

loop characteristic polynomial (denoted by ~c(S)) for the

optimal regulator to the open-loop characteristic polynomial

(denoted by ~o(S)) as follows

~c(S)~c(-S) = ~o(S)~o(-S) det [I m + IR-IGT

P
(-s) QG (s) ]
(6.5.6)

where I is a unit matrix of order m, the number of system

m
inputs, and

G(s) = C(sI - A)-IB (6.5.7)

is the open-loop transfer function or gain matrix of the system.

It is obvious from (6.5.6) that the poles of the optimal closed-

loop system, dependent on the input weighing, are values of

s in the left half-plane which satisfy

det [ Im + ~H(s) ] = O (6.5.8)

where
H(s) ~ R-IGT (-s) QG (s) (6.5.9)

Consider now the characteristic equation defining the

eigenvalues of H(s), that is

&(~,s) ~ d e t [ q I s -H(s)] = 0 (6.5.10)

or by expanding the determinant

~(D,s) = D m + al(s2)n m-I + . . . . . +am(S2) O
(6.5.11)

where the coefficients {ai(s2) ; i=i,2, .... ,m} are rational

2 2
functions in s . The coefficients are functions in s because
125

~ (~,s) -- det [ nI m - R-IG T(-s) QG(s)] (6.5.12)

= det [ HI m - R-GT(-s)QG(S)R -]

and R-GT(-s)QG(s)R - is para-Hermitian implying [ll,

appendix A] that ~(n,s) =A(q,-s) which can only be the case

2
if the coefficients are rational functions in s

If b (s 2) is the least common denominator of the coefficients

o
{ai(s2) ; i=l,2,...,m} then from (6.5.11)

bo(S2)~(~,s)=bo(S2)~ m + bl(S2)nm-l+ ..... +bm(S 2) = O

(6.5.13)

where the coefficients {bi(s 2) : i =1,2, .... ,m} are polynomials

in s 2. The function of a complex variable n(s), defined by

(6.5.13) is an algebraic function. In general, (6.5.11)

and (6.5.13) will be reducible into several irreducible

equations over the field of rational functions in s, thereby

defining a set of algebraic functions. However, for simplicity

of exposition it will be assumed that equations (6.5.11) and

(6.5.13) are irreducible over the field of rational functions

in s.

If we compare equations (6.5.8) and (6.5.10) we see

that the optimal closed-loop poles can be defined as the left

half-plane solutions of

n (s) = -p (6.5.14)

i.e. the optimal characteristic frequency loci are the 180

phase contours of the algebraic function H(s) in the left

half-plane. If we now consider p as a complex variable and

substitute for ~(s) in (6.5.13) we obtain

bo(S 2) (_p)m + bl(S2 ) (_p)m-l+ .... +bm (s2) = 0

(6.5.15)
126

which is an algebraic equation defining the algebraic

functions s(p) and p(s). The left half-plane branches of

the algebraic function s(p), for p real and positive, are

the optimal characteristic frequency loci. (Note that equation

(6.5.15) can be obtained directly from equation (6.5.8)).

To determine the asymptotic behaviour of the optimal

characteristic frequency loci we need approximations to the

branches of the algebraic function s(p) in the neighbourhood

of s = ~, as p approaches zero. These can be obtained as

the first terms in the series expansions for the branches of

s(p) about the point s =~, as p approaches zero. For this

--1 .
purpose we put s = z in equation (6.5.15) and procede as in

section 6.2.

6.6 Example 2

To demonstrate the procedure the asymptotic behaviour of

the optimal closed-loop poles will be calculated for

s + 2 6],
1
S(s) (s+l) (s+2) (s+3) (s+4)
s + 3 1

Q = I

and

pR = pI

From this data we have

H(s) ~ R-IGT (-s)QG (s)

-2s 2 + 13 -7s + 151

(s2-1) (s2-2) (s2-3) (s2-4)

7s + 15 37

SO that
127

A(~,s) A= det[ni 2 _ H(s)]

2 ( -2s 2 + 50) n + (-25s 2 + 256)

(s2 i) (s2 2) (s2_3) (s2_4) (s2_l) 2(s2_2)2(s2_3)2(s2_4)

= O

If we now substitute ~ = -P, and multiply throughout by the

least common denominator of the coefficients, we obtain

(s2-1) 2(s2-2)2(s2-3)2(s2-4)2p2+(s2-1) (s2-2) (s2-3) (s2-4) (-2s2+50) p

+ (-25s 2 + 256) = O.
-i
The substitution s = z now gives
(l-z 2) 2(l-2z2)2(l-3z~2(l-4z2)2p 2 + z6(l-z2) (I-2z2) (l-3z2)l-4z 2)

(-2+5Oz2)p
+ z14(-25 + 256z 2) = 0

The Newton diagram for this last equation is shown in figure 34,

from which we obtain

1 1 1
~i =~ ' ~2 = 8 ' and hence the approximationsz-elP 6 and
1 1
z=e2P~_ . To find the values of e I we substitute z = e.p 6 i n t o

I i.e. the terms corresponding to the

p2 _ 2z6p = 0 points on the first link of the Newton

diagram equated to zero,

giving e I = 6/~-~ = O.891 exp (j2kn), k=O,1,2,3,4,5.

6
1
If we substitute z = e2P 8 into
i.e. the terms corresponding to the

-25z 14 - 2z6p = 0 t points on the second link of the Newton

diagram equated to zero,

we obtain

e 2 = 8/-0.08 = 0.729 exp j(~+2k~.), k=O,l,2, .... ,7

8
~o
~Q
m~
~D O N
T~
m7

Q.
~o

3
129

Im (s)

. . . . J~ . . . . . . J .....

-40 -20 20 4 0 "Re'(S)

I I
-40 -20 20 40 R (sl

(b)
Figure 35. Asymptotic behaviour of the optimal characteristic
frequency loci (plus right half-plane image)displayed
on the Riemann surface domain of q(s)
(a) sheet 1 of the surface
(b)sheet 2 of the surface
130

If we now substitute back for s, and take only left half-

plane solutions, we obtain the following expressions for the

asymptotic behaviour of the optimal characteristic frequency

loci
-i
s~l.12 [ e x p
-
( j -~
k~ )]j p ~ ,k = 2,3,4

and
-I
s~ 1.37 [ e x p j (w+2kz)] p 8 , k = 2,3,4,5.
8

Note that using Kwakernaak's notation [ii], these

define Butterworth patterns of orders 3 and 4. The optimal

characteristic frequency loci and their right half--plane

image are shown in figure 35. Note that they have been computed

as the 180 phase contours of the algebraic function ~(s)

and displayed on its Riemann surface domain.

References

i] W.R. Evans, "Graphical Analysis of Control Systems", Trans.

AIEE, 67, 547-551, 1948.

2] W.R. Evans, "Control System Synthesis by Root Locus Method",

Trans. AIEE, 69, 1-4, 1950.

[3] W.R. Evans, "Control System Dynamics", McGraw-Hill, New

York, 1954.

[4] B. Kouvaritakis, and U. Shaked, "Asymptotic behaviour of

root-loci of multivariable systems", Int. J. Control, 23,
297-340, 1976.

[5] D.H. Owens, "A note o n series expansions for multivariable

root-loci", Int. J. Control, 26, 549-557, 1977.
6] U. Shaked, "The angles of departure and approach of the root-
loci in linear multivariable systems", Int. J. Control, 23,
445-457, 1976.

[7] G.A. Bliss, "Algebraic Functions", Dover, New York, 1966

(reprint of 1933 original).
131

[8] B.A.Fuchs, and V.I.Levin, "Functions of a Complex Variable",

International Series of Monographs in Pure and Applied
Mathematics, Pergamon Press, 1961 (translation of 1951
Russian original).

[9] E.Hille, "Analytic Function Theory", Vol. 2, Ginn and Co.,

U.S.A., 1962.

[i0] I.Postlethwaite, "A note on the characteristic frequency

loci of multivariable linear optimal regulators", IEEE
Trans. Automatic Control, 23, 757-760, 1978.

[ii] H. Kwakernaak, "Asymptotic Root Loci of Multivariable Linear

Optimal Regulators", IEEE Transo Automatic Control, 21,
378-382, 1976.

[12] A.G.J.MacFarlane, "Dual-system methods in dynamical analysis

Pt. 2 - Optimal regulators and optimal servo-mechanisms",
Proc. IEE, 1458-1462, 1969.
7. On parametric stability and future research

A feedback system is said to be stable if all of its

closed-loop poles are in the left half-plane. The stability

of a control system is therefore dependent on its associated

parameters. Sometimes in a control system the value of a

parameter is uncertain perhaps due to ageing, deterioration,

or damage; in other instances it may be desirable, for

economic reasons, to change a p a r a m e t e r value. In both

these cases a technique which predicts the relative stability

of a system w i t h respect to a given p a r a m e t e r would be extremely

useful.

A dominant theme in the p r e c e d i n g chapters has been the

association of a system with two sets of algebraic functions:

characteristic gain functions and c h a r a c t e r i s t i c frequency

functions. In this Chapter c h a r a c t e r i s t i c parameter functions

are introduced, and used to develop the ideas of 'parametric'

root loci and 'parametric' Nyquist loci from w h i c h the relative

stability of a system, w i t h respect to a single parameter,

can be determined.

In the final section of this chapter a few tentative

proposals and suggestions for future research are made.

7.1 Characteristic freqUency and c h a r a c t e r i s t i c parameter

functions

The feedback c o n f i g u r a t i o n considered is shown in figure

36, where A(k2,k3,...,kq), B(k2,k3,...,kq), C ( k 2 , k 3 , . . . k q)

and D(k2,k3,...,k q) are state-space matrices w h i c h are

dependent on (q-l) real, t i m e - i n v a r i a n t parameters and k I is

133

a scalar, time-invariant gain parameter common to all the loops.

-~ID(k2.....kq),l

Figure 36. Feedback configuration

for parameter anatysis

The closed-loop poles for this configuration are solutions

det [ si n - S(k) ] = O (7.1. i)

where

S (k) ~A (k 2 .... ,kq) -B (k 2 .... kq)~l-llm+D(k2 .... kq) ] -Ic (k2, ,kq)

is the closed-loop frequency matrix [see section 3.1]. If

numerical values for all the parameters except one, kj say,

are substituted into equation (7.1.i), and kj considered as

a complex variable, then the resulting algebraic equation

1:34

(which for simplicity of exposition will be regarded as

irreducible) defines a pair of algebraic functions [ i],

s(kj) and kj(s). The algebraic function s(kj) is called the

characteristic frequency function with respect to kj, and the

algebraic function kj(s) is called the characteristic parameter

function for k . (Note that the characteristic frequency

3
function s(g) and the characteristic gain function g(s),

introduced in chapter 3, are equivalent to s(-kl-l) and

-kl(S)-i respectively).
The branches of s (kj ), for k 3 real, clearly define the

variation of the closed-loop poles with respect to kj, and

as such are termed parametric root loci. Alternatively,

the parametric root loci can be viewed as the O phase

contours of k. (s) on the frequency surface domain for k. (s).

3 ]
Dual to the parametric root loci are the parametric

Nyquist loci or characteristic parameter loci which are the

branches of k.(s) as s traverses the imaginary axes.

]
Alternatively, the characteristic parameter loci can be

viewed as the ~90 phase contours of s(kj) on the Riemann

surface domain for s(kj) which will be called the parameter

surface for k..

3
If, for a particular system, we have a set of nominal

values for the system parameters we can determine which,

if any, are sensitive with respect to stability by looking

at the set of parameter surfaces. To help in such an assessment

the following generalizations of gain and phase margin are

introduced.
135

?.2 G a i n and phase margins

The ~ 90 phase contours of s(kj) on the p a r a m e t e r surface

for k. trace out the boundary between stable and unstable

3
closed-loop poles and therefore we can define parameter gain
o
and phase margins for k. about a stable operating point k.
3 3
which give a measure of the relative stability of the system

with respect to ko.

3
Parameter gain margi n . Parameter
gain margin is defined with
o
respect to a stable operating point k. as the smallest change
3
o
in parameter gain about kj needed to drive the system into

instability. Let d i be the shortest distahce along the real

o
axis from a stable operating point k to the stability boundary
3
(characteristic p a r a m e t e r loci) on the ith sheet of the parameter

surface for kj. Then the p a r a m e t e r gain margin is defined as

min{d.: i=1,2 ..... n}.

i l

Parameter phase margin. On each of the n sheets of the

parameter surface
for k. imagine that an arc is drawn,
3
o
centre the origin, from a stable operating point kj until

it reaches the stability boundary (characteristic parameter

loci). Let ~i be the angle subtended at the origin by

the corresponding arc on the ith sheet. Then the p a r a m e t e r

phase margin is defined as min{~i: i=1,2 .... ,n}

7.3 Example

In this section an inverted p e n d u l u m p o s i t i o n i n g system

(see figure 37) is considered and its stability analysed

with respect to one of its parameters, namely the m a s s of the

carriage.
136

ndutum

PlV~ll iage

kJ k2
/I/I////
Figure 37. Inverted pendulum
positioning system

This s y s t e m has also been used by K w a k e r n a a k and S i v a n

[2], Cannon[3], and E l g e r d [ 4 ] . The s y s t e m can be m o d e l l e d

by the f o l l o w i n g linearized state d i f f e r e n t i a l equation[2]

x(t) = 0 1 o o x(t)+ ro]u(t )

0 -~F o o I~ I (7.3.1)
I |
o o o l:I
-~ o ~ o
where u(t) is a force e x e r t e d on the carriage by a s m a l l motor;

M is the mass of the carriage; F is the f r i c t i o n coefficient

associated with the m o v e m e n t of the carriage; and L' is given

L' = J + mL 2 (7.3.2)
mL
137

w h e r e m is the m a s s of the p e n d u l u m ; L is the d i s t a n c e from

the p i v o t to the c e n t r e of g r a v i t y of the p e n d u l u m ; and J

is the m o m e n t of i n e r t i a of the p e n d u l u m w i t h r e s p e c t to

the c e n t e of g r a v i t y .

The s y s t e m is s t a b i l i z a b l e using state f e e d b a c k of the

form

u(t) = -Kx(t) (7.3.3)

and u s n g the n u m e r % c a l v a l u e s

F -i
- 1 s
M

1 - 1 kg -I
M
(7.3.4)
-2
= 11.65 s
L'

L' = O.842m

it can be f o u n d [2] t h a t

= [86.81, n.21, -118.4, - 3 3 4 4 ] (7.3.s)

stabilizes the l i n e a r i z e d s y s t e m p l a c i n g the c l o s e d - l o o p

poles at - 4 . 7 0 6 ~ j 1.382 and - 1 . 9 0 2 ~ j 3 . 4 2 0 .

We w i l l now look at the p a r a m e t e r s u r f a c e for M to see

how v a r i a t i o n s in the carriag~ mass, a b o u t an o p e r a t i n g p o i n t

of ikg, affect the s t a b i l i t y of the system. The four s h e e t s

of the m a s s surface, c h a r a c t e r i z e d by c o n s t a n t p h a s e and

magnitude c o n t o u r s of s(M), are s h o w n in f i g u r e s 38-41,

from w h i c h the f o l l o w i n g s t a b i l i t y m a r g i n s are o b t a i n e d :

parameter (mass) g a i n m a r g i n = 1 kg

parameter (mass) p h a s e m a r g i n = 60

The g a i n m a r g i n of ikg c o r r e s p o n d s to r e d u c i n g the c a r r i a g e

mass to zero b e f o r e i n s t a b i l i t y occurs. The p a r a m e t e r

138

~. :~(M) ,.,~

Figure 38. Sheet 1 of parameter (mass) surface

-II)O u ! "l'r~(l~l)l ~L - - ] 0 ~ -~0

-ifD

--70
,z ReC~

t I0

"-~C

5"d~

-4

Figure 39. Sheet 2 of parameter (mass) surface

139

J..~cr43

Figure 40. Sheet 3 of parameter (mass) surface

~cM)

~.C

IOaB

] 18o"
-4. o ~ o

1 1"70-

...~c

-,4o~

Figure Z~l.Sheet 4 of parameter (mass) surface

140

surface also shows that there is a m a x i m u m limit to the carriage

mass, for stability, of 2.125 kg. The phase margin of 60

indicates adequate damping of the closed-loop system.

7.4 Future research

It is thought that parameter surfaces may prove to be

useful in the design of p a r a m e t e r dependent controllers for

systems in which a p a r t i c u l a r p a r a m e t e r suffers large variations

during normal operation. For example, the controller of an

aircraft engine needs to operate satisfactorily over a wide

range of altitudes. A possible design scheme could be

(i) to design real constant controllers at a number of altitudes,

(ii) to obtain an altitude dependent controller by "matrix

interpolation", and finally

(iii) to analyse the stability of the system over the whole

w o r k i n g range using the "altitude surface".

As indicated in this chapter the methods put forward in

this work are not only applicable to gain and frequency, but

any single system p a r a m e t e r and frequency. To obtain stability

tests in terms of more than one p a r a m e t e r variation is a c o m p l i c a t e d

problem, but one with great practical significance. It is felt

that v a l u a b l e insight into this p r o b l e m may stem from a study of

functions of several complex variables.

In recent years stability tests have been developed for

two-dimensional and m u l t i - d i m e n s i o n a l digital filters [5]. Two-

dimensional digital filters are used w i d e l y in many fields, such

as image processing, and geophysics for the p r o c e s s i n g of seismic,

gravity and magnetic data. It is expected that higher

dimensional filters will also find applications; for example,

three-dimensional filters in holography. The stability tests

which have emerged are of a N y q u i s t - t y p e and it is thought that

141

a deeper understanding of t h e s e d e v e l o p m e n t s by c o n t r o l e n g i n e e r s

may lead to s u i t a b l e adaption for u s e in the c o n t r o l field;

possibly in the s t u d y of s t a b i l i t y u n d e r m u l t i - p a r a m e t e r

variations.

A constant theme throughout this w o r k has b e e n the

a s s o c i a t i o n of a m u l t i v a r i a b l e s y s t e m w i t h one or p o s s i b l y

more algebraic f u n c t i o n s e a c h of w h i c h is d e f i n e d on an

appropriate Riemann surface. A Riemann surface for an

algebraic function is t o p o l o g i c a l l y equivalent to a s p h e r e

with "handles", and the n u m b e r of h a n d l e s is k n o w n as the

g e n u s n u m b e r of the surface [6] . The g e n u s n u m b e r m a y p r o v e

to be an i m p o r t a n t c h a r a c t e r i s t i c of a s y s t e m and it is felt

that it m a y be r e l a t e d to " d e c o u p l i n g " , that is, the t r a n s f o r m a t i o n

of a m u l t i v a r i a b l e s y s t e m to a s y s t e m w h i c h is e f f e c t i v e l y a set

of s i n g l e - i n p u t , single-output systems. A single-input, single-

output s y s t e m has a c o r r e s p o n d i n g trivial (one sheeted) frequency

s u r f a c e of genus zero, so that to d e c o u p l e a multivariahle

s y s t e m w o u l d be e q u i v a l e n t to r e d u c i n g an m - s h e e t e d Riemann

surface, w i t h a g e n u s n u m b e r g r e a t e r t h a n or e q u a l to zero,

to a set of m trivial Riemann surfaces e a c h of zero genus.

Consequently the e s t a b l i s h m e n t of r e l a t i o n s h i p s between branch-

point singularities (whose n a t u r e a n d l o c a t i o n d e t e r m i n e such

topological properties as the g e n u s number) and decoupling

c o u l d be a f r u i t f u l line of r e s e a r c h . In a p p e n d i x 7 an

interesting relationship is d e v e l o p e d w h i c h shows that a b r a n c h

p o i n t on a g a i n s u r f a c e c o r r e s p o n d s to a b r a n c h p o i n t or

s t a t i o n a r y p o i n t on the c o r r e s p o n d i n g frequency surface and

v i c e versa.
142

References

[i] G.A.Bliss, "Algebraic Functions", Dover, New York,1966

(reprint of 1933 original).

[2] H. Kwakernaak, and R. Sivan, "Linear Optimal Control

Systems", Wiley, New York, 1972.

[3] R.H. Cannon, Jr, "Dynamics of Physical Systems",

McGraw-Hill, New York, 1967.

[4] O.I.Elgerd, "Control Systems Theory", McGraw-Hill,

New York, 1967.

[ 53 E. I. Jury, "Inners and Stability of Dynamical Systems",

Wiley, New York, 1974.

[ 6] G. Springer, "Introduction to Riemann Surfaces",

Addison-Wesley, Reading, Mass., 1957.
142

References

[i] G.A.Bliss, "Algebraic Functions", Dover, New York,1966

(reprint of 1933 original).

[2] H. Kwakernaak, and R. Sivan, "Linear Optimal Control

Systems", Wiley, New York, 1972.

[3] R.H. Cannon, Jr, "Dynamics of Physical Systems",

McGraw-Hill, New York, 1967.

[4] O.I.Elgerd, "Control Systems Theory", McGraw-Hill,

New York, 1967.

[ 53 E. I. Jury, "Inners and Stability of Dynamical Systems",

Wiley, New York, 1974.

[ 6] G. Springer, "Introduction to Riemann Surfaces",

Addison-Wesley, Reading, Mass., 1957.
143

Appendix i. Definition of an a l g e b r a i c function

Let A(q,v) be a p o l y n o m i a l in q of the form

A(q,v) = fo(V)q m + f l ( v ) q m - l + .... +fm(V) (Al.1)

where each c o e f f i c i e n t {fi(v): i=l,2,...,m} is itself a

polynomial in v w i t h coefficients in the d o m a i n of c o m p l e x

numbers. T h e n an a l g e b r a i c function is a f u n c t i o n q(v)

defined for v a l u e s of v in the c o m p l e x v- p l a n e by an e q u a t i o n

of the form

A(q,v) = O (AI.2)

The p o l y n o m i a l A(q,v) can be r e w r i t t e n as a p o l y n o m i a l in v

with coefficients which are t h e m s e l v e s polynomials in q, and

when considered in this w a y e q u a t i o n (Ai.2) defines an a l g e b r a i c

function v(q).

For a fixed v a l u e of v, v O say, equation (AI.2) has

m solutions which are called branches of q(v), and in the

neighbourhood of v the b r a n c h e s are r e p r e s e n t a b l e by p o w e r

o
series expansions [i].

It is a s s u m e d in the above d e f i n i t i o n that A(q,v) is

an i r r e d u c i b l e polynomial in (q,v), that is, that A(q,v) is

not the p r o d u c t of two or more p o l y n o m i a l s in (q,v). If

A(q,v) was e x p r e s s i b l e as a p r o d u c t of t p o l y n o m i a l s in (q,v),

equation (AI.2) would then d e f i n e t algebraic functions

of the form q(v), or t a l g e b r a i c functions of the form v(q).

A p p e n d i x 2. A reduction to the i r r e d u c i b l e rational canonical form

In this a p p e n d i x a proposed method is g i v e n for

reducing any m - s q u a r e m a t r i x to its i r r e d u c i b l e rational

canonical form. The m e t h o d u s e d is a v a r i a t i o n on that g i v e n

144

P ~
by A y r e s [2J for finding the r a t i o n a l canonical form of a

square matrix. The r a t i o n a l canonical form of a s q u a r e

matrix G is s i m i l a r to the i r r e d u c i b l e form e x c e p t that its

diagonal blocks correspond to the i n v a r i a n t factors of gI-G,

rather than the i r r e d u c i b l e factors. The p r o c e d u r e given

b y A y r e s [2] for f i n d i n g the r a t i o n a l canonical form is

outlined b e l o w along w i t h some n e c e s s a r y definitions.

Definitions: If the v e c t o r s

X,GX,G2X,...,Gt-Ix (A2.1)
are l i n e a r l y independent but
2
X,GX,G X,...,Gt-Ix,GtX (A2.2)
are not then (A2.1) is c a l l e d a c h a i n of length t having X

as its leader.

Procedure: For a g i v e n m - s q u a r e matrix G over any f i e l d F :

(i) let X m be the leader of a chain C m of m a x i m u m length

for all m - v e c t o r s over F ;

(ii) let Xm_ 1 be the leader of a c h a i n Cm_ 1 of m a x i m u m

length (any m e m b e r of w h i c h is l i n e a r l y independent of the

preceding members and those of Cm) for all m - v e c t o r s over F

which are l i n e a r l y independent of the v e c t o r s

; of C
m
(iii) let Xm_ 2 be the l e a d e r of a chain Cm_ 2 of m a x i m u m

length (any m e m b e r of w h i c h is l i n e a r l y independent of the

preceding members and those of C m and Cm_ I) for all m-

vectors over F which are l i n e a r l y independent of the v e c t o r s

of C m and Cm_ 1 ;
and so on. Then, for
t.-I tj+l-iXj+l ;
E = [ Xj,GXj .... G 3 ~ j + l , G X j + 1 .... ,G

t -i
... ; X m , G X m ..... G m Xm ]
145

we have that E-IGE is the rational canonical form of G.

In this approach the chains of m a x i m u m length are used

to pick out the invariant factors. Now the invariant factors

are made up from products of the irreducible characteristic

equations which are required in the irreducible rational

canonical form. Therefore, if instead of using chains of

maximum length those of m i n i m u m length are found, the

t r a n s f o r m a t i o n m a t r i x E so formed is that required to give

the irreducible rational canonical form Q.

The problem with both of these methods is that no

indication is given w h i c h enables one to know when a chain of

m a x i m u m or m i n i m u m length has been obtained, except that in

the contrary case there will appear a chain of longer or shorter

length.

It is interesting to note that the chains of m a x i m u m

and m i n i m u m length form bases for the G - i n v a r i a n t subspaces

of m a x i m u m and minimum dimensions respectively. Therefore

the p r o b l e m of finding the chains of m i n i m u m length is

equivalent to that of finding the invariant suhspaces of

minimum dimension. It also happens that just as the l-

dimensional invariant subspace (eigenvector) picks out an

eigenvalue, a t-dimensional invariant subspace (from the set

of those of m i n i m u m dimensions) picks out an irreducible

equation of degree t, defining t eigenvalues.

Example: To find the irreducible rational canonical form of

S(s) = (s+3) (s+2) 2 O

(s+l) 2 (s+l)

(s-3) -2 0
2 2
(s+l) (s+l)

-i - (s+2) 1
2 - - 2 s+l
(s+l) 2 (s+l)
146

Let X = ( O 0 1 )t, where "t" denotes the transpose,

1
then GX = (s+l) X

and X is a chain of minimum length.

t
Let Y = ( O 1 0 )

then Gy = [ s+2 -2 -(s+2) I t

(s+l) 2 (s+l) 2 2 (s+l) 2

and G2y = [ s+2 s-2 -(s+2) 1 t 1 s

(s+l)3 (s+]93 2(s+i)3 - (s-~ GY + --(s+l)3 Y

No chain of smaller length can be found and therefore Y,GY

completes the set of minimum length chains. The transformation

matrix E(S) is therefore given by

s+2
-[x Y ='o o 2
(s+l)
-2
O 1 2
(s+l)
-(s+2)
1 O 2
2 (s+l)

and

Q(s) = E-I(s)G(s)E (s)

= s+l O O

0 0 s
(s+l) 3
1
O 1 (s+l)

which implies that the irreducible characteristic equations

are
1
(g,s) = g s+l
2 1
~(g,s) = g (s+l) g (s+l)

Note that these are also obvious from the dependence relations

obtained when finding the chains of minimum length.

'r47

A~pendix 3. The d i s c r i m i n a n t

In this a p p e n d i x two m e t h o d s are g i v e n for f i n d i n g

the d i s c r i m i n a n t of an e q u a t i o n of the f o r m

~(g,s) = bo(S)gt + bl(s)gt-l+ ... +bt(s) = O

Method 1 (Barnett [ 3 ] )

The r e s u l t a n t R[ a(g),c(g)] of two p o l y n o m i a l s

a(g) and c ( g ) , g i v e n by
n-l+
a(g) = a o g n + alg ... + a n
m m-i
c(g) = Cog + clg + ... + c m

where ai,c i e C ,is the d e t e r m i n a n t

aO aI a2 ... an 0

0 a aI ... an_ 1 an

m
rows
a -i a
n n

R[a(g),c(g)] = .oo
co c I .. Cm_ 1 cm

. , o C
o cI c 2 0 . cm O

ooo n
rows
co C1 c2 ..-

The p o l y n o m i a l s a(g) a n d c(g) have a common factor (of

d e g r e e g r e a t e r t h a n zero) if and o n l y if the (n+m) -order

determinant R [ a(g),c(g) ] is zero, provided that a

and c o are not b o t h zero.

L e t the d e r i v a t i v e w i t h r e s p e c t to g of the p o l y n o m i a l

a(g) be d e n o t e d by a' (g). T h e n the d i s c r i m i n a n t of the

polymonial a(g) is the d e t e r m i n a n t D g ( a o , a l , . . . , a n) d e f i n e d

148

by
_ 1 R [ a(g) a' (g) ]
Dg(ao'''''an) a
0

The p o l y n o m i a l a(g) has a r e p e a t e d factor if and only if

the d i s c r i m i n a n t D g ( a o , . . . , a n) is zero.

Now consider a polynomial of the type

@(g,s) = b o ( S ) g t + b l ( S ) g t-I +...+bt(s) , t > O

where the coefficients {bi(s) :i=l,2,...,t} are all

polynomials in s. The d i s c r i m i n a n t of this p o l y n o m i a l in

g is found from D g ( b o , b l , . . . , b t) as d e f i n e d above by r e p l a c i n g

b o , . . . , b t by bo(S),...,bt(s) respectively. Thus there is a

function D (s) again called the d i s c r i m i n a n t and defined by

g
_ 1
Dg(S) b (s) R[~(g,s),~'(g,s)]
O

where #'(g,s) is the d e r i v a t i v e with respect to g

of ~(g,s)

Method 2 (Sansone and G e r r e t s e n [4])

Consider the p o l y n o m i a l
n algn-i n > O
a(g) = dog + + ... + a n ,

where ai e ~ ; then the d i s c r i m i n a n t of a(g) is given by

the e x p r e s s i o n

Dg(ao'''''an ) = ao2n-2p

where P is a d e t e r m i n a n t given by

P = So ~I "" On-1

... (~
G1 o2 n

an-i gn --- a2n_ 2

and the elements {~ : i = 1,2,...,2n-2} are functions of

i
149

the coefficients {a. : i = O,l,...,n} , and

= n
o

The elements o1,...,an_ 1 can be found from

aI + ao~ 1 = 0

2a 2 + al(l 1 + ao~ 2 = 0

(n-l)an_ 1 + an_2~ 1 ... + aoan_ 1 = 0

and On,...,a2n_2 from

anO o + an_l$1 + ... + aoa n = O

ana I + an_lO 2 + ... + aoOn+ 1 = 0

+ + = 0
anO m a n_ i m + l + .. aoOn+ m

Consider now the polynomial ~(g,s) given by

~(g,s) = bo(s)g t + bl(S)g t-I + ... + bt(s)

The discriminant of this polynomial in g is found from

D g ( b o , . . . , b t) as defined above by replacing bo,...,b t

by bo(S),...,bt(s) respectively. The polynomial ~(g,s)

therefore has repeated factors if a n d only if

Dg(S) = . 2t-2
DO (S) P

is zero, where P is the determinant of a matrix whose

elements are functions of the coefficients {bi(s) : i = O,l,2,...,t}

150

A p p e n d i x 4. A m e t h o d for c o n s t r u c t i n g the R i e m a n n surface

domains of the algebraic functions c o r r e s p o n d i n 9

to an o p e n - l o o p gain m a t r i x G(s)

In s u b - s e c t i o n 3.3-4 it was shown that for a c h a r a c t e r i s t i c

gain f u n c t i o n of degree t the c o r r e s p o n d i n g R i e m a n n surface

is made up from t sheets of the complex s-plane stitched

together along cuts made b e t w e e n b r a n c h points and infinity.

A l t h o u g h the cuts are in some sense a r b i t r a r y (i.e. there is

no unique set of cuts), it is still a p r o b l e m to choose a

set that is consistent, and then to be able to identify in w h a t

order the sheets are connected together. In this a p p e n d i x

a systematic m e t h o d is given for solving this problem. The

m e t h o d is quite elegant in that the r e s u l t i n g cuts are symmetrical

about the real axis and are always p a r a l l e l to the i m a g i n a r y

axis except for possible cuts along the real axis. Also,

the a p p r o a c h uses the o p e n - l o o p gain m a t r i x directly, so

that there is no need to find the c h a r a c t e r i s t i c equations,

and the resulting n o n - c o n n e c t e d sets of c o n n e c t e d sheets

r e p r e s e n t the R i e m a n n surfaces for the irreducible c h a r a c t e r i s t i c

equations.

The m e t h o d is based on finding the e i g e n v a l u e s of the

m a t r i x for a grid of values covering the s-plane and then

sorting the e i g e n v a l u e s in a continuous form along certain

lines of the grid. If the transfer function is of order mxm,

m arrays which will e f f e c t i v e l y r e p r e s e n t the m sheets of

the R i e m a n n surfaces are n e e d e d to store the eigenvalues.

The p ~ o c e s s is analogous to the analytic c o n t i n u a t i o n

p r o c e d u r e d e s c r i b e d in s u b - s e c t i o n 3.3-4 where i n d i v i d u a l
151

points of a circular disc are made bearers of unique functional

values. Here i n d i v i d u a l points of the s-plane sheets

(arrays) are being made the bearers of functional values.

The first line along w h i c h the eigenvalues are c a l c u l a t e d

and sorted is the real axis, and then the c a l c u l a t i o n and

sorting is carried out along lines p a r a l l e l to the imaginary

axis and e m a n a t i n g from the real axis, as shown in figure 42.

The c a l c u l a t i o n is only necessary in the upper h a l f - p l a n e

since the e i g e n v a l u e s in the lower h a l f - p l a n e are the complex

c o n j u g a t e of those in the upper half. C o n t i n u i n g this process

the s-plane is covered, and m arrays of e i g e n v a l u e s are

o b t a i n e d which are continuous along the real axis and along

lines parallel to the i m a g i n a r y axis but not n e c e s s a r i l y

crossing the real axis.

I m (s)
5- plane

indicates continuity
between eigenvalues

Re (s)

Figure 42. Lines atong which eigenvatues

ore catcutated and sorted

By o b s e r v a t i o n of each array or sheet the n e c e s s a r y cuts

are obvious. If p a r a l l e l to the imaginary axis a line of

e i g e n v a l u e s is not continuous w i t h an adjacent line then

these m u s t be separated by a cut starting at a b r a n c h point

and e n d i n g at a branch point or infinity as shown in

152

figures 43 (a) and (b). Eigenvalues corresponding to v a l u e s

of s in the upper h a l f - p l a n e are c o m p l e x conjugate to those

in the lower h a l f - p l a n e and t h e r e f o r e continuity of e i g e n v a l u e s

across the real axis is i m p o s s i b l e if the e i g e n v a l u e s situated

on the real axis are complex. Therefore a cut a l o n g the

real axis is n e c e s s a r y w h e n e v e r the real axis e i g e n v a l u e s

are complex. Again all the cuts start at a b r a n c h point

and end at a b r a n c h p o i n t or infinity, as s h o w n in f i g u r e 44.

X x indicates continuity

between eigenvalues |to00

i
I
i Cut
~--~ Cut

BrQnch points
/- i

~T--~-
i
< I--" Branch point
t-- --~--

(a) (b)
Figure 43. Cuts parallel to the imaginary axis
(a) finite (b) infinite

Cut. Im (s) Cut

R = Real invalue

Figure 44. Cuts along the real axis

153

The stitching together of the sheets also becomes

obvious by matching eigenvalues on one side of a cut on one

sheet to those on another. When the matching process is

complete, in general, there will be sets of c o n n e c t e d sheets

each set defining a Riemann surface.

To facilitate the i d e n t i f i c a t i o n of cuts and the

matching of sheets it is very useful to draw the constant

phase and constant m a g n i t u d e contours on each sheet; this

is d e m o n s t r a t e d in the examples given in the main text.

Computationally sorting the e i g e n v a l u e s along the real

axis can be difficult because of the likelihood of real

axis poles. This p r o b l e m is overcome if the first line of

c a l c u l a t i o n and sorting is changed to be a line parallel

to, and a "large" distance away from, the real axis. The

rest of the calculations and sorting are then carried out

along lines parallel to %he imaginary axis starting at this

new line and finishing at the real axis; as shown in figure

45. With this new approach the only possible cuts are

either along the real axis, as before (see figure 44) or b e t w e e n

complex conjugate branch points as shown in figure 46.

A Riemann surface c o n s t r u c t i o n for

G(s) = 1 5s-2 2s-l]

1.25(s+i) (s+2) !
3s-18 s-8J

is shown using the original m e t h o d in figures 47 and 48, and

using the m o d i f i e d m e t h o d in figures 49 and 50, and illustrates

the arbitrariness of the cuts.

154

Ira(s}

,Y, X X X X

~K X
S -plane

) :K
I

z z :_
Re is)

Figure 45. Lines orang which eigenvaiues are calculated

and sorted (modified method)

Figure 46. Cut parattet to the imaginary axis

(modified method)
155

..... Root loci

i Cut

Figure 47. Sheet 1 of the Riemann surface

Cut

3_o

Figure 48. Sheet 2 of the Riemann surface

156

,,*~.~ Root loci

Cut
e

Figure 49. Sheet 1 of the Riemann surface

(modified method of construction)

;~;w---.~'~': R o o t loci

Cut

Figure 50. Sheet 2 of the Riemann surface

(modified method of construction)
157

Appendix 5. Extended Principle of the Argument

The required extension of the Argument Principle does

not seem to be readily available in the literature, although

a suitable statement of the Principle for a general m u l t i p l e -

valued analytic function has recently appeared in a text by

Evgrafov [5, page 98]. Therefore, to justify its use in

chapters 4 and 5, an appropriately extended Principle of the

Argument is developed here.

A5.1 Introduction

The extension required is non-trivial, with two main

problems to be overcome. The first of these arises from

the fact that, in general, the Riemann surface of an algebraic

function will be multiply connected. For an example of this

source of difficulty consider a c h a r a c t e r i s t i c gain function

g(s) associated with a 2 2 open-loop gain m a t r i x G(s).

Suppose that g(s) has four branch points, and that each

branch point is associated with a cycle of two sheets [6];

then the genus number [6; 7] of the a s s o c i a t e d Riemann surface

is one. Further suppose that these branch points are disposed

in the frequency plane (s-plane) in the way shown in figure 51.

Then, taking two copies of the complex number sphere (Riemann

number sphere), making cuts between the branch points, and

forming the topological equivalent of the Riemann surface

in the usual way [7] one obtains a torus, as illustrated in

figures 52 and 53. The region ~, shown shaded on this torus

and having a boundary ~, corresponds to the interiors of the

pair of Nyquist D-contours (shown in figure 52) for the

original complex number spheres out of w h i c h the torus was

constructed. To cope with such a situation we must ensure

that the extended version of the Argument Principle holds for

158

[s)~ ~-=Bronchpoint

Figure 51. Frequency ptane

~ branch
_

-
Cut between
points

Figure 52. Two copies of the comp{ex number sphere

Figure 53. Riemann surface shown topologically

equivalent to a torus
159

a region of a Riemann surface which is n o n - s i m p l y connected

and whose boundary ~ consists of several distinct closed

Jordan contours on the surface. This basic extension of the

principle is achieved via a suitable g e n e r a l i z a t i o n of the Cauchy

Residue Theorem and is covered in section A5.2.

The second obstacle to be overcome in a derivation of a

suitably extended Argument Principle is more directly

associated with the branch points of an algebraic function; in

this case with their effect on the calculation of residues

via contour integrals taken round the b o u n d a r y ~ of a region

having branch points in its interior. This p r o b l e m is

overcome by a change of variable and is treated in section A5.3.

Finally, in section A5.4 the results of sections A5.2 and

.3 are combined to give the required extended Principle of the

Argument.

A5.2 Generalized form of Cauchy's Residue Theorem

The basic requirement is a g e n e r a l i z a t i o n of Cauchy's

Residue Theorem to complex functions defined on the Riemann

surface of some known algebraic function, where the region

of integration may be non-simply connected and have a boundary

consisting of one or more closed Jordan contours. Such a

generalization may be found in Bliss [ 6 ] ; the main theorem

is given below with slight rephrasing and change of notation

for the context of this work Two observations may be useful

in helping the reader u n f a m i l i a r with Riemann surface theory to

understand this theorem.

(i) Each n o n - s i n g u l a r place (or point) on the Riemann

surface of an algebraic function f(s) is uniquely defined by a

160

pair of values (f,s) which satisfy a known algebraic equation

A(f,s) = 0

Thus at each point on the Riemann surface the value of a

complex function T(f,s) of the pair of complex variables s

and f is defined.

(ii) The Riemann surface for an algebraic function is

an orientable surface. Thus one can u n a m b i g u o u s l y define a

positive sense in which a boundary ~ can be traversed so as

to "enclose" some region ~ . The simplest way in which to

visualize this procedure is to imagine someone walking round

the boundary ~ on a t w o - d i m e n s i o n a l manifold in which

lies (embedded in a familiar space of three dimensions).

The positive sense of traversal may then be taken as that in

which someone walking forward round any p o r t i o n o f ~ will

always have ~ on his right-hand side. In figure 53 the boundary

is shown with a positive orientation.

G e n e r a l i z e d Residue T h e o r e m

Let R be the Riemann surface of an algebraic function

f(s) and ~ a portion o f ~ not n e c e s s a r i l y simply connected but

having a boundary ~ consisting of one or m o r e closed Jordan

contours not passing through any singular place o n ~ .

Then if a function ~(f,s) of the places o n ~ is analytic on

and ~ except possibly for a finite number of singular points

in ~, we have

2~j" ~(f,s)ds = Z residues of ~(f,s) in a

~ (A5.2. i)

where the boundary ~ is traversed in the positive sense with

respect to ~.
161

To derive the e x t e n d e d A r g u m e n t Principle we w i l l first

apply the G e n e r a l i z e d Residue Theorem to the f u n c t i o n f' (s)/f(s),

where f' (s) d e n o t e s the d e r i v a t i v e of f(s) with respect to s,

to give

1 /~ f' (s) ds = Z residues of f'(s)

f(s) in
2~j ~ f(s)

Note that f(s) m u s t have no poles or b r a n c h points (either of

which are r e f e r r e d to as s i n g u l a r places) on the b o u n d a r y ~.

For the p u r p o s e s of d e r i v i n g an A r g U m e n t Principle we w i l l n o w

also e x c l u d e any zeros of f(s) from ~. The next step is to

calculate the r e s i d u e s of f' (s)/f(s) in ~.

A5.3 Calculation of R e s i d u e s

The poles, zeros and b r a n c h p o i n t s of f(s) are s i n g u l a r i t i e s

f'(s)
of f(s) , a fact w h i c h w i l l b e c o m e clearer as this section

progresses. We w i l l c o n s i d e r all the b r a n c h p o i n t s of f(s)

but it turns out that only those w h i c h are also poles or

zeros of f(s) are relevant.

In the n e i g h b o u r h o o d of a pole or a zero (not i n c l u d i n g

those w h i c h are a s s o c i a t e d with a b r a n c h point) f(s) can be

represented by a series w h i c h defines a single-valued function.

For a pole Pi of o r d e r t p i t h e algebraic function can be

represented by

f(s) = E a n ( S - P i )n, a_t ~ O (A5.3.1)

n=-t Pi
Pi

~(s) t (A5.3.2)
i.e. f(s) = (s-Pi) Pi

where ~(pi ) ~ ~ and ~(s) is a n a l y t i c in the n e i g h b o u r h o o d

162

of Pi" This gives

-t
f'(s) Pi
+ ~'(s) (A5.3.3)
f(s) ~pi ) ~(s)
!
<s)
and since is a n a l y t i c in the n e i g h b o u r h o o d of D.
"l
~(s)

f' (s) has a simple pole of r e s i d u e -t at

the f u n c t i o n
f(s) Pi

s = Pi"
For a zero s. of o r d e r t the a l g e b r a i c function can
i zi
be r e p r e s e n t e d by the series
co

f(s) = Z Cn (s-zi)n, ct O (A5.3.4)

n=t z.
Z. l
1
t
i.e. f(s) = (s - z i) zi e(s) (A5.3.5)

where e(z i) ~ 0 and 8(s) is a n a l y t i c in the n e i g h b o u r h o o d

of z.. This gives

t
f'(s) = zi S' (s)
+ (A5.3.6)
f(s) (s - z i) e(s)
f'(s)
and we see that f(s) has a s i m p l e ~ l e of r e s i d u e

t at s = z..
z. 1
1

To d e t e r m i n e the r e s i d u e s at a b r a n c h p o i n t it is

necessary to m a k e a change of v a r i a b l e [6, page 81]. The

procedure is as follows.

In the n e i g h b o u r h o o d of a finite b r a n c h p o i n t b.
l

the a l g e b r a i c function can be r e p r e s e n t e d by a series of the

form [8],
163

f(s) = Z dn(r - s/ ~ i )n (A5.3.7)

n___-o~

where r is the number of sheets which form a cycle at the

branch point. If the branch point is a pole of order

tb and consists of a cycle of r sheets we have the series
Pi

expansion

f(s) = n~_t~id n(r s - ~ i ) n , d_tbpi ~ O (A5.3.8)

Similarly if the branch point is a zero of order tb we

z,
1

have that
o0

f(s) = Z b dn(r s - ~ i ) n , dtb ~ O (A5.3.9)

n=t z. z.l
1
otherwise
0o

f(s) =
dn(r s _ ~ i ) n , do ~ O (AS. 3. iO)
n=O

These expansions are multivalued and therefore not

suitable for determining residues. If, however, we make

the substitution
s = b. + x r (A5.3.11)
l

we find that the series expansions (A5.3.8, 9 and i0)

become
f(x) = Z dnxn , d_tb ~ 0 (A5.3.12)
n=-t~i Pi

f(x) = n~tb dnxn' dtb O (A5.3.13)

Z,
Zi 1

and
164

f(x) = Z darn, d O ~ O (A5.3.14)

n=O

respectively, where it is to be understood that

f(x) ~ f(b i + x r) (A5.3.15)

The substitution has therefore mapped the algebraic

function onto the x-plane where it can be represented by a

"single-valued" series expansion.

By definition [6] the residue at b I of (____{s)

f' analytic
f(s)

in a n e i g h b o u r h o o d of b. except possibly at b. itself is

1 1

A 1 Sf'(s) ds
(residue)bl = 249 - cf(~ (A5.3.16)

where C is a closed positively oriented Jordan contour o n e

bounding a n e i g h b o u r h o o d N of b i in which f'(s)

f(s) is

analytic except possibly at b i. If we substitute for s

from (A5.3.11) we find that

A 1 / f ' (s)
(residue)b. = 2~j I f ( s ) ds
i C

_ 1 / f' (x) dx ds
27 f(x) as
c
X

1 f f' (x) (A5.3.17)

dx
2~j , / f(x)
C
x

where C is a closed p o s i t i v e l y o~ented Jordan contour on the

x-plane bounding a n e i g h b o u r h o o d of the origin. The residue

f'( (s)
of s----~
f at a branch point b.1 is therefore equal to the

residue of f' (x) at the origin.

f(x) The residue of the function
165

f'(x)
f(x) is e a s i l y o b t a i n e d f r o m its s e r i e s e x p a n s i o n . F o l l o w i n g

the same p r o c e d u r e as b e f o r e we n o t e t h a t f' (x) in the

f(x----[

neighbourhood of the o r i g i n has:

(i) a simple pole with residue -t b , if the b r a n c h

Pi
p o i n t is a p o l e of o r d e r t b ; or
Pi
(ii) a s i m p l e p o l e w i t h r e s i d u e t; , if the b r a n c h

p o i n t is a zero of o r d e r t b l;
zi

otherwise f'(x) is a n a l y t i c .
f(x)

A5.4 Extended Principle of the A r g u m e n t .

Combining the r e s u l t s of s e c t i o n A 5 . 3 w i t h that of

equation (A5.2.2) we h a v e t h a t

2~j f(s) i i Z Pi i Pi
~n
= WZ -~P (A5.4.1)

where # Z and ~ P are r e s p e c t i v e l y the total n u m b e r of p o l e s

and zeros of f(s) in ~ e n c l o s e d by the b o u n d a r y %~ t r a v e r s e d

in the p o s i t i v e sense w i t h r e s p e c t to ~, and w h e r e ~ lies

in the R i e m a n n s u r f a c e of the a l g e b r a i c function f(s). Note

that multiple poles and zeros m u s t be c o u n t e d as m a n y times

as t h e i r o r d e r s indicate.

The l e f t - h a n d side of e q u a t i o n (4.1) is e q u i v a l e n t to

the net sum of the c l o c k w i s e e n c i r c l e m e n t s of a c u r v e set F

about the o r i g i n of the f-plane w h e r e F is the image of ~ under

f(s). D e n o t i n g this n e t sum of c l o c k w i s e e n c i r c l e m e n t s by

N(F,O), we f i n a l l y o b t a i n the r e q u i r e d e x t e n d e d P r i n c i p l e of

the A r g u m e n t in the f o r m

N(F,O) = # Z - ~ p (A5.4.2)
166

Appendix 6. Multivariable pivots from the c h a r a c t e r i s t i c

equation ~(g,s)=O

C o n s i d e r the c h a r a c t e r i s t i c equation

~(g,s) = O (A6. i)

and a s s u m e we h a v e f o u n d an a p p r o x i m a t i o n for a b r a n c h of the

characteristic frequency loci a b o u t s =m, as k ~m, of the f o r m

s - bk ~ (A6.2)

If p is the p i v o t corresponding to this a s y m p t o t e then for k =~

s = p + bk ~ (A6.3)
or 1
k : (A64)

The d e p e n d e n c e of the c l o s e d - l o o p p o l e s on k is g i v e n by

equation (6.1.5), i.e.

F(k,s) = O (A6.5)

Therefore substituting for k or s in e q u a t i o n (A6.5) we

obtain a relationship between s and p or a r e l a t i o n s h i p b e t w e e n

k and p. S i n c e the o r d e r of s w i l l in g e n e r a l be m u c h

g r e a t e r t h a n the o r d e r of k in e q u a t i o n (A6.5), we w i l l c o n s i d e r

the case of s u b s t i t u t i n g for k. Substituting for k from

equation (A6.4) into equation (A6.5) we o b t a i n the e q u a t i o n

Z(p,s) = 0 (A6.6)
-i
we r e q u i r e p for s =~, and so w e put s=z in e q u a t i o n (A6.6)

to g i v e

Z(p,s) = Z ( p , z -I) = z-tn(p,z) = o (A6.8)

w h e r e t is the m a x i m u m o r d e r of s in (A6.6). The e q u a t i o n

~(p,O) = O (A6.8)
167

then gives the multivariable pivot, p.

Example
Consider the open-loop gain matrix

G(s)= 1 ~-s3-11s2-29s+92 -2Os2+35s+70

s4-s3+2s2-25s+29
[ 41s2-s-91 33s2-170s+l18
which has a characteristic equation
A(g,s)=(s4-s3+2s2-25s+29)g2-(-s3+22s2-199s+210)g +(-33s+594)=O

Using the method described in section 6.2 we find that

as k+~ the infinite branches of the characteristic frequency loci
are given by the following approximations
s~k and s~+/(-33k)
To find the pivot corresponding to the second-order

Butterworth pattern j/(33k) we will put

s=p+/(-33k)
giving
2
k=(S-P)
-33
From the characteristic equation ~(g,s)=O we obtain
F(s,k)=~4-s3+2s2-25s+29)+(-s3+22s2-199s+210)k+(-33s+594)k2=O
and substituting for k in this we obtain
Z(p,s)=332(s4_s3+2s2_25s+29)_33(_s3+22s2_199s+210) (_2ps +p2)
-33s2(22s2-199s+210)+(-33s+594) (-4s3p+6s2p2-4Sp3+p4)
+594s4=0
Putting s=z -1 we find
~(p,z)=332(l-z+2z2-25z3+29z 4)

-33(-l+22z-199z2+210z3) (-2p+p2z)-33(22-199z+210z 2)
+(-33+594z) (-40+6zp2-4z2p3+z3p4)+594=O
188

Therefore

(p, O) = 3 3 2 - 6 6 p - 3 3 x 2 2 + 3 3 4 p + 5 9 4 = O

f r o m w h i c h we find

p=-14.5

Appendix 7. Association between branch points an d stationary

points on the v a i n and f r e q u e n c y s u r f a c e s

Let us s u p p o s e t h a t f r o m an o p e n - l o o p g a i n m a t r i x G(s),

we h a v e o b t a i n e d an a l g e b r a i c e q u a t i o n in t e r m s of the c o m p l e x

gain variable g, and the c o m p l e x frequency variable s, g i v e n b y

F ~ f(g,s) = O (A7.1)

Let us also c o n s i d e r the following equations i n v o l v i n g the

derivatives of F:

and

t ~F ~
<T J -- o CA7.3
g

The v a l u e s of s s i m u l a t a n e o u s l y satisfying equation

(A7.1 and .2) are the b r a n c h p o i n t s on the f r e q u e n c y surface;

the v a l u e s of g s i m u l t a n e o u s l y satisfying equations (A7.1and .3)

b e i n g the b r a n c h p o i n t s on the g a i n surface.

If we c o n s i d e r g as a f u n c t i o n of s, the t o t a l d e r i v a t i v e

of F w i t h r e s p e c t to s is

dF ~F ~F dg (A7.4)
--ds + ds
g s

Alternatively we can c o n s i d e r s as a f u n c t i o n of g, a n d o b t a i n

dF ~F SF ds
s g

N o w from e q u a t i o n (A7.1), F is i d e n t i c a l l y zero and therefore

169

its total d e r i v a t i v e s m u s t also be zero, so that from e q u a t i o n s

(A7.4 and .5) we have the following:

8F DE dg (A7.6)
= ds
g s

and

-- ds
d-~ (A7.7)
s g

If we have a b r a n c h p o i n t on the gain surface equation

(A7.3) is s a t i s f i e d and h e n c e from e q u a t i o n (A7.6) we have

that

(~_FF~ = O or dg = 0
%~gJ s ds

which imply that on the f r e q u e n c y s u r f a c e we have e i t h e r a

b r a n c h p o i n t or a s t a t i o n a r y point corresponding to the gain

surface branch point.

Alternatively if we have a b r a n c h p o i n t on the f r e q u e n c y

surface equation (A7.2) is s a t i s f i e d and h e n c e from equation

(A7.7) we have that

(~~F
)s g = O or ds
dg o

which i m p l y that on the gain surface we have e i t h e r a b r a n c h

p o i n t or a s t a t i o n a r y point corresponding to the f r e q u e n c y

surface b r a n c h point.

References

i] B.A. Fuchs, and V.I. Levin, " F u n c t i o n s of a C o m p l e x V a r i a b l e " ,

I n t e r n a t i o n a l S e r i e s of M o n o g r a p h s in Pure and A p p l i e d M a t h e m a t i c s ,
P e r g a m o n Press, 1961 (translation of 1951 R u s s i a n original).

2] F.M. Ayres, " T h e o r y and P r o b l e m s of Matrices", Schaum,

N e w York, 1962.
170

[3] S.Barnett, "Matrices in Control Theory", Van Nostrand-

Reinhold, London, 1971.

[4] G. Sansone, and J. Gerretsen, "Lectures on the Theory of

Functions of a Complex Variable", Vol. 2, Wolters-
Nordhoff, Groningen, 1969.

[53 M.A.Evgrafov, "Analytic Functions", Dover, New York, 1978.

[6~ G.A. Bliss, "Algebraic Functions", Dover, New York, 1966

(reprint of 1933 original).

[7] G. Springer, "Introduction to Riemann Surfaces" Addison-

Wesley, Reading, Mass., 1957.

[8] K. Knopp, "Theory of Functions", Part 2, Dover, New York,

1947.

Bibliography

AYRES, F.M., "Theory and Problems of Matrices", Schaum,

New York, 1962.

BARMAN, J.F., and KATZENELSON, J., Memorandum ERL-383,

Electronics Research Laboratory, College of Engineering,
Univ. of California, Berkeley, 1973.

BARMAN, J.F., and KATZENELSON, J., "A qeneralized Nyquist-

type stability criterion for multivariable feedback systems",
Int. J. Control, 20, 593-622, 1974.

BARNETT, S., "Matrices in Control Theory", Van Nostrand-

Reinhold, London, 1971.

BLISS, G.A., "Algebraic Functions", Dover, New York, 1966,

(reprint of 1933 original).

BODE, H.W., "Network analysis and feedback amplifier design",

Van Nostrand, Princeton, N.J., 1945.

BOHN, E.V. "Design and synthesis methods for a class of

multivariable feedback control systems based on single
variable methods", Trans.AIEE, 81, Part 2, 109-115, 1962.

BOHN, E.V. and KASVAND, T, " Use of matrix transformations

and system eigenvalues in the design of linear multivariable
control systems", Proc. IEE, llO, 989-997, 1963.

BOKSENBOM A.S. and HOOD, R, "General algebraic method applied

to control analysis of complex engine types", National Advisory
Committee for Aeronautics, Report NCA-TR-980, Washington D.C.,
1949.

BRACEWELL, R., "The Fourier Transform and Its Applications",

McGraw-Hill, New York, 1965.
171

CANNON, R.H., Jr., "Dynamics of Physical Systems", McGraw-

Hill, New York, 1967.

COHN, P.M., "Algebra", Vol. i, Wiley, London, 1974.

DESOER, C.A., and CHAN, W.S., "The Feedback Interconnection

of Lumped Linear Time-invariant Systems", J. Franklin Inst.,
300, 335-351, 1975.

ELGERD, O.I., "Control System Theory", McGraw-Hill, New York,

1967.

EVANS, W.R., "Graphical Analysis of Control Systems", Trans.

AIEE, 67, 547-551, 1948.

EVANS, W.R., "Control System Synthesis by Root Locus Method",

Trans. AIEE, 69, 1-4, 1950.

EVANS, W.R., "Control System Dynamics", McGraw-Hill, New York,

1954.

FREEMAN, H., "A synthesis method for multipole control systems",

Trans. AIEE, 76, 28-31, 1957.

FREEMAN, H., "Stability and physical realizability considerations

in the synthesis of multipole control systems", Trans.AIEE, Part 2,
77, 1-15, 1958.

FUCHS, B.A., and LEVIN, V.I., "Functions of a Complex Variable",

International Series of Monographs in Pure and Applied Mathematics,
Pergamon Press, 1961 (translation of 1951 Russian original).

GANTMACHER, F.R., "Theory of Matrices", Vol. l, Chelsea,

New York, 1959.

GOLOMB, M.,and USDIN, E., "A theory of multidimensional servo

systems", J. Franklin Inst., 253(i) i 28-57, 1952.

HILLE, E., "Analytic Function Theory", Vol. i, Ginn and Co.,

U.S.A., 1959.

HILLE, E., "Analytic Function Theory", Vol. 2, Ginn and Co.,

U.S.A. 1962.

JURY, E.I., "Inners and Stability of Dynamical Systems",

Wiley, New York, 1974.

KALMAN, R.E., "When is a linear control system optimal?",

Trans.ASME J.Basic Eng., Series D., 86, 51-60, 1964.

KAVANAGH, R.J., "Noninteraction in linear multivariable

systems", Trans. AIEE, 76, 95-1OO, 1957.

KAVANAGH, R.J., "The application of matrix methods to multi-

variable control systems", J.Franklin Inst., 262, 349-367, 1957.

KAVANAGH, R.J., "Multivariable control system synthesis",

Trans. AIEE, Part 2, 77, 425-429, 1958.
172

KONTAKOS, T., Ph.D. Thesis, University of Manchester, 1973.

KNOPP, K., "Theory of Functions", Part 2, Dover, New York, 1947.

KOUVARITAKIS, B., and SHAKED, U., "Asymptotic behaviour of root-

loci of multivariable systems", Int. J. Control, 23, 297-340, 1977.

KWAKERNAAK, H., "Asymptotic Root LOCi of Multivariable Linear

Optimal Regulators", IEEE Trans. Automatic Control, 21, 378-382,
1976.

KWAKERNAAK, H., and SIVAN,R., "Linear Optimal Control Systems",

Wiley, New York, 1972.

MACFARLANE, A.G.J., "Dual system methods in dynamical analysis

Pt. 2 - Optimal regulators and optimal servo-mechanisms",
Proc. IEE, 116, 1458-1462, 1969.

MACFARLANE, A.G.J., "Dynamical System Models", Harrap, London, 1970.

MACFARLANE, A.G.J., "Return-difference and return-ratio matrices

and their use in analysis and design of multivariable feedback
control systems", Proc. IEE, 117, 2037-2049, 1970.

MACFARLANE A.G.J., and BELLETRUTTI, J.J., "The Characteristic

Locus Design Method", Automatica, 9, 575-588, 1973.

MACFARLANE, A.G.J., and KARCANIAS, N., "Poles and zeros of

linear multivariable systems: a survey of the algebraic, geometric
and complex variable theory", Int. J. Control, 24, 33-74, 1976.

MACFARLANE, A.G.J. and KOUVARITAKIS, B., "A design technique for

linear multivariable feedback systems", Int. J.Control, 25,
837-874, 1977.

MACFARLANE, A.G.J., KOUVARITAKIS, B., and EDMUNDS, J.M.,"Complex

variable methods for multivariable feedback systems analysis and
design", Alternatives for Linear Multivariable Control, National
Engineering Consortium, Chicago, 189-228, 1977.

MACFARLANE, A.G.J., and POSTLETHWAITE, I., "The generalized

Nyquist stability criterion and multivariable root loci",
Int. J. Control, 25, 81-127, 1977.
MACFARLANE, A.G.J.,and POSTLETHWAITE, I., "Characteristic
frequency functions and characteristic gain functions", Int.
J. Control, 26, 265-278, 1977.