Fuzzy Systems, Modeling and Identification: Robert Babu Ska
Fuzzy Systems, Modeling and Identification: Robert Babu Ska
Robert Babu ka s
Delft University of Technology, Department of Electrical Engineering Control Laboratory, Mekelweg 4, P.O. Box 5031, 2600 GA Delft, The Netherlands tel: +31 15 785117, fax: +31 15 2786679, e-mail: [email protected]
Summary
This text provides an introduction to the use of fuzzy sets and fuzzy logic for the approximation of functions and modeling of static and dynamic systems. The concept of a fuzzy system is rst explained. Afterwards, the motivation and practical relevance of fuzzy modeling are highlighted. Two types of rule-based fuzzy models are described: the linguistic (Mamdani) model and the TakagiSugeno model. For each model, the structure of the rules, the inference and defuzzication methods are presented. Fuzzy modeling of dynamic systems is addressed, as well as the methods to construct fuzzy models from knowledge and data (measurements). Illustrative examples are given throughout the text. At the end, homework problems are included. MATLAB programs implementing some of the examples are available from the author. The reader is encouraged to study and possibly modify these examples in order to get a better insight in the methods presented.
Preface
Prerequisites: This text provides an introduction to the use of fuzzy sets and fuzzy logic for the approximation of functions and modeling of static and dynamic systems. It is assumed that the reader has basic knowledge of set and fuzzy set theory (membership functions, operations on fuzzy sets union, intersection and complement, fuzzy relations, max-min composition, extension principle), mathematical analysis (univariate and multivariate functions, composition of functions), and linear algebra (system of linear equations, least-square solution). Organization. The material is organized in ve sections: In the Introduction, different modeling paradigms are rst presented. Then, the concept of a fuzzy system is rst explained and the motivation and practical relevance of fuzzy modeling are highlighted. Section 2 describes two types of rule-based fuzzy models: the linguistic (Mamdani) model and the TakagiSugeno model. For each model, the structure of the rules, the inference and defuzzication methods are presented. At the end of this section, fuzzy modeling of dynamic systems is addressed. In Section 3, methods to construct fuzzy models from knowledge and numerical data are presented. Section 4 reviews some engineering applications of fuzzy modeling, and the concluding Section 5 gives a short summary. Illustrative examples are provided throughout the text, and at the end, homework problems are included. Some of the numerical examples given have been implemented in MATLAB. The code is available from the author on request. The reader is encouraged to study and possibly modify these examples in order to get a better insight in the methods presented. A subject index is provided for a quick reference. Aims: After studying the material, the reader should be able to: Characterize a fuzzy system and give some examples of fuzzy systems. Dene the linguistic (Mamdani) and the Takagi-Sugeno fuzzy model in terms of their structure, inference and defuzzication mechanisms. Explain how dynamic systems are represented by fuzzy models, give examples. List the steps and choices in the knowledge-based design of fuzzy models. Name and briey characterize the presented techniques for data-driven acquisition and tuning of fuzzy models. Further reading. Readers interested in a detailed and fundamental treatment of fuzzy set theory and fuzzy logic can consult research monographs by Dubois and Prade (1980) or Klir and Yuan (1995). Basic, as well as more advanced concepts of fuzzy modeling and control, are presented, for instance, by Pedrycz (1993), Driankov, et al. (1993) or Yager and Filev (1994). Mathematical notation. Throughout the text, the following conventions are used. Lower case characters in italics, such as x or yi , denote scalar variables and elements of vectors. Vectors are printed in bold, i.e., x denotes a column vector. A row vector is denoted by the transpose operator, e.g., xT . Upper case bold characters denote matrices, for instance, X is a matrix. Upper case italic characters such as A denote crisp and fuzzy sets. A linguistic variable (a variable whose values are ~ fuzzy sets) is denoted by x. The term crisp is used as an opposite to fuzzy. For instance, a fuzzy number is a normal convex fuzzy set, while a crisp number may by a real or an integer number. A list of the used mathematical symbold is included in Appendix A. i
Acknowledgement. I am grateful to Piet Bruijn and Govert Monsees who read drafts of this text and contributed by their useful comments.
ii
Contents
1 Introduction 1.1 1.2 Fuzzy systems 1 2 4 5
2.2
2.3
::::::::::::::::: 2.1.1 Relational representation of a linguistic model : 2.1.2 Max-min (Mamdani) inference : : : : : : : : : 2.1.3 Multivariable systems : : : : : : : : : : : : : 2.1.4 Defuzzication : : : : : : : : : : : : : : : : : 2.1.5 Singleton model : : : : : : : : : : : : : : : : TakagiSugeno model : : : : : : : : : : : : : : : : : 2.2.1 Inference mechanism : : : : : : : : : : : : : 2.2.2 TS model as a quasi-linear systems : : : : : : Modeling dynamic systems : : : : : : : : : : : : : :
Linguistic fuzzy model
5 6 8 10 12 12 14 14 14 16 16
::::::::::::: Knowledge-based design : : : : : : : : : : : : : Data-driven acquisition/tuning of fuzzy models : 3.3.1 Least-squares estimation of consequents : 3.3.2 Template-based modeling : : : : : : : : 3.3.3 Neuro-fuzzy modeling : : : : : : : : : : 3.3.4 Fuzzy clustering : : : : : : : : : : : : :
Structure and parameters
: : : : : : :
: : : : : : :
: : : : : : :
17 19 19 19 20 22 23 24 25 27 30
4 Overview of Applications 5 Summary and Concluding Remarks A List of Symbols Subject index
iii
iv
1 Introduction
Developing mathematical models of real systems is a central topic in many disciplines of engineering and science. Models can be used for simulations, analysis of the systems behavior, better understanding of the underlying mechanisms in the system, design of new processes, or design of controllers. Traditionally, modeling is seen as a conjunction of a thorough understanding of the systems nature and behavior, and of a suitable mathematical treatment that leads to a usable model. This approach is usually termed white-box (physical, mechanistic, rst-principle) modeling. However, the requirement for a good understanding of the physical background of the problem at hand proves to be a severe limiting factor in practice, when complex and poorly understood systems are considered. Difculties encountered in conventional white-box modeling can arise, for instance, from poor understanding of the underlying phenomena, inaccurate values of various process parameters, or from the complexity of the resulting model. A complete understanding of the underlying mechanisms is virtually impossible for a majority of real systems. However, gathering an acceptable degree of knowledge needed for physical modeling may be a very difcult, time-consuming and expensive or even impossible task. Even if the structure of the model is determined, a major problem of obtaining accurate values for the parameters remains. It is the task of system identication to estimate the parameters from data measured on the system. Identication methods are currently developed to a mature level for linear systems only. Most real processes are, however, nonlinear and can be approximated by linear models only locally. A different approach assumes that the process under study can be approximated by using some sufciently general black-box structure used as a general function approximator. The modeling problem then reduces to postulating an appropriate structure of the approximator, in order to correctly capture the dynamics and nonlinearity of the system. In black-box modeling, the structure of the model is hardly related to the structure of the real system. The identication problem consists of estimating the parameters of the model. If representative process data are available, black-box models usually can be developed quite easily, without requiring process-specic knowledge. A severe drawback of this approach is that the structure and parameters of these models usually do not have any physical signicance. Such models cannot be used for analyzing the systems behavior otherwise than by numerical simulation, cannot be scaled up or down when moving from one process scale to another, and therefore are less useful for industrial practice. There is a range of modeling techniques that attempt to combine the advantages of the white-box and black-box approaches, such that the known parts of the system are modeled using physical knowledge, and the unknown or less certain parts are approximated in a black-box manner, using process data and black-box modeling structures with suitable approximation properties. These methods are often denoted as hybrid, semi-mechanistic or gray-box modeling. A common drawback of most standard modeling approaches is that they cannot make effective use of extra information, such as the knowledge and experience of engineers and operators, which is often imprecise and qualitative in its nature. The fact that humans are often able to manage complex tasks under signicant uncertainty has stimulated the search for alternative modeling and control paradigms. So-called intelligent modeling and control methodologies, which employ techniques motivated by biological systems and human intelligence to develop models and controllers for dynamic systems, have been introduced. These techniques explore alternative representation schemes, using, for instance, natural language, rules, semantic networks 1
or qualitative models, and possess formal methods to incorporate extra relevant information. Fuzzy modeling and control are typical examples of techniques that make use of human knowledge and deductive processes. Articial neural networks, on the other hand, realize learning and adaptation capabilities by imitating the functioning of biological neural systems on a simplied level. The different modeling paradigms are summarized in Tab. 1. Table 1. Different modeling paradigms. modeling approach mechanistic (white-box) black-box fuzzy source of information formal knowledge and data data various knowledge and data method of acquisition mathematical (Lagrange eq.) optimization (learning) knowledgebased + learning example differential equations regression, neural network rule-based model deciency cannot use soft knowledge cannot at all use knowledge curse of dimensionality
By means of the extension principle a crisp function can be evaluated for a fuzzy argument (Zadeh, 1975).
Table 2. Crisp and fuzzy information in systems. system description crisp crisp fuzzy input data crisp fuzzy crisp/ fuzzy resulting output data crisp fuzzy fuzzy mathematical framework functional analysis, linear algebra, etc. extension principle1 fuzzy relational calculus, fuzzy inference
Fuzzy systems can be regarded as a generalization of interval-valued systems, which are in turn a generalization of crisp systems. This is depicted in Fig. 1 which gives an example of a function and its interval and fuzzy forms. The evaluation of the function for crisp, interval and fuzzy data Y can be regarded as a subset is schematically depicted as well. Note that a function f : X of the Cartesian product X Y , i.e., as a relation. The evaluation of the function for a given input proceeds in three steps: 1) extend the given input into the product space X Y (vertical dashed lines in Fig. 1), 2) nd the intersection of this extension with the relation, 3) project this intersection onto Y (horizontal dashed lines in Fig. 1). This view is independent of the nature of both the function and the data (crisp, interval, fuzzy). Remember this view of function evaluation, as will help you to understand the use of fuzzy relations for inference in fuzzy modeling.
crisp argument
y
crisp function
x
y y
interval function
x
y y
fuzzy function
Figure 1. Evaluation of a crisp, interval and fuzzy function for crisp, interval and fuzzy arguments.
Most common are fuzzy systems dened by means of if-then rules: rule-based fuzzy systems. In the rest of this text we will focus on these systems only. Fuzzy systems can serve different purposes, such as modeling, data analysis, prediction or control. In this text a fuzzy rule-based system is for simplicity called a fuzzy model, regardless of its eventual purpose. 3
i = 1; 2 ; : : : ; K : (1) Here x is the input (antecedent) linguistic variable, and Ai are the antecedent linguistic terms ~ ~ (constants). Similarly, y is the output (consequent) linguistic variable and Bi are the consequent ~ ~ linguistic terms. The values of x (y ) and the linguistic terms Ai (Bi ) are fuzzy sets dened in the R p and y 2 Y R q . The membership domains of their respective base variables: x 2 X functions of the antecedent (consequent) fuzzy sets are then the mappings: x: X ! 0; 1 , y: Y ! 0; 1 . Fuzzy sets Ai dene fuzzy regions in the antecedent space, for which the respective consequent propositions hold. The linguistic terms Ai and Bi are usually selected
2
Ri : If x is Ai then y is Bi ; ~ ~
from sets of predened terms, such as Small, Medium, etc. By denoting these sets by and respectively, we have Ai and Bi . The rule base = i i = 1; 2; : : : ; K and the sets and constitute the knowledge base of the linguistic model.
2A
2B
R fR j
Example 2.1 Consider a simple fuzzy model which qualitatively describes how the heating power of a gas burner depends on the oxygen supply (assuming a constant gas supply). We have a scalar input, the oxygen ow rate (x), and a scalar output, the heating power (y ). Dene the = Low; OK; High , and the set of consequent linguistic set of antecedent linguistic terms: terms: = Low; High . The qualitative relationship between the model input and output can be expressed by the following rules:
B f
A f
R : If R : If R : If
1 2 3
O2 ow rate is Low then heating power is Low: O2 ow rate is OK then heating power is High: O2 ow rate is High then heating power is Low:
Base variable is the domain variable in which fuzzy sets are dened.
The meaning of the linguistic terms is dened by their membership functions, depicted in Fig. 2. The numerical values along the base variables are selected somewhat arbitrarily. Note that no universal meaning of the linguistic terms can be dened. For this example, it will depend on the type and ow rate of the fuel gas, type of burner, etc. Nevertheless, the qualitative relationship expressed by the rules remains valid.
Low 1 OK High 1 Low High
25
100
In order to be able to use the linguistic model, we need an algorithm which allows us to compute the output value, given some input value. This algorithm is called the fuzzy inference algorithm (or mechanism). For the linguistic model, the inference mechanism can be derived by using fuzzy relational calculus, as shown in the following section.
2.1.1
Each rule in (1) can be regarded as a fuzzy relation (fuzzy restriction on the simultaneous occur0; 1 . This relation can be computed in two basic rences of values x and y): Ri : X Y ways: by using fuzzy conjunctions (Mamdani method) and by using fuzzy implications (fuzzy logic method), see for instance (Driankov, et al., 1993). Fuzzy implications are used when the Bi, i.e., A implies B . In classical if-then rule (1) is strictly regarded as an implication Ai logic this means that if A holds, B must hold as well for the implication to be true. Nothing can, however, be said about B when A does not hold, and the relationship also cannot be inverted.
R x; y = A x ^ B y : (2) Note that the minimum is computed on the Cartesian product space of X and Y , i.e., for all possible pairs of x and y. The fuzzy relation R representing the entire model (1) is given by the disjunction (union) of the K individual rules relations Ri :
that is,
i i i
When using a conjunction, A B , the interpretation of the if-then rules is it is true that A and B simultaneously hold. This relationship is symmetric and can be inverted. For simplicity, in this text we restrict ourselves to the Mamdani (conjunction) method. The relation R is computed by the minimum ( ) operator:
Ri = Ai Bi ;
R=
Now the entire rule base is encoded in the fuzzy relation R and the output of the linguistic model can be computed by the relational max-min composition ( ):
i=1
Ri ;
that is,
(3)
y = x R: ~ ~
6
(4)
Example 2.2 Let us compute the fuzzy relation for the linguistic model of Example 2.1. First we discretize the input and output domains, for instance: X = 0; 1; 2; 3 and Y = 0; 25; 50; 75; 100 . The (discrete) membership functions are given in Tab. 3 for the antecedent linguistic terms, and in Tab. 4 for the consequent terms.
Table 3. Antecedent membership functions. linguistic term Low OK High 0 1.0 0.0 0.0 domain element 1 2 0.6 0.0 0.4 1.0 0.0 0.1 3 0.0 0.4 1.0
Table 4. Consequent membership functions. linguistic term Low High 0 1.0 0.0 domain element 25 50 75 1.0 0.6 0.0 0.0 0.3 0.9 100 0.0 1.0
The fuzzy relations Ri corresponding to the individual rule, can now be computed by using eq. (2). For rule 1 , we have R1 = Low Low, for rule 2 , we obtain R2 = OK High, and nally for rule 3 , R3 = High Low. The fuzzy relation R, which represents the entire rule base, is the union (element-wise maximum) of the relations Ri :
0 0 0 0
3 7 7 7 5 3 7 7 7 5 3 7 7 7 5
3 7 7 7 5
(5)
0 0 0 0
0 0 0 0
Graphical visualization of these steps is given in Fig. 3. In this gure, the relations are computed on a ner discretization by using the membership functions of Fig. 2. This example can be run under MATLAB by calling the script ling. See the le ling.m for details of the implementation. Now consider an input fuzzy set to the model, A0 = 1; 0:6; 0:3; 0 , which can be denoted as Somewhat Low ow rate, as it is close to Low but does not equal Low. The result of max-min composition is the fuzzy set B 0 = 1; 1; 0:6; 0:4; 0:4 , which gives the expected approximately Low heating power. For A0 = 0; 0:2; 1; 0:2 (approximately OK), we obtain B 0 = 0:2; 0:2; 0:3; 0:9; 1 , i.e., approximately High heating power. Verify these results as an exercise. 7
R2 = OK and High
1 0.5 0 100 50 y 0 0 1 x 2 3
1 0.5 0 100 50 y 0 0 1 x 2 3
R = R1 or R2 or R3
1 0.5 0 100 50 y 0 0 1 x 2 3
1 0.5 0 100 50 y 0 0 1 x 2 3
Figure 3. Fuzzy relations R1 , R2 , R3 corresponding to the individual rules, and the aggregated relation R corresponding to the entire rule base. Because of the relational representation, the linguistic fuzzy model is sometimes called a fuzzy graph. Figure 4 shows the fuzzy graph for our example (contours of R, where the shading corresponds to the membership degree). The relational composition (4) can be regarded as a function evaluation on the fuzzy graph, see also Fig. 1.
2.1.2
In the previous section, we have seen that a rule base can be represented as a fuzzy relation. The output of a rule-based fuzzy model is then computed by the max-min relational composition. In this section, it will be shown that the relational calculus can be by-passed. This is advantageous, as the discretization of domains and storing of the relation R can be avoided. To show this, suppose ~ an input fuzzy value x = A0 , for which the output value B 0 is given by the relational composition:
(6)
After substituting for R x; y from (3), the following expression is obtained:
(7)
Since the max and min operation are taken over different domains, their order can be changed as follows:
(8)
100 80 60 40
20 0
0.5
1.5
2.5
Figure 4. A fuzzy graph for the linguistic model of Example 2.2. Darker shading corresponds to higher membership degree. The solid line is a possible crisp function representing a similar relationship as the fuzzy model.
Denote i = maxX A x Ai x the degree of fulllment of the ith rules antecedent. The output fuzzy set of the linguistic model is thus:
0
(9)
The entire algorithm, called the max-min or Mamdani inference, is summarized in Algorithm 2.1 and visualized in Fig. 5.
Step 1 A1 A A2 A3 1 2 B1
Step 2 B2 B3
B 1 B 2
3 x
B 3 y
model:
Step 3
data:
Algorithm 2.1 Mamdani (max-min) inference 1. Compute the degree of fulllment by: i = maxX A x Ai x ; 1 i K : Note that for a singleton fuzzy set (A x = 1 for x = x0 and A x = 0 otherwise) the equation for i simplies to i = Ai x0 .
0 0 0
3.
^ B y; y 2 Y; 1 i K : Aggregate the output fuzzy sets Bi0 : B y = max iK B y; y 2 Y :
i
0
Example 2.3 Let us take the input fuzzy set A0 = 1; 0:6; 0:3; 0 from Example 2.2 and compute the corresponding ouput fuzzy set by the Mamdani inference method. Step 1 yields the following degrees of fulllment:
= max A x ^ A1 x = max 1; 0:6; 0:3; 0 ^ 1; 0:6; 0; 0 = 1; X 2 = max A x ^ A2 x = max 1; 0:6; 0:3; 0 ^ 0; 0:4; 1; 0:4 = 0:4; X 3 = max A x ^ A3 x = max 1; 0:6; 0:3; 0 ^ 0; 0; 0:1; 1 = 0:1 : X
1
0 0 0
B0 = B0 = B0 =
1 2 3
1 2 3
^ B = 1 ^ 1; 1; 0:6; 0; 0 = 1; 1; 0:6; 0; 0 ; ^ B = 0:4 ^ 0; 0; 0:3; 0:9; 1 = 0; 0; 0:3; 0:4; 0:4 ; ^ B = 0:1 ^ 1; 1; 0:6; 0; 0 = 0:1; 0:1; 0:1; 0; 0 :
1 2 3
which is identical to the result from Example 2.2. Verify the result for the second input fuzzy set as an exercise. From a comparison of the number of operations in examples 2.2 and 2.3, it may seem that the saving with the Mamdani inference method with regard to relational composition is not signicant. This is, however, only true for a rough discretization (such as the one used in Example 2.2) and for a small number of inputs (one in this case). Note that the Mamdani inference method does not require any discretization and thus can work with analytically dened membership functions. It also can make use of learning algorithms, as discussed in Section 3.3.3.
2.1.3
Multivariable systems
So far, the linguistic model was presented in a general manner covering both the SISO and MIMO cases. In the MIMO case, all fuzzy sets in the model are dened on vector domains by multivariate membership functions. It is, however, usually, more convenient to write the antecedent and consequent propositions as logical combinations of fuzzy propositions with univariate membership functions. Fuzzy logic operators, such as the conjunction, disjunction and negation (complement), can be used to combine the propositions. Furthermore, a MIMO model can be written as a set 10
of MISO models. Therefore, for the ease of notation, we will write the rules for MISO systems. Most common is the conjunctive form of the antecedent, which is given by:
: : : and xp is Aip then y is Bi ; i = 1; 2; : : : ; K : (16) Note that the above model is a special case of (1), as the fuzzy set Ai in (1) is obtained as the Cartesian product of fuzzy sets Aij : Ai = Ai Ai Aip . Hence, the degree of fulllment
1 1 1 2
Ri : If x is Ai
(17)
K = p Ni ; i where p is the dimension of the input space and Ni is the number of linguistic terms of the ith
=1
Other conjunction operators, such as the product, can be used. A set of rules in the conjunctive antecedent form divides the input domain into a lattice of fuzzy hyperboxes, parallel with the axes. Each of the hyperboxes is an Cartesian product-space intersection of the corresponding univariate fuzzy sets. This is shown in Fig. 6a. The number of rules in the conjunctive form, needed to cover the entire domain, is given by:
antecedent variable.
(a)
A23 A23
(b) x2 x2
(c)
A3 A1 A2 A4
x2
A22
A21
A21
A22
x1
A11 A12 A13 A11 A12
x1
A13
x1
Figure 6. Different partitions of the antecedent space. Gray areas denote the overlapping regions of the fuzzy sets.
By combining conjunctions, disjunctions and negations, various partitions of the antecedent space can be obtained, the boundaries are, however, restricted to the rectangular grid dened by the fuzzy sets of the individual variables, see Fig. 6b. As an example consider the rule antecedent covering the lower left corner of the antecedent space in this gure: If x1 is not A13 and x2 is A21 then
:::
(18)
The degree of fulllment of this rule is computed using the complement and intersection operators:
The antecedent form with multivariate membership functions (1) is the most general one, as there is no restriction on the shape of the fuzzy regions. The boundaries between these regions can be arbitrarily curved and opaque to the axes, as depicted in Fig. 6c. Also the number of fuzzy sets needed to cover the antecedent space may be much smaller than in the previous cases. Hence, for complex multivariable systems, this partition may provide the most effective representation. Note 11
that the fuzzy sets A1 to A4 in Fig. 6c still can be projected onto x1 and x2 to obtain an approximate linguistic interpretation of the regions described. Another way to reducing the complexity of multivariable fuzzy systems is the decomposition into subsystems with fewer inputs per rule base. The subsystems can be inter-connected in a at or hierarchical (multi-layer) structure. In such a case, an output of one rule base becomes an input to another rule base, as depicted in Fig. 7. This cascade connection will lead to the reduction of the total number of rules. As an example, suppose ve linguistic terms for each input. Using the conjunctive form, each of the two sub-rule bases will have 52 = 25 rules. This is a signicant saving compared to a single rule base with three inputs which would have 53 = 125 rules.
x1 x2 x3
rule base A
rule base B
2.1.4
Defuzzication
In many applications, a crisp output y is desired. To obtain a crisp value, the output fuzzy set must be defuzzied. With the Mamdani inference scheme, the center of gravity (COG) defuzzication method is used. This methods computes the y coordinate of the center of gravity of the area under the fuzzy set B 0 :
y0 = cogB 0 =
PF
PF
where F is the number of elements yj in Y . Continuous domain Y thus must be discretized to be able to compute the center of gravity. Example 2.4 Consider the output fuzzy set B 0 = 0:2; 0:2; 0:3; 0:9; 1 from Example 2.2, where the output domain is Y = 0; 25; 50; 75; 100 . The defuzzied output obtained by applying formula (19) is:
(19)
The heating power of the burner, computed by the fuzzy model, is thus 72.12 W.
2.1.5
Singleton model
A special case of the linguistic fuzzy model is obtained when the consequent fuzzy sets Bi are singleton fuzzy sets. These sets can be represented simply as real numbers bi , yielding the following rules: ~ Ri : If x is Ai then y = bi ; i = 1; 2; : : : ; K : (20) 12
This model is called the singleton model. A simplied inference/defuzzication method is usually used with this model: P
y=
PK
K i=1 i bi i=1 i
(21)
This defuzzication method is called the fuzzy mean. The singleton fuzzy model belongs to a general class of general function approximators, called the basis functions expansion (Friedman, 1991) taking the form:
y=
K X i=1
i xbi :
(22)
Most structures used in nonlinear system identication, such as articial neural networks, radial basis function networks, or splines, belong to this class of systems. Connections between these types of models have been investigated (Jang and Sun, 1993; Brown and Harris, 1994). In the singleton model, the basis functions i x are given by the (normalized) degrees of fulllment of the rule antecedents, and the constants bi are the consequents. Multilinear interpolation between the rule consequents is obtained if the antecedent membership functions are trapezoidal, pairwise overlapping and the membership degrees sum up to one for each domain element, the product operator is used to represent the logical and connective in the rule antecedents. The input-output mapping of the singleton model is then piecewise (multi-)linear, as shown in Fig. 8a.
b2
y
b4 b4 b3 b3
y
y = kx + q
b2 b1
b1 A1
1
y = f (x)
A2 A3
x A4
A1
1
A2
A3
x A4
a1
a2
a3
(a)
(b)
a4
Figure 8. Singleton model with triangular or trapezoidal membership functions results in a piecewise linear input-output maping (a), of which a linear mapping is a special case (b). Clearly, a singleton model can also represent any linear mapping of the form:
y = pT x + q =
p X i=1
pi xi + q :
(23)
In this case, the antecedent membership functions must be triangular. The consequent singletons can be computed by evaluating the desired mapping (23) for the cores aij of the antecedent fuzzy 13
sets Aij :
bi =
p X j =1
pj aij + q :
(24)
This situation is depicted in Fig. 8b. This property is useful, as the (singleton) fuzzy model can always be initialized such that it mimics a given (perhaps inaccurate) linear model and can later be optimized.
Ri : If x is Ai then yi = fi x;
i = 1; 2; : : : ; K :
(25)
Ri : If x is Ai then yi = aT x + bi; i
i = 1; 2; : : : ; K;
(26)
The inference formula of the TS model is a straightforward extension of the singleton model inference (21): P P
y=
K i=1 i yi PK i=1 i
(27)
When the antecedent fuzzy sets dene distinct but overlapping regions in the antecedent space and the parameters ai and bi correspond to a local linearization of a nonlinear function, the TS model can be regarded as a smoothed piece-wise approximation of that function, see Fig. 9.
2.2.2
The afne TS model can be regarded as a quasi-linear system (i.e., a linear system with inputdependent parameters). To see this, denote the normalized degree of fulllment by
i x = i x=
14
K X j =1
j x :
(28)
b1
y=
x+
a2
x+
y=
b2
y = a3 x + b3
a1
Small
Medium
Large
Figure 9. TakagiSugeno fuzzy model as a smoothed piece-wise linear approximation of a nonlinear function. Here we write i x explicitely as a function x to stress that the TS model is a quasi-linear model of the following form:
y=
The parameters bi , i.e.:
K X i=1
T i xai
x+
K X i=1
i xbi
= aT xx + bx :
K X i=1
(29)
ax, bx are convex linear combinations of the consequent parameters ai and ax =
K X i=1 i xai ;
bx =
i xbi :
(30)
In this sense, a TS model can be regarded as a mapping from the antecedent (input) space to a convex region (polytope) in the space of the parameters of a quasi-linear system, as schematically depicted in Fig. 10.
Parameter space Antecedent space
Rules
Medium
Big
Polytope a2 a1
x2 x1
Small
Small
Medium
Big
Figure 10. A TS model with afne consequents can be regarded as a mapping from the antecedent space to the space of the consequent parameters. This property facilitates the analysis of TS models in a framework similar to that of linear systems. Methods have been developed to design controllers with desired closed loop characteristics (Filev, 1996) and to analyze their stability (Tanaka and Sugeno, 1992; Zhao, 1995; Tanaka, et al., 1996). 15
where xk and uk are the state and the input at time k , respectively, and f is a static function, called the state-transition function. Fuzzy models of different types can be used to approximate the state-transition function. As the state of a process is often not measured, input-output modeling is usually applied. The most common is the NARX (Nonlinear AutoRegessive with eXogenous input) model:
yk +1 = f yk; yk , 1; : : : ; yk , ny +1; uk; uk , 1; : : : ; uk , nu +1 : (32) Here y k ; : : : ; y k , ny + 1 and uk ; : : : ; uk , nu + 1 denote the past model outputs and inputs respectively and ny , nu are integers related to the model order (usually selected by the user).
For example, a linguistic fuzzy model of a dynamic system may consist of rules of the following form:
Ri : If yk is Ai and yk , 1 is Ai and; : : : yk , n + 1 is Ain and uk is Bi and uk , 1 is Bi and; : : : ; uk , m + 1 is Bim then y k + 1 is Ci :
1 2 2
(33)
In this sense, we can say that the dynamic behavior is taken care of by external dynamic lters added to the fuzzy system Fig. 11. In (33), the input dynamic lter is a simple generator of the lagged inputs and outputs, and no output lter is used.
Input
Dynamic filter
Numerical data
Knowledge Base
Output
Dynamic filter
Numerical data
Rule Base
Data Base
Fuzzifier
Fuzzy Set
Fuzzy Set
Defuzzifier
Figure 11. A generic fuzzy system with fuzzication and defuzzication units and external dynamic lters. Since the fuzzy models can approximate any smooth function to any degree of accuracy (Wang, 1992), models of type (33) can approximate any observable and controllable modes of a large class of discrete-time nonlinear systems (Leonaritis and Billings, 1985).
knowledge, heuristics), which usually originates from experts, i.e., process designers, operators, etc. In this sense, fuzzy models can be regarded as simple fuzzy expert systems (Zimmermann, 1987). For many processes, data are available as records of the process operation or special identication experiments can be designed to obtain the relevant data. Building fuzzy models from data involves methods based on fuzzy logic and approximate reasoning, but also ideas originating from the eld of neural networks, data analysis and conventional systems identication. The acquisition or tuning of fuzzy models by means of data is usually termed fuzzy identication. Two main approaches to the integration of knowledge and data in a fuzzy model can be distinguished: 1. The expert knowledge expressed in a verbal form is translated into a collection of if then rules. In this way, a certain model structure is created. Parameters in this structure (membership functions, consequent singletons or parameters) can be ne-tuned using inputoutput data. The particular tuning algorithms exploit the fact that at the computational level, a fuzzy model can be seen as a layered structure (network), similar to articial neural networks, to which standard learning algorithms can be applied. This approach is usually termed neuro-fuzzy modeling (Jang, 1993; Jang and Sun, 1993; Pedrycz, 1995). 2. No prior knowledge about the system under study is initially used to formulate the rules, and a fuzzy model is constructed from data. It is expected that the extracted rules and membership functions can provide an a posteriori interpretation of the systems behavior. An expert can confront this information with his own knowledge, can modify the rules, or supply new ones, and can design additional experiments in order to obtain more informative data. These techniques, of course, can be combined, depending on the particular application. In the sequel, we describe the main steps and choices in the knowledge-based construction of fuzzy models, and the main techniques to extract or ne-tune fuzzy models by means of data.
17
depicted in Fig. 12a. A less complex model, such as a second-order polynomial, will do much better in this case, see Fig. 12b. See the function polynom.m.
Figure 12. Approximation of a sinusoidal function (dashed-dotted line) by two models of a different complexity (solid line).
In fuzzy models, structure selection involves the following choices: Input and output variables. With complex systems, it is not always clear which variables should be used as inputs to the model. In the case of dynamic systems, one also must estimate the order of the system. For the input-output NARX model (32) this means to dene the number of input and output lags ny and nu , respectively. Prior knowledge, insight in the process behavior and the purpose of modeling are the typical sources of information for this choice. Sometimes, automatic data-driven selection can be used to compare different choices in terms of some performance criteria. Structure of the rules. This choice involves the model type (linguistic, singleton, TakagiSugeno) and the antecedent form (refer to Section 2.1.3). Important aspects are the purpose of modeling and the type available knowledge. Number and type of membership functions for each variable. This choice determines the level of detail (granularity) of the model. Again, the purpose of modeling and the detail of available knowledge, will inuence this choice. Automated, data-driven methods can be used to add or remove membership functions from the model. Type of the inference mechanism, connective operators, defuzzication method. These choices are restricted by the type of fuzzy model (Mamdani, TS). Within these restrictions, however, some freedom remains, e.g., as to the choice of the conjunction operators, etc. To facilitate data-driven optimization of fuzzy models (learning), differentiable operators (product, sum) are often preferred to the standard min and max operators. After the structure is xed, the performance of a fuzzy model can be ne-tuned by adjusting its parameters. Tunable parameters of linguistic models are the parameters of antecedent and consequent membership functions (determine their shape and position) and the rules (determine 18
the mapping between the antecedent and consequent fuzzy regions). Takagi-Sugeno models have parameters in antecedent membership functions and in the consequent functions (a and b for the afne TS model).
X = x1 ; : : : ; xN T ; y = y1 ; : : : ; yN T :
3.3.1 Least-squares estimation of consequents
(34)
Note that the defuzzication formulas of the singleton and TS models, equations (21) and (27), respectively, are linear in the consequent parameters, ai , bi . Hence, these parameters can be estimated from the available data by least-squares techniques. Denote ,i R N N the diagonal matrix having the normalized membership degree i xk of (28) as its k th diagonal element. By appending a unitary column to X, the extended matrix Xe = X; 1 is created. Further, denote X0 the matrix in R N KN composed of the products of matrices ,i and Xe
X0 = ,1 Xe ; ,2 Xe ; : : : ; ,K Xe :
19
(35)
The consequent parameters ai and bi are lumped into a single parameter vector
Given the data X, y, eq. (27) now can be written in a matrix form, y = X0 + : From linear algebra (Strang, 1976) we know that this set of equations can be solved for the parameter by:
= aT ; b ; aT ; b ; : : : ; aT ; bK T : K
1 1 2 2
2 RK p
+1
: (36)
= X0 T X0 , X0 T y :
1
(37)
This is an optimal least-squares solution which gives the minimal prediction error, and as such is suitable for prediction models. At the same time, however, it may bias the estimates of the consequent parameters as parameters of local models. If an accurate estimate of local model parameters is desired, a weighted least-squares approach applied per rule may be used:
aT ; bi T = XT ,i Xe ,1 XT ,i y : i e e
(38)
In this case, the consequrent parameters of individual rules are estimated independently of each other, and therefore are not biased by the interactions of the rules. By omitting ai for all 1 i K , equations (37) and (38) directly apply to the singleton model (20).
3.3.2
Template-based modeling
With this approach, the domains of the antecedent variables are simply partitioned into a specied number of equally spaced and shaped membership functions. The rule base is then established to cover all the combinations of the antecedent terms. The consequent parameters are estimated by the least-squares method. Example 3.2 Consider a nonlinear dynamic system described by a rst-order difference equation:
(39)
We use a stepwise inputs signal to generate with this equation a set of 300 inputoutput data pairs (see Fig. 14a). Suppose that it is known that the system is rst order and that the nonlinearity of the system is only caused by y , the following TS rule structure can be chosen: If y k is Ai then y k + 1 = ai y k + bi uk ; (40)
Assuming that no further prior knowledge is available, seven equally spaced triangular membership functions, A1 to A7 , are dened in the domain of y k , as shown in Fig. 13a. The consequent parameters were estimated by the least-squares method as described in Section 3.3.1. Figure 13b gives a plot of the parameters ai , bi against the cores of the antecedent fuzzy sets Ai . Also plotted is the linear interpolation between the parameters (dashed line) and the true system nonlinearity (solid line). The interpolation between ai and bi is linear, since the membership functions are piece-wise linear (triangular). One can observe that the dependence of the consequent parameters on the antecedent variable approximates quite accurately the systems nonlinearity, which gives the model a certain transparency. The values of the parameters aT = 1:00; 1:00; 1:00; 0:97; 1:01; 1:00; 1:00 and bT = 0:01; 0:05; 0:20; 0:81; 0:20; 0:05; 0:01 T indicate the strong input nonlinearity and the linear dynamics as in (39). Validation of the model in simulation using a different data set is given in Fig. 14b. This example is implemented in the MATLAB function phdemo.m. 20
Parameters ai and bi
A1
A2
A3
A4
A5
A6
A7
a1
1
a2
a3
a4
a5
a6
a7
b4
0 -1.5
b1
-1.5
b2
-1
b3
-0.5 0
b5
0.5
b6
1
y(k)
b7
1.5
-1
-0.5
0.5
1
y(k)
1.5
Figure 13. (a) Equidistant triangular membership functions designed for the output yk; (b) comparison of the true system nonlinearity (solid line) and its approximation in terms of the estimated consequent parameters (dashed line).
y(k)
y(k)
50 100 150 200 250 300
1 0 1
u(k)
u(k)
50 100 150 200 250 300
1 0
1 0
50
100
150
200
250
300
(b) Validation.
Figure 14. Identication data set (a), and performance of the model on a validation data set (b). Solid line: process, dashed-dotted line: model.
The transparent local structure of the TS model facilitates the combination of local models obtained by parameter estimation and linearization of known mechanistic (white-box) models. If measurements are available only in certain regions of the process operating domain, parameters for the remaining regions can be obtained by linearizing a (locally valid) mechanistic model of the process. Suppose that this model is given by y = f x. Linearization around the center ci of the ith rules antecedent membership function yields the following parameters of the afne TS model (26):
df ai = dx
x=ci
; bi = f ci :
(41)
A drawback of the template-based approach is that the number of rules in the model may grow very fast. If no knowledge is available as to which variables cause the nonlinearity of the system, all the antecedent variables are usually partitioned uniformly, which leads to an exponential increase of the number of rules. 21
The complexity of the systems behavior is typically not uniform, which means that certain regions can be well approximated by a single model, while other regions require rather ne partitioning. In order to obtain an efcient representation with as few rules as possible, the membership functions must be placed such that they capture the non-uniform behavior of the system. This often requires that system measurements are also used to form the membership functions, as discussed in the following sections.
3.3.3
Neuro-fuzzy modeling
In Section 3.3.1 we have seen that parameters that are linearly related to the output can be (optimally) estimated by least-squares methods. In order to optimize also the parameters which are related to the output in a nonlinear way, training algorithms known from the area of neural networks can be employed. These techniques exploit the fact that, at the computational level, a fuzzy model can be seen as a layered structure (network), similar to articial neural networks. Hence, this approach is usually referred to as neuro-fuzzy modeling (Jang, 1993; Jang and Sun, 1993; Brown and Harris, 1994). Figure 15 gives an example of a singleton fuzzy model with two rules represented as a network. The rules are: If x1 is A11 and x2 is A21 then y
The nodes in the rst layer compute the membership degree of the inputs in the antecedent fuzzy sets. The product nodes in the second layer represent the antecedent conjunction operator. The normalization node N and the summation node realize the fuzzy-mean operator (27).
b1
b2
ij
(42)
the cij and ij parameters can be adjusted by gradient-descent learning algorithms, such as backpropagation (Wang, 1992). This allows for a ne-tuning of the fuzzy model to the available data in order to optimize its prediction accuracy. 22
3.3.4
Fuzzy clustering
Identication methods based on fuzzy clustering originate from data analysis and pattern recognition, where the concept of graded membership is used to represent the degree to which a given object, represented as a vector of features, is similar to some prototypical object. The degree of similarity can be calculated using a suitable distance measure. Based on the similarity, feature vectors can be clustered such that the vectors within a cluster are as similar (close) as possible, and vectors from different clusters are as dissimilar as possible. This idea of fuzzy clustering is depicted in Fig. 16a, where the data is clustered into two groups with prototypes v1 and v2 , using the Euclidean distance measure. The partitioning of the data is expressed in the fuzzy partition matrix whose elements ij are degrees of membership of the data points xi ; yi in a fuzzy cluster with prototypes vj .
B2
y
projection
curves of equidistance
v2
data
v2 v1
data
cluster centers
B1
v1
x
If x is A1 then y is B1 If x is A2 then y is B2
A1
Projected clusters
A1
A2
x
A2
x
Figure 16. Identication by fuzzy clustering. Fuzzy if-then rules can be extracted by projecting the clusters onto the axes. Figure 16a shows a data set with two apparent clusters and two associated fuzzy rules. The concept of similarity of data to a given prototype leaves enough space for the choice of an appropriate distance measure and of the character of the prototype itself. For example, the prototypes can be dened as linear subspaces (Bezdek, 1981), or the clusters can be ellipsoids with adaptively determined shape (Gustafson and Kessel, 1979), see Fig. 16b. From such clusters, the antecedent membership functions and the consequent parameters of the TakagiSugeno model can be extracted (Babuka and Verbruggen, s 1995):
= a1 x + b1 ; If x is A2 then y = a2 x + b2 :
Each obtained cluster is represented by one rule in the TakagiSugeno model. The membership functions for fuzzy sets A1 and A2 are generated by point-wise projection of the partition matrix onto the antecedent variables. These point-wise dened fuzzy sets are then approximated by a suitable parametric function. The consequent parameters for each rule are obtained as least-squares estimates (37) or (38). 23
If x is A1 then y
= f x dened piece-wise by: y = 0:25x; for x 3 y = x , 32 + 0:75; for 3 x 6 y = 0:25x + 8:25; for x 6
(43)
Figure 17a shows a plot of this function evaluated in 50 samples uniformly distributed over
y = 0.25x + 8.25
8 6 4 2 0 0
membership grade
y = (x3)^2 + 0.75
1 C1 C2 C3 C4
0.5
y = 0.25x 2 4 x 6 8 10
0 0
10
(b) Cluster prototypes (top) and the corresponding fuzzy sets (bottom).
Figure 17. Approximation of a static nonlinear function using a SugenoTakagi fuzzy model. The data xi ; yi i = 1; 2; : : : ; 50 was clustered into four hyperellipsoidal clusters. The upper plot of Fig. 17b shows the local linear models obtained through clustering, the bottom plot shows the corresponding fuzzy partition. In terms of the TS rules, the fuzzy model is expressed as:
R : If x is in C R : If x is in C
1 2
1 2
then y then y
R : If x is in C R : If x is in C
3 4
3 4
then y then y
Note that the consequents of 1 and 4 correspond almost exactly to the rst and third equation (43). Consequents of 2 and 3 are approximate tangents to the parabola dened by the second equation of (43) in the respective cluster centers. See the function clustdem.m.
R R
4 Overview of Applications
Fuzzy modeling and identication methodologies have been successfully used in a number of real-world applications. The TakagiSugeno model has often been employed in the modeling and identication of nonlinear technical processes from data. Examples are the modeling of a multilayer incinerator (Sugeno and Kang, 1986), a converter in a steel-making process (Takagi and Sugeno, 1985), or a glass-melting furnace (Zhao, et al., 1994). Biotechnology and ecology are typical examples of areas where conventional modeling techniques do not give satisfactory results. Fuzzy modeling has been used in a number of applications, such 24
as PenicillinG conversion (Babuka, et al., 1996), prediction of river water ow (Sugeno and s Tanaka, 1991), enzymatic soil removal in washing processes (Kaymak, 1994), or modeling of algae growth in lakes (Setnes, et al., 1997). Fuzzy models can be used in the design of automatic controllers, for instance in train operation (Terano, et al., 1994), combustion control (Sugeno and Kang, 1986), or pressure control (Babuka, s et al., 1996). Fuzzy models can also serve as decision support systems to assist operators (den Hartog, et al., 1997), or can be used to clone the operators based on traces of their behavior (Sugeno and Yasukawa, 1993).
Exercises
1. Explain the terms white-box (mechanistic) modeling and black-box modeling. 2. What is a fuzzy system? Give two examples of fuzzy systems. 3. Give an example of a linguistic if-then rule. What are the linguistic variables and linguistic terms (constants) in your example? 4. What do you understand under relational representation of a rule base? How is the fuzzy relation constructed? How is this relation used to derive an output fuzzy set, given the input fuzzy set? 5. Carry out the relational composition of MATLAB function.
Implement it in a
6. What is fuzzy inference? Give the formulas for the Mamdani (max-min) inference algorithm. Give the graphical representation for an example with two antecedent variables and one consequent variable, using the conjunctive form of rules. 7. Given are two rules: If x is A1 then y is B1 and If x is A2 then y is B2 , where A1 ~ ~ ~ ~ ~ ~ A2 = . Suppose a fuzzy input x = A1 . Is the output y = B1? Explain your answer.
6 ;
8. Give the formula for the center-of-gravity defuzzication. 9. Explain what the singleton fuzzy model is. Give the inference/defuzzication formula for this model. 10. Dene the afne TakagiSugeno fuzzy model. 11. Give an example of a SISO (single-input, single-output) rst-order NARX (Nonlinear AutoRegessive with eXogenous input) model with linguistic fuzzy rules. 12. What are the main steps of knowledge-based design of fuzzy models? 13. Which mathematical method can be used to estimate optimal consequent parameters of a TakagiSugeno fuzzy model? 14. What is a neuro-fuzzy network? Give a simple example.
26
A List of Symbols
A, B , K R
q p x; y; : : : ~~
X 0 1 a, b
x y uk, yk y
, X, Y
R
fuzzy sets number of rules in a rule base fuzzy relation fuzzy if-then rule matrix containing input data (regressors) matrix of appropriate dimensions with all entries equal to zero matrix of appropriate dimensions with all entries equal to one consequent parameters in a TS model number of outputs of a (static) fuzzy model number of inputs of a (static) fuzzy model linguistic variables (have fuzzy sets as their values) input vector output vector input and output of a dynamic system at time k , respectively vector containing regressand data degree of fulllment of a rule normalized degree of fulllment membership degree, membership function domains (universes) of variables x and y set of real numbers
Operators:
A
^ _
transpose of matrix X center of gravity defuzzication of fuzzy set A core of fuzzy set A support of fuzzy set A (fuzzy) set intersection (conjunction) (fuzzy) set union (disjunction) minimum, (fuzzy) conjunction, logical AND maximum, (fuzzy) disjunction, logical OR complement (negation) of A max-min composition
Abbreviations: AI COG MIMO MISO (N)ARX SISO articial intelligence center of gravity multipleinput, multipleoutput multipleinput, singleoutput (nonlinear) autoregressive with exogenous inputs singleinput, singleoutput
27
References
Babuka, R., H.A.B. te Braake, A.J. Krijgsman and H.B. Verbruggen (1996). Comparison of ins telligent control schemes for real-time pressure control. Control Engineering Practice 4(11), 15851592. Babuka, R., H.J.L. van Can and H.B. Verbruggen (1996). Fuzzy modeling of enzymatic s PenicillinG conversion. In Preprints 13th IFAC World Congress, Volume N, San Francisco, USA, pp. 479484. Babuka, R. and H.B. Verbruggen (1995). Identication of composite linear models via fuzzy s clustering. In Proceedings European Control Conference, Rome, Italy, pp. 12071212. Bezdek, J.C. (1981). Pattern Recognition with Fuzzy Objective Function. Plenum Press, New York. Brown, M. and C. Harris (1994). Neurofuzzy Adaptive Modelling and Control. New York: Prentice Hall. Buchanan, B.G. and E.H. Shortliffe (1984). Rule-Based Expert Systems. Readings (MA), USA: AddisonWesley. Driankov, D., H. Hellendoorn and M. Reinfrank (1993). An Introduction to Fuzzy Control. Springer, Berlin. Dubois, D. and H. Prade (1980). Fuzzy sets and systems: theory and applications, Volume 144 of Mathematics in science and engineering. Academic Press. Filev, D.P. (1996). Model based fuzzy control. In Proceedings Fourth European Congress on Intelligent Techniques and Soft Computing EUFIT96, Aachen, Germany. Friedman, J.H. (1991). Multivariate adaptive regression splines. The Annals of Statistics 19(1), 1141. Gustafson, D.E. and W.C. Kessel (1979). Fuzzy clustering with a fuzzy covariance matrix. In Proc. IEEE CDC, San Diego, CA, USA, pp. 761766. Hartog, den, M.H., R. Babuka, H.J.R. Deketh, M. Alvarez Grima, P.N.W. Verhoef and H.B. s Verbruggen (1997). Knowledge-based fuzzy model for performance prediction of a rockcutting trencher. International Journal of Approximate Reasoning 16(1), 4366. Jang, J.-S.R. (1993). ANFIS: Adaptive-network-based fuzzy inference systems. IEEE Transactions on Systems, Man & Cybernetics 23(3), 665685. Jang, J.-S.R. and C.-T. Sun (1993). Functional equivalence between radial basis function networks and fuzzy inference systems. IEEE Transactions on Neural Networks 4(1), 156159. Kaymak, U. (1994). Application of fuzzy methodologies to a washing process. Chartered designer thesis, Delft University of Technology, Control Lab., Faculty of El. Eng., Delft. Klir, G.J. and B. Yuan (1995). Fuzzy sets and fuzzy logic; theory and applications. Prentice Hall. Kosko, B. (1994). Fuzzy systems as universal approximators. IEEE Trans. Computers 43, 13291333. Leonaritis, I.J. and S.A. Billings (1985). Input-output parametric models for non-linear systems. International Journal of Control 41, 303344. Ljung, L. (1987). System Identication, Theory for the User. New Jersey: Prentice-Hall. Mamdani, E.H. (1977). Application of fuzzy logic to approximate reasoning using linguistic systems. Fuzzy Sets and Systems 26, 11821191. Patterson, D.W. (1990). Introduction to Articial Intelligence and Expert Systems. Englewoord Cliffs (NJ), USA: Prentice Hall. 28
Pedrycz, W. (1990). Relevancy of fuzzy models. Information Sciences 52, 285302. Pedrycz, W. (1993). Fuzzy Control and Fuzzy Systems (second, extended,edition). John Willey and Sons, New York. Pedrycz, W. (1995). Fuzzy Sets Engineering. Boca Raton, Fl.: CRC Press. Setnes, M., R. Babuka, H.B. Verbruggen, M.D. S nchez and H.F.P. van den Boogaard (1997). s a Fuzzy modeling and similarity analysis applied to ecological data. In Proceedings FUZZIEEE97, Barcelona, Spain, pp. 415420. Smets, P., E.H. Mamdani, D. Dubois and H. Prade (Eds.) (1988). Non-Standard Logics for Automated Reasoning. Academic Press, London. Strang, G. (1976). Linear Algebra and Its Applications. New York, U.S.A.: Academic Press. Sugeno, M. and G.T. Kang (1986). Fuzzy modelling and control of multilayer incinerator. Fuzzy Sets and Systems 18, 329. Sugeno, M. and K. Tanaka (1991). Successive identication of a fuzzy model and its application to prediction of a complex system. Fuzzy Sets and Systems 42, 315334. Sugeno, M. and T. Yasukawa (1993). A fuzzy-logic-based approach to qualitative modeling. IEEE Trans. Fuzzy Systems 1, 731. Takagi, T. and M. Sugeno (1985). Fuzzy identication of systems and its application to modeling and control. IEEE Trans. Systems, Man and Cybernetics 15(1), 116132. Tanaka, K., T. Ikeda and H.O. Wang (1996). Robust stabilization of a class of uncertain nonlinear systems via fuzzy control: Quadratic stability, H 1 control theory and linear matrix inequalities. IEEE Transactions on Fuzzy Systems 4(1), 113. Tanaka, K. and M. Sugeno (1992). Stability analysis and design of fuzzy control systems. Fuzzy Sets and Systems 45(2), 135156. Terano, T., K. Asai and M. Sugeno (1994). Applied Fuzzy Systems. Boston: Academic Press, Inc. Wang, L.-X. (1992). Fuzzy systems are universal approximators. In Proc. IEEE Int. Conf. on Fuzzy Systems 1992, San Diego, USA, pp. 11631170. Wang, L.-X. (1994). Adaptive Fuzzy Systems and Control, Design and Stability Analysis. New Jersey: Prentice Hall. Yager, R.R. and D.P. Filev (1994). Essentials of Fuzzy Modeling and Control. New York: John Wiley. Zadeh, L.A. (1973). Outline of a new approach to the analysis of complex systems and decision processes. IEEE Trans. Systems, Man, and Cybernetics 1, 2844. Zadeh, L.A. (1975). Calculus of fuzzy restrictions. In L.A. Zadeh, K.-S. Fu, K. Tanaka and M. Shimura (Eds.), Fuzzy Sets and Their Applications to Cognitive and Decision Processes, pp. 139. New York, USA: Academic Press. Zeng, X.J. and M.G. Singh (1995). Approximation theory of fuzzy systems MIMO case. IEEE Trans. Fuzzy Systems 3(2), 219235. Zhao, J. (1995). Fuzzy logic in modeling and control. PhD dissertation, CESAME, Louvain la Neuve, Belgium. Zhao, J., V. Wertz and R. Gorez (1994). A fuzzy clustering method for the identication of fuzzy models for dynamical systems. In 9th IEEE International Symposium on Intelligent Control, Columbus, Ohio, USA. Zimmermann, H.-J. (1987). Fuzzy Sets, Decision Making and Expert Systems. Boston: Kluwer Academic Publishers. 29
Subject index
A
antecedent, 5 applications, 24
L
least-squares method, 19 linguistic model, 5 term, 5 variable, 5 linguistic model, 5
B
black-box model, 1
C
conjunctive form, 11 consequent, 5
M
Mamdani inference, 8, 9 model, see linguistic model max-min inference, see Mamdani inference mechanistic model, 1 multivariable fuzzy system, 10
D
decomposition, 12 defuzzication, 12 center-of-gravity method, 12 fuzzy-mean method, 13 degree of fulllment, 9, 11 dynamic fuzzy system, 16, 20
N
NARX model, 16 neural network, 2, 4 neuro-fuzzy modeling, 17, 22 network, 22
F
rst-principle model, 1 fuzzy clustering, 23 expert system, 17 graph, 8 identication, 17 mean, 13 modeling, 2 number, 2 relation, 6 system, 2
P
physical model, 1
R
rule-based model, 5
S
semi-mechanistic model, 1 singleton model, 12
G
general function approximation, 4 generalization, 17 granularity, 18 gray-box model, 1
T
TakagiSugeno inference, 14 model, 5, 14, 20 template-based modeling, 20
H
hybrid model, 1
W
white-box model, 1
I
identication, 4 inference Mamdani, 8 TakagiSugeno, 14 intelligent modeling, control, 1 30
31