Two-Stepped Evolutionary Algorithm and Its Application To Stability Analysis of Slopes
Two-Stepped Evolutionary Algorithm and Its Application To Stability Analysis of Slopes
Abstract: Based on genetic algorithm and genetic programming, a new evolutionary algorithm is developed to evolve mathematical
models for predicting the behavior of complex systems. The input variables of the models are the property parameters of the systems,
Downloaded from ascelibrary.org by Northeastern University on 02/21/13. Copyright ASCE. For personal use only; all rights reserved.
which include the geometry, the deformation, the strength parameters, etc. On the other hand, the output variables are the system
responses, such as displacement, stress, factor of safety, etc. To improve the efficiency of the evolution process, a two-stepped approach
is adopted; the two steps are the structure evolution and parameter optimization steps. In the structure evolution step, a family of model
structures is generated by genetic programming. Each model structure is a polynomial function of the input variables. An interpreter is
then used to construct the mathematical expression for the model through simplification, regularization, and rationalization. Furthermore,
necessary internal model parameters are added to the model structures automatically. For each model structure, a genetic algorithm is then
used to search for the best values of the internal model parameters in the parameter optimization step. The two steps are repeated until the
best model is evolved. The slope stability problem is used to demonstrate that the present method can efficiently generate mathematical
models for predicting the behavior of complex engineering systems.
DOI: 10.1061/共ASCE兲0887-3801共2004兲18:2共145兲
CE Database subject headings: Algorithms; Slope stability; Stability analysis.
Introduction havior of the systems is available. Most of the methods are based
on either statistical or neural network approaches.
The enormous social and economic losses associated with hazards Various statistical and regression analysis techniques can be
such as slope failures and landslides have attracted the attention used to perform systematic data analysis and to establish relations
of both engineers and researchers. Efforts have been made to for predicting the behavior of complex systems. For example, Sah
develop analytical and numerical models for predicting their oc- et al. 共1994兲 applied the maximum likelihood method to estimate
currences and scales so that appropriate actions can be taken to the factor of safety of slopes. Though numerous methods are
minimize the losses. Because of the variability of geomaterial, available 共Weiss and Kulikowski 1991兲, the results depend on the
insufficient information regarding the site, and incomplete under- users who have to make a decision on the mathematical expres-
standing of the failure mechanism, it is almost impossible to sions of the relationships to be considered as candidates in the
model the problems exactly. Assumptions as well as simplifica- search for the best solutions. In addition, these statistical methods
tions are made in the modeling so that the solutions are tractable. are often based on the assumptions that the data are normally
However, the accuracy of the models in predicting the behavior of distributed and the variables are independent. Unfortunately, geo-
the problems depends very much on the assumptions and simpli- technical problems are usually extremely complex, and these cri-
fications 共Hoek and Bray 1981兲. teria are not necessarily satisfied.
As an alternative, data-based modeling methods can be used to A computation intelligence approach 共Mavrovouniotis 1990兲,
develop input-output relationships for the analysis of complex such as the neural network and evolutionary computation tech-
systems if a sufficiently large amount of information on the be- niques, can overcome such limitations. It has been shown math-
ematically that a neural network is capable of learning continuous
1 nonlinear input-output mapping 共Cybenko 1989兲. Feng et al.
College of Resources and Civil Engineering, Northeastern Univ.,
Shenyang 110004, People’s Republic of China. 共1996兲 proved that neural networks can adequately predict the
2
Dept. of Civil Engineering, Univ. of Hong Kong, Pokfulam Rd., stability level of slopes. Though this technique can serve as a
Hong Kong. strong predictive model for finding complex relationships, it has
3 inherent limitations in handling the uncertainties of the model
College of Resources and Civil Engineering, Northeastern Univ.,
Shenyang 110004, People’s Republic of China. inputs and deciding the size as well as topology of the required
4
College of Resources and Civil Engineering, Northeastern Univ., network. Furthermore, such an approach will not produce explicit
Shenyang 110004, People’s Republic of China. relations; therefore, the model obtained gives little insight into the
5
Dept. of Civil Engineering, Univ. of Hong Kong, Pokfulam Rd., underlying mechanisms of the problems.
Hong Kong. The evolutionary computation techniques 共Goldberg 1989;
Note. Discussion open until September 1, 2004. Separate discussions
Baäck 1995兲 provide robust alternatives. Based on the Darwinian
must be submitted for individual papers. To extend the closing date by
one month, a written request must be filed with the ASCE Managing theory of natural selection, they attempt to obtain the best solution
Editor. The manuscript for this paper was submitted for review and pos- by carrying out global optimization. They use suitable coding to
sible publication on July 18, 2002; approved on January 2, 2003. This represent possible solutions for a problem and guide the search by
paper is part of the Journal of Computing in Civil Engineering, Vol. 18, using genetic operators and the principle of ‘‘survival of the fit-
No. 2, April 1, 2004. ©ASCE, ISSN 0887-3801/2004/2-145–153/$18.00. test.’’ Of these algorithms, genetic algorithms 共Holland 1975;
Goldberg 1989兲 have established themselves as powerful search eling, ecological modeling, pattern recognition, and data mining
and optimization tools in problem solving and function optimiza- 共Sharman et al. 1995; Sen and Stoffa 1996; Shao and Yoshisada
tion 共Goldberg 1989, 1994兲, and they have been widely applied in 1996; McKay et al. 1997; Gray et al. 1998; Babovic 2000; Cao
civil and geotechnical engineering 共Feng et al. 1999, 2000; Javadi et al. 2000; Coello 2000; Yang and Soh 2000; Brameier and Ban-
et al. 1999; Nath et al. 1999; Simoes et al. 1999; Fishwick et al. zhaf 2001; Khu et al. 2001; Ross et al. 2001; Wigham and Reck-
2000; Deng and Lee 2001兲. Goh 共1999, 2000兲 used genetic algo- nagel 2001兲.
rithms to search for critical slip surfaces of slopes. However, ge- When standard genetic programming is used for model iden-
netic algorithms cannot be used to evolve complex models such tification, the models generated may include a number of internal
as equations. Therefore, genetic programming 共Koza 1992兲 has model parameters. These randomly generated parameters can
been developed to overcome this limitation. Genetic program- have significant effects on the performance of a model, because
ming can perform symbolic regression 共Koza 1992兲. It is advan-
their values affect the output of the model directly. As a poten-
tageous in that a priori assumptions and prior knowledge about
tially good model with favorable structure may be removed dur-
the structure and size of the solution are not necessary. Moreover,
ing the search, these internal model parameters have to be opti-
this technique can automatically select the system inputs to yield
a model structure that accurately fits the input-output response of mized in the model identification process. In many cases, one has
the system and provides a descriptive solution. Due to its advan- difficulties in the optimization process because the parameters
tages, genetic programming has been successfully used in the change dynamically with the evolution. Various methods have
study of engineering problems such as in civil engineering, pro- been proposed 共Sharman et al. 1995; McKay et al. 1997; Gray
cess modeling, autocontrol, signal processing, hydrological mod- et al. 1998; Cao et al. 2000兲 to overcome the difficulties by add-
therefore, no new model structure is produced. This operator is There are two mutation operators, namely, the shrink and re-
necessary, as it helps to preserve ‘‘good’’ model structures. place mutations. In shrink mutation, a subtree is deleted. By de-
On the other hand, crossover produces new model structures leting ‘‘⫹x 2 x 3 ’’ of Model 3, Model 7 is generated:
by swapping two randomly selected subtrees of two parents. For • Model 7: ⫹x 21 ⫺x 2 /x 1 x 3 .
example, by swapping the trees 共‘‘⫹x 1 x 2 /x 3 ’’ and ‘‘⫹x 1 x 3 ’’兲 be- On the other hand, replace mutation is conducted by replacing
tween Models 1 and 2, we can generate two new models 共Models a randomly selected node with a randomly selected node in the
5 and 6兲, namely building set. For example, Model 8 is created by replacing the ‘‘/’’
• Model 5: ⫹x 22 ⫺x 21 ⫹x 1 x 3 and of Model 4 with ‘‘⫻’’:
• Model 6: ⫹x 21 ⫹x 2 x 1 /x 3 ⫺x 2 /x 1 x 3 . • Model 8: ⫹x 21 ⫹x 23 ⫺x 2 x 3 /x 1 .
Control Parameters
Parameters have to be prechosen to control the search process,
E⫽
1
n
冑兺 n
1
共 z est⫺z target兲 2 (2)
and they include the population size for each generation, the num- where E⫽fitness; z est and z target⫽estimated and target value of the
ber of generations, the probabilities of genetic operators 共repro- output variable; and n⫽total number of cases in the fitness group.
duction, crossover, and mutation兲, the selection method, the ter- As the two-stepped approach is adopted in the proposed
mination conditions, etc. The population size and number of method, the optimal set of internal model parameters for each
generations are the primary parameters, whereas the others are the model structure is determined first. Comparing the various model
One can then search for factor of safety, F̂, in terms of the build-
ing set; that is
Implementation
Twenty-three dry 共13 failed and 10 stable兲 and 23 wet 共16 failed
Downloaded from ascelibrary.org by Northeastern University on 02/21/13. Copyright ASCE. For personal use only; all rights reserved.
and seven stable兲 cases of circular failure were collected from Sah
et al. 共1994兲. These data are presented in Table 1. Six cases,
which are indicated by ‘‘a’’ in the table, are randomly selected to
serve as the testing cases. The others are used as the fitness cases.
The fitness function is defined as the error between the factors
of safety obtained by limit equilibrium analysis and those calcu-
lated by the present method. Mathematically, it can be expressed
as
再 冑兺 n
1
Fig. 6. Correlation between factors of safety obtained by limit 共 F LEM⫺F̂ 兲 2
fitness⫽ n i⫽1 (8)
equilibrium and present approaches
M
where F LEM⫽safety factor obtained by the limit equilibrium ap-
popular one for computing such a factor 共Hoek and Bray 1981兲. It proach; n⫽total number of fitness cases; and M⫽prescribed rela-
makes use of the plastic limit theorem of solid mechanics to ana- tively large value, and it is necessary to allow for overflow re-
lyze the stability of any mass of soil or rock under the assumption sults. Based on the parameter settings given in Table 2, we can
that mass will fail along a potential slip surface. Although differ- perform the genetic search to obtain the best expression for com-
ent assumptions have been made in limit equilibrium analyses, puting the factor of safety.
they give similar results 共Nash 1987兲. A vast database of factors
of safety for slopes can be assimilated from either open published
literature or reports. Therefore, one can make use of such infor- Results
mation and apply the present method to evolve a mathematical
expression for obtaining the factor of safety of a slope without For the fitness cases shown in Table 1, 10 runs were performed
actually carrying out limit equilibrium analyses. with different randomly generated initial models so that each run
would take a different genetic path to evolve models. Fig. 4
shows the change of fitness values with evolution of a typical run.
Problem Statement Each point represents the fitness value of a model structure with
In this section, the proposed method is applied to slope stability optimized parameters. For clarity, only models with a fitness in
analysis, and an optimal mathematical expression for computing the range of 关0.12,0.6兴 are plotted. As the model structures have
the safety factor of slopes will be generated. Though the exact been generated randomly, the distribution of the fitness values in
relation is not yet known, it is believed that there must be some the initial generation varies significantly, and only a small number
mapping relationships between the safety factor and the geotech- of models have fitness within the chosen range 关Fig. 4共a兲兴.
nical and geometry properties of the slope. The factor of safety is As evolution proceeds, more and more models have a fitness
the output, whereas these geotechnical and geometrical properties within the range 共see results of the 20th, 40th, 60th, and 80th
are the input variables. Based on the knowledge of the behavior generations兲, and the fitness values converge. This indicated that
of slopes, the factor of safety can be expressed as more and more models with better fitness were generated and the
models with poor fitness were gradually eliminated through selec-
F⫽ f 共 ␥,c,,,H,r 兲 (3) tion and contest.
where F⫽factor of safety; ␥⫽unit weight; c⫽cohesive strength; The best fitness value and the average fitness value for all
⫽internal friction angle; ⫽slope angle; H⫽slope height; and models 共Fig. 5兲 show that the performance of the models improve
r⫽pore pressure ratio. gradually. Moreover, we can note that the changes are more sig-
These property variables constitute the terminal set. As the nificant in the early generations and become more gradual in the
terminal set includes angle variables, it is also necessary, in addi- latter generations. From the 40th generation to the 80th genera-
tion to the basic arithmetical functions 共i.e., ⫹, ⫺, ⫻, /兲, to in- tion, the process almost reaches the ‘‘steady state’’; that is, the
clude trigonometric functions 共sin, cos, tan, cot, sec, and cosec兲. average fitness oscillates slightly and the best fitness almost re-
Thus, the building set of the problem can be written as mains unchanged. Therefore, one can terminate the evolution at
the 80th generation.
B⫽ 兵 ⫹,⫺,⫻,/,sin,cos,tan,cot,sec,csc;␥,c,,,H,r 其 (4)
Translating the ‘‘best’’ tree-structure into a mathematical ex-
where the function set is pression, one can show that the factor of safety is given by
is close to one. The results show that the prediction is fairly rock mass parameters relating to the Three Gorges Project permanent
accurate. As a comparison, the results obtained from maximum shiplock using an intelligent displacement back analysis method.’’ Int.
likelihood estimation are also listed in Table 3. One can note that J. Rock Mech. Min. Sci., 37共7兲, 1039–1054.
the present method can attain a better approximation. Feng, X. T., Zhang, Z. Q., and Xu, P. 共1999兲. ‘‘Adaptive and intelligent
prediction of deformation time series of high rock excavation slope.’’
Trans. Nonferrous Met. Soc. China, 9共4兲, 842– 846.
Conclusions Fishwick, R. J., Liu, X. L., and Begg, D. W. 共2000兲. ‘‘Adaptive search in
discrete limit analysis problems.’’ Comput. Methods Appl. Mech.
Eng., 189共3兲, 931–942.
For highly nonlinear and extremely complex engineering prob-
Goh, A. T. C. 共1999兲. ‘‘Genetic algorithm search for critical slip surface
lems, a two-stepped evolutionary algorithm by combining genetic
in multiple-wedge stability analysis.’’ Can. Geotech. J., 36共2兲, 382–
programming and genetic algorithms has been developed to 391.
search for the best model structure and its internal model param- Goh, A. T. C. 共2000兲. ‘‘Search for critical slip circle using genetic algo-
eters. This method has improved the search process by simplify- rithms.’’ Civ. Eng. Environ. Syst., 17共3兲, 181–211.
ing the complex problem of evolving model structure and its in- Goldberg, D. E. 共1989兲. Genetic algorithms in search, optimization, and
ternal model parameters into two steps, that is, evolution of the machine learning, Addison-Wesley, Reading, Mass.
model structure and optimization of the internal model param- Goldberg, D. E. 共1994兲. ‘‘Genetic and evolutionary algorithms come of
eters. Using this method, an empirical equation was established age.’’ Commun. ACM, 37共3兲, 113–119.
for estimating the factor of safety of slopes from the input data Gray, G. J., Murray-Smith, D. J., Li, Y., Sharman, K. C., and Weinbren-
共geotechnical properties and the geometries of slopes兲. The results ner, T. 共1998兲. ‘‘Nonlinear model structure identification using genetic
show that there is good agreement between the safety factors programming.’’ Control Eng. Pract., 6, 1341–1352.
Hoek, E., and Bray, J. 共1981兲. Rock slope engineering, Institution of
obtained by the present method and the limit equilibrium method.
Mining and Metallurgy, London.
The present method automatically constructed models directly Holland, J. H. 共1975兲. Adaptation in natural and artificial system, Uni-
from the fitness case sets. Though it is an empirical approach, the versity of Michigan, Ann Arbor, Mich.
functions obtained can still give insight into the physical nature of Javadi, A. A., Farmani, R., Toropov, V. V., and Snee, C. P. M. 共1999兲.
complex systems. The present method can offer a useful alterna- ‘‘Identification of parameters for air permeability of shotcrete tunnel
tive to other data-based modeling methodologies and can perform lining using a genetic algorithm.’’ Comput. Geotech., 25共1兲, 1–24.
model input selection, model structure selection, and parameter Khu, S. T., Liong, S. Y., Babovic, V., Madsen, H., and Muttil, N. 共2001兲.
identification. It can be widely used in engineering analysis as a ‘‘Genetic programming and its application in real-time runoff fore-
useful tool for predicting the behavior of engineering systems. casting.’’ J. Am. Water Resour. Assoc., 37共2兲, 439– 451.
Koza, J. 共1992兲. Genetic programming: On the programming of comput-
ers by natural selection, MIT Press, Cambridge, Mass.
Mavrovouniotis, M. L. 共1990兲. Artificial intelligence in process engineer-
Acknowledgments ing, Academic, New York.
McKay, B., Willis, M., and Barton, G. 共1997兲. ‘‘Steady-state modelling of
This research is supported by the Fok Ying Tung Education Foun- chemical process systems using genetic programming.’’ Comput.
dation 共Grant No. 71408兲, the Teaching and Research Award Pro- Chem. Eng., 21共9兲, 981–996.
gram for Outstanding Young Teachers in Higher Education Insti- Nash, D. 共1987兲. ‘‘A comparative review of limit equilibrium methods of
tiutions of Ministry of Education, China, and the Research Grants stability analysis.’’ Slope stability, M. G. Anderson and K. S. Rich-
Council of HKSAR. The writers would like to thank Professor ards, eds., Wiley, New York.
Tingjie Li for his generous support. Nath, S. K., Chakraborty, S., Singh, S. K., and Ganguly, N. 共1999兲. ‘‘Ve-
locity inversion in cross-hole seismic tomography by counter-
propagation neural network, genetic algorithm, and evolutionary pro-
gramming techniques.’’ Geophys. J. Int., 138共1兲, 108 –124.
References Ross, B. J., Fueten, F., and Yashkir, D. Y. 共2001兲. ‘‘Automatic mineral
identification using genetic programming.’’ Mach. Vision Appl., 13共2兲,
Baäck, T. 共1995兲. Evolutionary algorithms in theory and practice, Oxford 61– 69.
University Press, New York. Sah, N. K., Sheorey, P. R., and Upadhyaya, L. N. 共1994兲. ‘‘Maximum
Babovic, V. 共2000兲. ‘‘Data mining and knowledge discovery in sediment likelihood estimation of slope stability.’’ Int. J. Rock Mech. Min. Sci.,
transport.’’ Comput. Aided Civ. Infrastruct. Eng., 15共5兲, 383–389. 31共1兲, 47–53.
Brameier, M., and Banzhaf, W. 共2001兲. ‘‘A comparison of linear genetic Sen, M. K., and Stoffa, P. L. 共1996兲. ‘‘Bayesian inference, Gibbs’ sam-
programming and neural networks in medical data mining.’’ IEEE pler, and uncertainty estimation in geophysical inversion.’’ Geophys.
Trans. Evol. Comput., 5共1兲, 17–26. Prospect., 44共2兲, 313–350.