0% found this document useful (0 votes)

4 views108 pages

TEC-SS2025-02-foundations-annotations

Chapter 2 of 'Theory of Evolutionary Computation' by Dirk Sudholt introduces advanced Randomised Search Heuristics, discusses the No Free Lunch theorems, and recaps probability theory foundations. It outlines the historical development of Evolutionary Computation from various strands such as Evolutionary Programming, Genetic Algorithms, and Swarm Intelligence. The chapter emphasizes the importance of understanding specific problem structures to evaluate the effectiveness of different algorithms.

Uploaded by

bilal hoor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views108 pages

TEC-SS2025-02-foundations-annotations

Uploaded by

bilal hoor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 108

Theory of Evolutionary Computation

Chapter 2: Theoretical Foundations of Evolutionary Computation

Dirk Sudholt
Aims for today

Introduce more advanced Randomised Search Heuristics

Discuss No Free Lunch (NFL) theorems
Recap foundations from probability theory
Give a first example of a runtime analysis
Introduce the fitness-level method as a simple analysis tool for proving upper
runtime bounds

Theory of Evolutionary Computation (Dirk Sudholt) Foundations 2 / 28

History of Evolutionary Computation
Evolutionary Computation emerged from four strands:

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History 3 / 28

History of Evolutionary Computation
Evolutionary Computation emerged from four strands:
Evolutionary Programming, 1962 (Lawrence J. Fogel, USA)
▶ evolving finite-state machines

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History 3 / 28

History of Evolutionary Computation
Evolutionary Computation emerged from four strands:
Evolutionary Programming, 1962 (Lawrence J. Fogel, USA)
▶ evolving finite-state machines

Evolution Strategies, 1964 (Ingo Rechenberg, Hans-Paul Schwefel, Germany)

▶ focus on continuous spaces Rd and mutation as variation operator

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History 3 / 28

History of Evolutionary Computation
Evolutionary Computation emerged from four strands:
Evolutionary Programming, 1962 (Lawrence J. Fogel, USA)
▶ evolving finite-state machines

Evolution Strategies, 1964 (Ingo Rechenberg, Hans-Paul Schwefel, Germany)

▶ focus on continuous spaces Rd and mutation as variation operator

Genetic Algorithms, 1960s (John H. Holland, USA)

▶ focus on binary spaces {0, 1}n and recombination as variation operator

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History 3 / 28

History of Evolutionary Computation
Evolutionary Computation emerged from four strands:
Evolutionary Programming, 1962 (Lawrence J. Fogel, USA)
▶ evolving finite-state machines

Evolution Strategies, 1964 (Ingo Rechenberg, Hans-Paul Schwefel, Germany)

▶ focus on continuous spaces Rd and mutation as variation operator

Genetic Algorithms, 1960s (John H. Holland, USA)

▶ focus on binary spaces {0, 1}n and recombination as variation operator

Genetic Programming, 1990s (John R. Koza, USA)

▶ evolving computer programs

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History 3 / 28

History of Evolutionary Computation
Evolutionary Computation emerged from four strands:
Evolutionary Programming, 1962 (Lawrence J. Fogel, USA)
▶ evolving finite-state machines

Evolution Strategies, 1964 (Ingo Rechenberg, Hans-Paul Schwefel, Germany)

▶ focus on continuous spaces Rd and mutation as variation operator

Genetic Algorithms, 1960s (John H. Holland, USA)

▶ focus on binary spaces {0, 1}n and recombination as variation operator

Genetic Programming, 1990s (John R. Koza, USA)

▶ evolving computer programs

Nowadays, EC is a subfield of AI/Computational Intelligence and an umbrella term for all

the above and further randomised search heuristics, including
estimation-of-distribution algorithms
simulated annealing
swarm intelligence
artificial immune systems
...
In Operational Research/Operational Management, these algorithms are also known as
metaheuristics.
Theory of Evolutionary Computation (Dirk Sudholt) Foundations History 3 / 28
Swarm Intelligence

Collective behavior of a “swarm” of agents.

Examples from Nature

Photo by Mehmet Karatay.

Yewenyi at the English language Wikipedia

Plenty of inspiration for optimization.

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History 4 / 28

Ant Colony Optimization

Sketch by Johann Dréo.

Photo by Mehmet Karatay.

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History 5 / 28

Swarm Intelligence paradigms
Ant colony optimization (ACO) [Dorigo, 1992]
inspired by foraging behavior of ants
artificial ants construct solutions using pheromones
pheromones indicate attractiveness of solution component

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History 6 / 28

Particle swarm optimization (PSO) [Kennedy and Eberhart, 1995]

mimics search of bird flocks and fish schools
particles “fly” through search space
each particle is attracted by own best position and best pos. of neighbors

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History 6 / 28

Particle swarm optimization (PSO) [Kennedy and Eberhart, 1995]

mimics search of bird flocks and fish schools
particles “fly” through search space
each particle is attracted by own best position and best pos. of neighbors

And then the flood gates opened. . .

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History 6 / 28

Particle swarm optimization (PSO) [Kennedy and Eberhart, 1995]

mimics search of bird flocks and fish schools
particles “fly” through search space
each particle is attracted by own best position and best pos. of neighbors

And then the flood gates opened. . .

Everyone and their dog proposed metaheuristics based on . . .

African buffalos, algae, amoebas, Andean condors, ant lions, bacteria, barnacles (mating), bats, bees,
bumblebees, queen bees, beetles, big bang, biogeography, bisons, black holes, blind naked mole rats,
brainstorming, buses, butterflies, camels, cancers, cats, central force, charged systems, cheetahs, chemical
reactions, chicks, chicken swarms, clouds, cockroaches, colliding bodies, consultants, coral reefs, covid-19,
coyotes, crows, crystal energy, cuckoos, . . .

Source: http://fcampelo.github.io/EC-Bestiary/
Theory of Evolutionary Computation (Dirk Sudholt) Foundations History 6 / 28
Which Metaheuristic Paradigm is the Best?
Past views of metaheuristics (see sketch):
Metaheuristics are good across all problems.
Only beaten by specialised algorithms on few problems.

Theory of Evolutionary Computation (Dirk Sudholt) Foundations NFL Theorem 7 / 28

No Free Lunch Theorems
We will consider sets F of functions closed under permutations.

Theory of Evolutionary Computation (Dirk Sudholt) Foundations NFL Theorem 8 / 28

No Free Lunch Theorems
We will consider sets F of functions closed under permutations.
If f ∈ F , then all permutations of f -values are also functions in F .

Theory of Evolutionary Computation (Dirk Sudholt) Foundations NFL Theorem 8 / 28

No Free Lunch Theorems
We will consider sets F of functions closed under permutations.
If f ∈ F , then all permutations of f -values are also functions in F .

000 001 010 011 100 101 110 111

f1 ∈ F 0 1 2 3 4 5 6 7
f2 ∈ F 4 7 3 0 6 5 1 2
f3 ∈ F 6 2 5 7 0 4 1 3
.. .. .. .. .. .. .. ..
. . . . . . . .

Theory of Evolutionary Computation (Dirk Sudholt) Foundations NFL Theorem 8 / 28

No Free Lunch Theorems
We will consider sets F of functions closed under permutations.
If f ∈ F , then all permutations of f -values are also functions in F .

000 001 010 011 100 101 110 111

f1 ∈ F 0 1 2 3 4 5 6 7
f2 ∈ F 4 7 3 0 6 5 1 2
f3 ∈ F 6 2 5 7 0 4 1 3
.. .. .. .. .. .. .. ..
. . . . . . . .

No Free Lunch Theorem [Wolpert and Macready, 1997, Droste et al., 2002]
Consider search algorithms for functions f ∈ F where F is closed under permutations.
Let T (A) be the average number of different search points sampled by A before an
optimum is found (under the uniform distribution on F ). Then for any two search
algorithms A, B we have T (A) = T (B).

Theory of Evolutionary Computation (Dirk Sudholt) Foundations NFL Theorem 8 / 28

No Free Lunch Theorems
We will consider sets F of functions closed under permutations.
If f ∈ F , then all permutations of f -values are also functions in F .

000 001 010 011 100 101 110 111

f1 ∈ F 0 1 2 3 4 5 6 7
f2 ∈ F 4 7 3 0 6 5 1 2
f3 ∈ F 6 2 5 7 0 4 1 3
.. .. .. .. .. .. .. ..
. . . . . . . .

All search algorithms have the same average performance on F !

Theory of Evolutionary Computation (Dirk Sudholt) Foundations NFL Theorem 8 / 28

No Free Lunch Theorems
We will consider sets F of functions closed under permutations.
If f ∈ F , then all permutations of f -values are also functions in F .

000 001 010 011 100 101 110 111

f1 ∈ F 0 1 2 3 4 5 6 7
f2 ∈ F 4 7 3 0 6 5 1 2
f3 ∈ F 6 2 5 7 0 4 1 3
.. .. .. .. .. .. .. ..
. . . . . . . .

All search algorithms have the same average performance on F !

Formal proof in Droste, Jansen, and Wegener [2002].

Theory of Evolutionary Computation (Dirk Sudholt) Foundations NFL Theorem 8 / 28

Discussion of No Free Lunch
No Free Lunch Theorem considers all functions in a set F closed under
permutations.

Theory of Evolutionary Computation (Dirk Sudholt) Foundations NFL Theorem 9 / 28

Discussion of No Free Lunch
No Free Lunch Theorem considers all functions in a set F closed under
permutations.
▶ That is, if f ∈ F , then every other assignments of fitness values to search points is
also a function in F !

Theory of Evolutionary Computation (Dirk Sudholt) Foundations NFL Theorem 9 / 28

Typical characteristics of realistic problems

Some degree of smoothness
Good search points are close to other good search points
(But we may get stuck in local optima.)

Theory of Evolutionary Computation (Dirk Sudholt) Foundations NFL Theorem 9 / 28

Typical characteristics of realistic problems

Some degree of smoothness
Good search points are close to other good search points
(But we may get stuck in local optima.)

Lessons learned from No Free Lunch

Disproves wrong claims that Algorithm A is better than Algorithm B on all problems.
But: the No Free Lunch scenario is not interesting.
To gain more insight, we have to study concrete problems!

Theory of Evolutionary Computation (Dirk Sudholt) Foundations NFL Theorem 9 / 28

How Useful is Crossover? Early theoretical approaches . . .
Schema Theory [Holland, 1975]
∗ ∗ 1 ∗ 0 0 ∗ ∗
Schemata with high fitness and few defining bits spread disproportionally.

Theory of Evolutionary Computation (Dirk Sudholt) Foundations Schemata Theory 10 / 28

How Useful is Crossover? Early theoretical approaches . . .
Schema Theory [Holland, 1975]
∗ ∗ 1 ∗ 0 0 ∗ ∗
Schemata with high fitness and few defining bits spread disproportionally.

true for one generation

Theory of Evolutionary Computation (Dirk Sudholt) Foundations Schemata Theory 10 / 28

How Useful is Crossover? Early theoretical approaches . . .
Schema Theory [Holland, 1975]
∗ ∗ 1 ∗ 0 0 ∗ ∗
Schemata with high fitness and few defining bits spread disproportionally.

true for one generation

not true for multiple generations!

Theory of Evolutionary Computation (Dirk Sudholt) Foundations Schemata Theory 10 / 28

How Useful is Crossover? Early theoretical approaches . . .
Schema Theory [Holland, 1975]
∗ ∗ 1 ∗ 0 0 ∗ ∗
Schemata with high fitness and few defining bits spread disproportionally.

true for one generation

not true for multiple generations!

Building-block hypothesis
Crossover is effective because it combines good ’building blocks’.

Theory of Evolutionary Computation (Dirk Sudholt) Foundations Schemata Theory 10 / 28

How Useful is Crossover? Early theoretical approaches . . .
Schema Theory [Holland, 1975]
∗ ∗ 1 ∗ 0 0 ∗ ∗
Schemata with high fitness and few defining bits spread disproportionally.

true for one generation

not true for multiple generations!

Building-block hypothesis
Crossover is effective because it combines good ’building blocks’.

Mitchell, Forrest, and Holland [1992]: “We designed a problem with building blocks on
which schema theory predicts: GAs outperform hill climbers.”

Theory of Evolutionary Computation (Dirk Sudholt) Foundations Schemata Theory 10 / 28

How Useful is Crossover? Early theoretical approaches . . .
Schema Theory [Holland, 1975]
∗ ∗ 1 ∗ 0 0 ∗ ∗
Schemata with high fitness and few defining bits spread disproportionally.

true for one generation

not true for multiple generations!

Building-block hypothesis
Crossover is effective because it combines good ’building blocks’.

Mitchell, Forrest, and Holland [1992]: “We designed a problem with building blocks on
which schema theory predicts: GAs outperform hill climbers.”
Forrest and Mitchell [1993]: “We ran experiments and found out: hill climbers
outperform GAs.”

Theory of Evolutionary Computation (Dirk Sudholt) Foundations Schemata Theory 10 / 28

How Useful is Crossover? Early theoretical approaches . . .
Schema Theory [Holland, 1975]
∗ ∗ 1 ∗ 0 0 ∗ ∗
Schemata with high fitness and few defining bits spread disproportionally.

true for one generation

not true for multiple generations!

Building-block hypothesis
Crossover is effective because it combines good ’building blocks’.

Conclusion
Need mathematical rigour – theorems and proofs!
Theory of Evolutionary Computation (Dirk Sudholt) Foundations Schemata Theory 10 / 28
Brief History of Runtime Analysis

Heinz Mühlenbein, 1992: non-rigorous runtime analyses

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History of Rigorous Analysis 11 / 28

Brief History of Runtime Analysis

Heinz Mühlenbein, 1992: non-rigorous runtime analyses

Günter Rudolph, 1997: first rigorous runtime analyses

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History of Rigorous Analysis 11 / 28

Brief History of Runtime Analysis

Heinz Mühlenbein, 1992: non-rigorous runtime analyses

Günter Rudolph, 1997: first rigorous runtime analyses

From 1997: Ingo Wegener and members of his Chair (Thomas Jansen, Stefan Droste) in
Dortmund
Collaborative Research Centre “Computational Intelligence” (12 years)

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History of Rigorous Analysis 11 / 28

Brief History of Runtime Analysis

Heinz Mühlenbein, 1992: non-rigorous runtime analyses

Günter Rudolph, 1997: first rigorous runtime analyses

From 1997: Ingo Wegener and members of his Chair (Thomas Jansen, Stefan Droste) in
Dortmund
Collaborative Research Centre “Computational Intelligence” (12 years)

Around 2000: Jun He and Xin Yao in Birmingham

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History of Rigorous Analysis 11 / 28

Brief History of Runtime Analysis

Heinz Mühlenbein, 1992: non-rigorous runtime analyses

Günter Rudolph, 1997: first rigorous runtime analyses

From 1997: Ingo Wegener and members of his Chair (Thomas Jansen, Stefan Droste) in
Dortmund
Collaborative Research Centre “Computational Intelligence” (12 years)

Around 2000: Jun He and Xin Yao in Birmingham

From 2004: Frank Neumann at Kiel and then MPI Saarbrücken

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History of Rigorous Analysis 11 / 28

Brief History of Runtime Analysis

Heinz Mühlenbein, 1992: non-rigorous runtime analyses

Günter Rudolph, 1997: first rigorous runtime analyses

From 1997: Ingo Wegener and members of his Chair (Thomas Jansen, Stefan Droste) in
Dortmund
Collaborative Research Centre “Computational Intelligence” (12 years)

Around 2000: Jun He and Xin Yao in Birmingham

From 2004: Frank Neumann at Kiel and then MPI Saarbrücken

From 2006: Benjamin Doerr and others at MPI Saarbrücken

Theory of Evolutionary Computation (Dirk Sudholt) Foundations History of Rigorous Analysis 11 / 28

Brief History of Runtime Analysis

Heinz Mühlenbein, 1992: non-rigorous runtime analyses

Günter Rudolph, 1997: first rigorous runtime analyses

From 1997: Ingo Wegener and members of his Chair (Thomas Jansen, Stefan Droste) in
Dortmund
Collaborative Research Centre “Computational Intelligence” (12 years)

Around 2000: Jun He and Xin Yao in Birmingham

From 2004: Frank Neumann at Kiel and then MPI Saarbrücken

From 2006: Benjamin Doerr and others at MPI Saarbrücken

Now: research groups at Aberystwyth, Adelaide, Beijing, Birmingham, Copenhagen,

Dortmund, Leiden, Minnesota, Nanjing, Paderborn, Paris, Passau, Potsdam, Sheffield,
Singapore, Zurich, . . .