0% found this document useful (0 votes)

22 views

Adversarial Search

This document discusses adversarial search and game playing. It begins by defining games as multi-agent environments that can be either competitive or cooperative. For competitive games, adversarial search is required to consider the actions of other agents. It then discusses perfect play using minimax decisions and alpha-beta pruning to evaluate game trees. It notes that with resource limits, approximate evaluation functions must be used instead of exhaustive search to evaluate game states.

Uploaded by

aryatel26

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

Adversarial Search

Uploaded by

aryatel26

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 42

Adversarial Search

Game Playing

Chapter 6
Outline
• Games
• Perfect Play
– Minimax decisions
– α-β pruning
• Resource Limits and Approximate
Evaluation
• Games of chance
Games

• Multi agent environments : any given agent will need

to consider the actions of other agents and how they
affect its own welfare.
• The unpredictability of these other agents can
introduce many possible contingencies

• There could be competitive or cooperative

environments

• Competitive environments, in which the agent’s goals

are in conflict require adversarial search – these
problems are called as games
What kind of games?
• Abstraction: To describe a game we must capture
every relevant aspect of the game. Such as:
– Chess
– Tic-tac-toe
– …
• Accessible environments: Such games are
characterized by perfect information
• Search: game-playing then consists of a search through
possible game positions
• Unpredictable opponent: introduces uncertainty thus
game-playing must deal with contingency problems

Slide adapted from Macskassy

Type of Games
Games
• In game theory (economics), any multi-agent environment (either
cooperative or competitive) is a game provided that the impact of
each agent on the other is significant*

• AI games are a specialized kind - deterministic, turn taking, two-

player, zero sum games of perfect information

– a zero-sum game is a mathematical representation of a situation in

which a participant's gain (or loss) of utility is exactly balanced by the
losses (or gains) of the utility of other participant(s)

• In our terminology – deterministic, fully observable environments

with two agents whose actions alternate and the utility values at the
end of the game are always equal and opposite (+1 and –1)
– If a player wins a game of chess (+1), the other player necessarily loses
(-1)

• * Environments with very many agents are best viewed as economies rather than
games
Deterministic Games
• Many possible formalizations, one is:
– States: S (start at s0)
– Players: P={1...N} (usually take turns)
– Actions: A (may depend on player / state)
– Transition Function: SxA →S
– Terminal Test: S → {t,f}
– Terminal Utilities: SxP → R

• Solution for a player is a policy: S → A

Games vs. search problems
• “Unpredictable" opponent  solution is a strategy
specifying a move for every possible opponent reply

• Time limits  unlikely to find goal, must approximate

• Plan of attack:
– Computer considers possible lines of play (Babbage, 1846)
– Algorithm for perfect play (Zermelo, 1912; Von Neumann, 1944)
– Finite horizon, approximate evaluation (Zuse, 1945; Wiener,
1948; Shannon, 1950)
– First chess program (Turing, 1951)
– Machine learning to improve evaluation accuracy (Samuel,
1952-57)
– Pruning to allow deeper search (McCarthy, 1956)
Deterministic Single-Player?
• Deterministic, single player,
perfect information:
– Know the rules
– Know what actions do
– Know when you win
– E.g. Freecell, 8-Puzzle, Rubik’s
cube
• … it’s just search!
• Slight reinterpretation:
– Each node stores a value: the
best outcome it can reach
– This is the maximal outcome of
its children (the max value)
– Note that we don’t have path
sums as before (utilities at end)
• After search, can pick move that
leads to best node

Slide adapted from Macskassy

Deterministic Two-Player
• E.g. tic-tac-toe, chess,
checkers
• Zero-sum games
– One player maximizes result
– The other minimizes result
• Minimax search
– A state-space search tree
– Players alternate
– Each layer, or ply, consists of a
round of moves
– Choose move to position with
highest minimax value = best
achievable utility against best play

Slide adapted from Macskassy

Searching for the next move
• Complexity: many games have a huge search space
– Chess: b = 35, m=100 nodes = 35 100
– if each node takes about 1 ns to explore then each move will
take about 1050 millennia to calculate.

• Resource (e.g., time, memory) limit: optimal solution

not feasible/possible, thus must approximate

• 1. Pruning: makes the search more efficient by

discarding portions of the search tree that cannot
improve quality result.

• 2. Evaluation functions: heuristics to evaluate utility of

a state without exhaustive search.

Slide adapted from Macskassy

Two-player Games
• A game formulated as a search problem:

Slide adapted from Macskassy

Example: Tic-Tac-Toe
The minimax algorithm
• Perfect play for deterministic environments with perfect
information

• Basic idea: choose move with highest minimax value

= best achievable payoff against best play

• Algorithm:
1. Generate game tree completely
2. Determine utility of each terminal state
3. Propagate the utility values upward in the three by applying
MIN and MAX operators on the nodes in the current level
4. At the root node use minimax decision to select the move with
the max (of the min) utility value

• Steps 2 and 3 in the algorithm assume that the opponent will

play perfectly.
Generate Game Tree
Minimax Example
Minimax value
• Given a game tree, the optimal strategy can be
determined by examining the minimax value of
each node (MINIMAX-VALUE(n))

• The minimax value of a node is the utility of

being in the corresponding state, assuming that
both players play optimally from there to the end
of the game

• Given a choice, MAX prefer to move to a state of

maximum value, whereas MIN prefers a state of
minimum value
Minimax: Recursive implementation
The Minimax Algorithm Properties
• Performs a complete depth-first exploration of the game
tree
• Optimal against a perfect player.
• Time complexity?
– O(bm)
• Space complexity?
– O(bm)
• For chess, b ~ 35, m ~ 100
– Exact solution is completely infeasible
– But, do we need to explore the whole tree?
• Minimax serves as the basis for the mathematical
analysis of games and for more practical algorithms
Resource Limits
• Cannot search to leaves
• Depth-limited search
– Instead, search a limited depth of tree
– Replace terminal utilities with an eval
function for non-terminal positions

• Guarantee of optimal play is gone

• More plies makes a BIG difference
• Example:
– Suppose we have 100 seconds, can
explore 10K nodes / sec
– So can check 1M nodes per move
– α-β reaches about depth 8 – decent
chess program

Slide adapted from Macskassy

α-β pruning
α-β pruning: example
α-β pruning: example
α-β pruning: example
α-β pruning: example
α-β pruning: example
α-β pruning: example
α-β pruning: example
α-β pruning: example
α-β pruning: General Principle
Why is it called α-β?
• α is the value of the
best (i.e., highest-
value) choice found
so far at any choice
point along the path
for max
• If v is worse than α,
max will avoid it
 prune that branch
• Define β similarly for
min
•
–
α-β pruning
• Alpha-beta search updates the values of α and β
as it goes along and prunes the remaining
branches at a node as soon as the value of the
current node is known to be worse than the
current α or β value for MAX or MIN,
respectively.

• The effectiveness of alpha-beta pruning is highly

dependent on the order in which the successors
are examined.
Properties of α-β
• Pruning does not affect final result

• Good move ordering improves effectiveness of pruning

• With "perfect ordering," time complexity = O(bm/2)

 doubles depth of search

• A simple example of the value of reasoning about which

computations are relevant (a form of metareasoning)
•
•

•
The α-β algorithm
The α-β algorithm
Imperfect Real-Time Decisions
Suppose we have 100 secs, explore 104
nodes/sec
 106 nodes per move
Standard approach:
• cutoff test:
e.g., depth limit (perhaps add quiescence search)
• evaluation function
= estimated desirability of position
* Replace the utility function by a heuristic evaluation
function EVAL, which gives an estimate of the
position’s utility
–
Evaluation Functions
• First proposed by Shannon in 1950
• The evaluation function should order the
terminal states in the same way as the true utility
function
• The computation must not take too long
• For non-terminal states, the evaluation function
should be strongly correlated with the actual
chances of winning
– Uncertainty introduced by computational limits
Evaluation Functions
Evaluation Functions
• Material value for each piece in chess
– Pawn: 1
– Knight: 3
– Bishop: 3
– Rook: 5
– Queen: 9
This can be used as weights and the number of each kind can be used as
features
• Other features
– Good pawn structure
– King safety

• These features and weights are not part of the rules of chess, they
come from playing experience
Cutting off search
MinimaxCutoff is identical to MinimaxValue except
1. Terminal? is replaced by Cutoff?
2. Utility is replaced by Eval

Does it work in practice?

bm = 106, b=35  m=4

4-ply lookahead is a hopeless chess player!

– 4-ply ≈ human novice
– 8-ply ≈ typical PC, human master
– 12-ply ≈ Deep Blue, Kasparov
–
•
Expectimax Search Trees
• What if we don’t know what the
result of an action will be? E.g.,
– In solitaire, next card is unknown
– In minesweeper, mine locations
– In pacman, the ghosts act randomly
– Games that include chance
• Can do expectimax search
– Chance nodes, like min nodes,
except the outcome is uncertain
– Calculate expected utilities
– Max nodes as in minimax search
– Chance nodes take average
(expectation) of value of children
Games : State-of-the-Art
• Checkers: Chinook ended 40-year-reign of human world champion
Marion Tinsley in 1994. Used an endgame database defining
perfect play for all positions involving 8 or fewer pieces on the
board, a total of 443,748,401,247 positions. Checkers is now
solved!
• Chess: Deep Blue defeated human world champion Gary Kasparov
in a six-game match in 1997. Deep Blue examined 200 million
positions per second, used very sophisticated evaluation and
undisclosed methods for extending some lines of search up to 40
ply. Current programs are even better, if less historic.
• Othello: In 1997, Logistello defeated human champion by six
games to none. Human champions refuse to compete against
computers, which are too good.
• Go: Human champions are beginning to be challenged by
machines, though the best humans still beat the best machines.
In Go, b > 300, so most programs use pattern knowledge bases to
suggest plausible moves, along with aggressive pruning.
• Backgammon: Neural-net learning program TDGammon one of
world’s top 3 players.

Numerical Reasoning Test Samples
100% (5)
Numerical Reasoning Test Samples
11 pages
Charm Casting Sheets
33% (3)
Charm Casting Sheets
3 pages
Impersonal Passive
100% (1)
Impersonal Passive
10 pages
Mock Exam For Cenni
No ratings yet
Mock Exam For Cenni
7 pages
Games
No ratings yet
Games
41 pages
AI Unit-3
No ratings yet
AI Unit-3
109 pages
06. Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
06. Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
AAI Lecture 7 Sp 25
No ratings yet
AAI Lecture 7 Sp 25
51 pages
Game Playing. Updated (3)
No ratings yet
Game Playing. Updated (3)
44 pages
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
No ratings yet
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
71 pages
AI-Lecture 6 (Adversarial Search)
No ratings yet
AI-Lecture 6 (Adversarial Search)
68 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
Game Playing
No ratings yet
Game Playing
53 pages
ITSC6121 Lecture 4 -- Game Trees I
No ratings yet
ITSC6121 Lecture 4 -- Game Trees I
34 pages
GamePlaying_Minimax_Unit-2_SPS
No ratings yet
GamePlaying_Minimax_Unit-2_SPS
72 pages
4 Adversel Search Game Tree
No ratings yet
4 Adversel Search Game Tree
51 pages
Adversarial Search
No ratings yet
Adversarial Search
78 pages
Lecture 6 - minmax alpha beta
No ratings yet
Lecture 6 - minmax alpha beta
41 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
Adversarial Search
No ratings yet
Adversarial Search
20 pages
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
Lecture11_AdversarialSearch
No ratings yet
Lecture11_AdversarialSearch
74 pages
Yapay Zeka - 8
No ratings yet
Yapay Zeka - 8
48 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
ai_lect_05
No ratings yet
ai_lect_05
39 pages
6 Game
No ratings yet
6 Game
42 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
06 Minimax
No ratings yet
06 Minimax
53 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
62 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
Unit 2c Game Playing (Compatibility Mode)
No ratings yet
Unit 2c Game Playing (Compatibility Mode)
36 pages
Lecture05 AdversarialSearch
No ratings yet
Lecture05 AdversarialSearch
51 pages
Adversarial Search
No ratings yet
Adversarial Search
36 pages
AI Lec07 Adversarial Search
No ratings yet
AI Lec07 Adversarial Search
29 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
2025-Lecture03-AdversarialSearch
No ratings yet
2025-Lecture03-AdversarialSearch
51 pages
Lecture 6 - Adversarial Search
No ratings yet
Lecture 6 - Adversarial Search
45 pages
Game Playing
No ratings yet
Game Playing
60 pages
6-GAME
No ratings yet
6-GAME
53 pages
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
No ratings yet
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
41 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
Game Playing: Adversarial Search
No ratings yet
Game Playing: Adversarial Search
66 pages
Game Playing MINMAX Search, Alpha-Beta Pruning,.pdf
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning,.pdf
4 pages
Adversarial Search
No ratings yet
Adversarial Search
91 pages
Adveserial Search
No ratings yet
Adveserial Search
29 pages
SP14 CS188 Lecture 6 Adversarial Search
No ratings yet
SP14 CS188 Lecture 6 Adversarial Search
29 pages
cs188 sp23 Lec09
No ratings yet
cs188 sp23 Lec09
47 pages
AI All Units
No ratings yet
AI All Units
93 pages
06 Adversarialsearch
No ratings yet
06 Adversarialsearch
36 pages
L06 (Adversarial Search) Ori
No ratings yet
L06 (Adversarial Search) Ori
46 pages
Game Playing
No ratings yet
Game Playing
24 pages
Optimal Decision in Games
No ratings yet
Optimal Decision in Games
68 pages
Cs188 Lecture 6 - Adversarial Search - Print (Edx) (2PP)
No ratings yet
Cs188 Lecture 6 - Adversarial Search - Print (Edx) (2PP)
35 pages
AI Chapter05
No ratings yet
AI Chapter05
38 pages
Basic 05 Games
No ratings yet
Basic 05 Games
74 pages
Lec 04
No ratings yet
Lec 04
79 pages
Adversial Search
No ratings yet
Adversial Search
39 pages
Adversarial Search (Minimax, Alfa-Beta Algorithm)
No ratings yet
Adversarial Search (Minimax, Alfa-Beta Algorithm)
15 pages
Why Do AI Researchers Study Game Playing?
No ratings yet
Why Do AI Researchers Study Game Playing?
42 pages
Mathematical Chess
From Everand
Mathematical Chess
Dr George Ho
No ratings yet
Fun Online Games For Teens with Tips and Tricks: Ages 13 And Up: Games for Kids and Teens
From Everand
Fun Online Games For Teens with Tips and Tricks: Ages 13 And Up: Games for Kids and Teens
Baby Professor
No ratings yet
Push Relabel
No ratings yet
Push Relabel
70 pages
Advance Algorithm Introduction
No ratings yet
Advance Algorithm Introduction
71 pages
Introduction To GAs
No ratings yet
Introduction To GAs
28 pages
Introduction
No ratings yet
Introduction
20 pages
AI-SMPS-Game-Playing
No ratings yet
AI-SMPS-Game-Playing
113 pages
Programmer's Model of 8086
100% (3)
Programmer's Model of 8086
4 pages
AI-SMPS-Stochastic-Local-Search
No ratings yet
AI-SMPS-Stochastic-Local-Search
115 pages
AI SMPS AStar Variations
No ratings yet
AI SMPS AStar Variations
54 pages
AICE English Language_SummerAssignment
No ratings yet
AICE English Language_SummerAssignment
9 pages
Betegség, Baleset - A1-C2
No ratings yet
Betegség, Baleset - A1-C2
24 pages
How To Make Your Own Spell
No ratings yet
How To Make Your Own Spell
3 pages
Prefix Meaning Examples: Here Is A List of The Most Common Prefixes
No ratings yet
Prefix Meaning Examples: Here Is A List of The Most Common Prefixes
6 pages
TXN Date Value Date Description Ref No./Cheque No. Branch Code Debit Credit Balance
No ratings yet
TXN Date Value Date Description Ref No./Cheque No. Branch Code Debit Credit Balance
9 pages
(SCM2025) General Schedule
No ratings yet
(SCM2025) General Schedule
1 page
Reading: Follow These Steps As You Skim A Reading
No ratings yet
Reading: Follow These Steps As You Skim A Reading
4 pages
Mathematical Association of America
No ratings yet
Mathematical Association of America
6 pages
BIRKAT HaLevana
No ratings yet
BIRKAT HaLevana
12 pages
CLASS X ENG PB1 SET B
No ratings yet
CLASS X ENG PB1 SET B
6 pages
Large Open Pit Slope Stability
No ratings yet
Large Open Pit Slope Stability
53 pages
Conlang Exhibit Master File Text Rev
No ratings yet
Conlang Exhibit Master File Text Rev
73 pages
Theme Options
No ratings yet
Theme Options
12 pages
Misrepresentation-English Law: Terms of The Contract, and What Is The Effect of Such False Representations
No ratings yet
Misrepresentation-English Law: Terms of The Contract, and What Is The Effect of Such False Representations
40 pages
OceanofPDF - Com The War of The Worlds - HG Wells
100% (1)
OceanofPDF - Com The War of The Worlds - HG Wells
229 pages
How To Present Bhagavad-Gita As It Is
100% (1)
How To Present Bhagavad-Gita As It Is
4 pages
Prof Ed-5
No ratings yet
Prof Ed-5
3 pages
Taking Fear Out of Schools
No ratings yet
Taking Fear Out of Schools
133 pages
SUN ILP Plan Term - 2( Biweek - 1)
No ratings yet
SUN ILP Plan Term - 2( Biweek - 1)
13 pages
Chapter 10
No ratings yet
Chapter 10
19 pages
Ate 2634 Final Exam Question With Explanations of Answers Latest Update 2024 - 2025
No ratings yet
Ate 2634 Final Exam Question With Explanations of Answers Latest Update 2024 - 2025
6 pages
02 Question Paper English X
No ratings yet
02 Question Paper English X
8 pages
Chapter 12 Sec 3
No ratings yet
Chapter 12 Sec 3
12 pages
Indonesia Is One of The Countries That Is Located in Southeast Asia
No ratings yet
Indonesia Is One of The Countries That Is Located in Southeast Asia
2 pages
Gujarat Technological University: W.E.F. AY 2018-19
No ratings yet
Gujarat Technological University: W.E.F. AY 2018-19
4 pages
Rev. Rul. 81-277, 1981 C.B 14
No ratings yet
Rev. Rul. 81-277, 1981 C.B 14
3 pages

Adversarial Search

Uploaded by

Adversarial Search

Uploaded by

Adversarial Search

• Multi agent environments : any given agent will need

• There could be competitive or cooperative

• Competitive environments, in which the agent’s goals

Slide adapted from Macskassy

• AI games are a specialized kind - deterministic, turn taking, two-

– a zero-sum game is a mathematical representation of a situation in

• In our terminology – deterministic, fully observable environments

• Solution for a player is a policy: S → A

• Time limits  unlikely to find goal, must approximate

Slide adapted from Macskassy

Slide adapted from Macskassy

• Resource (e.g., time, memory) limit: optimal solution

• 1. Pruning: makes the search more efficient by

• 2. Evaluation functions: heuristics to evaluate utility of

Slide adapted from Macskassy

Slide adapted from Macskassy

• Basic idea: choose move with highest minimax value

• Steps 2 and 3 in the algorithm assume that the opponent will

• The minimax value of a node is the utility of

• Given a choice, MAX prefer to move to a state of

• Guarantee of optimal play is gone

Slide adapted from Macskassy

• The effectiveness of alpha-beta pruning is highly

• Good move ordering improves effectiveness of pruning

• With "perfect ordering," time complexity = O(bm/2)

• A simple example of the value of reasoning about which

Does it work in practice?

4-ply lookahead is a hopeless chess player!

You might also like