0% found this document useful (0 votes)

4 views

ai1&2

The document provides an overview of Artificial Intelligence (AI), covering its definition, history, and foundational concepts. It discusses intelligent agents, problem-solving through searching, and various approaches to AI, including acting and thinking like humans. Additionally, it highlights the evolution of AI from early theories to modern applications across different fields.

Uploaded by

229X1A2838 BANDARI DINESH KUMAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

ai1&2

Uploaded by

229X1A2838 BANDARI DINESH KUMAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

e

te
ARTIFICIAL INTELLIGENCE

o
VI SEMESTER CSE

n
se
/c
:/ UNITI
p

tt

1.1 INTRODUCTION
h

1.1.1 What is AI?

1.1.2 The foundations of Artificial Intelligence.
1.1.3 The History of Artificial Intelligence
1.1.4 The state of art

1.2 INTELLIGENT AGENTS

1.2.1 Agents and environments
1.2.2 Good behavior : The concept of rationality
1.2.3 The nature of environments
1.2.4 Structure of agents

1.3 SOLVING PROBLEMS BY SEARCHING

1.3.1 Problem Solving Agents
1.3.1.1Well defined problems and solutions
1.3.2 Example problems
1.3.2.1 Toy problems
1.3.2.2 Real world problems
1.3.3 Searching for solutions
1.3.4 Uninformed search strategies
1.3.4.1 Breadthfirst search
1.3.4.2 Uniformcost search
1.3.4.3 Depthfirst search
1.3.4.4 Depth limited search
1.3.4.5 Iterativedeepening depth first search
1.3.4.6 Bidirectional search
1.3.4.7 Comparing uninformed search strategies
e
.w
1.3.5 Avoiding repeated states
1.3.6 Searching with partial information

te
o
n
se
/c
1.1 Introduction to AI
:/
p
tt
1.1.1 What is artificial intelligence?
Artificial Intelligence is the branch of computer science concerned with making computers
h

behave like humans.

Major AI textbooks define artificial intelligence as "the study and design of intelligent
agents," where an intelligent agent is a system that perceives its environment and takes actions
which maximize its chances of success. John McCarthy, who coined the term in 1956, defines it as
"the science and engineering of making intelligent machines,especially intelligent computer
programs."
The definitions of AI according to some text books are categorized into four approaches and are
summarized in the table below :

Systems that think like humans Systems that think rationally

“The exciting new effort to make computers “The study of mental faculties through the use of
think … machines with minds,in the full and computer models.”
literal sense.”(Haugeland,1985) (Charniak and McDermont,1985)

Systems that act like humans Systems that act rationally

The art of creating machines that perform “Computational intelligence is the study of the
functions that require intelligence when design of intelligent agents.”(Poole et al.,1998)
performed by people.”(Kurzweil,1990)

The four approaches in more detail are as follows :

(a) Acting humanly : The Turing Test approach

o Test proposed by Alan Turing in 1950
o The computer is asked questions by a human interrogator.

The computer passes the test if a human interrogator,after posing some written questions,cannot tell
whether the written responses come from a person or not. Programming a computer to pass ,the
computer need to possess the following capabilities :

 Natural language processing to enable it to communicate successfully in English.

 Knowledge representation to store what it knows or hears
 Automated reasoning to use the stored information to answer questions and to draw
new conclusions.
e
.w
 Machine learning to adapt to new circumstances and to detect and extrapolate
patterns

te
To pass the complete Turing Test,the computer will need

o
 Computer vision to perceive the objects,and

n
 Robotics to manipulate objects and move about.

se
(b)Thinking humanly : The cognitive modeling approach

/c
We need to get inside actual working of the human mind :
(a) through introspection – trying to capture our own thoughts as they go by;
(b) through psychological experiments
:/
Allen Newell and Herbert Simon,who developed GPS,the “General Problem Solver”
p
tried to trace the reasoning steps to traces of human subjects solving the same problems.
The interdisciplinary field of cognitive science brings together computer models from AI
tt
and experimental techniques from psychology to try to construct precise and testable
h

theories of the workings of the human mind

(c) Thinking rationally : The “laws of thought approach”

The Greek philosopher Aristotle was one of the first to attempt to codify “right
thinking”,that is irrefuatable reasoning processes. His syllogism provided patterns for argument
structures that always yielded correct conclusions when given correct premises—for
example,”Socrates is a man;all men are mortal;therefore Socrates is mortal.”.
These laws of thought were supposed to govern the operation of the mind;their study initiated a
field called logic.

(d)Acting rationally : The rational agent approach

An agent is something that acts. Computer agents are not mere programs ,but they are expected to
have the following attributes also : (a) operating under autonomous control, (b) perceiving their
environment, (c) persisting over a prolonged time period, (e) adapting to change.
A rational agent is one that acts so as to achieve the best outcome.

1.1.2 The foundations of Artificial Intelligence

The various disciplines that contributed ideas,viewpoints,and techniques to AI are given
below :
Philosophy(428 B.C. – present)
Aristotle (384322 B.C.) was the first to formulate a precise set of laws governing the rational part
of the mind. He developed an informal system of syllogisms for proper reasoning,which allowed
one to generate conclusions mechanically,given initial premises.

Computer Human Brain

Computational units 1 CPU,10 gates
8
1011 neurons
Storage units 10 bits RAM
10
1011 neurons
1011 bits disk 1014 synapses
Cycle time 10 sec
9
103 sec
Bandwidth 1010 bits/sec 1014 bits/sec
Memory updates/sec 10 9
1014
Table 1.1 A crude comparison of the raw computational resources available to computers(circa
2003 ) and brain. The computer’s numbers have increased by at least by a factor of 10 every few
e
.w
years. The brain’s numbers have not changed for the last 10,000 years.

te
Brains and digital computers perform quite different tasks and have different properties. Tablere 1.1
shows that there are 10000 times more neurons in the typical human brain than there are gates in

o
the CPU of a typical highend computer. Moore’s Law predicts that the CPU’s gate count will equal
the brain’s neuron count around 2020.

n
se
Psycology(1879 – present)
The origin of scientific psychology are traced back to the wok if German physiologist Hermann von
Helmholtz(18211894) and his student Wilhelm Wundt(1832 – 1920)

/c
In 1879,Wundt opened the first laboratory of experimental psychology at the university of Leipzig.
In US,the development of computer modeling led to the creation of the field of cognitive science.

:/
The field can be said to have started at the workshop in September 1956 at MIT.
p
Computer engineering (1940present)
tt
For artificial intelligence to succeed, we need two things: intelligence and an artifact. The
computer has been the artifact of choice.
h

A1 also owes a debt to the software side of computer science, which has supplied the
operating systems, programming languages, and tools needed to write modern programs

Control theory and Cybernetics (1948present)

Ktesibios of Alexandria (c. 250 B.c.) built the first selfcontrolling machine: a water clock
with a regulator that kept the flow of water running through it at a constant, predictable pace.
Modern control theory, especially the branch known as stochastic optimal control, has
as its goal the design of systems that maximize an objective function over time.

Linguistics (1957present)
Modem linguistics and AI, then, were "born" at about the same time, and grew up
together, intersecting in a hybrid field called computational linguistics or natural language
processing.

1.1.3 The History of Artificial Intelligence

The gestation of artificial intelligence (19431955)
There were a number of early examples of work that can be characterized as AI, but it
was Alan Turing who first articulated a complete vision of A1 in his 1950 article "Comput
ing Machinery and Intelligence." Therein, he introduced the Turing test, machine learning,
genetic algorithms, and reinforcement learning.

The birth of artificial intelligence (1956)

McCarthy convinced Minsky, Claude Shannon, and Nathaniel Rochester to help him
bring together U.S. researchers interested in automata theory, neural nets, and the study of
intelligence. They organized a twomonth workshop at Dartmouth in the summer of 1956.
Perhaps the longestlasting thing to come out of the workshop was an agreement to adopt McCarthy's
new name for the field: artificial intelligence.

Early enthusiasm, great expectations (19521969)

The early years of A1 were full of successesin a limited way.
General Problem Solver (GPS) was a computer program created in 1957 by Herbert Simon and
Allen Newell to build a universal problem solver machine. The order in which the program considered
subgoals and possible actions was similar to that in which humans approached the same problems. Thus,
GPS was probably the first program to embody the "thinking humanly" approach.
e
.w
At IBM, Nathaniel Rochester and his colleagues produced some of the first A1 pro
grams. Herbert Gelernter (1959) constructed the Geometry Theorem Prover, which was

te
able to prove theorems that many students of mathematics would find quite tricky.
Lisp was invented by John McCarthy in 1958 while he was at the Massachusetts Institute of

o
Technology (MIT). In 1963, McCarthy started the AI lab at Stanford.
Tom Evans's ANALOGY program (1968) solved geometric analogy problems that appear in IQ tests, such as

n
the one in Figure 1.1

se
/c
:/
p
tt
h

Figure 1.1 The Tom Evan’s ANALOGY program could solve geometric analogy problems as
shown.

A dose of reality (19661973)

From the beginning, AI researchers were not shy about making predictions of their coming
successes. The following statement by Herbert Simon in 1957 is often quoted:
“It is not my aim to surprise or shock youbut the simplest way I can summarize is to say
that there are now in the world machines that think, that learn and that create. Moreover,
their ability to do these things is going to increase rapidly untilin a visible futurethe
range of problems they can handle will be coextensive with the range to which the human
mind has been applied.
Knowledgebased systems: The key to power? (19691979)
Dendral was an influential pioneer project in artificial intelligence (AI) of the 1960s, and the
computer software expert system that it produced. Its primary aim was to help organic chemists in
identifying unknown organic molecules, by analyzing their mass spectra and using knowledge of
chemistry. It was done at Stanford University by Edward Feigenbaum, Bruce Buchanan, Joshua
Lederberg, and Carl Djerassi.

A1 becomes an industry (1980present)

In 1981, the Japanese announced the "Fifth Generation" project, a 10year plan to build
intelligent computers running Prolog. Overall, the A1 industry boomed from a few million dollars in 1980 to
billions of dollars in 1988.
e
.w
The return of neural networks (1986present)
Psychologists including David Rumelhart and Geoff Hinton continued the study of neuralnet models of

te
memory.
A1 becomes a science (1987present)

o
In recent years, approaches based on hidden Markov models (HMMs) have come to dominate the area.
Speech technology and the related field of handwritten character recognition are already making the

n
transition to widespread industrial and consumer applications.

se
The Bayesian network formalism was invented to allow efficient representation of, and rigorous reasoning
with, uncertain knowledge.
The emergence of intelligent agents (1995present)

/c
One of the most important environments for intelligent agents is the Internet.

1.1.4 The state of art

What can A1 do today? :/
p
Autonomous planning and scheduling: A hundred million miles from Earth, NASA's
tt
Remote Agent program became the first onboard autonomous planning program to control
the scheduling of operations for a spacecraft (Jonsson et al., 2000). Remote Agent generated
h

plans from highlevel goals specified from the ground, and it monitored the operation of the
spacecraft as the plans were executeddetecting, diagnosing, and recovering from problems
as they occurred.
Game playing: IBM's Deep Blue became the first computer program to defeat the
world champion in a chess match when it bested Garry Kasparov by a score of 3.5 to 2.5 in
an exhibition match (Goodman and Keene, 1997).
Autonomous control: The ALVINN computer vision system was trained to steer a car
to keep it following a lane. It was placed in CMU's NAVLAB computercontrolled minivan
and used to navigate across the United Statesfor 2850 miles it was in control of steering the
vehicle 98% of the time.
Diagnosis: Medical diagnosis programs based on probabilistic analysis have been able
to perform at the level of an expert physician in several areas of medicine.
Logistics Planning: During the Persian Gulf crisis of 1991, U.S. forces deployed a
Dynamic Analysis and Replanning Tool, DART (Cross and Walker, 1994), to do automated
logistics planning and scheduling for transportation. This involved up to 50,000 vehicles,
cargo, and people at a time, and had to account for starting points, destinations, routes, and
conflict resolution among all parameters. The AI planning techniques allowed a plan to be
generated in hours that would have taken weeks with older methods. The Defense Advanced
Research Project Agency (DARPA) stated that this single application more than paid back
DARPA's 30year investment in AI.
Robotics: Many surgeons now use robot assistants in microsurgery. HipNav (DiGioia
et al., 1996) is a system that uses computer vision techniques to create a threedimensional
model of a patient's internal anatomy and then uses robotic control to guide the insertion of a
hip replacement prosthesis.
Language understanding and problem solving: PROVERB (Littman et al., 1999) is a
computer program that solves crossword puzzles better than most humans, using constraints
on possible word fillers, a large database of past puzzles, and a variety of information sources
including dictionaries and online databases such as a list of movies and the actors that appear
in them.

1.2 INTELLIGENT AGENTS

1.2.1 Agents and environments

An agent is anything that can be viewed as perceiving its environment through sensors and
e
.w
SENSOR acting upon that environment through actuators. This simple idea is illustrated in Figure 1.2.

te
o A human agent has eyes, ears, and other organs for sensors and hands, legs, mouth, and other body
parts for actuators.

o
o A robotic agent might have cameras and infrared range finders for sensors and various motors for
actuators.

n
o A software agent receives keystrokes, file contents, and network packets as sensory inputs and acts
on the environment by displaying on the screen, writing files, and sending network packets.

se
/c
:/
p
tt
h

Figure 1.2 Agents interact with environments through sensors and actuators.

Percept
We use the term percept to refer to the agent's perceptual inputs at any given instant.
Percept Sequence
An agent's percept sequence is the complete history of everything the agent has ever perceived.
Agent function
Mathematically speaking, we say that an agent's behavior is described by the agent function
that maps any given percept sequence to an action.

Agent program
Internally, The agent function for an artificial agent will be implemented by an agent program. It is
important to keep these two ideas distinct. The agent function is an abstract mathematical
description; the agent program is a concrete implementation, running on the agent architecture.

To illustrate these ideas, we will use a very simple examplethe vacuumcleaner world
shown in Figure 1.3. This particular world has just two locations: squares A and B. The vacuum
agent perceives which square it is in and whether there is dirt in the square. It can choose to move
left, move right, suck up the dirt, or do nothing. One very simple agent function is the following: if
the current square is dirty, then suck, otherwise move to the other square. A partial tabulation of this
agent function is shown in Figure 1.4.
e
.w
te
o
n
Figure 1.3 A vacuumcleaner world with just two

se
locations.

/c
Agent function

:/
p
Percept Sequence Action
tt
[A, Clean] Right
[A, Dirty] Suck
h

[B, Clean] Left

[B, Dirty] Suck
[A, Clean], [A, Clean] Right
[A, Clean], [A, Dirty] Suck
…
Figure 1.4 Partial tabulation of a
simple agent function for the
vacuumcleaner world shown in
Figure 1.3.

Rational Agent
A rational agent is one that does the right thingconceptually speaking, every entry in
the table for the agent function is filled out correctly. Obviously, doing the right thing is
better than doing the wrong thing. The right action is the one that will cause the agent to be
most successful.
Performance measures
A performance measure embodies the criterion for success of an agent's behavior. When
an agent is plunked down in an environment, it generates a sequence of actions according
to the percepts it receives. This sequence of actions causes the environment to go through a
sequence of states. If the sequence is desirable, then the agent has performed well.
Rationality
What is rational at any given time depends on four things:
o The performance measure that defines the criterion of success.
e
.w
o The agent's prior knowledge of the environment.
o The actions that the agent can perform.

te
o The agent's percept sequence to date.

o
This leads to a definition of a rational agent:
For each possible percept sequence, a rational agent should select an action that is ex

n
pected to maximize its performance measure, given the evidence provided by the percept

se
sequence and whatever builtin knowledge the agent has.

Omniscience, learning, and autonomy

/c
An omniscient agent knows the actual outcome of its actions and can act accordingly; but
omniscience is impossible in reality.

:/
Doing actions in order to modify future perceptssometimes called information gatheringis
p
an important part of rationality.
Our definition requires a rational agent not only to gather information, but also to learn
tt
as much as possible from what it perceives.
To the extent that an agent relies on the prior knowledge of its designer rather than
h

on its own percepts, we say that the agent lacks autonomy. A rational agent should be
autonomousit should learn what it can to compensate for partial or incorrect prior knowledge.

Task environments
We must think about task environments, which are essentially the "problems" to which rational agents are
the "solutions."
Specifying the task environment
The rationality of the simple vacuumcleaner agent, needs specification of
o the performance measure
o the environment
o the agent's actuators and sensors.

PEAS
All these are grouped together under the heading of the task environment.
We call this the PEAS (Performance, Environment, Actuators, Sensors) description.
In designing an agent, the first step must always be to specify the task environment as fully
as possible.
Agent Type Performance Environments Actuators Sensors
Measure
Taxi driver Safe: fast, legal, Roads,other Steering,accelerator, Cameras,sonar,
comfortable trip, traffic,pedestrians, brake, Speedometer,GPS,
maximize profits customers Signal,horn,display Odometer,engine
sensors,keyboards,
accelerometer
Figure 1.5 PEAS description of the task environment for an automated taxi.
e
.w
te
o
n
se
/c
:/
p
tt
h

Figure 1.6 Examples of agent types and their PEAS descriptions.

Properties of task environments

o Fully observable vs. partially observable

o Deterministic vs. stochastic
o Episodic vs. sequential
o Static vs. dynamic
o Discrete vs. continuous
o Single agent vs. multiagent
Fully observable vs. partially observable.
If an agent's sensors give it access to the complete state of the environment at each
point in time, then we say that the task environment is fully observable. A task envi
ronment is effectively fully observable if the sensors detect all aspects that are relevant
to the choice of action;
An environment might be partially observable because of noisy and inaccurate sensors or because
parts of the state are simplly missing from the sensor data.
Deterministic vs. stochastic.
If the next state of the environment is completely determined by the current state and
the action executed by the agent, then we say the environment is deterministic; other
wise, it is stochastic.
Episodic vs. sequential
In an episodic task environment, the agent's experience is divided into atomic episodes.
Each episode consists of the agent perceiving and then performing a single action. Cru
cially, the next episode does not depend on the actions taken in previous episodes.
e
.w
For example, an agent that has to spot defective parts on an assembly line bases each decision on
the current part, regardless of previous decisions;

te
In sequential environments, on the other hand, the current decision
could affect all future decisions. Chess and taxi driving are sequential:

o
Discrete vs. continuous.
The discrete/continuous distinction can be applied to the state of the environment, to

n
the way time is handled, and to the percepts and actions of the agent. For example, a

se
discretestate environment such as a chess game has a finite number of distinct states.
Chess also has a discrete set of percepts and actions. Taxi driving is a continuous
state and continuoustime problem: the speed and location of the taxi and of the other

/c
vehicles sweep through a range of continuous values and do so smoothly over time.
Taxidriving actions are also continuous (steering angles, etc.).

:/
p
Single agent vs. multiagent.
An agent solving a crossword puzzle by itself is clearly in a
tt
singleagent environment, whereas an agent playing chess is in a twoagent environ
ment.
h

As one might expect, the hardest case is partially observable, stochastic, sequential, dynamic,
continuous, and multiagent.

Figure 1.7 lists the properties of a number of familiar environments.

Figure 1.7 Examples of task environments and their characteristics.

Agent programs
The agent programs all have the same skeleton: they take the current percept as input from the
sensors and return an action to the actuatom6 Notice the difference between the agent program,
which takes the current percept as input, and the agent function, which takes the entire percept
history. The agent program takes just the current percept as input because nothing more is available
from the environment; if the agent's actions depend on the entire percept sequence, the agent will
have to remember the percepts.

Function TABLEDRIVEN_AGENT(percept) returns an action

static: percepts, a sequence initially empty

table, a table of actions, indexed by percept sequence
e
.w
append percept to the end of percepts
action LOOKUP(percepts, table)

te
return action

o
n
Figure 1.8 The TABLEDRIVENAGENT program is invoked for each new percept and
returns an action each time.

se
Drawbacks:

/c
• Table lookup of perceptaction pairs defining all possible conditionaction rules necessary
to interact in an environment
• Problems

:/
– Too big to generate and to store (Chess has about 10^120 states, for example)
p
– No knowledge of nonperceptual parts of the current state
– Not adaptive to changes in the environment; requires entire table to be updated if
tt
changes occur
h

– Looping: Can't make actions conditional

• Take a long time to build the table

• No autonomy
• Even with learning, need a long time to learn the table entries

Some Agent Types

• Tabledriven agents
– use a percept sequence/action table in memory to find the next action. They are
implemented by a (large) lookup table.
• Simple reflex agents
– are based on conditionaction rules, implemented with an appropriate production
system. They are stateless devices which do not have memory of past world states.
• Agents with memory
– have internal state, which is used to keep track of past states of the world.
• Agents with goals
– are agents that, in addition to state information, have goal information that describes
desirable situations. Agents of this kind take future events into consideration.
• Utilitybased agents
– base their decisions on classic axiomatic utility theory in order to act rationally.

Simple Reflex Agent

The simplest kind of agent is the simple reflex agent. These agents select actions on the basis of the
current percept, ignoring the rest of the percept history. For example, the vacuum agent whose agent function
is tabulated in Figure 1.10 is a simple reflex agent, because its decision is based only on the current location
and on whether that contains dirt.
o Select action on the basis of only the current percept.
E.g. the vacuumagent
o Large reduction in possible percept/action situations(next page).
o Implemented through conditionaction rules
If dirty then suck
e
.w
A Simple Reflex Agent: Schema

te
o
n
se
/c
:/
p
tt
h

Figure 1.9 Schematic diagram of a simple reflex agent.

function SIMPLEREFLEXAGENT(percept) returns an action

static: rules, a set of conditionaction rules

state INTERPRETINPUT(percept)
rule RULEMATCH(state, rule)
action RULEACTION[rule]
return action

Figure 1.10 A simple reflex agent. It acts according to a rule whose condition matches
the current state, as defined by the percept.

function REFLEXVACUUMAGENT ([location, status]) return an action

if status == Dirty then return Suck
else if location == A then return Right
else if location == B then return Left

Figure 1.11 The agent program for a simple reflex agent in the twostate vacuum environment. This
program implements the agent function tabulated in the figure 1.4.

 Characteristics
o Only works if the environment is fully observable.
o Lacking history, easily get stuck in infinite loops
o One solution is to randomize actions
o
Modelbased reflex agents
The most effective way to handle partial observability is for the agent to keep track of the part of the
world it can't see now. That is, the agent should maintain some sort of internal state that depends
e
.w
on the percept history and thereby reflects at least some of the unobserved aspects of the current
state.

te
Updating this internal state information as time goes by requires two kinds of knowledge to be
encoded in the agent program. First, we need some information about how the world evolves

o
independently of the agentfor example, that an overtaking car generally will be closer behind than
it was a moment ago. Second, we need some information about how the agent's own actions affect

n
the worldfor example, that when the agent turns the steering wheel clockwise, the car turns to the

se
right or that after driving for five minutes northbound on the freeway one is usually about five miles
north of where one was five minutes ago. This knowledge about "how the world working whether
implemented in simple Boolean circuits or in complete scientific theoriesis called a model of the

/c
world. An agent that uses such a MODELBASED model is called a modelbased agent.

:/
p
tt
h

Figure 1.12 A model based reflex agent

function REFLEXAGENTWITHSTATE(percept) returns an action

static: rules, a set of conditionaction rules
state, a description of the current world state
action, the most recent action.
state UPDATESTATE(state, action, percept)
rule RULEMATCH(state, rule)
action RULEACTION[rule]
return action

Figure 1.13 Model based reflex agent. It keeps track of the current state of the world using an internal
model. It then chooses an action in the same way as the reflex agent.

Goalbased agents
Knowing about the current state of the environment is not always enough to decide what to do. For example, at a
road junction, the taxi can turn left, turn right, or go straight on. The correct decision depends on where the taxi is
trying to get to. In other words, as well as a current state description, the agent needs some sort of goal
information that describes situations that are desirablefor example, being at the passenger's destination. The agent
program can combine this with information about the results of possible actions (the same information as
was used to update internal state in the reflex agent) in order to choose actions that achieve the goal. Figure
1.13 shows the goalbased agent's structure.
e
.w
te
o
n
se
/c
:/
p
tt
h

Figure 1.14 A goal based agent

Utilitybased agents
Goals alone are not really enough to generate highquality behavior in most environments. For
example, there are many action sequences that will get the taxi to its destination (thereby achieving
the goal) but some are quicker, safer, more reliable, or cheaper than others. Goals just provide a
crude binary distinction between "happy" and "unhappy" states, whereas a more general
performance measure should allow a comparison of different world states according to exactly
how happy they would make the agent if they could be achieved. Because "happy" does not sound
very scientific, the customary terminology is to say that if one world state is preferred to another,
then it has higher utility for the agent.

Figure 1.15 A modelbased, utilitybased agent. It uses a model of the world, along with
a utility function that measures its preferences among states of the world. Then it chooses the
action that leads to the best expected utility, where expected utility is computed by averaging
over all possible outcome states, weighted by the probability of the outcome.
e
.w
• Certain goals can be reached in different ways.
– Some are better, have a higher utility.

te
• Utility function maps a (sequence of) state(s) onto a real number.
• Improves on goals:

o
– Selecting between conflicting goals
– Select appropriately between several goals based on likelihood of success.

n
se
/c
:/
p
tt
h

Figure 1.16 A general model of learning agents.

• All agents can improve their performance through learning.

A learning agent can be divided into four conceptual components, as shown in Figure 1.15
The most important distinction is between the learning element, which is responsible for making
improvements, and the performance element, which is responsible for selecting external actions.
The performance element is what we have previously considered to be the entire agent: it takes in
percepts and decides on actions. The learning element uses feedback from the critic on how the
agent is doing and determines how the performance element should be modified to do better in the
future.
The last component of the learning agent is the problem generator. It is responsible
for suggesting actions that will lead to new and informative experiences. But if the agent is willing
to explore a little, it might discover much better actions for the long run. The problem
generator's job is to suggest these exploratory actions. This is what scientists do when they
carry out experiments.

Summary: Intelligent Agents

• An agent perceives and acts in an environment, has an architecture, and is implemented by
an agent program.
• Task environment – PEAS (Performance, Environment, Actuators, Sensors)
• The most challenging environments are inaccessible, nondeterministic, dynamic, and
continuous.
• An ideal agent always chooses the action which maximizes its expected performance, given
its percept sequence so far.
• An agent program maps from percept to action and updates internal state.
– Reflex agents respond immediately to percepts.
e
.w
• simple reflex agents
• modelbased reflex agents

te
– Goalbased agents act in order to achieve their goal(s).
– Utilitybased agents maximize their own utility function.

o
• All agents can improve their performance through learning.

n
se
1.3.1 Problem Solving by Search
An important aspect of intelligence is goalbased problem solving.

/c
The solution of many problems can be described by finding a sequence of actions that lead to a
desirable goal. Each action changes the state and the aim is to find the sequence of actions and

:/
states that lead from the initial (start) state to a final (goal) state.
p
A welldefined problem can be described by:
tt
 Initial state
 Operator or successor function for any state x returns s(x), the set of states reachable
h

from x with one action

 State space all states reachable from initial by any sequence of actions
 Path sequence through state space
 Path cost function that assigns a cost to a path. Cost of a path is the sum of costs of
individual actions along the path
 Goal test test to determine if at goal state
What is Search?
Search is the systematic examination of states to find path from the start/root state to the goal
state.
The set of possible states, together with operators defining their connectivity constitute the search
space.
The output of a search algorithm is a solution, that is, a path from the initial state to a state that
satisfies the goal test.
Problemsolving agents
A Problem solving agent is a goalbased agent . It decide what to do by finding sequence of
actions that lead to desirable states. The agent can adopt a goal and aim at satisfying it.
To illustrate the agent’s behavior ,let us take an example where our agent is in the city of
Arad,which is in Romania. The agent has to adopt a goal of getting to Bucharest.

Goal formulation,based on the current situation and the agent’s performance measure,is the first
step in problem solving.
The agent’s task is to find out which sequence of actions will get to a goal state.

Problem formulation is the process of deciding what actions and states to consider given a goal.

Example: Route finding problem

Referring to figure 1.19
On holiday in Romania : currently in Arad.
Flight leaves tomorrow from Bucharest
Formulate goal: be in Bucharest
e
.w
Formulate problem:
states: various cities

te
actions: drive between cities

o
Find solution:
sequence of cities, e.g., Arad, Sibiu, Fagaras, Bucharest

n
Problem formulation

se
A problem is defined by four items:
initial state e.g., “at Arad"

/c
successor function S(x) = set of actionstate pairs
e.g., S(Arad) = {[Arad > Zerind;Zerind],….}

:/
goal test, can be
explicit, e.g., x = at Bucharest"
p
implicit, e.g., NoDirt(x)
path cost (additive)
tt
e.g., sum of distances, number of actions executed, etc.
h

c(x; a; y) is the step cost, assumed to be >= 0

A solution is a sequence of actions leading from the initial state to a goal state.

Figure 1.17 Goal formulation and problem formulation

Search
An agent with several immediate options of unknown value can decide what to do by examining
different possible sequences of actions that leads to the states of known value,and then choosing the
best sequence. The process of looking for sequences actions from the current state to reach the goal
state is called search.
The search algorithm takes a problem as input and returns a solution in the form of action
sequence. Once a solution is found,the execution phase consists of carrying out the recommended
action..
Figure 1.18 shows a simple “formulate,search,execute” design for the agent. Once solution has been
executed,the agent will formulate a new goal.

function SIMPLEPROBLEMSOLVINGAGENT( percept) returns an action

inputs : percept, a percept
static: seq, an action sequence, initially empty
state, some description of the current world state
goal, a goal, initially null
problem, a problem formulation
state UPDATESTATE(state, percept)
if seq is empty then do
goal FORMULATEGOAL(state)
problem FORMULATEPROBLEM(state, goal)
seq SEARCH( problem)
action FIRST(seq);
seq REST(seq)
return action
Figure 1.18 A Simple problem solving agent. It first formulates a goal and a
problem,searches for a sequence of actions that would solve a problem,and executes the actions
e
.w
one at a time.
 The agent design assumes the Environment is

te
• Static : The entire process carried out without paying attention to changes that
might be occurring in the environment.

o
• Observable : The initial state is known and the agent’s sensor detects all aspects that

n
are relevant to the choice of action
• Discrete : With respect to the state of the environment and percepts and actions so

se
that alternate courses of action can be taken
• Deterministic : The next state of the environment is completely determined by the

/c
current state and the actions executed by the agent. Solutions to the problem are
single sequence of actions
An agent carries out its plan with eye closed. This is called an open loop system because ignoring

:/
the percepts breaks the loop between the agent and the environment.
p
tt
h

1.3.1.1 Welldefined problems and solutions

A problem can be formally defined by four components:
 The initial state that the agent starts in . The initial state for our agent of example problem is
described by In(Arad)
 A Successor Function returns the possible actions available to the agent. Given a state
x,SUCCESSORFN(x) returns a set of {action,successor} ordered pairs where each action is
one of the legal actions in state x,and each successor is a state that can be reached from x by
applying the action.
For example,from the state In(Arad),the successor function for the Romania problem would
return
{ [Go(Sibiu),In(Sibiu)],[Go(Timisoara),In(Timisoara)],[Go(Zerind),In(Zerind)] }
 State Space : The set of all states reachable from the initial state. The state space forms a
graph in which the nodes are states and the arcs between nodes are actions.
 A path in the state space is a sequence of states connected by a sequence of actions.
 Thr goal test determines whether the given state is a goal state.

 A path cost function assigns numeric cost to each action. For the Romania problem the cost
of path might be its length in kilometers.
 The step cost of taking action a to go from state x to state y is denoted by c(x,a,y). The step
cost for Romania are shown in figure 1.18. It is assumed that the step costs are non negative.
 A solution to the problem is a path from the initial state to a goal state.
 An optimal solution has the lowest path cost among all solutions.
e
.w
te
o
n
se
/c
:/
p
tt
h

Figure 1.19 A simplified Road Map of part of Romania

1.3.2 EXAMPLE PROBLEMS

The problem solving approach has been applied to a vast array of task environments. Some
best known problems are summarized below. They are distinguished as toy or realworld
problems
A toy problem is intended to illustrate various problem solving methods. It can be easily
used by different researchers to compare the performance of algorithms.
A real world problem is one whose solutions people actually care about.

1.3.2.1 TOY PROBLEMS

Vacuum World Example

o States: The agent is in one of two locations.,each of which might or might not contain dirt.
Thus there are 2 x 22 = 8 possible world states.
o Initial state: Any state can be designated as initial state.
o Successor function : This generates the legal states that results from trying the three actions
(left, right, suck). The complete state space is shown in figure 2.3
o Goal Test : This tests whether all the squares are clean.
o Path test : Each step costs one ,so that the the path cost is the number of steps in the path.

Vacuum World State Space

e
.w
te
o
n
se
/c
:/
p
tt
Figure 1.20 The state space for the vacuum world.
h

Arcs denote actions: L = Left,R = Right,S = Suck

The 8puzzle
An 8puzzle consists of a 3x3 board with eight numbered tiles and a blank space. A tile adjacent to
the balank space can slide into the space. The object is to reach the goal state ,as shown in figure 2.4
Example: The 8puzzle

Figure 1.21 A typical instance of 8puzzle.

The problem formulation is as follows :

o States : A state description specifies the location of each of the eight tiles and the blank in
one of the nine squares.
o Initial state : Any state can be designated as the initial state. It can be noted that any given
goal can be reached from exactly half of the possible initial states.
o Successor function : This generates the legal states that result from trying the four
actions(blank moves Left,Right,Up or down).
o Goal Test : This checks whether the state matches the goal configuration shown in figure
2.4.(Other goal configurations are possible)
o Path cost : Each step costs 1,so the path cost is the number of steps in the path.
o
e
.w
The 8puzzle belongs to the family of slidingblock puzzles,which are often used as test
problems for new search algorithms in AI. This general class is known as NPcomplete.

te
The 8puzzle has 9!/2 = 181,440 reachable states and is easily solved.
The 15 puzzle ( 4 x 4 board ) has around 1.3 trillion states,an the random instances can be

o
solved optimally in few milli seconds by the best search algorithms.
The 24puzzle (on a 5 x 5 board) has around 1025 states ,and random instances are still quite

n
difficult to solve optimally with current machines and algorithms.

se
8queens problem

/c
The goal of 8queens problem is to place 8 queens on the chessboard such that no queen
attacks any other.(A queen attacks any piece in the same row,column or diagonal).

:/
Figure 2.5 shows an attempted solution that fails: the queen in the right most column is
attacked by the queen at the top left.
p
An Incremental formulation involves operators that augments the state description,starting
with an empty state.for 8queens problem,this means each action adds a queen to the state.
tt
A completestate formulation starts with all 8 queens on the board and move them around.
h

In either case the path cost is of no interest because only the final state counts.

Figure 1.22 8queens problem

The first incremental formulation one might try is the following :

o States : Any arrangement of 0 to 8 queens on board is a state.
o Initial state : No queen on the board.
o Successor function : Add a queen to any empty square.
o Goal Test : 8 queens are on the board,none attacked.

In this formulation,we have 64.63…57 = 3 x 1014 possible sequences to investigate.

A better formulation would prohibit placing a queen in any square that is already attacked.
:
o States : Arrangements of n queens ( 0 <= n < = 8 ) ,one per column in the left most columns
,with no queen attacking another are states.
o Successor function : Add a queen to any square in the left most empty column such that it
is not attacked by any other queen.
This formulation reduces the 8queen state space from 3 x 1014 to just 2057,and solutions are
easy to find.
For the 100 queens the initial formulation has roughly 10400 states whereas the improved
formulation has about 1052 states. This is a huge reduction,but the improved state space is still
too big for the algorithms to handle.
e
.w
1.3.2.2 REALWORLD PROBLEMS

te
o
ROUTEFINDING PROBLEM
Routefinding problem is defined in terms of specified locations and transitions along links

n
between them. Routefinding algorithms are used in a variety of applications,such as routing in

se
computer networks,military operations planning,and air line travel planning systems.

AIRLINE TRAVEL PROBLEM

/c
The airline travel problem is specifies as follows :
o States : Each is represented by a location(e.g.,an airport) and the current time.

:/
o Initial state : This is specified by the problem.
o Successor function : This returns the states resulting from taking any scheduled
p
flight(further specified by seat class and location),leaving later than the current time plus
tt
the withinairport transit time,from the current airport to another.
o Goal Test : Are we at the destination by some prespecified time?
h

o Path cost : This depends upon the monetary cost,waiting time,flight time,customs and
immigration procedures,seat quality,time of dat,type of air plane,frequentflyer mileage
awards, and so on.

TOURING PROBLEMS
Touring problems are closely related to routefinding problems,but with an important difference.
Consider for example,the problem,”Visit every city at least once” as shown in Romania map.
As with routefinding the actions correspond to trips between adjacent cities. The state space,
however,is quite different.
The initial state would be “In Bucharest; visited{Bucharest}”.
A typical intermediate state would be “In Vaslui;visited {Bucharest,Urziceni,Vaslui}”.
The goal test would check whether the agent is in Bucharest and all 20 cities have been visited.

THE TRAVELLING SALESPERSON PROBLEM(TSP)

Is a touring problem in which each city must be visited exactly once. The aim is to find the
shortest tour.The problem is known to be NPhard. Enormous efforts have been expended to
improve the capabilities of TSP algorithms. These algorithms are also used in tasks such as
planning movements of automatic circuitboard drills and of stocking machines on shop
floors.

VLSI layout
A VLSI layout problem requires positioning millions of components and connections on a chip
to minimize area ,minimize circuit delays,minimize stray capacitances,and maximize
manufacturing yield. The layout problem is split into two parts : cell layout and channel
routing.

ROBOT navigation
ROBOT navigation is a generalization of the routefinding problem. Rather than a discrete set
of routes,a robot can move in a continuous space with an infinite set of possible actions and
states. For a circular Robot moving on a flat surface,the space is essentially twodimensional.
When the robot has arms and legs or wheels that also must be controlled,the search space
becomes multidimensional. Advanced techniques are required to make the search space finite.
e
.w
AUTOMATIC ASSEMBLY SEQUENCING
The example includes assembly of intricate objects such as electric motors. The aim in assembly

te
problems is to find the order in which to assemble the parts of some objects. If the wrong order
is choosen,there will be no way to add some part later without undoing somework already done.

o
Another important assembly problem is protein design,in which the goal is to find a sequence of
Amino acids that will be fold into a threedimensional protein with the right properties to cure

n
some disease.

se
INTERNET SEARCHING
In recent years there has been increased demand for software robots that perform Internet

/c
searching.,looking for answers to questions,for related information,or for shopping deals. The
searching techniques consider internet as a graph of nodes(pages) connected by links.

:/
p
1.3.3 SEARCHING FOR SOLUTIONS
tt
h

SEARCH TREE
Having formulated some problems,we now need to solve them. This is done by a search through
the state space. A search tree is generated by the initial state and the successor function that
together define the state space. In general,we may have a search graph rather than a search
tree,when the same state can be reached from multiple paths.

Figure 1.23 shows some of the expansions in the search tree for finding a route from Arad to
Bucharest.

Figure 1.23 Partial search trees for finding a route from Arad to Bucharest. Nodes that have
been expanded are shaded.; nodes that have been generated but not yet expanded are outlined in
bold;nodes that have not yet been generated are shown in faint dashed line

The root of the search tree is a search node corresponding to the initial state,In(Arad). The first
step is to test whether this is a goal state. The current state is expanded by applying the successor
function to the current state,thereby generating a new set of states. In this case,we get three new
states: In(Sibiu),In(Timisoara),and In(Zerind). Now we must choose which of these three
possibilities to consider further. This is the essense of search following up one option now and
putting the others aside for latter,in case the first choice does not lead to a solution.
Search strategy . The general treesearch algorithm is described informally in Figure 1.24
.
Tree Search
e
.w
te
o
n
se
/c
Figure 1.24 An informal description of the general treesearch algorithm

:/
p
tt
The choice of which state to expand is determined by the search strategy. There are an infinite
number paths in this state space ,so the search tree has an infinite number of nodes.
h

A node is a data structure with five components :

o STATE : a state in the state space to which the node corresponds;
o PARENTNODE : the node in the search tree that generated this node;
o ACTION : the action that was applied to the parent to generate the node;
o PATHCOST :the cost,denoted by g(n),of the path from initial state to the node,as
indicated by the parent pointers; and
o DEPTH : the number of steps along the path from the initial state.
It is important to remember the distinction between nodes and states. A node is a book keeping
data structure used to represent the search tree. A state corresponds to configuration of the world.

Figure 1.25 Nodes are data structures from which the search tree is
constructed. Each has a parent,a state, Arrows point from child to parent.

Fringe
Fringe is a collection of nodes that have been generated but not yet been expanded. Each element
of the fringe is a leaf node,that is,a node with no successors in the tree. The fringe of each tree
consists of those nodes with bold outlines.
e
.w
The collection of these nodes is implemented as a queue.
The general tree search algorithm is shown in Figure 2.9

te
o
n
se
/c
:/
p
tt
h

Figure 1.26 The general Tree search algorithm

The operations specified in Figure 1.26 on a queue are as follows:

o MAKEQUEUE(element,…) creates a queue with the given element(s).
o EMPTY?(queue) returns true only if there are no more elements in the queue.
o FIRST(queue) returns FIRST(queue) and removes it from the queue.
o INSERT(element,queue) inserts an element into the queue and returns the resulting
queue.
o INSERTALL(elements,queue) inserts a set of elements into the queue and returns the
resulting queue.

MEASURING PROBLEMSOLVING PERFORMANCE

The output of problemsolving algorithm is either failure or a solution.(Some algorithms might
struck in an infinite loop and never return an output.
The algorithm’s performance can be measured in four ways :
o Completeness : Is the algorithm guaranteed to find a solution when there is one?
o Optimality : Does the strategy find the optimal solution
o Time complexity : How long does it take to find a solution?
o Space complexity : How much memory is needed to perform the search?
1.3.4 UNINFORMED SEARCH STRATGES
e
.w
Uninformed Search Strategies have no additional information about states beyond that provided
in the problem definition.

te
Strategies that know whether one non goal state is “more promising” than another are called
Informed search or heuristic search strategies.

o
There are five uninformed search strategies as given below.

n
o Breadthfirst search

se
o Uniformcost search
o Depthfirst search
o Depthlimited search

/c
o Iterative deepening search

:/
1.3.4.1 Breadthfirst search
p
Breadthfirst search is a simple strategy in which the root node is expanded first,then all
successors of the root node are expanded next,then their successors,and so on. In general,all the
tt
nodes are expanded at a given depth in the search tree before any nodes at the next level are
h

expanded.
Breathfirstsearch is implemented by calling TREESEARCH with an empty fringe that is a
firstinfirstout(FIFO) queue,assuring that the nodes that are visited first will be expanded first.
In otherwards,calling TREESEARCH(problem,FIFOQUEUE()) results in breadthfirstsearch.
The FIFO queue puts all newly generated successors at the end of the queue,which means that
Shallow nodes are expanded before deeper nodes.

Figure 1.27 Breadthfirst search on a simple binary tree. At each stage ,the node to be expanded next
is indicated by a marker.

Properties of breadthfirstsearch

Figure 1.28 Breadthfirstsearch properties

e
.w
te
o
n
se
/c
:/
p
tt
h

Figure 1.29 Time and memory requirements for breadthfirstsearch. The

numbers shown assume branch factor of b = 10 ; 10,000 nodes/second; 1000
bytes/node
Time complexity for BFS
Assume every state has b successors. The root of the search tree generates b nodes at the first
level,each of which generates b more nodes,for a total of b2 at the second level. Each of these
generates b more nodes,yielding b3 nodes at the third level,and so on. Now suppose,that the
solution is at depth d. In the worst case,we would expand all but the last node at level
d,generating bd+1 b nodes at level d+1.
Then the total number of nodes generated is
b + b2 + b3 + …+ bd + ( bd+1 + b) = O(bd+1).
Every node that is generated must remain in memory,because it is either part of the fringe or is an
ancestor of a fringe node. The space compleity is,therefore ,the same as the time complexity
e
.w
1.3.4.2 UNIFORMCOST SEARCH
Instead of expanding the shallowest node,uniformcost search expands the node n with the

te
lowest path cost. uniformcost search does not care about the number of steps a path has,but only
about their total cost.

o
n
se
/c
:/
p
tt
h

Figure 1.30 Properties of Uniformcostsearch

2.5.1.3 DEPTHFIRSTSEARCH

Depthfirstsearch always expands the deepest node in the current fringe of the search tree. The
progress of the search is illustrated in figure 1.31. The search proceeds immediately to the
deepest level of the search tree,where the nodes have no successors. As those nodes are
expanded,they are dropped from the fringe,so then the search “backs up” to the next shallowest
node that still has unexplored successors.
e
.w
te
o
n
se
/c
:/
p
tt
h

Figure 1.31 Depthfirstsearch on a binary tree. Nodes that have been expanded and have no
descendants in the fringe can be removed from the memory;these are shown in black. Nodes at
depth 3 are assumed to have no successors and M is the only goal node.
This strategy can be implemented by TREESEARCH with a lastinfirstout (LIFO) queue,also
known as a stack.

Depthfirstsearch has very modest memory requirements.It needs to store only a single path
from the root to a leaf node,along with the remaining unexpanded sibling nodes for each node on
the path. Once the node has been expanded,it can be removed from the memory,as soon as its
descendants have been fully explored(Refer Figure 2.12).
For a state space with a branching factor b and maximum depth m,depthfirstsearch requires
storage of only bm + 1 nodes.

Using the same assumptions as Figure 2.11,and assuming that nodes at the same depth as the goal
node have no successors,we find the depthfirstsearch would require 118 kilobytes instead of 10
petabytes,a factor of 10 billion times less space.

Drawback of Depthfirstsearch
The drawback of depthfirstsearch is that it can make a wrong choice and get stuck going down
very long(or even infinite) path when a different choice would lead to solution near the root of the
search tree. For example ,depthfirstsearch will explore the entire left subtree even if node C is a
goal node.
e
.w
BACKTRACKING SEARCH
A variant of depthfirst search called backtracking search uses less memory and only one successor

te
is generated at a time rather than all successors.; Only O(m) memory is needed rather than O(bm)

o
1.3.4.4 DEPTHLIMITEDSEARCH

n
The problem of unbounded trees can be alleviated by supplying depthfirstsearch with a pre

se
determined depth limit l.That is,nodes at depth l are treated as if they have no successors. This
approach is called depthlimitedsearch. The depth limit soves the infinite path problem.

/c
Depth limited search will be nonoptimal if we choose l > d. Its time complexity is O(bl) and its
space compleiy is O(bl). Depthfirstsearch can be viewed as a special case of depthlimited search
with l = oo

:/
Sometimes,depth limits can be based on knowledge of the problem. For,example,on the map of
p
Romania there are 20 cities. Therefore,we know that if there is a solution.,it must be of length 19 at
the longest,So l = 10 is a possible choice. However,it oocan be shown that any city can be reached
tt
from any other city in at most 9 steps. This number known as the diameter of the state space,gives
h

us a better depth limit.

Depthlimitedsearch can be implemented as a simple modification to the general treesearch
algorithm or to the recursive depthfirstsearch algorithm. The pseudocode for recursive depth
limitedsearch is shown in Figure 1.32.
It can be noted that the above algorithm can terminate with two kinds of failure : the standard
failure value indicates no solution; the cutoff value indicates no solution within the depth limit.
Depthlimited search = depthfirst search with depth limit l,
returns cut off if any path is cut off by depth limit

function DepthLimitedSearch( problem, limit) returns a solution/fail/cutoff

return RecursiveDLS(MakeNode(InitialState[problem]), problem, limit)
function RecursiveDLS(node, problem, limit) returns solution/fail/cutoff
cutoffoccurred? false
if GoalTest(problem,State[node]) then return Solution(node)
else if Depth[node] = limit then return cutoff
else for each successor in Expand(node, problem) do
result RecursiveDLS(successor, problem, limit)
if result = cutoff then cutoff_occurred? true
else if result not = failure then return result
if cutoff_occurred? then return cutoff else return failure
Figure 1.32 Recursive implementation of Depthlimitedsearch:

1.3.4.5 ITERATIVE DEEPENING DEPTHFIRST SEARCH

Iterative deepening search (or iterativedeepeningdepthfirstsearch) is a general strategy often
used in combination with depthfirstsearch,that finds the better depth limit. It does this by
gradually increasing the limit – first 0,then 1,then 2, and so on – until a goal is found. This will
occur when the depth limit reaches d,the depth of the shallowest goal node. The algorithm is shown
in Figure 2.14.
Iterative deepening combines the benefits of depthfirst and breadthfirstsearch
Like depthfirstsearch,its memory requirements are modest;O(bd) to be precise.
e
.w
Like Breadthfirstsearch,it is complete when the branching factor is finite and optimal when the
path cost is a non decreasing function of the depth of the node.

te
Figure 2.15 shows the four iterations of ITERATIVEDEEPENING_SEARCH on a binary search
tree,where the solution is found on the fourth iteration.

o
n
se
/c
:/
p
tt
h

Figure 1.33 The iterative deepening search algorithm ,which repeatedly applies depthlimited
search with increasing limits. It terminates when a solution is found or if the depth limited search
resturns failure,meaning that no solution exists.
e
.w
te
o
n
se
/c
:/
p
tt
h

Figure 1.34 Four iterations of iterative deepening search on a binary tree

Iterative search is not as wasteful as it might seem

e
.w
Iterative deepening search

te
S S

o
S

n
A D
Limit = 0

se
Limit = 1

/c
S S S
A
:/ D A D
p
tt
B D A E
Limit = 2
h

Figure 1.35

Iterative search is not as wasteful as it might seem

Properties of iterative deepening search

Figure 1.36
e
.w
In general,iterative deepening is the prefered uninformed search method when there is a
large search space and the depth of solution is not known.

te
1.3.4.6 Bidirectional Search
The idea behind bidirectional search is to run two simultaneous searches –

o
one forward from he initial state and

n
the other backward from the goal,
stopping when the two searches meet in the middle (Figure 1.37)

se
The motivation is that bd/2 + bd/2 much less than ,or in the figure ,the area of the two small circles is
less than the area of one big circle centered on the start and reaching to the goal.

/c
:/
p
tt
h

Figure 1.37 A schematic view of a bidirectional search that is about to succeed,when a

Branch from the Start node meets a Branch from the goal node.

1.3.4.7 Comparing Uninformed Search Strategies

Figure 1.38 compares search strategies in terms of the four evaluation criteria .

Figure 1.38 Evaluation of search strategies,b is the branching factor; d is the depth of the
shallowest solution; m is the maximum depth of the search tree; l is the depth limit. Superscript
caveats are as follows: a complete if b is finite; b complete if step costs >= E for positive E; c
optimal if step costs are all identical; d if both directions use breadthfirst search.

1.3.5 AVOIDING REPEATED STATES

In searching,time is wasted by expanding states that have already been encountered and
expanded before. For some problems repeated states are unavoidable. The search trees for these
problems are infinite. If we prune some of the repeated states,we can cut the search tree down to
finite size. Considering search tree upto a fixed depth,eliminating repeated states yields an
exponential reduction in search cost.
Repeated states ,can cause a solvable problem to become unsolvable if the algorithm does not detect
them.
e
.w
Repeated states can be the source of great inefficiency: identical sub trees will be explored many
times!

te
A A

o
B B B

n
C C C

se
C C

Figure 1.39

/c
:/
p
tt
h

Figure 1.40
e
.w
te
o
n
se
/c
Figure 1.41 The General graph search algorithm. The set closed can be implemented with a hash

:/
table to allow efficient checking for repeated states.
p
Do not return to the previous state.
tt
• Do not create paths with cycles.
h

• Do not generate the same state twice.

Store states in a hash table.

Check for repeated states.

o Using more memory in order to check repeated state
o Algorithms that forget their history are doomed to repeat it.
o Maintain CloseList beside OpenList(fringe)
Strategies for avoiding repeated states
We can modify the general TREESEARCH algorithm to include the data structure called the
closed list ,which stores every expanded node. The fringe of unexpanded nodes is called the open
list.
If the current node matches a node on the closed list,it is discarded instead of being expanded.
The new algorithm is called GRAPHSEARCH and much more efficient than TREESEARCH. The
worst case time and space requirements may be much smaller than O(bd).

1.3.6 SEARCHING WITH PARTIAL INFORMATION

o Different types of incompleteness lead to three distinct problem types:
o Sensorless problems (conformant): If the agent has no sensors at all
o Contingency problem: if the environment if partially observable or if
action are uncertain (adversarial)
o Exploration problems: When the states and actions of the environment
are unknown.

o No sensor
o Initial State(1,2,3,4,5,6,7,8)
o After action [Right] the state (2,4,6,8)
o After action [Suck] the state (4, 8)
o After action [Left] the state (3,7)
o After action [Suck] the state (8)
e
.w
o Answer : [Right,Suck,Left,Suck] coerce the world into state 7 without any
sensor

te
o Belief State: Such state that agent belief to be there

o
(SLIDE 7) Partial knowledge of states and actions:
– sensorless or conformant problem

n
– Agent may have no idea where it is; solution (if any) is a sequence.

se
– contingency problem
– Percepts provide new information about current state; solution is a tree or

/c
policy; often interleave search and execution.
– If uncertainty is caused by actions of another agent: adversarial problem

:/
– exploration problem
– When states and actions of the environment are unknown.
p
tt
h

Figure
e
.w
te
o
n
se
/c
:/
p
tt
h

Figure

Contingency, start in {1,3}.

Murphy’s law, Suck can dirty a clean carpet.
Local sensing: dirt, location only.
– Percept = [L,Dirty] ={1,3}
– [Suck] = {5,7}
– [Right] ={6,8}
– [Suck] in {6}={8} (Success)
– BUT [Suck] in {8} = failure
Solution??
– Beliefstate: no fixed action sequence guarantees solution
Relax requirement:
– [Suck, Right, if [R,dirty] then Suck]
– Select actions based on contingencies arising during execution.

Time and space complexity are always considered with respect to some measure of the problem
difficulty. In theoretical computer science ,the typical measure is the size of the state space.
In AI,where the graph is represented implicitly by the initial state and successor function,the
complexity is expressed in terms of three quantities:
e
.w
b,the branching factor or maximum number of successors of any node;
d,the depth of the shallowest goal node; and

te
m,the maximum length of any path in the state space.

o
Searchcost typically depends upon the time complexity but can also include the term for
memory usage.

n
Total–cost – It combines the searchcost and the path cost of the solution found.

se
/c
:/
p
tt
h

Polymeric Chemicals
100% (1)
Polymeric Chemicals
22 pages
Ai Chap 1 To 6
No ratings yet
Ai Chap 1 To 6
139 pages
Introduction To Artificial Intelligence
No ratings yet
Introduction To Artificial Intelligence
20 pages
Artificial Intelligence Munotes
No ratings yet
Artificial Intelligence Munotes
3 pages
Chapter 1 AI
No ratings yet
Chapter 1 AI
19 pages
Unit1 of AI
No ratings yet
Unit1 of AI
214 pages
AI- notes
No ratings yet
AI- notes
36 pages
AI notes
No ratings yet
AI notes
121 pages
Artificial-Intelligence NOTES
No ratings yet
Artificial-Intelligence NOTES
184 pages
Artificial intelligence
No ratings yet
Artificial intelligence
32 pages
Ai Unit 1 - Compressed
No ratings yet
Ai Unit 1 - Compressed
142 pages
Ai Intro
No ratings yet
Ai Intro
33 pages
Artificial Intelligence Tutorial
100% (2)
Artificial Intelligence Tutorial
68 pages
Artificial Intelligence Tutorial PDF
0% (1)
Artificial Intelligence Tutorial PDF
68 pages
AI Chapter 1
No ratings yet
AI Chapter 1
27 pages
Chap1
No ratings yet
Chap1
56 pages
A I Notes
No ratings yet
A I Notes
35 pages
13 02 Artificial Intelligence
No ratings yet
13 02 Artificial Intelligence
71 pages
Lec 01 and 02
No ratings yet
Lec 01 and 02
54 pages
Next Lecture
No ratings yet
Next Lecture
85 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
2 pages
Aiml Unit1
No ratings yet
Aiml Unit1
140 pages
Artificial Intellige NCE: Presented By:-Aditi Kohli & Shrishti Rajput
No ratings yet
Artificial Intellige NCE: Presented By:-Aditi Kohli & Shrishti Rajput
13 pages
Scs302 Artificial Intelligence Notes
No ratings yet
Scs302 Artificial Intelligence Notes
110 pages
Unit 1st
No ratings yet
Unit 1st
51 pages
Intoduction To Artificial Intelligence
No ratings yet
Intoduction To Artificial Intelligence
52 pages
AI Chapter One
No ratings yet
AI Chapter One
46 pages
Lecture 01 - Part A Advanced Artificial Intelligence: Dr. Shazzad Hosain
No ratings yet
Lecture 01 - Part A Advanced Artificial Intelligence: Dr. Shazzad Hosain
34 pages
Artificial Intelligence R20 Notes-Unit 1
No ratings yet
Artificial Intelligence R20 Notes-Unit 1
24 pages
TTNT 01
No ratings yet
TTNT 01
16 pages
AI Unit 1 PPT (21CSC206T)
No ratings yet
AI Unit 1 PPT (21CSC206T)
249 pages
AI notes
No ratings yet
AI notes
107 pages
Ambo University Woliso Campus: AI and Expert Systems (MIT6124)
No ratings yet
Ambo University Woliso Campus: AI and Expert Systems (MIT6124)
22 pages
UNIT 1_AI
No ratings yet
UNIT 1_AI
59 pages
LPA MOD 1 NOtes
No ratings yet
LPA MOD 1 NOtes
1 page
ocs351aiml-fundamentals-unit-1and2
No ratings yet
ocs351aiml-fundamentals-unit-1and2
64 pages
Artificial Intelligence Notes
No ratings yet
Artificial Intelligence Notes
62 pages
HIT3002 Introduction To Artificial Intelligence
No ratings yet
HIT3002 Introduction To Artificial Intelligence
31 pages
Unit 1 Introduction To AI and Intelligent Agents - Master
No ratings yet
Unit 1 Introduction To AI and Intelligent Agents - Master
75 pages
Department of Computer Science & Engineering Session 2020-2021
No ratings yet
Department of Computer Science & Engineering Session 2020-2021
125 pages
EmTec Chapter 3
No ratings yet
EmTec Chapter 3
46 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
187 pages
SLM1
No ratings yet
SLM1
28 pages
Chapter One
No ratings yet
Chapter One
41 pages
MSC 2 PDF
No ratings yet
MSC 2 PDF
146 pages
Dr.D.Muthukumaran ARTIFICIAL INTELLIGENCE
No ratings yet
Dr.D.Muthukumaran ARTIFICIAL INTELLIGENCE
183 pages
Introduction To Artificial Intelligence
No ratings yet
Introduction To Artificial Intelligence
22 pages
Artificial Intelligence Com 423 HND Ii
No ratings yet
Artificial Intelligence Com 423 HND Ii
8 pages
AI Introduction
No ratings yet
AI Introduction
49 pages
Artificial Inteligency - Presentation
No ratings yet
Artificial Inteligency - Presentation
34 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
4 pages
Arificial Intelligence.
No ratings yet
Arificial Intelligence.
43 pages
Friend: Fundamentals of
No ratings yet
Friend: Fundamentals of
97 pages
Network Analysis
No ratings yet
Network Analysis
235 pages
11 19UCSPEX01 B 3 10FAI Unit-1
No ratings yet
11 19UCSPEX01 B 3 10FAI Unit-1
123 pages
21.paper XX1 AI 2022
No ratings yet
21.paper XX1 AI 2022
66 pages
Chap1
No ratings yet
Chap1
58 pages
Artificial Intelligence
From Everand
Artificial Intelligence
IntroBooks Team
No ratings yet
Artificial Intelligence Winter: Fundamentals and Applications
From Everand
Artificial Intelligence Winter: Fundamentals and Applications
Fouad Sabry
No ratings yet
AI Fundamentals Explained
From Everand
AI Fundamentals Explained
Brian Mackay
No ratings yet
Deep Learning Frameworks
From Everand
Deep Learning Frameworks
Jamal Hopper
No ratings yet
United States v. Stephen A. Pearson and John Petracelli, 746 F.2d 787, 11th Cir. (1984)
No ratings yet
United States v. Stephen A. Pearson and John Petracelli, 746 F.2d 787, 11th Cir. (1984)
11 pages
Department of Labor: ls-210
No ratings yet
Department of Labor: ls-210
1 page
Armageddon - The End Times Are Here Post Witchcraft RPG
100% (19)
Armageddon - The End Times Are Here Post Witchcraft RPG
258 pages
Grace's Answers
No ratings yet
Grace's Answers
538 pages
Drug Study Amphotericin B, Meropenem, Furosemide, Ciprofloxacin, Pentoxifylline, Pip-Tazo, Midazolam, Vecuronium
100% (4)
Drug Study Amphotericin B, Meropenem, Furosemide, Ciprofloxacin, Pentoxifylline, Pip-Tazo, Midazolam, Vecuronium
12 pages
Ch-1-Coulombs Law, Electric Field and Gauss Theorem Amity
No ratings yet
Ch-1-Coulombs Law, Electric Field and Gauss Theorem Amity
24 pages
LSAT PT 36 Expl Unlocked
No ratings yet
LSAT PT 36 Expl Unlocked
32 pages
Pity The Fool
0% (1)
Pity The Fool
168 pages
Movement
No ratings yet
Movement
3 pages
Plant Biochemistry Caroline Bowsher instant download
No ratings yet
Plant Biochemistry Caroline Bowsher instant download
50 pages
Full Download Chemistry 4th Edition Burdge Solutions Manual All Chapter 2024 PDF
100% (23)
Full Download Chemistry 4th Edition Burdge Solutions Manual All Chapter 2024 PDF
44 pages
Harley Davidson
No ratings yet
Harley Davidson
33 pages
Church Resources: Advent & Christmas 2013
No ratings yet
Church Resources: Advent & Christmas 2013
40 pages
CPD For Nurses
No ratings yet
CPD For Nurses
2 pages
Homestead Requirements and Jurisprudence
No ratings yet
Homestead Requirements and Jurisprudence
17 pages
B1 Grammar Test Units 1-6
No ratings yet
B1 Grammar Test Units 1-6
3 pages
Introduction of Pega Platform
No ratings yet
Introduction of Pega Platform
8 pages
Questionpaper-Paper2R-January2019
No ratings yet
Questionpaper-Paper2R-January2019
32 pages
Yusef Komunyakaa - Edited
No ratings yet
Yusef Komunyakaa - Edited
4 pages
4th Sunday of Advent Sermon in English
No ratings yet
4th Sunday of Advent Sermon in English
14 pages
(MMPI) : The Minnesota Multiphasic Personality Inventory
No ratings yet
(MMPI) : The Minnesota Multiphasic Personality Inventory
22 pages
Dunne_Para-functionality-The Aesthetics of Use
No ratings yet
Dunne_Para-functionality-The Aesthetics of Use
25 pages
Sculpture Trail at FCP PDF
No ratings yet
Sculpture Trail at FCP PDF
6 pages
TM-1818 AVEVA Everything3D - (2.1) Support (CN)
No ratings yet
TM-1818 AVEVA Everything3D - (2.1) Support (CN)
153 pages
CSC V
No ratings yet
CSC V
4 pages
EE 29 RA 7920 Report.
No ratings yet
EE 29 RA 7920 Report.
41 pages
Lesson 5 The Role of Business in The Environment & How The Environment Affects The Firm
No ratings yet
Lesson 5 The Role of Business in The Environment & How The Environment Affects The Firm
13 pages
2180-PDS-Heldite Jointing Compound PDS Feb 2018
No ratings yet
2180-PDS-Heldite Jointing Compound PDS Feb 2018
1 page
Business Coach Uae
No ratings yet
Business Coach Uae
5 pages