0% found this document useful (0 votes)

49 views6 pages

Markov Chains

This document discusses Markov chains and provides additional reading and homework problems related to the topic. It begins with an introduction to Markov chains as probabilistic models that relax the determinism of DFAs by allowing transitions between states to occur probabilistically. It then provides the formal definition of a Markov chain and examples of using Markov chains as both discriminative and generative models. Homework problems related to Markov chains are also mentioned.

Uploaded by

Mohamed Fawzy Shrif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views6 pages

Markov Chains

Uploaded by

Mohamed Fawzy Shrif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

CS 252 - Markov Chains

Additional Reading 1
and
Homework problems

1 Markov Chains
The first model we discussed, the DFA, is deterministic—it’s transition function must be
total and it allows for transition from one state to exactly one other state at each processing
step. Our second model, the NFA, relaxed this deterministic constraint in a “magical” sort
of way, and provided us with some interesting new insights into regular languages, and the
potential for additional efficiency of computation (though this is something we have ignored
for now). Even though NFAs are “magical” in the sense that we can’t actually implement
them, we know we can simulate them with a DFA that has (in the worst case) an exponential
number of states. What if we relaxed our determinism in another way? What if, instead of
having to track an exponential number of states, we allowed the transition function to be
probabilistic—at each computational step, we allow a transition to only a single next state,
but we allow it to happen with some probability over many different states? In essence, we
avoid the exponential state explosion at the cost of introducing uncertainty into our model—
we no longer can be certain about what path the computation follows and in what state the
machine is in at any point in the computation. Such a model exists and it is called a Markov
chain.
Figure 1 shows a Markov chain that models the flipping of a fair coin. Probablilities on
the edges show that from the start state, there is a 50% chance of transitioning to the heads
state and a 50% chance of transitioning to the tails state. Transitioning from the heads state

Figure 1: A Markov chain C1 that models the flipping of a fair coin

1
to the tails state is as likely as remaining in the heads state, and transitioning from the tails
state to the heads state is as likely as remaining in the tails.
Using this model, we can simulate a series of (fair) coin flips and calculate their proba-
bilities. We will use the notation P (x|y) to represent the conditional probability of event x
happening, given that event y has happened. So, given the model of Figure 1, we can write

• P (heads|start) = 0.5

• P (tails|start) = 0.5

• P (heads|heads) = 0.5

• P (tails|heads) = 0.5

• P (heads|tails) = 0.5

• P (tails|tails) = 0.5

What is the probability P (H) of flipping the coin a single time and having it land heads
up? It is P (heads|start) = 0.5, which is represented by beginning in the start state and
transitioning to the heads state in our diagram1 . What about the probability of flipping
three heads in a row P (HHH)? The probability of both event x and event y happening is
computed by multiplying their individual probabilities together, P (xy) = P (x)P (y)2 . Thus
the probability of flipping three heads in a row is

P (HHH) = P (heads|start)P (heads|heads)P (heads|heads) = 0.5 × 0.5 × 0.5 = 0.125

Figure 1 shows a computation tree for all possible sequences of three or fewer coin flips.
Note that each unique path in the tree corresponds to a unique sequence of events (coin
flip results), and the probability of a particular sequence is computed by multiplying the
probabilities on the edges of the path.
What about the probability of flipping the coin three times and having exactly two of
them be heads? This is a bit trickier, as there are several possible ways this can happen
(HHT , HT H, T HH). The probability of either event x or event y happening is computed
by adding their individual probabilities together, P (x or y) = P (x) + P (y). So, to compute
the probability of seeing exactly two heads in three coin flips, we want to compute the
probability of each path in the tree that has exactly two heads and one tail and then add
1
Because the first event in a sequence cannot be conditioned on any prior event, this initial probability
P (heads|start) is sometimes written as the unconditional probability P (heads); here, we write all prob-
abilities as conditional and use a special start state that does not have an associated event and whose
unconditional probability P (start) = 1.
2
Note that this is actually only true when x and y are independent events. In general, P (xy) ≤ P (x)P (y),
but for our discussion, we will only be concerned with the independent case, as, in particular, it is the defining
characteristic of Markov chains.

2
Figure 2: A partial tree showing computation paths for the Markov chain C of Figure 1.

them together. In this case, because the coin is fair, each path will have the same probability
and the total probability of seeing exactly two heads is

P (exactly 2 heads) = P (HHT ) + P (HT H) + P (T HH) = 0.125 + 0.125 + 0.125 = 0.375

1.1 Formal Definition of Markov Chains

Like in previous readings, we have started with some informal discussion. We will now give
a formal definition of Markov chains, similar to those we’ve seen in the past.

Definition 0.0.1. A Markov Chain is a 5-tuple (Q, Σ, δ, γ, q0 ), where

1. Q is a finite set called the states

2. Σ is a finite set called the alphabet

3. δ : Q ∪ {q0 } × Q → [0 . . . 1] is the state transition function

4. γ : Q → Σ is the state emission function

5. q0 ∈
/ Q is the start state

Note the similarities to our definitions for DFAs and NFAs, but also note some important
differences. Like previous models, Markov chains have a set of states, a set of alphabet
symbols, a transition function and a start state. However, unlike earlier models, the start
state is a special kind of state that is not included in the set of states Q. Also, the transition
function is (as you might expect) again different from what we’ve seen in the past. This
time, it maps a pair of states to a real number between 0 and 1, which we will interpret as
a probability. That is, the transition function δ is now interpreted as the probability that

3
the Markov chain will transition from one state to another. Note that this function takes as
input only two states: the state from which the transition occurs and the state to which it
goes. Another way to think about this is to realize that the probability of transition does not
depend on any states occupied prior to the current state—history beyond the current state
does not matter in determining the transition probability. This property is called Markovian,
after Andrey Markov, a Russian mathematician interested in stochastic processes, and it is
this property that gives this model its name. In addition to the now familiar transition
function δ, we now have an entirely new function γ, called the emission function. Because
we now associate a probability with transitions, we will associate alphabet symbols with
states instead. The emission function tells us how symbols are associated with states—it
maps a symbol from Σ to each state in Q. Each time the Markov chain visits a state, it
consumes (or emits) its associated alphabet symbol. Finally, note that Markov chains do
not have a set of accept states; every state in the model is a valid ending state.
Revisiting the Markov chain C of Figure 1, the formal definition of C is (Q, Σ, δ, γ, q0 ),
where

1. Q = {heads, tails}

2. Σ = {H, T }

3. δ is given as

heads tails
start 0.5 0.5
heads 0.5 0.5
tails 0.5 0.5

4. γ is given as
q γ(q)
heads H
tails T

5. q0 = start

1.2 Computing with Markov Chains

Similar to DFAs, Markov chains can be thought of as language recognizers or as machines
that accept some strings and reject others. However, similar to regular expressions, Markov
chains can also be thought of as generators of strings, and, in fact, it is the latter that is
perhaps the more common case. When used as acceptors, we say that the Markov chain is
a discriminative model, and when used as a generator, we say that the Markov chain is a
generative model.

4
1.2.1 Discriminative models
When used discriminatively, Markov chains have a formal definition of computation that
is similar to those we’ve seen for DFAs and NFAs. Let C = (Q, Σ, δ, γ, q0 ) be a Markov
chain and w be a string over the alphabet Σ. Then we say that C accepts w if we can
1
write w = y1 y2 · · · ym , where each yi ∈ Σ, and one or more sequences of states r01 , r11 , . . . , rm ,
2 2 2 n n n
r0 , r1 , . . . , rm , . . . , r0 , r1 , . . . , rm exist in Q ∪ {q0 } with three conditions:
1. r0 = qo
n m−1
δ(rji , rj+1
i
P Q
2. )≥θ≥0
i=1 j=0

3. γ(rji ) = yj , for i = 1, . . . , n and j = 1, . . . , m

Condition 1 says that the machine starts out in the start state. Condition 2 says that
the total probability over all sequences of states is greater than some positive threshold θ
(which can be chosen in many ways, and might, for example, be a function of the length of the
string). Condition 3 says that for each sequence, each state in the sequence is associated with
the appropriate alphabet letter in the string w—in other words, it says that each sequence
of states can generate the string w.

1.2.2 Generative models

When used generatively, Markov chains give us the ability to compute the probability
of various sequences (or subsequences) occurring. Let C = (Q, Σ, δ, γ, q0 ) be a Markov
chain, w = y1 y2 · · · ym , where each yi ∈ Σ, be a string over the alphabet Σ and R =
{(r1 , . . . , rm )|ri ∈ Q, γ(ri ) = yi , 1 ≤ i ≤ m} be the set of state sequences that can generate
w. Then, we can compute the probability of C generating w as
X m
Y
P (w) = δ(q0 , r1 ) δ(rj , rj+1 )
r∈R j=1

The total probability is computed by summing over all generating sequences, and, for each
sequence, computing the probability of the entire sequence as a product of conditional prob-
abilities, each transition to a new state conditioned on the previous state.

1.3 Exercises
Exercise 1.1. Consider the transition diagram for C1 in Figure 1.
a. Draw a modified transition diagram for C1 to represent a coin that is biased to land
on heads 75% of the time.
b. Compute the probability of flipping the coin three times and seeing the sequence tails,
heads, tails.
c. Compute the probability of flipping the coin three times and seeing an odd number of
tails.

5
Exercise 1.2. Given the following formal description of a Markov model, C = (Q, Σ, δ, γ, q0 )

1. Q = {0, 1}

2. Σ = {0, 1}

3. δ is given as

0 1
start 0.25 0.75
0 0.25 0.75
1 1.0 0.0

4. γ is given as
q γ(q)
0 0
1 1

5. q0 = start

a. Draw a transition diagram for C.

b. Considering C as a generative model, what string of length 4 is most likely to be
generated?
c. What string is least likely to be generated?
d. Considering C as a discriminative model, let θ = (0.5)m , where m is the length of the
string in question. Will the the string 1001 be accepted by C? Why or why not?
e. Will the string 0010 be accepted by C? Why or why not?
f. List all strings of length 4 or less that are in L(C)?
g. What happens to L(C) if we make θ a constant?

Markov Chain Chapter 6
50% (2)
Markov Chain Chapter 6
42 pages
SQA Past Papers
No ratings yet
SQA Past Papers
17 pages
Mackintosh Probe Test Results: Jurutera Perunding Wahba Sdn. BHD
No ratings yet
Mackintosh Probe Test Results: Jurutera Perunding Wahba Sdn. BHD
1 page
Unit-1 Basic Principles of Measurements PDF
100% (3)
Unit-1 Basic Principles of Measurements PDF
13 pages
Chapter 5 Auditing Risk Assessment and Risk Management Processes
100% (1)
Chapter 5 Auditing Risk Assessment and Risk Management Processes
41 pages
CS6800MM
No ratings yet
CS6800MM
19 pages
18-615-notes
No ratings yet
18-615-notes
33 pages
Stochastic Processes I - Copy
No ratings yet
Stochastic Processes I - Copy
15 pages
14-RA-MIRI-MarkovChains
No ratings yet
14-RA-MIRI-MarkovChains
61 pages
Markov Chain
No ratings yet
Markov Chain
2 pages
Bioinformatics HMM Updated
No ratings yet
Bioinformatics HMM Updated
28 pages
Markov chains2
No ratings yet
Markov chains2
75 pages
Lec7_MarkovChains
No ratings yet
Lec7_MarkovChains
14 pages
Cadeia de Markov
No ratings yet
Cadeia de Markov
184 pages
2402.16324v1
No ratings yet
2402.16324v1
40 pages
lecture19Compressed
No ratings yet
lecture19Compressed
19 pages
Marketing Chapter3
67% (3)
Marketing Chapter3
32 pages
Chapter 8 _ Markov Chains
No ratings yet
Chapter 8 _ Markov Chains
28 pages
age_belief
No ratings yet
age_belief
11 pages
2408.08496v1
No ratings yet
2408.08496v1
7 pages
ASA Certs and Trustpoints
No ratings yet
ASA Certs and Trustpoints
7 pages
Taller de Autoaprendizaje No 3
No ratings yet
Taller de Autoaprendizaje No 3
129 pages
DSBD_UNIT–II_2
No ratings yet
DSBD_UNIT–II_2
47 pages
Byrne, S. Et Al. (2011) Unpacking The Collection
100% (2)
Byrne, S. Et Al. (2011) Unpacking The Collection
337 pages
Markov Chains
No ratings yet
Markov Chains
29 pages
BT302_L9_HMM
No ratings yet
BT302_L9_HMM
29 pages
Some Connections Between Partially Asymmetric Exclusion Process and Permutation Tableux
No ratings yet
Some Connections Between Partially Asymmetric Exclusion Process and Permutation Tableux
27 pages
Discrete Finite Markov Chains Notes
No ratings yet
Discrete Finite Markov Chains Notes
7 pages
STA03B3 Lecture 3
No ratings yet
STA03B3 Lecture 3
27 pages
Topic 4 CEM615
No ratings yet
Topic 4 CEM615
69 pages
Cinema 4 D
No ratings yet
Cinema 4 D
94 pages
Unified Modeling Language (UML)
No ratings yet
Unified Modeling Language (UML)
84 pages
ISWR SUMMERIZED NOTE
No ratings yet
ISWR SUMMERIZED NOTE
7 pages
Solution Review Set P DS
No ratings yet
Solution Review Set P DS
10 pages
Markov Chains: 1.1 Specifying and Simulating A Markov Chain
No ratings yet
Markov Chains: 1.1 Specifying and Simulating A Markov Chain
38 pages
4703 07 Notes MC PDF
No ratings yet
4703 07 Notes MC PDF
7 pages
Peter Kraljic
No ratings yet
Peter Kraljic
2 pages
Separation of Variables: Some Slides Adapted From Ake Nordlund and Dennis Healy
No ratings yet
Separation of Variables: Some Slides Adapted From Ake Nordlund and Dennis Healy
52 pages
Stochastic Process Simulation in Matlab
No ratings yet
Stochastic Process Simulation in Matlab
17 pages
MC MC Revolution
No ratings yet
MC MC Revolution
27 pages
PDF Document 3
No ratings yet
PDF Document 3
80 pages
Markov Chains
No ratings yet
Markov Chains
17 pages
Chap1-2 Markov Chain
No ratings yet
Chap1-2 Markov Chain
82 pages
2 Finite Markov Chains: Author
No ratings yet
2 Finite Markov Chains: Author
21 pages
Statistical Simulation - An Introduction: James H. Steiger
No ratings yet
Statistical Simulation - An Introduction: James H. Steiger
43 pages
Bioinformatics-Lesson 07 - Hidden Markov Model
No ratings yet
Bioinformatics-Lesson 07 - Hidden Markov Model
28 pages
Noori Farhan Al-Mayahi: University of Al-Qadisiyah
No ratings yet
Noori Farhan Al-Mayahi: University of Al-Qadisiyah
47 pages
Sporks Natural Selection in A Population No Explanation Nor Application
No ratings yet
Sporks Natural Selection in A Population No Explanation Nor Application
3 pages
Cadenas de Markov
No ratings yet
Cadenas de Markov
53 pages
Binomial Theorem
No ratings yet
Binomial Theorem
13 pages
3
No ratings yet
3
58 pages
hw6 ch6 2011x
No ratings yet
hw6 ch6 2011x
5 pages
8 SQL Data Types in Sap Hana
No ratings yet
8 SQL Data Types in Sap Hana
8 pages
Rural Market & Consumer Profile (India)
No ratings yet
Rural Market & Consumer Profile (India)
11 pages
Stochastic Processes and Time Series Markov Chains - I: 1 A Coin-Tossing Game
No ratings yet
Stochastic Processes and Time Series Markov Chains - I: 1 A Coin-Tossing Game
4 pages
P C P G P: Use This Form in Conjunction With The 3 Standards and 12 Criteria
No ratings yet
P C P G P: Use This Form in Conjunction With The 3 Standards and 12 Criteria
3 pages
Chapter 7 Values, Motivation, and Goal Theories
No ratings yet
Chapter 7 Values, Motivation, and Goal Theories
14 pages
Motion Lab Activity
No ratings yet
Motion Lab Activity
7 pages
Finite Markov Chains and Algorithmic Applications
100% (1)
Finite Markov Chains and Algorithmic Applications
123 pages
Fundamentals of Statistical Analysis
No ratings yet
Fundamentals of Statistical Analysis
2 pages
JMSSP 2013 149 154
No ratings yet
JMSSP 2013 149 154
6 pages
Probability & Statistics 2: Robert Šámal January 29, 2024
No ratings yet
Probability & Statistics 2: Robert Šámal January 29, 2024
29 pages
Introduction To Fungi
No ratings yet
Introduction To Fungi
20 pages
RoughSetsRep29 PDF
No ratings yet
RoughSetsRep29 PDF
51 pages
Markov Chains
No ratings yet
Markov Chains
6 pages
Malhotra 12 - Essentials 1E
No ratings yet
Malhotra 12 - Essentials 1E
76 pages
Fil 103 - Barayti at Baryasyon NG Wika Course Outline: Instructor
No ratings yet
Fil 103 - Barayti at Baryasyon NG Wika Course Outline: Instructor
7 pages
APPM 2360 Project 2 Network Markov Chains
No ratings yet
APPM 2360 Project 2 Network Markov Chains
6 pages
Research Project Report Template - Final MBA
No ratings yet
Research Project Report Template - Final MBA
23 pages
MCMC Notes by Mark Holder
No ratings yet
MCMC Notes by Mark Holder
16 pages
MC Notes
No ratings yet
MC Notes
42 pages
CH13
No ratings yet
CH13
18 pages
Impact of Conf. On Japanese Culture
No ratings yet
Impact of Conf. On Japanese Culture
4 pages
Chapter 8 Markov Chain Model
No ratings yet
Chapter 8 Markov Chain Model
3 pages
Numerical Method For The Compton Scattering Operat
No ratings yet
Numerical Method For The Compton Scattering Operat
34 pages
Introduction LP Duality1 PDF
No ratings yet
Introduction LP Duality1 PDF
29 pages
Markov Chain
No ratings yet
Markov Chain
7 pages
Markov Chain
No ratings yet
Markov Chain
37 pages
Boundary Conditions On Perfect Conductors
No ratings yet
Boundary Conditions On Perfect Conductors
5 pages
Markov Chains
No ratings yet
Markov Chains
50 pages
Markov Chains
No ratings yet
Markov Chains
13 pages
Access Consciousness Clearing Statement
No ratings yet
Access Consciousness Clearing Statement
29 pages
Markov Decission Process. Unit 3
No ratings yet
Markov Decission Process. Unit 3
37 pages
ESI 4313 Operations Research 2: Markov Chains Basics
No ratings yet
ESI 4313 Operations Research 2: Markov Chains Basics
45 pages
Advertisement - No - 07 - 2023 - PPS - General - in - Service
No ratings yet
Advertisement - No - 07 - 2023 - PPS - General - in - Service
3 pages
Markov Chain - Exe
No ratings yet
Markov Chain - Exe
6 pages
HMM
No ratings yet
HMM
24 pages
Chapter 4 - Discrete Time Markov Chains
No ratings yet
Chapter 4 - Discrete Time Markov Chains
37 pages
Markov Chain For Transition Probability
100% (1)
Markov Chain For Transition Probability
29 pages
Markov Chains
No ratings yet
Markov Chains
55 pages
6761 4 MarkovChains
No ratings yet
6761 4 MarkovChains
76 pages
Markov Chains: J. M. Akinpelu
No ratings yet
Markov Chains: J. M. Akinpelu
56 pages
Gita Sar
No ratings yet
Gita Sar
5 pages
Sol A6 MarkovChain
No ratings yet
Sol A6 MarkovChain
24 pages
Markov Chains
No ratings yet
Markov Chains
76 pages
Notes On Markov Chain
No ratings yet
Notes On Markov Chain
34 pages
Counting Principle HW
No ratings yet
Counting Principle HW
3 pages
1768 CompactLogix Quick Start
No ratings yet
1768 CompactLogix Quick Start
208 pages
MATH858D Markov Chains: Maria Cameron
No ratings yet
MATH858D Markov Chains: Maria Cameron
44 pages
Machine Foundation Design: Lawrence O. Galvez
100% (1)
Machine Foundation Design: Lawrence O. Galvez
43 pages
Integrating ITIL&COBIT To Optiimize IT Process & SVC Delivery
No ratings yet
Integrating ITIL&COBIT To Optiimize IT Process & SVC Delivery
18 pages
Markov Chains
No ratings yet
Markov Chains
35 pages
A Treatise on the Calculus of Finite Differences
From Everand
A Treatise on the Calculus of Finite Differences
George Boole
4/5 (1)
Hidden Markov Model: Fundamentals and Applications
From Everand
Hidden Markov Model: Fundamentals and Applications
Fouad Sabry
No ratings yet

Markov Chains

Uploaded by

Markov Chains

Uploaded by

CS 252 - Markov Chains

Figure 1: A Markov chain C1 that models the flipping of a fair coin

P (HHH) = P (heads|start)P (heads|heads)P (heads|heads) = 0.5 × 0.5 × 0.5 = 0.125

P (exactly 2 heads) = P (HHT ) + P (HT H) + P (T HH) = 0.125 + 0.125 + 0.125 = 0.375

1.1 Formal Definition of Markov Chains

Definition 0.0.1. A Markov Chain is a 5-tuple (Q, Σ, δ, γ, q0 ), where

1. Q is a finite set called the states

2. Σ is a finite set called the alphabet

3. δ : Q ∪ {q0 } × Q → [0 . . . 1] is the state transition function

4. γ : Q → Σ is the state emission function

1.2 Computing with Markov Chains

3. γ(rji ) = yj , for i = 1, . . . , n and j = 1, . . . , m

1.2.2 Generative models

a. Draw a transition diagram for C.

You might also like