0% found this document useful (0 votes)

89 views

A Guide To Markov Chain and Its Applications in Machine Learning

The document is a guide to Markov chains and their applications in machine learning. It defines Markov chains as stochastic processes where the future state depends only on the present state, not past states. It discusses key concepts like state space, trajectories, transition probabilities, and transition matrices. It then explains how Markov chains can be used for prediction, including one-step and long-run predictions. Finally, it outlines some common applications of Markov chains like forecasting, customer behavior analysis, and natural language processing.

Uploaded by

perhacker

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

89 views

A Guide To Markov Chain and Its Applications in Machine Learning

Uploaded by

perhacker

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

PUBLISHED ON OCTOBER 8, 2021 IN DEVELOPERS CORNER

A Guide to Markov Chain and its Applications

in Machine Learning
A stochastic process can be considered as the Markov chain if the process consists of
the Markovian properties which are to process the future.

BY YUGESH VERMA

Markov Chains are one of the simple and very useful tools in order to model time-
dependent, space-dependent stochastic processes. Many domains like finance (stock
price movement), sales(sales quantity information), NLP algorithms (finite-state
transducers, Hidden Markov Model for POS Tagging), weather forecasting, etc use
the Markov chain to make their predictions easily and accurately. In this article, we
will discuss the Markovian Chains in detail. We will try to understand their working
with advantages and applications. The major points to be covered in this article are
listed below.

Table of Contents

1. Definitions
1.1. State-space
1.2. Trajectory
2. Prediction Using Markov Chain
2.1. Initial State and One-Step Prediction
2.2. Long Run Prediction
3. Advantages of Markov Chain
4. Application of the Markov Chain
1. Definitions

The Markov chain represents a class of stochastic processes in which the future does
not depend on the past, it depends on the present. A stochastic process can be
considered as the Markov chain if the process consists of the Markovian properties
which are to process the future. We require the information of the present state and
past is independent of the process.

Consider a situation where we have Xn state recorded at the n time stamp. The future
state at time n+1 depends on the state at time n. Let’s take an example of the corona
cases where the number of cases is Xn at time n and the number of cases at time
n+1 is Xn+1. So if we are following the Markov chain definition the number of cases
at time n+1 will depend on the number of cases at time n (Xn+1 will depend on Xn),
not on the past which are {Xn−1, Xn−2, . . . , X0}. To understand the Markov chain
we may need to understand some of the terms which are basically used by the Markov
chain concept. These terms are explained below.

1.1. State-space

If the state space for the Markov chain can be donated by S where S = {1,2,3….., n}
then the state of the process can be given by the value of Xn. For example, if Xn =
8 then the state of the process is 8. Hence we can say that at any time n, the state
in which the process is given by the value of Xn.

For example, in a class of students, the students with the old fail record are more
likely to develop a final result as a failure and the students who have lower marks in
the previous exam have the probability to get the result as a failure. So in this
situation, we can say that the chances of students failing the exam for the students
with old fail records are higher and for the students with low marks are lower. In this
scenario, we have two states: lower chances and higher chances. And S={1,2}.

1.2. Trajectory

The trajectory of the Markov chain can be considered as the sequence of the states
in which the stochastic process has been existing from the start.

In other words if we can denote the trajectory values as s0, s1, s2……. sn then the
state will take values as X0=s0, X1=s1……. Xn=sn.
1.3. Transition probability
Markov chains cannot be indifferent states at a particular time but they can change
their state with time. The change in the state can be called the transition of state.
From the above-given example, the Markov chain for example can be either at
lower chances or higher chances.

Let’s take a look at the below pictures

The above picture is a representation of the transition of the states. At state 1 the
chain is on higher chances states where we can say the exam which is going on is at
the state where the chances of failure are higher. The probability for the next exam
in getting into a state with higher chances for failure is 0.7 and the probability of
state transition to lower chances is 0.3. The probability that students transit to a
lower chances state to another exam from the present exam is 0.3.

Suppose the system is in a lower chance state and a similar transition diagram is
drawn. Here the transition probabilities are 0.85 and 0.15. Using both of the diagrams
we can draw a complete process
The above image is a representation of the Combined-state transition diagram from
State 1 to State 2. For a time instance, these processes can not go in the backward
direction but they can go backwards on the next time instance.

1.4. State transition matrix

The matrix of all transition probabilities is called the transition matrix. Where the
rows are the starting point and the columns are the endpoint.

The above matrix is a representation of the transition matrix of the example

illustrated above in the article. the transition of the process from a lower chances
state to a low risk has a probability of 0.15. The transition from low risk to higher
chances has a probability of 0.85.
2. Prediction Using Markov Chain

The Markov chain is a very powerful tool for making predictions for future value.
Since it gives various useful insights, it becomes very necessary to know the
transition probabilities, transition matrix, state-space, and trajectory to understand
the insights.

Also one of the basic things which are required to have prior knowledge about is the
initial state of the process. To explain the prediction process let’s take a look at the
above student failure chances example where applying few changes

2.1. Initial State and One-Step Prediction

This time it’s an engineering examination and the observation is that if the students
fail in the first year’s mathematics exams they are more likely to fail three times in
their core subjects and if they pass mathematics in the first year they are more likely
to pass their core subjects exams four times. So the transition matrix for example
will be
So the initial state of the process if the students pass their mathematics exam will be

From the above initial state, we can make a future prediction in the form of the
probability that the students will pass the core subject by just multiplying the initial
state and the transition matrix.

For the given example the prediction for the next step will be.

From the above intuition, we can say that the prediction for the first state after the
initial state can be given by the following formula.

Initial State * Transition Matrix = Prediction

2.2. Long-run Probability
The long-run probability can be considered as the steady-state probability. Since we
can calculate the steady-state probability when a state in the procedure is stable.
Here in the Markov chain if the initial stage is stable which means once it becomes
constant we can calculate the steady-state probability.

Let’s say that the V0 is the initial state probability vector and T is the transition matrix
so the one-time step prediction can be represented as the

V1 = V0 . T

Here one thing which is notable and very simple mathematics is the dot product of
the vector and matrix in a vector and by this intuition, we can say that in the process
of predicting the one-time step we again meet with a vector which can again be
considered as the initial state. Or more formally saying every predicted one-time step
in the future will be responsible for its next step only.

So if we want to predict the second step the formula for prediction will be

V2 = V1 . T

And here from the prediction of one step, we know the value of V1. by putting the
value of V1

V2 = (V0 . T) . T

V2 = V0 . T^2

Similarly, for the third step, the prediction will be

V3 = V2 . T = (V0 . T^2).T

V3 = V0 . T^3

Therefore talking about the nth time step prediction the prediction can be calculated
by the following formula

Vn = Vn-1 . T = V0 .T^n

So this is how the above given iterative process helps in prediction for the future
state probability of the long processes. Here the long-run probability can be written
as

V∞ = V0 . T^∞
From the above formula of long-run probabilities, we can say that no amount of
multiplication by the transition matrix leads to changes in the long-run probability
vector.

3. Advantages of Markov Chain

• As we have seen above the Markov chain is very easy to derive from a
successional data
• We don’t need to dive deep into the mechanism of dynamic change.
• Markov chain is very insightful. It can tell the area of any process where we are
lacking and further we can make changes in accordance to improvement.
• Very low or modest computation requirements can easily be calculated by any
size of the system.

4. Application of the Markov Chain

• Markov chains can be used for forecasting which can be any kind of forecasting
like weather, temperature, sale, etc.
• This can be used for predicting customer behaviour.
• As we know, it is good with sequential data so it can be merged with
many NLP problem solutions like POS tagging.
• Brand loyalty and consumer behaviour can be analyzed.
• In the gaming field, various models can be developed in the game of chances.

5. Final Words

Here in the article, we have seen an explanation of the Markov chain concept. As we
have seen, it is not difficult to understand and calculate so we can say that it is also
very easy for any size of computing. There can be many more usages of the Markov
chain because of its ease and accuracy, it has become a very popular subject of
research. I encourage readers to utilize this concept in his or her real-life projects to
make the project more easily calculable and interpretable.

Graded Exam 1
No ratings yet
Graded Exam 1
12 pages
Dummy Independent Variable
No ratings yet
Dummy Independent Variable
14 pages
Fall17 Exam1 Students
0% (1)
Fall17 Exam1 Students
12 pages
Research Proposal
No ratings yet
Research Proposal
30 pages
A Crash Course in Statistics - Handouts
No ratings yet
A Crash Course in Statistics - Handouts
46 pages
Written Assignment Unit 7 BUS1103
No ratings yet
Written Assignment Unit 7 BUS1103
5 pages
Compound Interest: You May Wish To Read First
No ratings yet
Compound Interest: You May Wish To Read First
11 pages
Handout 9: Choice Under Uncertainty
No ratings yet
Handout 9: Choice Under Uncertainty
6 pages
(eBook PDF) Basic Business Statistics Concepts and Applications 13th all chapter instant download
100% (2)
(eBook PDF) Basic Business Statistics Concepts and Applications 13th all chapter instant download
41 pages
Chap - 5 - Problems With Answers
No ratings yet
Chap - 5 - Problems With Answers
15 pages
Self-Quiz Unit 2 - Attempt Review
No ratings yet
Self-Quiz Unit 2 - Attempt Review
8 pages
MATH 1280-Unit 1 Discussion Assignment
No ratings yet
MATH 1280-Unit 1 Discussion Assignment
3 pages
Model Select Quantitative Mid Exam November 2021
No ratings yet
Model Select Quantitative Mid Exam November 2021
3 pages
5-4 Exponential Growth and Decay (Presentation)
100% (2)
5-4 Exponential Growth and Decay (Presentation)
16 pages
Handout 5: Production Functions and Cost Minimization
No ratings yet
Handout 5: Production Functions and Cost Minimization
10 pages
Exponential Smoothing
No ratings yet
Exponential Smoothing
5 pages
ECON 330-Econometrics-Dr. Farooq Naseer
No ratings yet
ECON 330-Econometrics-Dr. Farooq Naseer
5 pages
Stats 250 W12 Exam 1 Solutions
No ratings yet
Stats 250 W12 Exam 1 Solutions
7 pages
BUS 1103 Written Assignment Unit 1
No ratings yet
BUS 1103 Written Assignment Unit 1
6 pages
Sample Problem With Answers On Hypothesis Testing
No ratings yet
Sample Problem With Answers On Hypothesis Testing
3 pages
Linear Programming Introduction
No ratings yet
Linear Programming Introduction
22 pages
Dual Linear Programming and Complementary Slackness
No ratings yet
Dual Linear Programming and Complementary Slackness
35 pages
Probability Distribution
No ratings yet
Probability Distribution
78 pages
Day 11 & 12 - Hypothesis Testing
No ratings yet
Day 11 & 12 - Hypothesis Testing
6 pages
Statistics Assignment Problems
No ratings yet
Statistics Assignment Problems
10 pages
MATH 1280 Learning Journal Unit 4
No ratings yet
MATH 1280 Learning Journal Unit 4
2 pages
Simplex Method For Standard Maximization Problem
No ratings yet
Simplex Method For Standard Maximization Problem
6 pages
291 Practice Midterms and Solutions
100% (2)
291 Practice Midterms and Solutions
116 pages
Regression
No ratings yet
Regression
19 pages
AP Statistics 1st Semester Study Guide
No ratings yet
AP Statistics 1st Semester Study Guide
6 pages
HMM
No ratings yet
HMM
24 pages
Markov Chain
No ratings yet
Markov Chain
7 pages
IntroMarkovChainsandApplications PDF
No ratings yet
IntroMarkovChainsandApplications PDF
8 pages
Markov Chain For Transition Probability
100% (1)
Markov Chain For Transition Probability
29 pages
Chapman Kolmogorov Equations
No ratings yet
Chapman Kolmogorov Equations
10 pages
QF 101 Week 1 Day 4
No ratings yet
QF 101 Week 1 Day 4
12 pages
Lec10 PDF
No ratings yet
Lec10 PDF
24 pages
Dissecting Reinforcement Learning-Part8
No ratings yet
Dissecting Reinforcement Learning-Part8
16 pages
Markov Chain
No ratings yet
Markov Chain
16 pages
Chapter 8 Markov Chain Model
No ratings yet
Chapter 8 Markov Chain Model
3 pages
Markov Process
No ratings yet
Markov Process
28 pages
Markov Chain
No ratings yet
Markov Chain
18 pages
Lec20 PDF
No ratings yet
Lec20 PDF
30 pages
Markov Chain and Markov Processes
No ratings yet
Markov Chain and Markov Processes
9 pages
Forward-Backward Algorithm
No ratings yet
Forward-Backward Algorithm
8 pages
Markov Processes
No ratings yet
Markov Processes
60 pages
Markov Chain Togola Molobaly Dit Bébé 202352180026
No ratings yet
Markov Chain Togola Molobaly Dit Bébé 202352180026
19 pages
980775
No ratings yet
980775
32 pages
Modul 9 Model Stokastik
No ratings yet
Modul 9 Model Stokastik
8 pages
Star T: The Applicability of Markov Analysis Methods To Reliability, Maintainability, and Safety
No ratings yet
Star T: The Applicability of Markov Analysis Methods To Reliability, Maintainability, and Safety
4 pages
FALLSEM2024-25_CSE3008_ETH_AP2024252000577_2024-11-08_Reference-Material-I
No ratings yet
FALLSEM2024-25_CSE3008_ETH_AP2024252000577_2024-11-08_Reference-Material-I
19 pages
3
No ratings yet
3
58 pages
Multiple Sequence Alignment
No ratings yet
Multiple Sequence Alignment
16 pages
Markov Chain - Exe
No ratings yet
Markov Chain - Exe
6 pages
Lecture 3
No ratings yet
Lecture 3
7 pages
Markov Chain Model
No ratings yet
Markov Chain Model
15 pages
L1 - Markov Information Sources
No ratings yet
L1 - Markov Information Sources
11 pages
Risk Analytics (IMT)_Chapter 7
No ratings yet
Risk Analytics (IMT)_Chapter 7
47 pages
HMM Cuda Baum Welch
No ratings yet
HMM Cuda Baum Welch
8 pages
Analytical Methods of Optimization
From Everand
Analytical Methods of Optimization
D. F. Lawden
No ratings yet
A Universal Data Compression System
No ratings yet
A Universal Data Compression System
9 pages
A Unique Perspective On Data Coding and Decoding
No ratings yet
A Unique Perspective On Data Coding and Decoding
11 pages
A Novel Encoding Algorithm For Textual Data Compression
No ratings yet
A Novel Encoding Algorithm For Textual Data Compression
14 pages
A Tutorial On Hidden Markov Models - Dugad and Desai
No ratings yet
A Tutorial On Hidden Markov Models - Dugad and Desai
16 pages
A Review of Data Compression Techniques
No ratings yet
A Review of Data Compression Techniques
9 pages
A Time-Domain Based Lossless Data Compression Technique
No ratings yet
A Time-Domain Based Lossless Data Compression Technique
4 pages
A Novel Approach of Data Compression For Dynamic Data
No ratings yet
A Novel Approach of Data Compression For Dynamic Data
7 pages
TI-06-Stream Codes
No ratings yet
TI-06-Stream Codes
88 pages
Notes 7 2013 - Arithmetic Coding
No ratings yet
Notes 7 2013 - Arithmetic Coding
34 pages
A Machine Learning Perspective On Predictive Coding With PAQ
No ratings yet
A Machine Learning Perspective On Predictive Coding With PAQ
30 pages
Lecture 4 - Arithmetic Coding and Lempel-Ziv
No ratings yet
Lecture 4 - Arithmetic Coding and Lempel-Ziv
26 pages
Practical Implementations of Arithmetic Coding
No ratings yet
Practical Implementations of Arithmetic Coding
32 pages
CDI15-04 - Arithmetic Coding
No ratings yet
CDI15-04 - Arithmetic Coding
17 pages
Image Compression-Decompression Technique Using Arithmetic Coding
No ratings yet
Image Compression-Decompression Technique Using Arithmetic Coding
12 pages
Arithmetic Coding (Float Binary) Leangroup Org
No ratings yet
Arithmetic Coding (Float Binary) Leangroup Org
49 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
20 pages
Lec 05 - Arithmetic Coding
No ratings yet
Lec 05 - Arithmetic Coding
44 pages
Context-Based Adaptive Arithmetic Coding
No ratings yet
Context-Based Adaptive Arithmetic Coding
13 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
6 pages
Data Collection, Scales and Indexes, Reliability & Validity
No ratings yet
Data Collection, Scales and Indexes, Reliability & Validity
13 pages
Matrix & Family Tree Arrangement
No ratings yet
Matrix & Family Tree Arrangement
8 pages
Language Acquisition and Language Learning: Developing The System of External and Internal Perspectives
No ratings yet
Language Acquisition and Language Learning: Developing The System of External and Internal Perspectives
8 pages
Makabayan Curriculum For Secondary Schoo
No ratings yet
Makabayan Curriculum For Secondary Schoo
14 pages
Understanding Leadership, Theories and Practices
100% (1)
Understanding Leadership, Theories and Practices
2 pages
Risk Management Analysis in Scrum Software Projects
No ratings yet
Risk Management Analysis in Scrum Software Projects
23 pages
Seatwork 6 Create Your Own Graphic Organizer Focusing On Your Philosophy in Education As A Learner in The 21st-Century World
No ratings yet
Seatwork 6 Create Your Own Graphic Organizer Focusing On Your Philosophy in Education As A Learner in The 21st-Century World
1 page
Overview of The Nursing Process-1
No ratings yet
Overview of The Nursing Process-1
16 pages
01agus Rahmat
No ratings yet
01agus Rahmat
242 pages
Re-Visioning Family Therapy, Third Edition: Addressing Diversity in Clinical Practice Third Edition - Ebook PDF
100% (40)
Re-Visioning Family Therapy, Third Edition: Addressing Diversity in Clinical Practice Third Edition - Ebook PDF
62 pages
Plant Disease Classification
No ratings yet
Plant Disease Classification
10 pages
From Women and Technology To Gendered Technoscience
No ratings yet
From Women and Technology To Gendered Technoscience
13 pages
A3 Dean 2021
No ratings yet
A3 Dean 2021
10 pages
Chapter 1 HCB With Revisions
No ratings yet
Chapter 1 HCB With Revisions
6 pages
Be Civil V & Vii Sem Offline TT
No ratings yet
Be Civil V & Vii Sem Offline TT
4 pages
Major Approaches To Discourse Analysis
No ratings yet
Major Approaches To Discourse Analysis
3 pages
Lesson 3 Perdev
No ratings yet
Lesson 3 Perdev
4 pages
SR - No Name of The Student Day/ Date Physics Chemistry Xii A Roll No
No ratings yet
SR - No Name of The Student Day/ Date Physics Chemistry Xii A Roll No
9 pages
Marjorie Harness Goodwin - He-Said-She-Said_ Talk as Social Organization Among Black Children (1991, Indiana University Press) - Libgen.li
No ratings yet
Marjorie Harness Goodwin - He-Said-She-Said_ Talk as Social Organization Among Black Children (1991, Indiana University Press) - Libgen.li
385 pages
Activity Sheet
No ratings yet
Activity Sheet
5 pages
Chapter 7
No ratings yet
Chapter 7
26 pages
Experience in Teaching Practice of Pre-Service Tea
No ratings yet
Experience in Teaching Practice of Pre-Service Tea
6 pages
Cbar-Acs 1
No ratings yet
Cbar-Acs 1
10 pages
Module 1
No ratings yet
Module 1
15 pages
Resume 2023
No ratings yet
Resume 2023
1 page
LP 1 in SocPsych
No ratings yet
LP 1 in SocPsych
10 pages
Technical University of Kenya Jab 2013 Admission
No ratings yet
Technical University of Kenya Jab 2013 Admission
32 pages
Key Concepts in Gender Studies - DICHOTOMY
No ratings yet
Key Concepts in Gender Studies - DICHOTOMY
3 pages
Multilingual Lesson Plan
No ratings yet
Multilingual Lesson Plan
2 pages
Trends Report 2024 1.1
No ratings yet
Trends Report 2024 1.1
53 pages

A Guide To Markov Chain and Its Applications in Machine Learning

Uploaded by

A Guide To Markov Chain and Its Applications in Machine Learning

Uploaded by

PUBLISHED ON OCTOBER 8, 2021 IN DEVELOPERS CORNER

A Guide to Markov Chain and its Applications

Let’s take a look at the below pictures

1.4. State transition matrix

The above matrix is a representation of the transition matrix of the example

2.1. Initial State and One-Step Prediction

Initial State * Transition Matrix = Prediction

Similarly, for the third step, the prediction will be

3. Advantages of Markov Chain

4. Application of the Markov Chain

You might also like