5.the Knuth Morris Pratt Algorithm

The document describes the Knuth-Morris-Pratt (KMP) string matching algorithm. It discusses how the KMP algorithm uses the prefix function to compute the overlap between the pattern and text to efficiently determine matches. It provides examples to demonstrate how the prefix function and matching process works on a sample pattern and text. It also includes two lemmas about properties of the prefix function used to prove correctness of the KMP algorithm.

Uploaded by

Shubham Taneja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

113 views

5.the Knuth Morris Pratt Algorithm

Uploaded by

Shubham Taneja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

The Knuth-Morris-Pratt

algorithm
KMP Matcher Algorithm
Prefix Function Algorithm
Alternative Prefix function algorithm
Input: pattern P of length m
Overlap[1] = 0
For k:=1 to m-1 // Consider P[1..k+1]
c:=P[k+1] // current character of P
v:=Overlap[k]
while P[v+1] ≠ c and v ≠ 0 // until overlap can be extended
v:=Overlap[v] // find next largest precomputed overlap
if P[v+1] = c then
Overlap[k+1]:=v+1 // extend the current overlap
else
Overlap[k+1]:=0 // no overlap exists return overlap
Matching Algorithm
i=1,j=1,k=1
While (n-k) ≥ m do
while j ≤ m and T[i] = P[j] do
i++, j++
if j > m then output k
if Overlap (j-1) > 0 then
k=i-Overlap (j-1)
else
if i==k then i++
k=i;
if j>1 then j=Overlap(j-1) + 1
Computation of Prefix and Matching
j 1 2 3 4 5 6 7 8
P A T T A T A C A
Overlap (j) 0 0 0 1 2 1 0 1

i 1 2 3 4 5 6 7 8 9 1 1 1 1 1 1 1 1 1 1 2 2 2 2 2 2 2
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6

P A T C G C A C A T T A T A C A T T A T T A T A C A T
j
Example
• i=1, j=1, k=1 – match
• i=2, j=2 – match
• i=3, j=3 – no match
• Since Overlap(j-1) = 0, j = overlap(j-1) + 1 => j = 1, k = i=3
• i=3, j=1 – no match
• Since i = k, i++ => i=4, j =1 – no match
• Since i = k, i++ => i=5, j =1 – no match
• Since i = k, i++ => i=6, j =1 – match
• i=7, j=2 – no match
• Since Overlap(j-1) = 0, j = overlap(j-1) + 1 => j = 1, k = i=7
• i=7, j=1 – no match
Example
• i=8, j=1 – match
• i=9, j=2 - match
• i=10, j=3 - match
• i=11, j=4 - match
• i=12, j=5 - match
• i=13, j=6 – match
• i=14, j=7 - match
• i=15, j=8 - match
• i=16, j=9 – j > m => output k = 8 (position from where the pattern is
found)
• Overlap(8) = 1 > 0 => k = i – overlap(j-1) = 16-1 = 15
• Start matching at j = overlap(j-1) + 1 = 1+1 = 2
Example
• i=16, j=2 – match
• i=17, j=3 – match
• i=18, j=4 – match
• i=19, j=5 – match
• i=20, j=6 – no match
• Overlap(j-1) = 2 > 0 => k=i-overlap(j-1) = 20-2=18; j = overlap(j-1)+1 =
2+1 = 3
• i=20, j = 3 – match
• i=21, j = 4 – match
• i=22, j = 5 – match
• i=23, j = 6 – match
• i=24, j = 7 – match
• i=25, j = 8 – match
• i=26, j = 9 – match => j > m => output k – k =18.
Example 2
Running time
• For prefix computation – Θ(m)
• For matching - Θ(n)
Lemma 32.5 (Prefix function iteration lemma)
• *[q] is the list of all possible values obtained by
repeatedly applying the prefix function  to q.
• Lemma: Let P be a pattern of length m with prefix
function π. Then, for q = 1, 2, …, m, we have *[q]
= {k : k < q and Pk ] Pq}.
• Proof: We first prove that i ϵ π*[q] implies Pi ] Pq.
• If i ϵ π*[q], then i = π(u)[q] for some u > 0. we prove
the above equation by induction on u.
• For u = 1, we have i = π[q], and the claim follows
since i < q and Pπ[q] ] Pq.
Lemma 32.5
• Using the relations π[i] < i and Pπ[i] ] Pi and the
transitivity of < and ] establishes the claim for
all i in π*[q].
• Therefore, π*[q]  {k : k < q and Pk ] Pq}.
• We prove that {k : k < q and Pk ] Pq}  π*[q] by
contradiction.
• Suppose to the contrary that there is an
integer in the set {k : k < q and Pk ] Pq} - π*[q],
and let j be the largest such value.
Lemma 32.5
• Because π[q] is the largest value in {k : k < q
and Pk ] Pq} and π[q] ϵ π*[q], we must have j <
π[q], and so we let j’ denote the smallest
integer in π*[q] that is greater than j.
• We can choose j’ = π[q] if there is no other
number in π*[q] that is greater than j.
• We have Pj ] Pq because j ϵ {k : k < q and Pk ]
Pq}, and we have Pj’ ] Pq because j’ ϵ π*[q].
Lemma 32.5
• Thus, Pj ] Pj’ by lemma 32.1 and j is the largest
value less than j’ with this property.
• Therefore, we must have π[j’] = j and, since j’ ϵ
π*[q], we must have j ϵ π*[q] as well.
• This contradiction proves the lemma.
Lemma 32.6
• Let P be a pattern of length m, and let π be the
prefix function for P. for q = 1, 2, …, m, if π[q] >
0, then π[q] – 1 ϵ π*[q – 1].
• Proof: if r = π[q] > 0, then r < q and Pr ] Pq; thus
r – 1 < q – 1 and Pr-1 ] Pq-1 (by dropping the last
character from Pr and Pq).
• By lemma 32.5, therefore, π[q] – 1 = r – 1 ϵ
π*[q-1].

BUSC2112 Basic Calculus WEEK 1 10 Wewoo
75% (4)
BUSC2112 Basic Calculus WEEK 1 10 Wewoo
120 pages
hw10 Solution PDF
No ratings yet
hw10 Solution PDF
5 pages
Permutation and Combinations
From Everand
Permutation and Combinations
Ramesh Chandra
4/5 (36)
String Matching Problem
No ratings yet
String Matching Problem
16 pages
KMP Algorithm
No ratings yet
KMP Algorithm
21 pages
W9 Presentation
No ratings yet
W9 Presentation
20 pages
w 9 Presentation
No ratings yet
w 9 Presentation
20 pages
Today's Lecture: String Matching Algorithm Naïve / Brute Force RK
No ratings yet
Today's Lecture: String Matching Algorithm Naïve / Brute Force RK
20 pages
String Matching
No ratings yet
String Matching
27 pages
KMP Algo
No ratings yet
KMP Algo
16 pages
Knuth Moris 2797348
No ratings yet
Knuth Moris 2797348
21 pages
A357460420 - 22393 - 2 - 2018 - String Matching
No ratings yet
A357460420 - 22393 - 2 - 2018 - String Matching
27 pages
Week4 PPT SM
No ratings yet
Week4 PPT SM
35 pages
18 String Matching - KMP Algorithm
No ratings yet
18 String Matching - KMP Algorithm
30 pages
How A Search Engine Works
No ratings yet
How A Search Engine Works
28 pages
Unit 3
No ratings yet
Unit 3
34 pages
AAD Lec11
No ratings yet
AAD Lec11
5 pages
String Matching - RYS - Lect - 1 - 2 - 3 - Update
No ratings yet
String Matching - RYS - Lect - 1 - 2 - 3 - Update
61 pages
Algorithms in Bioinformatics
No ratings yet
Algorithms in Bioinformatics
7 pages
KMP 2
No ratings yet
KMP 2
7 pages
Advanced String Lecture
No ratings yet
Advanced String Lecture
50 pages
KMP Algorithm
No ratings yet
KMP Algorithm
20 pages
32.4 The Knuth-Morris-Pratt Algorithm: Either
No ratings yet
32.4 The Knuth-Morris-Pratt Algorithm: Either
10 pages
Lecture 56string Matching
No ratings yet
Lecture 56string Matching
43 pages
12 StringMatching
No ratings yet
12 StringMatching
23 pages
The Knuth Morris Pratt Algorithm
No ratings yet
The Knuth Morris Pratt Algorithm
7 pages
BNP Unit-5 Lecture 20 KMP 5.2
No ratings yet
BNP Unit-5 Lecture 20 KMP 5.2
14 pages
AOA Module 6 - String of Algorithms - Aeraxia - in
No ratings yet
AOA Module 6 - String of Algorithms - Aeraxia - in
26 pages
Theory of Automata & Formal Languages
No ratings yet
Theory of Automata & Formal Languages
60 pages
Lecture Notes On Pattern Matching Algorithms
No ratings yet
Lecture Notes On Pattern Matching Algorithms
16 pages
Lecture Notes On Pattern Matching Algorithms
No ratings yet
Lecture Notes On Pattern Matching Algorithms
16 pages
Mathematical Model For String Pattern Matching Algorithm (Boyer-Moore's Algorithm)
No ratings yet
Mathematical Model For String Pattern Matching Algorithm (Boyer-Moore's Algorithm)
5 pages
String Matching
No ratings yet
String Matching
35 pages
Short Notes on Knuth
No ratings yet
Short Notes on Knuth
2 pages
DAA Assignment - 3.pdf (Nomi)
No ratings yet
DAA Assignment - 3.pdf (Nomi)
4 pages
CLRS Solution Chapter 32
No ratings yet
CLRS Solution Chapter 32
10 pages
4string Matching Kmprabin Karp and Naive
No ratings yet
4string Matching Kmprabin Karp and Naive
57 pages
CS 240 Tutorial 11 Notes: C A A B A
No ratings yet
CS 240 Tutorial 11 Notes: C A A B A
2 pages
Application of A Modified Convolution Method To Exact String Matching
No ratings yet
Application of A Modified Convolution Method To Exact String Matching
6 pages
5CS4-AOA-Unit-3 @zammers
No ratings yet
5CS4-AOA-Unit-3 @zammers
7 pages
String Matching
No ratings yet
String Matching
34 pages
CH-8
No ratings yet
CH-8
26 pages
String Matching
No ratings yet
String Matching
63 pages
ADA UNIT 3 Complete Notes
No ratings yet
ADA UNIT 3 Complete Notes
59 pages
Sandeep Singh (Iii B.Tech I.T)
No ratings yet
Sandeep Singh (Iii B.Tech I.T)
179 pages
Ch9
No ratings yet
Ch9
33 pages
String Matching Algorithms
No ratings yet
String Matching Algorithms
46 pages
Module III Problem Solving
No ratings yet
Module III Problem Solving
16 pages
String Matching Introduction To NP-Completeness
No ratings yet
String Matching Introduction To NP-Completeness
37 pages
Boyer Moore Algorithm: Idan Szpektor
100% (1)
Boyer Moore Algorithm: Idan Szpektor
48 pages
Naïve Method. Code:: Naive, Rabin-Karp, and Knuth-Morris-Pratt Algorithms For String Matching
No ratings yet
Naïve Method. Code:: Naive, Rabin-Karp, and Knuth-Morris-Pratt Algorithms For String Matching
5 pages
DAA_unit_5
No ratings yet
DAA_unit_5
22 pages
Unit-5
No ratings yet
Unit-5
52 pages
DAA (Algorithms Knowledge Capsule 4 by Dr. Choudhary Ravi Singh)
No ratings yet
DAA (Algorithms Knowledge Capsule 4 by Dr. Choudhary Ravi Singh)
20 pages
Abstract
No ratings yet
Abstract
12 pages
Lecture 18 - String Matching-KMP
No ratings yet
Lecture 18 - String Matching-KMP
40 pages
Knuth-Morris-Pratt Algorithm KENT
No ratings yet
Knuth-Morris-Pratt Algorithm KENT
4 pages
publication_11_23912_388
No ratings yet
publication_11_23912_388
11 pages
Preliminary
No ratings yet
Preliminary
65 pages
Basic Mathematics. Explained Easy | For Beginners
From Everand
Basic Mathematics. Explained Easy | For Beginners
ExaGrecation
No ratings yet
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
From Everand
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
4.the Rabin Karp Algorithm
No ratings yet
4.the Rabin Karp Algorithm
16 pages
6 Suffix-Tree
No ratings yet
6 Suffix-Tree
20 pages
WINSEM2014 15 - CP2324 - 27 Jan 2015 - RM03 - 7 8088 IO System
No ratings yet
WINSEM2014 15 - CP2324 - 27 Jan 2015 - RM03 - 7 8088 IO System
18 pages
WINSEM2014 15 - CP2658 - 21 Jan 2015 - RM01 - 7 Addressing Modes PDF
No ratings yet
WINSEM2014 15 - CP2658 - 21 Jan 2015 - RM01 - 7 Addressing Modes PDF
8 pages
Cse221 Microprocessors
No ratings yet
Cse221 Microprocessors
2 pages
IGMO Round 1 Paper Final Typos Corrected Time Updated 2 France 2
No ratings yet
IGMO Round 1 Paper Final Typos Corrected Time Updated 2 France 2
5 pages
PDE Assignment
No ratings yet
PDE Assignment
2 pages
All About Matrices
No ratings yet
All About Matrices
14 pages
4026Q1 Specimen Additional Maths
No ratings yet
4026Q1 Specimen Additional Maths
16 pages
03 - Projection of Planes
No ratings yet
03 - Projection of Planes
22 pages
Hölder and locally Hölder Continuous Functions and Open Sets of Class C k C k lambda 1st ed. Edition Renato Fiorenza download
100% (1)
Hölder and locally Hölder Continuous Functions and Open Sets of Class C k C k lambda 1st ed. Edition Renato Fiorenza download
65 pages
65 4 1 Mathematics
No ratings yet
65 4 1 Mathematics
7 pages
MATH 119 Calculus With Analytic Geometry (2011-1)
No ratings yet
MATH 119 Calculus With Analytic Geometry (2011-1)
2 pages
AP Calculus AB Full Mock Exam 2020
No ratings yet
AP Calculus AB Full Mock Exam 2020
26 pages
Finding The General Rule of The Sequence
No ratings yet
Finding The General Rule of The Sequence
20 pages
Course Name: Discrete Mathematics For IT: Annexure CD - 01'
No ratings yet
Course Name: Discrete Mathematics For IT: Annexure CD - 01'
5 pages
2008 Galois Solution
No ratings yet
2008 Galois Solution
6 pages
Number Theory - Form 1
No ratings yet
Number Theory - Form 1
19 pages
Ch Complex
No ratings yet
Ch Complex
23 pages
Download ebooks file An introduction to integral transforms Patra all chapters
100% (7)
Download ebooks file An introduction to integral transforms Patra all chapters
55 pages
2020-21 First Term Exam F.4 Math 2
No ratings yet
2020-21 First Term Exam F.4 Math 2
15 pages
PMO 2019 Qualifying Stage
No ratings yet
PMO 2019 Qualifying Stage
19 pages
WS-02-Gr7-MATH-Sem1-24-25
No ratings yet
WS-02-Gr7-MATH-Sem1-24-25
8 pages
Hyperspectral Image Classification Based On KNN Sparse Representation
No ratings yet
Hyperspectral Image Classification Based On KNN Sparse Representation
5 pages
JEE Mathematics 2
No ratings yet
JEE Mathematics 2
180 pages
Local Media4801755508831326590 PDF
No ratings yet
Local Media4801755508831326590 PDF
266 pages
Pro Finite
No ratings yet
Pro Finite
6 pages
Prova Selecao PG-EIA 2022-1osem
No ratings yet
Prova Selecao PG-EIA 2022-1osem
6 pages
Kreatryx Control System
No ratings yet
Kreatryx Control System
33 pages
Geometric Programming Lecture
100% (2)
Geometric Programming Lecture
17 pages
SELECTED STORIES IN MATHEMATICS AND PHYSICS/book Lambert Academic Publishing
100% (1)
SELECTED STORIES IN MATHEMATICS AND PHYSICS/book Lambert Academic Publishing
91 pages
Euler Angles
No ratings yet
Euler Angles
18 pages
Analytic Geometry and Trigonometry
No ratings yet
Analytic Geometry and Trigonometry
7 pages
Rounding Decimals
No ratings yet
Rounding Decimals
28 pages

5.the Knuth Morris Pratt Algorithm

Uploaded by

5.the Knuth Morris Pratt Algorithm

Uploaded by

The Knuth-Morris-Pratt

You might also like