0% found this document useful (0 votes)

25 views

3-5 Simplified Note

The document discusses regular expressions and finite automata. It defines regular expressions recursively and provides examples. It also covers properties of regular sets, construction of regular expressions from finite automata using Arden's theorem and Thompson's construction, and finite automata with epsilon transitions.

Uploaded by

hammoudi.yousuf

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views

3-5 Simplified Note

Uploaded by

hammoudi.yousuf

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Regular Expressions

A Regular Expression can be recursively defined as follows −

• ε is a Regular Expression indicates the language containing an empty string. (L (ε) = {ε})
• φ is a Regular Expression denoting an empty language. (L (φ) = { })
• x is a Regular Expression where L = {x}
• If X is a Regular Expression denoting the language L(X) and Y is a Regular Expression
denoting the language L(Y), then
• X + Y is a Regular Expression corresponding to the language L(X) ∪ L(Y) where
L(X+Y) = L(X) ∪ L(Y).
• X . Y is a Regular Expression corresponding to the language L(X) . L(Y) where
L(X.Y) = L(X) . L(Y)
• R* is a Regular Expression corresponding to the language L(R*)where L(R*) =
(L(R))*
• If we apply any of the rules several times from 1 to 5, they are Regular Expressions.

Some RE Examples
Regular
Regular Set
Expressions
(0 + 10*) L = { 0, 1, 10, 100, 1000, 10000, … }
(0*10*) L = {1, 01, 10, 010, 0010, …}
(0 + ε)(1 + ε) L = {ε, 0, 1, 01}
Set of strings of a’s and b’s of any length including the null string. So L = { ε, a,
(a+b)*
b, aa , ab , bb , ba, aaa…….}
Set of strings of a’s and b’s ending with the string abb. So L = {abb, aabb, babb,
(a+b)*abb
aaabb, ababb, …………..}
Set consisting of even number of 1’s including empty string, So L= {ε, 11, 1111,
(11)*
111111, ……….}
Set of strings consisting of even number of a’s followed by odd number of b’s , so
(aa)*(bb)*b
L = {b, aab, aabbb, aabbbbb, aaaab, aaaabbb, …………..}
String of a’s and b’s of even length can be obtained by concatenating any
(aa + ab + ba +
combination of the strings aa, ab, ba and bb including null, so L = {aa, ab, ba, bb,
bb)*
aaab, aaba, …………..}

Regular Sets
Any set that represents the value of the Regular Expression is called a Regular Set.
Properties of Regular Sets
Property 1. The union of two regular set is regular.
Proof −
Let us take two regular expressions
RE1 = a(aa)* and RE2 = (aa)*

So, L1 = {a, aaa, aaaaa,.....} (Strings of odd length excluding Null)

and L2 ={ ε, aa, aaaa, aaaaaa,.......} (Strings of even length including Null)

L1 ∪ L2 = { ε, a, aa, aaa, aaaa, aaaaa, aaaaaa,.......}

(Strings of all possible lengths including Null)

RE (L1 ∪ L2) = a* (which is a regular expression itself)

Hence, proved.
Property 2. The intersection of two regular set is regular.
Proof −
Let us take two regular expressions
RE1 = a(a*) and RE2 = (aa)*

So, L1 = { a,aa, aaa, aaaa, ....} (Strings of all possible lengths excluding Null)

L2 = { ε, aa, aaaa, aaaaaa,.......} (Strings of even length including Null)

L1 ∩ L2 = { aa, aaaa, aaaaaa,.......} (Strings of even length excluding Null)

RE (L1 ∩ L2) = aa(aa)* which is a regular expression itself.

Hence, proved.
Property 3. The complement of a regular set is regular.
Proof −
Let us take a regular expression −
RE = (aa)*
So, L = {ε, aa, aaaa, aaaaaa, .......} (Strings of even length including Null)
Complement of L is all the strings that is not in L.
So, L’ = {a, aaa, aaaaa, .....} (Strings of odd length excluding Null)
RE (L’) = a(aa)* which is a regular expression itself.
Hence, proved.
Property 4. The difference of two regular set is regular.
Proof −
Let us take two regular expressions −
RE1 = a (a*) and RE2 = (aa)*

So, L1 = {a, aa, aaa, aaaa, ....} (Strings of all possible lengths excluding Null)

L2 = { ε, aa, aaaa, aaaaaa,.......} (Strings of even length including Null)

L1 – L2 = {a, aaa, aaaaa, aaaaaaa, ....}

(Strings of all odd lengths excluding Null)

RE (L1 – L2) = a (aa)* which is a regular expression.

Hence, proved.
Property 5. The reversal of a regular set is regular.
Proof −

We have to prove LR is also regular if L is a regular set.

Let, L = {01, 10, 11, 10}
RE (L) = 01 + 10 + 11 + 10

LR = {10, 01, 11, 01}

RE (LR) = 01 + 10 + 11 + 10 which is regular

Hence, proved.
Property 6. The closure of a regular set is regular.
Proof −
If L = {a, aaa, aaaaa, .......} (Strings of odd length excluding Null)
i.e., RE (L) = a (aa)*
L* = {a, aa, aaa, aaaa , aaaaa,……………} (Strings of all lengths excluding Null)
RE (L*) = a (a)*
Hence, proved.
Property 7. The concatenation of two regular sets is regular.
Proof −
Let RE1 = (0+1)*0 and RE2 = 01(0+1)*

Here, L1 = {0, 00, 10, 000, 010, ......} (Set of strings ending in 0)

and L2 = {01, 010,011,.....} (Set of strings beginning with 01)

Then, L1 L2 = {001,0010,0011,0001,00010,00011,1001,10010,.............}
Set of strings containing 001 as a substring which can be represented by an RE − (0 + 1)*001(0 +
1)*
Hence, proved.

Identities Related to Regular Expressions

Given R, P, L, Q as regular expressions, the following identities hold −
• ∅* = ε
• ε* = ε
• RR* = R*R
• R*R* = R*
• (R*)* = R*
• RR* = R*R
• (PQ)*P =P(QP)*
• (a+b)* = (a*b*)* = (a*+b*)* = (a+b*)* = a*(ba*)*
• R + ∅ = ∅ + R = R (The identity for union)
• R ε = ε R = R (The identity for concatenation)
• ∅ L = L ∅ = ∅ (The annihilator for concatenation)
• R + R = R (Idempotent law)
• L (M + N) = LM + LN (Left distributive law)
• (M + N) L = ML + NL (Right distributive law)
• ε + RR* = ε + R*R = R*

Arden's Theorem
In order to find out a regular expression of a Finite Automaton, we use Arden’s Theorem along with
the properties of regular expressions.
Statement −
Let P and Q be two regular expressions.
If P does not contain null string, then R = Q + RP has a unique solution that is R = QP*
Proof −
R = Q + (Q + RP)P [After putting the value R = Q + RP]
= Q + QP + RPP
When we put the value of R recursively again and again, we get the following equation −

R = Q + QP + QP2 + QP3…..

R = Q (ε + P + P2 + P3 + …. )
R = QP* [As P* represents (ε + P + P2 + P3 + ….) ]
Hence, proved.
Assumptions for Applying Arden’s Theorem
• The transition diagram must not have NULL transitions
• It must have only one initial state

Method
Step 1 − Create equations as the following form for all the states of the DFA having n states with
initial state q1.

q1 = q1R11 + q2R21 + … + qnRn1 + ε

q2 = q1R12 + q2R22 + … + qnRn2

…………………………
…………………………
…………………………
…………………………
qn = q1R1n + q2R2n + … + qnRnn

Rij represents the set of labels of edges from qi to qj, if no such edge exists, then Rij = ∅

Step 2 − Solve these equations to get the equation for the final state in terms of Rij

Problem
Construct a regular expression corresponding to the automata given below −

Solution −
Here the
initial state
and final
state is q1.

The
equations
for the three
states q1,
q2, and q3
are as
follows −
q1 = q1a + q3a + ε (ε move is because q1 is the initial state0

q2 = q1b + q2b + q3b

q3 = q2a
Now, we will solve these three equations −
q2 = q1b + q2b + q3b

= q1b + q2b + (q2a)b (Substituting value of q3)

= q1b + q2(b + ab)

= q1b (b + ab)* (Applying Arden’s Theorem)

q1 = q1a + q3a + ε

= q1a + q2aa + ε (Substituting value of q3)

= q1a + q1b(b + ab*)aa + ε (Substituting value of q2)

= q1(a + b(b + ab)*aa) + ε

= ε (a+ b(b + ab)aa)

= (a + b(b + ab)*aa)*
Hence, the regular expression is (a + b(b + ab)*aa)*.
Problem
Construct a regular expression corresponding to the automata given below −

Solution −
Here the initial state is q1 and the final state is q2

Now we write down the equations −

q1 = q10 + ε

q2 = q11 + q20
q3 = q21 + q30 + q31

Now, we will solve these three equations −

q1 = ε0* [As, εR = R]

So, q1 = 0*

q2 = 0*1 + q20

So, q2 = 01(0) [By Arden’s theorem]

Hence, the regular expression is 010.

Construction of an FA from an RE
We can use Thompson's Construction to find out a Finite Automaton from a Regular Expression.
We will reduce the regular expression into smallest regular expressions and converting these to NFA
and finally to DFA.
Some basic RA expressions are the following −
Case 1 − For a regular expression ‘a’, we can construct the following FA −

Case 2 − For a regular expression ‘ab’, we can construct the following FA −

Case 3 − For a regular expression (a+b), we can construct the following FA −

Case 4 − For a regular expression (a+b)*, we can construct the following FA −

Method
Step 1 Construct an NFA with Null moves from the given regular expression.
Step 2 Remove Null transition from the NFA and convert it into its equivalent DFA.
Problem
Convert the following RA into its equivalent DFA − 1 (0 + 1)* 0
Solution
We will concatenate three expressions "1", "(0 + 1)*" and "0"

Now we will remove the ε transitions. After we remove the ε transitions from the NDFA, we get the
following −
It is an NDFA corresponding to the RE − 1 (0 + 1)* 0. If you want to convert it into a DFA, simply
apply the method of converting NDFA to DFA discussed in Chapter 1.

Finite Automata with Null Moves (NFA-ε)

A Finite Automaton with null moves (FA-ε) does transit not only after giving input from the
alphabet set but also without any input symbol. This transition without input is called a null move.
An NFA-ε is represented formally by a 5-tuple (Q, ∑, δ, q0, F), consisting of

• Q − a finite set of states

• ∑ − a finite set of input symbols

• δ − a transition function δ : Q × (∑ ∪ {ε}) → 2Q

• q0 − an initial state q0 ∈ Q

• F − a set of final state/states of Q (F⊆Q).

The above (FA-ε) accepts a string set − {0, 1, 01}

Removal of Null Moves from Finite Automata

If in an NDFA, there is ϵ-move between vertex X to vertex Y, we can remove it using the following
steps −
• Find all the outgoing edges from Y.
• Copy all these edges starting from X without changing the edge labels.
• If X is an initial state, make Y also an initial state.
• If Y is a final state, make X also a final state.
Problem
Convert the following NFA-ε to NFA without Null move.

Solution
Step 1 −
Here the ε transition is between q1 and q2, so let q1 is X and qf is Y.

Here the outgoing edges from qf is to qf for inputs 0 and 1.

Step 2 −
Now we will Copy all these edges from q1 without changing the edges from qf and get the
following FA −
Step 3 −
Here q1 is an initial state, so we make qf also an initial state.

So the FA becomes −

Step 4 −
Here qf is a final state, so we make q1 also a final state.

So the FA becomes −
Ambiguity in Context-Free Grammars
If a context free grammar G has more than one derivation tree for some string w ∈ L(G), it is
called an ambiguous grammar. There exist multiple right-most or left-most derivations for some
string generated from that grammar.

Problem
Check whether the grammar G with production rules −
X → X+X | X*X |X| a
is ambiguous or not.

Solution
Let’s find out the derivation tree for the string "a+a*a". It has two leftmost derivations.
Derivation 1 − X → X+X → a +X → a+ X*X → a+a*X → a+a*a
Parse tree 1 −

Derivation 2 − X → XX → X+XX → a+ XX → a+aX → a+a*a

Parse tree 2 −
Since there are two parse trees for a single string "a+a*a", the grammar G is ambiguous.

In a CFG, it may happen that all the production rules and symbols are not needed for the derivation
of strings. Besides, there may be some null productions and unit productions. Elimination of these
productions and symbols is called simplification of CFGs. Simplification essentially comprises of
the following steps −

• Reduction of CFG
• Removal of Unit Productions
• Removal of Null Productions

Reduction of CFG
CFGs are reduced in two phases −
Phase 1 − Derivation of an equivalent grammar, G’, from the CFG, G, such that each variable
derives some terminal string.
Derivation Procedure −
Step 1 − Include all symbols, W1, that derive some terminal and initialize i=1.

Step 2 − Include all symbols, Wi+1, that derive Wi.

Step 3 − Increment i and repeat Step 2, until Wi+1 = Wi.

Step 4 − Include all production rules that have Wi in it.

Phase 2 − Derivation of an equivalent grammar, G”, from the CFG, G’, such that each symbol
appears in a sentential form.
Derivation Procedure −
Step 1 − Include the start symbol in Y1 and initialize i = 1.
Step 2 − Include all symbols, Yi+1, that can be derived from Yi and include all production rules that
have been applied.
Step 3 − Increment i and repeat Step 2, until Yi+1 = Yi.

Problem
Find a reduced grammar equivalent to the grammar G, having production rules, P: S → AC | B, A
→ a, C → c | BC, E → aA | e

Solution
Phase 1 −
T = { a, c, e }
W1 = { A, C, E } from rules A → a, C → c and E → aA

W2 = { A, C, E } U { S } from rule S → AC

W3 = { A, C, E, S } U ∅

Since W2 = W3, we can derive G’ as −

G’ = { { A, C, E, S }, { a, c, e }, P, {S}}
where P: S → AC, A → a, C → c , E → aA | e
Phase 2 −
Y1 = { S }

Y2 = { S, A, C } from rule S → AC

Y3 = { S, A, C, a, c } from rules A → a and C → c

Y4 = { S, A, C, a, c }

Since Y3 = Y4, we can derive G” as −

G” = { { A, C, S }, { a, c }, P, {S}}
where P: S → AC, A → a, C → c

Removal of Unit Productions

Any production rule in the form A → B where A, B ∈ Non-terminal is called unit production..

Removal Procedure −
Step 1 − To remove A → B, add production A → x to the grammar rule whenever B → x occurs in
the grammar. [x ∈ Terminal, x can be Null]
Step 2 − Delete A → B from the grammar.
Step 3 − Repeat from step 1 until all unit productions are removed.
Problem
Remove unit production from the following −
S → XY, X → a, Y → Z | b, Z → M, M → N, N → a
Solution −
There are 3 unit productions in the grammar −
Y → Z, Z → M, and M → N
At first, we will remove M → N.
As N → a, we add M → a, and M → N is removed.
The production set becomes
S → XY, X → a, Y → Z | b, Z → M, M → a, N → a
Now we will remove Z → M.
As M → a, we add Z→ a, and Z → M is removed.
The production set becomes
S → XY, X → a, Y → Z | b, Z → a, M → a, N → a
Now we will remove Y → Z.
As Z → a, we add Y→ a, and Y → Z is removed.
The production set becomes
S → XY, X → a, Y → a | b, Z → a, M → a, N → a
Now Z, M, and N are unreachable, hence we can remove those.
The final CFG is unit production free −
S → XY, X → a, Y → a | b

Removal of Null Productions

In a CFG, a non-terminal symbol ‘A’ is a nullable variable if there is a production A → ε or there is
a derivation that starts at A and finally ends up with
ε: A → .......… → ε

Removal Procedure
Step 1 − Find out nullable non-terminal variables which derive ε.
Step 2 − For each production A → a, construct all productions A → x where x is obtained from ‘a’
by removing one or multiple non-terminals from Step 1.
Step 3 − Combine the original productions with the result of step 2 and remove ε - productions.
Problem
Remove null production from the following −
S → ASA | aB | b, A → B, B → b | ∈
Solution −
There are two nullable variables − A and B
At first, we will remove B → ε.
After removing B → ε, the production set becomes −
S→ASA | aB | b | a, A ε B| b | &epsilon, B → b
Now we will remove A → ε.
After removing A → ε, the production set becomes −
S→ASA | aB | b | a | SA | AS | S, A → B| b, B → b
This is the final production set without null transition.

Chomsky Normal Form

A CFG is in Chomsky Normal Form if the Productions are in the following forms −
• A→a
• A → BC
• S→ε
where A, B, and C are non-terminals and a is terminal.

Algorithm to Convert into Chomsky Normal Form −

Step 1 − If the start symbol S occurs on some right side, create a new start symbol S’ and a new
production S’→ S.
Step 2 − Remove Null productions. (Using the Null production removal algorithm discussed
earlier)
Step 3 − Remove unit productions. (Using the Unit production removal algorithm discussed earlier)
Step 4 − Replace each production A → B1…Bn where n > 2 with A → B1C where C → B2 …Bn.
Repeat this step for all productions having two or more symbols in the right side.
Step 5 − If the right side of any production is in the form A → aB where a is a terminal and A, B
are non-terminal, then the production is replaced by A → XB and X → a. Repeat this step for every
production which is in the form A → aB.

Problem
Convert the following CFG into CNF
S → ASA | aB, A → B | S, B → b | ε
Solution
(1) Since S appears in R.H.S, we add a new state S0 and S0→S is added to the production set and it
becomes −
S0→S, S→ ASA | aB, A → B | S, B → b | ∈

(2) Now we will remove the null productions −

B → ∈ and A → ∈
After removing B → ε, the production set becomes −
S0→S, S→ ASA | aB | a, A → B | S | ∈, B → b

After removing A → ∈, the production set becomes −

S0→S, S→ ASA | aB | a | AS | SA | S, A → B | S, B → b

(3) Now we will remove the unit productions.

After removing S → S, the production set becomes −
S0→S, S→ ASA | aB | a | AS | SA, A → B | S, B → b

After removing S0→ S, the production set becomes −

S0→ ASA | aB | a | AS | SA, S→ ASA | aB | a | AS | SA

A → B | S, B → b
After removing A→ B, the production set becomes −
S0 → ASA | aB | a | AS | SA, S→ ASA | aB | a | AS | SA

A→S|b
B→b
After removing A→ S, the production set becomes −
S0 → ASA | aB | a | AS | SA, S→ ASA | aB | a | AS | SA

A → b |ASA | aB | a | AS | SA, B → b
(4) Now we will find out more than two variables in the R.H.S
Here, S0→ ASA, S → ASA, A→ ASA violates two Non-terminals in R.H.S.

Hence we will apply step 4 and step 5 to get the following final production set which is in CNF −
S0→ AX | aB | a | AS | SA

S→ AX | aB | a | AS | SA
A → b |AX | aB | a | AS | SA
B→b
X → SA
(5) We have to change the productions S0→ aB, S→ aB, A→ aB

And the final production set becomes −

S0→ AX | YB | a | AS | SA

S→ AX | YB | a | AS | SA
A → b A → b |AX | YB | a | AS | SA
B→b
X → SA
Y→a

Greibach Normal Form

A CFG is in Greibach Normal Form if the Productions are in the following forms −
A→b
A → bD1…Dn

S→ε
where A, D1,....,Dn are non-terminals and b is a terminal.

Algorithm to Convert a CFG into Greibach Normal Form

Problem
Convert the following CFG into CNF
S → XY | Xn | p
X → mX | m
Y → Xn | o
Solution
Here, S does not appear on the right side of any production and there are no unit or null productions
in the production rule set. So, we can skip Step 1 to Step 3.
Step 4
Now after replacing
X in S → XY | Xo | p
with
mX | m
we obtain
S → mXY | mY | mXo | mo | p.
And after replacing
X in Y → Xn | o

with the right side of

X → mX | m
we obtain
Y → mXn | mn | o.
Two new productions O → o and P → p are added to the production set and then we came to the
final GNF as the following −
S → mXY | mY | mXC | mC | p
X → mX | m
Y → mXD | mD | o
O→o
P→p

CFL Closure Property

Context-free languages are closed under −
• Union
• Concatenation
• Kleene Star operation

Union
Let L1 and L2 be two context free languages. Then L1 ∪ L2 is also context free.
Example
Let L1 = { anbn , n > 0}. Corresponding grammar G1 will have P: S1 → aAb|ab

Let L2 = { cmdm , m ≥ 0}. Corresponding grammar G2 will have P: S2 → cBb| ε

Union of L1 and L2, L = L1 ∪ L2 = { anbn } ∪ { cmdm }

The corresponding grammar G will have the additional production S → S1 | S2

Concatenation
If L1 and L2 are context free languages, then L1L2 is also context free.

Example
Union of the languages L1 and L2, L = L1L2 = { anbncmdm }

The corresponding grammar G will have the additional production S → S1 S2

Kleene Star
If L is a context free language, then L* is also context free.

Example
Let L = { anbn , n ≥ 0}. Corresponding grammar G will have P: S → aAb| ε

Kleene Star L1 = { anbn }*

The corresponding grammar G1 will have additional productions S1 → SS1 | ε

Context-free languages are not closed under −

• Intersection − If L1 and L2 are context free languages, then L1 ∩ L2 is not necessarily
context free.
• Intersection with Regular Language − If L1 is a regular language and L2 is a context free
language, then L1 ∩ L2 is a context free language.
• Complement − If L1 is a context free language, then L1’ may not be context free.
Pushdown Automata Introduction
Basic Structure of PDA
A pushdown automaton is a way to implement a context-free grammar in a similar way we design
DFA for a regular grammar. A DFA can remember a finite amount of information, but a PDA can
remember an infinite amount of information.
Basically a pushdown automaton is −
"Finite state machine" + "a stack"
A pushdown automaton has three components −
• an input tape,
• a control unit, and
• a stack with infinite size.
The stack head scans the top symbol of the stack.
A stack does two operations −
• Push − a new symbol is added at the top.
• Pop − the top symbol is read and removed.
A PDA may or may not read an input symbol, but it has to read the top of the stack in every
transition.

A PDA can be formally described as a 7-tuple (Q, ∑, S, δ, q0, I, F) −

• Q is the finite number of states

• ∑ is input alphabet
• S is stack symbols
• δ is the transition function: Q × (∑ ∪ {ε}) × S × Q × S*
• q0 is the initial state (q0 ∈ Q)

• I is the initial stack top symbol (I ∈ S)

• F is a set of accepting states (F ∈ Q)
The following diagram shows a transition in a PDA from a state q1 to state q2, labeled as a,b → c −

This means at state q1, if we encounter an input string ‘a’ and top symbol of the stack is ‘b’, then
we pop ‘b’, push ‘c’ on top of the stack and move to state q2.

Terminologies Related to PDA

Instantaneous Description
The instantaneous description (ID) of a PDA is represented by a triplet (q, w, s) where
• q is the state
• w is unconsumed input
• s is the stack contents

Turnstile Notation
The "turnstile" notation is used for connecting pairs of ID's that represent one or many moves of a
PDA. The process of transition is denoted by the turnstile symbol "⊢".
Consider a PDA (Q, ∑, S, δ, q 0, I, F). A transition can be mathematically represented by the
following turnstile notation −
(p, aw, Tβ) ⊢ (q, w, αb)

This implies that while taking a transition from state p to state q, the input symbol ‘a’ is consumed,
and the top of the stack ‘T’ is replaced by a new string ‘α’.
Note − If we want zero or more moves of a PDA, we have to use the symbol (⊢*) for it.

There are two different ways to define PDA acceptability.

Final State Acceptability
In final state acceptability, a PDA accepts a string when, after reading the entire string, the PDA is
in a final state. From the starting state, we can make moves that end up in a final state with any
stack values. The stack values are irrelevant as long as we end up in a final state.
For a PDA (Q, ∑, S, δ, q0, I, F), the language accepted by the set of final states F is −

L(PDA) = {w | (q0, w, I) ⊢* (q, ε, x), q ∈ F}

for any input stack string x.

Empty Stack Acceptability

Here a PDA accepts a string when, after reading the entire string, the PDA has emptied its stack.
For a PDA (Q, ∑, S, δ, q0, I, F), the language accepted by the empty stack is −

L(PDA) = {w | (q0, w, I) ⊢* (q, ε, ε), q ∈ Q}

Example
Construct a PDA that accepts L = {0n 1n | n ≥ 0}

Solution

This language accepts L = {ε, 01, 0011, 000111, ............................. }

Here, in this example, the number of ‘a’ and ‘b’ have to be same.
• Initially we put a special symbol ‘$’ into the empty stack.
• Then at state q2, if we encounter input 0 and top is Null, we push 0 into stack. This may
iterate. And if we encounter input 1 and top is 0, we pop this 0.
• Then at state q3, if we encounter input 1 and top is 0, we pop this 0. This may also iterate.
And if we encounter input 1 and top is 0, we pop the top element.
• If the special symbol ‘$’ is encountered at top of the stack, it is popped out and it finally
goes to the accepting state q4.

Example
Construct a PDA that accepts L = { wwR | w = (a+b)* }
Solution

Initially we put a special symbol ‘$’ into the empty stack. At state q2, the w is being read. In state

q3, each 0 or 1 is popped when it matches the input. If any other input is given, the PDA will go to a

dead state. When we reach that special symbol ‘$’, we go to the accepting state q4.

PDA & Context-Free Grammar

If a grammar G is context-free, we can build an equivalent nondeterministic PDA which accepts the
language that is produced by the context-free grammar G. A parser can be built for the grammar G.

Also, if P is a pushdown automaton, an equivalent context-free grammar G can be constructed

where
L(G) = L(P)
In the next two topics, we will discuss how to convert from PDA to CFG and vice versa.

Algorithm to find PDA corresponding to a given CFG

Input − A CFG, G = (V, T, P, S)
Output − Equivalent PDA, P = (Q, ∑, S, δ, q0, I, F)

Step 1 − Convert the productions of the CFG into GNF.

Step 2 − The PDA will have only one state {q}.
Step 3 − The start symbol of CFG will be the start symbol in the PDA.
Step 4 − All non-terminals of the CFG will be the stack symbols of the PDA and all the terminals of
the CFG will be the input symbols of the PDA.
Step 5 − For each production in the form A → aX where a is terminal and A, X are combination of
terminal and non-terminals, make a transition δ (q, a, A).
Problem
Construct a PDA from the following CFG.
G = ({S, X}, {a, b}, P, S)
where the productions are −
S → XS | ε , A → aXb | Ab | ab

Solution
Let the equivalent PDA,
P = ({q}, {a, b}, {a, b, X, S}, δ, q, S)
where δ −
δ(q, ε , S) = {(q, XS), (q, ε )}
δ(q, ε , X) = {(q, aXb), (q, Xb), (q, ab)}
δ(q, a, a) = {(q, ε )}
δ(q, 1, 1) = {(q, ε )}

Algorithm to find CFG corresponding to a given PDA

Input − A CFG, G = (V, T, P, S)
Output − Equivalent PDA, P = (Q, ∑, S, δ, q0, I, F) such that the non- terminals of the grammar G
will be {Xwx | w,x ∈ Q} and the start state will be Aq0,F.

Step 1 − For every w, x, y, z ∈ Q, m ∈ S and a, b ∈ ∑, if δ (w, a, ε) contains (y, m) and (z, b, m)

contains (x, ε), add the production rule Xwx → a Xyzb in grammar G.

Step 2 − For every w, x, y, z ∈ Q, add the production rule Xwx → XwyXyx in grammar G.

Step 3 − For w ∈ Q, add the production rule Xww → ε in grammar G.

Pushdown Automata & Parsing

Parsing is used to derive a string using the production rules of a grammar. It is used to check the
acceptability of a string. Compiler is used to check whether or not a string is syntactically correct. A
parser takes the inputs and builds a parse tree.

A parser can be of two types −

• Top-Down Parser − Top-down parsing starts from the top with the start-symbol and derives
a string using a parse tree.
• Bottom-Up Parser − Bottom-up parsing starts from the bottom with the string and comes to
the start symbol using a parse tree.

Design of Top-Down Parser

For top-down parsing, a PDA has the following four types of transitions −
• Pop the non-terminal on the left hand side of the production at the top of the stack and push
its right-hand side string.
• If the top symbol of the stack matches with the input symbol being read, pop it.
• Push the start symbol ‘S’ into the stack.
• If the input string is fully read and the stack is empty, go to the final state ‘F’.

(x+yz, I) ⊢(x +yz, SI) ⊢ (x+yz, S+XI) ⊢(x+yz, X+XI)

⊢(x+y*z, Y+X I) ⊢(x+y*z, x+XI) ⊢(+y*z, +XI) ⊢ (y*z, XI)
⊢(y*z, X*YI) ⊢(y*z, y*YI) ⊢(*z,*YI) ⊢(z, YI) ⊢(z, zI) ⊢(ε, I)

Design of a Bottom-Up Parser

For bottom-up parsing, a PDA has the following four types of transitions −
• Push the current input symbol into the stack.
• Replace the right-hand side of a production at the top of the stack with its left-hand side.
• If the top of the stack element matches with the current input symbol, pop it.
• If the input string is fully read and only if the start symbol ‘S’ remains in the stack, pop it
and go to the final state ‘F’.

Example
Design a top-down parser for the expression "x+y*z" for the grammar G with the following
production rules −
P: S → S+X | X, X → X*Y | Y, Y → (S) | id
Solution
If the PDA is (Q, ∑, S, δ, q0, I, F), then the bottom-up parsing is −
(x+y*z, I) ⊢ (+y*z, xI) ⊢ (+y*z, YI) ⊢ (+y*z, XI) ⊢ (+y*z, SI)
⊢(y*z, +SI) ⊢ (*z, y+SI) ⊢ (*z, Y+SI) ⊢ (*z, X+SI) ⊢ (z, *X+SI)
⊢ (ε, z*X+SI) ⊢ (ε, Y*X+SI) ⊢ (ε, X+SI) ⊢ (ε, SI)

Turing Machine Introduction

A Turing Machine is an accepting device which accepts the languages (recursively enumerable set)
generated by type 0 grammars. It was invented in 1936 by Alan Turing.

Definition
A Turing Machine (TM) is a mathematical model which consists of an infinite length tape divided
into cells on which input is given. It consists of a head which reads the input tape. A state register
stores the state of the Turing machine. After reading an input symbol, it is replaced with another
symbol, its internal state is changed, and it moves from one cell to the right or left. If the TM
reaches the final state, the input string is accepted, otherwise rejected.
A TM can be formally described as a 7-tuple (Q, X, ∑, δ, q0, B, F) where −

• Q is a finite set of states

• X is the tape alphabet
• ∑ is the input alphabet
• δ is a transition function; δ : Q × X → Q × X × {Left_shift, Right_shift}.
• q0 is the initial state

• B is the blank symbol

• F is the set of final states

Comparison with the previous automaton

The following table shows a comparison of how a Turing machine differs from Finite Automaton
and Pushdown Automaton.

Machine Stack Data Structure Deterministic?

Finite Automaton N.A Yes
Pushdown Automaton Last In First Out(LIFO) No
Turing Machine Infinite tape Yes

Example of Turing machine

Turing machine M = (Q, X, ∑, δ, q0, B, F) with

• Q = {q0, q1, q2, qf}

• X = {a, b}
• ∑ = {1}
• q0 = {q0}
• B = blank symbol
• F = {qf }

δ is given by −

Tape alphabet symbol Present State ‘q0’ Present State ‘q1’ Present State ‘q2’
a 1Rq1 1Lq0 1Lqf
b 1Lq2 1Rq1 1Rqf
Here the transition 1Rq1 implies that the write symbol is 1, the tape moves right, and the next state
is q1. Similarly, the transition 1Lq2 implies that the write symbol is 1, the tape moves left, and the
next state is q2.

Time and Space Complexity of a Turing Machine

For a Turing machine, the time complexity refers to the measure of the number of times the tape
moves when the machine is initialized for some input symbols and the space complexity is the
number of cells of the tape written.
Time complexity all reasonable functions −
T(n) = O(n log n)
TM's space complexity −
S(n) = O(n)

Accepted Language & Decided Language

A TM accepts a language if it enters into a final state for any input string w. A language is
recursively enumerable (generated by Type-0 grammar) if it is accepted by a Turing machine.

A TM decides a language if it accepts it and enters into a rejecting state for any input not in the
language. A language is recursive if it is decided by a Turing machine.
There may be some cases where a TM does not stop. Such TM accepts the language, but it does not
decide it.

Designing a Turing Machine

The basic guidelines of designing a Turing machine have been explained below with the help of a
couple of examples.
Example 1
Design a TM to recognize all strings consisting of an odd number of α’s.
Solution
The Turing machine M can be constructed by the following moves −
• Let q1 be the initial state.

• If M is in q1; on scanning α, it enters the state q2 and writes B (blank).

• If M is in q2; on scanning α, it enters the state q1 and writes B (blank).

• From the above moves, we can see that M enters the state q1 if it scans an even number of
α’s, and it enters the state q2 if it scans an odd number of α’s. Hence q2 is the only accepting
state.
Hence,
M = {{q1, q2}, {1}, {1, B}, δ, q1, B, {q2}}

where δ is given by −

Tape alphabet symbol Present State ‘q1’ Present State ‘q2’

α BRq2 BRq1

Example 2
Design a Turing Machine that reads a string representing a binary number and erases all leading 0’s
in the string. However, if the string comprises of only 0’s, it keeps one 0.
Solution
Let us assume that the input string is terminated by a blank symbol, B, at each end of the string.
The Turing Machine, M, can be constructed by the following moves −
• Let q0 be the initial state.

• If M is in q0, on reading 0, it moves right, enters the state q1 and erases 0. On reading 1, it
enters the state q2 and moves right.

• If M is in q1, on reading 0, it moves right and erases 0, i.e., it replaces 0’s by B’s. On
reaching the leftmost 1, it enters q2 and moves right. If it reaches B, i.e., the string comprises
of only 0’s, it moves left and enters the state q3.

• If M is in q2, on reading either 0 or 1, it moves right. On reaching B, it moves left and enters
the state q4. This validates that the string comprises only of 0’s and 1’s.

• If M is in q3, it replaces B by 0, moves left and reaches the final state qf.
• If M is in q4, on reading either 0 or 1, it moves left. On reaching the beginning of the string,
i.e., when it reads B, it reaches the final state qf.

Hence,
M = {{q0, q1, q2, q3, q4, qf}, {0,1, B}, {1, B}, δ, q0, B, {qf}}

where δ is given by −

Tape alphabet Present State Present State Present State Present State Present State
symbol ‘q0’ ‘q1’ ‘q2’ ‘q3’ ‘q4’
0 BRq1 BRq1 ORq2 - OLq4
1 1Rq2 1Rq2 1Rq2 - 1Lq4
B BRq1 BLq3 BLq4 OLqf BRqf

Multi-tape Turing Machine

Multi-tape Turing Machines have multiple tapes where each tape is accessed with a separate head.
Each head can move independently of the other heads. Initially the input is on tape 1 and others are
blank. At first, the first tape is occupied by the input and the other tapes are kept blank. Next, the
machine reads consecutive symbols under its heads and the TM prints a symbol on each tape and
moves its heads.

A Multi-tape Turing machine can be formally described as a 6-tuple (Q, X, B, δ, q0, F) where −

• Q is a finite set of states

• X is the tape alphabet
• B is the blank symbol
• δ is a relation on states and symbols where

δ: Q × Xk → Q × (X × {Left_shift, Right_shift, No_shift })k

where there is k number of tapes
• q0 is the initial state

• F is the set of final states

Note − Every Multi-tape Turing machine has an equivalent single-tape Turing machine.

Multi-track Turing Machine

Multi-track Turing machines, a specific type of Multi-tape Turing machine, contain multiple tracks
but just one tape head reads and writes on all tracks. Here, a single tape head reads n symbols from
n tracks at one step. It accepts recursively enumerable languages like a normal single-track single-
tape Turing Machine accepts.

A Multi-track Turing machine can be formally described as a 6-tuple (Q, X, ∑, δ, q0, F) where −

• Q is a finite set of states

• X is the tape alphabet
• ∑ is the input alphabet
• δ is a relation on states and symbols where
δ(Qi, [a1, a2, a3,....]) = (Qj, [b1, b2, b3,....], Left_shift or Right_shift)

• q0 is the initial state

• F is the set of final states

Note − For every single-track Turing Machine S, there is an equivalent multi-track Turing Machine
M such that L(S) = L(M).

Non-Deterministic Turing Machine

In a Non-Deterministic Turing Machine, for every state and symbol, there are a group of actions the
TM can have. So, here the transitions are not deterministic. The computation of a non-deterministic
Turing Machine is a tree of configurations that can be reached from the start configuration.
An input is accepted if there is at least one node of the tree which is an accept configuration,
otherwise it is not accepted. If all branches of the computational tree halt on all inputs, the non-
deterministic Turing Machine is called a Decider and if for some input, all branches are rejected,
the input is also rejected.
A non-deterministic Turing machine can be formally defined as a 6-tuple (Q, X, ∑, δ, q0, B, F)
where −
• Q is a finite set of states
• X is the tape alphabet
• ∑ is the input alphabet
• δ is a transition function;
δ : Q × X → P(Q × X × {Left_shift, Right_shift}).
• q0 is the initial state

• B is the blank symbol

• F is the set of final states

Regular-Expressions : LECT-2
No ratings yet
Regular-Expressions : LECT-2
12 pages
Unit 2: Can Be Recursively Defined As Follows
No ratings yet
Unit 2: Can Be Recursively Defined As Follows
28 pages
WINSEM2024-25_ISWE203L_TH_VL2024250503413_2025-01-20_Reference-Material-I
No ratings yet
WINSEM2024-25_ISWE203L_TH_VL2024250503413_2025-01-20_Reference-Material-I
30 pages
Automata Chapter 2
No ratings yet
Automata Chapter 2
15 pages
toc unit2
No ratings yet
toc unit2
24 pages
Unit II Regular Expression
No ratings yet
Unit II Regular Expression
176 pages
TOC (UNIT-II)
No ratings yet
TOC (UNIT-II)
69 pages
Module 2 Part 1
No ratings yet
Module 2 Part 1
7 pages
lecture 7
No ratings yet
lecture 7
49 pages
Regular Expression
No ratings yet
Regular Expression
21 pages
Regular Expressions (RE) 3.1
100% (3)
Regular Expressions (RE) 3.1
53 pages
Chapter 2 RegularExpressions
No ratings yet
Chapter 2 RegularExpressions
95 pages
Chapter Two Regular Expression and Regular Language
No ratings yet
Chapter Two Regular Expression and Regular Language
30 pages
Unit 2 TOC 3
No ratings yet
Unit 2 TOC 3
49 pages
Lecture 7
No ratings yet
Lecture 7
70 pages
Chapter 2 RegularExpressions (3)
No ratings yet
Chapter 2 RegularExpressions (3)
95 pages
Class 10 Regular Expression
No ratings yet
Class 10 Regular Expression
26 pages
Regular Expression
No ratings yet
Regular Expression
42 pages
05 Handout 1
No ratings yet
05 Handout 1
3 pages
Regular Expression: Operations On Regular Language
No ratings yet
Regular Expression: Operations On Regular Language
33 pages
Toc Unit 2
No ratings yet
Toc Unit 2
29 pages
Unit 2-Theory of Computation
No ratings yet
Unit 2-Theory of Computation
44 pages
TOC Module-2 Notes
No ratings yet
TOC Module-2 Notes
24 pages
Regular Expressions
No ratings yet
Regular Expressions
34 pages
Ch2 Automata & Complexity Theory2016
No ratings yet
Ch2 Automata & Complexity Theory2016
44 pages
regular expression
No ratings yet
regular expression
87 pages
Toc U2
No ratings yet
Toc U2
31 pages
CS351 Regular Expressions
No ratings yet
CS351 Regular Expressions
14 pages
Regular Expressions and Languages
No ratings yet
Regular Expressions and Languages
16 pages
Unit-Ii Regular Expressions and Languages Definition
No ratings yet
Unit-Ii Regular Expressions and Languages Definition
34 pages
REGULAR EXPRESSIONS
No ratings yet
REGULAR EXPRESSIONS
22 pages
FLAT - UNIT-2
No ratings yet
FLAT - UNIT-2
32 pages
Index UNIT - II: Regular Expressions No: Khit, Cse
No ratings yet
Index UNIT - II: Regular Expressions No: Khit, Cse
11 pages
Flat-Unit-2 Notes
No ratings yet
Flat-Unit-2 Notes
23 pages
Regular Expressions
No ratings yet
Regular Expressions
30 pages
MITWPU - Unit 2-Theory of Computation
No ratings yet
MITWPU - Unit 2-Theory of Computation
50 pages
Chapter_2_Finite Automata and Regular Language_Part_2
No ratings yet
Chapter_2_Finite Automata and Regular Language_Part_2
80 pages
Flat Unit-Iii
No ratings yet
Flat Unit-Iii
11 pages
WINSEM2024-25_BCSE304L_TH_VL2024250501632_2025-01-25_Reference-Material-I (1)
No ratings yet
WINSEM2024-25_BCSE304L_TH_VL2024250501632_2025-01-25_Reference-Material-I (1)
15 pages
Regular Expressiontzzz
No ratings yet
Regular Expressiontzzz
46 pages
Unit Iii Flat LM Cse
No ratings yet
Unit Iii Flat LM Cse
15 pages
re-nfa-220110090941
No ratings yet
re-nfa-220110090941
15 pages
unit2
No ratings yet
unit2
135 pages
Toc Unit 2 Regular Expression
No ratings yet
Toc Unit 2 Regular Expression
109 pages
Unit Ii Regular Expressions and Languages: 2.1.1. Definition
No ratings yet
Unit Ii Regular Expressions and Languages: 2.1.1. Definition
31 pages
Unit2 Part2 - Applications of Regular Expressions & Arden's Theorem
No ratings yet
Unit2 Part2 - Applications of Regular Expressions & Arden's Theorem
20 pages
Unit 3 - Regular Expression
No ratings yet
Unit 3 - Regular Expression
45 pages
Unit 2
No ratings yet
Unit 2
53 pages
2.0+Regular Expression Part 1 MKN
No ratings yet
2.0+Regular Expression Part 1 MKN
33 pages
Practice Questions Modules 2 Answer
No ratings yet
Practice Questions Modules 2 Answer
33 pages
CS372 Formal Languages & The Theory of Computation
No ratings yet
CS372 Formal Languages & The Theory of Computation
29 pages
Notation To Specify A Language: Theory of Computation - Regular Expressions
No ratings yet
Notation To Specify A Language: Theory of Computation - Regular Expressions
23 pages
FLAT Lec - 3
No ratings yet
FLAT Lec - 3
34 pages
ACD UNIT 2 RegularExpression
No ratings yet
ACD UNIT 2 RegularExpression
10 pages
Chapter 3 Regular Expression
No ratings yet
Chapter 3 Regular Expression
25 pages
re-nfa-220110090941
No ratings yet
re-nfa-220110090941
20 pages
Theory of Computation - Regular Expressions
No ratings yet
Theory of Computation - Regular Expressions
23 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
A Short Course in Automorphic Functions
From Everand
A Short Course in Automorphic Functions
Joseph Lehner
No ratings yet
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Dark Green & Orange Travel Bag Store Presentation Template
No ratings yet
Dark Green & Orange Travel Bag Store Presentation Template
14 pages
Two-Pointers and Sliding Window Recap Session Div-2
No ratings yet
Two-Pointers and Sliding Window Recap Session Div-2
39 pages
(SRE) (Software Requirement Engineering) Final Exam
No ratings yet
(SRE) (Software Requirement Engineering) Final Exam
8 pages
Coffee Machine Model
No ratings yet
Coffee Machine Model
7 pages
Advanced Algorithms and Complexity: The Complexity Class N P
No ratings yet
Advanced Algorithms and Complexity: The Complexity Class N P
4 pages
10.4 The Classes P and NP: Next: Up: Previous
No ratings yet
10.4 The Classes P and NP: Next: Up: Previous
3 pages
Unit 4
No ratings yet
Unit 4
100 pages
NP Hard and NP Complete Problems Basic Concepts
No ratings yet
NP Hard and NP Complete Problems Basic Concepts
67 pages
Unit-3 TOC
No ratings yet
Unit-3 TOC
42 pages
Complexity Theory Chapter 1
100% (1)
Complexity Theory Chapter 1
53 pages
Definitions of The Turing Machine
No ratings yet
Definitions of The Turing Machine
10 pages
Week8-Turing Machine
No ratings yet
Week8-Turing Machine
63 pages
CIT237 PROGRAMMING AND ALGORITHMS SUMMARY
No ratings yet
CIT237 PROGRAMMING AND ALGORITHMS SUMMARY
38 pages
Turing's Thesis: Fall 2005 Costas Busch - RPI 1
100% (1)
Turing's Thesis: Fall 2005 Costas Busch - RPI 1
69 pages
Turing Machine Variations
No ratings yet
Turing Machine Variations
60 pages
Toc 5 TH Unit Notes
No ratings yet
Toc 5 TH Unit Notes
43 pages
Full Download Automata Computability and Complexity Theory and Applications Elaine A. Rich PDF DOCX
100% (2)
Full Download Automata Computability and Complexity Theory and Applications Elaine A. Rich PDF DOCX
81 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
253 pages
What Is Complexity Theory?
No ratings yet
What Is Complexity Theory?
2 pages
Lecture 3: Algorithms On Turing Machine: CSE 431: Theory of Computation Instructor: Thomas Rothvoß
No ratings yet
Lecture 3: Algorithms On Turing Machine: CSE 431: Theory of Computation Instructor: Thomas Rothvoß
4 pages
ATLC CIE 2
No ratings yet
ATLC CIE 2
8 pages
Unit 5 - Theory of Computation - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Theory of Computation - WWW - Rgpvnotes.in
17 pages
Computer Science notes ⇒ Theory of Computation
No ratings yet
Computer Science notes ⇒ Theory of Computation
21 pages
Lecture Notes Cryptographic Protocols: Berry Schoenmakers
No ratings yet
Lecture Notes Cryptographic Protocols: Berry Schoenmakers
138 pages
TOC Unit4
No ratings yet
TOC Unit4
38 pages
FLAT Unit 5 Question Bank Solutions
No ratings yet
FLAT Unit 5 Question Bank Solutions
24 pages
Automata Theory Answers
No ratings yet
Automata Theory Answers
33 pages
Lecture11 Comparison of State Machine-1
No ratings yet
Lecture11 Comparison of State Machine-1
33 pages
Fla Unit 4 and 5
No ratings yet
Fla Unit 4 and 5
198 pages
Theoretical Computer Science
No ratings yet
Theoretical Computer Science
24 pages
Chapter 1 Turing Machines
No ratings yet
Chapter 1 Turing Machines
92 pages
Lecture 03
No ratings yet
Lecture 03
6 pages
Types of TM
No ratings yet
Types of TM
3 pages
P vs NP
No ratings yet
P vs NP
39 pages

3-5 Simplified Note

Uploaded by

3-5 Simplified Note

Uploaded by

Regular Expressions

A Regular Expression can be recursively defined as follows −

So, L1 = {a, aaa, aaaaa,.....} (Strings of odd length excluding Null)

and L2 ={ ε, aa, aaaa, aaaaaa,.......} (Strings of even length including Null)

L1 ∪ L2 = { ε, a, aa, aaa, aaaa, aaaaa, aaaaaa,.......}

(Strings of all possible lengths including Null)

L2 = { ε, aa, aaaa, aaaaaa,.......} (Strings of even length including Null)

L1 ∩ L2 = { aa, aaaa, aaaaaa,.......} (Strings of even length excluding Null)

RE (L1 ∩ L2) = aa(aa)* which is a regular expression itself.

L2 = { ε, aa, aaaa, aaaaaa,.......} (Strings of even length including Null)

L1 – L2 = {a, aaa, aaaaa, aaaaaaa, ....}

(Strings of all odd lengths excluding Null)

We have to prove LR is also regular if L is a regular set.

LR = {10, 01, 11, 01}

RE (LR) = 01 + 10 + 11 + 10 which is regular

and L2 = {01, 010,011,.....} (Set of strings beginning with 01)

Identities Related to Regular Expressions

q1 = q1R11 + q2R21 + … + qnRn1 + ε

q2 = q1R12 + q2R22 + … + qnRn2

q2 = q1b + q2b + q3b

= q1b + q2b + (q2a)b (Substituting value of q3)

= q1b + q2(b + ab)

= q1b (b + ab)* (Applying Arden’s Theorem)

= q1a + q2aa + ε (Substituting value of q3)

= q1a + q1b(b + ab*)aa + ε (Substituting value of q2)

= q1(a + b(b + ab)*aa) + ε

= ε (a+ b(b + ab)*aa)*

Now we write down the equations −

Now, we will solve these three equations −

So, q2 = 0*1(0)* [By Arden’s theorem]

Hence, the regular expression is 0*10*.

Case 2 − For a regular expression ‘ab’, we can construct the following FA −

Case 3 − For a regular expression (a+b), we can construct the following FA −

Finite Automata with Null Moves (NFA-ε)

• Q − a finite set of states

• δ − a transition function δ : Q × (∑ ∪ {ε}) → 2Q

• F − a set of final state/states of Q (F⊆Q).

The above (FA-ε) accepts a string set − {0, 1, 01}

Removal of Null Moves from Finite Automata

Here the outgoing edges from qf is to qf for inputs 0 and 1.

Derivation 2 − X → X*X → X+X*X → a+ X*X → a+a*X → a+a*a

Step 2 − Include all symbols, Wi+1, that derive Wi.

Step 3 − Increment i and repeat Step 2, until Wi+1 = Wi.

Step 4 − Include all production rules that have Wi in it.

Since W2 = W3, we can derive G’ as −

Y3 = { S, A, C, a, c } from rules A → a and C → c

Since Y3 = Y4, we can derive G” as −

Removal of Unit Productions

Removal of Null Productions

Chomsky Normal Form

Algorithm to Convert into Chomsky Normal Form −

(2) Now we will remove the null productions −

After removing A → ∈, the production set becomes −

(3) Now we will remove the unit productions.

After removing S0→ S, the production set becomes −

S0→ ASA | aB | a | AS | SA, S→ ASA | aB | a | AS | SA

And the final production set becomes −

Greibach Normal Form

Algorithm to Convert a CFG into Greibach Normal Form

with the right side of

CFL Closure Property

Let L2 = { cmdm , m ≥ 0}. Corresponding grammar G2 will have P: S2 → cBb| ε

Union of L1 and L2, L = L1 ∪ L2 = { anbn } ∪ { cmdm }

The corresponding grammar G will have the additional production S → S1 | S2

The corresponding grammar G will have the additional production S → S1 S2

Kleene Star L1 = { anbn }*

The corresponding grammar G1 will have additional productions S1 → SS1 | ε

Context-free languages are not closed under −

A PDA can be formally described as a 7-tuple (Q, ∑, S, δ, q0, I, F) −

• Q is the finite number of states

• I is the initial stack top symbol (I ∈ S)

Terminologies Related to PDA

There are two different ways to define PDA acceptability.

L(PDA) = {w | (q0, w, I) ⊢* (q, ε, x), q ∈ F}

for any input stack string x.

Empty Stack Acceptability

= ε (a+ b(b + ab)aa)

So, q2 = 01(0) [By Arden’s theorem]

Hence, the regular expression is 010.

Derivation 2 − X → XX → X+XX → a+ XX → a+aX → a+a*a

(x+yz, I) ⊢(x +yz, SI) ⊢ (x+yz, S+XI) ⊢(x+yz, X+XI)