Normal Forms For CFG'S: Eliminating Useless Variables Removing Epsilon Removing Unit Productions Chomsky Normal Form

The document discusses several algorithms for transforming context-free grammars (CFGs) into different normal forms. It describes discovering and eliminating useless symbols like variables that derive nothing and unreachable symbols. It also explains eliminating epsilon productions by finding nullable symbols and removing unit productions by collapsing derivations using unit productions. Finally, it defines Chomsky normal form as having productions that are either a single terminal or two variables, and provides a proof that any CFG can be transformed into this normal form.

Uploaded by

Lalalalalala

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views

Normal Forms For CFG'S: Eliminating Useless Variables Removing Epsilon Removing Unit Productions Chomsky Normal Form

Uploaded by

Lalalalalala

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

1

Normal Forms for CFGs

Eliminating Useless Variables
Removing Epsilon
Removing Unit Productions
Chomsky Normal Form
2
Variables That Derive Nothing
Consider: S -> AB, A -> aA | a, B -> AB
Although A derives all strings of as, B
derives no terminal strings.
Why? The only production for B leaves a B
in the sentential form.
Thus, S derives nothing, and the
language is empty.
3
Discovery Algorithms
There is a family of algorithms that work
inductively.
They start discovering some facts that
are obvious (the basis).
They discover more facts from what they
already have discovered (induction).
Eventually, nothing more can be
discovered, and we are done.
4
Picture of Discovery
Start with
the basis
facts
Round 1:
Add facts
that follow
from the
basis
Round 2:
Add facts
that follow
from round 1
and the
basis
And so on
5
Testing Whether a Variable
Derives Some Terminal String
Basis: If there is a production A -> w,
where w has no variables, then A
derives a terminal string.
Induction: If there is a production
A -> , where consists only of
terminals and variables known to derive
a terminal string, then A derives a
terminal string.
6
Testing (2)
Eventually, we can find no more
variables.
An easy induction on the order in which
variables are discovered shows that
each one truly derives a terminal string.
Conversely, any variable that derives a
terminal string will be discovered by this
algorithm.
7
Proof of Converse
The proof is an induction on the height
of the least-height parse tree by which
a variable A derives a terminal string.
Basis: Height = 1. Tree looks like:
Then the basis of the algorithm
tells us that A will be discovered.
A
a
1
a
n
. . .
8
Induction for Converse
Assume IH for parse trees of height <
h, and suppose A derives a terminal
string via a parse tree of height h:
By IH, those X
i
s that are
variables are discovered.
Thus, A will also be discovered,
because it has a right side of terminals
and/or discovered variables.
A
X
1
X
n
. . .
w
1
w
n
9
Algorithm to Eliminate
Variables That Derive Nothing
1. Discover all variables that derive
terminal strings.
2. For all other variables, remove all
productions in which they appear in
either the head or body.
10
Example: Eliminate Variables
S -> AB | C, A -> aA | a, B -> bB, C -> c
Basis: A and C are discovered because
of A -> a and C -> c.
Induction: S is discovered because of
S -> C.
Nothing else can be discovered.
Result: S -> C, A ->aA | a, C -> c
11
Unreachable Symbols
Another way a terminal or variable
deserves to be eliminated is if it cannot
appear in any derivation from the start
symbol.
Basis: We can reach S (the start symbol).
Induction: if we can reach A, and there is
a production A -> , then we can reach all
symbols of .
12
Unreachable Symbols (2)
Easy inductions in both directions show
that when we can discover no more
symbols, then we have all and only the
symbols that appear in derivations from S.
Algorithm: Remove from the grammar all
symbols not discovered reachable from S
and all productions that involve these
symbols.
13
Eliminating Useless Symbols
A symbol is useful if it appears in
some derivation of some terminal
string from the start symbol.
Otherwise, it is useless.
Eliminate all useless symbols by:
1. Eliminate symbols that derive no terminal
string.
2. Eliminate unreachable symbols.
14
Example: Useless Symbols (2)
S -> AB, A -> C, C -> c, B -> bB
If we eliminated unreachable symbols
first, we would find everything is
reachable.
A, C, and c would never get eliminated.
15
Why It Works
After step (1), every symbol remaining
derives some terminal string.
After step (2) the only symbols
remaining are all derivable from S.
In addition, they still derive a terminal
string, because such a derivation can
only involve symbols reachable from S.
16
Epsilon Productions
We can almost avoid using productions of
the form A -> (called -productions ).
The problem is that cannot be in the
language of any grammar that has no
productions.
Theorem: If L is a CFL, then L-{} has a
CFG with no -productions.
17
Nullable Symbols
To eliminate -productions, we first
need to discover the nullable symbols
= variables A such that A =>* .
Basis: If there is a production A -> ,
then A is nullable.
Induction: If there is a production
A -> , and all symbols of are
nullable, then A is nullable.
18
Example: Nullable Symbols
S -> AB, A -> aA | , B -> bB | A
Basis: A is nullable because of A -> .
Induction: B is nullable because of
B -> A.
Then, S is nullable because of S -> AB.
19
Eliminating -Productions
Key idea: turn each production
A -> X
1
X
n
into a family of productions.
For each subset of nullable Xs, there is
one production with those eliminated
from the right side in advance.
Except, if all Xs are nullable (or the body
was empty to begin with), do not make a
production with as the right side.
20
Example: Eliminating -
Productions
S -> ABC, A -> aA | , B -> bB | , C ->
A, B, C, and S are all nullable.
New grammar:
S -> ABC | AB | AC | BC | A | B | C
A ->aA | a
B ->bB | b
Note: C is now useless.
Eliminate its productions.
21
Why it Works
Prove that for all variables A:
1. If w and A =>*
old
w, then A =>*
new
w.
2. If A =>*
new
w then w and A =>*
old
w.
Then, letting A be the start symbol
proves that L(new) = L(old) {}.
(1) is an induction on the number of
steps by which A derives w in the old
grammar.
22
Proof of 1 Basis
If the old derivation is one step, then
A -> w must be a production.
Since w , this production also
appears in the new grammar.
Thus, A =>
new
w.
23
Proof of 1 Induction
Let A =>*
old
w be a k-step derivation,
and assume the IH for derivations of
fewer than k steps.
Let the first step be A =>
old
X
1
X
n
.
Then w can be broken into w = w
1
w
n
,
where X
i
=>*
old
w
i
, for all i, in fewer
than k steps.
24
Induction Continued
By the IH, if w
i
, then X
i
=>*
new
w
i
.
Also, the new grammar has a
production with A on the left, and just
those X
i
s on the right such that w
i
.
Note: they all cant be , because w .
Follow a use of this production by the
derivations X
i
=>*
new
w
i
to show that A
derives w in the new grammar.
25
Unit Productions
A unit production is one whose body
consists of exactly one variable.
These productions can be eliminated.
Key idea: If A =>* B by a series of unit
productions, and B -> is a non-unit-
production, then add production A -> .
Then, drop all unit productions.
26
Unit Productions (2)
Find all pairs (A, B) such that A =>* B
by a sequence of unit productions only.
Basis: Surely (A, A).
Induction: If we have found (A, B), and
B -> C is a unit production, then add
(A, C).
27
Proof That We Find Exactly
the Right Pairs
By induction on the order in which pairs
(A, B) are found, we can show A =>* B
by unit productions.
Conversely, by induction on the number
of steps in the derivation by unit
productions of A =>* B, we can show
that the pair (A, B) is discovered.
28
Proof The the Unit-Production-
Elimination Algorithm Works
Basic idea: there is a leftmost
derivation A =>*
lm
w in the new
grammar if and only if there is such a
derivation in the old.
A sequence of unit productions and a
non-unit production is collapsed into a
single production of the new grammar.
29
Cleaning Up a Grammar
Theorem: if L is a CFL, then there is a
CFG for L {} that has:
1. No useless symbols.
2. No -productions.
3. No unit productions.
I.e., every body is either a single
terminal or has length >2.
30
Cleaning Up (2)
Proof: Start with a CFG for L.
Perform the following steps in order:
1. Eliminate -productions.
2. Eliminate unit productions.
3. Eliminate variables that derive no
terminal string.
4. Eliminate variables not reached from the
start symbol.
Must be first. Can create
unit productions or useless
variables.
31
Chomsky Normal Form
A CFG is said to be in Chomsky
Normal Form if every production is of
one of these two forms:
1. A -> BC (body is two variables).
2. A -> a (body is a single terminal).
Theorem: If L is a CFL, then L {}
has a CFG in CNF.
32
Proof of CNF Theorem
Step 1: Clean the grammar, so every
body is either a single terminal or of
length at least 2.
Step 2: For each body a single terminal,
make the right side all variables.
For each terminal a create new variable A
a
and production A
a
-> a.
Replace a by A
a
in bodies of length >2.
33
Example: Step 2
Consider production A -> BcDe.
We need variables A
c
and A
e
. with
productions A
c
-> c and A
e
-> e.
Note: you create at most one variable for
each terminal, and use it everywhere it is
needed.
Replace A ->BcDe by A -> BA
c
DA
e
.
34
CNF Proof Continued
Step 3: Break right sides longer than 2
into a chain of productions with right
sides of two variables.
Example: A -> BCDE is replaced by
A -> BF, F -> CG, and G -> DE.
F and G must be used nowhere else.
35
Example of Step 3 Continued
Recall A -> BCDE is replaced by
A -> BF, F -> CG, and G -> DE.
In the new grammar, A => BF => BCG
=> BCDE.
More importantly: Once we choose to
replace A by BF, we must continue to
BCG and BCDE.
Because F and G have only one production.

Past Life Oracle Cards Guidebook
68% (76)
Past Life Oracle Cards Guidebook
56 pages
Mystical Shaman Oracle Cards
100% (11)
Mystical Shaman Oracle Cards
161 pages
CHAKRA Wisdom Oracle Cards
55% (11)
CHAKRA Wisdom Oracle Cards
98 pages
Keepers of The Light Oracle Cards
89% (27)
Keepers of The Light Oracle Cards
131 pages
Earth Warriors Oracle
100% (3)
Earth Warriors Oracle
217 pages
Apartment Rental Management System Class Diagram: Reports
100% (1)
Apartment Rental Management System Class Diagram: Reports
1 page
Network Topologies
No ratings yet
Network Topologies
22 pages
UiPath Exam Questions
0% (1)
UiPath Exam Questions
15 pages
Clean Desk Policy
100% (3)
Clean Desk Policy
3 pages
Normal Forms For CFG'S: Eliminating Useless Variables Removing Epsilon Removing Unit Productions Chomsky Normal Form
No ratings yet
Normal Forms For CFG'S: Eliminating Useless Variables Removing Epsilon Removing Unit Productions Chomsky Normal Form
36 pages
6_Simplification of CFG
No ratings yet
6_Simplification of CFG
68 pages
CFL Properties
No ratings yet
CFL Properties
62 pages
09 CFLProperties
No ratings yet
09 CFLProperties
62 pages
CS242 Module 7
No ratings yet
CS242 Module 7
71 pages
P Ti FCTTF Properties of Context-Free Languages GG: Reading: Chapter 7
No ratings yet
P Ti FCTTF Properties of Context-Free Languages GG: Reading: Chapter 7
62 pages
Module-4 Normal Forms
No ratings yet
Module-4 Normal Forms
63 pages
Properties of Context-Free Languages: Reading: Chapter 7
No ratings yet
Properties of Context-Free Languages: Reading: Chapter 7
61 pages
Properties of Context-Free Languages: Reading: Chapter 7
No ratings yet
Properties of Context-Free Languages: Reading: Chapter 7
61 pages
CFLProperties
No ratings yet
CFLProperties
43 pages
Properties of Context-Free Languages
No ratings yet
Properties of Context-Free Languages
77 pages
9 - CFG Simplification
100% (1)
9 - CFG Simplification
7 pages
3 CFLProperties
No ratings yet
3 CFLProperties
49 pages
Chapter 3
No ratings yet
Chapter 3
32 pages
Theory of Automata
No ratings yet
Theory of Automata
202 pages
Unit 3 CFG
No ratings yet
Unit 3 CFG
65 pages
PPT3_Simpliying_CFG_PS
No ratings yet
PPT3_Simpliying_CFG_PS
26 pages
Grammar
No ratings yet
Grammar
44 pages
note
No ratings yet
note
3 pages
BCS503 TOC Second IA Test Question Bank
No ratings yet
BCS503 TOC Second IA Test Question Bank
8 pages
WINSEM2024-25_BCSE304L_TH_VL2024250501632_2025-02-17_Reference-Material-I
No ratings yet
WINSEM2024-25_BCSE304L_TH_VL2024250501632_2025-02-17_Reference-Material-I
17 pages
4.2 Context - Free - Grammars - Normal-MKN
No ratings yet
4.2 Context - Free - Grammars - Normal-MKN
65 pages
Parsing ME Modified
No ratings yet
Parsing ME Modified
168 pages
NORMAL FORM
No ratings yet
NORMAL FORM
18 pages
Session 07 - Context Free Grammar
No ratings yet
Session 07 - Context Free Grammar
34 pages
Normal Forms and Parsing: CSC 3130: Automata Theory and Formal Languages
No ratings yet
Normal Forms and Parsing: CSC 3130: Automata Theory and Formal Languages
22 pages
TOC Lecture 11
No ratings yet
TOC Lecture 11
22 pages
TOC 3IS(cs)
No ratings yet
TOC 3IS(cs)
24 pages
Unit-2 Context Free Grammer (TOC)
No ratings yet
Unit-2 Context Free Grammer (TOC)
100 pages
Chomsky - Greibach Hector Chavez
No ratings yet
Chomsky - Greibach Hector Chavez
44 pages
CS351 Context Free Grammars
No ratings yet
CS351 Context Free Grammars
9 pages
Unit 3 (Part II)
No ratings yet
Unit 3 (Part II)
18 pages
04 Parsing
No ratings yet
04 Parsing
330 pages
Unit Iv Properties of Context-Free Languages
No ratings yet
Unit Iv Properties of Context-Free Languages
37 pages
ATC Module 3
No ratings yet
ATC Module 3
43 pages
Lecture 7 - 8 & 9 - Chapter 4
No ratings yet
Lecture 7 - 8 & 9 - Chapter 4
50 pages
Unit 3 - Theory of Computation - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Theory of Computation - WWW - Rgpvnotes.in
14 pages
Flat Unit-5 LM
No ratings yet
Flat Unit-5 LM
21 pages
Unit 3 TOC
No ratings yet
Unit 3 TOC
80 pages
CFG Normal Forms [Autosaved]
No ratings yet
CFG Normal Forms [Autosaved]
35 pages
chapter 3
No ratings yet
chapter 3
57 pages
Flat Module 3
No ratings yet
Flat Module 3
18 pages
Chomsky_Greibach
No ratings yet
Chomsky_Greibach
43 pages
Grammar
No ratings yet
Grammar
31 pages
Chapter 4 and 5
No ratings yet
Chapter 4 and 5
71 pages
Chapter 4 and 5
100% (1)
Chapter 4 and 5
71 pages
Notes CFG
No ratings yet
Notes CFG
25 pages
10 Grammar Simplification
No ratings yet
10 Grammar Simplification
33 pages
Chapter 4 - Context-Free Grammars and Languages
No ratings yet
Chapter 4 - Context-Free Grammars and Languages
60 pages
FLAT 2
No ratings yet
FLAT 2
15 pages
CD Unit-3
No ratings yet
CD Unit-3
146 pages
Normal Forms: CS154 Chris Pollett Mar 12, 2007
No ratings yet
Normal Forms: CS154 Chris Pollett Mar 12, 2007
8 pages
Chapter 3 - CFG
No ratings yet
Chapter 3 - CFG
26 pages
CH 4 - Context Free Languages Amd Grammars
No ratings yet
CH 4 - Context Free Languages Amd Grammars
86 pages
CFG Removal of Null and Unit Production
No ratings yet
CFG Removal of Null and Unit Production
31 pages
Theory of Computation: Automata Theory (CFG, CFL, CNF)
No ratings yet
Theory of Computation: Automata Theory (CFG, CFL, CNF)
39 pages
FAFL Final Lecture 26.2 CMH
No ratings yet
FAFL Final Lecture 26.2 CMH
24 pages
CS6503 Theory of Computations Unit 2
67% (3)
CS6503 Theory of Computations Unit 2
47 pages
Square Summable Power Series
From Everand
Square Summable Power Series
Louis de Branges
5/5 (1)
Exercises of Line, Surface and Volume Integrals
From Everand
Exercises of Line, Surface and Volume Integrals
Simone Malacrida
No ratings yet
Healing Many Modalities PDF
100% (2)
Healing Many Modalities PDF
50 pages
Excerpt-from-UVS-Part-2-Powers-of-the-Vibrational-Bands Vesica Piscis
No ratings yet
Excerpt-from-UVS-Part-2-Powers-of-the-Vibrational-Bands Vesica Piscis
10 pages
More About Turing Machines: "Programming Tricks" Restrictions Extensions Closure Properties
No ratings yet
More About Turing Machines: "Programming Tricks" Restrictions Extensions Closure Properties
54 pages
The Satisfiability Problem: Cook's Theorem: An NP-Complete Problem Restricted SAT: CSAT, 3SAT
No ratings yet
The Satisfiability Problem: Cook's Theorem: An NP-Complete Problem Restricted SAT: CSAT, 3SAT
44 pages
Properties of Context-Free Languages: Decision Properties Closure Properties
No ratings yet
Properties of Context-Free Languages: Decision Properties Closure Properties
35 pages
More Undecidable Problems: Rice's Theorem Post's Correspondence Problem Some Real Problems
No ratings yet
More Undecidable Problems: Rice's Theorem Post's Correspondence Problem Some Real Problems
60 pages
The Pumping Lemma For CFL'S: Statement Applications
No ratings yet
The Pumping Lemma For CFL'S: Statement Applications
14 pages
Conversion of CFG To PDA Conversion of PDA To CFG
No ratings yet
Conversion of CFG To PDA Conversion of PDA To CFG
23 pages
4 - 3 - 15. Decision and Closure Properties For CFL's (35 Min.)
No ratings yet
4 - 3 - 15. Decision and Closure Properties For CFL's (35 Min.)
12 pages
Pushdown Automata: Moves of The PDA Languages of The PDA Deterministic PDA's
No ratings yet
Pushdown Automata: Moves of The PDA Languages of The PDA Deterministic PDA's
32 pages
Mblock Extension Guide
No ratings yet
Mblock Extension Guide
10 pages
Question Bank: Java Programming (9113) Class: Tyif (IF/V/C) Chapter No. 04 Multithreaded Programming and Exception Handling
No ratings yet
Question Bank: Java Programming (9113) Class: Tyif (IF/V/C) Chapter No. 04 Multithreaded Programming and Exception Handling
5 pages
WinCC Web Navigator V1.1
No ratings yet
WinCC Web Navigator V1.1
178 pages
IUB Congestion
No ratings yet
IUB Congestion
4 pages
Vue Essentials Cheat Sheet
No ratings yet
Vue Essentials Cheat Sheet
2 pages
Filter Design Methods For Fpgas: Accelchip, Inc. 1900 Mccarthy Blvd. Suite 204 Milpitas, Ca 95035 (408) 943-0700
No ratings yet
Filter Design Methods For Fpgas: Accelchip, Inc. 1900 Mccarthy Blvd. Suite 204 Milpitas, Ca 95035 (408) 943-0700
10 pages
Slide MySql
No ratings yet
Slide MySql
22 pages
STM32F429 Seminar
No ratings yet
STM32F429 Seminar
259 pages
Openldap Replication Strategies: Gavin Henry
No ratings yet
Openldap Replication Strategies: Gavin Henry
33 pages
Unit 7 Notes
No ratings yet
Unit 7 Notes
15 pages
IJTAG B Ravi Kishore 2017HT80522
No ratings yet
IJTAG B Ravi Kishore 2017HT80522
10 pages
Delta V
No ratings yet
Delta V
22 pages
3 Networking in AWS
100% (1)
3 Networking in AWS
34 pages
Data Mitigation - Literature Review & Proposal
No ratings yet
Data Mitigation - Literature Review & Proposal
11 pages
Big Data Processing With Apache Spark - Infoqdotcom
No ratings yet
Big Data Processing With Apache Spark - Infoqdotcom
16 pages
Linux Interview Questions & Answers
No ratings yet
Linux Interview Questions & Answers
87 pages
Faculty of Engineering and Technology Ramaiah University of Applied Sciences
No ratings yet
Faculty of Engineering and Technology Ramaiah University of Applied Sciences
4 pages
Introduction To Microcontrollers: Dr. Konstantinos Tatas
No ratings yet
Introduction To Microcontrollers: Dr. Konstantinos Tatas
9 pages
Prototyping Tools and Techniques
No ratings yet
Prototyping Tools and Techniques
26 pages
BEx Analyzer Elemental Training
No ratings yet
BEx Analyzer Elemental Training
24 pages
Step-By-step Guide To Implement Modeling Scenarios in SAP BW 7.4 On HANA
100% (1)
Step-By-step Guide To Implement Modeling Scenarios in SAP BW 7.4 On HANA
25 pages
Admin Kayako Staff Control Panel
No ratings yet
Admin Kayako Staff Control Panel
106 pages
5 NPM Modules Every Node Developer Needs PDF
No ratings yet
5 NPM Modules Every Node Developer Needs PDF
5 pages
CPP
No ratings yet
CPP
81 pages
Bill Book Systerm
No ratings yet
Bill Book Systerm
10 pages
1532748281262
No ratings yet
1532748281262
31 pages

Normal Forms For CFG'S: Eliminating Useless Variables Removing Epsilon Removing Unit Productions Chomsky Normal Form

Uploaded by

Normal Forms For CFG'S: Eliminating Useless Variables Removing Epsilon Removing Unit Productions Chomsky Normal Form

Uploaded by

1

Normal Forms for CFGs

You might also like