0% found this document useful (0 votes)

52 views

Extended RE's: Character Classes

Regular expressions (REs) define regular languages. Certain operations preserve regularity, such as union, concatenation, and substitution of regular languages. The pumping lemma can be used to prove a language is not regular by finding a string that cannot be pumped. Closure properties ensure operations like reversal, homomorphism, and inverse homomorphism of regular languages produce regular languages.

Uploaded by

atul

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views

Extended RE's: Character Classes

Uploaded by

atul

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Extended RE's

UNIX pioneered the use of additional operators

and notation for RE's:
E? = 0 or 1 occurrences of E = + E.
E + = 1 or more occurrences of E = EE .
Character classes [a , zGX] = the union of
all (ASCII) characters from a to z, plus the
characters G and X, for example.
Algebraic Laws for RE's
If two expressions E and F have no variables, then
E = F means that L(E) = L(F) (not that E and
F are identical expressions).
Example: 1+ = 11 .
If E and F are RE's with variables, then E =
F (E is equivalent to F) means that whatever
languages we substitute for the variables (provided
we substitute the same language everywhere the
same variable appears), the resulting expressions
denote the same language.
Example: R+ = RR.
With two notable exceptions, we can think of
union (+) as if it were addition with ; in place of
the identity 0, and concatenation, with in place
of the identity 1, as multiplication.
+ and concatenation are both associative.
+ is commutative.
Laws of the identities hold for both.
; is the annihilator for concatenation.
The exceptions:
1. Concatenation is not commutative: ab 6=
ba.
2. + is idempotent : E + E = E for any
expression E.
Checking a Law
Suppose we are told that the law (R + S) =
(R S ) holds for RE's. How would we check that
this claim is true?
Think of R and S as if they were single
symbols, rather than placeholders for
languages, i.e., R = f0g and S = f1g.
✦ Then the left side is clearly \any sequence
of 0's and 1's.
1
✦ The right side also denotes any string
of 0's and 1's, since 0 and 1 are each in
L(0 1 ).
That test is necessary (i.e., if the test fails,
then the law does not hold.
✦ We have particular languages that serve
as a counterexample.
But is it sucient (if the test succeeds, the
law holds)?
Proof of Suciency
The book has a fairly simple argument for why,
when the \concretized" expressions denote the
same language, then the languages we get by
substituting any languages for the variables are
also the same.
But if you think that's obvious, the book also
has an example of \RE's with intersection"
where the same statement is false.
Also | is it clear that we can tell whether
two RE's without variables denote the same
language?
✦ Algorithm to do so will be covered.

Closure Properties
Not every language is a regular language.
However, there are some rules that say \if
these languages are regular, so is this one
derived from them.
There is also a powerful technique | the
pumping lemma | that helps us prove a
language not to be regular.
Key tool: Since we know RE's, DFA's,
NFA's, -NFA's all dene exactly the
regular languages, we can use whichever
representation suits us when proving
something about a regular language.
Pumping Lemma
If L is a regular language, then there exists a
constant n such that every string w in L, of length
n or more, can we written as w = xyz, where:
1. 0 < jyj.
2. jxyj n.

2
3. For all i 0, wyi z is also in L.
✦ Note yi = y repeated i times; y0 = .
The alternating quantiers in the logical
statement of the PL makes it very complex:
(8L)(9n)(8w)(9x; y; z)(8i).
Proof of Pumping Lemma
Since we claim L is regular, there must be a
DFA A such that L = L(A).
Let A have n states; choose this n for the
pumping lemma.
Let w be a string of length n in L, say w =
a1 a2 am , where m n.
Let qi be the state A is in after reading the
rst i symbols of w.
✦ q0 = start state, q1 = (q0; a1), q2 =
^(q0; a1a2 ), etc.
Since there are only n dierent states, two of
q0; q1; : : :; qn must be the same; say qi = qj ,
where 0 i < j n.
Let x = a1 ai ; y = ai+1 aj ; z =
aj +1 am .
Then by repeating the loop from qi to qi with
label ai+1 aj zero times once, or more, we
can show that xyi z is accepted by A.
PL Use
We use the PL to show a language L is not
regular.
Start by assuming L is regular.
Then there must be some n that serves as the
PL constant.
✦ We may no know what n is, but we can
work the rest of the \game" with n as a
parameter.
We choose some w that is known to be in L.
✦ Typically, w depends on n.
Applying the PL, we know w can be broken
into xyz, satisfying the PL properties.
✦ Again, we may not know how to break w,
so we use x; y; z as parameters.
We derive a contradiction by picking i (which
might depend on n, x, y, and/or z) such that
xyi z is not in L.
3
Example
Consider the set of strings of 0's whose length is a
perfect square; formally L = f0i j i is a squareg.
We claim L is not regular.
Suppose L is regular. Then there is a constant
n satisfying the PL conditions.
Consider w = 0n2 , which is surely in L.
Then w = xyz, where jxyj n and y 6= .
By PL, xyyz is in L. But the length of xyyz
is greater than n2 and no greater than n2 + n.
However, the next perfect square after n2 is
(n + 1)2 = n2 + 2n + 1.
Thus, xyyz is not of square length and is not
in L.
Since we have derived a contradiction, the
only unproved assumption | that L is
regular | must be at fault, and we have a
\proof by contradiction" that L is not regular.
Closure Properties
Certain operations on regular languages are
guaranteed to produce regular languages.
Example: the union of regular languages is
regular; start with RE's, and apply + to get
an RE for the union.
Substitution
Take a regular language L over some alphabet
.
For each a in , let La be a regular language.
Let s be the substitution dened by s(a) = La
for each a.
✦ Extend s to strings by s(a1 a2 an) =
s(a1 )s(a2 ) s(an ); i.e., concatenate the
languages La1 La2 Lan .
✦ Extend s to languages by s(M) =[w in M
s(w).
Then s(L) is regular.
Proof That Substitution of Regular
Languages Into a Regular Language is
Regular
Let R be a regular expression for language L.

4
Let Ra be a regular expression for language
s(a) = La , for all symbols a in .
Construct a RE E for s(L) by starting with R
and replacing each symbol a by the RE La .
Proof that L(E) = s(L) is an induction on the
height of (the expression tree for) RE R.
Basis : R is a single symbol, a. Then E = Ra,
L = fag, and s(L) = s(fag) = L(Ra ).
Cases where R is or ; easy.
Induction : There are three cases, depending on
whether R = R1 + R2, R = R1R2, or R = R1 .
We'll do only R = R1R2.
L = L1L2 , where L1 = L(R1) and L2 =
L(R2 ).
Let E1 be R1, with each a replaced by Ra,
and E2 similarly.
By the IH, L(E1 ) = s(L1 ) and L(E2 ) = s(L2 ).
Thus, L(E) = s(L1 )s(L2 ) = s(L).
Applications of the Substitution Theorem
If L1 and L2 are regular, so is L1 L2 .
✦ Let s(a) = L1 and s(b) = L2 . Substitute
into the regular language fabg.
So is L1 [ L2 .
✦ Substitute into fa; bg.
Ditto L1 .
✦ Substitute into L(a ).
Closure under homomorphism = substitution
of one string for each symbol.
✦ Special case of a substitution.

Example: Homomorphism
Let L = L(0 1 ), and let h be a homomorphism
dened by h(0) = aa and h(1) = .
,
Then h(L) = L aa) = all strings of an even
number of a's.
Closure Under Inverse Homomorphism
h,1 (L) = fw j h(w) is in Lg.

5
See argument in course reader. Brie y:
✦ Given homomorphism h and regular
language L, start with a DFA A for L.
✦ Construct DFA B for h,1 (L), by having
B, go from state q to state p on input a if
^ q; h(a) = p.
Closure Under Reversal
The reverse of a string w = a1 a2 an is
an a2a1 .
✦ Denoted wR .
✦ Note R = .
The reverse of a language L is the set
containing the reverse of each string in L.
If L is regular, so is LR .
✦ Proof: use RE's, recursive reversal as in
course reader.

Algebra Driven Design Elegant Software From Simple Building Blocks (Sandy Maguire)
No ratings yet
Algebra Driven Design Elegant Software From Simple Building Blocks (Sandy Maguire)
337 pages
Algebra Driven Design Sample
No ratings yet
Algebra Driven Design Sample
85 pages
Algebraic Laws For Regular Epxressions
0% (3)
Algebraic Laws For Regular Epxressions
14 pages
Specification of Tokens
No ratings yet
Specification of Tokens
21 pages
re-nfa-220110090941
No ratings yet
re-nfa-220110090941
15 pages
Computer Programming C: Operator Basics
No ratings yet
Computer Programming C: Operator Basics
5 pages
Wa0000
No ratings yet
Wa0000
26 pages
Lec 07
No ratings yet
Lec 07
22 pages
A Short Course in Automorphic Functions
From Everand
A Short Course in Automorphic Functions
Joseph Lehner
No ratings yet
chapter3 from UNIT1
No ratings yet
chapter3 from UNIT1
32 pages
Toc Unit-2
No ratings yet
Toc Unit-2
109 pages
4-Reg Exp
No ratings yet
4-Reg Exp
33 pages
EECS150 - Digital Design: Lecture 4 - Boolean Algebra I
No ratings yet
EECS150 - Digital Design: Lecture 4 - Boolean Algebra I
20 pages
CS351 Regular Expressions
No ratings yet
CS351 Regular Expressions
14 pages
Regular Expressions
No ratings yet
Regular Expressions
169 pages
CMP3008 LN4 RegularExpressions
No ratings yet
CMP3008 LN4 RegularExpressions
45 pages
101 Regular Expressions
No ratings yet
101 Regular Expressions
50 pages
Discrete Mathematical Structures Lec - 35
No ratings yet
Discrete Mathematical Structures Lec - 35
28 pages
Lexical Analyzer 2023
No ratings yet
Lexical Analyzer 2023
38 pages
chap2-160308123432
No ratings yet
chap2-160308123432
48 pages
Unit22pdf 2021 03 13 13 38 11
No ratings yet
Unit22pdf 2021 03 13 13 38 11
114 pages
Lecture 03
No ratings yet
Lecture 03
37 pages
6 Lecture 4
No ratings yet
6 Lecture 4
25 pages
Dfa
No ratings yet
Dfa
43 pages
re-nfa-220110090941
No ratings yet
re-nfa-220110090941
20 pages
Lecture 1 String and Language
No ratings yet
Lecture 1 String and Language
36 pages
Lexical Analyzer 1
No ratings yet
Lexical Analyzer 1
37 pages
Digital Fundamentals: Floyd
No ratings yet
Digital Fundamentals: Floyd
21 pages
Unit Ii Regular Expressions and Languages: 2.1.1. Definition
No ratings yet
Unit Ii Regular Expressions and Languages: 2.1.1. Definition
31 pages
Unit - 1
No ratings yet
Unit - 1
139 pages
5 Algebraic Structure More
No ratings yet
5 Algebraic Structure More
18 pages
03 Recursive Definitions
No ratings yet
03 Recursive Definitions
18 pages
DML (UE18CS205) - Unit 5 (Algebraic Structures)
No ratings yet
DML (UE18CS205) - Unit 5 (Algebraic Structures)
61 pages
Mathematical Foundations of Computer Science: Unit-I
No ratings yet
Mathematical Foundations of Computer Science: Unit-I
7 pages
pp04
No ratings yet
pp04
47 pages
Digital Fundamentals: Floyd
No ratings yet
Digital Fundamentals: Floyd
32 pages
Mathematical Preliminaries: Sipser Pages 1-28
No ratings yet
Mathematical Preliminaries: Sipser Pages 1-28
37 pages
ch03 Expressions
No ratings yet
ch03 Expressions
36 pages
Regular Expressions and Languages
No ratings yet
Regular Expressions and Languages
20 pages
Regular Expression
No ratings yet
Regular Expression
23 pages
DLD Week5 Chap4 2D
No ratings yet
DLD Week5 Chap4 2D
29 pages
DM Unit 3
No ratings yet
DM Unit 3
30 pages
Wipro Class 2
No ratings yet
Wipro Class 2
18 pages
Operators and Expressions, Control Structures in C
No ratings yet
Operators and Expressions, Control Structures in C
134 pages
Unit1 C Programming 6 10 2022
No ratings yet
Unit1 C Programming 6 10 2022
38 pages
ln5
No ratings yet
ln5
22 pages
Grfnotes 1011
No ratings yet
Grfnotes 1011
47 pages
Regular Expression: Anab Batool Kazmi
No ratings yet
Regular Expression: Anab Batool Kazmi
32 pages
Algebraic Structures Ass Dbatu
No ratings yet
Algebraic Structures Ass Dbatu
35 pages
12500123196
No ratings yet
12500123196
12 pages
Regular Expression Question Solution
100% (2)
Regular Expression Question Solution
68 pages
DLD Chapter 4
No ratings yet
DLD Chapter 4
28 pages
Unit 2 - Discrete Structures - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Discrete Structures - WWW - Rgpvnotes.in
10 pages
DM UNIT - 3
No ratings yet
DM UNIT - 3
6 pages
Regular Expressions and Languages
No ratings yet
Regular Expressions and Languages
16 pages
CH 7&8 DataHandling & Module
No ratings yet
CH 7&8 DataHandling & Module
117 pages
C Expression
No ratings yet
C Expression
22 pages
BIL 104E Introduction To Scientific and Engineering Computing
No ratings yet
BIL 104E Introduction To Scientific and Engineering Computing
20 pages
The General Theory of Dirichlet's Series
From Everand
The General Theory of Dirichlet's Series
G. H. Hardy
No ratings yet
Harmonic Analysis and the Theory of Probability
From Everand
Harmonic Analysis and the Theory of Probability
Salomon Bochner
No ratings yet
Independent Set Problem
No ratings yet
Independent Set Problem
5 pages
The Class of Languages: Polynomial-Time TM
No ratings yet
The Class of Languages: Polynomial-Time TM
5 pages
Procedures Versus Algorithms: Recursively Enumerable
No ratings yet
Procedures Versus Algorithms: Recursively Enumerable
6 pages
Stupid Turing Machine Tricks
No ratings yet
Stupid Turing Machine Tricks
6 pages
Outline of Turing Machines and Complexity
No ratings yet
Outline of Turing Machines and Complexity
5 pages
Closure Properties of CFL's - Substitution Proof
No ratings yet
Closure Properties of CFL's - Substitution Proof
4 pages
Cleaning Up Grammars: Useless - Productions
No ratings yet
Cleaning Up Grammars: Useless - Productions
5 pages
Formal Denition of Finite Automaton: States Input Symbols Start/initial Nal/accepting Transition Function
No ratings yet
Formal Denition of Finite Automaton: States Input Symbols Start/initial Nal/accepting Transition Function
5 pages
Decision Properties of Regular Languages
No ratings yet
Decision Properties of Regular Languages
4 pages
Context-Free Grammars: Variables
No ratings yet
Context-Free Grammars: Variables
5 pages
Equivalence of CFG's and PDA's
No ratings yet
Equivalence of CFG's and PDA's
3 pages
Finite Automata With - Transitions
No ratings yet
Finite Automata With - Transitions
5 pages
Slides 7
No ratings yet
Slides 7
4 pages

Extended RE's: Character Classes

Uploaded by

Extended RE's: Character Classes

Uploaded by

Extended RE's

UNIX pioneered the use of additional operators

You might also like