Lab Session 1 - Lexical Analyzer

This document discusses constructing and using a lexical analyzer. It begins by introducing regular expressions that will be used to build a lexical analyzer. Next, it outlines the steps to construct a lexical analyzer from the regular expressions using a tool called LexLab. These steps include loading a grammar, generating an NFA and DFA, and viewing the transition tables. The document then demonstrates how to use the constructed lexical analyzer to scan input strings and view the resulting symbol table. Finally, it discusses constructing a lexical analyzer for a subset of Java called simpleJava and provides the EBNF grammar for simpleJava.

Uploaded by

Nasasira Julius

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

120 views

Lab Session 1 - Lexical Analyzer

Uploaded by

Nasasira Julius

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Lab Session 1: Constructing and Using a

Lexical Analyzer

1 Introduction
In this lab session, we look at how to generate a lexical analyzer using LexLab, view the
generated NFA and DFA, and then use the generated analyzer to scan some input strings and
view the output symbol table.

To do this we shall use the following simple abstract example that contains three regular
expressions: r1 = a, r2 = abb, r3 = a*b+
Next, we shall construct a lexical analyzer for simpleJava --- a small subset of Java language ---
which is a more realistic example, and then use the constructed lexical analyzer to scan some
realistic input strings.

2 Constructing a Lexical Analyzer from Regular Expressions To proceed with

the construction of a lexical analyzer you need to define a lexical grammar, from the three regular
expressions, i.e., r1 = a, r2 = abb, r3 = a*b+. Obtain a copy of the grammar from the file
“regExp.txt” in the LexLab\dist folder. To construct the lexical analyzer:
1) Double click the executable jar file “Scanning.jar” from the LexLab folder. A simple
GUI opens. See Fig 1: for sample GUI.
2) Load the grammar into the grammar text editor. Note that for this example the
grammar is already there.
3) Parse the grammar by using the Commit button.
4) Generate NFA by using Grammar2NFA button.
5) Generate DFA by using NFA2DFA button.
Note: The NFA and the corresponding DFA transition tables have been generated and can be
viewed from the NFA and DFA tab sheets. The lexical analyzer for our regular expressions is
now constructed.

3 Scanning with the Lexical Analyzer

Follow the following steps:
1) Enter a string “aabbba” in the text editor, click ScanText. The lexical analyzer reads the string
and produces a list of symbols in Symbol Table tab sheet. The table contains the following
fields: ∙ SymCode - a unique code of a symbol.
∙ SymName - name of the symbol.
∙ SymValue - value of the symbol
∙ SymStart - start position of a symbol in the input string.
∙ SymLength - length of the value of the symbol.
From string “aabbba" the table shows that the lexical analyzer has recognized two symbols,
r3=”aabbb" and r1=`a', because the rule is to take the longest match. The last symbol
“endmarker" is a special symbol which signals the end of input to the analyzer.

1
2) Edit the text editor value to string “abb". After scanning the string, note that the symbol
list is changed. It shows that the lexical analyzer has recognized one symbol, r2=”abb".
Note that the grammar we defined contains some conflict resolutions which we
discussed in class. String “abb" matches both regular expressions r2 and r3 but it is considered
as a symbol for r2, since expression r2 is listed first in the grammar. 3) Again, edit the text
editor value to string “ababcdb". Note that the symbol list is changed. Note that the lexical
analyzer has recognized an error, i.e. a substring “cd" of length 2, starting at position 4 in the
input string.

4 Constructing a Lexical Analyzer for simpleJava

To proceed with the construction, follow similar steps as presented in Step 2 above. Obtain a
copy of the lexical grammar for simpleJava from a file “simpleJava.txt". After completing the
process in Step 2 above, the grammar, NFA and DFA transition tables will be updated and the
lexical analyzer for simpleJava is ready for use.
Note: The transition tables become extremely large depending on the set of tokens of a
language. As a result, they may not be interesting to look at.

Now, edit the text editor value to the following Java program text:

class pay{
int items;
int pay;

void computePay(){
if(item<10)
pay=1000;
else
pay=items * 10;
}
}

When the Java program is scanned the symbol table is updated

immediately with a long list of symbols that have been recognized.

Note: Practice by editing the above program; try to include right and wrong tokens. Note down
what you observe.

2
Fig 1: GUI of LexLab showing a sample lexical grammar (top-left), input
string (bottom-left) and the generated list of symbols (right).

3
SimpleJava EBNF
ClassDeclaration=”class” Identifier “{“ VarDeclaration* MethodDeclaration*
”}” VarDeclaration= Type Identifier “;”
MethodDeclaration= Type Identifier “(” ”)” “{” Statement* ”}”
Type=int|boolean
Statement=”{“ Statement* “}”
| Identifier “=” Expression”;”
| ”if” “(“ Expression ”)” Statement “else” Statement
Expression= Expression (“<”|” >”|”+”|”-“|”*”) Expression
| “true”
| “false”
| Identifier
| Number

Lexical Aspects
An identifier is a sequence of letters (lower and upper) and digits starting with a
letter. A number is a sequence of digits 0 to 9
A binary operator is any of the following binary operators: <, >, +, -,*

Java Codelab Solutions - Section 2.1 Java Application Structure
0% (2)
Java Codelab Solutions - Section 2.1 Java Application Structure
5 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
CS3304 9 LanguageSyntax 2 PDF
No ratings yet
CS3304 9 LanguageSyntax 2 PDF
39 pages
Lab
0% (1)
Lab
32 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
38 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
39 pages
Chapter 3 Lexical Analysis
No ratings yet
Chapter 3 Lexical Analysis
5 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
40 pages
Compiler_Construction_Lexical_Analysis
No ratings yet
Compiler_Construction_Lexical_Analysis
63 pages
Chpater 2 Lexical Analysis
No ratings yet
Chpater 2 Lexical Analysis
48 pages
Unit2
No ratings yet
Unit2
61 pages
L2 Lexical Analysis
No ratings yet
L2 Lexical Analysis
59 pages
CC_unit_2
No ratings yet
CC_unit_2
80 pages
Chapter-2[1]
No ratings yet
Chapter-2[1]
77 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
17 pages
L4 - Lexical Analysis
No ratings yet
L4 - Lexical Analysis
44 pages
Lexical Analysis: Dr. Murali Krishna Enduri Department of CSE
No ratings yet
Lexical Analysis: Dr. Murali Krishna Enduri Department of CSE
88 pages
2 Lexical Analyzer
No ratings yet
2 Lexical Analyzer
21 pages
Lexical Analysis
No ratings yet
Lexical Analysis
88 pages
Chapter 2 Lexical Analysis (Scanning) Edited
No ratings yet
Chapter 2 Lexical Analysis (Scanning) Edited
46 pages
Chapter 2
No ratings yet
Chapter 2
56 pages
CD GTU Study Material Presentations Unit-2 27082020063553AM
No ratings yet
CD GTU Study Material Presentations Unit-2 27082020063553AM
84 pages
Chap 11
No ratings yet
Chap 11
28 pages
CD GTU Study Material Presentations Unit-2 27082020063553AM
No ratings yet
CD GTU Study Material Presentations Unit-2 27082020063553AM
84 pages
CD - CH2 - Lexical Analysis
No ratings yet
CD - CH2 - Lexical Analysis
67 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
31 pages
Unit 1 (B)
No ratings yet
Unit 1 (B)
69 pages
Unit 2
No ratings yet
Unit 2
14 pages
A Typical Lexical Analyzer Generator Nfa To Dfa DFA Analysis
No ratings yet
A Typical Lexical Analyzer Generator Nfa To Dfa DFA Analysis
64 pages
Compilers: Topic 2: Lexical Analysis
No ratings yet
Compilers: Topic 2: Lexical Analysis
29 pages
Compiler Design Part 2
No ratings yet
Compiler Design Part 2
20 pages
Unit 2 Lexical Analyzer
No ratings yet
Unit 2 Lexical Analyzer
30 pages
Lexical Analysis
No ratings yet
Lexical Analysis
31 pages
The lexical analyzer
No ratings yet
The lexical analyzer
4 pages
CD - CH2 - Lexical Analysis
No ratings yet
CD - CH2 - Lexical Analysis
59 pages
Lecture 02
No ratings yet
Lecture 02
150 pages
cd1
No ratings yet
cd1
92 pages
CS-352 - Spring 2024 - Lec2
No ratings yet
CS-352 - Spring 2024 - Lec2
35 pages
Lexical Analysis Using Jflex: Tokens
No ratings yet
Lexical Analysis Using Jflex: Tokens
39 pages
lexicalanalysis-160516142825
No ratings yet
lexicalanalysis-160516142825
39 pages
HW_31712
No ratings yet
HW_31712
22 pages
Lexical Analysis and Parsing CD
No ratings yet
Lexical Analysis and Parsing CD
107 pages
Lexeme Generator 5th Sem 2009 REPORT
100% (1)
Lexeme Generator 5th Sem 2009 REPORT
78 pages
4 Lexical Analysis
No ratings yet
4 Lexical Analysis
60 pages
Lexical Analysis
No ratings yet
Lexical Analysis
62 pages
Lecture 03
No ratings yet
Lecture 03
42 pages
Chapter 2-Lexical Analysis
No ratings yet
Chapter 2-Lexical Analysis
48 pages
CD_UNIT-2
No ratings yet
CD_UNIT-2
64 pages
CD Unit-2
No ratings yet
CD Unit-2
64 pages
Compiler Design 2
No ratings yet
Compiler Design 2
9 pages
c2 PDF
No ratings yet
c2 PDF
13 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
33 pages
Day 2 - Lexial Analyzer
No ratings yet
Day 2 - Lexial Analyzer
37 pages
Chapter 2
No ratings yet
Chapter 2
91 pages
Lexical and Syntax Analysis: CSE 325/CSE 425: Concepts of Programming Language
No ratings yet
Lexical and Syntax Analysis: CSE 325/CSE 425: Concepts of Programming Language
41 pages
Lexical and Syntax Analysis: CSE 325/CSE 425: Concepts of Programming Language
No ratings yet
Lexical and Syntax Analysis: CSE 325/CSE 425: Concepts of Programming Language
41 pages
Ch2 - Lexical Analysis
No ratings yet
Ch2 - Lexical Analysis
76 pages
2-Lexical Analysis
No ratings yet
2-Lexical Analysis
52 pages
Compiler Gold
No ratings yet
Compiler Gold
46 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
10 pages
Lexical Analysis
No ratings yet
Lexical Analysis
14 pages
Lesson 2 Homework 5.3
100% (2)
Lesson 2 Homework 5.3
5 pages
ISY150 Beveling Machine CJ2018050301cl
No ratings yet
ISY150 Beveling Machine CJ2018050301cl
1 page
Analisis Geotecnico Costa Fuera Del Problema de Deformacion en Arcilla Blanda
No ratings yet
Analisis Geotecnico Costa Fuera Del Problema de Deformacion en Arcilla Blanda
303 pages
Excel: A Brief Overview
100% (1)
Excel: A Brief Overview
31 pages
Aromatic Chemistry
No ratings yet
Aromatic Chemistry
10 pages
Syntactic Change Summary
No ratings yet
Syntactic Change Summary
5 pages
A Survey On Mapping Semi-Structured Data and Graph Data To Relational Data
No ratings yet
A Survey On Mapping Semi-Structured Data and Graph Data To Relational Data
38 pages
PD ENEC 303 Annex I - December 2010
No ratings yet
PD ENEC 303 Annex I - December 2010
2 pages
ICSD#23808
No ratings yet
ICSD#23808
2 pages
1.dynamic Characteristics Introduction
No ratings yet
1.dynamic Characteristics Introduction
20 pages
Michel Et Al. (2016)
No ratings yet
Michel Et Al. (2016)
12 pages
Intro To Templates
No ratings yet
Intro To Templates
23 pages
CRSI Manual To Design RC Diaphragms - Part9
No ratings yet
CRSI Manual To Design RC Diaphragms - Part9
4 pages
Ns E: Earth Science: There Are Two Types of Igneous Rocks
No ratings yet
Ns E: Earth Science: There Are Two Types of Igneous Rocks
4 pages
Oracle AQs Presentation
No ratings yet
Oracle AQs Presentation
15 pages
Tetrahedral Element: Section 12: VOLUME ELEMENTS
No ratings yet
Tetrahedral Element: Section 12: VOLUME ELEMENTS
48 pages
Analyzing The Impact of Supermarket Promotions
No ratings yet
Analyzing The Impact of Supermarket Promotions
5 pages
3-Bromo-2-Butanol When Treated With HBR Threo DL Pair
100% (1)
3-Bromo-2-Butanol When Treated With HBR Threo DL Pair
54 pages
Cx4 VHF Antenna
No ratings yet
Cx4 VHF Antenna
3 pages
Acetaldehyde Scavengers For Poly (Ethylene Terephthalate) - Chemis
No ratings yet
Acetaldehyde Scavengers For Poly (Ethylene Terephthalate) - Chemis
334 pages
Trial STPM TERENGANU 2011 (Mathematics T Paper 2)
No ratings yet
Trial STPM TERENGANU 2011 (Mathematics T Paper 2)
4 pages
Fmvss 121 Spec
No ratings yet
Fmvss 121 Spec
23 pages
Mini-Task 4 m100 History of Mathematics
No ratings yet
Mini-Task 4 m100 History of Mathematics
11 pages
A Case Study On Socio-Cultural Impacts of Tourism in The City of Jaipur, Rajasthan: India
No ratings yet
A Case Study On Socio-Cultural Impacts of Tourism in The City of Jaipur, Rajasthan: India
14 pages
Page Fault
No ratings yet
Page Fault
10 pages
Geography of Romania - Curs
No ratings yet
Geography of Romania - Curs
13 pages
Natural Gas Measurement Common Standards - 20220803
No ratings yet
Natural Gas Measurement Common Standards - 20220803
7 pages
Dispatcher
No ratings yet
Dispatcher
38 pages
Practical for PHP Programming
No ratings yet
Practical for PHP Programming
20 pages