0% found this document useful (0 votes)

79 views5 pages

LEX Notes

The document provides an overview of LEX, a lexical analyzer generator used to create scanners that break input into tokens. It outlines the structure of a LEX specification, including declarations, transition rules, and auxiliary functions, and provides several code examples demonstrating pattern matching, counting words and numbers, and analyzing C code components. Additionally, it covers basic pattern matching techniques and counting string lengths, vowels, and consonants.

Uploaded by

aatreyeedev05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

79 views5 pages

LEX Notes

Uploaded by

aatreyeedev05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

CODING PART

Basics:
 LEX = LEXical analyzer generator
 It is a tool used to create lexical analyzers (scanners)
 Basically, LEX programs that read input and break it into tokens, which are
meaningful sequences of characters (like keywords, numbers, or identifiers)
 Alternatives to LEX – FLEX (Fast LEX) and JLEX (Java LEX)
 Structure of a LEX specification (specification ≈ program):

 Structure of a LEX Specification: There are three sections – Declarations,

Transition rules and Auxiliary functions
%{Declarations%}
%%
Transition rules
%%
Auxiliary functions

 Here, %% acts as a separator between two sections

Some codes
1. Pattern matching:
%{
#include <stdio.h>
%}

[0-9]+ { printf("NUMBER: %s\n", yytext); }

[a-zA-Z]+ { printf("WORD: %s\n", yytext); }
[ \t\n]+ { /* ignore whitespace */ }
. { printf("UNKNOWN: %s\n", yytext); }

int main() {
yylex(); // Call the lexer
return 0;
}
 Here,
a. [0-9]+ matches a sequence of digits from 0 to 9 (it is basically a regular
expression) from the input, and categorises it as a number
b. [a-zA-Z]+ matches a sequence of lowercase or uppercase characters
c. [ \t\n]+ matches a sequence of spaces, tabs and new line characters
d. All other sequences (indicated by ‘.’) are ignored
 yytext is a built in variable that contains the text matched by the current rule
(like for instance, in “April 1st 2025”, the sequence “April” is matched by the
second rule as we scan from left to right ⟶ “April” is stored in the variable
yytext temporarily, and that is printed
 After that, the space is ignored
 And then the sequence “1” is matched by the first rule ⟶ “1” is stored in the
variable yytext temporarily (replaces “April”, and that is printed. This goes on
 yylex() is the function that starts the lexical analysis from left to right and that
must be called in the main function
2. Counting the number of words and numbers:

%{
#include <stdio.h>
int words = 0, numbers = 0;
%}

[0-9]+ { numbers++; }
[a-zA-Z]+ { words++; }
[ \t\n]+ { /* do nothing & skip spaces */ }

int main() {
yylex();
printf("Total Words: %d\n", words);
printf("Total Numbers: %d\n", numbers);
return 0;
}

 We declare variables words = 0 and numbers = 0 initially

 When the lexer scans from left to right and identifies a sequence of digits (0 to
9), that is considered as a number, and the variable ‘numbers’ is incremented
 Similarly, when a sequence of characters is encountered, it is considered as a
word and the variable ‘words’ is incremented
 Here, we don’t really have to make use of yytext
 When we run yylex(), it takes in the input and based on that, performs whatever
has been described amongst the productions
3. Breaking down the components of a C Code:

%{
#include <stdio.h>
%}

"if" { printf("IF keyword\n"); }

"else" { printf("ELSE keyword\n"); }
"while" { printf("WHILE keyword\n"); }
"return" { printf("RETURN keyword\n"); }
[a-zA-Z_][a-zA-Z0-9_]* { printf("ID: %s\n", yytext); }
[0-9]+ { printf("NUMBER: %s\n", yytext); }
. { /* ignore other characters */ }

int main() {
yylex();
return 0;
}

 The ones within “” are counted as strings, and are matched based only if the
length, and the characters match (including lower/upper case)
Basic pattern matches:
Pattern Matches
a Only the character ‘a’
a|b Either the character ‘a’ or ‘b’
Anything except whatever has been
.
declared previously (equivalent to ‘else’)
\n New line characters
\t Tab character
\r Carriage return
\\ A single backslash ⟶ \
\” A single doublequote ⟶ “
[abc] Any one of a, b, or c
[^abc] Any character except a, b, c
[a-z] Any lowercase letter
[A-Z] Any uppercase letter
[0-9] Any digit
[a-zA-Z] Any letter
[a-zA-Z0-9_] Any letter, digit, or underscore
a* Zero or more a
a+ One or more a
a? Zero or one a
a{3} Exactly three as
a{2,4} Between 2 and 4 as
ab a followed by b
“something” The exact word “something”
^ Start of line (outside brackets)
End of line (not supported in old LEX
$
versions)
\ Escape next character
[aA][a-zA-Z0-9_]* Words starting with a or A

4. Counting the length of a string:

%{
#include <stdio.h>
#include <string.h>
%}

[a-zA-Z0-9]+ {
printf("Length of input: %lu\n", strlen(yytext));
}

.|\n { /* ignore everything else */ }

int main() {
yylex();
return 0;
}
5. Counting the number of vowels and consonants:

%{
#include <stdio.h>
#include <ctype.h>

int v_count = 0;
int c_count = 0;
%}

[aAeEiIoOuU] { v_count++; }
[b-df-hj-np-tv-zB-DF-HJ-NP-TV-Z] { c_count++; }
.|\n

int main() {
yylex(); // Start scanning input
printf("Vowels: %d\n", v_count);
printf("Consonants: %d\n", c_count);
return 0;
}

CD Unit-Ii
No ratings yet
CD Unit-Ii
34 pages
Daa Unit-2final
No ratings yet
Daa Unit-2final
33 pages
Concatenating Two Linked Lists in C
No ratings yet
Concatenating Two Linked Lists in C
12 pages
JSP Programs
No ratings yet
JSP Programs
3 pages
CCS355 Neural Network and Deep Learning
No ratings yet
CCS355 Neural Network and Deep Learning
32 pages
ccs355-NNDL Notes
No ratings yet
ccs355-NNDL Notes
158 pages
DAA Unit1
No ratings yet
DAA Unit1
26 pages
Data Structure Introduction
No ratings yet
Data Structure Introduction
88 pages
Purpose of JSP
100% (1)
Purpose of JSP
21 pages
CD Unit-V
No ratings yet
CD Unit-V
10 pages
Object Oriented Programming: Exception Handling in Java
No ratings yet
Object Oriented Programming: Exception Handling in Java
21 pages
Hash Collision Resolution Techniques
No ratings yet
Hash Collision Resolution Techniques
17 pages
JDBC Connection
No ratings yet
JDBC Connection
21 pages
The Phases of Compiler
No ratings yet
The Phases of Compiler
3 pages
Theory of Computation
100% (1)
Theory of Computation
48 pages
Compiler Design UNIT III
No ratings yet
Compiler Design UNIT III
20 pages
CSE B.Tech: Instruction Set Basics
No ratings yet
CSE B.Tech: Instruction Set Basics
11 pages
Compiler Run-Time & Memory Management
No ratings yet
Compiler Run-Time & Memory Management
22 pages
Kuk B.tech Cse Automata Theory
No ratings yet
Kuk B.tech Cse Automata Theory
237 pages
006chapter 6 - Intermediate Code Generation
No ratings yet
006chapter 6 - Intermediate Code Generation
23 pages
Yacc
No ratings yet
Yacc
5 pages
RG, Re-Rg, Fa-Rg, Rg-Fa, RLG-LLG, LLG-RLG
No ratings yet
RG, Re-Rg, Fa-Rg, Rg-Fa, RLG-LLG, LLG-RLG
19 pages
LR Parsing Methods
No ratings yet
LR Parsing Methods
50 pages
DBMS Unit-V
0% (1)
DBMS Unit-V
48 pages
Basic of C Language
100% (1)
Basic of C Language
29 pages
C Program To Implement Evaluation of Postfix Expression Using Stack
0% (1)
C Program To Implement Evaluation of Postfix Expression Using Stack
2 pages
Token Separation & Parsing Guide
82% (11)
Token Separation & Parsing Guide
47 pages
Asymptotic Notations
No ratings yet
Asymptotic Notations
4 pages
TOC in 8 Hours
100% (1)
TOC in 8 Hours
312 pages
Chapter 3 - Lexical Analysis
100% (1)
Chapter 3 - Lexical Analysis
51 pages
8 File Handling in C'
83% (6)
8 File Handling in C'
7 pages
Advance C Programming100-Ques
No ratings yet
Advance C Programming100-Ques
28 pages
User Defined Ordinal Type
No ratings yet
User Defined Ordinal Type
8 pages
Unit 4: Symbol Table
No ratings yet
Unit 4: Symbol Table
38 pages
Coding Questions
No ratings yet
Coding Questions
166 pages
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
53 pages
Moore and Mealy
No ratings yet
Moore and Mealy
39 pages
TOC Unit 5 PDF
No ratings yet
TOC Unit 5 PDF
17 pages
CD Unit - 4
No ratings yet
CD Unit - 4
39 pages
Peephole Optimization in Compilers
No ratings yet
Peephole Optimization in Compilers
14 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
Automata Theory & Compiler Design
No ratings yet
Automata Theory & Compiler Design
69 pages
Lecture-7 Turing Machine As Adder
No ratings yet
Lecture-7 Turing Machine As Adder
15 pages
FLAT
No ratings yet
FLAT
85 pages
Theory of Computation-Lecture 1
No ratings yet
Theory of Computation-Lecture 1
78 pages
CD Unit-Iii
No ratings yet
CD Unit-Iii
20 pages
Bottom Up Parsing - LR Parsers (LR (0), SLR, CLR and LALR Parsers)
0% (1)
Bottom Up Parsing - LR Parsers (LR (0), SLR, CLR and LALR Parsers)
7 pages
Data Structure Complete Notes
No ratings yet
Data Structure Complete Notes
115 pages
Theory of Computation
No ratings yet
Theory of Computation
4 pages
Programming Language Data Types
No ratings yet
Programming Language Data Types
154 pages
Scripting Language Lab Manual
No ratings yet
Scripting Language Lab Manual
27 pages
Unit-1 FiniteAutomata
No ratings yet
Unit-1 FiniteAutomata
89 pages
CD Lab Manual
100% (1)
CD Lab Manual
55 pages
DAA Notes
No ratings yet
DAA Notes
222 pages
Lab Manual - Compiler Lab CSL411 Manual 2022
No ratings yet
Lab Manual - Compiler Lab CSL411 Manual 2022
114 pages
Lex Tool and Program Examples
No ratings yet
Lex Tool and Program Examples
39 pages
Lex and Yacc Programming Guide
No ratings yet
Lex and Yacc Programming Guide
21 pages
System Programming Lab: LEX: Lexical Analyser Generator
No ratings yet
System Programming Lab: LEX: Lexical Analyser Generator
33 pages
Cs6109 - Compiler Design: Lab Assignment
No ratings yet
Cs6109 - Compiler Design: Lab Assignment
8 pages
Software Engg
No ratings yet
Software Engg
72 pages
Different Definitions of AI
No ratings yet
Different Definitions of AI
43 pages
Lab 4 - OPFT and Shift Reduce Parser
No ratings yet
Lab 4 - OPFT and Shift Reduce Parser
69 pages
Lab 6 - LEX
No ratings yet
Lab 6 - LEX
7 pages
Gold Experience 2nd Edition, A2 Key For Schools, Teacher S Book With Digital Tools and Resources
No ratings yet
Gold Experience 2nd Edition, A2 Key For Schools, Teacher S Book With Digital Tools and Resources
7 pages
Shad Log
No ratings yet
Shad Log
28 pages
Ethernet I/O Modules: ADAM-6000 Series
No ratings yet
Ethernet I/O Modules: ADAM-6000 Series
1 page
Peer-to-Peer Network Algorithms
No ratings yet
Peer-to-Peer Network Algorithms
65 pages
Unifi Digital ID (UDID) FAQ Guide
No ratings yet
Unifi Digital ID (UDID) FAQ Guide
7 pages
Unit 1 PPT
No ratings yet
Unit 1 PPT
49 pages
Business Information Systems - Notes
100% (1)
Business Information Systems - Notes
6 pages
Keyboard Shortcuts Bricscad
No ratings yet
Keyboard Shortcuts Bricscad
1 page
RF and Microwave Power Sensors/Meters: Tektronix PSM3000, PSM4000, and PSM5000 Series Data Sheet
No ratings yet
RF and Microwave Power Sensors/Meters: Tektronix PSM3000, PSM4000, and PSM5000 Series Data Sheet
12 pages
Learning To Compare: Relation Network For Few-Shot Learning
No ratings yet
Learning To Compare: Relation Network For Few-Shot Learning
10 pages
CCNA Module Exam Solutions
No ratings yet
CCNA Module Exam Solutions
8 pages
MP Stack 1001260890 2021096
No ratings yet
MP Stack 1001260890 2021096
2 pages
03 Scope Management Plan
No ratings yet
03 Scope Management Plan
30 pages
Evot Vx-Nexg Dco
No ratings yet
Evot Vx-Nexg Dco
39 pages
User Pass Combo by @magic - CKG
No ratings yet
User Pass Combo by @magic - CKG
1,202 pages
Microsoft Azure Presentation
No ratings yet
Microsoft Azure Presentation
32 pages
Answer Evaluation
No ratings yet
Answer Evaluation
7 pages
Glitch v1.3 VST Plugin Guide
No ratings yet
Glitch v1.3 VST Plugin Guide
14 pages
Google 10000 English No Swears
No ratings yet
Google 10000 English No Swears
168 pages
Maths Revision Worksheet 3
No ratings yet
Maths Revision Worksheet 3
4 pages
EPM 4463 Day 5 - Analyze The Initiation and Baselining Phase and The Role of A Project Manager (PM) Part 2
No ratings yet
EPM 4463 Day 5 - Analyze The Initiation and Baselining Phase and The Role of A Project Manager (PM) Part 2
27 pages
DS-2TXS2628-3P Qa GLT CH30S80 V5.5.50 220721
No ratings yet
DS-2TXS2628-3P Qa GLT CH30S80 V5.5.50 220721
8 pages
Co3102 Co7102 CW2 2024
No ratings yet
Co3102 Co7102 CW2 2024
8 pages
EEZG612 Lecture7 Enhancements
No ratings yet
EEZG612 Lecture7 Enhancements
11 pages
Data Mining Tutorial - Javatpoint
No ratings yet
Data Mining Tutorial - Javatpoint
16 pages
Mcdonald Solution
No ratings yet
Mcdonald Solution
7 pages
Midterm OOP Project
No ratings yet
Midterm OOP Project
4 pages
Smartwatch Market Analysis
No ratings yet
Smartwatch Market Analysis
6 pages
Numerical Linear Algebra Guide
No ratings yet
Numerical Linear Algebra Guide
695 pages
Computer Science Practical File XI - Certificate and Index Page
No ratings yet
Computer Science Practical File XI - Certificate and Index Page
5 pages

LEX Notes

Uploaded by

LEX Notes

Uploaded by

CODING PART

 Structure of a LEX Specification: There are three sections – Declarations,

 Here, %% acts as a separator between two sections

[0-9]+ { printf("NUMBER: %s\n", yytext); }

 We declare variables words = 0 and numbers = 0 initially

"if" { printf("IF keyword\n"); }

4. Counting the length of a string:

.|\n { /* ignore everything else */ }

You might also like