0% found this document useful (0 votes)

76 views28 pages

Understanding Compiler Architecture and Phases

Uploaded by

prantostart

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

76 views28 pages

Understanding Compiler Architecture and Phases

Uploaded by

prantostart

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Compiler

Segment 1

Md. Nazmul Arefin

Lecturer
Dept. of Computer Science & Engineering
International Islamic University Chittagong
A language processing system
● We write programs in high-level language, which is
easier for us to understand and remember.
● These programs are then fed into a series of tools and
OS components to get the desired code that can be
used by the machine. This is known as Language
Processing System.
● The high-level language is converted into binary
language in various phases.
● A compiler is a program that converts high-level
language to assembly language.
● Similarly, an assembler is a program that converts the
assembly language to machine-level language.

2
A language processing system

Let us first understand how a program, using C compiler, is executed on a host machine.
● User writes a program in C language (high-level language).
● The C compiler, compiles the program and translates it to assembly program (low-level
language).
● An assembler then translates the assembly program into machine code (object).
● A linker tool is used to link all the parts of the program together for execution (executable
machine code).
● A loader loads all of them into memory and then the program is executed.

3
A language processing system (tools)

Preprocessor
A preprocessor, generally considered as a part of compiler, is a tool that produces input
for compilers. It deals with macro-processing, augmentation, file inclusion, language
extension, etc.
Assembler
An assembler translates assembly language programs into machine code. The output of an
assembler is called an object file, which contains a combination of machine instructions as
well as the data required to place these instructions in memory.
Linker
Linker is a computer program that links and merges various object files together in order
to make an executable file. All these files might have been compiled by separate
assemblers. The major task of a linker is to search and locate referenced module/routines
in a program and to determine the memory location where these codes will be loaded,
making the program instruction to have absolute references.

4
A language processing system (tools)
Loader
Loader is a part of operating system and is responsible for loading executable files into
memory and execute them. It calculates the size of a program (instructions and data) and
creates memory space for it. It initializes various registers to initiate execution.

Cross-compiler
A compiler that runs on platform (A) and is capable of generating executable code for
platform (B) is called a cross-compiler.

5
Compiler

● A program that can read a program in one language (Source language) and translate it
into an equivalent program in another language (target language).
● An important role of the compiler is to report any errors in the source program that it
detects during the translation process.

Source program input

Compiler Target program

Target program output

6
interpreter

● Instead of producing a target program as a translation, an interpreter appears to directly

execute the operations specified in the source program on inputs supplied by the user.

Source program
Interpreter Output
Input

7
Beneﬁts and diﬀerence

● The machine language target program produced by a compiler is usually much faster
● Interpreter can give better error diagnostics than a compiler, because it executes the
source program statement by statement.

Compiler Interpreter

Compiler scans the whole program in one go. Translates program one statement at a time.

As it scans the code in one go, the errors (if any) are Considering it scans code one line at a time, errors
shown at the end together. are shown line by line.

C, C++, C# etc. Python, Ruby, Perl, SNOBOL, MATLAB etc.

8
Example

● Java language processors combine compilation and interpretation.

● Intermediate code: bytecode

Source program

Translator

Intermediate program
Virtual machine Output
Input

9
Compiler Design - Architecture

10
Architecture

A compiler can broadly be divided into two phases based on the way they compile.

1. Analysis phase
2. Synthesis phase

11
Architecture

Analysis Phase
● Known as the front-end of the compiler
● The analysis phase of the compiler reads the source program, divides it into core parts
and then checks for lexical, grammar and syntax errors.
● The analysis phase generates an intermediate representation of the source program and
symbol table, which should be fed to the Synthesis phase as input.

12
Architecture

Synthesis Phase
● Known as the back-end of the compiler, the synthesis phase generates the target program
with the help of intermediate source code representation and symbol table.

13
Phases of Compiler
● The compilation process is a sequence of various
phases.
● Each phase takes input from its previous stage, has its
own representation of source program, and feeds its
output to the next phase of the compiler.

14
Lexical Analysis

The first phase of a compiler is called lexical analysis or scanning.

The lexical analyzer reads the stream of characters making up the source program and groups
the characters into meaningful sequences called lexemes.

For each lexeme, the lexical analyzer produces as output a token of the form that it passes on to
the subsequent phase, syntax analysis.:

In the token, the first component token-name is an abstract symbol that is used during syntax
analysis, and the second component attribute-value points to an entry in the symbol table for
this token.

Information from the symbol-table entry'is needed for semantic analysis and code generation.

15
Lexical Analysis

Att. token name

val.

1 id position

2 id initial

3 id rate

Symbol Table
16
Syntax Analysis

The second phase of the compiler is syntax analysis

or parsing.

The parser uses the first components of the tokens

produced by the lexical analyzer to create a tree-like
intermediate representation that depicts the
grammatical structure of the token stream.

A typical representation is a syntax tree in which each

interior node represents an operation and the children
of the node represent the arguments of the operation.

17
Syntax Analysis

This tree shows the order in which the operations in the assignment
position = initial + rate * 60
are to be performed. The tree has an interior node labeled * with (id, 3) as its left
child and the integer 60 as its right child. The node (id, 3) represents the identifier
rate. The node labeled * makes it explicit that we must first multiply the value of
rate by 60. The node labeled + indicates that we must add the result of this
multiplication to the value of initial. The root of the tree, labeled =, indicates that
we must store the result of this addition into the location for the identifier position.
This ordering of operations is consistent with the usual conventions of arithmetic
which tell us that multiplication has higher precedence than addition, and hence
that the multiplication is to be performed before the addition.

18
Semantic Analysis

The semantic analyzer uses the syntax tree and the information in the symbol table
to check the source program for semantic consistency with the language definition.
It also gathers type information and saves it in either the syntax tree or the symbol
table, for subsequent use during intermediate-code generation.
An important part of semantic analysis is type checking, where the compiler checks
that each operator has matching operands.
For example, many programming language definitions require an array index to be
an integer; the compiler must report an error if a floating-point number is used to
index an array.

19
Semantic Analysis

Suppose that position, initial, and rate

have been declared to be floating-point numbers,
and that the lexeme 60 by itself forms an integer.
The type checker in the semantic analyzer in the
figure discovers that the operator * is applied to a
floating-point number rate and an integer 60. In
this case, the integer may be converted into a
floating-point number.

20
Intermediate Code Generation

We consider an intermediate form called three-address code, which consists of a

sequence of assembly-like instructions with three operands per instruction.

Fig. 1.3

21
Intermediate Code Generation

There are several points worth noting about three-address instructions.

First, each three-address assignment instruction has at most one operator on the
right side. Thus, these instructions fix the order in which operations are to be done;
the multiplication precedes the addition in the source program.
Second, the compiler must generate a temporary name to hold the value computed
by a three-address instruction.
Third, some "three-address instructions" like the first and last in the sequence above,
have fewer than three operands

22
Code Optimization

The machine-independent code-optimization phase attempts to improve the

intermediate code so that better target code will result. Usually better means faster,
but other objectives may be desired, such as shorter code, or target code that
consumes less power.
The optimizer can deduce that the conversion of 60 from integer to floating point can
be done once and for all at compile time, so the inttofloat operation can be
eliminated by replacing the integer 60 by the floating-point number 60.0.
Moreover, t3 is used only once to transmit its value to id1 so the optimizer can
transform (1.3) into the shorter sequence:

23
Code Optimization

Intermediate Code

Optimized Code

Fig. 1.4

24
Code Generation

The code generator takes as input an intermediate representation of the source

program and maps it into the target language.
If the target language is machine code, registers Or memory locations are selected for
each of the variables used by the program.
Then, the intermediate instructions are translated into sequences of machine
instructions that perform the same task.
A crucial aspect of code generation is the judicious assignment of registers to hold
variables.

25
Code Generation

For example, using registers R1 and R2, the intermediate code in (1.4) might get
translated into the machine code

Fig. 1.5

26
Code Generation
The F in each instruction tells us that it deals with floating-point numbers.

The contents of address id3 into register R2. , then multiplies it with floating-point
constant 60.0. The # signifies that 60.0 is to be treated as an immediate constant.
The third instruction moves id2 into register Rl and the fourth adds to it the value
previously computed in register R2.
Finally, the value in register Rl is stored into the address of idl , so the code correctly
implements the assignment statement

27
THANK YOU

Compiler Design and Preprocessor Overview
No ratings yet
Compiler Design and Preprocessor Overview
13 pages
Compiler Design Overview and Phases
No ratings yet
Compiler Design Overview and Phases
18 pages
Compiler and Interpreter Overview
No ratings yet
Compiler and Interpreter Overview
10 pages
Introduction to Compiler Basics
No ratings yet
Introduction to Compiler Basics
29 pages
Compiler Design Overview and Phases
No ratings yet
Compiler Design Overview and Phases
67 pages
Compiler Design Lecture Notes
No ratings yet
Compiler Design Lecture Notes
58 pages
Compiler Design Overview
No ratings yet
Compiler Design Overview
28 pages
Compiler Design Overview and Phases
No ratings yet
Compiler Design Overview and Phases
74 pages
Understanding Compiler Structure and Functions
No ratings yet
Understanding Compiler Structure and Functions
30 pages
Compiler Design Overview and Phases
No ratings yet
Compiler Design Overview and Phases
69 pages
Understanding Compiler Construction Basics
No ratings yet
Understanding Compiler Construction Basics
8 pages
Compiler Design Principles Explained
No ratings yet
Compiler Design Principles Explained
68 pages
Introduction to Compiler Design
No ratings yet
Introduction to Compiler Design
13 pages
Compiler Design Overview and Phases
100% (1)
Compiler Design Overview and Phases
66 pages
Overview of Compiler Design Phases
No ratings yet
Overview of Compiler Design Phases
11 pages
Lect Notes
No ratings yet
Lect Notes
40 pages
Compiler Design Overview and Phases
No ratings yet
Compiler Design Overview and Phases
16 pages
Understanding PPTX Compiler Basics
No ratings yet
Understanding PPTX Compiler Basics
20 pages
Understanding Compiler Phases and Tools
No ratings yet
Understanding Compiler Phases and Tools
10 pages
Introduction to Compiler Design Basics
No ratings yet
Introduction to Compiler Design Basics
115 pages
Module 1
100% (1)
Module 1
135 pages
Overview of Compiler Design Phases
No ratings yet
Overview of Compiler Design Phases
111 pages
Compiler Design: Phases and Tools
100% (1)
Compiler Design: Phases and Tools
36 pages
Interpreter vs Compiler: Key Differences
No ratings yet
Interpreter vs Compiler: Key Differences
179 pages
Compiler Design Unit 1 Overview
No ratings yet
Compiler Design Unit 1 Overview
9 pages
Compiler Design Overview and Phases
No ratings yet
Compiler Design Overview and Phases
98 pages
Overview of Compiler Processes
No ratings yet
Overview of Compiler Processes
6 pages
Principles of Compiler Design Overview
No ratings yet
Principles of Compiler Design Overview
40 pages
Compiler Design Unit 1
No ratings yet
Compiler Design Unit 1
16 pages
Compiler Design Course Overview
No ratings yet
Compiler Design Course Overview
5 pages
Introduction to Compiler Design Concepts
No ratings yet
Introduction to Compiler Design Concepts
7 pages
Compiler Design Quick Guide
No ratings yet
Compiler Design Quick Guide
45 pages
Introduction to Compilers and Their Functions
No ratings yet
Introduction to Compilers and Their Functions
175 pages
Compiler Design: Unit 1 Overview
No ratings yet
Compiler Design: Unit 1 Overview
63 pages
Compiler Construction Overview
No ratings yet
Compiler Construction Overview
61 pages
Compiler Design Notes for GATE CSE
No ratings yet
Compiler Design Notes for GATE CSE
41 pages
Compiler Phases and Types Explained
No ratings yet
Compiler Phases and Types Explained
24 pages
Compiler Basics and Phases Explained
No ratings yet
Compiler Basics and Phases Explained
13 pages
Introduction to Compilers Overview
No ratings yet
Introduction to Compilers Overview
77 pages
Introduction to Compiler Construction
No ratings yet
Introduction to Compiler Construction
63 pages
Compiler Construction Course Overview
No ratings yet
Compiler Construction Course Overview
9 pages
Compiler Design Fundamentals and Phases
No ratings yet
Compiler Design Fundamentals and Phases
21 pages
Compiler Design Overview for GATE
No ratings yet
Compiler Design Overview for GATE
20 pages
Compiler Design Overview - BCS613C
No ratings yet
Compiler Design Overview - BCS613C
136 pages
Phases of Compiler Design Explained
No ratings yet
Phases of Compiler Design Explained
8 pages
Overview of Language Processing Systems
No ratings yet
Overview of Language Processing Systems
6 pages
Compiler Basics: Types and Phases
No ratings yet
Compiler Basics: Types and Phases
78 pages
Compiler Design - Quick Guide
No ratings yet
Compiler Design - Quick Guide
38 pages
Overview of Language Processors and Compilers
No ratings yet
Overview of Language Processors and Compilers
25 pages
Compiler Overview and Structure Guide
No ratings yet
Compiler Overview and Structure Guide
49 pages
Introduction to Compilers and Interpreters
No ratings yet
Introduction to Compilers and Interpreters
146 pages
Comprehensive Guide to Compiler Design
No ratings yet
Comprehensive Guide to Compiler Design
47 pages
Compiler Design and Language Processing
100% (1)
Compiler Design and Language Processing
115 pages
Understanding Compiler Design Basics
No ratings yet
Understanding Compiler Design Basics
42 pages
Compiler Design Overview and Phases
No ratings yet
Compiler Design Overview and Phases
8 pages
Compiler Design Overview and Phases
No ratings yet
Compiler Design Overview and Phases
121 pages
Compiler Design Overview and Phases
No ratings yet
Compiler Design Overview and Phases
37 pages
Legal Aspects of Software Quality Issues
No ratings yet
Legal Aspects of Software Quality Issues
45 pages
VxRail Power Status Dial Home Events
No ratings yet
VxRail Power Status Dial Home Events
2 pages
First 100 Days As A CIO
100% (10)
First 100 Days As A CIO
36 pages
OpenMP Essentials for Parallel Computing
No ratings yet
OpenMP Essentials for Parallel Computing
18 pages
Outdoor Wireless Access Point Datasheet
No ratings yet
Outdoor Wireless Access Point Datasheet
3 pages
Love2D.js Framework Overview
No ratings yet
Love2D.js Framework Overview
20 pages
Web Scraping Yelp Reviews with Python
No ratings yet
Web Scraping Yelp Reviews with Python
7 pages
CAN Protocol and LabVIEW Integration
No ratings yet
CAN Protocol and LabVIEW Integration
31 pages
SAP B1 Database Table and Field Guide
No ratings yet
SAP B1 Database Table and Field Guide
2 pages
Product Photography: 89 Essential Tips
No ratings yet
Product Photography: 89 Essential Tips
84 pages
External Party Risk Assessment Form
No ratings yet
External Party Risk Assessment Form
5 pages
Grade 2 MAPEH Distance Learning Report
No ratings yet
Grade 2 MAPEH Distance Learning Report
10 pages
B.Tech Computer Science II Year Courses
No ratings yet
B.Tech Computer Science II Year Courses
1 page
Final (2019 - 02 - 13 09 - 01 - 45 UTC)
No ratings yet
Final (2019 - 02 - 13 09 - 01 - 45 UTC)
102 pages
2.4 GHz Microstrip Patch Antenna Design
No ratings yet
2.4 GHz Microstrip Patch Antenna Design
5 pages
Likofit
No ratings yet
Likofit
21 pages
PSpice Circuit Simulation Basics
No ratings yet
PSpice Circuit Simulation Basics
10 pages
Software Quality Models and ISO 25010
No ratings yet
Software Quality Models and ISO 25010
11 pages
Decision Analytics Consultant Profile
No ratings yet
Decision Analytics Consultant Profile
1 page
Nonlinear Programming Techniques Overview
No ratings yet
Nonlinear Programming Techniques Overview
21 pages
20.3.4 - Repeated Roots Reduction of Order
No ratings yet
20.3.4 - Repeated Roots Reduction of Order
16 pages
Autodesk IRISVR Download Guide
No ratings yet
Autodesk IRISVR Download Guide
89 pages
Powersoft Duecanali-DSP DS en
No ratings yet
Powersoft Duecanali-DSP DS en
2 pages
Configuring MDG Data Replication Framework
No ratings yet
Configuring MDG Data Replication Framework
20 pages
Probabilistic Shape Modeling in Aging
No ratings yet
Probabilistic Shape Modeling in Aging
73 pages
Media Processing Log for Redmi Device
No ratings yet
Media Processing Log for Redmi Device
2,705 pages
React Budget Allocation Lab Guide
No ratings yet
React Budget Allocation Lab Guide
12 pages
D - OD - 511 - Installation Guide - SMARTSTAIN EVE - Diebold-Nixdorf RM4H
100% (1)
D - OD - 511 - Installation Guide - SMARTSTAIN EVE - Diebold-Nixdorf RM4H
12 pages
MT5004A to MT5016A Rectifiers Overview
No ratings yet
MT5004A to MT5016A Rectifiers Overview
4 pages
SIM Card Registration: Security vs. Privacy
No ratings yet
SIM Card Registration: Security vs. Privacy
65 pages