Syntax Directed Translation in Compiler Design
Last Updated :
07 Feb, 2025
Syntax-Directed Translation (SDT) is a method used in compiler design to convert source code into another form while analyzing its structure. It integrates syntax analysis (parsing) with semantic rules to produce intermediate code, machine code, or optimized instructions.
In SDT, each grammar rule is linked with semantic actions that define how translation should occur. These actions help in tasks like evaluating expressions, checking types, generating code, and handling errors.
SDT ensures a systematic and structured way of translating programs, allowing information to be processed bottom-up or top-down through the parse tree. This makes translation efficient and accurate, ensuring that every part of the input program is correctly transformed into its executable form.
SDT relies on three key elements:
- Lexical values of nodes (such as variable names or numbers).
- Constants used in computations.
- Attributes associated with non-terminals that store intermediate results.
The general process of SDT involves constructing a parse tree or syntax tree, then computing the values of attributes by visiting its nodes in a specific order. However, in many cases, translation can be performed directly during parsing, without explicitly building the tree.
SDD v/s SDT Scheme
Syntax Directed Definition | Syntax Directed Translation |
---|
It is a context-free grammar where attributes and rules are combined and associated with grammar symbols and productions, respectively. | It refers to the translation of a string into an array of actions. This is done by adding an action to a rule of context-free grammar. It is a type of compiler interpretation. |
Attribute Grammar | Translation Schemes |
SDD: Specifies the values of attributes by associating semantic rules with the productions. | SDT: Embeds program fragments (also called semantic actions) within production bodies. |
E → E + T { E.val := E1.val + T.val } | E → E + T { print(‘+’); } |
Always written at the end of the body of production. | The position of the action defines the order in which the action is executed (in the middle of production or at the end). |
More Readable | More Efficient |
Used to specify the non-terminals. | Used to implement S-Attributed SDD and L-Attributed SDD. |
Specifies what calculation is to be done at each production. | Specifies what calculation is to be done at each production and at what time they must be done. |
Left to right evaluation. | Left to right evaluation. |
Used to know the value of non-terminals. | Used to generate Intermediate Code. |
Attributes in Syntax-Directed Translation
An attribute is any quantity associated with a programming construct in a parse tree. Attributes help in carrying semantic information during the compilation process.
Examples of Attributes:
- Data types of variables
- Line numbers for error handling
- Instruction details for code generation
Types of Attributes
1. Synthesized Attributes
- Defined by a semantic rule associated with the production at node N in the parse tree.
- Computed only using the attribute values of the children and the node itself.
- Mostly used in bottom-up evaluation.
2. Inherited Attributes
- Defined by a semantic rule associated with the parent production of node N.
- Computed using the attribute values of the parent, siblings, and the node itself.
- Used in top-down evaluation.
Read about Differences Between Synthesized and Inherited Attributes.
Attribute Grammars
An Attributed Grammar is a special type of grammar used in compiler design to add extra information (attributes) to syntax rules. This helps in semantic analysis, such as type checking, variable classification, and ensuring correctness in programming languages.
Think of it like a regular grammar with extra labels that help check things like variable types, correctness of expressions, and rule enforcement.
Example of an Attribute Grammar
Production Rule | Semantic Rule |
---|
D → T L | L.in := T.type (Passes type information) |
T → int | T.type := integer (Defines type as integer) |
T → real | T.type := real (Defines type as real) |
L → L1 , id | L1.in := L.in
addtype (id.entry, L.in) (Passes type info to child and updates symbol table) |
L → id | addtype (id.entry, L.in) (Adds type info to symbol table) |
- D → T L → The type from
T
is passed to L
. - T → int / real → Assigns type
integer
or real
to T
. - L → id → Assigns type information to identifier (
id
).
Grammar and Translation Rules
SDT Scheme | SDD Scheme |
---|
E → E + T{ print('+') } | E → E + T E.code = E.code || T.code || '+' |
E → E - T{ print('-') } | E → E - T `E.code = E.code || T.code || '-' |
E → T | E → TE.code = T.code |
T → 0{ print('0') } | T → 0T.code = '0' |
T → 1{ print('1') } | T → 1T.code = '1' |
... | ... |
T → 9{ print('9') } | T → 9T.code = '9' |
To evaluate translation rules, we can employ one depth-first search traversal on the parse tree. This is possible only because SDT rules don’t impose any specific order on evaluation until children's attributes are computed before parents for a grammar having all synthesized attributes. Otherwise, we would have to figure out the best-suited plan to traverse through the parse tree and evaluate all the attributes in one or more traversals. For better understanding, we will move bottom-up in the left to right fashion for computing the translation rules .Â
Syntax-Directed Translation (SDT) allows us to evaluate arithmetic expressions while parsing. It uses attributes associated with grammar symbols and rules to compute values as we process the parse tree.
The following context-free grammar (CFG) defines an arithmetic expression with addition (+) and multiplication (*):
E -> E + T { E.val = E.val + T.val } // PR#1
E -> T { E.val = T.val } // PR#2
T -> T * F { T.val = T.val * F.val } // PR#3
T -> F { T.val = F.val } // PR#4
F -> INTLIT { F.val = INTLIT.lexval } // PR#5
Each production rule has a semantic action in {}
that defines how values are computed.
- E, T, and F are non-terminals (expression components).
- INTLIT represents an integer literal (actual number).
- val is an attribute used to store computed values at each step.
Let’s evaluate the expression: S = 2 + 3 * 4
Step 1: Build the Parse Tree
The parse tree for 2 + 3 * 4
is structured like this:
E
/|\
E + T
/ /|\
T T * F
/ / |
F F 4
| |
2 3
Step 2: Apply Translation Rules (Bottom-Up Evaluation)
We evaluate the expression step by step in a bottom-up manner (from leaves to root)
F → 2 → F.val = 2
F → 3 → F.val = 3
F → 4 → F.val = 4
T → F (T gets F's value) → T.val = 3
T → T * F → T.val = 3 * 4 = 12
E → T (E gets T's value) → E.val = 12
E → E + T → E.val = 2 + 12 = 14
Thus, the final computed value of 2 + 3 * 4
is 14.

 Advantages of Syntax Directed Translation:
Ease of implementation: SDT is a simple and easy-to-implement method for translating a programming language. It provides a clear and structured way to specify translation rules using grammar rules.
Separation of concerns: SDT separates the translation process from the parsing process, making it easier to modify and maintain the compiler. It also separates the translation concerns from the parsing concerns, allowing for more modular and extensible compiler designs.
Efficient code generation: SDT enables the generation of efficient code by optimizing the translation process. It allows for the use of techniques such as intermediate code generation and code optimization.
Disadvantages of Syntax Directed Translation:
Limited expressiveness: SDT has limited expressiveness in comparison to other translation methods, such as attribute grammars. This limits the types of translations that can be performed using SDT.
Inflexibility: SDT can be inflexible in situations where the translation rules are complex and cannot be easily expressed using grammar rules.
Limited error recovery: SDT is limited in its ability to recover from errors during the translation process. This can result in poor error messages and may make it difficult to locate and fix errors in the input program.
Similar Reads
Compiler Design Tutorial
A compiler is software that translates or converts a program written in a high-level language (Source Language) into a low-level language (Machine Language or Assembly Language). Compiler design is the process of developing a compiler.It involves many stages like lexical analysis, syntax analysis (p
3 min read
Introduction
Introduction of Compiler Design
A compiler is software that translates or converts a program written in a high-level language (Source Language) into a low-level language (Machine Language or Assembly Language). Compiler design is the process of developing a compiler.The development of compilers is closely tied to the evolution of
9 min read
Compiler construction tools
The compiler writer can use some specialized tools that help in implementing various phases of a compiler. These tools assist in the creation of an entire compiler or its parts. Some commonly used compiler construction tools include: Parser Generator - It produces syntax analyzers (parsers) from the
4 min read
Phases of a Compiler
A compiler is a software tool that converts high-level programming code into machine code that a computer can understand and execute. It acts as a bridge between human-readable code and machine-level instructions, enabling efficient program execution. The process of compilation is divided into six p
10 min read
Symbol Table in Compiler
Every compiler uses a symbol table to track all variables, functions, and identifiers in a program. It stores information such as the name, type, scope, and memory location of each identifier. Built during the early stages of compilation, the symbol table supports error checking, scope management, a
8 min read
Error Detection and Recovery in Compiler
Error detection and recovery are essential functions of a compiler to ensure that a program is correctly processed. Error detection refers to identifying mistakes in the source code, such as syntax, semantic, or logical errors. When an error is found, the compiler generates an error message to help
6 min read
Error Handling in Compiler Design
During the process of language translation, the compiler can encounter errors. While the compiler might not always know the exact cause of the error, it can detect and analyze the visible problems. The main purpose of error handling is to assist the programmer by pointing out issues in their code. E
5 min read
Language Processors: Assembler, Compiler and Interpreter
Computer programs are generally written in high-level languages (like C++, Python, and Java). A language processor, or language translator, is a computer program that convert source code from one programming language to another language or to machine code (also known as object code). They also find
5 min read
Generation of Programming Languages
Programming languages have evolved significantly over time, moving from fundamental machine-specific code to complex languages that are simpler to write and understand. Each new generation of programming languages has improved, allowing developers to create more efficient, human-readable, and adapta
6 min read
Lexical Analysis
Introduction of Lexical Analysis
Lexical analysis, also known as scanning is the first phase of a compiler which involves reading the source program character by character from left to right and organizing them into tokens. Tokens are meaningful sequences of characters. There are usually only a small number of tokens for a programm
6 min read
Flex (Fast Lexical Analyzer Generator)
Flex (Fast Lexical Analyzer Generator), or simply Flex, is a tool for generating lexical analyzers scanners or lexers. Written by Vern Paxson in C, circa 1987, Flex is designed to produce lexical analyzers that is faster than the original Lex program. Today it is often used along with Berkeley Yacc
7 min read
Introduction of Finite Automata
Finite automata are abstract machines used to recognize patterns in input sequences, forming the basis for understanding regular languages in computer science. They consist of states, transitions, and input symbols, processing each symbol step-by-step. If the machine ends in an accepting state after
4 min read
Ambiguous Grammar
Context-Free Grammars (CFGs) is a way to describe the structure of a language, such as the rules for building sentences in a language or programming code. These rules help define how different symbols can be combined to create valid strings (sequences of symbols).CFGs can be divided into two types b
7 min read
Parsers
Parsing - Introduction to Parsers
Parsing, also known as syntactic analysis, is the process of analyzing a sequence of tokens to determine the grammatical structure of a program. It takes the stream of tokens, which are generated by a lexical analyzer or tokenizer, and organizes them into a parse tree or syntax tree.The parse tree v
6 min read
Classification of Top Down Parsers
Top-down parsing is a way of analyzing a sentence or program by starting with the start symbol (the root of the parse tree) and working down to the leaves (the actual input symbols). It tries to match the input string by expanding the start symbol using grammar rules. The process of constructing the
4 min read
Bottom-up Parsers
Bottom-up parsing is a type of syntax analysis method where the parser starts from the input symbols (tokens) and attempts to reduce them to the start symbol of the grammar (usually denoted as S). The process involves applying production rules in reverse, starting from the leaves of the parse tree a
10 min read
Shift Reduce Parser in Compiler
Shift-reduce parsing is a popular bottom-up technique used in syntax analysis, where the goal is to create a parse tree for a given input based on grammar rules. The process works by reading a stream of tokens (the input), and then working backwards through the grammar rules to discover how the inpu
11 min read
SLR Parser (with Examples)
LR parsers is an efficient bottom-up syntax analysis technique that can be used to parse large classes of context-free grammar is called LR(k) parsing. L stands for left-to-right scanningR stands for rightmost derivation in reversek is several input symbols. when k is omitted k is assumed to be 1.Ad
4 min read
CLR Parser (with Examples)
LR parsers :It is an efficient bottom-up syntax analysis technique that can be used to parse large classes of context-free grammar is called LR(k) parsing. L stands for the left to right scanningR stands for rightmost derivation in reversek stands for no. of input symbols of lookahead Advantages of
7 min read
Construction of LL(1) Parsing Table
Parsing is an essential part of computer science, especially in compilers and interpreters. From the various parsing techniques, LL(1) parsing is best. It uses a predictive, top-down approach. This allows efficient parsing without backtracking. This article will explore parsing and LL(1) parsing. It
6 min read
LALR Parser (with Examples)
LALR Parser :LALR Parser is lookahead LR parser. It is the most powerful parser which can handle large classes of grammar. The size of CLR parsing table is quite large as compared to other parsing table. LALR reduces the size of this table.LALR works similar to CLR. The only difference is , it combi
6 min read
Syntax Directed Translation
Code Generation and Optimization
Code Optimization in Compiler Design
Code optimization is a crucial phase in compiler design aimed at enhancing the performance and efficiency of the executable code. By improving the quality of the generated machine code optimizations can reduce execution time, minimize resource usage, and improve overall system performance. This proc
9 min read
Intermediate Code Generation in Compiler Design
In the analysis-synthesis model of a compiler, the front end of a compiler translates a source program into an independent intermediate code, then the back end of the compiler uses this intermediate code to generate the target code (which can be understood by the machine). The benefits of using mach
6 min read
Issues in the design of a code generator
A code generator is a crucial part of a compiler that converts the intermediate representation of source code into machine-readable instructions. Its main task is to produce the correct and efficient code that can be executed by a computer. The design of the code generator should ensure that it is e
7 min read
Three address code in Compiler
TAC is an intermediate representation of three-address code utilized by compilers to ease the process of code generation. Complex expressions are, therefore, decomposed into simple steps comprising, at most, three addresses: two operands and one result using this code. The results from TAC are alway
6 min read
Data flow analysis in Compiler
Data flow is analysis that determines the information regarding the definition and use of data in program. With the help of this analysis, optimization can be done. In general, its process in which values are computed using data flow analysis. The data flow property represents information that can b
6 min read
Compiler Design | Detection of a Loop in Three Address Code
Prerequisite - Three address code in Compiler Loop optimization is the phase after the Intermediate Code Generation. The main intention of this phase is to reduce the number of lines in a program. In any program majority of the time is spent actually inside the loop for an iterative program. In the
3 min read
Introduction of Object Code in Compiler Design
Let assume that, you have a c program, then you give the C program to compiler and compiler will produce the output in assembly code. Now, that assembly language code will give to the assembler and assembler is going to produce you some code. That is known as Object Code. In the context of compiler
7 min read
Data flow analysis in Compiler
Data flow is analysis that determines the information regarding the definition and use of data in program. With the help of this analysis, optimization can be done. In general, its process in which values are computed using data flow analysis. The data flow property represents information that can b
6 min read
Compiler Design GATE PYQ's and MCQs