0% found this document useful (0 votes)

38 views

Decompilation

1. The document discusses decompilation, which involves transforming compiled binaries back into a representation of the original source code. This involves disassembly, lifting code to a higher-level representation, dataflow analysis, and other techniques. 2. The key steps of decompilation are disassembly, lifting code and performing dataflow analysis to recover variables and expressions, control flow analysis to recover structures like if/else, and type analysis. 3. Disassembly is challenging due to the difficulty distinguishing code from data. Lifting to a higher-level representation and performing dataflow analysis aims to eliminate machine-specific details and recover the original abstract instructions.

Uploaded by

Ravi Mohan

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views

Decompilation

Uploaded by

Ravi Mohan

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Lecture Notes on Decompilation

15411: Compiler Design

Maxime Serrano
Lecture 20
October 31, 2013

1 Introduction
In this lecture, we consider the problem of doing compilation “backwards” - that is, transforming from
a compiled binary into a reasonable representation of its original source. Solving this problem will involve
significant consideration of our standard dataflow analyses, as well as a discussion of good selection of internal
representations of code.
While the motivation for the existence of compilers is fairly clear, the motivation for the existence of
decompilers is less so. However, in the modern world there exist many legacy systems for which the original
source code has been lost, which need bugs fixed in them or to be ported to a more modern architecture.
Decompilers facilitate this process greatly. In addition, in malware analysis, generally source is not provided.
It is therefore extremely useful to have some way to go from binary to a reasonable approximation of the
original code.
For this lecture, we will focus on decompiling machine code, originally C0 code, that conforms to the C
ABI, into a version of C0 with pointer arithmetic and goto. This comes nowhere near to being a treatment
of decompilation of arbitrary binaries (and in fact the algorithms as described here will frequently fail to
work on arbitrary binaries!), though more complex variants of the same ideas will continue to work.

2 Steps of Decompilation
Roughly, decompilation follows a few steps:
1. Disassembly - transformation from machine code to the assembly equivalent. There are a surprising
number of pitfalls here.
2. Lifting and dataflow analysis - transforming the resulting assembly code into a higher-level internal
representation, such as our three-operand assembly. One of the tricky parts here is recognizing distinct
variables, and detaching variables from registers or addresses. We also recover expressions, function
return values and arguments.
3. Control flow analysis - recovering control flow structure information, such as if and while statements,
as well as their nesting level.
4. Type analysis - recovering types of variables, functions, and other pieces of data.

3 Disassembly
The first step of writing a good decompiler is writing a good disassembler. While the details of individual
disassemblers can be extremely complex, the general idea is fairly simple. The mapping between assembly
and machine code is in theory one-to-one, so a straight-line translation should be feasible.

1
However, disassemblers rapidly run into a problem: it is very difficult to reliably distinguish code from
data.
In order to do so, generally disassemblers will take one of two strategies:
1. Disassemble the sections that are generally filled with code (.plt, .text, some others) and treat the
rest of them as data. One tool that follows this strategy is objdump. While this works decently well
on code produced by most modern compilers, there exist (or existed!) compilers that place data into
these executable sections, causing the disassembler some confusion. Further, any confusingly-aligned
instructions will also confuse these disassemblers.
2. Consider the starting address given by the binary’s header, and recursively disassemble all code reach-
able from that address. This approach is frequently defeated by indirect jumps, though most of the
disassemblers that use it have additional heuristics that allow them to deal with this. An example tool
that follows this strategy is Hex-Ray’s Interactive Disassembler.
While disassembly is a difficult problem with many pitfalls, it is not particularly interesting from an
implementation perspective for us. Many program “obfuscators” have many steps that are targeted at
fooling disassemblers, however, as without correct disassembly it is impossible to carry on the later steps.

4 Lifting and Dataflow Analysis

Given correct disassembly, another problem rears its head. As you may have noticed while writing your
compilers, doing any form of reasonable analysis on x86 64 is an exercise in futility. The structure of most
assembly language does not lend itself well to any kind of sophisticated analysis.
In order to deal with this, decompilers generally do something which closely resembles a backwards
form of instruction selection. However, decompilers cannot just tile sequences of assembly instructions with
sequences of abstract instructions, as different compilers may produce radically different assembly for the
same sequence of abstract instructions.
Further, frequently a single abstract instruction can expand into a very long sequence of “real” instruc-
tions, many of which are optimized away by the compiler later on.
There are two primary approaches to dealing with this issue. The first is to simply translate our complex
x86 64 into a simpler RISC instruction set. The tools produced by Zynamics frequently take this approach.
The alternative is to translate into an exactly semantics-preserving, perhaps more complicated, instruction
set, which has more cross-platform ways of performing analysis on it. This is the approach taken by CMU’s
BAP research project, as well as by the Hex-Rays decompiler.
The choice of the internal representation can be very important. For our purposes, we’ll consider a
modified version of the 3-operand IR that we’ve been using throughout the semester. We’ll consider a
version that is extended to allow instructions of the form s <- e where e is an expression.
We will summarize the translation from x86 64 to our IR by simply effectively doing instruction selection
in reverse. The difficulty here is generally in the design of the IR, which we most likely do not have the time
to discuss in detail. Some places to learn about IRs include the BAP website (bap.ece.cmu.edu) and the
Zynamics paper “REIL: A platform-independent intermediate representation of disassembled code for static
code analysis” by Thomas Dullien and Sebastian Porst.
Once we have obtained an IR, we would now like to eliminate as many details about the underlying
machine as possible. This is generally done using a form of dataflow analysis, in order to recover variables,
expressions and the straight-line statements.
Recall the dataflow analyses that have been presented in past lectures. Many of these analyses will be
available to help us “refactor” the IR produced by our direct translation.
We will follow two preliminary analyses, both of which are predicated on liveness analysis:
1. Dead register elimination. This is necessary to efficiently deal with instructions such as idiv, as well
as to notice void functions. It should be noted that unlike in your compilers, it is sometimes possible
to eliminate instructions with additional state. For example, if idiv %ecx translates into:

2
t <- %edx:%eax
%eax <- t / %ecx
%edx <- t % %ecx

and %eax is not live in the successor, it is permissible to remove the second line of the result, since the
third line will cause the division by 0 in the case that %ecx is zero.
Dead register elimination is done following effectively the same rules as dead code elimination from the
homeworks, with some special cases like the above.
2. Dead flag elimination. Our translation makes direct use of the condition flags, and keeps track of which
of them are defined and used at which time. We treat flags effectively as registers of their own. In
this case, if a flag f is defined at a line l and is not live-in in l + 1, then we remove the definition of
f from the line l. This will simplify our later analyses greatly, allowing us to collapse conditions more
effectively.
3. Conditional collapsing. At this stage, we collapse sequences of the form comparison-cjump into a
conditional jump on an expression. For example, after flag elimination, we collapse:

zf <- cmp(%eax,0)
jz label

into

jcond (%eax == 0) label

In C0, generally every conditional will have this form. However, sufficiently clever optimizing compilers
may be able to optimize some conditional chains more efficiently. A discussion of transforming more
optimized conditions can be found in Cristina Cifuentes’ thesis.
Having reached this point in the analysis, we would like to lose registers. Hence, we may simply replace
each register with an appropriate temp, taking care to keep argument and result registers pinned. We then do
the function-call-expansion step in reverse, replacing sequences of moves into argument registers followed by
a call with a parametrized call. We note that in order to do so, we must first make a pass over all functions
to determine how many arguments they take, in order to deal with the possibility of certain moves being
optimized out.
At this stage, it is possible to effectively perform a slightly modified SSA analysis on the resulting code.
Hence, for the future we will assume that this SSA analysis has been executed, and define our further analysis
over SSA code. We may now perform an extended copy-propagation pass to collapse expressions.
This is sufficient to perform the next stages of the analysis. However, many decompilers apply much
more sophisticated techniques to this stage. Cristina Cifuentes’ thesis contains a description of many such
algorithms.

5 Control Flow Analysis

Having reached this stage, we now have a reasonable control flow graph, with “real” variables in it. At this
point, we could produce C code which is semantically equivalent to the original machine code. However,
this is frequently undesirable. Few programs are written with as much abuse of the goto keyword as this
approach would entail. Most control flow graphs are generated by structured programs, using if, for and
while. It is then desirable for the decompiler to attempt to recover this original structure and arrive at a
fair approximation of the original code.
This form of analysis relies largely on graph transformations. A primary element of this analysis relies
on considering dominator nodes. Given a start node a, a node b is said to dominate a node c if every path
from a to c in the graph passes through b. The immediate dominator of c is the node b such that for every
node d, if d dominates c, then either d = b or d dominates b.

3
5.1 Structuring Loops
We will consider three primary different classes of loops. While other loops may appear in decompiled
code, analysis of these more complex loops is more difficult. Further reading can be found in the paper “A
Structuring Algorithm for Decompilation” by Cristina Cifuentes. Our three primary classes are as follows:
1. While loops: the node at the start of the loop is a conditional, and the latching node is unconditional.
2. Repeat loops: the latching node is conditional.
3. Endless loops: both the latching and the start nodes are unconditional.

The latching node here is the node with the back-edge to the start node. We note that there are at most
one of these per loop in our language, as break and continue do not exist.
In order to do so, we will consider intervals on a digraph. If h is a node in G, the interval I(h) is the
maximal subgraph in which h is the only entry node and in which all closed paths contain h. It is a theorem
that there exists a set {h1 , ...hk } of header nodes such that the set {I(h1 ), ...I(hk )} is a partition of the
graph, and further there exists an algorithm to find this partition.
We then define the sequence of derived graphs of G as follows:
1. G1 = G.
2. Gn+1 is the graph formed by contracting every interval of Gn into a single node.

This procedure eventually reaches a fixed point, at which point the resulting graph is irreducible.
Note that for any interval I(h), there exists a loop rooted at h if there is a back-edge to h from some
node z ∈ I(h). One way to find such a node is to simply perform DFS on the interval. Then, in order to
find the nodes in the loop, we define h as being part of the loop and then proceed by noting that a node k
is in the loop if and only if its immediate dominator is in the loop and h is reachable from k.
The algorithm for finding loops in the graph then proceeds as follows. Compute the derived graphs of
G until you reach the fixed point, and find the loops in each derived graph. Note that if any node is found
to be the latching node for two loops, one of these loops will need to be labeled with a goto instead. While
there do exist algorithms that can recover more complex structures, this is not one of them.

5.2 Structuring Ifs

An if statement is a 2-way conditional branch with a common end node. The final end node is referred to
as the follow node and is immediately dominated by the header node.
First, compute a post-ordering of the graph, and traverse it in that order. This guarantees that we will
analyze inner nested ifs before outer ones.
We now find if statements as follows:

1. For every conditional node a, find the set of nodes immediately dominated by a.
2. Produce G0 from G by reversing all the arrows. Filter out nodes from the set above that do not
dominate a in G0 .
3. Find the closest node to a in the resulting set, by considering the one with the highest post-order
number.
The resulting node is the follow node of a.
We note that this algorithm does not do a particularly good job of dealing with boolean short-circuiting.
Any control flow that does not match the patterns above will be replaced with an if with a goto.

4
6 Type Analysis
Given control flow and some idea of which variables are which, it is frequently useful to be able to determine
what the types of various variables are. While it may be correct to produce a result where every variable
is of type void *, no one actually writes programs that way. Therefore, we would like to be able to assign
variables and functions their types, as well as hopefully recover structure layout.
A compiler has significant advantages over a decompiler in this respect. The compiler knows which
sections of a structure are padding, and which are actually useful; it also knows which things a function can
take or accept. A compiler notices that the functions below are different, and so compiles them separately;
a decompiler may not be able to notice that these functions accept different types without some more
sophisticated analysis. In particular, on a 32-bit machine, these functions will produce identical assembly.

struct s1 { int a; };
int s1_get(struct s1 *s) { return s->a; }
struct s2 { struct s1 *a; };
struct s1 *s2_get(struct s2 *s) { return s->a; }

Given this problem, how does type analysis work?

In short, the answer is: this is an open problem. The TIE paper by CyLab claims to resolve many such
cases, but is far from complete. The Hex-Rays decompiler fails to recognize structures altogether, and often
defaults to int even when the variable is in fact a pointer.
We can model a simple type analysis as follows:
1. Multiplication, substraction, shifting, xor, binary and, binary or and division force their “parameters”
to be integers.

2. Dereferencing forces its parameter to be a pointer.

3. The return values of standard library functions are maintained.
4. Any variable that is branched on is a boolean.

5. If two variables are added together and one is a pointer, the other is an integer.
6. If two variables are added together and one is an integer, the other is either a pointer or an integer.
7. If two variables are compared with <, >, >= or <=, they are both integers.
8. If two variables are compared with == or !=, they have the same type.

9. If something is returned from main(), it is an integer.

10. If the value of one variable is moved into another variable, they have the same type.
11. If the dereferenced value of a pointer has type τ , then the pointer has type τ ∗ .

12. The sum of a pointer of type τ ∗ and an integer is a pointer, but not necessarily of type τ ∗ .
We note that in order to get high-quality types, we will often need to perform analysis across function
boundaries. We also note that this analysis is entirely unable to distinguish between structures and arrays.
A more sophisticated type analysis is described in the TIE paper in the references section. There is plenty
of research being done in this area, however!

5
7 Other Issues
Other issues that haven’t been discussed here include doing things like automatically detecting vulnerabilities,
detecting and possibly collapsing aliases, recovering scoping information, extracting inlined functions, or
dealing with tail call optimizations. Many of these problems (and, in fact, many of the things discussed
above!) do not have satisfactory solutions, and remain open research problems. For one, CMU’s CyLab
contains a group actively doing research on these topics. They recently (a few days ago!) released a paper
containing a description of their solutions to many of these problems. Since they decompile arbitrary native
code, rather than caring mostly about a specific language, they encounter some very interesting and difficult
problems.
Decompilation as a whole is very much an open research topic, and there exist very few reasonable
decompilers. One of the better-known ones is the Hex-Rays decompiler, and it is sadly entirely closed-
source. As far as I know, there are no high-quality open-source decompilers for x86 or x86 64.

8 References
The material for this lecture was almost entirely gleaned from the following:

1. Cifuentes, Cristina. “A Structuring Algorithm for Decompilation.” XIX Conferencia Latinoamericana

de Informática, 2-6 August 1993, 267-276.
2. Cifuentes, Cristina. “Reverse Compilation Techniques.” PhD thesis, Queensland University of Tech-
nology, 1994.
3. Dullien, Thomas, and Sebastian Porst. “REIL: A platform-independent intermediate representation
of disassembled code for static code analysis.” CanSecWest, 2009.
4. Lee, JongHyup, Thanassis Avgerinos, and David Brumley. “TIE: Principled Reverse Engineering of
Types in Binary Programs.” NDSS Symposium, 2011.
5. Schwartz, Edward J., JongHyup Lee, Maverick Woo, and David Brumley. “Native x86 Decompilation
using Semantics-Preserving Structural Analysis and Iterative Control-Flow Structuring.” Usenix, 2013.

The C# Player's Guide - 5th Edition - 5.0.0
83% (18)
The C# Player's Guide - 5th Edition - 5.0.0
497 pages
Corce
70% (46)
Corce
206 pages
Ap Computer Science Principles Practice Exam and Notes 2021
100% (6)
Ap Computer Science Principles Practice Exam and Notes 2021
108 pages
Get Coding! Learn HTML, CSS, and JavaScript and Build A Website, App, and Game PDF
91% (32)
Get Coding! Learn HTML, CSS, and JavaScript and Build A Website, App, and Game PDF
209 pages
The Ethical Slut PDF
55% (69)
The Ethical Slut PDF
298 pages
Hacking The Art of Exploitation 2nd Edition Jon Erickson
100% (19)
Hacking The Art of Exploitation 2nd Edition Jon Erickson
492 pages
PrepTest 83 - Print and Take Test - 7sage Lsat
100% (3)
PrepTest 83 - Print and Take Test - 7sage Lsat
46 pages
50 Phone Hacks DR - Brad
58% (19)
50 Phone Hacks DR - Brad
29 pages
Bug Bounty Checklist
No ratings yet
Bug Bounty Checklist
7 pages
Update to Modern C++
From Everand
Update to Modern C++
James Raynard
No ratings yet
C & C++ Interview Questions You'll Most Likely Be Asked
From Everand
C & C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Learn Python in A Day
100% (14)
Learn Python in A Day
141 pages
C# Cheat Sheet
100% (5)
C# Cheat Sheet
12 pages
Self-Service Short Codes - T-Mobile Support
No ratings yet
Self-Service Short Codes - T-Mobile Support
6 pages
PDF
100% (1)
PDF
568 pages
Kontol
50% (2)
Kontol
144 pages
Need Ideas For Getting Your Sub's Mind Into A Submissive Space PDF
50% (2)
Need Ideas For Getting Your Sub's Mind Into A Submissive Space PDF
3 pages
18 Code Optimization 07-02-2025
No ratings yet
18 Code Optimization 07-02-2025
9 pages
1-Program Design and Analysis
No ratings yet
1-Program Design and Analysis
6 pages
Cpplib Internals: Neil Booth
No ratings yet
Cpplib Internals: Neil Booth
32 pages
Performance of Systemverilog Coding
No ratings yet
Performance of Systemverilog Coding
8 pages
AT&CD Unit 5
No ratings yet
AT&CD Unit 5
13 pages
2025_03_02_23_07_26_1740937046
No ratings yet
2025_03_02_23_07_26_1740937046
8 pages
Frequently Asked Questions - AVR
100% (2)
Frequently Asked Questions - AVR
18 pages
Main Seminar 'Autonomic Computing': Operating Systems and Middleware
No ratings yet
Main Seminar 'Autonomic Computing': Operating Systems and Middleware
10 pages
C Undefined Behavior
No ratings yet
C Undefined Behavior
4 pages
Memory Model
No ratings yet
Memory Model
35 pages
Interposable, Psychoacoustic Modalities
No ratings yet
Interposable, Psychoacoustic Modalities
7 pages
UNIT 1 (1)
No ratings yet
UNIT 1 (1)
34 pages
C Programming in Unix
80% (5)
C Programming in Unix
38 pages
BIT: A Very Compact Scheme System For Microcontrollers
No ratings yet
BIT: A Very Compact Scheme System For Microcontrollers
28 pages
Efficient Programming Techniques For Digital Signal Processing
No ratings yet
Efficient Programming Techniques For Digital Signal Processing
9 pages
A New C Compiler: Ken Thompson
No ratings yet
A New C Compiler: Ken Thompson
12 pages
The Structure of C Programs: #Include #Include #Include #Include
No ratings yet
The Structure of C Programs: #Include #Include #Include #Include
12 pages
T M T L: G C H - P S C: HE Atrix Emplate Ibrary Eneric Omponents For IGH Erformance Cientific Omputing
No ratings yet
T M T L: G C H - P S C: HE Atrix Emplate Ibrary Eneric Omponents For IGH Erformance Cientific Omputing
9 pages
Impact of Non Functional Programming
No ratings yet
Impact of Non Functional Programming
3 pages
Implementing A Custom X86 Encoder
No ratings yet
Implementing A Custom X86 Encoder
25 pages
Memory Model For Multithreaded C++: Andrei Alexandrescu Hans Boehm Kevlin Henney Doug Lea Bill Pugh
No ratings yet
Memory Model For Multithreaded C++: Andrei Alexandrescu Hans Boehm Kevlin Henney Doug Lea Bill Pugh
6 pages
From Quads To Graphs
No ratings yet
From Quads To Graphs
29 pages
Pcs - 2m
No ratings yet
Pcs - 2m
6 pages
Dip
No ratings yet
Dip
12 pages
C, Embedded Linux, LDD Interview Questions: 1. Explain About Compilation Process in C
No ratings yet
C, Embedded Linux, LDD Interview Questions: 1. Explain About Compilation Process in C
564 pages
Function Pointer PDF
No ratings yet
Function Pointer PDF
12 pages
Programming in C++
No ratings yet
Programming in C++
66 pages
3 CO and CG
No ratings yet
3 CO and CG
11 pages
Unit 5 - Compiler Design - WWW - Rgpvnotes.in PDF
No ratings yet
Unit 5 - Compiler Design - WWW - Rgpvnotes.in PDF
15 pages
Searching Deadlocks
No ratings yet
Searching Deadlocks
12 pages
Compiler Design
No ratings yet
Compiler Design
20 pages
Scimakelatex 1179 Me You Them
No ratings yet
Scimakelatex 1179 Me You Them
3 pages
Dsa - Barnette and Tonga - 5
No ratings yet
Dsa - Barnette and Tonga - 5
3 pages
EEPC102-Module_1-Lesson-2
No ratings yet
EEPC102-Module_1-Lesson-2
8 pages
Coursera Programming Languages Course Section 5 Summary
No ratings yet
Coursera Programming Languages Course Section 5 Summary
23 pages
Data Structures With C++
No ratings yet
Data Structures With C++
169 pages
Linux Doc En
No ratings yet
Linux Doc En
42 pages
Computer Labs Post
No ratings yet
Computer Labs Post
57 pages
Scimakelatex 30610 None
No ratings yet
Scimakelatex 30610 None
7 pages
Prep
No ratings yet
Prep
41 pages
Sanet - ST B07CY6DCW8
No ratings yet
Sanet - ST B07CY6DCW8
87 pages
What Is C Language?
No ratings yet
What Is C Language?
7 pages
Bit Twiddling
100% (2)
Bit Twiddling
90 pages
Reducing Computation Time For Short Bit Width Twos Compliment Multiplier
No ratings yet
Reducing Computation Time For Short Bit Width Twos Compliment Multiplier
57 pages
Assignment 5: Raw Memory: Bits and Bytes
No ratings yet
Assignment 5: Raw Memory: Bits and Bytes
6 pages
Compiler Design Assignment(1)
No ratings yet
Compiler Design Assignment(1)
12 pages
Ch6 Problem Set
No ratings yet
Ch6 Problem Set
5 pages
Compiler Contruction 1 Mark
No ratings yet
Compiler Contruction 1 Mark
5 pages
Towards The Synthesis of Context-Free Grammar
No ratings yet
Towards The Synthesis of Context-Free Grammar
8 pages
CPC Imp Questions
No ratings yet
CPC Imp Questions
9 pages
cd aat
No ratings yet
cd aat
8 pages
Executing Around Sequences
No ratings yet
Executing Around Sequences
24 pages
Parallel Multiverse
No ratings yet
Parallel Multiverse
46 pages
VHDL: A Language For Specifying Logic
No ratings yet
VHDL: A Language For Specifying Logic
24 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
More on C# in Front Office
From Everand
More on C# in Front Office
Xing Zhou
No ratings yet
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
C++ Functional Programming for Starters: A Practical Guide with Examples
From Everand
C++ Functional Programming for Starters: A Practical Guide with Examples
William E. Clark
No ratings yet
Readme
No ratings yet
Readme
6 pages
Geometry of The Einstein Equations
No ratings yet
Geometry of The Einstein Equations
41 pages
Gell Mann Low
No ratings yet
Gell Mann Low
13 pages
Hyderabadi Bryani Vidhi
No ratings yet
Hyderabadi Bryani Vidhi
5 pages
Eat That Frog
100% (10)
Eat That Frog
124 pages
The Linux Command Line
100% (4)
The Linux Command Line
537 pages
Coding With JavaScript For Dummies Everything To Know About JavaScript (2020) - 40153
100% (1)
Coding With JavaScript For Dummies Everything To Know About JavaScript (2020) - 40153
247 pages
Linux Cheat Sheet
No ratings yet
Linux Cheat Sheet
4 pages
Mad Love - Pedophile Hearts, Disposable Teens, and The - Tracy R. Twyman - Anna's Archive
0% (1)
Mad Love - Pedophile Hearts, Disposable Teens, and The - Tracy R. Twyman - Anna's Archive
104 pages
Python Programming For Beginners - A Crash Course To Learn Python and Other Recommended Coding
83% (6)
Python Programming For Beginners - A Crash Course To Learn Python and Other Recommended Coding
86 pages
Structured and Unstructured Maintenance With Example
0% (1)
Structured and Unstructured Maintenance With Example
9 pages
Simple Sabotage Field Manual
100% (2)
Simple Sabotage Field Manual
16 pages
List of Programming Languages by Type
100% (1)
List of Programming Languages by Type
35 pages
PrepTest 80 - Print and Take Test - 7sage Lsat
100% (3)
PrepTest 80 - Print and Take Test - 7sage Lsat
45 pages
Password Cracking Techniques
100% (1)
Password Cracking Techniques
48 pages
Learn Javascript in A DAY!
100% (8)
Learn Javascript in A DAY!
192 pages
AI Tools and Prompts
100% (4)
AI Tools and Prompts
94 pages
Rust Language Cheat Sheet
No ratings yet
Rust Language Cheat Sheet
9 pages
NWO, Illuminati, Freemason, Occult, Bible Prophecy, Conspiracy, Secret Society, Etc. Links
No ratings yet
NWO, Illuminati, Freemason, Occult, Bible Prophecy, Conspiracy, Secret Society, Etc. Links
47 pages
Platform Technologies Reviewer
No ratings yet
Platform Technologies Reviewer
46 pages
Bsc20 Java e Content U Sample
No ratings yet
Bsc20 Java e Content U Sample
21 pages
PHPBuilder - PHP and Zend Engine Internals
No ratings yet
PHPBuilder - PHP and Zend Engine Internals
10 pages
Ques. Evolution of Computer. Ans.: Important Questions With Solutions (Unit 1 To Unit 4)
No ratings yet
Ques. Evolution of Computer. Ans.: Important Questions With Solutions (Unit 1 To Unit 4)
27 pages
Chapter 1-Introduction To Programming Language
100% (1)
Chapter 1-Introduction To Programming Language
24 pages
W7A
No ratings yet
W7A
21 pages
PlantsVsZombies - Cheat - Engine (Save As CT)
No ratings yet
PlantsVsZombies - Cheat - Engine (Save As CT)
11 pages
Assembly Language Assignment 1
No ratings yet
Assembly Language Assignment 1
14 pages
Programming For Problem Solving Mcqs (Set-1) : 1. Which of The Following Are Components of Central Processing Unit (Cpu) ?
No ratings yet
Programming For Problem Solving Mcqs (Set-1) : 1. Which of The Following Are Components of Central Processing Unit (Cpu) ?
5 pages
Commodore 128 Book 7 Peeks and Pokes PDF
No ratings yet
Commodore 128 Book 7 Peeks and Pokes PDF
268 pages
Assembler Notes - SS
No ratings yet
Assembler Notes - SS
23 pages
Cit301 Summary From
No ratings yet
Cit301 Summary From
23 pages
Lo'ai Tawalbeh: Cpe 252: Computer Organization 1
No ratings yet
Lo'ai Tawalbeh: Cpe 252: Computer Organization 1
84 pages
Embedded System Unit 1 (VTU)
No ratings yet
Embedded System Unit 1 (VTU)
20 pages
Assignment CSIT ANSHU YADAV
No ratings yet
Assignment CSIT ANSHU YADAV
15 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
422 pages
Introduction To Programming and Flowcharting
No ratings yet
Introduction To Programming and Flowcharting
7 pages
ICT Notes
No ratings yet
ICT Notes
9 pages
Banking System Project
No ratings yet
Banking System Project
94 pages
Codius Whitepaper
No ratings yet
Codius Whitepaper
15 pages
11th
No ratings yet
11th
11 pages
Advanced Machine Code Programming
100% (2)
Advanced Machine Code Programming
259 pages
Computer Architecture (Ceng 201)
No ratings yet
Computer Architecture (Ceng 201)
32 pages
Instruction Set
No ratings yet
Instruction Set
5 pages
Mic Lab Manualpdf
No ratings yet
Mic Lab Manualpdf
54 pages
InternalArchitecture 8086 - PPT
100% (1)
InternalArchitecture 8086 - PPT
21 pages

Decompilation

Uploaded by

Decompilation

Uploaded by

Lecture Notes on Decompilation

15411: Compiler Design

4 Lifting and Dataflow Analysis

jcond (%eax == 0) label

5 Control Flow Analysis

5.2 Structuring Ifs

Given this problem, how does type analysis work?

2. Dereferencing forces its parameter to be a pointer.

9. If something is returned from main(), it is an integer.

1. Cifuentes, Cristina. “A Structuring Algorithm for Decompilation.” XIX Conferencia Latinoamericana

You might also like