0% found this document useful (0 votes)

2 views20 pages

Lecture 10

The document outlines the processes of forward and reverse engineering in programming, detailing the steps involved in building and deconstructing software. It discusses the tools used in each process, the importance of registers in assembly language, and the methods for managing control flow and function calls. Additionally, it covers memory addressing, data storage, and the significance of calling conventions in function execution.

Uploaded by

jowaowa101

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views20 pages

Lecture 10

Uploaded by

jowaowa101

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Reverse Engineering

Lecture 10
Design

The Forward Engineering Process Code

Compile
"Forward Engineering" is an overloaded term, but
in this context, it is the process of building a Fix Tons of Bugs
program.
Compile

1. Figure out what you want to code. Extensive Cursing

2. Code it.
3. Compile it. Fix More Bugs
4. Run it.
Compile

At every step, information is lost! Assemble

Design

Forward Engineering Tools Code

Compile
Let's look at some tools:
Fix Tons of Bugs
- Visual Studio/ an IDE
- Compile
gcc
- Strings Extensive Cursing

Fix More Bugs

Viola! An ELF is born. Compile

Assemble
The Reverse Engineering Process Understand

Every step in the reverse-engineering process is

imperfect and relies on some amount of human Lots of Thinking
help.

Typically, a reverser use several reverse Decompile

engineering tools to build up a mental model of
the target software.
Disassemble
This art is the focus of this module: how do
we reverse the design from the binary?
Assembly Refresher
#
Registers rax

rdx

Registers are very fast, temporary stores for data. rsi

You get several "general purpose" registers:

- 8085: a, c, d, b, e, h, l
- 8086: ax, cx, dx, bx, sp, bp, si, di
- x86: eax, ecx, edx, ebx, esp, ebp, esi, edi
- amd64: rax, rcx, rdx, rbx, rsp, rbp, rsi, rdi, r8, r9, r10, r11, r12, r13, r14, r15
- arm: r0, r1, r2, r3, r4, r5, r6, r7, r8, r9, r10, r11, r12, r13, r14

The address of the next instruction is in a register:

eip (x86), rip (amd64), r15 (arm)

Various extensions add other registers (x87, MMX, SSE, etc).

#
Setting Registers
You load data into registers with... assembly! "mov" means "move".
mov rax, 0x539
mov rbx, 1337

Data specified directly in the instruction like this is called an

Immediate Value.
You can also load data into partial registers:
mov ah, 0x5
mov al, 0x39
#
Register Arithmetic
Once you have data in registers, you can compute!
For most arithmetic instructions, the first specified register stores the result.
Instruction C / Math equivalent Description
add rax, rbx rax = rax + rbx add rax to rbx
sub ebx, ecx ebx = ebx - ecx subtract ecx from ebx
imul rsi, rdi rsi = rsi * rdi multiple rsi to rdi, truncate to 64-bits
inc rdx rdx = rdx + 1 increment rdx
dec rdx rdx = rdx - 1 decrement rdx
neg rax rax = 0 - rax negate rax in terms of numerical value
not rax rax = ~rax negate each bit of rax
and rax, rbx rax = rax & rbx bitwise AND between the bits of rax and rbx
or rax, rbx rax = rax | rbx bitwise OR between the bits of rax and rbx
xor rcx, rdx rcx = rcx ^ rdx bitwise XOR (don't confuse ^ for exponent!)
shl rax, 10 rax = rax << 10 shift rax's bits left by 10, filling with 10 zeroes on the right

shr rax, 10 rax = rax >> 10 shift rax's bits right by 10, filling with 10 zeroes on the left
shift rax's bits right by 10, with sign-extension to fill the now
sar rax, 10 rax = rax >> 10
"missing" bits!
ror rax, 10 rax = (rax >> 10) | (rax << 54) rotate the bits of rax right by 10

rol rax, 10 rax = (rax << 10) | (rax >> 54) rotate the bits of rax left by 10

Curious how these work? Play around with the rappel tool ( https://github.com/yrp604/rappel)!
#
Memory (stack)
The stack has several uses. For now, we'll talk about temporary data
storage.
Registers and immediates can be pushed onto the stack to save
values:
mov rax, 0xc001ca75
push rax
push 0xb0bacafe # WARNING: even on 64-bit x86, you can only push 32-bit immediates...

c001ca75

c001ca75
b0bacafe
push rax
stack
(Like mov, push leaves the value in the src register intact.)

Values can be popped back off of the stack (to any register!).
pop rbx # sets rbx to 0xc001ca75

c001ca75
stack
pop rcx # sets rcx to 0xb0bacafe
#
Addressing the Stack
The CPU knows where the stack is because its address is stored in
rsp = 0x7f01f3453050
rsp.

0x7f01f345305
0

c001ca75
stack

rsp = 0x7f01f3453048

push 0xb0bacafe

0x7f01f345304
8

c001ca75
b0bacafe
stack

rsp = 0x7f01f3453050
pop rcx

0x7f01f345305
0

c001ca75
stack

Historical oddity: the stack grows backwards toward

smaller memory addresses!
push decreases rsp, pop increases it.
#
Accessing Memory
You can also move data between registers and memory with ... mov!
This will load the 64-bit value stored at memory address 0x12345 into
rbx:
mov rax, 0x12345
mov rbx, [rax]

This will store the 64-bit value in rbx into memory at address
0x133337:
mov rax, 0x133337
mov [rax], rbx

This is equivalent to push rcx:

sub rsp, 8
mov [rsp], rcx

Each addressed memory location contains one byte.

#
Memory Endianess
Data on most modern systems is stored backwards, in little endian.
ah al

mov eax, 0xc001ca75 # sets rax to c0 01 ca 75

mov rcx, 0x10000 0x10000 0x10001 0x10002 0x10003

mov [rcx], eax # stores data as 75 ca 01 c0

mov bh, [rcx] # reads 0x75

https://en.wikipedia.org/wiki/Endianness
Assembly Crash Course
#
Computers Make Decisions
if (authenticated) {
leetness = 1337;
}
else {
leetness = 0;
}
So far, we've just shunted data around.
But how do we make decisions?
#
What to Execute?
First, let's look at how computers execute instructions.
Recall: Assembly instructions are direct translations of binary code.
This binary code lives in memory.
0x10000 0x7fffffffffff

Dynamically Allocated Memory OS Helper

Program Binary Code (managed by libraries) Library Code Process Stack Regions

Example:
0x400800

Program add rax,

pop rax pop rbx push rax
Binary Code rbx

This is (in hex):

0x400800 0x400801 0x400802 0x400805

Program
58 5b 48 01 d8 50
Binary Code
#
Control Flow: Jumps
CPUs execute instructions in sequence until told not to.
One way to interrupt the sequence is with a jmp instruction:
mov cx, 1337
jmp STAY_LEET
mov cx, 0
STAY_LEET:
push rcx
0x400800 STAY_LEET

Program mov rcx, 0x1337 jmp STAY_LEET mov rcx, 0 push rcx
Binary Code

STAY_LEET
0x400800 0x400804 0x400806 0x40080a
eb 04
Program
66 b9 37 13 (skip 4 66 b9 00 00 51
Binary Code bytes)

jmp skips X bytes and then resumes execution!

But that's still not enough for decisions...
je jump if equal
jn jump if not equal
e jump if greater
#
Control Flow: Conditional Jumps! jg
jl
jle
jump if less
jump if less than or equal
jump if greater than or equal
jg jump if above (unsigned)
e jump if below (unsigned)
Jumps can rely on conditions! ja jump if above or equal
mov cx, 1337 jb (unsigned)
jnz STAY_LEET ja jump if below or equal
mov cx, 0 e (unsigned)
jb jump if signed
STAY_LEET: e jump if not signed
push rcx js jump if overflow
jn jump if not overflow
0x400800 STAY_LEET
s jump if zero
Program
Binary Code
mov rcx, 0x1337 jmp STAY_LEET mov rcx, 0 push rcx jo jump if not zero
jn
STAY_LEET o
0x400800 0x400804 0x400806 0x40080a
jz
Program
Binary Code
66 b9 37 13 75 04 66 b9 00 00 51 jn
z

jnz is "jump if not zero", but if what is not zero?

je jump if equal ZF=1
jn jump if not equal ZF=0
e jump if greater ZF=0 and SF=OF
#
Control Flow: Conditions jg
jl
jle
jump if less
jump if less than or equal
jump if greater than or equal
SF!=OF
ZF=1 or SF!=OF
SF=OF
jg jump if above (unsigned) CF=0 and ZF=0
e jump if below (unsigned) CF=1
Conditional jumps check Conditions ja jump if above or equal CF=0
stored in the "flags" register: rflags. jb (unsigned) CF=1 or ZF=1
ja jump if below or equal SF=1
e (unsigned) SF=0
Flags are updated by: jb jump if signed OF=1
Most arithmetic instructions.
e jump if not signed OF=0
Comparison instruction cmp (sub, but discards result).
js jump if overflow ZF=1
Comparison instruction test (and, but discards result).
jn jump if not overflow ZF=0
s jump if zero
Main conditional flags: jo jump if not zero
Carry Flag: was the 65th bit 1? jn
Zero Flag: was the result 0?
o
Overflow Flag: did the result "wrap" between positive to negative?
jz
Signed Flag: was the result's signed bit set (i.e., was it negative)?
jn
z
Common patterns:
cmp rax, rbx; ja STAY_LEET # unsigned rax > rbx. 0xffffffff >= 0
cmp rax, rbx; jle STAY_LEET # signed rax <= rbx. 0xffffffff = -1 < 0
test rax, rax; jnz STAY_LEET # rax != 0
cmp rax, rbx; je STAY_LEET # rax == rbx

Thanks to Two's Complement, only the jumps themselves have to be signedness-aware.

#
Control Flow: Function Calls!
Assembly code is split into functions with call and ret.
call pushes rip (address of the next instruction after the call) and jumps away!
ret pops rip and jumps to it!

Using a function that takes an authenticated value and returns

leetness: int check_leet(int authed)
mov rdi, 0
call FUNC_CHECK_LEET {
mov rdi, 1 if (authed) return 1337;
call FUNC_CHECK_LEET
call EXIT else return 0;
}
FUNC_CHECK_LEET:
test rdi, rdi
jnz LEET int main() {
mov ax, 0
ret
check_leet(0);
LEET: check_leet(1);
mov ax, 1337 exit();
ret
}
FUNC_EXIT:
#
Calling Conventions
Callee and caller functions must agree on argument passing.
Linux x86: push arguments (in reverse order), then call (which pushes return address),
return value in eax
Linux amd64: rdi, rsi, rdx, rcx, r8, r9, return value in rax
Linux arm: r0, r1, r2, r3, return value in r0

Registers are shared between functions, so calling conventions should

agree on what registers are protected.
Linux amd64.
rbx, rbp, r12, r13, r14, r15 are "callee-saved"
(the function you call keeps their values safe on the stack).
Other registers are up for grabs
(within reason; e.g., rsp must be maintained). Save their values (on the stack)!

Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
981RA Installation Manual
No ratings yet
981RA Installation Manual
49 pages
PHP A Visual Blueprint
100% (1)
PHP A Visual Blueprint
132 pages
01 Lecture02
No ratings yet
01 Lecture02
78 pages
Intel I
No ratings yet
Intel I
72 pages
x86_assembly_reversing_cheat_sheet
No ratings yet
x86_assembly_reversing_cheat_sheet
1 page
ch4 Handouts
No ratings yet
ch4 Handouts
72 pages
Fundamentals - Assembly
No ratings yet
Fundamentals - Assembly
18 pages
Review of Assembly Language: Program "Text" Contains Binary Instructions
No ratings yet
Review of Assembly Language: Program "Text" Contains Binary Instructions
27 pages
MP Assignment1
No ratings yet
MP Assignment1
7 pages
Intel Cheat Sheet
No ratings yet
Intel Cheat Sheet
8 pages
Multicore and Assembly Language
No ratings yet
Multicore and Assembly Language
31 pages
8051 Assembly Language
No ratings yet
8051 Assembly Language
39 pages
x86-64 Intel Cheat Sheet Summary
100% (1)
x86-64 Intel Cheat Sheet Summary
10 pages
6 Machine - Intro v2
No ratings yet
6 Machine - Intro v2
29 pages
A Crash Course On x86 Disassembly
No ratings yet
A Crash Course On x86 Disassembly
23 pages
Integers Floating Point: N N S E
No ratings yet
Integers Floating Point: N N S E
4 pages
Assembly #1
No ratings yet
Assembly #1
8 pages
Practical Malware Analysis: CH 4: A Crash Course in x86 Disassembly
No ratings yet
Practical Malware Analysis: CH 4: A Crash Course in x86 Disassembly
50 pages
Week 3 - Lecture
No ratings yet
Week 3 - Lecture
68 pages
CSE331_L3_ARM_ISA
No ratings yet
CSE331_L3_ARM_ISA
103 pages
Arithmetic Instructions
No ratings yet
Arithmetic Instructions
100 pages
Milen_Dimitrov_HW2_Q2
No ratings yet
Milen_Dimitrov_HW2_Q2
28 pages
lecture01-intro
No ratings yet
lecture01-intro
67 pages
Lab 4: Introduction To x86 Assembly
No ratings yet
Lab 4: Introduction To x86 Assembly
14 pages
657668478
No ratings yet
657668478
78 pages
7 Machine - Condition - Codes v2
No ratings yet
7 Machine - Condition - Codes v2
25 pages
EE209A - 24 15 Assembly2
No ratings yet
EE209A - 24 15 Assembly2
45 pages
What S Inside An 8086
No ratings yet
What S Inside An 8086
29 pages
Referral Sheet Format
No ratings yet
Referral Sheet Format
2 pages
Assembly #2
No ratings yet
Assembly #2
5 pages
Assembly
No ratings yet
Assembly
49 pages
x86 Assembly Tutorial: COS 318: Fall 2017
No ratings yet
x86 Assembly Tutorial: COS 318: Fall 2017
23 pages
14 Assembly Instructions
No ratings yet
14 Assembly Instructions
9 pages
Duane Intro X 86
No ratings yet
Duane Intro X 86
6 pages
x86 Instructions - Windows drivers _ Microsoft Learn
No ratings yet
x86 Instructions - Windows drivers _ Microsoft Learn
14 pages
EE209A - 24 14 Assembly1
No ratings yet
EE209A - 24 14 Assembly1
40 pages
Lec3 - RISC-V Assembly
No ratings yet
Lec3 - RISC-V Assembly
56 pages
ARM Assembly Language Guide: Common ARM Instructions (And Psuedo-Instructions)
No ratings yet
ARM Assembly Language Guide: Common ARM Instructions (And Psuedo-Instructions)
7 pages
Chapter 3 Instructions ARM
No ratings yet
Chapter 3 Instructions ARM
35 pages
x86-64 Reference Sheet (GNU Assembler Format) : Arithmetic Operations
No ratings yet
x86-64 Reference Sheet (GNU Assembler Format) : Arithmetic Operations
1 page
osbook-v0.775_removed
No ratings yet
osbook-v0.775_removed
31 pages
Basic Architecture Ia32 x86
No ratings yet
Basic Architecture Ia32 x86
41 pages
Lec08 ARMisa 4up
No ratings yet
Lec08 ARMisa 4up
24 pages
ARM Instruction Set: Computer Organization and Assembly Languages P GZ y GG Yung-Yu Chuang
No ratings yet
ARM Instruction Set: Computer Organization and Assembly Languages P GZ y GG Yung-Yu Chuang
25 pages
Rema Lab 2 Mihirpatel k042
No ratings yet
Rema Lab 2 Mihirpatel k042
6 pages
Mips Instructions
No ratings yet
Mips Instructions
30 pages
2.avr Risc
No ratings yet
2.avr Risc
46 pages
Assembly Notes
No ratings yet
Assembly Notes
18 pages
CS 4740/6740 Network Security: Lecture 7: Memory Corruption (Assembly Review, Basic Exploits)
No ratings yet
CS 4740/6740 Network Security: Lecture 7: Memory Corruption (Assembly Review, Basic Exploits)
189 pages
Offensive Security & Reverse Engineering (OSRE) : Ali Hadi
No ratings yet
Offensive Security & Reverse Engineering (OSRE) : Ali Hadi
110 pages
(Sep 2021) Modern Binary Exploitation
No ratings yet
(Sep 2021) Modern Binary Exploitation
676 pages
Assembly: Arithmetic and Logic: Machine Programming
No ratings yet
Assembly: Arithmetic and Logic: Machine Programming
43 pages
Introduction To Assembly Language
100% (6)
Introduction To Assembly Language
65 pages
CH04-Machine Language I (1)
No ratings yet
CH04-Machine Language I (1)
32 pages
Os Notes Mit
No ratings yet
Os Notes Mit
9 pages
Manual
No ratings yet
Manual
132 pages
Standard Cheat Sheet
No ratings yet
Standard Cheat Sheet
6 pages
Instruction Set 8051 - v1
No ratings yet
Instruction Set 8051 - v1
10 pages
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
No ratings yet
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
126 pages
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
ATI Imageon 2300
No ratings yet
ATI Imageon 2300
2 pages
Epson EB-500 Series-N PDF
No ratings yet
Epson EB-500 Series-N PDF
8 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
56 pages
Hotel Management System
No ratings yet
Hotel Management System
20 pages
Faro Laser Scanner Focus 150
No ratings yet
Faro Laser Scanner Focus 150
2 pages
12 Animation Reviewer
No ratings yet
12 Animation Reviewer
5 pages
Senzor de Vibratie VS - 125.01
No ratings yet
Senzor de Vibratie VS - 125.01
4 pages
Combien G
No ratings yet
Combien G
16 pages
CCNA1 Chap7 Practice Testanswers
No ratings yet
CCNA1 Chap7 Practice Testanswers
5 pages
25040
No ratings yet
25040
11 pages
MPC Touch Screen Setup
No ratings yet
MPC Touch Screen Setup
4 pages
PISO
No ratings yet
PISO
5 pages
PIPING Isometric Drawings
No ratings yet
PIPING Isometric Drawings
26 pages
74 HC 191
No ratings yet
74 HC 191
10 pages
Bios vs. Uefi
No ratings yet
Bios vs. Uefi
8 pages
E-Guide Data Center Update
No ratings yet
E-Guide Data Center Update
13 pages
C10. Ubuntu and Cloud Computing
No ratings yet
C10. Ubuntu and Cloud Computing
37 pages
BALT443809
No ratings yet
BALT443809
9 pages
Central Superficie Amron Amcom I Manual en
No ratings yet
Central Superficie Amron Amcom I Manual en
45 pages
Fpr2140 NGFW k9 Datasheet
No ratings yet
Fpr2140 NGFW k9 Datasheet
5 pages
Adobe Media Encoder Log-Last
No ratings yet
Adobe Media Encoder Log-Last
8 pages
27 Huawei
No ratings yet
27 Huawei
3 pages
Sony KV 29ls60e Part1
No ratings yet
Sony KV 29ls60e Part1
22 pages
Star Schema B
No ratings yet
Star Schema B
10 pages
UMPT Swap Procedure v1.2
100% (1)
UMPT Swap Procedure v1.2
13 pages
Exynos2200
No ratings yet
Exynos2200
3 pages
Toc QB
No ratings yet
Toc QB
6 pages
First Oculus
No ratings yet
First Oculus
2 pages

Lecture 10

Uploaded by

Lecture 10

Uploaded by

Reverse Engineering

The Forward Engineering Process Code

1. Figure out what you want to code. Extensive Cursing

At every step, information is lost! Assemble

Forward Engineering Tools Code

Fix More Bugs

Viola! An ELF is born. Compile

Every step in the reverse-engineering process is

Typically, a reverser use several reverse Decompile

Registers are very fast, temporary stores for data. rsi

You get several "general purpose" registers:

The address of the next instruction is in a register:

Various extensions add other registers (x87, MMX, SSE, etc).

Data specified directly in the instruction like this is called an

Historical oddity: the stack grows backwards toward

This is equivalent to push rcx:

Each addressed memory location contains one byte.

mov eax, 0xc001ca75 # sets rax to c0 01 ca 75

mov [rcx], eax # stores data as 75 ca 01 c0

Dynamically Allocated Memory OS Helper

Program add rax,

This is (in hex):

jmp skips X bytes and then resumes execution!

jnz is "jump if not zero", but if what is not zero?

Thanks to Two's Complement, only the jumps themselves have to be signedness-aware.

Using a function that takes an authenticated value and returns

Registers are shared between functions, so calling conventions should

You might also like