Lec7_RunAProgram

The document discusses the process of translating a C program into a machine program, highlighting the differences between high-level languages, assembly language, and machine instructions. It explains the architecture of embedded systems, specifically Harvard and Von Neumann architectures, and how they affect memory access and performance. Additionally, it covers the organization of runtime memory images and the importance of register allocation for improving execution speed.

Uploaded by

ullash414

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views24 pages

Lec7_RunAProgram

Uploaded by

ullash414

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

Heaven’s Light Is Our Guide

Rajshahi University of Engineering & Technology

Department of Computer Science & Engineering

Running a Program
CSE 3105 (Computer Interfacing & Embedded Systems)

Md. Nasif Osman Khansur

Lecturer
Dept. of CSE, RUET
Reference
Embedded Systems with ARM Cortex-M Microcontrollers in Assembly Language
and C 3rd Ed - Yifeng Zhu [Chapter 1]
Translate a C Program into a Machine Program

High Level Language

Assembly Language

Machine Instruction
Translate a C Program into a Machine Program
❏ Executable files created by compilers are usually platform dependent.
❏ An executable file compiled for one type of microprocessors, such as ARM
Cortex-M3, cannot directly run on a platform with a different kind of
microprocessors that support a different set of machine instructions, such as
PIC or Atmel AVR microcontrollers.
❏ When we migrate a program written in a high-level language to a processor
of a different instruction set, we usually have to modify and recompile the
source programs for the new target platform.
Translate a C Program into a Machine Program

❏ One exception is Java executables, which are platform independent.

❏ The Java compiler converts a Java program to bytecodes. A Java
virtual machine (JVM) translates bytecode into machine instructions
at runtime. Because each platform has its JVM, Java executables
become platform-independent.
❏ Currently, Java is not popular in embedded systems yet because it
needs more memory and cannot control peripherals flexibly.
Translate a C Program into a Machine Program

Extracting
symbols and Intermediate Optimization
checking Representation
syntax

Lexical Syntax Semantic

Analysis Analysis Analysis

Machine Program
Translate a C Program into a Machine Program

Figure 1. Compiling a C program into a binary executable

Translate a C Program into a Machine Program
❖ An assembly program includes the following five key components:
1. A label represents the memory address of a data variable or an assembly
instruction.
2. An instruction mnemonic is an operation that the processor should perform,
such as "ADD" for adding integers.
3. Operands of a machine operation can be numeric constants or processor
registers.
4. A program comment aims to improve inter-programmer communication and
code readability by explicitly specifying programmers' intentions, assumptions,
and hidden concepts.
5. An assembly directive is not a machine instruction, but it defines the data
content or provides relevant information to assist the assembler.
Translate a C Program into a Machine Program
An ELF file provides load view and execution view. The load view specifies how to load data into the
memory. The execution view instructs how to initialize data regions at runtime.
Translate a C Program into a Machine Program
A binary machine program includes four critical sections, including:
● a text segment that consists of binary machine instructions,
● a read-only data segment that defines the value of variables
unalterable at runtime,
● a read-write data segment that sets the initial values of statically
allocated and modifiable variables, and
● a zero-initialized data segment that holds all uninitialized variables
declared in the program
Harvard Architecture and Von Neumann Architecture
Von Neumann Architecture

Instructions and data share the

memory device. It has only
one set of data bus and
address bus shared by the
instruction memory and the
data memory. The data stream
and the instruction stream
share the memory bandwidth.
Harvard Architecture

Instructions and data are

stored in different memory
devices. It has a dedicated set
of data bus and address bus
for the instruction memory
and the data memory. The
instruction stream and the
data stream transfer
information on separate sets
of data and address buses.
Harvard Architecture

Data memory and instruction

memory are in the same memory
address space in Harvard
Architecture in many embedded
systems. Accordingly, the
instruction memory and the data
memory can share the same
address bus.
Harvard Architecture and Von Neumann Architecture
● The Von Neumann architecture is relatively inexpensive and simple.
● The Harvard architecture allows the processor to access the data memory and the
instruction memory concurrently. By contrast, the Von Neumann architecture allows
only one memory access at any time instant; the processor either read an instruction
from the instruction memory or accesses data in the data memory. Accordingly, the
Harvard architecture often offers faster processing speed at the same clock rate.
● The Harvard architecture tends to be more energy efficient. Under the same
performance requirement, the Harvard architecture often needs lower clock speeds,
resulting in a lower power consumption rate.
● In the Harvard architecture, the data bus and the instruction bus may have different
widths. For example, digital signal processing processors can leverage this feature to
make the instruction bus wider to reduce the number of clock cycles required to load
an instruction.
Creating Runtime Memory Image
❏ ARM Cortex-M3/M4/M7 microprocessors are
Harvard computer architecture, and the instruction
memory (flash memory) and the data memory
(SRAM) are built into the processor chip.
❏ Separating the instruction and data memories allows
concurrent accesses to instructions and data, thus
improving the memory bandwidth and speeding up
the processor performance.
❏ Typically, the instruction memory uses a slow but
nonvolatile flash memory, and the data memory uses
a fast but volatile SRAM.
Creating Runtime Memory Image
❏ At runtime, the data memory is divided into four segments: initialized data segment,
uninitialized data segment, heap, and stack. The processor allocates the first two data
segments statically, and their size and location remain unchanged at runtime. The size
of the last two segments changes as the program runs.
❏ The initialized data segment contains global and static variables that the program
gives some initial values. For example, in a C declaration, "int capacity = 100;", if it
appears outside any function (i.e., it is a global variable), the processor places the
variable capacity in the initialized data segment with an initial value when the
processor creates a running time memory image for this C program.
❏ The zero-initialized data segment contains all global or static variables that are
uninitialized or initialized to zero in the program. For example, a globally declared
string "char name[20];" is stored in the uninitialized data segment.
Creating Runtime Memory Image
❏ The heap holds all data objects that an application creates dynamically at runtime.
For example, all data objects created by dynamic memory allocation library
functions like malloc() or calloc() in C or by the new operator in C++ are placed in
the heap. A free() function inC or a delete operator in C++ removes a data object
from the heap.
❏ The stack stores local variables of subroutines, including main(), saves the runtime
environment and passes arguments to a subroutine. A stack is a first-in, last-out
(FILO) memory region, and the processor places it on the top of the data memory.
When a subroutine declares a local variable, the variable is saved in the stack. When
a subroutine returns, the subroutine should pop from the stack all variables it has
pushed.
Creating Runtime Memory Image

A processor of Harvard architecture loads a program into the instruction memory and the data memory.
Reusing Registers to Improve Performance
Question: Why some variable is not stored in memory rather stored in register?

Answer: Variables stored in registers rather than memory are typically those that are
heavily used and require fast access. Registers are much faster to access than memory
because they are part of the CPU itself, whereas accessing memory involves traversing
buses and interacting with potentially slower components. When a variable is stored in
a register, it means that the CPU can directly manipulate the variable's value without
having to fetch it from memory. This can significantly speed up the execution of code
that heavily uses these variables, particularly in tight loops or performance-critical
sections of code. In languages like C or C++, the register keyword can be used as a
hint to the compiler that a particular variable should be stored in a register if possible.
However, modern compilers are often able to make these decisions automatically
based on their optimization algorithms and the characteristics of the target hardware.
Reusing Registers to Improve Performance
Self Study: Temporal, Spatial Locality, Register Allocation, Processor Registers
(Article 1.3.1 & 1.3.2)

Life cycle of an instruction:

● At the first stage, the processor fetches 4 bytes from the instruction memory and
increments the program counter by 4 automatically. After each instruction fetch,
the program counter points to the next instruction(s) to be fetched.
● At the second stage, the processor decodes the instruction and finds out what
operations are to be carried out.
● At the last stage, the processor reads operand registers, carries out the designated
arithmetic or logic operation, accesses data memory (if necessary) and updates
target registers (if needed)
Reusing Registers to Improve Performance
Pipelining allows multiple instructions to run simultaneously. Thus, it increases the
utilization of hardware resources and improves the processor's overall performance.
Executing a Machine Program
Extended Study: Loading the program (1.4.1), Starting the Execution (1.4.2),
Program Completion (1.4.3)

PCMCIA Socket Services
No ratings yet
PCMCIA Socket Services
18 pages
Embedded Computing Systems Unit - I-Instruction Set Text Books: 1. Wayne Wolf: Computers As Components, Principles of Embedded Computing Systems Design, 2nd Edition, Elsevier, 2008
No ratings yet
Embedded Computing Systems Unit - I-Instruction Set Text Books: 1. Wayne Wolf: Computers As Components, Principles of Embedded Computing Systems Design, 2nd Edition, Elsevier, 2008
40 pages
Module 1-Complete
No ratings yet
Module 1-Complete
136 pages
Instruction Set Architecture
No ratings yet
Instruction Set Architecture
37 pages
On Chip Periperals tm4c UNIT-II
No ratings yet
On Chip Periperals tm4c UNIT-II
30 pages
3.1_Machine_Basics
No ratings yet
3.1_Machine_Basics
55 pages
Memory and Classification
No ratings yet
Memory and Classification
8 pages
Micro-2nd
No ratings yet
Micro-2nd
10 pages
04ABIs
No ratings yet
04ABIs
60 pages
Lecture1 2
No ratings yet
Lecture1 2
21 pages
OS_UNIT-4-MAIN_MEMORY.pptx
No ratings yet
OS_UNIT-4-MAIN_MEMORY.pptx
99 pages
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
Ca05 2014 PDF
No ratings yet
Ca05 2014 PDF
54 pages
Introduction To Cortex-M3 Programming: ARM University Program
No ratings yet
Introduction To Cortex-M3 Programming: ARM University Program
34 pages
Lesson 6a PDF
No ratings yet
Lesson 6a PDF
5 pages
Module-3 ARMProgram Notes.-16857877494142 PDF
No ratings yet
Module-3 ARMProgram Notes.-16857877494142 PDF
5 pages
UNIT - IV
No ratings yet
UNIT - IV
40 pages
05 Lec Memory - Architectures
No ratings yet
05 Lec Memory - Architectures
40 pages
06ABIs 1
No ratings yet
06ABIs 1
41 pages
Module 2
No ratings yet
Module 2
41 pages
TASK 2
No ratings yet
TASK 2
9 pages
comporg6_ch4
No ratings yet
comporg6_ch4
39 pages
C for Embedded Systems Programming
No ratings yet
C for Embedded Systems Programming
69 pages
unit 1 topic 3
No ratings yet
unit 1 topic 3
21 pages
Computer Architecture Taxonomy
No ratings yet
Computer Architecture Taxonomy
13 pages
18 Processor Architectures
No ratings yet
18 Processor Architectures
8 pages
Efficient Embedded C Programming
No ratings yet
Efficient Embedded C Programming
70 pages
OS Supplement
No ratings yet
OS Supplement
11 pages
Instruction Set Architecture 24
No ratings yet
Instruction Set Architecture 24
17 pages
COA_Unit-2_notes
No ratings yet
COA_Unit-2_notes
27 pages
Lecture02 FundamentalsOfComputerDesign
No ratings yet
Lecture02 FundamentalsOfComputerDesign
47 pages
A Simple Computer
No ratings yet
A Simple Computer
63 pages
Memory Layout of C Program on ARM Processor
No ratings yet
Memory Layout of C Program on ARM Processor
21 pages
2441-LT3 ARM Assembly Instr 2023-24
No ratings yet
2441-LT3 ARM Assembly Instr 2023-24
30 pages
Memory Management
No ratings yet
Memory Management
55 pages
Comp - Arch 2334
No ratings yet
Comp - Arch 2334
4 pages
Es (U4) 1
No ratings yet
Es (U4) 1
24 pages
2 Arch Mips I
No ratings yet
2 Arch Mips I
16 pages
Module 4 - Introduction To Embedded System and ARM
No ratings yet
Module 4 - Introduction To Embedded System and ARM
29 pages
04 ARM Assembly
No ratings yet
04 ARM Assembly
62 pages
4 Embedded Software Development 10-08-2023
No ratings yet
4 Embedded Software Development 10-08-2023
45 pages
The ARM Processor
100% (2)
The ARM Processor
24 pages
CSE331_L3_ARM_ISA
No ratings yet
CSE331_L3_ARM_ISA
103 pages
Unit V Contents at A Glance
No ratings yet
Unit V Contents at A Glance
27 pages
OS Unit- 4 Notes
No ratings yet
OS Unit- 4 Notes
35 pages
Solutions COA7e 1
No ratings yet
Solutions COA7e 1
92 pages
Tutorial11 Cpuinstructions
No ratings yet
Tutorial11 Cpuinstructions
11 pages
U2 - ARM Processor
No ratings yet
U2 - ARM Processor
85 pages
UNIT 3 Notes-OS
No ratings yet
UNIT 3 Notes-OS
34 pages
Lecture 5
No ratings yet
Lecture 5
51 pages
RISC Vs CISC, Harvard V/s Van Neumann
No ratings yet
RISC Vs CISC, Harvard V/s Van Neumann
35 pages
Part 1B, DR S.W. Moore
100% (3)
Part 1B, DR S.W. Moore
16 pages
04-Instructions and Formats
No ratings yet
04-Instructions and Formats
7 pages
6 Machine - Intro v2
No ratings yet
6 Machine - Intro v2
29 pages
ARM Presentation
No ratings yet
ARM Presentation
51 pages
Unit - I
No ratings yet
Unit - I
47 pages
Efficient Programming Techniques For ARM
100% (1)
Efficient Programming Techniques For ARM
18 pages
Code Beneath the Surface: Mastering Assembly Programming
From Everand
Code Beneath the Surface: Mastering Assembly Programming
Kameron Hussain
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
From Everand
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
Manoj R Chakravarthi
No ratings yet
Embedded C Programming in a Nutshell
No ratings yet
Embedded C Programming in a Nutshell
532 pages
Advance Operating System
No ratings yet
Advance Operating System
18 pages
Vmware 1
No ratings yet
Vmware 1
14 pages
How To Develop An Operating System Using
No ratings yet
How To Develop An Operating System Using
14 pages
Lab 2
No ratings yet
Lab 2
24 pages
MPMC (r15) Unit 1b
No ratings yet
MPMC (r15) Unit 1b
89 pages
Instructions: Language of The Computer: CMPS290 Class Notes (Chap02) Page 1 / 45 by Kuo-Pao Yang
No ratings yet
Instructions: Language of The Computer: CMPS290 Class Notes (Chap02) Page 1 / 45 by Kuo-Pao Yang
45 pages
HW2 Operating Systems
No ratings yet
HW2 Operating Systems
9 pages
8086 Microprocessor: Lec. 3: 8086 Intel Microprocessor Omar Zyad
No ratings yet
8086 Microprocessor: Lec. 3: 8086 Intel Microprocessor Omar Zyad
21 pages
The Compiler, Assembler, Linker, Loader
No ratings yet
The Compiler, Assembler, Linker, Loader
10 pages
X86-Win32 Reverse Engineering Cheat-Sheet
No ratings yet
X86-Win32 Reverse Engineering Cheat-Sheet
1 page
Microprocessor Question Solve
No ratings yet
Microprocessor Question Solve
31 pages
Recap: Translation Box (MMU)
No ratings yet
Recap: Translation Box (MMU)
4 pages
Relocating Code and Data Using The KDS GCC Linker File For Kinetis
No ratings yet
Relocating Code and Data Using The KDS GCC Linker File For Kinetis
18 pages
Unix Programming - Module 3
No ratings yet
Unix Programming - Module 3
11 pages
Memory Map in
No ratings yet
Memory Map in
22 pages
Advanced C++ Programming Advanced C++ Programming
100% (2)
Advanced C++ Programming Advanced C++ Programming
319 pages
Assemly Language 02: To Pay More Attention To Gain Better Result
No ratings yet
Assemly Language 02: To Pay More Attention To Gain Better Result
24 pages
Avr Alp
No ratings yet
Avr Alp
87 pages
Microprocessor Lab Manual: Vi Semester
No ratings yet
Microprocessor Lab Manual: Vi Semester
102 pages
The Compiler, Assembler, Linker, Loader and Process Address Space Tutorial - Hacking The Process of Building Programs Using C Language - Notes and Illustrations
No ratings yet
The Compiler, Assembler, Linker, Loader and Process Address Space Tutorial - Hacking The Process of Building Programs Using C Language - Notes and Illustrations
12 pages
Computer Architecture Lab Manual
No ratings yet
Computer Architecture Lab Manual
108 pages
Inter Process Communication Tutorial PDF
No ratings yet
Inter Process Communication Tutorial PDF
20 pages
COAL_LAB SOL,4
No ratings yet
COAL_LAB SOL,4
7 pages
Advant Controller 31
100% (1)
Advant Controller 31
132 pages
1program To Display Single Character PRINT A Character
No ratings yet
1program To Display Single Character PRINT A Character
23 pages
Intel 8088/80286 Register Architecture: General Purpose Registers Special Purpose Registers
No ratings yet
Intel 8088/80286 Register Architecture: General Purpose Registers Special Purpose Registers
5 pages
Eee MPMC Mid-2 Bits
No ratings yet
Eee MPMC Mid-2 Bits
7 pages
Unix&Network Programming: Study of Multiuser Operating System and Their Features"
No ratings yet
Unix&Network Programming: Study of Multiuser Operating System and Their Features"
36 pages