0% found this document useful (0 votes)

17K views24 pages

5-Stage Pipeline CPU Hardware

Here are the key points about conditional execution in ARM: - ARM instructions can optionally be made conditional by postfixing a condition code. - The condition code checks the status of flags like N, Z, C, and V set by previous instructions. - If the condition is true based on the flag statuses, the instruction executes normally. - If the condition is false, the instruction does not execute and the pipeline progresses to the next instruction. - Conditional execution allows greater pipeline performance by avoiding stalls when conditions are false. - It also improves code density since conditional instructions don't need separate branch instructions. - Overall, conditional execution enables higher instruction throughput in the ARM pipeline.

Uploaded by

Moksha Patel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPSX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17K views24 pages

5-Stage Pipeline CPU Hardware

Uploaded by

Moksha Patel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPSX, PDF, TXT or read online on Scribd

You are on page 1/ 24

5-stage Pipeline CPU hardware

Pipeline CPU hardware

Distribution of control signals in pipeline CPU

Data hazards
Control hazards
 Control hazard occurs whenever there is change in normal sequential flow of
program (caused by branch/jump, calling subroutine, interrupt, return from
interrupt etc.)
Structural hazards
 [1] multiply instruction holds Ex stage for two or more clock cycle.

 [2]Two or more instructions in pipeline try to read/write register file =>

Since there is only one read/write port, only one instruction is allowed to
read/write register file.
ARM Architecture
 ARM core :
 Pipelined RISC CPU reduced number of fixed size instructions
 Offers high code density, small size, low power
 Applications are cell phones, handheld PDA, camera
 But different from pure RISC (to gain some advantages)
 Variable cycle execution for certain instructions to support multiple
load and store
 Inline barrel shifter leading to few complex instructions –
preprocessing one operand enhances computational power
 Thumb state (16-bit instruction set) to improve code density
 Conditional execution of instructions for smooth pipeline operation
 DSP instructions to support signal processing
 Performance: speed=> MIPS@ Clk freq., DMIPS@ Clk freq.
power=> mW @ (Volt, Clk freq., technology)

6
DMIPS
 Dhrystone is a synthetic benchmark program for system programming. So
DMIPS measures not just instructions per second but gives an idea of how
long overall it will take one processor to perform a task versus another,
taking into account the different number and kinds of instructions.
 The industries have adopted the VAX 11/780 as the reference 1 MIPS
machine. The VAX 11/780 achieves 1757 Dhrystones per second.

 The Dhrystone figure of given computing system is calculated by measuring

the number of Dhrystones executed per second and dividing that by 1757.
So if a computing system able to execute 140560 dhrystones per second,
then its DMIPS rating is 140560/1757 = 80 DMIPS
 To compare two computing systems that run at different clock frequency,
DMIPS is normalized to clock frequency.
e.g. 60 DMIPS @ 40 MHz = 1.5 DMIPS/MHz
 New Benchmarking => CoreMark MIPS

7
 Sign Extend -> converts
signed 8/16 bit to 32 bit
value and places in reg.
 Two source registers (Rn
and Rm) and one result
register Rd
 Barrel shifter =>
preprocess Rm before it
enters to ALU
 MAC unit => for multiply
and accumulation
operation

8
On Chip Debug Hardware

9
ARM Architecture
 ARM Core under study is ARM7TDMI
 ARM state => Instructions are 32-bit wide and address is word aligned
 Thumb state => Instructions are 16-bit and address is half-word aligned
ARM Modes:
 Different Modes of ARM processor are defined for specific purpose
 User mode => most application softwares run in this mode

10
ARM Architecture
 Exception modes => Supervisor, IRQ, FIQ, abort, undefined
 Non exception modes=> User, System
 ‘supervisor’ mode => runs embedded operating system routines
 ‘User’ mode => runs Application programs
 IRQ & FIQ modes => handles hardware interrupts
 Abort mode => handles memory access violations
 Undefined mode => handles undefined instruction
ARM Architecture
CPSR:
 32-bit register with condition flags, control bits, status & ext.
 Only privileged modes have full write access to CPSR
 Every processor mode except user mode can change mode by writing
directly to the mode bits of the CPSR.

 N = 1 if MSB of the ALU result is 1

 Z = 1 if Zero result from ALU
 C = 1 if ALU operation results in Carry (if Subtraction result is -ve =>C reset)
 V =1 if ALU operation oVerflowed (useful for signed numbers only)
 Flags are updated only if suffix ‘S’ is added to instruction 12
ARM Architecture
 When the processor is executing in ARM state:
 All instructions are 32 bits wide
 All instructions must be word aligned
 Therefore the pc value is stored in bits [31:2] with bits [1:0]
undefined (as instruction cannot be halfword or byte aligned).

 When the processor is executing in Thumb state:

 All instructions are 16 bits wide
 All instructions must be halfword aligned
 Therefore the pc value is stored in bits [31:1] with bit [0] undefined
(as instruction cannot be byte aligned).

 When the processor is executing in Jazelle state:

 All instructions are 8 bits wide
 Executes java byte codes

13
Banked Registers:

15
ARM Architecture

 Total 37 registers = 30 general purpose + 6 status + 1 PC

 Different set of register in different mode of operation
 User and System mode uses same set of registers
 Shaded registers (banked registers) are hidden from user/system mode and
available only in exception modes.
 R13 = Stack pointer (SP). Each exception mode has its own SP
 R14 = link register (LR) -> Holds return address of subroutine when it is
called with BL instruction.
 Each exception mode has its own SP and LR
BL <cc> subroutine_label (LR automatically stores return add.)
 The return can be in two ways

 MOV PC, LR or
 B LR

16
ARM Family and Cores

ARM Core Features ARM ISA Thumb

family version version

ARM7TDMI 3-state pipeline, thumb state ARMv4T v1

ARM7 ARM 720T as ARM7TDMI, cache
ARM 740T as ARM7TDMI, cache
ARM 920T 5-stage pipeline, thumb, data and inst. ARMv4T
cache, MMU
ARM 922T 5-stage pipeline, thumb, data and inst.
cache, MMU
ARM9 ARM946E 5-stage pipeline, thumb, Enhanced DSP ARMv5TE
instructions, caches, MPU
ARM926EJ 5-stage pipeline, thumb, Jazelle DBX, ARMv5TEJ
Enhanced DSP instructions, caches, MMU

ARM11 ARM1156T2(F) 8-stage pipeline, SIMD, Thumb-2, VFP, ARMv6T2 v2

Enhanced DSP instructions

ARM Cortex Series: Profile A, Profile R, Profile M

ARM Data Processing
 Syntax : <opcode> {<cc>} {S} Rd, Rn, op2
 ‘op2’ normally comes from barrel shifter and can be the following:

 Rm and Rs should not be PC (r15) in shift/rotate by register mode of ‘op2’

 shift and rotate affects N,Z,C flags
 # value for shift and rotate is 5-bit unsigned integer

18
19
ARM - The Barrel Shifter
LSL : Logical Left Shift ASR: Arithmetic Right Shift

CF Destination 0 Destination CF

Multiplication by a power of 2 Division by a power of 2,

preserving the sign bit
LSR : Logical Shift Right
ROR: Rotate Right

...0 Destination CF Destination CF

Bit rotate with wrap around

Division by a power of 2
from LSB to MSB

RRX: Rotate Right Extended

Destination CF

Single bit rotate with wrap around

from CF to MSB

20
ARM Data Processing Instructions

 CMP,CMN,TST & TEQ always update flags (even if ‘S’ is not used as
suffix) and do not alter any register. They use only Rn and OP2.
 MOV & MVN use only two operands i.e. Rd and ‘op2’

21
Data processing:
 ADD R9, R5, R5, LSL #3 ; R9 = R5+(R5*8) = 9*R5
 RSB R9, R5, R5, LSR #3 ; R9 = (R5/8) – R5
 MOV R12, R4, ROR R3 ;R12= R4 rotated right by value of R3
 CMP R7, R5 ; update flags after (R7-R5)

Conditional Execution:
 ARM instructions can be made to execute conditionally by post fixing
them with the appropriate condition code field. (e.g. MOVEQ R0,R1)
 Condition checks the status of appropriate flags
 If condition is true, normal execution otherwise no execution.
 Adv. => Greater pipeline performance and higher code density leading to
higher instructions throughput

22
ARM Conditional Execution

23
ARM Conditional Execution
 Set the flags, and then use various conditional codes
 CMP r0, # 0 if (a==0) x=0; (here r0 = a, r1= x)
 MOVEQ r1, # 0 if (a>0) x=1;
 MOVGT r1, #1
 Set of Conditional compare instruction
 CMP r0, # 4 if (a==4 or a==10)
 CMPNE r0, #10 x=0;
 MOVEQ r1, # 0

 Reduces number of instructions

While (a!=b) {
if (a>b) a=a-b; else b=b-a; } (here r1 = a, r2= b)
------------------------------------------------------------------------------------------
loop: CMP r1,r2 loop1: CMP r1, r2
BEQ finish SUBGT r1, r1, r2
BLT lessthan SUBLT r2, r2, r1
SUB r1, r1, r2 BNE loop1
B loop
lessthan : SUB r2,r2,r1
B loop
finish

ARM Microcontrollers Programming for Embedded Systems
From Everand
ARM Microcontrollers Programming for Embedded Systems
Sever Spanulescu
5/5 (1)
TestExer5 Econometrics
0% (1)
TestExer5 Econometrics
3 pages
Important Reports in SAP FI
100% (1)
Important Reports in SAP FI
42 pages
5-Stage Pipeline CPU Hardware
No ratings yet
5-Stage Pipeline CPU Hardware
43 pages
5-Stage Pipeline CPU Hardware
No ratings yet
5-Stage Pipeline CPU Hardware
33 pages
ARM Founded in November 1990: Advanced RISC Machines
No ratings yet
ARM Founded in November 1990: Advanced RISC Machines
45 pages
Arm PPT
No ratings yet
Arm PPT
25 pages
ARm Chinmayi PPT Lecture1 Upld 1 ND 2
No ratings yet
ARm Chinmayi PPT Lecture1 Upld 1 ND 2
43 pages
ARM Teaching Material
100% (1)
ARM Teaching Material
33 pages
ARM Teaching Material
No ratings yet
ARM Teaching Material
33 pages
3 ARM Processor
No ratings yet
3 ARM Processor
33 pages
Day2 Arm
No ratings yet
Day2 Arm
29 pages
Unit 4 - ARM Processors
No ratings yet
Unit 4 - ARM Processors
68 pages
ARM Teaching Material
No ratings yet
ARM Teaching Material
33 pages
ARM Register Organization
No ratings yet
ARM Register Organization
33 pages
ARM
No ratings yet
ARM
44 pages
Arm PPT Full
No ratings yet
Arm PPT Full
84 pages
Arm Brief
No ratings yet
Arm Brief
29 pages
ESSAY MICRO Full
No ratings yet
ESSAY MICRO Full
6 pages
Architecture Programmers Model Instruction Set
No ratings yet
Architecture Programmers Model Instruction Set
33 pages
Embedded Lecture 4 ARM
No ratings yet
Embedded Lecture 4 ARM
47 pages
04 - The ARM Architecture and ISA
No ratings yet
04 - The ARM Architecture and ISA
73 pages
Risc Processor - Arm 9
No ratings yet
Risc Processor - Arm 9
84 pages
ARM K
No ratings yet
ARM K
32 pages
Arm7 Architecture
No ratings yet
Arm7 Architecture
20 pages
Arm
No ratings yet
Arm
44 pages
Chapter 1
No ratings yet
Chapter 1
26 pages
The First Encounter: Authors: Nemanja Perovic, Prof. Dr. Veljko Milutinovic
No ratings yet
The First Encounter: Authors: Nemanja Perovic, Prof. Dr. Veljko Milutinovic
44 pages
Unit 2 Es
No ratings yet
Unit 2 Es
78 pages
ARM Architecture
No ratings yet
ARM Architecture
26 pages
Arm Microprocessor
No ratings yet
Arm Microprocessor
22 pages
MC 5
No ratings yet
MC 5
23 pages
The ARM Architecture The ARM Architecture
No ratings yet
The ARM Architecture The ARM Architecture
26 pages
MKC ES Units 3&4 ARM 1
No ratings yet
MKC ES Units 3&4 ARM 1
105 pages
Fat MPMC
No ratings yet
Fat MPMC
97 pages
ARM Processor Instruction Set: Open Access - Preliminary
No ratings yet
ARM Processor Instruction Set: Open Access - Preliminary
50 pages
ARM Architecture Overview
100% (1)
ARM Architecture Overview
19 pages
MCES Unit 1 2 ARM-Instruction-set 2023
No ratings yet
MCES Unit 1 2 ARM-Instruction-set 2023
41 pages
ASM Session1
No ratings yet
ASM Session1
32 pages
ARM2
No ratings yet
ARM2
49 pages
Arm
100% (2)
Arm
44 pages
ARM Overview
No ratings yet
ARM Overview
43 pages
ARM Instruction Set Architecture
No ratings yet
ARM Instruction Set Architecture
8 pages
Arm Inst
No ratings yet
Arm Inst
75 pages
l18 Arm
No ratings yet
l18 Arm
71 pages
ARM Basic Help
No ratings yet
ARM Basic Help
48 pages
ARM: An Advanced Microcontroller
No ratings yet
ARM: An Advanced Microcontroller
54 pages
MS Unit2
No ratings yet
MS Unit2
94 pages
Arm Instruction
No ratings yet
Arm Instruction
102 pages
Embedded Systems Design - 2: Dr. N. Mathivanan
No ratings yet
Embedded Systems Design - 2: Dr. N. Mathivanan
10 pages
ARM
No ratings yet
ARM
40 pages
MPMC Unit - 4
No ratings yet
MPMC Unit - 4
15 pages
ARM Introduction & Instruction Set Architecture: Aleksandar Milenkovic
No ratings yet
ARM Introduction & Instruction Set Architecture: Aleksandar Milenkovic
31 pages
ARM Presentation
No ratings yet
ARM Presentation
51 pages
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
LEARN MPLS FROM SCRATCH PART-B: A Beginners guide to next level of networking
From Everand
LEARN MPLS FROM SCRATCH PART-B: A Beginners guide to next level of networking
POONAM DEVI
No ratings yet
ROUTING INFORMATION PROTOCOL: RIP DYNAMIC ROUTING LAB CONFIGURATION
From Everand
ROUTING INFORMATION PROTOCOL: RIP DYNAMIC ROUTING LAB CONFIGURATION
Mulayam Singh
No ratings yet
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
Franco Mario
No ratings yet
CISCO PACKET TRACER LABS: Best practice of configuring or troubleshooting Network
From Everand
CISCO PACKET TRACER LABS: Best practice of configuring or troubleshooting Network
Mulayam Singh
No ratings yet
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
First Hop Redundancy Protocol: Network Redundancy Protocol
From Everand
First Hop Redundancy Protocol: Network Redundancy Protocol
Mulayam Singh
No ratings yet
The Book of I²C: A Guide for Adventurers
From Everand
The Book of I²C: A Guide for Adventurers
Randall Hyde
No ratings yet
Date: 13-08-18 Correlation and Autocorrelation Coefficient: Xperiment O
No ratings yet
Date: 13-08-18 Correlation and Autocorrelation Coefficient: Xperiment O
3 pages
Experiment 6: Date:14/07/2018
No ratings yet
Experiment 6: Date:14/07/2018
2 pages
Payment Details Report: Orders Under This Transaction
No ratings yet
Payment Details Report: Orders Under This Transaction
1 page
Go Alpha Curricullum Sample PDF
No ratings yet
Go Alpha Curricullum Sample PDF
2 pages
Digital Signature
No ratings yet
Digital Signature
48 pages
N Naannoom Maakkeerr: Lab$#10:$Organic$Photovoltaics$ (PV) $$
No ratings yet
N Naannoom Maakkeerr: Lab$#10:$Organic$Photovoltaics$ (PV) $$
21 pages
Organic Solar Cells Research Work PDF
No ratings yet
Organic Solar Cells Research Work PDF
1 page
Switching
No ratings yet
Switching
58 pages
Hivemq Ebook MQTT Essentials
No ratings yet
Hivemq Ebook MQTT Essentials
90 pages
CSE 111 - Team Activity - pdf4
No ratings yet
CSE 111 - Team Activity - pdf4
5 pages
Alarm - Shortlist Categorization
No ratings yet
Alarm - Shortlist Categorization
35 pages
Lecture 01 - Fundamentals of Digital Forensics
No ratings yet
Lecture 01 - Fundamentals of Digital Forensics
19 pages
ICT-CSS 11 - Q1 - W4 - Mod4
No ratings yet
ICT-CSS 11 - Q1 - W4 - Mod4
17 pages
Single Phasing Monitoring and Prevention System For 3-Phase Industrial Loads PDF
No ratings yet
Single Phasing Monitoring and Prevention System For 3-Phase Industrial Loads PDF
3 pages
L-6 (DK) (Pe) ( (Ee) Nptel) 7
No ratings yet
L-6 (DK) (Pe) ( (Ee) Nptel) 7
3 pages
DIA - NE XT4 CTR 4.08 HMI 2.9 R5 - en 14
No ratings yet
DIA - NE XT4 CTR 4.08 HMI 2.9 R5 - en 14
10 pages
The KEY - Numerical Solutions of The Modified Burger's
No ratings yet
The KEY - Numerical Solutions of The Modified Burger's
9 pages
Assignment On Application of GIS in Social Problem Analysis
No ratings yet
Assignment On Application of GIS in Social Problem Analysis
11 pages
Bi 1
No ratings yet
Bi 1
69 pages
g6q1 Week 5 Math
No ratings yet
g6q1 Week 5 Math
65 pages
PostgreSQL Version Upgrade From 13 To 14 Using PG - Upgrade
No ratings yet
PostgreSQL Version Upgrade From 13 To 14 Using PG - Upgrade
15 pages
Abiram Shankar Ias
No ratings yet
Abiram Shankar Ias
6 pages
Floyd's Algorithm ADA
No ratings yet
Floyd's Algorithm ADA
19 pages
DM104 - Evaluation of Business Performance
No ratings yet
DM104 - Evaluation of Business Performance
13 pages
Slide 1 - Authentication: Adobe Captivate Thursday, April 15, 2021
No ratings yet
Slide 1 - Authentication: Adobe Captivate Thursday, April 15, 2021
55 pages
Lista de Pedidos (4) Ifood
No ratings yet
Lista de Pedidos (4) Ifood
33 pages
The Incomplete Political Economy of Social Media: Siva Vaidhyanathan
No ratings yet
The Incomplete Political Economy of Social Media: Siva Vaidhyanathan
14 pages
Figure 3
No ratings yet
Figure 3
5 pages
Filtro de Silica Gel MTraB - 100115620
No ratings yet
Filtro de Silica Gel MTraB - 100115620
1 page
MGP Paranoia - Red Clearance Reference Sheet
No ratings yet
MGP Paranoia - Red Clearance Reference Sheet
2 pages
Prod PDF 1051492176905
No ratings yet
Prod PDF 1051492176905
3 pages
Nitc - PHD
0% (1)
Nitc - PHD
36 pages
Cloud Agent Slides For QSC
100% (1)
Cloud Agent Slides For QSC
106 pages
Ethernet Train Bus Article
No ratings yet
Ethernet Train Bus Article
6 pages
Knowledge Management
No ratings yet
Knowledge Management
8 pages
Mnnnii
No ratings yet
Mnnnii
32 pages

5-Stage Pipeline CPU Hardware

Uploaded by

5-Stage Pipeline CPU Hardware

Uploaded by

5-stage Pipeline CPU hardware

Pipeline CPU hardware

Distribution of control signals in pipeline CPU

 [2]Two or more instructions in pipeline try to read/write register file =>

 The Dhrystone figure of given computing system is calculated by measuring

 N = 1 if MSB of the ALU result is 1

 When the processor is executing in Thumb state:

 When the processor is executing in Jazelle state:

 Total 37 registers = 30 general purpose + 6 status + 1 PC

ARM Core Features ARM ISA Thumb

ARM7TDMI 3-state pipeline, thumb state ARMv4T v1

ARM11 ARM1156T2(F) 8-stage pipeline, SIMD, Thumb-2, VFP, ARMv6T2 v2

ARM Cortex Series: Profile A, Profile R, Profile M

 Rm and Rs should not be PC (r15) in shift/rotate by register mode of ‘op2’

Multiplication by a power of 2 Division by a power of 2,

...0 Destination CF Destination CF

Bit rotate with wrap around

RRX: Rotate Right Extended

Single bit rotate with wrap around

 Reduces number of instructions

You might also like