Module-2: Memory Systems Basic Processing Unit
Module-2: Memory Systems Basic Processing Unit
Memory Systems
Basic Processing Unit
Syllabus
⚫ Memory System: Chapter 5 – 5.1 to 5.4, 5.5 (5.5.1, 5.5.2), 5.6
⚫ Basic Concepts
⚫ Semiconductor RAM Memories,
⚫ Read Only Memories
⚫ Speed, Size, and Cost
⚫ Cache Memories
⚫ Mapping Functions
⚫ Replacement Algorithms
⚫ Performance Considerations.
⚫ Basic Processing Unit: Chapter7, Chapter 8 – 8.1
⚫ Some Fundamental Concepts
⚫ Execution of a Complete Instruction
⚫ Multiple Bus Organization
⚫ Hard-wired Control
⚫ Micro programmed Control
⚫ Basic concepts of pipelining,
⚫ Carl Hamacher, Zvonko Vranesic, Safwat Zaky, Computer
Organization, 5th Edition, Tata McGraw Hill, 2002
To find Least Recently Used block in cache two
methods can be implemented
• Stack Based
• Counter Based
Block-
1
Block-
2
Block-
3
Cache Block-
4
Main Memory
Block-
5
Block-
6
Stack Based LRU
• Replacement is required.
• Block 1 is used before 1 timeslot
• Block 2 is used before 2 timeslots
Main • Block 4 is used before 3 timeslots
memory • Block 3 is used before 4 timeslots
blocks • Hence Block is least recently used block among-Blocks 1,2, 4 and 3 and
which hence it is replaced
are to
moved to 1 2 3 4 2 1 5 6 2
cache M M M M H H M M H
4 2 1 5 6 2
3 3 4 2 1 5 6
2 2 2 3 4 2 1 5
1 1 1 1 1 3 4 2 1
Cache
• H-Cache Hit:
• When Hit occurs hit block is moved to top of stack indicating it is recent
• M-Cache Miss:
• When this occurs LRU block is replaced (From the bottom of the cache
Counter Based LRU
• H-Cache Hit:
• When Hit occurs, Counter of hit block is set to 1
• Counters below to this are incremented and above to it remains s
• M-Cache Miss:
• When this occurs the counter of corresponding block is changed
Main memory • Remaining non-zero counters are incremented
blocks which 1 2 3 4 2 1 5 6 2
are to moved M M M M H H M M H
to cache
Counter for Block-1 1 2 3 4 4 1 2 3 4
Counter for Block-2 0 1 2 3 1 2 3 4 1
Counter for Block-3 0 0 1 2 3 4 0 0 0
Counter for Block-4 0 0 0 1 2 3 4 0 0
Counter for Block-5 0 0 0 0 0 0 1 2 3
Counter for Block-6 0 0 0 0 0 0 0 1 2
Basic Processing Unit
Fundamental Concepts
⚫ Processor fetches one instruction at a time and
perform the operation specified.
⚫ Instructions are fetched from successive memory
locations until a branch or a jump instruction is
encountered.
⚫ Processor keeps track of the address of the memory
location containing the next instruction to be fetched
using Program Counter (PC).
⚫ Instruction Register (IR)
Executing an Instruction
⚫ Fetch the contents of the memory location pointed
to by the PC. The contents of this location are
loaded into the IR (fetch phase).
IR ← [[PC]]
⚫ Assuming that the memory is byte addressable,
increment the contents of the PC by 4 (fetch phase).
PC ← [PC] + 4
⚫ Carry out the actions specified by the instruction in
the IR (execution phase).
I nt ernal proc es s or
bus
PC
I ns t ruc t ion
Addres s
dec oder and
lines
MAR c ont rol logic
Processor Organization
Mem ory
bus
MD R
D at a
lines IR
C ons t ant 4 R0
Selec t MU X
Add
A B
ALU Sub R( n - 1)
c ont rol ALU
lines
C arry -in
XOR TEMP
Fi gure 7.1. Si ngl e-bus organi zati on of the datapath i nsi de a processor.
Datapath
Internal organization of the
processor
⚫ ALU
⚫ Registers for temporary storage
⚫ Various digital circuits for executing different micro
operations.(gates, MUX,decoders,counters).
⚫ Internal path for movement of data between ALU
and registers.
⚫ Driver circuits for transmitting signals to external
units.
⚫ Receiver circuits for incoming signals from external
units.
⚫ PC:
❖ Keeps track of execution of a program
❖ Contains the memory address of the next instruction to be
fetched and executed.
MAR:
❖ Holds the address of the location to be accessed.
❖ I/P of MAR is connected to Internal bus and an O/p to external
bus.
MDR:
❖ Contains data to be written into or read out of the addressed
location.
❖ IT has 2 inputs and 2 Outputs.
❖ Data can be loaded into MDR either from memory bus or from
internal processor bus.
The data and address lines are connected to the internal bus via
MDR and MAR
Registers:
❖ The processor registers R0 to Rn-1 vary considerably from one
processor to another.
❖ Registers are provided for general purpose used by
programmer.
❖ Special purpose registers-index & stack registers.
R i in
1.Register Transfers Ri
R i out
Y in
Constant 4
Select MUX
A B
ALU
Z in
Z out
Figure 7.2. Input and output gating for the registers in Figure 7.1.
⚫ The input and output gates for register Ri are
controlled by signals isRin and Riout .
⚫ Rin Is set to1 – data available on common bus
are loaded into Ri.
⚫ Riout Is set to1 – the contents of register are
placed on the bus.
⚫ Riout Is set to 0 – the bus can be used for
transferring data from other registers .
Data transfer between two
registers:
EX:
Transfer the contents of R1 to R4.
1. Enable output of register R1 by setting
R1out=1. This places the contents of R1 on
the processor bus.
2. Enable input of register R4 by setting
R4in=1. This loads the data from the
processor bus into register R4.
Architecture Riin
Internal processor
bus
Ri
Riout
Y in
Constant 4
Select MUX
A B
ALU
Z in
Z out
Figure 7.2. Input and output gating for the registers in Figure 7.1.
2.Performing an Arithmetic or
Logic Operation
⚫ The ALU is a combinational circuit that has no
internal storage.
⚫ ALU gets the two operands from MUX and bus.
The result is temporarily stored in register Z.
⚫ What is the sequence of operations to add the
contents of register R1 to those of R2 and store the
result in R3?
1. R1out, Yin
2. R2out, SelectY, Add, Zin
3. Zout, R3in
Step 1: Output of the register R1 and input of
the register Y are enabled, causing the
contents of R1 to be transferred to Y.
Step 2: The multiplexer’s select signal is set to
select Y causing the multiplexer to gate the
contents of register Y to input A of the ALU.
Step 3: The contents of Z are transferred to the
destination register R3.
Register Transfers 0
Ri in
Bus
Clock
D Q
⚫ All operations and data transfers are controlled by the processor clock.
Figure 7.3. Input and output gating for one register bit.
Fetching a Word from Memory
Memory -bus
data lines MDRoutE
MDR
MDRout
Internal processor
bus
MFC
MDR out
Timing
Assume MAR
is always available
on the address lines
of the memory bus.
⚫ Move (R1), R2
1. R1out, MARin, Read
2. MDRinE, WMFC
3. MDRout, R2in
4.Storing a word in memory
⚫ Address is loaded into MAR
⚫ Data to be written loaded into MDR.
⚫ Write command is issued.
⚫ Example:Move R2,(R1)
R1out,MARin
R2out,MDRin,Write
MDRoutE, WMFC
Execution of a Complete
Instruction
⚫ Add (R3), R1
⚫ Fetch the instruction
⚫ Fetch the first operand (the contents of the
memory location pointed to by R3)
⚫ Perform the addition
⚫ Load the result into R1
Execution of a Complete Memory
bus
Address
lines
PC
MAR
Internal processor
bus
Control signals
Instruction
decoder and
control logic
Instruction
MDR
Data
lines IR
Constant 4 R0
Select MUX
Add
A B
ALU Sub R( n - 1)
control ALU
Step Action lines
Carry -in
XOR TEMP
1 PCout , MAR in , Read, Select4,A dd, Zin
Z
2 Zout , PCin , Y in , WMF C
3 MDR out , IR in
4 R3out , MAR in , Read
Figure 7.1. Single-bus organization of the datapath inside a processor.
5 R1out , Y in , WMF C
6 MDR out , SelectY,Add, Zin
7 Zout , R1 in , End
Add (R3), R1
Execution of Branch
Instructions
⚫ A branch instruction replaces the contents of
PC with the branch target address, which is
usually obtained by adding an offset X given
in the branch instruction.
⚫ The offset X is usually the difference between
the branch target address and the address
immediately following the branch instruction.
⚫ UnConditional branch
Execution of Branch
Instructions
Step Action
Incrementer
Multiple-Bus Organization
PC
Register
f ile
Constant 4
MUX
A
ALU R
Instruction
decoder
IR
MDR
MAR
Memory b us Address
data lines lines
Step Action
Control signals
PC
Instruction
Address
decoder and
lines
MAR control logic
Memory
bus
MDR
Data
lines IR
Exercise
Constant 4 R0
Select MUX
Add
A B
ALU Sub R( n - 1)
control ALU
lines
Carry -in
XOR TEMP
External
inputs
Decoder/
IR
encoder
Condition
codes
Control signals
Step decoder
T 1 T2 Tn
INS1
External
INS2 inputs
Instruction
IR Encoder
decoder
Condition
codes
Run End
Control signals
T4 T6
T1
Figure 7.12. Generation of the Zin control signal for the processor in Figure 7.1.
Generating End
Branch<0
Add Branch
N N
T7 T5 T4 T5
⚫
Figure 7.13. Generation of the End control signal.
Instruction Integer Floating-point
unit unit unit
Instruction Data
cache cache
Bus interface
Processor
A Complete Processor
Sy stem us
b
Main Input/
memory Output
WMFC
MAR in
Select
Read
PCout
R1out
R3out
Micro -
End
PCin
R1in
Add
Z out
IRin
Yin
Zin
instruction
1 0 1 1 1 0 0 0 1 1 1 0 0 0 0 0 0
2 1 0 0 0 0 0 1 0 0 0 1 0 0 0 1 0
3 0 0 0 0 1 1 0 0 0 0 0 0 0 0 0 0
4 0 0 1 1 0 0 0 0 0 0 0 0 0 1 0 0
5 0 0 0 0 0 0 1 0 0 0 0 1 0 0 1 0
6 0 0 0 0 1 0 0 0 1 1 0 0 0 0 0 0
7 0 0 0 0 0 0 0 0 0 0 1 0 1 0 0 1
1
2
3
4
5
6
7
Action
Clock PC
Control
store CW
⚫ Control store
One function
cannot be carried
out by this simple
organization.
Conditional branch
⚫ The previous organization cannot handle the situation when the control
unit is required to check the status of the condition codes or external
inputs to choose between alternative courses of action.
⚫ Use conditional branch microinstruction.
Address Microinstruction
Starting and
branch address Condition
IR codes
generator
Clock PC
Control
store CW
F1
F1 (4 bits)
0100: R0
0101: R1
out
0010: MDRout
0011: Zout
out
out
F2
F2 (3 bits)
010: IRin
011: Zin
100: R0in
101: R1in
F3
F3 (3 bits)
010: MDRin
011: TEMPin
100: Yin
F4
F4 (4 bits)
1111: XOR
F5
F5 (2 bits)
00: No action
01: Read
10: Write
Microinstructions
0110: R2 110: R2in 16 ALU
out f unctions
0111: R3out 111: R3in
1010: TEMPout
1011: Of f set
out
F6 F7 F8
11 10 8 7 4 3 0
Address Microinstruction
(octal)
Condition
codes
Decoding circuits
Address Field
A R
Control store
Next address I R
Microinstruction decoder
Control signals
Microroutine
000 0 0 0 0 0 0 0 1 0 0 1 01 1 0 0 1 0 0 0 0 01 1 0 0 0 0
001 0 0 0 0 0 0 1 0 0 1 1 00 1 1 0 0 0 0 0 0 00 0 1 0 0 0
002 0 0 0 0 0 0 1 1 0 1 0 01 0 0 0 0 0 0 0 0 00 0 0 0 0 0
003 0 0 0 0 0 0 0 0 0 0 0 00 0 0 0 0 0 0 0 0 00 0 0 1 1 0
121 0 1 0 1 0 0 1 0 1 0 0 01 1 0 0 1 0 0 0 0 01 1 0 0 0 0
122 0 1 1 1 1 0 0 0 0 1 1 10 0 0 0 0 0 0 0 0 00 0 1 0 0 1
170 0 1 1 1 1 0 0 1 0 1 0 00 0 0 0 1 0 0 0 0 01 0 1 0 0 0
171 0 1 1 1 1 0 1 0 0 1 0 00 0 1 0 0 0 0 0 0 00 0 0 0 0 0
172 0 1 1 1 1 0 1 1 1 0 1 01 1 0 0 0 0 0 0 0 00 0 0 0 0 0
173 0 0 0 0 0 0 0 0 0 1 1 10 1 0 0 0 0 0 0 0 00 0 0 0 0 0
Control store
Rdstout
Rdstin
Microinstruction
decoder
Rsrcout
Rsrcin