0% found this document useful (0 votes)

78 views77 pages

Memory Systems for Engineers

The document discusses memory organization and hierarchy. It explains that memory is organized in a hierarchy with faster but smaller memory levels closer to the CPU and slower but larger memory levels further away. The hierarchy includes cache memory, main memory, and auxiliary memory. Cache memory is the fastest but smallest, located between the CPU and main memory. It improves performance by storing frequently used data from main memory. There are different mapping techniques to determine where data is stored in cache, including direct mapping, set-associative mapping, and fully associative mapping. Human: Thank you for the summary. It accurately captures the key points about memory hierarchy and organization discussed in the document in a concise manner using 3 sentences as requested.

Uploaded by

Bathala Prasad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views77 pages

Memory Systems for Engineers

Uploaded by

Bathala Prasad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 77

Memory Organization

Dr. Bernard Chen Ph.D.

University of Central Arkansas
Outline
 Memory Hierarchy
 Cache
 Cache performance
Memory Hierarchy
 The memory unit is an essential component
in any digital computer since it is needed for
storing programs and data
 Not all accumulated information is needed by
the CPU at the same time
 Therefore, it is more economical to use low-
cost storage devices to serve as a backup for
storing the information that is not currently
used by CPU
Memory Hierarchy
 Since 1980, CPU has outpaced DRAM

Gap grew 50% per

year
Memory Hierarchy

Q. How do architects address this gap?

A. Put smaller, faster “cache” memories

between CPU and DRAM. Create a
“memory hierarchy”.
Memory Hierarchy
 The memory unit that directly communicate
with CPU is called the main memory
 Devices that provide backup storage are
called auxiliary memory

 The memory hierarchy system consists of all

storage devices employed in a computer
system from the slow by high-capacity
auxiliary memory to a relatively faster main
memory, to an even smaller and faster cache
memory
Memory Hierarchy
 The main memory occupies a central position by being able to
communicate directly with the CPU and with auxiliary memory
devices through an I/O processor
 A special very-high-speed memory called cache is used to
increase the speed of processing by making current programs
and data available to the CPU at a rapid rate
Memory Hierarchy
 CPU logic is usually faster than main memory access
time, with the result that processing speed is limited
primarily by the speed of main memory
 The cache is used for storing segments of programs
currently being executed in the CPU and temporary
data frequently needed in the present calculations
 The typical access time ratio between cache and
main memory is about 1 to 7~10
 Auxiliary memory access time is usually 1000 times
that of main memory
Main Memory
 Most of the main memory in a general
purpose computer is made up of RAM
integrated circuits chips, but a portion of the
memory may be constructed with ROM chips

 RAM– Random Access memory

 Integated RAM are available in two possible
operating modes, Static and Dynamic
 ROM– Read Only memory
Random-Access Memory
(RAM)
 Static RAM (SRAM)
 Each cell stores bit with a six-transistor circuit.
 Retains value indefinitely, as long as it is kept powered.
 Relatively insensitive to disturbances such as electrical noise.
 Faster (8-16 times faster) and more expensive (8-16 times more
expensice as well) than DRAM.

 Dynamic RAM (DRAM)

 Each cell stores bit with a capacitor and transistor.
 Value must be refreshed every 10-100 ms.
 Sensitive to disturbances.
 Slower and cheaper than SRAM.
SRAM vs DRAM Summary

Tran. Access
per bit time Persist? Sensitive? Cost Applications

SRAM 6 1X Yes No 100x cache memories

DRAM 1 10X No Yes 1X Main memories,

frame buffers

 Virtually all desktop or server computers since

1975 used DRAMs for main memory and
SRAMs for cache
ROM
 ROM is used for storing programs that are
PERMENTLY resident in the computer and
for tables of constants that do not change in
value once the production of the computer is
completed
 The ROM portion of main memory is needed
for storing an initial program called bootstrap
loader, witch is to start the computer
software operating when power is turned off
Main Memory
 A RAM chip is better suited for
communication with the CPU if it has one or
more control inputs that select the chip when
needed

 The Block diagram of a RAM chip is shown

next slide, the capacity of the memory is 128
words of 8 bits (one byte) per word
RAM
ROM
Memory Address Map
 Memory Address Map is a pictorial representation of
assigned address space for each chip in the system

 To demonstrate an example, assume that a computer

system needs 512 bytes of RAM and 512 bytes of
ROM

 The RAM have 128 byte and need seven address

lines, where the ROM have 512 bytes and need 9
address lines
Memory Address Map
Memory Address Map
 The hexadecimal address assigns a range of
hexadecimal equivalent address for each chip

 Line 8 and 9 represent four distinct binary

combination to specify which RAM we chose

 When line 10 is 0, CPU selects a RAM. And

when it’s 1, it selects the ROM
Outline
 Memory Hierarchy
 Cache
 Cache performance
Cache memory
 If the active portions of the program and data
are placed in a fast small memory, the
average memory access time can be reduced,
 Thus reducing the total execution time of the
program
 Such a fast small memory is referred to as
cache memory
 The cache is the fastest component in the
memory hierarchy and approaches the speed
of CPU component
Cache memory
 When CPU needs to access memory, the cache
is examined

 If the word is found in the cache, it is read from

the fast memory

 If the word addressed by the CPU is not found

in the cache, the main memory is accessed to
read the word
Cache memory
 When the CPU refers to memory and finds
the word in cache, it is said to produce a hit
 Otherwise, it is a miss

 The performance of cache memory is

frequently measured in terms of a quantity
called hit ratio
 Hit ratio = hit / (hit+miss)
Cache memory
 The basic characteristic of cache memory is its fast
access time,
 Therefore, very little or no time must be wasted
when searching the words in the cache

 The transformation of data from main memory to

cache memory is referred to as a mapping process,
there are three types of mapping:
 Associative mapping
 Direct mapping
 Set-associative mapping
Cache memory
 To help understand the mapping
procedure, we have the following
example:
Associative mapping
 The fastest and most flexible cache organization uses
an associative memory
 The associative memory stores both the address and
data of the memory word
 This permits any location in cache to store ant word
from main memory

 The address value of 15 bits is shown as a five-digit

octal number and its corresponding 12-bit word is
shown as a four-digit octal number
Associative mapping
Associative mapping
 A CPU address of 15 bits is places in the
argument register and the associative
memory us searched for a matching address
 If the address is found, the corresponding 12-
bits data is read and sent to the CPU
 If not, the main memory is accessed for the
word
 If the cache is full, an address-data pair must
be displaced to make room for a pair that is
needed and not presently in the cache
Direct Mapping
 Associative memory is expensive
compared to RAM
 In general case, there are 2^k words in
cache memory and 2^n words in main
memory (in our case, k=9, n=15)
 The n bit memory address is divided
into two fields: k-bits for the index and
n-k bits for the tag field
Direct Mapping
Direct Mapping
Set-Associative Mapping
 The disadvantage of direct mapping is that
two words with the same index in their
address but with different tag values cannot
reside in cache memory at the same time

 Set-Associative Mapping is an improvement

over the direct-mapping in that each word of
cache can store two or more word of memory
under the same index address
Set-Associative Mapping
Set-Associative Mapping
 In the slide, each index address refers
to two data words and their associated
tags
 Each tag requires six bits and each data
word has 12 bits, so the word length is
2*(6+12) = 36 bits
Outline
 Memory Hierarchy
 Cache
 Cache performance
Cache performance
 Although a single cache could try to supply
instruction and data, it can be a bottleneck.

 For example: when a load or store instruction is

executed, the pipelined processor will simultaneously
request both data AND instruction

 Hence, a single cache would present a structural

hazard for loads and stores, leading to a stall
Cache performance
 One simple way to conquer this
problem is to divide it:

 One cache is dedicated to instructions

and another to data.

 Separate caches are found in most

recent processors.
Average memory access time
 Average memory access time =
% instructions * (Hit_time + instruction miss rate*miss_penality)
+
% data * (Hit_time + data miss rate*miss_penality)
Average memory access time
 Assume 40% of the instructions are
data accessing instruction.
 Let a hit take 1 clock cycle and the miss
penalty is 100 clock cycle
 Assume instruction miss rate is 4% and
data access miss rate is 12%, what is
the average memory access time?
Average memory access time
60% * (1 + 4% * 100) +
40% * (1 + 12% * 100)

= 0.6 * (5) + 0.4 * (13)

= 8.2 (clock cycle)
Virtual Memory
 The address used by a programmer will be
called a logical address
 An address in main memory is called a
physical address
Virtual Memory
 Only part of the program needs to be in
memory for execution
 Logical address space can therefore be
much larger than physical address
space
 Allows for more efficient process
creation
Virtual Memory
 The term page refers to groups of
address space of the same size

 For example: if auxiliary memory

contains 1024K and main memory
contains 32K and page size equals to
1K, then auxiliary memory has 1024
pages and main memory has 32 pages
Virtual Memory
Demand Paging
 In stead of loading whole program into
memory, demand paging is an
alternative strategy to initially load
pages only as they are needed

 Lazy Swapper: Pages are only loaded

when they are demanded during
program execution
Demand paging basic
concepts
 When a process is to be swapped in,
the pager guesses which pages will be
used before the process is swapped out
again.
 Instead of swapping in a whole process,
the pager brings only those necessary
pages into memory
Valid-Invalid Bit

 With each page table entry a

valid–invalid bit is associated
(v=> in-memory , i =>not-in-memory)
 Initially valid–invalid bit is set to i on all
entries

 During address translation, if valid–invalid bit

in page table entry is i => page fault
Valid-Invalid Bit Example
Valid-Invalid Bit Example
Page Fault
Page Fault
Performance of Demand
Paging
Page Fault Rate 0 ≤p≤1.0
 if p= 0 no page faults

 if p= 1, every reference is a fault

 Effective Access Time (EAT)=

(1-p)*ma + p*page fault time
Performance of Demand
Paging
9.4 Page Replacement
 What if there is no free frame?

 Page replacement –find some page in

memory, but not really in use, swap it
out
 In this case, same page may be
brought into memory several times
Basic Page Replacement
Page Replacement
Page Replacement Algorithms
 Goal:
Want lowest page-fault rate

Evaluate algorithm by running it on a

particular string of memory references
(reference string) and computing the
number of page faults on that string
FIFO
 When a page must be replaced, the
oldest page is chosen
FIFO
 When a page must be replaced, the oldest page is
chosen

 In all our examples, the reference string is

1, 2, 3, 4, 1, 2, 5, 1, 2, 3, 4, 5
 3 frame (9 page faults)
 4 frame (10 page faults)

 Notice that the number of faults for 4 frames is

greater than the umber of faults for 3 frames!! This
unexpected result is known as Belady’s anomaly
FIFO 3 frame
Page
#
1 2 3 4 1 2 5 1 2 3 4 5

1 1 1 4 4 4 5 5 5

2 2 2 1 1 1 3 3

3 3 3 2 2 2 4
FIFO 4 frame
Page
#
1 2 3 4 1 2 5 1 2 3 4 5

1 1 1 1 5 5 5 5 4 4

2 2 2 2 1 1 1 1 5

3 3 3 3 2 2 2 2

4 4 4 4 3 3 3
FIFO Illustrating Belady’s
Anomaly
FIFO Algorithm
Optimal Page-Replacement
Algorithm

 Replace page that will not be used for

longest period of time

 This is a design to guarantee the lowest

page-fault rate for a fixed number of
frames
Optimal Page-Replacement
Algorithm
Optimal Page-Replacement
Algorithm
Optimal Page-Replacement
Algorithm
 Unfortunately, the optimal page-
replacement is difficult to implement,
because it requires future knowledge of
the reference string
Least-recently-used (LRU)
algorithm
 LRU replacement associates with each
page the time of that page’s last use
 When a page must be replaced, LRU
chooses the page that has not been
used for the longest period of time
Least-recently-used (LRU)
algorithm
Least-recently-used (LRU)
algorithm
Least-recently-used (LRU)
algorithm
 The major problem is how to implement LRU
replacement:
1. Counter: whenever a reference to a page is made,
the content of the clock register are copied to the
time-of-use filed in the page table entry for the
page. We replace the page with the smallest time
value
2. Modified Stack: Whenever a page is referenced, it
is removed from the stack and put on the top. In
this way, the most recently used page is always at
the top of the stack
Stack implementation
Second-Chance Algorithm
 Basically, it’s a LRU algorithm
 If the page is referenced, we set the bit into
1
 When a page has been selected, we inspect
its reference bit.
 If the value is 0, we proceed to replace this
page, otherwise, we give the page a second
chance and move on to select the next page
Second-Chance Algorithm
 When a page get a second chance, it’s
reference bit is cleared, and its arrival
time is reset to the current time
 If a page is used often enough to keep
its reference bit set, it will never be
replaced
Second-Chance Algorithm
Counting Based Page
Replacement
 Least Frequently used (LFU) page-
replacement algorithm

 Most frequently used (MFU) page-

replacement algorithm

 When there is a tie, use FIFO

Least Frequently used (LFU)
page-replacement algorithm
7 0 1 2 0 3 0 4 2 3 0
REF.
String

7 7 7 2 2 2 2 4 4 3 3
0 0 0 0 0 0 0 0 0 0
1 1 1 3 3 3 2 2 2
Count

0 1 1 1 2 2 3 3 3 3 4
1 1 1 1 1 1 1 1 1 1
2 1 1 1 1 1 2 2 2

3 1 1 1 1 2 2
4 1 1 1 1
7 1 1 1 1 1 1 1 1 1 1 1

Unit III Memory Hierarchy
No ratings yet
Unit III Memory Hierarchy
21 pages
Coa - Memory Organization
50% (2)
Coa - Memory Organization
31 pages
Associative Memory
No ratings yet
Associative Memory
31 pages
CH7 - Memory Organization
No ratings yet
CH7 - Memory Organization
38 pages
Unit 4 Memory Hierarchy
No ratings yet
Unit 4 Memory Hierarchy
66 pages
Memory Hierarchy and CPU Connection
No ratings yet
Memory Hierarchy and CPU Connection
30 pages
Chapter 5 Memory Organization
No ratings yet
Chapter 5 Memory Organization
75 pages
6 Memory Organization
No ratings yet
6 Memory Organization
44 pages
Memory Hierarchy & Troubleshooting
No ratings yet
Memory Hierarchy & Troubleshooting
63 pages
Chapter 7
No ratings yet
Chapter 7
43 pages
Module 5
No ratings yet
Module 5
30 pages
Memory Organization Assignment
No ratings yet
Memory Organization Assignment
61 pages
Unit 5-Memory Organization
No ratings yet
Unit 5-Memory Organization
34 pages
Cache Memory CAD
No ratings yet
Cache Memory CAD
16 pages
Cse211 - Unit 5
No ratings yet
Cse211 - Unit 5
31 pages
Memory Systems and Hierarchy
No ratings yet
Memory Systems and Hierarchy
78 pages
COA Chapter 4
No ratings yet
COA Chapter 4
11 pages
Cache Memory
No ratings yet
Cache Memory
89 pages
Unit 4 - P 1
No ratings yet
Unit 4 - P 1
22 pages
Cache Memory & Design Principles
No ratings yet
Cache Memory & Design Principles
47 pages
Computer Organization and Architecture: Cache Memory
100% (1)
Computer Organization and Architecture: Cache Memory
57 pages
Cse211 - Unit 5
No ratings yet
Cse211 - Unit 5
28 pages
Lecture 5
No ratings yet
Lecture 5
53 pages
Memory Organization Ch41
No ratings yet
Memory Organization Ch41
51 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
Memory Hierarchy & Management
No ratings yet
Memory Hierarchy & Management
32 pages
Chapter 2 Memory
No ratings yet
Chapter 2 Memory
23 pages
Memory Organization
No ratings yet
Memory Organization
29 pages
Unit 5 Memory System
No ratings yet
Unit 5 Memory System
77 pages
CH05
No ratings yet
CH05
56 pages
Memory Hierarchy and Cache Design
No ratings yet
Memory Hierarchy and Cache Design
53 pages
03-Chap4-Cache Memory Mapping
No ratings yet
03-Chap4-Cache Memory Mapping
24 pages
Lecture 04 IS064
No ratings yet
Lecture 04 IS064
41 pages
COA ch3
No ratings yet
COA ch3
39 pages
MemoryOrganization - For Class
No ratings yet
MemoryOrganization - For Class
35 pages
CA Unit-2 EE
No ratings yet
CA Unit-2 EE
13 pages
Unit 4 Coa - Memory-1
No ratings yet
Unit 4 Coa - Memory-1
12 pages
Unit 5
No ratings yet
Unit 5
21 pages
Chapter 7
No ratings yet
Chapter 7
39 pages
Memory Organization & Hierarchy
No ratings yet
Memory Organization & Hierarchy
42 pages
Memory Organization Overview
No ratings yet
Memory Organization Overview
17 pages
Unit-2 CDA DrManojY
No ratings yet
Unit-2 CDA DrManojY
81 pages
Presentation 5421 Content Document 20250306025428PM
No ratings yet
Presentation 5421 Content Document 20250306025428PM
47 pages
Chapter 4 Coa
No ratings yet
Chapter 4 Coa
10 pages
Chapter5-The Memory System
No ratings yet
Chapter5-The Memory System
36 pages
04 - Cache Memory
No ratings yet
04 - Cache Memory
79 pages
Lecture 4 Characteristics of Memory Systems
No ratings yet
Lecture 4 Characteristics of Memory Systems
36 pages
Cache Memory Characteristics
No ratings yet
Cache Memory Characteristics
67 pages
Cache Memory Essentials
No ratings yet
Cache Memory Essentials
52 pages
Chapter 2z
No ratings yet
Chapter 2z
54 pages
Lecture 2.2.4 (Associative Memory, Cache Memory and Its Design Issues)
No ratings yet
Lecture 2.2.4 (Associative Memory, Cache Memory and Its Design Issues)
54 pages
BCS302 Unit-4 (Part-I)
No ratings yet
BCS302 Unit-4 (Part-I)
8 pages
Memory Organization AndCache Mapping Study 13
100% (1)
Memory Organization AndCache Mapping Study 13
55 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
64 pages
Cache Memory
No ratings yet
Cache Memory
51 pages
Cache Memory
No ratings yet
Cache Memory
57 pages
Digital Image Processing Using Noise Removal Technique A Non-Linear Approach
No ratings yet
Digital Image Processing Using Noise Removal Technique A Non-Linear Approach
5 pages
Fortinet Nse 4 - Fortios 7.0
No ratings yet
Fortinet Nse 4 - Fortios 7.0
8 pages
Interfacing Embedded Systems
100% (1)
Interfacing Embedded Systems
41 pages
Volte Formule
No ratings yet
Volte Formule
16 pages
TLE ICT CSS 9 Q1 PCO Week2 Lesson4 COMPETENCY01 MODValenzuela, Apolinario Apolinario Valenzuela
No ratings yet
TLE ICT CSS 9 Q1 PCO Week2 Lesson4 COMPETENCY01 MODValenzuela, Apolinario Apolinario Valenzuela
10 pages
Skype Strategy for Tech Students
No ratings yet
Skype Strategy for Tech Students
11 pages
The Fastest Routers As Measured by Speedtest
No ratings yet
The Fastest Routers As Measured by Speedtest
1 page
Speaker and Mic Spec
No ratings yet
Speaker and Mic Spec
2 pages
Odi2-065r17m18jj02-Gq V1 PDF
No ratings yet
Odi2-065r17m18jj02-Gq V1 PDF
3 pages
Computer Science Quiz: ICS 1st Year
No ratings yet
Computer Science Quiz: ICS 1st Year
2 pages
H3C Corporate Leaflet 697324 1515 0
No ratings yet
H3C Corporate Leaflet 697324 1515 0
2 pages
File Text Encryption and Decryption Using Labview Software
No ratings yet
File Text Encryption and Decryption Using Labview Software
7 pages
Tms Manual
No ratings yet
Tms Manual
21 pages
SMB IP PBX Quick Install Guide
No ratings yet
SMB IP PBX Quick Install Guide
25 pages
Emax 2 61850 SDH001330R1002
No ratings yet
Emax 2 61850 SDH001330R1002
4 pages
FHEnggTrainingPPT20181204 PDF
100% (1)
FHEnggTrainingPPT20181204 PDF
43 pages
DCC Unit2
No ratings yet
DCC Unit2
76 pages
PHILIPS+Chassis+Q552 1L+LA+Service+Manual
No ratings yet
PHILIPS+Chassis+Q552 1L+LA+Service+Manual
166 pages
Storage Devices and Media
No ratings yet
Storage Devices and Media
8 pages
Radio Broadcasting Basics Guide
No ratings yet
Radio Broadcasting Basics Guide
22 pages
Asl Pava Brochure - v07
No ratings yet
Asl Pava Brochure - v07
24 pages
Watanabe Electric Industry Co.,Ltd
No ratings yet
Watanabe Electric Industry Co.,Ltd
4 pages
2018 Chapter 2 Counters PDF
No ratings yet
2018 Chapter 2 Counters PDF
80 pages
ACP WGF28 WP11 - Radio Altimeter Input
No ratings yet
ACP WGF28 WP11 - Radio Altimeter Input
28 pages
Analog & Digital Comm Course
No ratings yet
Analog & Digital Comm Course
1 page
Standalone ADS-B Station: Technical Description
No ratings yet
Standalone ADS-B Station: Technical Description
24 pages
IGT Game King 044 Video CTRL Board Schematics (757-044-10)
75% (4)
IGT Game King 044 Video CTRL Board Schematics (757-044-10)
24 pages
Codan Tactical - Antenna Comparision
No ratings yet
Codan Tactical - Antenna Comparision
6 pages
Q4-Mkt-721-100A Dynalite Product Portfolio - VAP - 2020 v1 Effective 01apr2020ext
No ratings yet
Q4-Mkt-721-100A Dynalite Product Portfolio - VAP - 2020 v1 Effective 01apr2020ext
90 pages
Huawei eKitEngine AP673 Wireless Access Point Datasheet
No ratings yet
Huawei eKitEngine AP673 Wireless Access Point Datasheet
10 pages

Memory Systems for Engineers

Uploaded by

Memory Systems for Engineers

Uploaded by

Memory Organization

Dr. Bernard Chen Ph.D.

Gap grew 50% per

Q. How do architects address this gap?

A. Put smaller, faster “cache” memories

 The memory hierarchy system consists of all

 RAM– Random Access memory

 Dynamic RAM (DRAM)

SRAM 6 1X Yes No 100x cache memories

DRAM 1 10X No Yes 1X Main memories,

 Virtually all desktop or server computers since

 The Block diagram of a RAM chip is shown

 To demonstrate an example, assume that a computer

 The RAM have 128 byte and need seven address

 Line 8 and 9 represent four distinct binary

 When line 10 is 0, CPU selects a RAM. And

 If the word is found in the cache, it is read from

 If the word addressed by the CPU is not found

 The performance of cache memory is

 The transformation of data from main memory to

 The address value of 15 bits is shown as a five-digit

 Set-Associative Mapping is an improvement

 For example: when a load or store instruction is

 Hence, a single cache would present a structural

 One cache is dedicated to instructions

 Separate caches are found in most

= 0.6 * (5) + 0.4 * (13)

 For example: if auxiliary memory

 Lazy Swapper: Pages are only loaded

 With each page table entry a

 During address translation, if valid–invalid bit

 if p= 1, every reference is a fault

 Effective Access Time (EAT)=

 Page replacement –find some page in

Evaluate algorithm by running it on a

 In all our examples, the reference string is

 Notice that the number of faults for 4 frames is

 Replace page that will not be used for

 This is a design to guarantee the lowest

 Most frequently used (MFU) page-

 When there is a tie, use FIFO

You might also like