0% found this document useful (0 votes)

15 views17 pages

Lecture 12: Cache Innovations

The document discusses innovations in cache design including increasing cache size, associativity, and block size to reduce cache misses. It also covers techniques like multi-level caches, read/write priority, victim caches, and prefetching to help tolerate cache miss penalties.

Uploaded by

Prateek Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views17 pages

Lecture 12: Cache Innovations

Uploaded by

Prateek Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Lecture 12: Cache Innovations

• Today: cache access basics and innovations

(Sections 5.1-5.2)

1
Accessing the Cache
Byte address

101000

Offset
8-byte words

8 words: 3 index bits

Direct-mapped cache:
each address maps to
a unique address

Sets
Data array
2
The Tag Array
Byte address

101000

Tag
8-byte words

Compare

Direct-mapped cache:
each address maps to
a unique address

Tag array Data array

4
Associativity
Byte address Set associativity Æ fewer conflicts; wasted power
because multiple data and tags are read
10100000

Tag Way-1 Way-2

Tag array Data array

Compare 5
Example

• 32 KB 4-way set-associative data cache array with 32

byte line sizes

• How many sets?

• How many index bits, offset bits, tag bits?

• How large is the tag array?

6
Cache Misses

• On a write miss, you may either choose to bring the block

into the cache (write-allocate) or not (write-no-allocate)

• On a read miss, you always bring the block in (spatial and

temporal locality) – but which block do you replace?
¾ no choice for a direct-mapped cache
¾ randomly pick one of the ways to replace
¾ replace the way that was least-recently used (LRU)
¾ FIFO replacement (round-robin)

7
Writes

• When you write into a block, do you also update the

copy in L2?
¾ write-through: every write to L1 Æ write to L2
¾ write-back: mark the block as dirty, when the block
gets replaced from L1, write it to L2

• Writeback coalesces multiple writes to an L1 block into one

L2 write

• Writethrough simplifies coherency protocols in a

multiprocessor system as the L2 always has a current
copy of data
8
Reducing Cache Miss Penalty

• Multi-level caches

• Critical word first

• Priority for reads

• Victim caches

9
Multi-Level Caches

• The L2 and L3 have properties that are different from L1

¾ access time is not as critical for L2 as it is for L1 (every
load/store/instruction accesses the L1)
¾ the L2 is much larger and can consume more power
per access

• Hence, they can adopt alternative design choices

serial tag and data access
high associativity

10
Read/Write Priority

• For writeback/thru caches, writes to lower levels are placed

in write buffers

• When we have a read miss, we must look up the write

buffer before checking the lower level

• When we have a write miss, the write can merge with

another entry in the write buffer or it creates a new entry

• Reads are more urgent than writes (probability of an instr

waiting for the result of a read is 100%, while probability of
an instr waiting for the result of a write is much smaller) –
hence, reads get priority unless the write buffer is full
11
Victim Caches

• A direct-mapped cache suffers from misses because

multiple pieces of data map to the same location

• The processor often tries to access data that it recently

discarded – all discards are placed in a small victim cache
(4 or 8 entries) – the victim cache is checked before going
to L2

• Can be viewed as additional associativity for a few sets

that tend to have the most conflicts

12
Types of Cache Misses

• Compulsory misses: happens the first time a memory

word is accessed – the misses for an infinite cache

• Capacity misses: happens because the program touched

many other words before re-touching the same word – the
misses for a fully-associative cache

• Conflict misses: happens because two words map to the

same location in the cache – the misses generated while
moving from a fully-associative to a direct-mapped cache

• Sidenote: can a fully-associative cache have more misses

than a direct-mapped cache of the same size?
13
What Influences Cache Misses?

Compulsory Capacity Conflict

Increasing cache
capacity

Increasing number
of sets

Increasing block
size
Increasing
associativity

14
Reducing Miss Rate

• Large block size – reduces compulsory misses, reduces

miss penalty in case of spatial locality – increases traffic
between different levels, space wastage, and conflict misses

• Large caches – reduces capacity/conflict misses – access

time penalty

• High associativity – reduces conflict misses – rule of thumb:

2-way cache of capacity N/2 has the same miss rate as
1-way cache of capacity N – access time penalty

• Way prediction – by predicting the way, the access time

is effectively like a direct-mapped cache – can also reduce
power consumption 15
Tolerating Miss Penalty

• Out of order execution: can do other useful work while

waiting for the miss – can have multiple cache misses
-- cache controller has to keep track of multiple
outstanding misses (non-blocking cache)

• Hardware and software prefetching into prefetch buffers

– aggressive prefetching can increase contention for buses

16
Title

• Bullet

BCS304 Notes
No ratings yet
BCS304 Notes
167 pages
COA_PPT
No ratings yet
COA_PPT
158 pages
U42
No ratings yet
U42
41 pages
Module-01 FSD(BIS601)
No ratings yet
Module-01 FSD(BIS601)
39 pages
Chapter # 05
No ratings yet
Chapter # 05
42 pages
Chapter 2z Ppt
No ratings yet
Chapter 2z Ppt
54 pages
C++ Programming de Gruyter
No ratings yet
C++ Programming de Gruyter
507 pages
6.Module 2_Part 2
No ratings yet
6.Module 2_Part 2
39 pages
CA Q,,A 4TH SEM
No ratings yet
CA Q,,A 4TH SEM
18 pages
Lec 5
No ratings yet
Lec 5
35 pages
cache_ppt
No ratings yet
cache_ppt
38 pages
unit ii
No ratings yet
unit ii
9 pages
ch2 Appb
No ratings yet
ch2 Appb
58 pages
Cache_optimizations
No ratings yet
Cache_optimizations
29 pages
10_Caches
No ratings yet
10_Caches
34 pages
COMP 740: Computer Architecture and Implementation: Montek Singh
No ratings yet
COMP 740: Computer Architecture and Implementation: Montek Singh
41 pages
Lecture 19: Cache Basics: Today's Topics: Out-Of-Order Execution Cache Hierarchies Reminder: Assignment 7 Due On Thursday
No ratings yet
Lecture 19: Cache Basics: Today's Topics: Out-Of-Order Execution Cache Hierarchies Reminder: Assignment 7 Due On Thursday
17 pages
Cache Optimizations
No ratings yet
Cache Optimizations
23 pages
Cache Memory: A Safe Place For Hiding or Storing Things
100% (1)
Cache Memory: A Safe Place For Hiding or Storing Things
34 pages
10-cacheperf
No ratings yet
10-cacheperf
24 pages
Cache Org
No ratings yet
Cache Org
19 pages
Computer Arch 06
No ratings yet
Computer Arch 06
41 pages
Lectures wk11
No ratings yet
Lectures wk11
21 pages
Lec8 - Caches
No ratings yet
Lec8 - Caches
55 pages
9 - Cache
No ratings yet
9 - Cache
58 pages
Memory Hierarchy Design
No ratings yet
Memory Hierarchy Design
115 pages
UNIT2 Cahe-Opt
No ratings yet
UNIT2 Cahe-Opt
134 pages
Chapter 5.1-5.6 Memory
No ratings yet
Chapter 5.1-5.6 Memory
26 pages
CS 152 Computer Architecture and Engineering Lecture 7 - Memory Hierarchy-II
No ratings yet
CS 152 Computer Architecture and Engineering Lecture 7 - Memory Hierarchy-II
27 pages
2023 BATCH I-II DS CSE COURSE FILE
No ratings yet
2023 BATCH I-II DS CSE COURSE FILE
65 pages
Miss Rate Versus Block Size: 25% 1K 4K 16K 64K 256K
No ratings yet
Miss Rate Versus Block Size: 25% 1K 4K 16K 64K 256K
33 pages
L07-MemoryII
No ratings yet
L07-MemoryII
27 pages
Topics: Cache Innovations (Sections 2.4, B.4, B.5), Virtual Memory Intro
No ratings yet
Topics: Cache Innovations (Sections 2.4, B.4, B.5), Virtual Memory Intro
20 pages
Computer Architecture: Memory Organization
No ratings yet
Computer Architecture: Memory Organization
65 pages
Computer Organization and Architecture (AT70.01) : Comp. Sc. and Inf. MGMT
No ratings yet
Computer Organization and Architecture (AT70.01) : Comp. Sc. and Inf. MGMT
49 pages
361 Computer Architecture Lecture 14: Cache Memory
No ratings yet
361 Computer Architecture Lecture 14: Cache Memory
20 pages
CS Classroom Paper 1 Guide
No ratings yet
CS Classroom Paper 1 Guide
56 pages
Internship Report Priyank vasoya
No ratings yet
Internship Report Priyank vasoya
80 pages
Program in C Notes
No ratings yet
Program in C Notes
82 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
96 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
49 pages
Computer Science 246 Computer Architecture: Si 2009 Spring 2009 Harvard University
No ratings yet
Computer Science 246 Computer Architecture: Si 2009 Spring 2009 Harvard University
27 pages
CHAPTER 2 Memory Hierarchy Design & APPENDIX B. Review of Memory Heriarchy
No ratings yet
CHAPTER 2 Memory Hierarchy Design & APPENDIX B. Review of Memory Heriarchy
73 pages
Cache
No ratings yet
Cache
34 pages
chapter5 - direct mapped caches
No ratings yet
chapter5 - direct mapped caches
11 pages
Improving Cache Performance:: Average Memory Access Time Amat T + Miss Rate X Miss Penalty
No ratings yet
Improving Cache Performance:: Average Memory Access Time Amat T + Miss Rate X Miss Penalty
16 pages
Cacche
No ratings yet
Cacche
6 pages
15IF11 Multicore B
No ratings yet
15IF11 Multicore B
36 pages
2nd PUC Computer Science - Score 65+ With MCQ Topics - YouTube
No ratings yet
2nd PUC Computer Science - Score 65+ With MCQ Topics - YouTube
8 pages
Cache Misses
No ratings yet
Cache Misses
8 pages
Chapter 2 Neede For Guide Line Help From Smiw
No ratings yet
Chapter 2 Neede For Guide Line Help From Smiw
7 pages
ACA Unit-5
No ratings yet
ACA Unit-5
54 pages
React Simple CRUD Application React Hooks Typescript (English (Auto-Generated) ) (DownloadYoutubeSubtitles - Com)
No ratings yet
React Simple CRUD Application React Hooks Typescript (English (Auto-Generated) ) (DownloadYoutubeSubtitles - Com)
77 pages
Lecture Notes 1 On Analysis and Complexity of Algorithms
No ratings yet
Lecture Notes 1 On Analysis and Complexity of Algorithms
29 pages
DS Lab Manual
No ratings yet
DS Lab Manual
41 pages
AC14L08 Memory Hierarchy
No ratings yet
AC14L08 Memory Hierarchy
20 pages
Advanced Computer Architecture-06CS81-Memory Hierarchy Design
No ratings yet
Advanced Computer Architecture-06CS81-Memory Hierarchy Design
18 pages
Memory Cache
No ratings yet
Memory Cache
18 pages
Course Code: CS 283 Course Title: Computer Architecture: Class Day: Friday Timing: 12:00 To 1:30
No ratings yet
Course Code: CS 283 Course Title: Computer Architecture: Class Day: Friday Timing: 12:00 To 1:30
23 pages
Data Structure Mcq
No ratings yet
Data Structure Mcq
33 pages
5.2 Eleven Advanced Optimizations of Cache Performance
No ratings yet
5.2 Eleven Advanced Optimizations of Cache Performance
13 pages
Zoho Round 2
No ratings yet
Zoho Round 2
21 pages
Liquidity Vs Profitability
No ratings yet
Liquidity Vs Profitability
14 pages
Module 1 - Introduction To Corporate Finance
No ratings yet
Module 1 - Introduction To Corporate Finance
47 pages
Lecture: Cache Hierarchies: Topics: Cache Innovations (Sections B.1-B.3, 2.1)
No ratings yet
Lecture: Cache Hierarchies: Topics: Cache Innovations (Sections B.1-B.3, 2.1)
20 pages
Lect12 Cache
No ratings yet
Lect12 Cache
39 pages
Ec6009 Advanced Computer Architecture Unit V Memory and I/O: Cache Performance
No ratings yet
Ec6009 Advanced Computer Architecture Unit V Memory and I/O: Cache Performance
16 pages
MS 109 - Unit I
No ratings yet
MS 109 - Unit I
33 pages
17 Multilevel Page Table TLB
No ratings yet
17 Multilevel Page Table TLB
32 pages
Terms
No ratings yet
Terms
41 pages
AC7114-3 Rev NΔ1
No ratings yet
AC7114-3 Rev NΔ1
30 pages
Memory Hierarchy Design-Aca
No ratings yet
Memory Hierarchy Design-Aca
15 pages
Lecture16 PDF
No ratings yet
Lecture16 PDF
4 pages
MCQ On Array For ICSE Class-10
No ratings yet
MCQ On Array For ICSE Class-10
3 pages
UNIT-IV Memory and I/O
No ratings yet
UNIT-IV Memory and I/O
36 pages
DSA 3rd Sem Solution MCQ and 1 Word
No ratings yet
DSA 3rd Sem Solution MCQ and 1 Word
7 pages
Unit 3 Compiler
No ratings yet
Unit 3 Compiler
27 pages
Numpy Cheat Sheet
No ratings yet
Numpy Cheat Sheet
13 pages
CODch 7 Slides
No ratings yet
CODch 7 Slides
49 pages
Aa
No ratings yet
Aa
15 pages
Array Implementation of List ADT
No ratings yet
Array Implementation of List ADT
5 pages
summary sit(102)
No ratings yet
summary sit(102)
8 pages
3 Summation of Series PUPILS notes
No ratings yet
3 Summation of Series PUPILS notes
8 pages
Memory Hierarchies (Part 2) Review: The Memory Hierarchy
No ratings yet
Memory Hierarchies (Part 2) Review: The Memory Hierarchy
7 pages
Chapter03 Assembly Part1
No ratings yet
Chapter03 Assembly Part1
8 pages
Cpe 104 Note
No ratings yet
Cpe 104 Note
6 pages
Accounting Cycle
No ratings yet
Accounting Cycle
6 pages
Compiler Design: Prof. Santanu Chattopadhyay
0% (1)
Compiler Design: Prof. Santanu Chattopadhyay
1 page
EW201A Micro Perez-Truglia Syllabus Redacted
No ratings yet
EW201A Micro Perez-Truglia Syllabus Redacted
5 pages
10A Worksheet
No ratings yet
10A Worksheet
4 pages
Programming Fundamentals
No ratings yet
Programming Fundamentals
2 pages
ACFrOgAOGvxAY-qubeZyNht2U0vFoIxGFstaXDJx9DZehFP9F93VWBhe_fLD5vc3F8CCwrF-ZQi66zttj88mNDxe3kXpb650Lz5idqOxNYkq9gCpV3zQH0yxBKy1ChKU1V2YpsqGvpezbUTrEbEx4HK3IfFA_JP6WVmm7bS3qQ==
No ratings yet
ACFrOgAOGvxAY-qubeZyNht2U0vFoIxGFstaXDJx9DZehFP9F93VWBhe_fLD5vc3F8CCwrF-ZQi66zttj88mNDxe3kXpb650Lz5idqOxNYkq9gCpV3zQH0yxBKy1ChKU1V2YpsqGvpezbUTrEbEx4HK3IfFA_JP6WVmm7bS3qQ==
2 pages
an-expansion-for-xn-yn
No ratings yet
an-expansion-for-xn-yn
2 pages
Summary of MATLAB Onramp
No ratings yet
Summary of MATLAB Onramp
3 pages
Mba 1 Sem Ba Organizational Behavior M 2173 Feb 2019
No ratings yet
Mba 1 Sem Ba Organizational Behavior M 2173 Feb 2019
2 pages
DS 20-21 PYP
No ratings yet
DS 20-21 PYP
2 pages
11 - JEE - Maths - Quadratic Equation - Wavy Curve Method
No ratings yet
11 - JEE - Maths - Quadratic Equation - Wavy Curve Method
1 page
CUET 2023 PG COMPLETE COURSE With TEST SERIES MCA SCQP09 1681301272969
No ratings yet
CUET 2023 PG COMPLETE COURSE With TEST SERIES MCA SCQP09 1681301272969
1 page
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
Storage Area Networks For Dummies
From Everand
Storage Area Networks For Dummies
Christopher Poelker
3.5/5 (2)
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Search Tree: Fundamentals and Applications
From Everand
Search Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet

Lecture 12: Cache Innovations

Uploaded by

Lecture 12: Cache Innovations

Uploaded by

Lecture 12: Cache Innovations

• Today: cache access basics and innovations

8 words: 3 index bits

Tag array Data array

Tag array Data array

Tag Way-1 Way-2

Tag array Data array

• 32 KB 4-way set-associative data cache array with 32

• How many sets?

• How many index bits, offset bits, tag bits?

• How large is the tag array?

• On a write miss, you may either choose to bring the block

• On a read miss, you always bring the block in (spatial and

• When you write into a block, do you also update the

• Writeback coalesces multiple writes to an L1 block into one

• Writethrough simplifies coherency protocols in a

• Critical word first

• Priority for reads

• The L2 and L3 have properties that are different from L1

• Hence, they can adopt alternative design choices

• For writeback/thru caches, writes to lower levels are placed

• When we have a read miss, we must look up the write

• When we have a write miss, the write can merge with

• Reads are more urgent than writes (probability of an instr

• A direct-mapped cache suffers from misses because

• The processor often tries to access data that it recently

• Can be viewed as additional associativity for a few sets

• Compulsory misses: happens the first time a memory

• Capacity misses: happens because the program touched

• Conflict misses: happens because two words map to the

• Sidenote: can a fully-associative cache have more misses

Compulsory Capacity Conflict

• Large block size – reduces compulsory misses, reduces

• Large caches – reduces capacity/conflict misses – access

• High associativity – reduces conflict misses – rule of thumb:

• Way prediction – by predicting the way, the access time

• Out of order execution: can do other useful work while

• Hardware and software prefetching into prefetch buffers

You might also like