0% found this document useful (0 votes)

16 views60 pages

Week12 Updated

The document outlines the learning goals and weekly plan for CSC 258, focusing on cache memory, its configurations, and performance evaluation. It discusses concepts such as locality, memory hierarchy, direct-mapped and associative caching, and the impact of cache configurations on system performance. Additionally, it covers multilevel caches, write policies, and includes activities for practical understanding of the topics discussed.

Uploaded by

pramitha rm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views60 pages

Week12 Updated

Uploaded by

pramitha rm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

CSC 258

© Course Director-EECS2021
This Week’s Learning Goals
1. Describe how the introduction of caches can be used to reduce
the effective memory latency.

2. Describe how a sequence of memory requests will be handled by

a cache, factoring in both line size and associativity.

3. Quantify the impact of different cache configurations on latency

and overall system performance

3
Week’s Plan
• Associativity
• Memory Hierarchy Performance
• Dependability (Time permitting)
• Memory System Summary
• Virtual Memory and TLB (Time permitting)

4
Recap:
Last week, we looked at locality and the memory hierarchy.
- Locality is the big idea: it allows us to build high-performance memory systems
based on predicting what data we will need.
- We introduced the idea of the memory wall: that increases in processor
performance were making memory further away.

Finally, we examined a direct mapped cache, and this week, we’re going to build
on that design.

5
Activity: Direct-Mapped Caching

Assume you have a direct mapped cache that stores 4 blocks, each containing 4 words.

a) For an 8-bit address, indicate which bits indicate the block and which indicate the tag.

b) Given the following sequence of addresses, indicate the number of hits and misses
and the final state of the cache:
80, 90, 84, 40, 84, 90, A4, B8, 44, 64, A4, B8

6
Activity QUE: Direct-Mapped Caching
Assume you have a direct mapped cache that stores 4 blocks, each containing 4 words.

a) For an 8-bit address, indicate which bits indicate the block and which indicate the tag.

b) Given the following sequence of addresses, indicate the number of hits and misses
and the final state of the cache:
80, 90, 84, 40, 84, 90, A4, B8, 44, 64, A4, B8
Block Cache Hit/miss Showing Tag ; Cache lines content after access
address index 0 1 2 3
Activity ANS: Direct-Mapped Caching
Assume you have a direct mapped cache that stores 4 blocks, each containing 4 words.

a) For an 8-bit address, indicate which bits indicate the block and which indicate the tag.

b) Given the following sequence of addresses, indicate the number of hits and misses
and the final state of the cache:
80, 90, 84, 40, 84, 90, A4, B8, 44, 64, A4, B8
Block Cache Hit/miss Showing Tag ; Cache lines content after access
address index 0 1 2 3
8 0 M 10; MEM[8]
9 1 M 10; MEM[8] 10; MEM[9]
8 0 H 10; MEM[8] 10; MEM[9]
4 0 M 01; MEM[4] 10; MEM[9]
8 0 M 10; MEM[8] 10; MEM[9]
9 1 H 10; MEM[8] 10; MEM[9]
A 2 M 10; MEM[8] 10; MEM[9] 10; MEM[A]
B 3 M 10; MEM[8] 10; MEM[9] 10; MEM[A] 10; MEM[B]
4 0 M 01; MEM[4] 10; MEM[9] 10; MEM[A] 10; MEM[B]
6 2 M 01; MEM[4] 10; MEM[9] 01; MEM[6] 10; MEM[B]
A 2 M 01; MEM[4] 10; MEM[9] 10; MEM[A] 10; MEM[B]
B 3 H 01; MEM[4] 10; MEM[9] 10; MEM[A] 10; MEM[B]
Follow-up: Performance Evaluation

Based on the hits and misses from the breakout:

a) What is the miss rate of the sequence? With a 100 cycle miss penalty, what is AMAT?

b) Is that miss rate a fair evaluation of the performance of the cache? Why or why not?

c) Using that miss rate and assuming (1) a CPI of 1 without memory stalls, (2) 40%
load/stores, and (3) a 100 cycle miss penalty, what is the CPI with memory stalls using
this cache?

9
CSC 258

10
Associativity

Most caches use some form of hashing.

The caches are smaller than the memory they are caching from, so they
can’t store everything!

If two blocks hash to the same value, they can’t both be stored. To reduce
the impact of this, caches are often associative.
A direct mapped cache has associativity 1: a block can be placed in only one
place in the cache.
A 2-way set associative cache can store two blocks that hash to the same
value: there are two places that a block may be placed in the cache.
In a fully associative cache hash, a block can be placed in any location in the
cache.

12
Associative Caches
Fully associative
• Allow a given block to go in any cache entry
• Requires all entries to be searched at once
• Comparator per entry (expensive)

n-way set associative

• Each set contains n entries
• Block number determines which set
• (Block number in memory) modulo (#Sets in cache)
• Search all entries in a given set at once
• n comparators (less expensive)

13
Associative Cache Example

14
Spectrum of Associativity
For a cache with 8 entries

15
Associativity Example
Compare 4-block caches for 3 kinds of configuration below:

• Direct mapped; 2-way set associative; fully associative

• Given the Block access sequence: 0, 8, 0, 6, 8

Direct mapped

Block Cache Hit/miss Cache content after access

address index 0 1 2 3
0 0 miss Mem[0]
8 0 miss Mem[8]
0 0 miss Mem[0]
6 2 miss Mem[0] Mem[6]
8 0 miss Mem[8] Mem[6]

INDEX = Block address MOD no. of cache blocks

16
Associativity Example
2-way set associative

Block Cache Hit/miss Cache content after access

address index Set 0 Set 1
0 0 miss Mem[0]
8 0 miss Mem[0] Mem[8]
0 0 hit Mem[0] Mem[8]
6 0 miss Mem[0] Mem[6]
8 0 miss Mem[8] Mem[6]

SET/INDEX = Block address MOD no. of cache blocks

 Fully associative
Block Hit/miss Cache content after access
address
0 miss Mem[0]
8 miss Mem[0] Mem[8]
0 hit Mem[0] Mem[8]
6 miss Mem[0] Mem[8] Mem[6]
8 hit Mem[0] Mem[8] Mem[6]

INDEX = Not Applicable

17
The Cost of Associativity

• An associative cache is larger and slower than a direct mapped cache

that stores the same amount of data.

• To implement associativity, we have to search more locations to

determine whether a block is in the cache.
• We must compare the tag at each possible location (N comparators,
instead of just 1).
• We must steer the data from the correct location to the output (a larger
mux).

18
4-way Set Associative Cache Design

19
Associative Doesn’t Always
Mean “Better”
Designers need to balance the amount of associativity with the
expected workload.

The overhead in the previous slide pushes us to reduce associativity.

Workloads where multiple blocks with the same set are needed at the
same time push associativity up.
Some workloads actually benefit from less associativity.

There isn’t a single correct answer. It depends on the workload and

context of the cache.

20
Example: Spec2000
Increased associativity decreases miss rate … with diminishing returns

Simulation of a system with 64KB D-cache, 16-word blocks, SPEC2000

1-way: 10.3%
2-way: 8.6%
4-way: 8.3%
8-way: 8.1%

21
Cache Eviction
Every load brings in a block.
• Each cache has a finite size.
• It can store some maximum number of blocks.
• Based on associativity, it can store a set number of
blocks with a specific hash.
• Every time a load is performed from memory, the block
must be stored.
• This means that another block might need to be evicted.

22
Replacement Policy
Direct mapped: there is no choice, so no policy is needed
Set associative
Prefer to evict non-valid or empty entry, if there is one
Otherwise, we must choose among entries in the set
Least-recently used (LRU)
Choose the one unused for the longest time
• LRU is hard, so we use approximations!
Random
Gives approximately the same performance as LRU for high associativity

23
Activity: Associative Caching

Assume you have a 2-way set associative cache that stores a total of 4 blocks, each
containing 4 words. LRU is used to evict items.

a) For an 8-bit address, indicate which bits indicate the block and which indicate the tag.

b) Given the following sequence of addresses, indicate the number of hits and misses
and the final state of the cache:
80, 90, 84, 40, 84, 90, A4, B8, 44, 64, A4, B8

24
Activity ANS: Associative Caching
Assume you have a 2-way set associative cache that stores a total of 4 blocks, each
containing 4 words. LRU is used to evict items.

a) For an 8-bit address, indicate which bits indicate the block and which indicate the tag.

b) Given the following sequence of addresses, indicate the number of hits and misses
and the final state of the cache:
80, 90, 84, 40, 84, 90, A4, B8, 44, 64, A4, B8
Block Cache Hit/miss Showing Tag ; Cache lines content after access
address index 0 1
8 0 M 100; MEM[8]
9 1 M 100; MEM[8] 100; MEM[9]
8 0 H 100; MEM[8] 100; MEM[9]
4 0 M 100; MEM[8] 010; MEM[4] 100; MEM[9]
8 0 H 100; MEM[8] 010; MEM[4] 100; MEM[9]
9 1 H 100; MEM[8] 010; MEM[4] 100; MEM[9]
A 0 M 100; MEM[8] 101; MEM[A] 100; MEM[9]
B 1 M 100; MEM[8] 101; MEM[A] 100; MEM[9] 101; MEM[B]
4 0 M 010; MEM[4] 101; MEM[A] 100; MEM[9] 101; MEM[B]
6 0 M 010; MEM[4] 011; MEM[6] 100; MEM[9] 101; MEM[B]
A 0 M 101; MEM[A] 011; MEM[6] 100; MEM[9] 101; MEM[B]
B 1 H 101; MEM[A] 011; MEM[6] 100; MEM[9] 101; MEM[B]
Follow-up: Performance Evaluation
Based on the hits and misses from the breakout:

a) What is the miss rate of the sequence? With a 100 cycle miss penalty, what is AMAT?

b) Is that miss rate a fair evaluation of the performance of the cache? Why or why not?

c) Using that miss rate and assuming (1) a CPI of 1 without memory stalls, (2) 40%
load/stores, and (3) a 100 cycle miss penalty, what is the CPI with memory stalls using
this cache?

How does this compare with the direct mapped cache from earlier?

26
Activity: Associative Caching

Earlier, I mentioned that, “Some workloads actually benefit from less associativity.”

For the direct mapped and associative caches we’ve seen so far ….

a) Generate a sequence that performs much better on the associative cache.

b) Generate a new sequence that performs much better on the direct mapped cache.

27
CSC 258

28
Multilevel Caches
Modern memory systems are composed of a sequence of caches.
• Level-1: Primary cache attached to CPU is small and fast
• Level-2: Services misses from primary cache and stores more
• Main memory services Level-2 cache misses

Some high-end systems include L-3 cache

Other caches exist in the system that are not part of the memory caching
system: TLB, for example.

29
Multilevel Cache Considerations

L-1 cache
The focus is on minimizing hit time.
L-2 cache
The focus is on reducing miss rate to avoid the penalty of a
main memory access.
Hit time has less overall impact: it is less than main memory
access.

30
Writeback Policy

When data is stored in a cache, it also needs to be written to

other caches and to memory.
• Write-through
• Update both upper and lower levels on every write
• Simplifies replacement, but may require write buffer
• Write-back
• Update upper level only
• Update lower level when block is replaced
• Need to keep more state, but more efficient

31
Multilevel Cache Example
Given …
CPU base CPI = 1, clock rate = 4GHz
Miss rate/instruction = 2%
Main memory access time = 100ns

With a single cache ..

Miss penalty = 100ns/0.25ns = 400 cycles
Effective CPI = 1 + 0.02 × 400 = 9 (!)

32
Multilevel Cache Example, Continued

Next, add a Level-2 cache with …

Access time = 5ns
Global miss rate / instruction to main memory = 0.5%

Primary miss with Level-2 hit

Penalty = 5ns/0.25ns = 20 cycles
Primary miss with Level-2 miss
Still 400 cycles

Total CPI = Hit time + Primary stalls per instruction + Secondary stalls per instruction
= 1 + 2% x 20 + 0.5% x 400
Hence, CPI = 1 + 0.02 × 20 + 0.005 × 400 = 3.4

Performance ratio = 9/3.4 = 2.6

33
CSC 258

34
Dependability
Performance is only one consideration when designing a memory system.
We also need to consider dependability.

A system is dependable if the service is delivered as specified.

For example: “If a value is stored at an address, then when that address is
loaded, the same value will be returned.”

35
Dependability
If a value is stored at an address, then when that address is loaded, the same
value will be returned.

This seems easy, but consider:

• Hardware failures
• Faults due to interference
• Slightly different issue: Maintaining consistency across caches

36
Dependability Measures
Reliability: mean time to failure (MTTF)
Service interruption: mean time to repair (MTTR)
Mean time between failures
MTBF = MTTF + MTTR
Availability = MTTF / (MTTF + MTTR)
Improving Availability
Increase MTTF: fault avoidance, fault tolerance, fault forecasting
Reduce MTTR: improved tools and processes for diagnosis and repair

37
Increasing Dependability
Most memory systems today use error detecting codes to find and correct
single-bit (and even multi-bit) errors in stored data.

In important applications, data can be replicated – stored across multiple

devices so that a failure in one device does not cause a loss of data.
Consider RAID systems

38
Cache Coherence
We will not discuss coherence in detail in this course. It’s a huge topic. But be
aware that it is the major issue that makes caching in parallel systems difficult.

Cache coherence refers to the uniformity of shared data stored across the
memory system.
• Example: When a value is stored, how is the stored value propagated back to
memory and/or other caches?

39
CSC 258

40
Types of Misses:
Designing with the Three C’s
Compulsory misses
First access to a block
Capacity misses
Occurs when a block that was replaced is accessed later
Caused by limited cache size
Conflict misses (or collision misses)
Occurs when two blocks are competing for space in the cache and evict
each other
Would not occur in a fully associative cache of the same total size

41
Cache Design Trade-offs

Design change Effect on miss rate Negative

performance effect

Increase cache size Decrease capacity May increase access

misses time

Increase associativity Decrease conflict May increase access

misses time

Increase block size Decrease compulsory Increases miss

misses penalty. For very large
block size, may
increase miss rate due
to pollution.

42
Activity: Designing a Cache

You are designing a processor and currently have a 2-way, 32-entry set-associative cache
that stores 8-word blocks.

The processor design is currently being tested. What is your response when you are told
that the following things are occurring?

a)A benchmark never completely fills the cache but has a large number of misses; the
same lines appear to be reloaded again and again.
b)A benchmark is reading data sequentially from a file, and miss rates are high: it appears
to be loading one line at a time and not reusing data.
c) Tricky: On a very short benchmark, miss rates are very high – near 50%.

43
Virtual Memory
• Use main memory as a “cache” for secondary (disk)
storage
• Managed jointly by CPU hardware and the operating
system (OS)
• Programs share main memory
• Each gets a private virtual address space holding its
frequently used code and data
• Protected from other programs
• CPU and OS translate virtual addresses to physical
addresses
• VM “block” is called a page
• VM translation “miss” is called a page fault

44
Address Translation
Fixed-size pages (e.g., 4K)

45
Page Fault Penalty
• On page fault, the page must be fetched from disk
• Takes millions of clock cycles
• Handled by OS code
• Try to minimize page fault rate
• Fully associative placement
• Smart replacement algorithms

46
Page Tables
• Stores placement information
• Array of page table entries, indexed by virtual page
number
• Page table register in CPU points to page table in physical
memory
• If page is present in memory
• PTE stores the physical page number
• Plus other status bits (referenced, dirty, …)
• If page is not present
• PTE can refer to location in swap space on disk

47
Replacement and Writes
• To reduce page fault rate, prefer least-recently used
(LRU) replacement
• Reference bit (aka use bit) in PTE set to 1 on access to
page
• Periodically cleared to 0 by OS
• A page with reference bit = 0 has not been used recently
• Disk writes take millions of cycles
• Block at once, not individual locations
• Write through is impractical
• Use write-back
• Dirty bit in PTE set when page is written

48
Fast Translation Using a TLB
• Address translation would appear to require
extra memory references
• One to access the PTE
• Then the actual memory access
• But access to page tables has good locality
• So, use a fast cache of PTEs within the CPU
• Called a Translation Look-aside Buffer (TLB)
• Typical: 16–512 PTEs, 0.5–1 cycle for hit, 10–100 cycles
for miss, 0.01%–1% miss rate
• Misses could be handled by hardware or software

49
TLB Misses
• If page is in memory
• Load the PTE from memory and retry
• Could be handled in hardware
• Can get complex for more complicated page table
structures
• Or in software
• Raise a special exception, with optimized handler
• If page is not in memory (page fault)
• OS handles fetching the page and updating the page
table
• Then restart the faulting instruction

50
CSC 258

Important practice questions… 51

Quiz-like Question 1
Simulate the performance of a cache on the following (hex) address
loads.
40 48 4c 40 50 58 5c 40 60 48 4c 44 40 60 58 5c
The cache is direct-mapped and stores 4 words. It uses a FIFO eviction
policy.

For more details, please check Example cache configurations available in

the FINAL EXAM practice questions in the PRACTICE MODULE on eClass.

52
Quiz-like Question 2
Simulate the performance of a cache on the following (hex) address loads.
40 48 4c 40 50 58 5c 40 60 48 4c 44 40 60 58 5c
The cache is direct-mapped and stores 2 blocks of two words. It uses a FIFO
eviction policy.

53
Quiz-like Question 3
Simulate the performance of a cache on the following (hex) address loads.
40 48 4c 40 50 58 5c 40 60 48 4c 44 40 60 58 5c
The cache is 2-way set associative and stores 4 words. It uses a FIFO
eviction policy.

54
Quiz-like Question 4
Simulate the performance of a cache on the following (hex) address loads.
40 48 4c 40 50 58 5c 40 60 48 4c 44 40 60 58 5c
The cache is 2-way set associative and stores 4 words. It uses an LRU
eviction policy.

55
Quiz-like Question 5
Simulate the performance of a cache on the following (hex) address loads.
40 48 4c 40 50 58 5c 40 60 48 4c 44 40 60 58 5c
The cache is fully associative and stores 4 words. It uses a FIFO eviction
policy.

56
Coming Up
57
Don’t Forget!

• Practice problems
• #5.7*, 5.8, 5.10*, 5.11*, 5.12*
• #1.12*, 1.13*, 1.14*, 1.15*
• Final exam review (based on ch1,2,3,4,5) is
available.

58
All the best for
Exams!
59
Thank you 
60

Team Topologies at Parts Unlimited The Unicorn Project
100% (2)
Team Topologies at Parts Unlimited The Unicorn Project
45 pages
Case Study
No ratings yet
Case Study
13 pages
Toefl Ibt: Writing Practice Questions
No ratings yet
Toefl Ibt: Writing Practice Questions
15 pages
Yashwant Internship
No ratings yet
Yashwant Internship
17 pages
ETL Testing Interview Questions and Answers
No ratings yet
ETL Testing Interview Questions and Answers
9 pages
Dyeing and Processing - Color Matching in Textiles
0% (1)
Dyeing and Processing - Color Matching in Textiles
6 pages
Topic Outline - Human Vs Technology
No ratings yet
Topic Outline - Human Vs Technology
14 pages
An Internship Report: Soyah
No ratings yet
An Internship Report: Soyah
37 pages
020 100880 01 Christie LIT MAN SERV D4K25
No ratings yet
020 100880 01 Christie LIT MAN SERV D4K25
138 pages
Apm MDM Tech Riello Ups
No ratings yet
Apm MDM Tech Riello Ups
49 pages
User+Manual+of+12 1+Inch+Patient+Monitor
No ratings yet
User+Manual+of+12 1+Inch+Patient+Monitor
87 pages
Cache Basics and Operation
No ratings yet
Cache Basics and Operation
42 pages
Computer Arch 06
No ratings yet
Computer Arch 06
41 pages
PDF Tank
No ratings yet
PDF Tank
3 pages
Aclg0 Justpasteit
No ratings yet
Aclg0 Justpasteit
3 pages
2013 SCC Online Bom 1530 PDF
No ratings yet
2013 SCC Online Bom 1530 PDF
20 pages
Memory Hierarchy Design
No ratings yet
Memory Hierarchy Design
76 pages
Yankee Module (Without DSC)
No ratings yet
Yankee Module (Without DSC)
27 pages
Ra 9262 Implementing Rules and Regulations - Google Search
No ratings yet
Ra 9262 Implementing Rules and Regulations - Google Search
1 page
Modem - Mux Base Prs Network: DCM DCM DCM
No ratings yet
Modem - Mux Base Prs Network: DCM DCM DCM
8 pages
SAP - How To Access Each System - English V 1 4
No ratings yet
SAP - How To Access Each System - English V 1 4
22 pages
19 Cache 2
No ratings yet
19 Cache 2
46 pages
361 Computer Architecture Lecture 14: Cache Memory
No ratings yet
361 Computer Architecture Lecture 14: Cache Memory
20 pages
Computer Organization and Architecture (AT70.01) : Comp. Sc. and Inf. MGMT
No ratings yet
Computer Organization and Architecture (AT70.01) : Comp. Sc. and Inf. MGMT
49 pages
Ict 112 Lec 1812s Week 19 Long Quiz 004 PDF Free
No ratings yet
Ict 112 Lec 1812s Week 19 Long Quiz 004 PDF Free
9 pages
Cache Memory: CSE 410, Spring 2008 Computer Systems
No ratings yet
Cache Memory: CSE 410, Spring 2008 Computer Systems
42 pages
Lec 4
No ratings yet
Lec 4
31 pages
Cache
No ratings yet
Cache
34 pages
CA I - Chapter 5 Caches 3
No ratings yet
CA I - Chapter 5 Caches 3
70 pages
Caching: Acknowledgements
No ratings yet
Caching: Acknowledgements
6 pages
CMSC 611: Advanced Computer Architecture
No ratings yet
CMSC 611: Advanced Computer Architecture
21 pages
2ND SUMMATIVE TEST in Math 8
No ratings yet
2ND SUMMATIVE TEST in Math 8
3 pages
Computer Org and Arch: R.Magesh
No ratings yet
Computer Org and Arch: R.Magesh
48 pages
05) Cache Memory Introduction
No ratings yet
05) Cache Memory Introduction
20 pages
24-Cache Memory Mapping Techniques-14!03!2024
No ratings yet
24-Cache Memory Mapping Techniques-14!03!2024
36 pages
An 71
No ratings yet
An 71
11 pages
CODch 7 Slides
No ratings yet
CODch 7 Slides
49 pages
Cache Memory: A Safe Place For Hiding or Storing Things
100% (1)
Cache Memory: A Safe Place For Hiding or Storing Things
34 pages
Cache Presentation
No ratings yet
Cache Presentation
45 pages
EE6304 Lecture9 Mem Caches
No ratings yet
EE6304 Lecture9 Mem Caches
61 pages
16-Cache Memory-13-03-2024
No ratings yet
16-Cache Memory-13-03-2024
50 pages
Memory Hierarchies (Part 2) Review: The Memory Hierarchy
No ratings yet
Memory Hierarchies (Part 2) Review: The Memory Hierarchy
7 pages
Is Auditing Procedure P2: Digital Signatures and Key Management
No ratings yet
Is Auditing Procedure P2: Digital Signatures and Key Management
6 pages
Cache Memory
No ratings yet
Cache Memory
39 pages
Lectures wk11
No ratings yet
Lectures wk11
21 pages
Large and Fast: Exploiting Memory Hierarchy
No ratings yet
Large and Fast: Exploiting Memory Hierarchy
48 pages
AC14L08 Memory Hierarchy
No ratings yet
AC14L08 Memory Hierarchy
20 pages
Fundamentals of Computer Systems: Caches
No ratings yet
Fundamentals of Computer Systems: Caches
28 pages
Resuume
No ratings yet
Resuume
2 pages
Lecture 5: Memory Hierarchy and Cache Traditional Four Questions For Memory Hierarchy Designers
No ratings yet
Lecture 5: Memory Hierarchy and Cache Traditional Four Questions For Memory Hierarchy Designers
10 pages
ACA Unit-5
No ratings yet
ACA Unit-5
54 pages
CV Raushan 24
No ratings yet
CV Raushan 24
1 page
BSC Sem4 Algorithms Assignment 1 Ans
No ratings yet
BSC Sem4 Algorithms Assignment 1 Ans
11 pages
Memory Cache
No ratings yet
Memory Cache
18 pages
Sampriya Chandra Cache Memory
No ratings yet
Sampriya Chandra Cache Memory
36 pages
Chapter 4 Memory Organization Lecture
No ratings yet
Chapter 4 Memory Organization Lecture
54 pages
My Presentation - 6th Oct. 2011
No ratings yet
My Presentation - 6th Oct. 2011
18 pages
CL10 MemoryMgmt
No ratings yet
CL10 MemoryMgmt
45 pages
Coa PPT
No ratings yet
Coa PPT
158 pages
Cache Design
No ratings yet
Cache Design
59 pages
DECO - Module 4.3 - Cache
No ratings yet
DECO - Module 4.3 - Cache
20 pages
Caches - Basic Idea
No ratings yet
Caches - Basic Idea
11 pages
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
No ratings yet
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
55 pages
Cache Org
No ratings yet
Cache Org
19 pages
Toshiba Manual
No ratings yet
Toshiba Manual
35 pages
6GK52062BB002AB2 Datasheet en
No ratings yet
6GK52062BB002AB2 Datasheet en
4 pages
06 - Memory System - I
No ratings yet
06 - Memory System - I
63 pages
Erika Heunis
No ratings yet
Erika Heunis
2 pages
Ch01 Part3 Caches
No ratings yet
Ch01 Part3 Caches
32 pages
Ch01 Part3 Caches
No ratings yet
Ch01 Part3 Caches
32 pages
9460 Analog Input Module - en
No ratings yet
9460 Analog Input Module - en
5 pages
CS2115 Chapter-6
No ratings yet
CS2115 Chapter-6
45 pages
04 - Cache Memory
No ratings yet
04 - Cache Memory
61 pages
Lecture 08 - CH No. 04 (Part 02)
No ratings yet
Lecture 08 - CH No. 04 (Part 02)
60 pages
CA Lecture 08
No ratings yet
CA Lecture 08
38 pages
Cache PPT
No ratings yet
Cache PPT
38 pages
L18 Cache Wrap Up
No ratings yet
L18 Cache Wrap Up
30 pages
Cache Writing & Performance
No ratings yet
Cache Writing & Performance
23 pages
MHT-CET 2024 Questions - 12th Mathematics - I
No ratings yet
MHT-CET 2024 Questions - 12th Mathematics - I
38 pages
10 Cacheperf
No ratings yet
10 Cacheperf
24 pages
55-Types of Caches, Caches Misses,-04!03!2025
No ratings yet
55-Types of Caches, Caches Misses,-04!03!2025
64 pages
Unit 4
No ratings yet
Unit 4
72 pages
Week 13 - Lecture 13 - Memory (Cont)
No ratings yet
Week 13 - Lecture 13 - Memory (Cont)
31 pages
Lect 12 Memory
No ratings yet
Lect 12 Memory
42 pages
6.module 2 - Part 2
No ratings yet
6.module 2 - Part 2
39 pages
CMP3010L09 MemoryII
No ratings yet
CMP3010L09 MemoryII
39 pages
Lec8 Memory
No ratings yet
Lec8 Memory
17 pages

Week12 Updated

Uploaded by

Week12 Updated

Uploaded by

CSC 258

2. Describe how a sequence of memory requests will be handled by

3. Quantify the impact of different cache configurations on latency

Based on the hits and misses from the breakout:

Most caches use some form of hashing.

n-way set associative

• Direct mapped; 2-way set associative; fully associative

Block Cache Hit/miss Cache content after access

INDEX = Block address MOD no. of cache blocks

Block Cache Hit/miss Cache content after access

SET/INDEX = Block address MOD no. of cache blocks

INDEX = Not Applicable

• An associative cache is larger and slower than a direct mapped cache

• To implement associativity, we have to search more locations to

The overhead in the previous slide pushes us to reduce associativity.

There isn’t a single correct answer. It depends on the workload and

Simulation of a system with 64KB D-cache, 16-word blocks, SPEC2000

a) Generate a sequence that performs much better on the associative cache.

Some high-end systems include L-3 cache

When data is stored in a cache, it also needs to be written to

With a single cache ..

Next, add a Level-2 cache with …

Primary miss with Level-2 hit

Performance ratio = 9/3.4 = 2.6

A system is dependable if the service is delivered as specified.

This seems easy, but consider:

In important applications, data can be replicated – stored across multiple

Design change Effect on miss rate Negative

Increase cache size Decrease capacity May increase access

Increase associativity Decrease conflict May increase access

Increase block size Decrease compulsory Increases miss

Important practice questions… 51

For more details, please check Example cache configurations available in

You might also like