0% found this document useful (0 votes)

13 views

CH13 DRAM Controller

The document discusses DRAM controller strategies including row buffer management policies, address mapping techniques, and command scheduling. Row buffer management policies include open-page, close-page, and hybrid policies which balance row buffer hits and precharges. Address mapping aims to minimize bank conflicts and maximize row hits and parallelism. Command scheduling issues DRAM commands to banks each cycle.

Uploaded by

洪啟恩

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

CH13 DRAM Controller

Uploaded by

洪啟恩

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Memory

Systems
CH13 DRAM
Controller
Prof. Ren-Shuo Liu
Outline
• DRAM controller model
• Controller's strategies
• Row buffer management
• Address mapping
• Command scheduling

2
DRAM Controller Model

• DRAM controller accepts physical address requests from clients, such as

processors and I/O devices
• Arbiter schedules transactions to enter into the controller

3
DRAM Controller Model

• A physical address request is mapped (translated) to a DRAM address

location and then converted to a sequence of DRAM commands
• DRAM commands are placed in queues
• Depending the command scheduling policy, commands are scheduled to
the DRAM devices

4
Row Buffer Management
• Sense amplifiers can act as buffers (i.e., row buffers)
to provide temporary data storage
• Row buffer significantly affects DRAM system
performance
• Accessing an opened row is fast (i.e., row buffer hit)
• Accessing another row requires a precharge and an
activation

ACT Read PRE ACT Read PRE

time

ACT Read Read PRE

time
5
Row Buffer Management
• Policies
• Open-page
• Close-page
• Hybrid (dynamic)

6
Open-Page Policy
• Leave a row opened after a row command
• Anticipate future temporally and spatially adjacent
memory accesses to the same row
• Exploit applications' locality
• Achieve the minimal row hit latency (tCAS)

7
Close-Page Policy
• Close a page immediately after a row command
• Favor systems with low degrees of access locality
• Precharge is performed as soon as possible
• Reduce the row miss latency

8
Hybrid (Dynamic) Policy
• Neither a strictly open-page policy nor a strictly
close-page policy achieve the best performance
• Modern controllers typically adopt a dynamic
combination of the two policies
• Runtime behaviors, including access locality and request
rate, are both considered
• Example
• Hit rate-aware hybrid policy
• Time-aware hybrid policy

9
Hit Rate-Aware Hybrid Policy
• Controller switches between the two policies
• If the row-hit probability is greater than a threshold
 switch to open page
• Otherwise
 switch to close page

10
Hit Rate-Aware Hybrid Policy
• Threshold selection based on a simple analysis
Access latency

tRP + tRCD + tCAS

close page
tRCD + tCAS Better

tCAS

0 1 Row-hit probability
𝑡𝑅𝑃
𝑡𝑅𝐶𝐷 + 𝑡𝑅𝑃

11
Time-Aware Hybrid Policy
• Concept
• Prevent opening rows too long, which wastes power
• Mechanism
• A timer is set to a predetermined value when a row is
activated
• The timer counts down
• When the timer reaches zero, a precharge command is
issued to precharge the bank
• In case of a row buffer hit to an open bank, the counter
is reset to a higher value

12
Address Mapping

1….N

1….M

Channel, rank,
bank, row, column
13
Address Mapping
• Consideration
• Minimize the probability of bank conflicts in temporally
adjacent requests
• Maximize the row hit ratio
• Maximize the parallelism
• Available parallelism in memory systems
• Channel
• Bank
• Rank

14
Address Mapping
• Channel-level parallelism
• There are no restrictions from the perspective of the
DRAM memory system on requests issued to different
logical channels
• Mapping consecutive cache lines to different channels
maximizes the parallelism of sequential accesses
• Mapping nearby cache lines to the same row maximizes
the row hit chances

15
Address Mapping
• Rank-level and bank-level parallelism
• Consecutive memory accesses can proceed in parallel to
different ranks or different banks
• But in general, scheduling consecutive accesses to
different banks of a given rank is more efficient than to
different ranks
• Because of the need of rank-to-rank switching latency, tRTRS

16
Address Mapping Examples
Virtual address: 36 bits Physical memory: 4 GB # Banks per rank: 8
Virtual page: 4 KB # Channels: 2 # Columns per row: 8 K
# Bytes per cache line: 64 # Rank per ch. : 2 # Bytes per column: 1

24 12
Virtual address VPN
(TLB)
20 12
Physical address PPN
32
DRAM address

Channel/rank/bank/row/col
addresses
17
Baseline Close-Page Mapping
Virtual address: 36 bits Physical memory: 4 GB # Banks per rank: 8
Virtual page: 4 KB # Channels: 2 # Columns per row: 8 K
# Bytes per cache line: 64 # Rank per ch. : 2 # Bytes per column: 1

24 12
Virtual address VPN
(TLB)
20 12
Physical address PPN
14 7 131 6
DRAM address

rank bank ch lower part of

row higher part of
addr. addr. addr. column addr.
addr. column addr.

18
Baseline Open-Page Mapping
Virtual address: 36 bits Physical memory: 4 GB # Banks per rank: 8
Virtual page: 4 KB # Channels: 2 # Columns per row: 8 K
# Bytes per cache line: 64 # Rank per ch. : 2 # Bytes per column: 1

24 12
Virtual address VPN
(TLB)
20
Physical address PPN
14 13 7 1 6
DRAM address

rank bank higher ch lower part of

row
addr. addr. part of addr. column addr.
addr.
column
addr.
19
Possible Issue
• Stride Collision
char a[256*1024];
char b[256*1024];
char c[256*1024];
24 12 …
Virtual address VPN for(int i…)
a[i]=b[i]+c[i];
(TLB) …
20
Physical address PPN  a[i], b[i], and c[i]
probably all map to the
14 13 7 1 6 same bank
DRAM address

rank bank higher ch lower part of

row
addr. addr. part of addr. column addr.
addr.
column
addr.
20
HW Solution to Stride Collision
• Enlarge the stride
24 12 char a[256*1024];
Virtual address VPN char b[256*1024];
char c[256*1024];
(TLB) …
20 for(int i…)
Physical address PPN a[i]=b[i]+c[i];
…
XOR

14 13 7 1 6
DRAM address

rank bank higher ch lower part of

row
addr. addr. part of addr. column addr.
addr.
column
addr.
21
Command Scheduling
• Each cycle it is possible to issue a command
• Activation
• Column read/write
• There’s an option to issue an auto-precharge
• Precharge command
• to any bank
• to all bank of a rank
• Power up/down command
• Refresh command

22
Optimization Goal
• Performance (delay)
• Sum of execution times for all involved programs

• Energy-delay product
• Sum of EDPs for all involved programs

• Fairness
• Compute the slowdown for each program, relative to its
single-thread execution
• Fairness metric is the ratio of the max slowdown to the
min slowdown

23
Example Scheduler
• Preliminary schedulers
• First-come, first-serve (FCFS)
• Open-page, first-ready, first-come first serve (FR-FCFS)
• Close-page
• Power-down
• First-ready-round-robin (FRRR)
• Credit-fair
• MLP-aware
• PAR-BS

24
First-Come First-Serve (FCFS)
• Algorithm
• Read queue is ordered by request arrival time
• Every cycle, the scheduler scans the read queue
sequentially until it finds an instruction that can issue in
the current cycle
• When the write queue size exceeds a high water mark,
writes are drained similarly until a low water mark is
reached
• Writes are also drained if there are no pending reads

25
First-Come First-Serve (FCFS)
START

Write Q > HI_WM Y

|| Read Q == 0
N Write drain mode = true
N Write drain mode && Y
Write Q > LO_WM

Find the first Find the first

issuable command issuable command
in the read queue in the write queue

Found Not found

Issue request command

END
26
Close-Page
• Algorithm
• Mainly based on FR-FCFS
• In every idle cycle, the scheduler issues precharge
operations to banks that last serviced a column
read/write

27
Close-Page
START

Write Q > HI_WM Y

|| Read Q == 0
N Write drain mode = true
N Write drain mode Y
&&Write Q > LO_WM

Find the first Find the first

issuable command issuable command
in the read queue in the write queue

Found Not found

Issue request command Try issuing PRECHARGE

END
28
Power-Down
• Algorithm
• Issues powerdown commands in every idle cycle

29
Power-Down
START

Write Q > HI_WM Y

|| Read Q == 0
N Write drain mode = true
N Write drain mode Y
&&Write Q > LO_WM

Find the first Find the first

issuable command issuable command
in the read queue in the write queue

Found Not found

Issue request command Try issuing PWR_DN

END
30
First-Ready-Round-Robin
• Algorithm
• First tries to issue any open row hits with the “correct”
thread-id (as defined by the current round robin flag)
• Then other row hits
• Then row misses with the “correct” thread-id
• Finally, a random request

• Effects
• Combine the benefits of open row hits with the fairness
of a round-robin scheduler

31
Credit-Fair
• Algorithm
• For every channel, this algorithm maintains a set of counters
for credits for each thread
• When scheduling reads, the thread with the most credits is
chosen
• Reads that will be open row hits get a 50% bonus to their
number of credits for that round of arbitration
• When a column read command is issued, that thread’s total
number of credits for using that channel is cut in half
• Each cycle all threads gain one credit
• Write queue draining happens in an FR-FCFS manner
• Effects
• Threads with infrequent DRAM reads will store up their
credits for many cycles so they will have priority when they
need to use them
32
MLP-Aware
• Algorithm
• Assumes that threads with many outstanding misses
(high memory level parallelism, MLP) are not as limited
by memory access time
• Prioritizes requests from low-MLP threads over those
from high-MLP threads
• To support fairness, a request’s wait time in the queue is
also considered
• Writes are handled as in FCFS, with appropriate high and
low water marks

33
Parallelism Aware Batch Scheduling
(PAR-BS)
• Recall FR-FCFS
• Exploit the latency benefit if successive requests hit the
same row buffer
PARBS
• Improve average latency and fairness among
threads
PAR-BS
• More sophisticated scheduling policy
• Improve both fairness and speedup
• Major policies
• Batch formation
• Request prioritization
• Thread ranking
• Misc.
Policy: Batch Formation
Policy: Request Prioritization
Policy: Thread Ranking

3. Remaining ties are broken according to process/thread IDs.

P0 > P1 > P2 .. and so on.

Quarter 1 - Module 1: To Computer Programming: (Special Science Class)
100% (10)
Quarter 1 - Module 1: To Computer Programming: (Special Science Class)
24 pages
Aspnet Core Aspnetcore 8.0
No ratings yet
Aspnet Core Aspnetcore 8.0
6,938 pages
Cs-537: Midterm Exam (Fall 2013) Professor Mcflub: The Solutions Edition
No ratings yet
Cs-537: Midterm Exam (Fall 2013) Professor Mcflub: The Solutions Edition
14 pages
Lecture 3: Memory Buffers and Scheduling
No ratings yet
Lecture 3: Memory Buffers and Scheduling
21 pages
Lecture: DRAM Main Memory: Topics: DRAM Intro and Basics (Section 2.3)
No ratings yet
Lecture: DRAM Main Memory: Topics: DRAM Intro and Basics (Section 2.3)
18 pages
Online Architecture Assignment Help
No ratings yet
Online Architecture Assignment Help
29 pages
Lecture: DRAM Main Memory: Topics: DRAM Intro and Basics (Section 2.3)
No ratings yet
Lecture: DRAM Main Memory: Topics: DRAM Intro and Basics (Section 2.3)
14 pages
Topics: Cache Innovations (Sections 2.4, B.4, B.5), Virtual Memory Intro
No ratings yet
Topics: Cache Innovations (Sections 2.4, B.4, B.5), Virtual Memory Intro
20 pages
11 Memory
No ratings yet
11 Memory
41 pages
EECS 470 Final Review
No ratings yet
EECS 470 Final Review
16 pages
Lecture: Cache Hierarchies: Topics: Cache Innovations (Sections B.1-B.3, 2.1)
No ratings yet
Lecture: Cache Hierarchies: Topics: Cache Innovations (Sections B.1-B.3, 2.1)
20 pages
Comparch Fall2020 Lecture11a Memory Controllers
No ratings yet
Comparch Fall2020 Lecture11a Memory Controllers
71 pages
DRAM Schedule
No ratings yet
DRAM Schedule
11 pages
CASS DRAM 2018 NoAnim
No ratings yet
CASS DRAM 2018 NoAnim
91 pages
Memory Access Sheduling
No ratings yet
Memory Access Sheduling
11 pages
Memory 2 A
No ratings yet
Memory 2 A
37 pages
16.1 Operating System (0S)
No ratings yet
16.1 Operating System (0S)
52 pages
Cls7 VirtualMemory
No ratings yet
Cls7 VirtualMemory
30 pages
OS
No ratings yet
OS
9 pages
Dram Controller: Mahdi Nazm Bojnordi
No ratings yet
Dram Controller: Mahdi Nazm Bojnordi
28 pages
15IF11 Multicore C PDF
No ratings yet
15IF11 Multicore C PDF
46 pages
OS Presentation
No ratings yet
OS Presentation
22 pages
Title: 1:rahul Kore 2: 3: 16102A0031
No ratings yet
Title: 1:rahul Kore 2: 3: 16102A0031
11 pages
Onur 447 Spring15 Lecture17 Memoryhierarchyandcaches Afterlecture
No ratings yet
Onur 447 Spring15 Lecture17 Memoryhierarchyandcaches Afterlecture
51 pages
Ios Model Qp Answer Key
No ratings yet
Ios Model Qp Answer Key
4 pages
Computer Architecture: Cache Memory
No ratings yet
Computer Architecture: Cache Memory
28 pages
Course Instructor: Nausheen Shoaib
No ratings yet
Course Instructor: Nausheen Shoaib
70 pages
os answer paper
No ratings yet
os answer paper
30 pages
OS Unit 4
No ratings yet
OS Unit 4
13 pages
DDR Controller
No ratings yet
DDR Controller
30 pages
CS7810 Prefetching: Seth Pugsley
No ratings yet
CS7810 Prefetching: Seth Pugsley
22 pages
SEG3420 File Structures and Processing - Lecture4 Data Transfers
No ratings yet
SEG3420 File Structures and Processing - Lecture4 Data Transfers
48 pages
Dram Cs7810 Protocolx2
No ratings yet
Dram Cs7810 Protocolx2
22 pages
Stanford Advanced Caches
No ratings yet
Stanford Advanced Caches
46 pages
CS 333 Introduction To Operating Systems Class 2 - OS-Related Hardware & Software The Process Concept
No ratings yet
CS 333 Introduction To Operating Systems Class 2 - OS-Related Hardware & Software The Process Concept
47 pages
Memory L
No ratings yet
Memory L
44 pages
Basic Operating Systems Concepts
No ratings yet
Basic Operating Systems Concepts
7 pages
Slide 8 OS Virtual Memory 2025
No ratings yet
Slide 8 OS Virtual Memory 2025
68 pages
Chapter Goals: Responsibilities Memory Process Timesharing Logical Physical Memory Management Techniques
No ratings yet
Chapter Goals: Responsibilities Memory Process Timesharing Logical Physical Memory Management Techniques
43 pages
Hw2 Solution
No ratings yet
Hw2 Solution
15 pages
I/O Management and Disk Scheduling
No ratings yet
I/O Management and Disk Scheduling
27 pages
CHAP8
No ratings yet
CHAP8
29 pages
Top 50 OS Interview Questions.pdf_20231107_101916_0000
No ratings yet
Top 50 OS Interview Questions.pdf_20231107_101916_0000
33 pages
Memory-Management Strategies
No ratings yet
Memory-Management Strategies
70 pages
Os 5
No ratings yet
Os 5
86 pages
Unit Iii - 80286
No ratings yet
Unit Iii - 80286
44 pages
Cache and Caching: Electrical and Electronic Engineering
No ratings yet
Cache and Caching: Electrical and Electronic Engineering
15 pages
Swapping Page Replacement Algorithms Thrashing
No ratings yet
Swapping Page Replacement Algorithms Thrashing
34 pages
Chapter 08
No ratings yet
Chapter 08
72 pages
CA_lecture_7
No ratings yet
CA_lecture_7
42 pages
Operation System Design
No ratings yet
Operation System Design
7 pages
Virtual Memory
No ratings yet
Virtual Memory
68 pages
OS mini
No ratings yet
OS mini
20 pages
cs-intro-os
No ratings yet
cs-intro-os
58 pages
08 - Operating System Support
No ratings yet
08 - Operating System Support
66 pages
35
No ratings yet
35
8 pages
Purple and White Modern Advertising Presentation
No ratings yet
Purple and White Modern Advertising Presentation
18 pages
Excerpts From Chap11 1 Luigi Logrippo U Ottawa
No ratings yet
Excerpts From Chap11 1 Luigi Logrippo U Ottawa
33 pages
Semester:-Sem IV Branch: - Computer Science and Engineering Subject: - Operating System
No ratings yet
Semester:-Sem IV Branch: - Computer Science and Engineering Subject: - Operating System
14 pages
Lecture 07 - Memory Management
No ratings yet
Lecture 07 - Memory Management
87 pages
CompTIA Network+: Untangling Ethernet, Herding Packets, and Conquering Connectivity Chaos
From Everand
CompTIA Network+: Untangling Ethernet, Herding Packets, and Conquering Connectivity Chaos
Scott Markham
No ratings yet
What is TCP/IP: Basic Concepts to More Advanced.
From Everand
What is TCP/IP: Basic Concepts to More Advanced.
Scott Markham
No ratings yet
Example Resume Data Engineer 18
No ratings yet
Example Resume Data Engineer 18
2 pages
Fm2a78m-Dg3 PDF
No ratings yet
Fm2a78m-Dg3 PDF
63 pages
Biztalk Deployment Framework Documentation
No ratings yet
Biztalk Deployment Framework Documentation
8 pages
RFC Gateway Security, Part 4 - Prxyinfo ACL - SAP Blogs
No ratings yet
RFC Gateway Security, Part 4 - Prxyinfo ACL - SAP Blogs
10 pages
Cloud Computoing Module I
No ratings yet
Cloud Computoing Module I
24 pages
Microprocessor Architecture and Assembly Language: Week # 1
No ratings yet
Microprocessor Architecture and Assembly Language: Week # 1
6 pages
Installation 3G Network Elements
No ratings yet
Installation 3G Network Elements
108 pages
Quiz Oracle Greenfoot
67% (3)
Quiz Oracle Greenfoot
12 pages
Symphony Ready-reckoner (3)
No ratings yet
Symphony Ready-reckoner (3)
69 pages
American International University-Bangladesh (Aiub) : Faculty of Engineering
No ratings yet
American International University-Bangladesh (Aiub) : Faculty of Engineering
6 pages
Validation of MODICON 3-2 Receiver Board Operation - LJ Create PDF
No ratings yet
Validation of MODICON 3-2 Receiver Board Operation - LJ Create PDF
3 pages
H12-725_V4.0 Huawei Exam Questions Ensure You Pass
No ratings yet
H12-725_V4.0 Huawei Exam Questions Ensure You Pass
22 pages
News Portal Project Report
100% (1)
News Portal Project Report
34 pages
Raspserverprint
No ratings yet
Raspserverprint
2 pages
an-1328
No ratings yet
an-1328
9 pages
Assignment No-3,4,5
No ratings yet
Assignment No-3,4,5
4 pages
Tachometer Using Arduino and Hall Effect Sensor - Engineer Experiences
100% (1)
Tachometer Using Arduino and Hall Effect Sensor - Engineer Experiences
12 pages
Final
No ratings yet
Final
80 pages
PPCL Manual
No ratings yet
PPCL Manual
195 pages
Textile Shop Management System
100% (1)
Textile Shop Management System
63 pages
Acsse Csc01a1 2025 Lg
No ratings yet
Acsse Csc01a1 2025 Lg
18 pages
ODROID Magazine 201401 PDF
No ratings yet
ODROID Magazine 201401 PDF
27 pages
8 Auto Mobile Service Station
No ratings yet
8 Auto Mobile Service Station
25 pages
NEET Electronic Devices Important Questions
No ratings yet
NEET Electronic Devices Important Questions
24 pages
Integrated Circuit - Wik..., The Free Encyclopedia
No ratings yet
Integrated Circuit - Wik..., The Free Encyclopedia
15 pages
PC 817
No ratings yet
PC 817
4 pages
Miniproject in Opengl
0% (2)
Miniproject in Opengl
8 pages
TDP T90 PDF
No ratings yet
TDP T90 PDF
93 pages

CH13 DRAM Controller

Uploaded by

CH13 DRAM Controller

Uploaded by

Memory

• DRAM controller accepts physical address requests from clients, such as

• A physical address request is mapped (translated) to a DRAM address

ACT Read PRE ACT Read PRE

ACT Read Read PRE

tRP + tRCD + tCAS

rank bank ch lower part of

rank bank higher ch lower part of

rank bank higher ch lower part of

rank bank higher ch lower part of

Write Q > HI_WM Y

Find the first Find the first

Found Not found

Write Q > HI_WM Y

Find the first Find the first

Found Not found

Write Q > HI_WM Y

Find the first Find the first

Found Not found

3. Remaining ties are broken according to process/thread IDs.

You might also like