Spos Unit 6
Spos Unit 6
Source: cse.iitkgp.ac.in/~bivasm/os_notes/memory_v3.pptx
SYLLABUS :
Introduction: Memory Management concepts, Memory Management
requirements.
Memory Partitioning: Fixed Partitioning, Dynamic Partitioning, Buddy
Systems Fragmentation, Paging, Segmentation, Address translation.
Placement Strategies: First Fit, Best Fit, Next Fit and Worst Fit.
Virtual Memory (VM): Concepts, Swapping, VM with Paging, Page
Table Structure, Inverted Page Table, Translation Look aside Buffer,
Page Size, VM with Segmentation, VM with Combined paging and
segmentation.
Page Replacement Policies: First In First Out (FIFO), Last Recently
Used(LRU), Optimal, Thrashing.
Conten
• Memory management:
t
• Review of Programming Model of Intel 80386,
• Contiguous and non-contiguous,
• Swapping,
• Paging,
• Segmentation,
• Segmentation with Paging.
• Virtual Memory:
– Background,
– Demand paging,
– Page replacement scheme-
• FIFO,
• LRU,
• Optimal,
• Thrashing.
• Case Study: Memory Management in multi-cores OS.
Memory management
• We have seen how CPU can be shared by a set
of processes
– Improve system performance
– Process management
• Need to keep several process in memory
– Share memory
• Learn various techniques to manage memory
– Hardware dependent
Memory management
What are we going to learn?
• Basic Memory Management: logical vs.
physical address space, protection, contiguous
memory allocation, paging, segmentation,
segmentation with paging.
1. Protect OS
2. Protect user processes
Base and Limit Registers
• A pair of base and limit registers define
the logical address space
Hardware Address Protection with Base and Limit Registers
14000
Relocatabl
e code
Contiguous Allocation
• Relocation registers used to protect user processes from each other, and from changing
operating-system code and data
• Relocation register contains value of smallest physical address
• Limit register contains range of logical addresses – each logical address must be
less
than the limit register
• Context switch
• MMU maps logical address dynamically
Fragmentation
• Processes loaded and removed from memory
– Memory is broken into little pieces
m-n n
– For given logical address space 2m and page size
2n
Paging Hardware
Paging Example
Logical address
0 (0*4+0)
Logical address = 16 Physical
Page size=4 address:
Physical (5*4+0)=20
memory=32 Logical address 3
(0*4+3)
Physical
address:
(5*4+0)=23
Logical address
User’s 4 (1*4+0)
view Physical
address:
Run time address binding (6*4+0)=24
Logical address 13
(3*4+1)
Physical
address:
n=2 and m=4 32-byte (2*4+1)
memory and 4-byte pages
Paging
• External fragmentation??
• Calculating internal fragmentation
– Page size = 2,048 bytes
– Process size = 72,766 bytes
– 35 pages + 1,086 bytes
– Internal fragmentation of 2,048 - 1,086 = 962 bytes
• So small frame sizes desirable?
– But increases the page table size
– Poor disk I/O
– Page sizes growing over time
• Solaris supports two page sizes – 8 KB and 4 MB
• User’s view and physical memory now very different
– user view=> process contains in single contiguous memory space
• By implementation process can only access its own memory
– protection
• Each page table entry 4 bytes (32 bits) long
• Each entry can point to 232 page frames
• If each frame is 4 KB
• The system can address 244 bytes (16TB) of
physical memory
Use’s
System’s
view
view
RAM RAM
Before allocation After allocation
Implementation of Page Table
• For each process, Page table is kept in main memory
• Page-table base register (PTBR) points to the page table
• Page-table length register (PTLR) indicates size of the
page table
• In this scheme every data/instruction access requires two
memory accesses
– One for the page table and one for the data / instruction
• The two memory access problem can be solved by
the use of a special fast-lookup hardware cache called
associative memory or translation look-aside buffers
(TLBs)
Associative memory
Associative Memory
• Associative memory – parallel search
Page # Frame #
• On a TLB miss, value is loaded into the TLB for faster access next time
– Replacement policies must be considered (LRU)
– Some entries can be wired down for permanent fast access
• Some TLBs store address-space identifiers (ASIDs) in each TLB entry – uniquely
identifies each process (PID) to provide address-space protection for that process
– Otherwise need to flush at every context switch
Paging Hardware With TLB
Effective Access Time
• Associative Lookup = time unit
– Can be < 10% of memory access time
• Hit ratio =
– Hit ratio – percentage of times that a page number is found in the
associative registers; ratio related to size of TLB
ptr
Shared
memory
Structure of the Page Table
• Memory requirement for page table can get huge using straight-
forward methods
– Consider a 32-bit logical address space as on modern computers
– Page size of 4 KB (212)
– Page table would have 1 million entries 220 (232 / 212)
– If each entry is 4 bytes -> 4 MB of physical address space / memory for
page table alone
• That amount of memory used to cost a lot
• Don’t want to allocate that contiguously in main memory
• Hierarchical Paging
10 10 12
• where p1 is an index into the outer page table, and p2 is the
displacement within the page of the inner page table
Two-Level Page-Table Scheme
Each divided page table
size=210 *4bytes=4KB
=Page size
d
p1
p2
Pentium
II
Address-Translation Scheme
Pentium
II
64-bit Logical Address Space
• Even two-level paging scheme not sufficient
• If page size is 4 KB (212)
– Then page table has 252 entries
– If two level scheme, inner page tables could be 210 4-byte entries
– Address would look like
inner page
outer p age page offset
p1 p2 d
42 10 12
SPARC (32 bits), Motorola 68030 support three and four level paging respectively
Hashed Page Tables
• Common in virtual address spaces > 32 bits
• Each element contains (1) the page number (2) the value of the
mapped page frame (3) a pointer to the next element
Address space
ID
Segmentation
• Memory-management scheme that supports user view
of memory
• A program is a collection of segments
– A segment is a logical unit such as:
Compiler generates the
main segments
program Loader assign the seg#
procedure
function
method
object
local
variables,
global
variables
common block
User’s View of a Program
User specifies each address
by two quantities
(a) Segment name
(b) Segment offset
4
1
3 2
4
Logical
address
space user space physical memory space
• Long term scheduler finds and allocates memory for all segments of a program
• Variable size partition scheme
Memory image
Executable file and virtual address
Symbol table
Name address
SQR 0
a.out SUM 4 Virtual
address space
Paging
view
0 Load 0
4 ADD 4
Segmentation view
<CODE, Load <ST,0
0> >
<CODE, ADD <ST,4
2> >
Segmentation Architecture
• Logical address consists of a two tuple:
<segment-number, offset>
• Segment table – maps two-dimensional logical address
to physical address;
• Each table entry has:
– base – contains the starting physical address where the
segments reside in memory
– limit – specifies the length of the segment
• Segment-table base register (STBR) points to the
segment table’s location in memory
• Segment-table length register (STLR) indicates number
of segments used by a program;
segment number s is legal if s < STLR
Example of Segmentation
Segmentation Hardware
Example of Segmentation
Segmentation Architecture
• Protection
• Protection bits associated with segments
– With each entry in segment table associate:
• validation bit = 0 illegal segment
• read/write/execute privileges
• Code sharing occurs at segment level
• Since segments vary in length, memory allocation is
a dynamic storage-allocation problem
– Long term scheduler
– First fit, best fit etc
• Fragmentation
Segmentation with Paging
Key idea:
Segments are splitted into multiple pages
Page table=220
entries
Example: The Intel Pentium
8 Segment
bytes register
Intel Pentium Segmentation
Pentium Paging Architecture
Virtual Memory
Backgroun
• d in memory to execute, but
Code needs to be
entire program rarely used
– Error code, unusual routines, large data structures
• Entire program code not needed at same time
• Consider ability to execute partially-loaded
program
– Program no longer constrained by limits of physical
memory
– programs could be larger than physical memory
– More processes can be accommodated
Virtual Memory That is
Larger Than Physical Memory
Large
virtual
space
Small memory
Classical paging
• Process P1 arrives
• Requires n pages => n frames must be
available
• Allocate n frames to the process P1
• Create page table for P1
• When we want to
execute a process, swap
in
• Instead of swap in
entire
process, load page
• Pager
Page Table When Some Pages
Are Not in Main Memory
v
v
i page table
v
….
Disk
ii
address
• During address translation, if valid–invalid bit in page table entry
is i page fault
Page Fault
• If the page in not in memory, first reference to that page will trap to
operating system:
page fault
• Page fault
– No free frame
– Terminate? swap out? replace the page?
• Page replacement – find some page in memory, not really in use, page it out
– Performance – want an algorithm which will result in minimum number of page faults
P2
Need For Page Replacement
P1
P2
PC
Basic Page Replacement
1. Find the location of the desired page on disk
2. Find a free frame:
- If there is a free frame, use it
- If there is no free frame, use a page replacement algorithm to
select a victim frame (of that process)
- Write victim frame to disk
3. Bring the desired page into the (newly) free frame; update the page
and frame tables
Note now potentially 2 page transfers for page fault – increasing Effective
memory access time
Page Replacement
5
5 6
6
Page Replacement
5 5
6
6
Evaluation
• Evaluate algorithm by running it on a particular string
of memory references (reference string) and
computing the number of page faults on that string
– String is just page numbers, not full addresses
– Repeated access to the same page does not cause a page
fault
• Trace the memory reference of a process
0100, 0432, 0101, 0612, 0102, 0104, 0101, 0611, 0102
• Page size 100B
• Reference string 1, 4, 1, 6, 1, 6
Associates a time with each frame when the page was brought
into memory
Limitation:
A variable is initialized early and constantly used
FIFO Page Replacement
3 frames (3 pages can be in memory at a time)
15 page faults
9 page faults
Least Recently Used (LRU)
Algorithm
• Use past knowledge rather than future
– Past is the proxy of future
• Replace page that has not been used in the most of the
time
• Associate time of last use with each page
0 1 2 3 4 5 0 1 2 3 4 5 0 1 2 3 4 5
LRU Approximation Algorithms
• LRU needs special hardware and still slow
• Reference bit
– With each page associate a bit, initially = 0
– When page is referenced bit set to 1
• Additional reference bit algorithm
• Second-chance algorithm
– Generally FIFO, plus hardware-provided reference bit
– Clock replacement
– If page to be replaced has
• Reference bit = 0 -> replace it
• reference bit = 1 then:
– set reference bit 0, leave page in memory (reset the time)
– replace next page, subject to same rules
Second-Chance (clock) Page-Replacement Algorithm
FIFO Example –
Program - 5 pages,
3 frames of Memory allocated
1 2 3 4 1 2 5 1 2 3
4 5
1 1 1 4 4 4 5 5 5 5 5 5
9 2 2 2 1 1 1 1 1 3 3 3
faults 3 3 3 2 2 2 2 2 4 4
FIFO Example –
Program - 5 pages,
4 frames of Memory allocated
1 2 3 4 1 2 5 1 2 3 4
5
1 1 1 4 4 4 5 5 5 5 5 5
2 2 2 1 1 1 1 1 3 3 3
3 3 3 2 2 2 2 2 4 4
10 1 1 1 1 1 1 5 5 5 5 4 4
faults 2 2 2 2 2 2 1 1 1 1 5
3 3 3 3 3 3 2 2 2 2
4 4 4 4 4 4 3 3 3
Belady's Anomaly
# of Page Faults
Number of Frames
cs431-cotter 118
Belady’s Anomaly
• This most unexpected result is known as
Belady’s anomaly – for some page-
replacement algorithms, the page fault rate
may increase as the number of allocated
frames increases
In memory
3
frames
Outside
memory
3 6 4 3 9
5 3 6 4 3
In memory
4 6
56 5 3 4
frames
5 5 6
3 5
6
Global vs. Local Allocation
• Frames are allocated to various processes
• Local replacement – each process selects from only its own set of
allocated frames
– More consistent per-process performance
– But possibly underutilized memory
Email : [email protected]