0% found this document useful (0 votes)

16 views21 pages

File Organization-Lec5

Uploaded by

Pc Pc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views21 pages

File Organization-Lec5

Uploaded by

Pc Pc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

CSW241-File Organization and Processing

Secondary Storage Devices: Magnetic Disks

Dr. Riham Moharam

Faculty of Information Technology & Computer Science
Sinai University
North Sinai, Egypt
Outline
➢ Data Organization
➢ Organizing Tracks by Sector
➢ Organizing Tracks by Block
➢ Disk Layout Strategies
➢ Non Data Overhead
➢ The Cost of a Disk Access
➢ Disk as Bottleneck

2
Data Organization
➢ There are two basic ways to organize data on a disk by:
• Sector and
• User- defined block.

3
Organizing Tracks by Sector
➢ The simplest view, is that sectors are adjacent, fixed-sized segments of a track that
happen to hold a file.

➢ This is often a perfectly adequate way to view a file logically, but it may not be a
good way to store sectors physically.

4
Organizing Tracks by Sector
➢ The file manager is the part of the operating system responsible for managing files.
• The file manager maps the logical parts of the file into their physical location.
• A cluster is a fixed number of contiguous sectors.
• The file manager allocates an integer number of clusters to a file. An example: Sector
size: 512 bytes, Cluster size: 2 sectors.
• If a file contains 10 bytes, a cluster is allocated (1024 bytes).
• There may be unused space in the last cluster of a file. This unused space contributes to
internal fragmentation.
• The clusters are also not usually stored contiguously on the disk, causing external
fragmentation.

5
Organizing Tracks by Sector
➢ Clusters are good since they improve sequential access: reading bytes sequentially
from a cluster can be done in one revolution, seeking only once.
➢ The file manager maintains a file allocation table (FAT) containing for each cluster
in the file and its location in disk.
➢ An extent is a group of contiguous clusters. If file is stored in a single extent then
seeking is done only once.
➢ If there is not enough contiguous clusters to hold a file, the file is divided into 2 or
more extents.

6
Organizing Tracks by Sector

7
Fragmentation
➢ Due to records not fitting exactly in a sector.
• Example: Record size = 200 bytes, sector size = 512 bytes
• To avoid that a record span 2 sectors, we can only store 2 records in this sector (112
bytes go unused per sector)
• The alternative is to let a record span two sectors, but in this case two sectors must be
read when we need to access this record).
➢ Due to the use of clusters.
• If the file size is not multiple of the cluster size, then the last cluster will be partially
used.

8
How to Chose Cluster Size
➢ Some OS allow the system administrator to choose the cluster size.
➢ When to use large cluster size?
• When disks contain large files likely to be processed sequentially.
• Example: Updates in a master file of bank accounts (in batch mode)

➢ What about small cluster size?

• When disks contain small files and/or files likely to be accessed randomly.
• Example : online updates for airline reservation

9
Organizing Tracks by Block
➢ Rather than being divided into sectors, the disk tracks may be divided into user-
defined blocks.

➢ When the data on a track is organized by block, this usually means that the amount
of data transferred in a single I/O operation can vary depending on the needs of the
software designer (not the hardware).

➢ Blocks can normally be either fixed or variable in length, depending on the

requirements of the file designer and the capabilities of the operating system.

➢ A block is usually organized to contain an integral number of logical records.

10
Organizing Tracks by Block
➢ The blocking factor indicates the number of records that are to be stored in each
block in a file.
➢ Blocks don’t have the sector-spanning and fragmentation problem of sectors since
they vary in size to fit the logical organization of the data.
➢ A block typically contains subblocks.
➢ Data subblock: contains the records in this block.
➢ Each block is usually accompanied by subblocks:
• Key-subblock:
• The key for the last record in the data subblock (disk controller can search for key without
loading it in main memory)
• Count-subblock:
• The number of bytes in a block.

11
Non-Data Overhead
➢ Amount of space used for extra stuff other than data.

➢ Sector-Addressable Disks
• At the beginning of each sector some info is stored, such as sector address, track
address, condition (if sector is defective);
• There is some gap between sectors.

➢ Block-Organized Disks
• Subblocks and interblock gaps is part of the extra stuff; more nondata overhead than
with sector-addressing.

12
Non-Data Overhead
➢ Whether using a block or a sector organization, some space on the disk is taken up
by non-data overhead. i.e., information stored on the disk during pre-formatting.

➢ On sector-addressable disks, pre-formatting involves storing, at the beginning of

each sector, sector address, track address and condition (usable or defective).

➢ On block-organized disks, subblock + interblock gaps have to be provided with

every block. The relative amount of non-data space necessary for a block scheme is
higher than for a sector-scheme.

13
Non-Data Overhead
➢ The greater the block-size, the greater potential amount of internal track
fragmentation.

➢ The flexibility introduced by the use of blocks rather than sectors can save time
since it lets the programmer determine, to a large extent, how the data is to be
organized physically on disk.

14
Example
➢ Disk characteristics
• Block-addressable Disk Drive
• Size of track = 20.000 bytes
• Nondata overhead per block = 300 bytes
➢ File Characteristics
• Record size = 100 bytes
➢ How many records can be stored per track for the following blocking factors?
• 1. Block factor = 10
• 2. Block factor = 60

15
Solution
➢ Case 1:
• Blocking factor is 10
𝟐𝟎𝟎𝟎𝟎
• Size of data subblocks = 1000 = 𝟏𝟓. 𝟑𝟖 = 𝟏𝟓
𝟏𝟑𝟎𝟎
• Number of blocks that can fit in a track =
• Number of records per track = 150 records

➢ Case 2:
• Blocking factor is 60 𝟐𝟎𝟎𝟎𝟎
= 𝟑. 𝟏𝟕 = 𝟑
• Size of data subblocks = 6000 𝟔𝟑𝟎𝟎
• Number of blocks that can fit in a track =
• Number of records per track = 180 records

16
The Cost of a Disk Access
➢ Seek Time is the time required to move the access arm to the correct cylinder.
• More costly in a multiuser environment.

➢ Rotational Delay is the time it takes for the disk to rotate so the sector we want is
under the read/write head.

➢ Transfer Time
• =(# 𝒐𝒇 𝒃𝒚𝒕𝒆𝒔 𝒕𝒓𝒂𝒏𝒔𝒇𝒆𝒓𝒓𝒆𝒅) / (# 𝒐𝒇 𝒃𝒚𝒕𝒆𝒔 𝒐𝒏 𝒂 𝒕𝒓𝒂𝒄𝒌) × 𝒓𝒐𝒕𝒂𝒕𝒊𝒐𝒏 𝒕𝒊𝒎𝒆
• 63 sectors per track

17
Disk as Bottleneck
➢ Processes are often Disk-Bound, i.e., the network and the CPU often have to wait
inordinate lengths of time for the disk to transmit data.

➢ When a program reads a byte from the disk, the operating system locates the
surface, track and sector containing that byte, and reads the entire sector into a
special area in main memory called buffer.

18
Various Techniques to Solve this Problem
1. Multiprocessing: (CPU works on other jobs while waiting for the disk), but:
• Multiprocessing is not always available.
• The process cannot afford so much time waiting for the disk.
2. Disk Striping:
• Putting different blocks of the file in different drives, then letting the separate drives
deliver parts of the file to the network simultaneously.
• Independent processes accessing the same file may not interfere with each other
(parallelism)
3. RAID (Redundant Array of Independent Disks).
4. RAM Disk (Memory Disk): Simulate the behavior of the mechanical disk in
memory.

19
Various Techniques to Solve this Problem
5. Disk Cache:
• Large block of memory configured to contain pages of data from a disk.
• When data is requested from disk, first the cache is checked.
• If data is not there (miss) the disk is accessed.
• Differs from the Cache memory which does the same types of performance-enhancing
operations with respect to memory.

20
RAID (Redundant Array of Independent Disks)
➢ Disk Array: Arrangement of several disks that gives abstraction of a single, large
disk. (One Disk Controller)
➢ Goals: Increase performance and reliability.
➢ Two main techniques:
• Data striping: Data is partitioned; size of a partition is called the striping unit.
Partitions are distributed over several disks. For an 8-drive RAID, for example, the
controller receives a single block to write and breaks it into eight pieces, the first piece is
written to a particular track of the first disk, and so on. Reading is done the same way,
all the pieces are reassembled in cache, and cache content is transmitted back through
the I/O channels.
• Redundancy: Same Information is replicated in more disks.
• More disks more failures.
• Redundant information allows reconstruction of data if a disk fails.

BTP Administration Guide
No ratings yet
BTP Administration Guide
270 pages
Chapter 3 Secondary Storage and System Software
No ratings yet
Chapter 3 Secondary Storage and System Software
24 pages
Accessing The Data.: Amogh P K
No ratings yet
Accessing The Data.: Amogh P K
21 pages
Secondary Storage Devices (1) :: Magnetic Disks
No ratings yet
Secondary Storage Devices (1) :: Magnetic Disks
56 pages
Secondary Storage Devices: Magnetic Disks
No ratings yet
Secondary Storage Devices: Magnetic Disks
34 pages
L02
No ratings yet
L02
31 pages
Secondary Storage Introduction
No ratings yet
Secondary Storage Introduction
82 pages
Secondary Storage Devices
100% (1)
Secondary Storage Devices
75 pages
8 DataStorageIndexingStructures Updated
No ratings yet
8 DataStorageIndexingStructures Updated
57 pages
File Organization (1)
No ratings yet
File Organization (1)
93 pages
L2.1 File Organization
No ratings yet
L2.1 File Organization
56 pages
Disk Storage, Basic File Structures, and Hashing: Dr. Hasnaa Raafat Dr. Nora Zakie
No ratings yet
Disk Storage, Basic File Structures, and Hashing: Dr. Hasnaa Raafat Dr. Nora Zakie
31 pages
Accessing The Data.: Dept. of Ise, Gsssietw
No ratings yet
Accessing The Data.: Dept. of Ise, Gsssietw
22 pages
Chapter 17: Disk Storage, Basic File Structures, and Hashing
No ratings yet
Chapter 17: Disk Storage, Basic File Structures, and Hashing
54 pages
7_DataStorageIndexingStructures
No ratings yet
7_DataStorageIndexingStructures
83 pages
Data Storage and Access Methods: Min Song IS698
No ratings yet
Data Storage and Access Methods: Min Song IS698
50 pages
Chapter 10 - External Storage - Part 2
No ratings yet
Chapter 10 - External Storage - Part 2
41 pages
6 Disks 4
No ratings yet
6 Disks 4
3 pages
Lec02 secondaryStorageDevices
No ratings yet
Lec02 secondaryStorageDevices
49 pages
VND - Ms Powerpoint&Rendition 1
No ratings yet
VND - Ms Powerpoint&Rendition 1
118 pages
02 Storage (1)
No ratings yet
02 Storage (1)
104 pages
File Organization-Lec4
No ratings yet
File Organization-Lec4
21 pages
ch1
No ratings yet
ch1
39 pages
Lecture 01 - File Storage - Part 1
No ratings yet
Lecture 01 - File Storage - Part 1
48 pages
01 Introduction To Information Technology1
No ratings yet
01 Introduction To Information Technology1
74 pages
Elmasri 6e Ch17 Week2 HW DiskStorage
No ratings yet
Elmasri 6e Ch17 Week2 HW DiskStorage
96 pages
File Management
No ratings yet
File Management
91 pages
6 Data Storage and Querying
100% (1)
6 Data Storage and Querying
58 pages
m5 Main m6 Main 1 Merged
No ratings yet
m5 Main m6 Main 1 Merged
95 pages
Disk Drives: Storage Storage
No ratings yet
Disk Drives: Storage Storage
5 pages
File Design Alternatives
100% (1)
File Design Alternatives
5 pages
Chapter 6- - Copy
No ratings yet
Chapter 6- - Copy
62 pages
Chapter 13:disk Storage and Basic File Structures
No ratings yet
Chapter 13:disk Storage and Basic File Structures
31 pages
File System and Secondary Storage
No ratings yet
File System and Secondary Storage
40 pages
18CSC205J Operating Systems Unit 5 - New
No ratings yet
18CSC205J Operating Systems Unit 5 - New
140 pages
Storage and File Structure
No ratings yet
Storage and File Structure
55 pages
Lecture 11-File Management
No ratings yet
Lecture 11-File Management
52 pages
Chapter 4 - Storage Final
No ratings yet
Chapter 4 - Storage Final
22 pages
FULL
No ratings yet
FULL
449 pages
The Bare Basics: Storing Data On Disks and Files
No ratings yet
The Bare Basics: Storing Data On Disks and Files
33 pages
Disks, Memories & Buffer Management: "The Two Offices of Memory Are Collection and Distribution." - Samuel Johnson
No ratings yet
Disks, Memories & Buffer Management: "The Two Offices of Memory Are Collection and Distribution." - Samuel Johnson
28 pages
Module 2: Storing Data: Disks and Files 2.1 Memory Hierarchy
No ratings yet
Module 2: Storing Data: Disks and Files 2.1 Memory Hierarchy
16 pages
disk_management
No ratings yet
disk_management
46 pages
05 - Stallings CH6 External Memory
No ratings yet
05 - Stallings CH6 External Memory
37 pages
Chapter 5-Record Storage and Primary File Organization
100% (1)
Chapter 5-Record Storage and Primary File Organization
64 pages
Storage and File Structure
No ratings yet
Storage and File Structure
60 pages
Lecture 15
No ratings yet
Lecture 15
19 pages
Raid Levels
No ratings yet
Raid Levels
47 pages
Chapter 6-
No ratings yet
Chapter 6-
62 pages
File
No ratings yet
File
37 pages
DBMS Storage and Indexing
No ratings yet
DBMS Storage and Indexing
90 pages
Database Management System Chapter 1
No ratings yet
Database Management System Chapter 1
53 pages
CH 6 Disk
No ratings yet
CH 6 Disk
40 pages
Chapter 5
No ratings yet
Chapter 5
53 pages
Introduction To File Structures: CENG 351 1
No ratings yet
Introduction To File Structures: CENG 351 1
78 pages
I/O Management and Disk Scheduling
No ratings yet
I/O Management and Disk Scheduling
27 pages
Ssos - U5
No ratings yet
Ssos - U5
39 pages
OS Unit-IV - File Organization and Disk Scheduling
No ratings yet
OS Unit-IV - File Organization and Disk Scheduling
36 pages
CST 204 Dbms Module - 3 Physical Data Organization
No ratings yet
CST 204 Dbms Module - 3 Physical Data Organization
93 pages
FreeBSD Mastery: Storage Essentials: IT Mastery, #4
From Everand
FreeBSD Mastery: Storage Essentials: IT Mastery, #4
Michael W. Lucas
No ratings yet
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
From Everand
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
Steve Brown
No ratings yet
HS OperationGuide
No ratings yet
HS OperationGuide
22 pages
UNIT 7 - Atomic Transactions
No ratings yet
UNIT 7 - Atomic Transactions
30 pages
Download Full Visual Studio Code Distilled: Evolved Code Editing for Windows, macOS, and Linux 2nd ed. Alessandro Del Sole PDF All Chapters
100% (3)
Download Full Visual Studio Code Distilled: Evolved Code Editing for Windows, macOS, and Linux 2nd ed. Alessandro Del Sole PDF All Chapters
21 pages
Final Review Worksheet: CMSC 201 Spring 2019 Name
No ratings yet
Final Review Worksheet: CMSC 201 Spring 2019 Name
5 pages
Resumen Cap7b
No ratings yet
Resumen Cap7b
5 pages
IVX_Useful CLI commands for Trellix Intelligent Virtual Execution
No ratings yet
IVX_Useful CLI commands for Trellix Intelligent Virtual Execution
4 pages
Number System 2
No ratings yet
Number System 2
16 pages
Modern Batch Scripting PDF
No ratings yet
Modern Batch Scripting PDF
64 pages
Microsoft Word Shortcut Keys
100% (2)
Microsoft Word Shortcut Keys
9 pages
MT6070iH 8070ih MT607i Installation 101028
No ratings yet
MT6070iH 8070ih MT607i Installation 101028
8 pages
The Joy of Computing Using Python: Assignment 3
No ratings yet
The Joy of Computing Using Python: Assignment 3
5 pages
Mcgraw-Hill Technology Education
No ratings yet
Mcgraw-Hill Technology Education
24 pages
Pulley
No ratings yet
Pulley
7 pages
Keyence User Manual
No ratings yet
Keyence User Manual
240 pages
CYS 506 - Lab5
No ratings yet
CYS 506 - Lab5
113 pages
UsbFix Report
No ratings yet
UsbFix Report
97 pages
HANA Configuration Parameters 1.00.70+
100% (1)
HANA Configuration Parameters 1.00.70+
65 pages
Interactive Briefing QA - Infor LN Report Designer - Design Distribute and Store
No ratings yet
Interactive Briefing QA - Infor LN Report Designer - Design Distribute and Store
6 pages
PHP Tutorial - Learn PHP
No ratings yet
PHP Tutorial - Learn PHP
67 pages
Okok
No ratings yet
Okok
2 pages
SAP PM Overview
No ratings yet
SAP PM Overview
62 pages
Lifecycle and States of a Thread in Java
No ratings yet
Lifecycle and States of a Thread in Java
8 pages
304TX DataSheet
No ratings yet
304TX DataSheet
2 pages
2844.flashing The XDS110 FW
No ratings yet
2844.flashing The XDS110 FW
3 pages
How To Download For Offline Use
No ratings yet
How To Download For Offline Use
4 pages
Tben S2 4iol
No ratings yet
Tben S2 4iol
132 pages
HW1SolSp25
No ratings yet
HW1SolSp25
11 pages
Openedge 10 Availability Guide Jan17
No ratings yet
Openedge 10 Availability Guide Jan17
24 pages
Mag 254-255-256
No ratings yet
Mag 254-255-256
8 pages

File Organization-Lec5

Uploaded by

File Organization-Lec5

Uploaded by

CSW241-File Organization and Processing

Secondary Storage Devices: Magnetic Disks

Dr. Riham Moharam

➢ What about small cluster size?

➢ Blocks can normally be either fixed or variable in length, depending on the

➢ A block is usually organized to contain an integral number of logical records.

➢ On sector-addressable disks, pre-formatting involves storing, at the beginning of

➢ On block-organized disks, subblock + interblock gaps have to be provided with

You might also like