CS2606: Data Structures and Object-Oriented Development Chapter 10: Indexing

The document discusses indexing for efficiently storing and searching large files. It covers linear indexing and tree indexing, specifically 2-3 trees and B-trees. B-trees improve on 2-3 trees by keeping similar values together, guaranteeing nodes are full, and always being balanced. B-trees support efficient insertion, deletion, and range searches. The most common implementation is the B+-tree where internal nodes store only keys and leaf nodes store records.

Uploaded by

anon_484100541

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

CS2606: Data Structures and Object-Oriented Development Chapter 10: Indexing

Uploaded by

anon_484100541

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 25

Course notes

CS2606: Data Structures and

Object-Oriented Development

Chapter 10: Indexing

Department of Computer Science
Virginia Tech
Spring 2008
(The following notes were derived from Cliff Shaffer’s textbook and notes)
Indexing
Goals:
– Store large files
– Support multiple search keys
– Support efficient insert, delete, and
range queries
Terms(1)
Entry sequenced file: Order records
by time of insertion.
– Search with sequential search

Index file: Organized, stores

pointers to actual records.
– Could be organized with a tree or
other data structure.
Terms(2)
Primary Key: A unique identifier for
records. May be inconvenient for
search.
Secondary Key: An alternate search
key, often not unique for each
record. Often used for search
key.
Linear Indexing
Linear index: Index file organized
as a simple sequence of
key/record pointer pairs with key
values are in sorted order.
Linear indexing is good for
searching variable-length records.
Linear Indexing (2)
If the index is too large to fit in
main memory, a second-level
index might be used.
Tree Indexing (1)
Linear index is poor for
insertion/deletion.

Tree index can efficiently support

all desired operations:
– Insert/delete
– Multiple search keys (multiple
indices)
– Key range search
Tree Indexing (2)
Difficulties when storing
tree index on disk:
– Tree must be balanced.
– Each path from root to
leaf should cover few disk
pages.
2-3 Tree (1)
A 2-3 Tree has the following
properties:
1. A node contains one or two keys
2. Every internal node has either two
children (if it contains one key) or
three children (if it contains two keys).
3. All leaves are at the same level in the
tree, so the tree is always height
balanced.

The 2-3 Tree has a search tree

property analogous to the BST.
2-3 Tree (2)
The advantage of the 2-3 Tree over
the BST is that it can be updated
at low cost.
2-3 Tree Insertion (1)
2-3 Tree Insertion (2)
2-3 Tree Insertion (3)
B-Trees (1)
The B-Tree is an extension of the 2-
3 Tree.

The B-Tree is now the standard file

organization for applications
requiring insertion, deletion, and
key range searches.
B-Trees (2)
1. B-Trees are always balanced.
2. B-Trees keep similar-valued records
together on a disk page, which
takes advantage of locality of
reference.
3. B-Trees guarantee that every node
in the tree will be full at least to a
certain minimum percentage. This
improves space efficiency while
reducing the typical number of disk
fetches necessary during a search
or update operation.
B-Tree Definition
A B-Tree of order m has these properties:
– The root is either a leaf or has at least two
children.
– Each node, except for the root and the
leaves, has between m/2 and m children.
– All leaves are at the same level in the tree,
so the tree is always height balanced.

A B-Tree node is usually selected to match

the size of a disk block.
– A B-Tree node could have hundreds of
children.
B-Tree Search (1)
Search in a B-Tree is a
generalization of search in a 2-3
Tree.
1. Do binary search on keys in current
node. If search key is found, then
return record. If current node is a
leaf node and key is not found, then
report an unsuccessful search.
2. Otherwise, follow the proper branch
and repeat the process.
B+-Trees
The most commonly implemented form
of the B-Tree is the B+-Tree.
Internal nodes of the B+-Tree do not
store records -- only key values to
guide the search.
Leaf nodes store records or pointers to
records.
A leaf node may store more or fewer
records than an internal node stores
keys.
B+-Tree Example
B+-Tree Insertion
B+-Tree Deletion (1)
B+-Tree Deletion (2)
B+-Tree Deletion (3)
B-Tree Space Analysis (1)
B+-Trees nodes are always at least half
full.

The B*-Tree splits two pages for three,

and combines three pages into two. In
this way, nodes are always 2/3 full.

Asymptotic cost of search, insertion, and

deletion of nodes from B-Trees is (log
n).
– Base of the log is the (average) branching
factor of the tree.
B-Tree Space Analysis (2)
Example: Consider a B+-Tree of order
100 with leaf nodes containing 100
records.
1 level B+-tree:
2 level B+-tree:
3 level B+-tree:
4 level B+-tree:

Ways to reduce the number of disk

fetches:
– Keep the upper levels in memory.
– Manage B+-Tree pages with a buffer pool.

A Hand-Made Wi-Fi Gun! A Powerful Antenna For A Wi-Fi DIY !
100% (1)
A Hand-Made Wi-Fi Gun! A Powerful Antenna For A Wi-Fi DIY !
27 pages
2 - Indexing Structures - Ch14
No ratings yet
2 - Indexing Structures - Ch14
50 pages
Unit V
No ratings yet
Unit V
55 pages
Dbms. 5 Unit Part-B
No ratings yet
Dbms. 5 Unit Part-B
8 pages
Btree Data Structure
No ratings yet
Btree Data Structure
25 pages
B+ tree
No ratings yet
B+ tree
17 pages
unit-5-indexing-2024
No ratings yet
unit-5-indexing-2024
50 pages
Storage and Indexing
No ratings yet
Storage and Indexing
41 pages
Indexing and Hashing: (Emphasis On B+ Trees)
No ratings yet
Indexing and Hashing: (Emphasis On B+ Trees)
23 pages
DBMS PPT
No ratings yet
DBMS PPT
17 pages
Data Structures Using C, 2e Jhalak Dutta
No ratings yet
Data Structures Using C, 2e Jhalak Dutta
16 pages
Indexing and Hashing: (Emphasis On B+ Trees)
No ratings yet
Indexing and Hashing: (Emphasis On B+ Trees)
23 pages
B+ Tree Indexing
No ratings yet
B+ Tree Indexing
22 pages
Btree
No ratings yet
Btree
3 pages
Lesson 04
No ratings yet
Lesson 04
58 pages
Binary Tree Handwritten Notes For Students
No ratings yet
Binary Tree Handwritten Notes For Students
6 pages
LM6 - B+ Tree Index Files - B Tree Index Files
No ratings yet
LM6 - B+ Tree Index Files - B Tree Index Files
27 pages
9.CCS224_PART 2_Lecture 4 (August 3, 2021)
No ratings yet
9.CCS224_PART 2_Lecture 4 (August 3, 2021)
30 pages
B Tree Application
100% (2)
B Tree Application
6 pages
DS Trees Short Notes
No ratings yet
DS Trees Short Notes
12 pages
B+ Tree & B Tree
No ratings yet
B+ Tree & B Tree
38 pages
Chp2 - Advanced Data Structure
No ratings yet
Chp2 - Advanced Data Structure
88 pages
n04-B+Trees
No ratings yet
n04-B+Trees
19 pages
Index and Hashing
No ratings yet
Index and Hashing
82 pages
n04-B Trees
No ratings yet
n04-B Trees
19 pages
DSA-II UNIT-II B Tree
No ratings yet
DSA-II UNIT-II B Tree
46 pages
B - Trees
No ratings yet
B - Trees
19 pages
Definition of B-Trees Properties Specialization Examples 2-3 Trees Insertion of B-Tree Remove Items From B-Tree
No ratings yet
Definition of B-Trees Properties Specialization Examples 2-3 Trees Insertion of B-Tree Remove Items From B-Tree
21 pages
n3 BTrees
No ratings yet
n3 BTrees
14 pages
Tree-Structured Indexes: R & G Chapter 9
No ratings yet
Tree-Structured Indexes: R & G Chapter 9
34 pages
DBMS Indexing B - Tree To B Tree (197222, 197125, 197155)
No ratings yet
DBMS Indexing B - Tree To B Tree (197222, 197125, 197155)
41 pages
Find All Students With Gpa 3.0'': Can Do Binary Search On (Smaller) Index File!
No ratings yet
Find All Students With Gpa 3.0'': Can Do Binary Search On (Smaller) Index File!
42 pages
Assignment (DS)
No ratings yet
Assignment (DS)
8 pages
2 BPlus Trees
No ratings yet
2 BPlus Trees
26 pages
B-Trees DS
No ratings yet
B-Trees DS
28 pages
CSE 301 Lecture-8-Indexing WT
No ratings yet
CSE 301 Lecture-8-Indexing WT
31 pages
Binary Search Tree
No ratings yet
Binary Search Tree
39 pages
DBMS-B+ and B Trees
0% (1)
DBMS-B+ and B Trees
9 pages
Unit-2 (Btree InsertionDelection)
No ratings yet
Unit-2 (Btree InsertionDelection)
4 pages
B and B+ Tree
No ratings yet
B and B+ Tree
33 pages
B Tree: Max Keys m-1 Min Keys (m/2) - 1 Max Child M Min Children m/2
No ratings yet
B Tree: Max Keys m-1 Min Keys (m/2) - 1 Max Child M Min Children m/2
8 pages
CSE 544: Lecture 11 Storing Data, Indexes: Monday, 5/1/2006
No ratings yet
CSE 544: Lecture 11 Storing Data, Indexes: Monday, 5/1/2006
52 pages
B and B+ Tree
No ratings yet
B and B+ Tree
33 pages
UNIT 3 Some Questions ans
No ratings yet
UNIT 3 Some Questions ans
10 pages
Unit Iv Indexing and Hashing: Basic Concepts
No ratings yet
Unit Iv Indexing and Hashing: Basic Concepts
35 pages
Algorithms: Modern Systems
No ratings yet
Algorithms: Modern Systems
21 pages
Multiway Search Tree
No ratings yet
Multiway Search Tree
16 pages
B-Trees and B+-Trees: Jay Yim CS 157B Dr. Lee
No ratings yet
B-Trees and B+-Trees: Jay Yim CS 157B Dr. Lee
34 pages
UNIT-5: Indexing and Hashing
No ratings yet
UNIT-5: Indexing and Hashing
78 pages
FS Mod 3 - Multilevel Indexing and B-Trees
No ratings yet
FS Mod 3 - Multilevel Indexing and B-Trees
37 pages
Class Presentation Btree
No ratings yet
Class Presentation Btree
15 pages
B+ Trees: What Are B+ Trees Used For Whatisabtree What Is A B+ Tree Searching Insertion Deletion
No ratings yet
B+ Trees: What Are B+ Trees Used For Whatisabtree What Is A B+ Tree Searching Insertion Deletion
30 pages
B+ Tree in DBMS
No ratings yet
B+ Tree in DBMS
21 pages
Data Structure and Algorithm (CS-102) : Ashok K Turuk
No ratings yet
Data Structure and Algorithm (CS-102) : Ashok K Turuk
39 pages
Dbms Indexing
No ratings yet
Dbms Indexing
3 pages
B Tree
100% (1)
B Tree
12 pages
Data Structure and Algorithm (CS-102) : Ashok K Turuk
No ratings yet
Data Structure and Algorithm (CS-102) : Ashok K Turuk
39 pages
Search Tree: Fundamentals and Applications
From Everand
Search Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Data Structures and Algorithm
From Everand
Data Structures and Algorithm
Knowledge Flow
No ratings yet
Quadtree: Exploring Hierarchical Data Structures for Image Analysis
From Everand
Quadtree: Exploring Hierarchical Data Structures for Image Analysis
Fouad Sabry
No ratings yet
Data Structures I Essentials
From Everand
Data Structures I Essentials
Dennis Smolarski
No ratings yet
PROJECTION EXERCISE
No ratings yet
PROJECTION EXERCISE
6 pages
Honeywell-Vista-32FB-User-Manual
No ratings yet
Honeywell-Vista-32FB-User-Manual
64 pages
Print Template: Monggokerso
No ratings yet
Print Template: Monggokerso
7 pages
ELS 08 September 2021
No ratings yet
ELS 08 September 2021
11 pages
2d6 Gringo PDF
No ratings yet
2d6 Gringo PDF
35 pages
SPARKING ZERO MODEL IMPORT GUIDE
No ratings yet
SPARKING ZERO MODEL IMPORT GUIDE
4 pages
Cisco UCS Central 1-4 v1 Demo Guide
100% (1)
Cisco UCS Central 1-4 v1 Demo Guide
109 pages
Module 3 Animated Single Cycle and Multi Cycle Data Path
No ratings yet
Module 3 Animated Single Cycle and Multi Cycle Data Path
29 pages
How To Update Pi-Hole Easily
No ratings yet
How To Update Pi-Hole Easily
1 page
001 DS3 - DSheet - FP2 Rectifier 48V1800W - v5
No ratings yet
001 DS3 - DSheet - FP2 Rectifier 48V1800W - v5
2 pages
E4408B
No ratings yet
E4408B
100 pages
Chapter 03 Thermal Oxidation of Silicon
No ratings yet
Chapter 03 Thermal Oxidation of Silicon
30 pages
ThinkPad Pro Dock 65W and 90W - Overview and Service Parts - Lenovo Support US
No ratings yet
ThinkPad Pro Dock 65W and 90W - Overview and Service Parts - Lenovo Support US
6 pages
Application Bulletin: Comparison of Noise Performance Between A Fet Transimpedance Amplifier and A Switched Integrator
No ratings yet
Application Bulletin: Comparison of Noise Performance Between A Fet Transimpedance Amplifier and A Switched Integrator
8 pages
Application of C Language in Electronics
No ratings yet
Application of C Language in Electronics
21 pages
Fab1 Bkmap Quickinstbk
No ratings yet
Fab1 Bkmap Quickinstbk
188 pages
PP Presentation Slides
No ratings yet
PP Presentation Slides
9 pages
CSC 134 Assignment
No ratings yet
CSC 134 Assignment
13 pages
NV XVR5104H X
No ratings yet
NV XVR5104H X
1 page
Failure Analysis and Principles Involved
No ratings yet
Failure Analysis and Principles Involved
29 pages
Computer Networking
No ratings yet
Computer Networking
5 pages
Retail Management Information System
No ratings yet
Retail Management Information System
24 pages
Module 2 _Data Structure (2, 5, 10 respectively)
No ratings yet
Module 2 _Data Structure (2, 5, 10 respectively)
8 pages
Learning Episode 1: The Teacher We Remember Analyze
No ratings yet
Learning Episode 1: The Teacher We Remember Analyze
89 pages
Remote Functions PDF
No ratings yet
Remote Functions PDF
14 pages
3Thought experiment _ Exam Ref AZ-900 Microsoft Azure Fundamentals, 3rd Edition
No ratings yet
3Thought experiment _ Exam Ref AZ-900 Microsoft Azure Fundamentals, 3rd Edition
2 pages
System Development Life Cycle
100% (2)
System Development Life Cycle
3 pages
B.Tech, CS&E-CS, 5th Sem, 2018-19 Batch
No ratings yet
B.Tech, CS&E-CS, 5th Sem, 2018-19 Batch
20 pages
CVMV26L-G Schematic Diagram
No ratings yet
CVMV26L-G Schematic Diagram
6 pages

CS2606: Data Structures and Object-Oriented Development Chapter 10: Indexing

Uploaded by

CS2606: Data Structures and Object-Oriented Development Chapter 10: Indexing

Uploaded by

Course notes

CS2606: Data Structures and

Chapter 10: Indexing

Index file: Organized, stores

Tree index can efficiently support

The 2-3 Tree has a search tree

The B-Tree is now the standard file

A B-Tree node is usually selected to match

The B*-Tree splits two pages for three,

Asymptotic cost of search, insertion, and

Ways to reduce the number of disk

You might also like