0% found this document useful (0 votes)

46 views21 pages

B+-Trees: Adapted From Mike Franklin

This document describes B+ trees, which are a data structure used to store indexed data in databases. B+ trees allow for efficient searching, insertion, and deletion operations that take logarithmic time. They maintain a balanced tree structure where internal nodes can have a variable number of child nodes between a minimum and maximum threshold. The document provides examples of operations like searching, inserting, deleting on a sample B+ tree to demonstrate how the tree structure is updated. It also discusses properties of B+ trees like order, fill factor, and how they are implemented in practice.

Uploaded by

Rupali Misri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views21 pages

B+-Trees: Adapted From Mike Franklin

Uploaded by

Rupali Misri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 21

B+-Trees

Adapted from Mike Franklin

Example Tree Index

Index entries:<search key value, page id> they direct search for data entries in leaves.
Example where each node can hold 2 entries;

Root
40

10*

15*

20*

27*

33*

37*

40*

46*

51*

55*

63*

97*

ISAM
Indexed Sequential Access Method Similar to what we discussed in the last class
Root Index Pages
40

Primary Leaf Pages

10* 15* 20* 27* 33* 37* 40* 46* 51* 55*

63*

97*

Overflow
Pages

23*

48*

41*

42*

Example B+ Tree
Search begins at root, and key comparisons direct it to a leaf. Search for 5*, 15*, all data entries >= 24* ...

Root
13 17 24 30

14* 16*

19* 20* 22*

24* 27* 29*

33* 34* 38* 39*

Based on the search for 15*, we know it is not in the tree!

B+ Tree - Properties
Balanced Every node except root must be at least full. Order: the minimum number of keys/pointers in a non-leaf node Fanout of a node: the number of pointers out of the node

B+ Trees in Practice
Typical order: 100. Typical fill-factor: 67%.
average fanout = 133

Typical capacities:
Height 3: 1333 = 2,352,637 entries Height 4: 1334 = 312,900,700 entries

Can often hold top levels in buffer pool:

Level 1 = 1 page = 8 Kbytes Level 2 = 133 pages = 1 Mbyte Level 3 = 17,689 pages = 133 MBytes

B+ Trees: Summary
Searching:
logd(n) Where d is the order, and n is the number of entries

Insertion:
Find the leaf to insert into If full, split the node, and adjust index accordingly Similar cost as searching

Deletion
Find the leaf node Delete May not remain half-full; must adjust the index accordingly

Insert 23*
Root
13 17 24 30

14* 16*

19* 20* 22*

24* 27* 29*

33* 34* 38* 39*

No splitting required.
Root
13 17 24

14* 16*

19* 20* 22* 23*

24* 27* 29*

33* 34* 38* 39*

Example B+ Tree - Inserting 8*

Root Root
13 17 17 24 30

2* 2*

3* 3*

14* 16* 7* 8*

14* 16*

24* 27* 29* 19* 20* 22* 19* 20* 22* 24* 27* 29*

33* 34* 38* 39* 33* 34* 38* 39*

Notice that root was split, leading to increase in height. In this example, we can avoid split by re-distributing entries; however, this is usually not done in practice.

Data vs. Index Page Split

(from previous example of inserting 8)
Observe how minimum occupancy is guaranteed in both leaf and index pg splits. Note difference between copy-up and push-up; be sure you understand the reasons for this. Data Page Split
2* 3* 5* 7* 8*

Entry to be inserted in parent node. (Note that 5 is copied up and s continues to appear in the leaf.)

13 17

Index Page Split

Entry to be inserted in parent node. (Note that 17 is pushed up and only appears once in the index. Contrast this with a leaf split.)

Delete 19*
Root
17

Root
5 13 13 2* 3* 5* 7* 8* 17 24 30 33* 34* 38* 39* 24 30

14* 16*

19* 20* 22*

24* 27* 29*

14* 16*

19* 20* 22*

24* 27* 29*

33* 34* 38* 39*

Root
17

7* 8*

14* 16*

20* 22*

24* 27* 29*

33* 34* 38* 39*

Delete 20* ...

Root
17

7* 8*

14* 16*

20* 22*

24* 27* 29*

33* 34* 38* 39*

Root
17

7* 8*

14* 16*

22* 24*

27* 29*

33* 34* 38* 39*

Delete 19* and 20* ...

Deleting 19* is easy. Deleting 20* is done with re-distribution. Notice how middle key is copied up. Further deleting 24* results in more drastic changes

Delete 24* ...

Root
17

7* 8*

14* 16*

22* 24*

27* 29*

33* 34* 38* 39*

Root
17

No redistribution from neighbors possible

7* 8*

14* 16*

22*

27* 29*

33* 34* 38* 39*

Deleting 24*
Must merge. Observe `toss of index entry (on right), and `pull down of index entry (below).
22* 27* 29* 30

33*

34*

38*

39*

Root
5 13

14* 16*

22* 27* 29*

33* 34* 38* 39*

Example of Non-leaf Redistribution

Tree is shown below during deletion of 24*. (What could be a possible initial tree?) In contrast to previous example, can re-distribute entry from left child of root to right child.
Root
22

2* 3*

5* 7* 8*

14* 16*

17* 18*

20* 21*

22* 27* 29*

33* 34* 38* 39*

After Re-distribution
Intuitively, entries are re-distributed by `pushing through the splitting entry in the parent node. It suffices to re-distribute index entry with key 20; weve redistributed 17 as well for illustration.
Root
17

2* 3*

5* 7* 8*

14* 16*

17* 18*

20* 21*

22* 27* 29*

33* 34* 38* 39*

Primary vs Secondary Index

Note: We were assuming the data items were in sorted order
This is called primary index

Secondary index:
Built on an attribute that the file is not sorted on.

A Secondary B+-Tree index

Root
17

14 16

17 18

20 21

22 27 29

33 34 38 39

2* 16* 5* 39*

Primary vs Secondary Index

Note: We were assuming the data items were in sorted order
This is called primary index

Secondary index:
Built on an attribute that the file is not sorted on.

Can have many different indexes on the same file.

More
Hash-based Indexes
Static Hashing Extendible Hashing
Read on your own.

Linear Hashing

Grid-files R-Trees etc

B - Trees
No ratings yet
B - Trees
19 pages
Tree-Structured Indexes: R & G Chapter 9
No ratings yet
Tree-Structured Indexes: R & G Chapter 9
34 pages
Multilevel Indexing and B+ Trees
No ratings yet
Multilevel Indexing and B+ Trees
33 pages
Find All Students With Gpa 3.0'': Can Do Binary Search On (Smaller) Index File!
No ratings yet
Find All Students With Gpa 3.0'': Can Do Binary Search On (Smaller) Index File!
42 pages
B Tree
No ratings yet
B Tree
53 pages
Lec 8
No ratings yet
Lec 8
30 pages
Btrees Animated
No ratings yet
Btrees Animated
77 pages
5b Tree Indexes
No ratings yet
5b Tree Indexes
41 pages
B-Trees DS
No ratings yet
B-Trees DS
28 pages
Lec7 - B-Trees
No ratings yet
Lec7 - B-Trees
27 pages
Tree-Structured Indexes: Comp 521 - Files and Databases Fall 2010 1
No ratings yet
Tree-Structured Indexes: Comp 521 - Files and Databases Fall 2010 1
27 pages
Unit-5 B+Trees & Hashing
No ratings yet
Unit-5 B+Trees & Hashing
37 pages
Tutorial 10 Indexing
No ratings yet
Tutorial 10 Indexing
36 pages
Ads 2 Part 3
No ratings yet
Ads 2 Part 3
60 pages
Unit V
No ratings yet
Unit V
55 pages
DM Module-3
No ratings yet
DM Module-3
60 pages
CS143: Index: Basic Problem Random-Order File
No ratings yet
CS143: Index: Basic Problem Random-Order File
12 pages
Blink
No ratings yet
Blink
50 pages
Adb A1 B21it083
No ratings yet
Adb A1 B21it083
15 pages
B Trees
No ratings yet
B Trees
27 pages
B+ Tree Index Structure Guide
No ratings yet
B+ Tree Index Structure Guide
9 pages
B+ Trees and ISAM Indexing
No ratings yet
B+ Trees and ISAM Indexing
18 pages
Chapter 7 - Indexing
No ratings yet
Chapter 7 - Indexing
94 pages
B Trees
No ratings yet
B Trees
51 pages
Database Indexing Techniques
No ratings yet
Database Indexing Techniques
50 pages
CPS216: Data-Intensive Computing Systems
No ratings yet
CPS216: Data-Intensive Computing Systems
70 pages
Unit 5
No ratings yet
Unit 5
99 pages
Indexing
No ratings yet
Indexing
77 pages
B Trees and B Trees
No ratings yet
B Trees and B Trees
24 pages
Data Structure Lecture 7 Tree
No ratings yet
Data Structure Lecture 7 Tree
49 pages
B+ Tree Indexing Explained
No ratings yet
B+ Tree Indexing Explained
46 pages
Tree Structured Indexing: Dr. Hari Om Gupta Professor, Department of Electrical Engineering IIT Roorkee
No ratings yet
Tree Structured Indexing: Dr. Hari Om Gupta Professor, Department of Electrical Engineering IIT Roorkee
27 pages
Indexing: Data Structure and Algorithm Analysis
No ratings yet
Indexing: Data Structure and Algorithm Analysis
22 pages
Hash Tree Index
No ratings yet
Hash Tree Index
44 pages
CH 13
No ratings yet
CH 13
34 pages
Tree-Structured Indexes: Computer Science Department Columbia University
No ratings yet
Tree-Structured Indexes: Computer Science Department Columbia University
13 pages
Lecture 11 - B Trees
No ratings yet
Lecture 11 - B Trees
39 pages
B Tree: Muhammad Haris Department of Computer Science M.haris@nu - Edu.pk
No ratings yet
B Tree: Muhammad Haris Department of Computer Science M.haris@nu - Edu.pk
27 pages
Lecture 13 Btree
No ratings yet
Lecture 13 Btree
50 pages
Indexing
No ratings yet
Indexing
56 pages
n3 BTrees
No ratings yet
n3 BTrees
14 pages
B-Trees: Definition, Properties, and Operations
No ratings yet
B-Trees: Definition, Properties, and Operations
21 pages
Data Structures Using C, 2e Jhalak Dutta
No ratings yet
Data Structures Using C, 2e Jhalak Dutta
16 pages
Prefix B-Trees: Rudolf Bayer and Karl Unterauer Technische Universitiit Miinchen
No ratings yet
Prefix B-Trees: Rudolf Bayer and Karl Unterauer Technische Universitiit Miinchen
16 pages
Ch10 Tree Index-95
No ratings yet
Ch10 Tree Index-95
30 pages
B Trees
No ratings yet
B Trees
31 pages
B-Tree Resume
No ratings yet
B-Tree Resume
4 pages
CSE 301 Lecture-8-Indexing WT
No ratings yet
CSE 301 Lecture-8-Indexing WT
31 pages
B+-Trees: Indexing and Operations
No ratings yet
B+-Trees: Indexing and Operations
46 pages
CNG351 Lecture 12 B
No ratings yet
CNG351 Lecture 12 B
34 pages
Hafta.
No ratings yet
Hafta.
60 pages
CNG351 Lecture 12 B
No ratings yet
CNG351 Lecture 12 B
34 pages
20mca14c U5
No ratings yet
20mca14c U5
26 pages
2-3 Trees Tyutorial
No ratings yet
2-3 Trees Tyutorial
19 pages
Lec 24
No ratings yet
Lec 24
27 pages
CSE 544: Lecture 11 Storing Data, Indexes: Monday, 5/1/2006
No ratings yet
CSE 544: Lecture 11 Storing Data, Indexes: Monday, 5/1/2006
52 pages

B+-Trees: Adapted From Mike Franklin

Uploaded by

B+-Trees: Adapted From Mike Franklin

Uploaded by

B+-Trees

Adapted from Mike Franklin

Example Tree Index

Primary Leaf Pages

19* 20* 22*

24* 27* 29*

33* 34* 38* 39*

Based on the search for 15*, we know it is not in the tree!

Can often hold top levels in buffer pool:

19* 20* 22*

24* 27* 29*

33* 34* 38* 39*

19* 20* 22* 23*

24* 27* 29*

33* 34* 38* 39*

Example B+ Tree - Inserting 8*

33* 34* 38* 39* 33* 34* 38* 39*

Data vs. Index Page Split

Index Page Split

19* 20* 22*

24* 27* 29*

19* 20* 22*

24* 27* 29*

33* 34* 38* 39*

24* 27* 29*

33* 34* 38* 39*

Delete 20* ...

24* 27* 29*

33* 34* 38* 39*

33* 34* 38* 39*

Delete 19* and 20* ...

Delete 24* ...

33* 34* 38* 39*

No redistribution from neighbors possible

33* 34* 38* 39*

22* 27* 29*

33* 34* 38* 39*

Example of Non-leaf Redistribution

22* 27* 29*

33* 34* 38* 39*

22* 27* 29*

33* 34* 38* 39*

Primary vs Secondary Index

A Secondary B+-Tree index

Primary vs Secondary Index

Can have many different indexes on the same file.

Grid-files R-Trees etc

You might also like