0% found this document useful (0 votes)

243 views

Collision Resolution Techniques

This document discusses collision resolution techniques for hash tables using open addressing, including quadratic probing, double hashing, and rehashing. It provides examples and pseudocode for implementing these techniques. Specifically, it covers: 1) Quadratic probing and double hashing algorithms to handle collisions in open addressing hash tables by probing to subsequent index locations using quadratic or secondary hash functions. 2) How double hashing eliminates both primary and secondary clustering through the use of a second hash function. 3) Rehashing the table by doubling the size and reinserting all elements when the table becomes too full. 4) Pseudocode for implementing open addressing hash tables using these collision resolution techniques.

Uploaded by

vvsprasad

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

243 views

Collision Resolution Techniques

Uploaded by

vvsprasad

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Collision Resolution: Open Addressing

• Quadratic Probing

• Double Hashing

• Rehashing

• Algorithms for:
– insert
– find
– withdraw

1
Open Addressing: Quadratic Probing
• Quadratic probing eliminates primary clusters.
• c(i) is a quadratic function in i of the form c(i) = a*i2 + b*i. Usually c(i) is chosen
as:
c(i) = i2 for i = 0, 1, . . . , tableSize – 1
or
c(i) = ±i2 for i = 0, 1, . . . , (tableSize – 1) / 2

• The probe sequences are then given by:

hi(key) = [h(key) + i2] % tableSize for i = 0, 1, . . . , tableSize – 1
or
hi(key) = [h(key) ± i2] % tableSize for i = 0, 1, . . . , (tableSize – 1) / 2

• Note for Quadratic Probing:

¾ Hashtable size should not be an even number; otherwise Property 2 will not be
satisfied.
¾ Ideally, table size should be a prime of the form 4j+3, where j is an integer. This
choice of table size guarantees Property 2. 2
Quadratic Probing (cont’d)
• Example: Load the keys 23, 13, 21, 14, 7, 8, and 15, in this order,
in a hash table of size 7 using quadratic probing with c(i) = ±i2 and
the hash function: h(key) = key % 7
• The required probe sequences are given by:
hi(key) = (h(key) ± i2) % 7 i = 0, 1, 2, 3

3
Quadratic Probing (cont’d)
h0(23) = (23 % 7) % 7 = 2 hi(key) = (h(key) ± i2) % 7 i = 0, 1, 2, 3
h0(13) = (13 % 7) % 7 = 6
h0(21) = (21 % 7) % 7 = 0
h0(14) = (14 % 7) % 7 = 0 collision
0 O 21
h1(14) = (0 + 12) % 7 = 1
h0(7) = (7 % 7) % 7 = 0 collision
h1(7) = (0 + 12) % 7 = 1 collision 1 O 14
h-1(7) = (0 - 12) % 7 = -1
NORMALIZE: (-1 + 7) % 7 = 6 collision 2 O 23
h2(7) = (0 + 22) % 7 = 4
h0(8) = (8 % 7)%7 = 1 collision 3 O 15
h1(8) = (1 + 12) % 7 = 2 collision
h-1(8) = (1 - 12) % 7 = 0 collision 4 O 7
h2(8) = (1 + 22) % 7 = 5
h0(15) = (15 % 7)%7 = 1 collision 5 O 8
h1(15) = (1 + 12) % 7 = 2 collision
h-1(15) = (1 - 12) % 7 = 0 collision 6 O 13
2
h2(15) = (1 + 2 ) % 7 = 5 collision
h-2(15) = (1 - 22) % 7 = -3
NORMALIZE: (-3 + 7) % 7 = 4 collision 4
h3(15) = (1 + 32)%7 = 3
Secondary Clusters
• Quadratic probing is better than linear probing because it eliminates primary
clustering.
• However, it may result in secondary clustering: if h(k1) = h(k2) the probing
sequences for k1 and k2 are exactly the same. This sequence of locations is called
a secondary cluster.
• Secondary clustering is less harmful than primary clustering because secondary
clusters do not combine to form large clusters.
• Example of Secondary Clustering: Suppose keys k0, k1, k2, k3, and k4 are
inserted in the given order in an originally empty hash table using quadratic
probing with c(i) = i2. Assuming that each of the keys hashes to the same array
index x. A secondary cluster will develop and grow in size:

5
Double Hashing
• To eliminate secondary clustering, synonyms must have different probe sequences.

• Double hashing achieves this by having two hash functions that both depend on the
hash key.

• c(i) = i * hp(key) for i = 0, 1, . . . , tableSize – 1

where hp (or h2) is another hash function.

• The probing sequence is:

hi(key) = [h(key) + i*hp(key)]% tableSize for i = 0, 1, . . . , tableSize – 1

• The function c(i) = i*hp(r) satisfies Property 2 provided hp(r) and tableSize are
relatively prime.

• To guarantee Property 2, tableSize must be a prime number.

• Common definitions for hp are :

¾ hp(key) = 1 + key % (tableSize - 1)
¾ hp(key) = q - (key % q) where q is a prime less than tableSize
¾ hp(key) = q*(key % q) where q is a prime less than tableSize
6
Double Hashing (cont'd)
Performance of Double hashing:
– Much better than linear or quadratic probing because it eliminates both primary
and secondary clustering.
– BUT requires a computation of a second hash function hp.

Example: Load the keys 18, 26, 35, 9, 64, 47, 96, 36, and 70 in this order, in an
empty hash table of size 13
(a) using double hashing with the first hash function: h(key) = key % 13 and the
second hash function: hp(key) = 1 + key % 12
(b) using double hashing with the first hash function: h(key) = key % 13 and
the second hash function: hp(key) = 7 - key % 7
Show all computations.

7
Double Hashing (cont’d)

h0(18) = (18%13)%13 = 5 hi(key) = [h(key) + i*hp(key)]% 13

h0(26) = (26%13)%13 = 0 h(key) = key % 13
h0(35) = (35%13)%13 = 9
h0(9) = (9%13)%13 = 9 collision hp(key) = 1 + key % 12
hp(9) = 1 + 9%12 = 10
h1(9) = (9 + 1*10)%13 = 6
h0(64) = (64%13)%13 = 12
h0(47) = (47%13)%13 = 8
h0(96) = (96%13)%13 = 5 collision
hp(96) = 1 + 96%12 = 1
h1(96) = (5 + 1*1)%13 = 6 collision
h2(96) = (5 + 2*1)%13 = 7
h0(36) = (36%13)%13 = 10
h0(70) = (70%13)%13 = 5 collision
hp(70) = 1 + 70%12 = 11
h1(70) = (5 + 1*11)%13 = 3
8
Double Hashing (cont'd)

h0(18) = (18%13)%13 = 5 hi(key) = [h(key) + i*hp(key)]% 13

h0(26) = (26%13)%13 = 0 h(key) = key % 13
h0(35) = (35%13)%13 = 9
h0(9) = (9%13)%13 = 9 collision hp(key) = 7 - key % 7
hp(9) = 7 - 9%7 = 5
h1(9) = (9 + 1*5)%13 = 1
h0(64) = (64%13)%13 = 12
h0(47) = (47%13)%13 = 8
h0(96) = (96%13)%13 = 5 collision
hp(96) = 7 - 96%7 = 2
h1(96) = (5 + 1*2)%13 = 7
h0(36) = (36%13)%13 = 10
h0(70) = (70%13)%13 = 5 collision
hp(70) = 7 - 70%7 = 7
h1(70) = (5 + 1*7)%13 = 12 collision
h2(70) = (5 + 2*7)%13 = 6
9
Rehashing
• As noted before, with open addressing, if the hash tables become
too full, performance can suffer a lot.
• So, what can we do?
• We can double the hash table size, modify the hash function, and
re-insert the data.
– More specifically, the new size of the table will be the first
prime that is more than twice as large as the old table size.

10
Implementation of Open Addressing
public class OpenScatterTable extends AbstractHashTable {
protected Entry array[];
protected static final int EMPTY = 0;
protected static final int OCCUPIED = 1;
protected static final int DELETED = 2;

protected static final class Entry {

public int state = EMPTY;
public Comparable object;
// …
}

public OpenScatterTable(int size) {

array = new Entry[size];
for(int i = 0; i < size; i++)
array[i] = new Entry();
}
// …
}

11
Implementation of Open Addressing (Con’t.)
/* finds the index of the first unoccupied slot
in the probe sequence of obj */
protected int findIndexUnoccupied(Comparable obj){
int hashValue = h(obj);
int tableSize = getLength();
int indexDeleted = -1;
for(int i = 0; i < tableSize; i++){
int index = (hashValue + c(i)) % tableSize;
if(array[index].state == OCCUPIED
&& obj.equals(array[index].object))
throw new IllegalArgumentException(
"Error: Duplicate key");
else if(array[index].state == EMPTY ||
(array[index].state == DELETED &&
obj.equals(array[index].object)))
return indexDeleted ==-1?index:indexDeleted;
else if(array[index].state == DELETED &&
indexDeleted == -1)
indexDeleted = index;
}
if(indexDeleted != -1) return indexDeleted;
throw new IllegalArgumentException(
"Error: Hash table is full");
}
12
Implementation of Open Addressing (Con’t.)
protected int findObjectIndex(Comparable obj){
int hashValue = h(obj);
int tableSize = getLength();

for(int i = 0; i < tableSize; i++){

int index = (hashValue + c(i)) % tableSize;
if(array[index].state == EMPTY
|| (array[index].state == DELETED
&& obj.equals(array[index].object)))
return -1;
else if(array[index].state == OCCUPIED
&& obj.equals(array[index].object))
return index;
}
return -1;
}

public Comparable find(Comparable obj){

int index = findObjectIndex(obj);
if(index >= 0)return array[index].object;
else return null; 13
}
Implementation of Open Addressing (Con’t.)
public void insert(Comparable obj){
if(count == getLength()) throw new ContainerFullException();
else {
int index = findIndexUnoccupied(obj);
// throws exception if an UNOCCUPIED slot is not found
array[index].state = OCCUPIED;
array[index].object = obj;
count++;
}
}

public void withdraw(Comparable obj){

if(count == 0) throw new ContainerEmptyException();
int index = findObjectIndex(obj);
if(index < 0)
throw new IllegalArgumentException("Object not found");
else {
array[index].state = DELETED;
// lazy deletion: DO NOT SET THE LOCATION TO null
count--;
}
} 14
Exercises
1. If a hash table is 25% full what is its load factor?

2. Given that,
c(i) = i2,
for c(i) in quadratic probing, we discussed that this equation
does not satisfy Property 2, in general. What cells are missed by
this probing formula for a hash table of size 17? Characterize
using a formula, if possible, the cells that are not examined by
using this function for a hash table of size n.

3. It was mentioned in this session that secondary clusters are less

harmful than primary clusters because the former cannot combine
to form larger secondary clusters. Use an appropriate hash table
of records to exemplify this situation.
15

Dakshina Bharat Hindi Prachar Sabha, Madras Rashtrabhasha-Answer Book First Page
No ratings yet
Dakshina Bharat Hindi Prachar Sabha, Madras Rashtrabhasha-Answer Book First Page
1 page
Modification of Simplex Method and Its Implementation in Visual Basic Simplexfol
No ratings yet
Modification of Simplex Method and Its Implementation in Visual Basic Simplexfol
9 pages
Collection & Maps Question 1
100% (1)
Collection & Maps Question 1
11 pages
Algorithms All Sortings
No ratings yet
Algorithms All Sortings
91 pages
241-423 Advanced Data Structures and Algorithms: 9. Queues
No ratings yet
241-423 Advanced Data Structures and Algorithms: 9. Queues
41 pages
Syllabus Python Masterclass
No ratings yet
Syllabus Python Masterclass
8 pages
Algorithm To Become A Good Programmer by Ashish Kedia
No ratings yet
Algorithm To Become A Good Programmer by Ashish Kedia
2 pages
"Doing" Strategy
No ratings yet
"Doing" Strategy
10 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
Private Mix
No ratings yet
Private Mix
91 pages
IronFX Case Study
No ratings yet
IronFX Case Study
18 pages
Java Lab Manual SDES
No ratings yet
Java Lab Manual SDES
41 pages
Algorithm Types and Classification
No ratings yet
Algorithm Types and Classification
5 pages
CIB R&a Banking Junior Analyst Academic Intern FinTech
No ratings yet
CIB R&a Banking Junior Analyst Academic Intern FinTech
2 pages
Week4 Assignment Solution
No ratings yet
Week4 Assignment Solution
2 pages
Algorithms 2
No ratings yet
Algorithms 2
49 pages
Greedy Algorithm
No ratings yet
Greedy Algorithm
34 pages
Data Warehousing and Data Mining
No ratings yet
Data Warehousing and Data Mining
7 pages
Unit-4 Greedy Algorithms
No ratings yet
Unit-4 Greedy Algorithms
71 pages
Data Structures Unit 5
No ratings yet
Data Structures Unit 5
20 pages
Red and White Productive Habits Self-Improvement Infographic Poster
No ratings yet
Red and White Productive Habits Self-Improvement Infographic Poster
1 page
MBA in FinTech
No ratings yet
MBA in FinTech
14 pages
Dynamic Programming
No ratings yet
Dynamic Programming
110 pages
Dynamic Programming
No ratings yet
Dynamic Programming
14 pages
Project Time Management Literature Review
100% (2)
Project Time Management Literature Review
7 pages
Priority Queues
No ratings yet
Priority Queues
6 pages
Binary Tree in Java
No ratings yet
Binary Tree in Java
79 pages
Programming & Algorithms
No ratings yet
Programming & Algorithms
5 pages
Greedy Algorithm
100% (1)
Greedy Algorithm
18 pages
Dynamic Programming Vs Greedy MEthod
100% (1)
Dynamic Programming Vs Greedy MEthod
33 pages
OOP Basics - Java Programming Tutorial
No ratings yet
OOP Basics - Java Programming Tutorial
30 pages
(Numpy) - Extended Cheatsheet
No ratings yet
(Numpy) - Extended Cheatsheet
8 pages
Valid AWS-Solution-Architect-Associate Dumps PDF - AWS-Solution-Archi
No ratings yet
Valid AWS-Solution-Architect-Associate Dumps PDF - AWS-Solution-Archi
2 pages
Apache Spark For Beginners
No ratings yet
Apache Spark For Beginners
30 pages
Effective Whispers (1166 +) to Easily Learn New Skills and Subjects, Develop Laser Sharp Focus, and Improve Your Memory Ability
From Everand
Effective Whispers (1166 +) to Easily Learn New Skills and Subjects, Develop Laser Sharp Focus, and Improve Your Memory Ability
Nicholas Mag
No ratings yet
Machine Learning: Data Set
100% (1)
Machine Learning: Data Set
52 pages
7 HospitalMS Mysql CaseStudy
No ratings yet
7 HospitalMS Mysql CaseStudy
2 pages
Data Engineering Interviews
No ratings yet
Data Engineering Interviews
33 pages
Data Science Interview Questions - Statistics: Mohit Kumar Dec 12, 2018 11 Min Read
100% (1)
Data Science Interview Questions - Statistics: Mohit Kumar Dec 12, 2018 11 Min Read
14 pages
Data Science Deep Learning & Artificial Intelligence
No ratings yet
Data Science Deep Learning & Artificial Intelligence
9 pages
Machine Learning C
No ratings yet
Machine Learning C
24 pages
Amazon - Test Inside - Aws Solution Architect Associate - Simulations.2020 Dec 29.by - Howar.146q.vce
No ratings yet
Amazon - Test Inside - Aws Solution Architect Associate - Simulations.2020 Dec 29.by - Howar.146q.vce
29 pages
Bayesforbeginners
No ratings yet
Bayesforbeginners
21 pages
Chapter 2 Kimball Dimensional Modelling Techniques Overview
No ratings yet
Chapter 2 Kimball Dimensional Modelling Techniques Overview
14 pages
Dynamic Time Warping
No ratings yet
Dynamic Time Warping
5 pages
Data Types JavaScript
No ratings yet
Data Types JavaScript
23 pages
Prolog Coding
No ratings yet
Prolog Coding
552 pages
Leetcode Preparation
No ratings yet
Leetcode Preparation
14 pages
Hassan Raza Test
No ratings yet
Hassan Raza Test
4 pages
Getting Started With Hazelcast - Second Edition - Sample Chapter
0% (1)
Getting Started With Hazelcast - Second Edition - Sample Chapter
14 pages
Alfresco High Availability Infrastructure in Real Life
No ratings yet
Alfresco High Availability Infrastructure in Real Life
5 pages
Sub-Conscious Mind: Your Hidden Curator
100% (2)
Sub-Conscious Mind: Your Hidden Curator
25 pages
Types of Digital Data
No ratings yet
Types of Digital Data
22 pages
Model Paper - 2016 Introductory
No ratings yet
Model Paper - 2016 Introductory
9 pages
Python Pandas Series
No ratings yet
Python Pandas Series
37 pages
Apache Mahout Essentials - Sample Chapter
No ratings yet
Apache Mahout Essentials - Sample Chapter
25 pages
Storage Area Network: Lecture Notes
100% (1)
Storage Area Network: Lecture Notes
29 pages
Big Query Interview Q&A
No ratings yet
Big Query Interview Q&A
8 pages
Spark Training - Java
No ratings yet
Spark Training - Java
8 pages
Basic R
No ratings yet
Basic R
3 pages
What Is Azure IoT Hub
No ratings yet
What Is Azure IoT Hub
7 pages
Hash Table
No ratings yet
Hash Table
9 pages
C Exercises - Display The Pattern Like Right Angle Using An Asterisk - W3resource
No ratings yet
C Exercises - Display The Pattern Like Right Angle Using An Asterisk - W3resource
1 page
É Xé ºéæj Éé Eöò É +æeò - Éé (Iééæeò - É Xé ºéæj Éé Eöò É +æeò - Éé (Iééæeò
No ratings yet
É Xé ºéæj Éé Eöò É +æeò - Éé (Iééæeò - É Xé ºéæj Éé Eöò É +æeò - Éé (Iééæeò
1 page
C Exercises - Display The Sum of First 10 Natural Numbers - W3resource
No ratings yet
C Exercises - Display The Sum of First 10 Natural Numbers - W3resource
1 page
1 - 2software: : Software Is (1) Instructions (Computer
No ratings yet
1 - 2software: : Software Is (1) Instructions (Computer
18 pages
Se Unit I
No ratings yet
Se Unit I
38 pages
DBMS - Interview Questions and Answers: Level 1
No ratings yet
DBMS - Interview Questions and Answers: Level 1
30 pages
CSE 326: Data Structures AVL Trees The AVL Balance Condition
No ratings yet
CSE 326: Data Structures AVL Trees The AVL Balance Condition
9 pages
Arithmetic Operators
No ratings yet
Arithmetic Operators
4 pages
CS / IT 425 C (CR) Iv/Iv B.Tech Degree Examination, March/April 2015 Second Semester CS/ It Cloud Computing Answer Question No. 1 Compulsory (14x1 14) Answer ONE Question From Each Unit 4x14 56
No ratings yet
CS / IT 425 C (CR) Iv/Iv B.Tech Degree Examination, March/April 2015 Second Semester CS/ It Cloud Computing Answer Question No. 1 Compulsory (14x1 14) Answer ONE Question From Each Unit 4x14 56
8 pages
JNNNN
No ratings yet
JNNNN
1 page
Efghij Klmno PMQRMS: & - CV - Z - B & U ZD
No ratings yet
Efghij Klmno PMQRMS: & - CV - Z - B & U ZD
1 page
Class Addtwonumbers //class Declaration (
No ratings yet
Class Addtwonumbers //class Declaration (
1 page
Indian Rupee Payment - Individual Option
No ratings yet
Indian Rupee Payment - Individual Option
4 pages
Emergency Exit in Networking
No ratings yet
Emergency Exit in Networking
2 pages
English
No ratings yet
English
1 page
ABEN-3412_Module_3
No ratings yet
ABEN-3412_Module_3
24 pages
HVAC Calculation
No ratings yet
HVAC Calculation
6 pages
BK2461 Beken
No ratings yet
BK2461 Beken
95 pages
Plug and Abandonment Solution For Oilfield Decommissioning in The North Sea
100% (2)
Plug and Abandonment Solution For Oilfield Decommissioning in The North Sea
19 pages
Datasheet_S11R315MX4
No ratings yet
Datasheet_S11R315MX4
2 pages
Statistical and Probability Analysis of Rainfall For Crop Planning PDF
100% (1)
Statistical and Probability Analysis of Rainfall For Crop Planning PDF
8 pages
How LTE Stuff Works?
No ratings yet
How LTE Stuff Works?
1 page
Marcel Felizardo - Portfolio 2010
No ratings yet
Marcel Felizardo - Portfolio 2010
24 pages
Chem Eng 2009
No ratings yet
Chem Eng 2009
10 pages
ch04 Equilibrium of Rigid Bodies
No ratings yet
ch04 Equilibrium of Rigid Bodies
61 pages
Week 11 Surveying
No ratings yet
Week 11 Surveying
54 pages
The Witch's Swampwoods
No ratings yet
The Witch's Swampwoods
8 pages
Design of Machine Elements Project
No ratings yet
Design of Machine Elements Project
43 pages
Data Base Processing Fundamentals, Design, and Implementation 15th Edition David M. Kroenke download
100% (1)
Data Base Processing Fundamentals, Design, and Implementation 15th Edition David M. Kroenke download
41 pages
For PPS
No ratings yet
For PPS
57 pages
Angelus V Overview Brochure 0875
No ratings yet
Angelus V Overview Brochure 0875
2 pages
ECE250 Lab 8 MOSFET Audio PA Amp Demo
No ratings yet
ECE250 Lab 8 MOSFET Audio PA Amp Demo
6 pages
Chapter 8 Solutions (5th Ed.)
No ratings yet
Chapter 8 Solutions (5th Ed.)
4 pages
An Automated Traffic Accident Detection and Alarm Device
No ratings yet
An Automated Traffic Accident Detection and Alarm Device
4 pages
FAANG Puzzle Combine
No ratings yet
FAANG Puzzle Combine
40 pages
How To Convert Si To British Unit PDF
No ratings yet
How To Convert Si To British Unit PDF
1 page
Fuji Electric Contactor FJ Series Full Catalog - V1
No ratings yet
Fuji Electric Contactor FJ Series Full Catalog - V1
36 pages
Management of Alveolar Clefts Using Dento-Osseous Transport Distraction Osteogenesis
No ratings yet
Management of Alveolar Clefts Using Dento-Osseous Transport Distraction Osteogenesis
7 pages
Transmission Line & Feeder Protection
No ratings yet
Transmission Line & Feeder Protection
13 pages
PDF Finite Generalized Quadrangles 2ed Edition Payne S.E. Download
100% (4)
PDF Finite Generalized Quadrangles 2ed Edition Payne S.E. Download
84 pages
PROGRAMMING 2nd Quarter Reviewer
No ratings yet
PROGRAMMING 2nd Quarter Reviewer
4 pages
Tablas para Metal Deck
No ratings yet
Tablas para Metal Deck
34 pages
G11 Chemistry
No ratings yet
G11 Chemistry
3 pages
Saint Venant Torsion Theory PDF
No ratings yet
Saint Venant Torsion Theory PDF
34 pages