3 Hashing

The document discusses different hashing techniques used for storing items in a hash table, including hash functions, open addressing techniques like linear probing and quadratic probing, double hashing, and chaining. It provides examples and explanations of how each technique works.

Uploaded by

Shahnawaz Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

3 Hashing

Uploaded by

Shahnawaz Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Hashing

General Idea
• Facilitates search ideally in O(1) time
• The ideal hash table structure is merely an array of some ﬁxed
size, containing the items.
• A stored item needs to have a data member, called key, that
will be used in computing the index value for the item.
• Key could be an integer, a string, etc
▪ e.g. a name or Id that is a part of a large employee structure
• The size of the array is TableSize.
• The items that are stored in the hash table are indexed by
values from 0 to TableSize – 1.
• Each key is mapped into some number in the range 0 to
TableSize – 1.
• The mapping is called a hash function.
Example Hash

0 Table
1
Item
2
s 25000
john
3 john 25000
Hash
phil 31250 ke Functio 4 phil 31250
y n
dave 27500 5
mary 28200 6 dave 27500
7 mary 28200
ke 8
y 9
Hash Function
• Function H from set of K keys to set of L memory
locations
▪ H:K →L
• The hash function:
▪ must be simple to compute.
▪ must distribute the keys evenly among the cells.
• If we know which keys will occur in advance we can
write perfect hash functions, but we don’t
• Problems:
▪ Keys may not be numeric.
▪ Number of possible keys is much larger than the space
available in table.
• Different keys may map into same location
▪ Hash function is not one-to-one => collision.
Some popular hash functions

• Division Method
• Midsquare Mthod
• Folding Method
Division Method

• h(k) = k mod M
• Generally, it is best to choose M to be a prime
number because making M a prime increases
the likelihood that the keys are mapped with
a uniformity in the output range of values.
Midsquare Method
• Step 1: Square the value of the key. That is,
ﬁnd k 2
• Step 2: Extract the middle r bits of the result
obtained in Step 1 where r is the size of the
Example: Calculate the hash value for keys 1234 and 5642 using the mid
address
square of hash
method. The location
table has 100 memory locations.

Note the hash table has 100 memory locations whose indices vary from
0-99. this means, only two digits are needed to map the key to a
location in the hash table, so r = 2.

When k = 1234, k2 = 1522756, h (k) = 27

When k = 5642, k 2 = 31832164, h (k) = 21

Observe that 3rd and 4th digits starting from the right are chosen.
Folding Method
• The folding method works in two steps.
• Step 1: Divide the key value into a number of parts. That is
divide k into parts, k1, k2, …, kn, where each part has the same
number of digits except the last part which may have lesser
digits than the other parts.
• Step 2: Add the individual parts. That is obtain the sum of k1 +
k2 + .. + kn. Hash value is produced by ignoring the last carry,
if any.
• Note that the number of digits in each part of the key will vary
depending upon the size of the hash table. For example, if the
hash table has a size of 1000. Then it means there are 1000
locations in the hash table. To address these 1000 locations, we
will need at least three digits, therefore, each part of the key
must have three digits except the last part which may have
lesser digits.
Collision Resolution

• Collision occurs when the hash

function maps two different keys to
same location
• Methods for handling collision:
• Open addressing
▪ Linear Probing
▪ Quadratic Probing
▪ Double Hashing
• Chaining
Collision Resolution with Open Addressing
• In an open addressing hashing system, all the
data go inside the table.
– Thus, a bigger table is needed.
• Generally the load factor should be below 0.5.
– If a collision occurs, alternative cells are tried until an
empty cell is found.
• More formally:
– Cells h 0(x), h 1(x), h 2(x), …are tried in succession where
h i (x) = (hash(x) + f(i)) mod TableSize, with f(0) = 0.
– The function f is the collision resolution strategy.
• There are three common collision resolution
strategies:
– Linear Probing
– Quadratic probing
– Double hashing
Linear Probing

• In linear probing, collisions are

resolved by sequentially scanning an
array (with wraparound) until an
empty cell is found.
▪ i.e. f is a linear function of i, typically f(i)= i.
• Example:
▪ Insert items with keys: 89, 18, 49, 58, 9
into an empty hash table.
▪ Table size is 10.
▪ Hash function is hash(x) = x mod 10.
o f(i) = i;
Linear probing
hash table after
each insertion
Find and Delete

• The ﬁnd algorithm follows the same

probe sequence as the insert algorithm.
▪ A ﬁnd for 58 would involve 4 probes.
▪ A ﬁnd for 19 would involve 5 probes.
• We must use lazy deletion (i.e. marking
items as deleted)
▪ Standard deletion (i.e. physically
removing the item) cannot be performed.
▪ e.g. remove 89 from hash table
Clustering Problem

• As long as table is big enough, a free

cell can always be found, but the time
to do so can get quite large.
• Worse, even if the table is relatively
empty, blocks of occupied cells start
forming.
• This effect is known as primary
clustering.
• Any key that hashes into the cluster
will require several attempts to resolve
Quadratic Probing

• Quadratic Probing eliminates primary

clustering problem of linear probing.
• Collision function is quadratic.
▪ The popular choice is f(i) = i 2.
• If the hash function evaluates to h and a
search in cell h is inconclusive, we try cells h
+ 1 2, h+22, … h + i 2.
▪ i.e. It examines cells 1,4,9 and so on away from
the original probe.
• Remember that subsequent probe points
are a quadratic number of positions from
the original probe point.
A quadratic
probing hash table
after each insertion
(note that the table
size was poorly
chosen because it is
not a prime
number).
Double Hashing

• A second hash function is used to drive the

collision resolution.
▪ f(i) = i * hash2(x)
• We apply a second hash function to x and
probe at a distance hash2(x), 2*hash2(x), …
and so on.
• The function hash2(x) must never evaluate to
zero.
▪ e.g. Let hash2(x) = x mod 9 and try to insert 99 in
the previous example.
Chaining
• The idea is to keep a list of all elements
that hash to the same value.
– The array elements are pointers to the ﬁrst
nodes of the lists.
– A new item is inserted to the front of the list.
• Advantages:
– Better space utilization for large items.
– Simple collision handling: searching linked
list.
– Overﬂow: we can store more items than the
hash table size.
– Deletion is quick and easy: deletion from the
linked list.
Example
Keys: 0, 1, 4, 9, 16, 25, 36, 49, 64, 81
hash(key) = key % 10.
0 0

1 81 1
2

4 64 4
5 25
6 36 16
7

9 49 9
Operations
• Initialization: all entries are set to NULL
• Find:
– locate the cell using hash function.
– sequential search on the linked list in that cell.
• Insertion:
– Locate the cell using hash function.
– (If the item does not exist) insert it as the ﬁrst
item in the list.
• Deletion:
– Locate the cell using hash function.
– Delete the item from the linked list.

Pet Products in The Philippines
No ratings yet
Pet Products in The Philippines
8 pages
María O Kean J (2015) Economía
No ratings yet
María O Kean J (2015) Economía
18 pages
Put A in The Right Box:: Slow-Fried French Fries
100% (2)
Put A in The Right Box:: Slow-Fried French Fries
1 page
MCQs On Motivation
93% (100)
MCQs On Motivation
12 pages
Hashing
No ratings yet
Hashing
20 pages
unit 1 Hashing
No ratings yet
unit 1 Hashing
61 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Algo Cha 8
No ratings yet
Algo Cha 8
20 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
MODULE-5
No ratings yet
MODULE-5
33 pages
Hashing
No ratings yet
Hashing
56 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
26 pages
ADS M TECH MID 2
No ratings yet
ADS M TECH MID 2
26 pages
Hashing Updated
No ratings yet
Hashing Updated
26 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
32 pages
Hashing new
No ratings yet
Hashing new
48 pages
Hashing
No ratings yet
Hashing
35 pages
Hashing
No ratings yet
Hashing
35 pages
Hashing
No ratings yet
Hashing
30 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
HASHING
No ratings yet
HASHING
63 pages
Unit-5 2
No ratings yet
Unit-5 2
9 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Hashing
No ratings yet
Hashing
34 pages
Hash Table: Didih Rizki Chandranegara
No ratings yet
Hash Table: Didih Rizki Chandranegara
33 pages
Hashing Techniques
No ratings yet
Hashing Techniques
13 pages
Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
Hashing
No ratings yet
Hashing
23 pages
Hashing
No ratings yet
Hashing
23 pages
11 Hashing
No ratings yet
11 Hashing
60 pages
hashing v2 12032018
No ratings yet
hashing v2 12032018
23 pages
09 Hashtable
No ratings yet
09 Hashtable
53 pages
Hash Table PDF
No ratings yet
Hash Table PDF
25 pages
Collision
No ratings yet
Collision
24 pages
Hash Tables: Dr. Dibakar Saha
No ratings yet
Hash Tables: Dr. Dibakar Saha
26 pages
DSA LABTASK 12
No ratings yet
DSA LABTASK 12
5 pages
Hashing
No ratings yet
Hashing
30 pages
UNIT V - Hashing
No ratings yet
UNIT V - Hashing
20 pages
Study_Material_on_Hashing
No ratings yet
Study_Material_on_Hashing
4 pages
Chapter One - Hashing PDF
No ratings yet
Chapter One - Hashing PDF
30 pages
Hashing
No ratings yet
Hashing
44 pages
Ch7 Hashing
No ratings yet
Ch7 Hashing
12 pages
Hashing and Graphs
No ratings yet
Hashing and Graphs
28 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
Hashing PDF
No ratings yet
Hashing PDF
65 pages
Struktur Data: By: Sri Rezeki Candra Nursari
No ratings yet
Struktur Data: By: Sri Rezeki Candra Nursari
34 pages
Lecture 08 - Hash Tables
No ratings yet
Lecture 08 - Hash Tables
21 pages
HASHING
No ratings yet
HASHING
21 pages
Hashing 1
No ratings yet
Hashing 1
26 pages
DSA Unit VI Hashing and File Organization
No ratings yet
DSA Unit VI Hashing and File Organization
56 pages
Week13 1
No ratings yet
Week13 1
16 pages
HASHING
No ratings yet
HASHING
16 pages
Unit-5
No ratings yet
Unit-5
50 pages
Hashing
No ratings yet
Hashing
42 pages
Hashing ClassNotes
No ratings yet
Hashing ClassNotes
8 pages
HAshing (Satish sir)
No ratings yet
HAshing (Satish sir)
52 pages
Hashing Slide
No ratings yet
Hashing Slide
16 pages
Chapter 11 Hashing
No ratings yet
Chapter 11 Hashing
42 pages
Chapter 8 - Searching
No ratings yet
Chapter 8 - Searching
44 pages
Hashing
From Everand
Hashing
Prakash Hegade
No ratings yet
Elementary Functional Analysis
From Everand
Elementary Functional Analysis
Georgi E. Shilov
4/5 (1)
Monthly Leads
No ratings yet
Monthly Leads
15 pages
TORRES de Lima Vs City of Manila
No ratings yet
TORRES de Lima Vs City of Manila
2 pages
Stanley Davidson v. Oilfield Maintenance Company, Inc., Mayronne Drilling Company, and Third Party, 423 F.2d 1115, 3rd Cir. (1970)
No ratings yet
Stanley Davidson v. Oilfield Maintenance Company, Inc., Mayronne Drilling Company, and Third Party, 423 F.2d 1115, 3rd Cir. (1970)
2 pages
Unit 1 - Teaching Aptitude
No ratings yet
Unit 1 - Teaching Aptitude
17 pages
Ebook SIF
No ratings yet
Ebook SIF
9 pages
Ob Group Assignment
No ratings yet
Ob Group Assignment
8 pages
Questionnaire
No ratings yet
Questionnaire
5 pages
Casiotone cts200
No ratings yet
Casiotone cts200
51 pages
173d Airborne Lessons Learned 7 Mar 1968
100% (3)
173d Airborne Lessons Learned 7 Mar 1968
37 pages
FEKO. Script Examples
No ratings yet
FEKO. Script Examples
182 pages
Resume For Teachers in Ontario
100% (2)
Resume For Teachers in Ontario
8 pages
[Ebooks PDF] download Intra Asian Trade and the World Market Routledge Studies in the Modern History of Asia 1st Edition Ajh Latham full chapters
100% (8)
[Ebooks PDF] download Intra Asian Trade and the World Market Routledge Studies in the Modern History of Asia 1st Edition Ajh Latham full chapters
81 pages
Ship Motions in Waves
100% (1)
Ship Motions in Waves
8 pages
Strategic Assessment of Godiva UK
No ratings yet
Strategic Assessment of Godiva UK
3 pages
Srs NISSAN SENTRA B16
No ratings yet
Srs NISSAN SENTRA B16
52 pages
Professional Graduate Programme in Electrical Engineering
No ratings yet
Professional Graduate Programme in Electrical Engineering
19 pages
AFM Dinner
No ratings yet
AFM Dinner
1 page
Camtech Manual (Maintenance Schedule) (1)
No ratings yet
Camtech Manual (Maintenance Schedule) (1)
130 pages
Implementation of Haccp
100% (1)
Implementation of Haccp
21 pages
Sime Darby 2021
No ratings yet
Sime Darby 2021
32 pages
Foxconn 661FX6617mi
No ratings yet
Foxconn 661FX6617mi
83 pages
طلب ترخيص عمل لمن هم على كفالة ذويهم
No ratings yet
طلب ترخيص عمل لمن هم على كفالة ذويهم
2 pages
Unit 1 Introduction To Statistics: Structure
No ratings yet
Unit 1 Introduction To Statistics: Structure
24 pages
The Evolution of Computers and The Internet
No ratings yet
The Evolution of Computers and The Internet
4 pages
In Ac rcub-DGMST-2020A1918024IISEPTEMBER
No ratings yet
In Ac rcub-DGMST-2020A1918024IISEPTEMBER
1 page

3 Hashing

Uploaded by

3 Hashing

Uploaded by

Hashing

When k = 1234, k2 = 1522756, h (k) = 27

• Collision occurs when the hash

• In linear probing, collisions are

• The ﬁnd algorithm follows the same

• As long as table is big enough, a free

• Quadratic Probing eliminates primary

• A second hash function is used to drive the

You might also like