0% found this document useful (0 votes)

6 views20 pages

Hashing Methods (1)

The document presents an overview of hashing techniques used in data structures, focusing on hash functions, collision resolution methods, and their characteristics. It covers various hashing methods such as the Division Method, Mid Square Method, Digit Folding Method, Linear Probing, Double Hashing, and Separate Chaining, along with examples and their pros and cons. Additionally, it discusses the estimation of overflows in hash tables based on the density of data insertion.

Uploaded by

aymen Beskri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views20 pages

Hashing Methods (1)

Uploaded by

aymen Beskri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

FILE ORGANIZATION

Hashing
Presented by Pr. Nabil KESKES

year 2023-2024.

1
PLAN

Introduction.
Hash Functions
Collusion
Conclusion

2
1. Introduction
Hash Table uses an array as a storage medium and uses hash technique to generate an
index where an element is to be inserted or is to be located from.

Hashing is a technique to convert a range of key values into a range of indexes of an

array.

3
2.Types Of Hash Function In Data Structures
a hash function maps a significant number or string to a small integer that can be
used as the index in the hash table.
A good hash function should have the following characteristics:

It should be deterministic. This means that a given input should always produce the
same output.

It should be fast to compute.

It should be hard to predict the output for a given input.

Collision free. 4
1. Division Method:
This is the most simple and easiest method to generate a hash value. The hash function
divides the value k by M and then uses the remainder obtained.

Formula:

h(K) = k mod M

Here,
k is the key value, and
M is the size of the hash table.
.

5
Example:

k = 12345
M = 95
h(12345) = 12345 mod 95
= 90
k = 1276
M = 11
h(1276) = 1276 mod 11
=0
Pros:
This method is quite good for any value of M.
The division method is very fast since it requires only a single division operation
Cons:
This method leads to poor performance since consecutive keys map to consecutive hash
values in the hash table.
Sometimes extra care should be taken to choose the value of M
6
2. Mid Square Method

The steps involved in computing this hash method include the following -

Squaring the value of k ( like k*k)

Extract the hash value from the middle r digits.

Formula - h(K) = h(k x k)

(where k = key value )

7
Example:

Suppose the hash table has 100 memory locations. So r = 2 because two digits
are required to map the key to the memory location.

k = 60
k x k = 60 x 60
= 3600
h(60) = 60

The hash value obtained is 60

8
3. Digit Folding Method:
This method involves two steps:

Divide the key-value k into a number of parts i.e. k1, k2, k3,….,kn, where each part has
the same number of digits except for the last part that can have lesser digits than the other
parts.

Add the individual parts. The hash value is obtained by ignoring the last carry if any.

9
Formula:
k = k1, k2, k3, k4, ….., kn
s = k1+ k2 + k3 + k4 +….+ kn
h(K)= s
Here,
s is obtained by adding the parts of the key k

Example:
k = 12345
k1 = 12,. k2 = 34, k3 = 5
s = k1 + k2 + k3
= 12 + 34 + 5
= 51
h(K) = 51
10
Note:

The number of digits in each part varies depending upon the size of the hash table.

Suppose for example the size of the hash table is 100, then each part must have two

digits except for the last part which can have a lesser number of digits.

11
Hash Collision

A hash collision happens when the same hash value is produced for two different input

values by a hash algorithm. But it's important to point out that collisions aren't a

problem; they're a fundamental aspect of hashing algorithms.

12
Linear Probing

Linear probing involves systematically checking the hash table from its very beginning. A
different site is searched if the one received is already occupied. In linear probing, the interval
between the probes is usually fixed (generally, to a value of 1).

The formula for linear probing: index = key % hashTableSize

The hash(n) is the index computed using a hash function, and T is the table size.

If slot index = ( hash(n) % T) is full, then the next slot index is calculated by adding 1
((hash(n) + 1) % T).
The sequence goes as -
index = ( hash(n) % T)
(hash(n) + 1) % T
(hash(n) + 2) % T
(hash(n) + 3) % T … and so on.
13
Example -

For a hash table, Table Size = 20

Keys = 3,2,46,6,11,13,53,12,70,90

INDEX
SL. NO KEY HASH INDEX
(AFTER LINEAR PROBING)

1 3 3%20 3 3
2 2 2%20 2 2
3 46 46%20 6 6
4 6 6%20 6 7
5 11 11%20 11 11
6 13 13%20 13 13
7 53 53%20 13 14
8 12 12%20 12 12
9 70 70%20 10 10
.
14
Double Hashing

Double hashing is a collision resolution technique used in hash tables. It works by using
two hash functions to compute two different hash values for a given key. The first hash
function is used to compute the initial hash value, and the second hash function is used
to compute the step size for the probing sequence.

Double hashing can be done using :

(hash1(key) + i * hash2(key)) % TABLE_SIZE
Here hash1() and hash2() are hash functions and TABLE_SIZE
is size of hash table.

15
Example -

Insert Keys: 4, 9, 14, 1, 19

h(x) = x mod 5 h2(x) = 3 – (x mod 3)

Double Hashing

4 mod 5 = 4
9 mod 5 = 4 3 - (9 mod 3) = 3
14 mod 5 = 4 3 - (14 mod 3) = 1
1 mod 5 = 1
19 mod 5 = 4 3 - (19 mod 3) = 2

0 1 2 3 4
14 1 9 19 4

16
Separate Chaining:

this method is implemented using the linked list data structure. As a result,

when numerous elements are hashed into the same slot index, those

elements are added to a chain, which is a singly-linked list.

17
Let's use "key mod 7" as our simple hash function with the following key values: 50, 700, 76, 85, 92, 73, 101.

18
Estimation of overflows

Consider a table of N elements, and we would like to insert r data

The filling percentage (density) is therefore: d = r / N
Let P(x) be the probability that x data among r are “hashed” to the same element

P(x) = C(r ,x) ( 1 – 1/N )r-x (1/N)x

The Poisson function is a good approximation, in assuming a uniform hash function

P(x) = (dx *e-d) / x! (with d = r/N)

N*P(x) is therefore an estimate of the number of boxes having been chosen x times
during the insertion of r data into the table The total number of overflow data
is then estimated at: NP(2) + 2N*P(3) + 3N*P(4) + 4N*P(5) + ... 0

19
When inserting 1000 data into a table of 1000 boxes (density = 1), we estimate that:

N.P(0) = 368 boxes will not receive any data N.P(1) = 368 boxes will have been chosen only
once
N.P(2) = 184 boxes will have been chosen twice
N.P(3) = 61 boxes will have been chosen 3 times
N.P(4) = 15 boxes will have been chosen 4 times
N.P(5) = 3 boxes will have been chosen 5 times
N.P(6) = 0 boxes will have been chosen 6 times

The number of overflow data is close to: 184 + 2*61 + 3*15 + 4*3 = 363 or 36% of the data

against 631 data in their primary addresses (368 + 184 + 61 + 15 + 3 = 631)

For a density = 0.5, (ex r = 500 and N = 1000), we would have had 21% data overflow

BCS304-DSA Notes M-5
100% (1)
BCS304-DSA Notes M-5
22 pages
Wilshire Software Technologies: Adv. Shell Scripting Schedule
No ratings yet
Wilshire Software Technologies: Adv. Shell Scripting Schedule
1 page
MODULE-5
No ratings yet
MODULE-5
33 pages
Unit-5
No ratings yet
Unit-5
50 pages
Hashing
No ratings yet
Hashing
56 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
HAshing (Satish sir)
No ratings yet
HAshing (Satish sir)
52 pages
Hashing
No ratings yet
Hashing
34 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
Hashing
No ratings yet
Hashing
23 pages
What is Hashing
No ratings yet
What is Hashing
11 pages
Week13 1
No ratings yet
Week13 1
16 pages
Unit 5 Session 5 Hashing
No ratings yet
Unit 5 Session 5 Hashing
20 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
32 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
26 pages
Hashing
No ratings yet
Hashing
30 pages
Hashing Techniques
No ratings yet
Hashing Techniques
13 pages
Hashing
No ratings yet
Hashing
44 pages
Lecture 08 - Hash Tables
No ratings yet
Lecture 08 - Hash Tables
21 pages
Unit-5 2
No ratings yet
Unit-5 2
9 pages
2,2Hashing
No ratings yet
2,2Hashing
30 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Modue 5
No ratings yet
Modue 5
10 pages
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
No ratings yet
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
39 pages
Hashing
No ratings yet
Hashing
48 pages
UNIT V - Hashing
No ratings yet
UNIT V - Hashing
20 pages
Hashing
No ratings yet
Hashing
30 pages
HASHING
No ratings yet
HASHING
63 pages
ADS M TECH MID 2
No ratings yet
ADS M TECH MID 2
26 pages
Hashing and Skiplist_removed
No ratings yet
Hashing and Skiplist_removed
113 pages
Algo Cha 8
No ratings yet
Algo Cha 8
20 pages
CO4 - Hashing in Data Structure
No ratings yet
CO4 - Hashing in Data Structure
13 pages
DS 5
No ratings yet
DS 5
23 pages
Hash
No ratings yet
Hash
7 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
Module 5
No ratings yet
Module 5
22 pages
Hashing
No ratings yet
Hashing
7 pages
DS Module 5 Hashing
No ratings yet
DS Module 5 Hashing
23 pages
DS_Lecture_01.1_Fall-24-35
No ratings yet
DS_Lecture_01.1_Fall-24-35
20 pages
Exp 5 - Dsa Lab File
No ratings yet
Exp 5 - Dsa Lab File
10 pages
Hashing
No ratings yet
Hashing
37 pages
Hashing
No ratings yet
Hashing
20 pages
Module 5 Hashing
No ratings yet
Module 5 Hashing
66 pages
Dsa 5
No ratings yet
Dsa 5
22 pages
Hashing in DBMS
No ratings yet
Hashing in DBMS
5 pages
HAshing (ISE department)
No ratings yet
HAshing (ISE department)
31 pages
HASHING
No ratings yet
HASHING
21 pages
Hash Function
No ratings yet
Hash Function
9 pages
Unit-5 Hashing (1)
No ratings yet
Unit-5 Hashing (1)
12 pages
Hash
No ratings yet
Hash
17 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Hashing
No ratings yet
Hashing
23 pages
Chapter One - Hashing PDF
No ratings yet
Chapter One - Hashing PDF
30 pages
Module 5-Hashing and Collision (1)
No ratings yet
Module 5-Hashing and Collision (1)
51 pages
UNIT V
No ratings yet
UNIT V
14 pages
hashtables
No ratings yet
hashtables
21 pages
Hashing
No ratings yet
Hashing
25 pages
DSAU1HASH
No ratings yet
DSAU1HASH
21 pages
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Testing Atas, Gateways, Voip PBXS, and Other Signal Processing Elements in Voip Networks
No ratings yet
Testing Atas, Gateways, Voip PBXS, and Other Signal Processing Elements in Voip Networks
8 pages
(eBook PDF) C How to Program, Global Edition 8th by Paul Deitelinstant download
100% (2)
(eBook PDF) C How to Program, Global Edition 8th by Paul Deitelinstant download
44 pages
C++ Practical File
No ratings yet
C++ Practical File
51 pages
Logic Gates
100% (1)
Logic Gates
25 pages
Hierarchy
No ratings yet
Hierarchy
22 pages
SE-Comps SEM4 AOA-CBCGS DEC19 SOLUTION
No ratings yet
SE-Comps SEM4 AOA-CBCGS DEC19 SOLUTION
19 pages
HLASM Language Reference
100% (1)
HLASM Language Reference
474 pages
Simulacion Promodel
No ratings yet
Simulacion Promodel
27 pages
AEC Standard 4
No ratings yet
AEC Standard 4
447 pages
ADA Module 1
No ratings yet
ADA Module 1
24 pages
CV Dragos Urucu
No ratings yet
CV Dragos Urucu
3 pages
Adv DBMS-Unit 2
No ratings yet
Adv DBMS-Unit 2
15 pages
100 Useful Websites
No ratings yet
100 Useful Websites
3 pages
Question Bank - C#
100% (1)
Question Bank - C#
16 pages
RFC 1006
No ratings yet
RFC 1006
39 pages
Triforma PDF
100% (1)
Triforma PDF
4 pages
2011NoCOUG - HistOPA 2
No ratings yet
2011NoCOUG - HistOPA 2
24 pages
Anand Babu
No ratings yet
Anand Babu
3 pages
A User's Guide To Winsteps PDF
No ratings yet
A User's Guide To Winsteps PDF
667 pages
MAKALAH PROJECT Introduction To Web Programming
No ratings yet
MAKALAH PROJECT Introduction To Web Programming
15 pages
10.1007/978 3 319 51415 4
No ratings yet
10.1007/978 3 319 51415 4
334 pages
The Million Followers Guide: Fashion & Lifestyle Niche
No ratings yet
The Million Followers Guide: Fashion & Lifestyle Niche
13 pages
Lovely Professional University: (Foundation of Computing) CSE101 Topic Digital Watch Program
No ratings yet
Lovely Professional University: (Foundation of Computing) CSE101 Topic Digital Watch Program
13 pages
Using This Tutorial Guide
No ratings yet
Using This Tutorial Guide
12 pages
Advanced Java Lab Manual
100% (2)
Advanced Java Lab Manual
47 pages
Admit Card 2018-19 Odd-Sem
No ratings yet
Admit Card 2018-19 Odd-Sem
2 pages
Advanced Programming With Python
No ratings yet
Advanced Programming With Python
9 pages

Hashing Methods (1)

Uploaded by

Hashing Methods (1)

Uploaded by

FILE ORGANIZATION

Hashing is a technique to convert a range of key values into a range of indexes of an

It should be fast to compute.

It should be hard to predict the output for a given input.

Squaring the value of k ( like k*k)

Extract the hash value from the middle r digits.

Formula - h(K) = h(k x k)

The hash value obtained is 60

problem; they're a fundamental aspect of hashing algorithms.

The formula for linear probing: index = key % hashTableSize

For a hash table, Table Size = 20

Double hashing can be done using :

Insert Keys: 4, 9, 14, 1, 19

h(x) = x mod 5 h2(x) = 3 – (x mod 3)

elements are added to a chain, which is a singly-linked list.

Consider a table of N elements, and we would like to insert r data

P(x) = C(r ,x) ( 1 – 1/N )r-x (1/N)x

The Poisson function is a good approximation, in assuming a uniform hash function

P(x) = (dx *e-d) / x! (with d = r/N)

against 631 data in their primary addresses (368 + 184 + 61 + 15 + 3 = 631)

You might also like