Viden Io Data Analytics Lecture7 Data Stream Filtering PDF
Viden Io Data Analytics Lecture7 Data Stream Filtering PDF
• • x : An element
• S: A set of elements
• Input: x,S
• Output:
• -TRUE if x in S
• -FALSE if x not in S
• A Bloom filter consists of vectors of n boolean values, initially all
set false, as well as k independent hash functions, h1,h2,.....,hk,
each with range {0,1,..., n-1}
0 0 0 0 0 0 0 0 0 0
0 1 2 3 4 5 6 7 8 9 Initial setup n=10
• For each element x in S, the boolean values with positions h1(x),
h2(x),...,hk(x), are set true.
x1 x2
h1 h3 h1 h3
h2 h 2