Understanding Functional Dependencies in Databases
Understanding Functional Dependencies in Databases
Functional Dependencies
Source:
The Theory of Relational Databases
D. Maier, Ed. Computer Science Press
Available at: [Link]
1
Functional Dependencies
• Two primary purposes of databases are to
– attenuate data redundancy and
– enhance data reliability.
2
Example1
• Consider the relation
assign (Pilot, Flight, Date, Departs)
PILOT FLIGHT DATE DEPARTS
Cushing 83 9 Aug 10: 15a
Cushing 116 IO Aug 1:25p
Clark 281 8 Aug 5:50a
Clark 301 12 Aug 6:35p
Clark 83 11 Aug 10: 15a
Chin 83 13 Aug 10: 15a
Chin 116 12 Aug 1:25p
Copely 281 9 Aug 5:50a
Copely 281 13 Aug 5:50a
Copely 412 15 Aug 1:25p
3
Example1 - Observations
• The relation assign tells which pilot flies a given flight on a given day, and what
time the flight leaves.
• Not every combination of pilots, flights, dates, and times is allowable in assign.
• The following restrictions apply, among others:
1. For each flight there is exactly one time.
2. For any given pilot, date, and time, there is only one flight.
3. For a given flight and date, there is only one pilot.
• Informally, a functional dependency occurs when the values of a tuple on one set
of attributes uniquely determine the values on another set of attributes.
4
FD Definition
Def. Let r be a relation on scheme R, with X and Y
subsets of R. Relation r satisfies the functional
dependency (FD) X Y if for every X-value x,
y( X=x(r)) has at most one tuple.
5
FD Satisfies
Algorithm 4.1 SATISFIES
Input: A relation r and an FD X Y.
Output: true if T satisfies X Y, false otherwise.
SATISFIES(r, X Y);
1. Sort the relation r on its X columns to bring tuples with
equal X-values together.
2. If each set of tuples with equal X-values has equal Y-
values, return true. Otherwise, return false.
6
Algorithm: Satisfies
Using algorithm satisfies to test if FLIGHT DEPARTS
PILOT FLIGHT DATE DEPART
Cushing 83 9-Aug 10: 15a
Clark 83 11-Aug 10: 15a
Chin 83 13-Aug 10: 15a
Cushing 116 IO Aug 1:25p
Chin 116 12-Aug 1:25p
Clark 281 8-Aug 5:50a
Copely 281 9-Aug 5:50a
Copely 281 13-Aug 5:50a
Clark 301 12-Aug 6:35p
Copely 412 15-Aug 1:25p
Question:
DEPARTS FLIGHT ???
7
Inference Axioms
• The number of FDs that can apply to a relation
r(R) is finite, since there is only a finite
number of subsets of R.
• Thus it is always possible to find all the FDs
that r satisfies, by trying all possibilities using
the algorithm SATISFIES.
• This brute-force approach is time-consuming.
8
Inference Axioms
• Finding F requires semantic knowledge of the
relation r.
• After knowing some members of F, it is often
possible to infer other members of F.
• A set F of FDs implies the FD X Y, written
F X Y, if every relation that satisfies all
the FDs in F also satisfies X Y.
• An inference axiom is a rule that states if a
relation satisfies certain FDs, it must satisfy
certain other FDs.
9
Inference Axioms
F={ ...}
Set of functional XY
dependencies
Set of all
relations r(R)
satisfying FDs in
F
A set F of FDs implies the FD X Y,
written F X Y, if every relation that
satisfies all the FDs in F also satisfies X Y.
10
Example - Inference Axioms
F = { A B, B C }
Set of functional AC
dependencies
Set of all
relations r(R)
satisfying FDs in
F
A set F of FDs implies the FD X Y,
written F X Y, if every relation that
satisfies all the FDs in F also satisfies X Y.
11
Inference Axioms
The Armstrong-Set of Inference Axioms
• Axioms will implement the “intelligence” needed to
prove (or disprove) a sequence of derivations.
1 William Armstrong: Dependency Structures of Data Base Relationships, page 580-583. IFIP Congress, 1974.
12
A-Axioms
A1. Reflexivity XX
13
Inference Machine
INPUT:
Relation schema R
Set F of FDs on R
YES
NO
Proof:
15
Example1 - Using the A-Axioms
Consider R = (Street, Zip, City) ; and the dependencies
F = { City Street Zip, Zip City }
We want to show: Street Zip Street Zip City
Proof:
1. Zip City – Given
2. Street Zip Street City – Augmentation of (1) by Street
3. City Street Zip – Given
4. City Street City Street Zip– Augmentation of (3) by
City Street
5. Street Zip City Street Zip – Transitivity of (2) and (4)
16
Example2 – Using A-Axioms
Consider the relation schema <R,F> where R = (ABCDEGHI) and dependencies
F = { ABE AGJ BE I E G GI H }
Show that
AB GH If YES give a proof
is derived by F.
If NO provide a counter-example
17
Example2A – Using A-Axioms
Consider the relation schema <R,F> where R = (ABCDEGHI) and dependencies
F = { ABE AGJ BE I E G GI H }
Step Statement Explanation
Show that 1 AB E Given
AB GH 2 E G Given
is derived by F. 3 AB G Transitivity on (1) and (2)
4 AB BE Augmentation (1) by B
5 BE I Given
6 AB I Transitivity on (4) and (5)
7 AB GI Additivity on (6) and (3)
8 GI H Given
9 AB H Transitivity on (7) and (8)
Q.E.D. 10 AB GH Additivity on (3) and (11)
quod erat 18
demonstrandum
Example2B – Using A-Axioms
Consider the relation schema <R,F> where R = (ABCDEGHI) and dependencies
F = { ABE AGJ BE I E G GI H }
Step Statement Explanation
Show that 1 AB E Given
AB GH 2 AB AB Reflexivity
is derived by F. 3 AB B Projectivity on (2)
4 AB BE Additivity on (1) and (3)
again!
5 BE I Given
6 AB I Transitivity on (4) and (5)
7 EG Given
8 AB G Transitivity on (1) and (7)
9 AB GI Additivity on (6) and (8)
10 GI H Given
11 AB H Transitivity on (9) and (10)
quod erat Q.E.D. 12 AB GH Additivity on (8) and (11)19
demonstrandum
Example3 – Using A-Axioms
Consider the relation schema <R,F> where R = (ABCDEGHI) and dependencies
F = { ABE AGJ BE I E G GI H }
Step Statement Explanation
Show that 1
AEI H 2
is derived by F. 3
4
Your turn! 5
6
7
8
9
10
11
12 20
Reducing the A-Axioms
The set of A-Axioms is not minimal, therefore some of its rules could be
eliminated.
Observations
• Rule A5 (transitivity) is a special case of rule A6 (pseudo-transitivity).
• Rules A3 (additivity) and A4 (projectivity) can be derived from A1
(reflexivity), A2 (augmentation), A6 (pseudo-transitivity).
Proof
(a) First observation is trivial (just make Z= Ø)
(b) Axiom A3 (Additivity) states that two rules, say X Y and X Z, can be
combined in one X YZ. Lets use A2 on X Y to produce XZ YZ. Repeat
A2 this time on X Z to produce X XZ. Now apply A5 on X XZ and
XZ YZ; we get X YZ. Therefore, we conclude that X YZ without using
the rule A3 itself (see next page)
21
Reducing the A-Axioms
The set of A-Axioms is not minimal, therefore some of its rules could be
eliminated.
Statement
Axiom A3 is redundant. Rule A3 (Additivity) states that two rules, say X Y
and X Z, can be combined in one X YZ.
Proof
We can prove that this fact is true without using A3
1. X Y Given
2. XZ YZ (A2) Augmenting (1) by Z
3. X Z Given
4. X XZ (A2) Augmenting (3) by Z
5. X YZ (A5) Transitivity on (4) and (2)
22
Characterizing the A-Axioms
• The set of A-Axioms is complete
Therefore every FD that is implied by a set F of FDs can be derived from the
FDs in F and one or more applications of the A-Axioms ( FA XY )
23
Correctness of the A-Axioms
The axioms can not be used to prove a false derivation. In such a case showing a
counter-example is sufficient to establish the falsity of a statement.
Example
Assume schema R(XYZW). Does ( XY ZW ) A X Z ?
The correct answer is NO. To show support for our argument we produce a
counter-example. For instance:
X Y Z W
1 2 3 4
1 5 6 7
On the example table there are no violations to the fact that XY implies a unique ZW
(12 34 and 15 67). However X=1 determines two different Z values, 3 and 6.
Therefore X Z is not a valid dependency as shown in the counter-example.
24
Closure F+
• Let F be a set of FDs for a relation r(R). The closure of F, denoted F+,
is the smallest set containing F such that the A-axioms cannot be
applied to the set to produce a new rule not included in the set
already
26
Closure F+
Example: Consider the relation schema <R,F>
where R = (A B C) and F = { AB C, C B }.
By the use of brute-force we produce all rules out of F.
A A AB AB ABC ABC
B B AC AC ABC A
C C BC BC ABC B
C B AB C ABC C
AB
AB
A
B
ABC
…
AB F+
… ABC BC F = { ABC, CB }
BC C
27
Closure F+
Example: Consider the relation schema <R,F> where R = (ABC)
and F = { AB C, C B }.
Question: Does F B C ?
Answer: F+ is the set of rules listed below and B C is not in
the set; therefore the rule B C is not implied by F.
28
Closure F+
Aside: How many FDs are there in <R,F>
n n n n
An upper bound is (2n 1)2
r 1 r r 1 r
Each sum term represents the possible combinations of r
attributes made out of the total n domains for each of the m
X Y rule in F.
for n=3 there are (23-1)2 = 49 possibilities, however for R
holding 10 attributes there are over a million possible FDs
29
Closure F+
Definition.
An FD X Y is trivial if X Y.
30
Derivations and DDAGs
• If F XY, then either X Y is in F, or a series of applications of the
inference A-axioms to F will yield X Y.
• Definition
Let P be a derivation sequence on F. The use set of P is the collection of
all FDs (originally) in F that appear in P.
31
Derivations and DDAGs
EXAMPLE
Consider schema r(ABCDEG) and functional dependencies
F = { A BC, BD G, C ED }
A derivation sequence for A E is
Step Explanation
1
2
3
4
5
Try…
32
Derivations and DDAGs
EXAMPLE
Consider schema r(ABCDEG) and functional dependencies
F = { A BC, BD G, C ED }
A derivation sequence for A E is
Step Explanation
1 A BC (given)
2 A C (Projectivity [A4] on 1)
3 C ED (given)
4 C E (Projectivity[A4] on 3)
5 A E (Transitivity[A5] on 2 and 4)
The set P for AE (five rules written above) is a derivation sequence
on F. The Use_Set_Of_P is = {A BC, C ED }
33
Derivations and DDAGs
Example. Consider schema <R, F> where R= { A B C D E G H I J }
and F = { ABE, AG J, BE I, E G, GI H}
The following sequence is a derivation sequence for A B G H.
Step Explanation
1. AB E given)
2. AB AB (reflexivity)
3. AB B (projectivity from 2)
4. AB BE (additivity from 1 and 3)
5. BE I (given)
6. AB I (transitivity from 4 and 5)
7. E G (given)
8. AB G (transitivity from 1 and 7)
9. AB GI (additivity from 6 and 8)
10. GI H (given)
11. AB H (transitivity from 9 and 10)
12. GI GI (reflexivity)
13. G I I (projectivity from 12)
14. ABG H (additivity from 8 and 11)
B1. Reflexivity X X
Motivation:
This is another approach to the problem of finding a sequence of derivations using
a smaller set of axioms.
Significance:
Since B-Axioms are complete, we can always find a derivation sequence using only
the three B-axioms to assert whether or not FB XY.
35
B-Axioms
Step Explanation
Example:
Let R = (ABCDEGHI) and 1
F = {ABE 2
AG J
BE I 3
EG 4
GI H}
5
Problem 6
Find a derivation sequence P 7
showing FB AB GH using
only B-axioms. 8
9
Answer
10
See P sequence on the right
11
Comment
12
(a) Use_Set_Of_P contains
rules on lines 2, 5, 9, 11. 13
(b) Too many steps! 14
36
B-Axioms
Step Explanation
Example:
Let R = (ABCDEGHI) and 1 EI EI Reflexivity (B1)
AG J
3 EI EIG Accumulation (B2)
BE I
EG 4 EI GI Projectivity (B3) from (3)
GI H} GI H Given
5
9 AB E Given
Answer
10 AB ABE Accumulation from (8) and (9)
See P sequence on the right
11 BE I Given
Comment 12 AB ABEI Accumulation from (10) and (11)
(a) Use_Set_Of_P contains
rules on lines 2, 5, 9, 11. 13 AB ABEIG Accumulation from (4) and (12)
(b) Too many steps! 14 AB ABEGHI Accumulation from (7) and (13)
37
15 AB GH Projectivity from (14)
RAP-Derivation Sequence
RAP: Stands for: Reflexivity, Augmentation, Projectivity
Definition:
Consider derivation sequences for X Y on a set F of FDs
using the B-axioms that satisfy the following constraints:
1. The first FD is X X
2. The last FD is X Y
3. Every FD other than the first and last is either an FD in F (given) or
and FD of the form X Z that was derived using axiom B2
(Accumulation).
38
RAP-Derivation Sequence
Example:
Let R = (ABCDEGHI) and
F = { ABE AGJ BE I E G GI H }
Find a RAP-sequence for AB GH
Step Explanation
1 AB AB B1
Comments 2 AB E Given
lines 2, 4, 6, 8. 10 AB GH B3
39
RAP-Derivation Sequence
Example:
Let R = (ABCDEGHI) and
F = { ABE AGJ BE I E G GI H }
Find a RAP-sequence for BHE GI
Step Explanation
1
Your turn… 2
10
40
RAP-Derivation Sequence
Example:
Let R = (ABCDEG) and
F = { A BC, BD G, C ED }
Find a RAP-sequence for AD GE
Step Explanation
1
Your turn… 2
10
41
RAP-Derivation Sequence
Example:
Let R = (ABCDEI) and
F = { A D, AB E, BI E, CD I, E C}
Find a RAP-sequence for AE DCI
Step Explanation
1
Your turn… 2
10
42
Derivation DAGs
A directed acyclic graph (DAG) is a directed graph with no
directed paths from any node to itself.
44
Derivation DAGs
EXAMPLE
Consider schema r(ABCDEG) and functional dependencies
F = { A BC, BD G, C ED }. Show a DDAG for AD GE
D G
B
A C E
NOTE:
The Use_Set of the derivation sequence is { A BC, BD G, C E }
45
Derivation DAGs
Rules for Constructing a DDAG
Rule 1. Any set of unconnected nodes with labels from r(R) is an F-based DDAG
A1 A2
Rule 2. Let H be a DDAG including nodes labeled A1 … Ak. Let rule A1…Ak B be
part of F. Form graph H’ by adding a new node labeled “B” and new edges
<A1,B>,…,<AK,B>.
A1 New edges
… B
New node
AK
AN
A E G
B I H
47
Derivation DAGs
Example
Consider the relation schema r(ABCDEGHIJ) subject to the dependencies
in F = { AB E, AG J, BE I, E G, GI H }.
B
J
I H A
NOTE: No path from source to destination is possible, therefore the new rule
BIG AJ is not derivable from F.
48
Derivation DAGs
Example
Consider the relation schema R(ABCDEGHIJ) subject to the dependencies
in F = { AB E, AG J, BE I, E G, GI H }.
A E C
B I H
NOTE: Node C is not reachable from the source. Therefore the rule AB CH cannot
be deduced from F.
49
X+ Closure of a Set of Attributes
• In order to simplify the asserting of whether or not a
rule X Y follows from a set F of FDs, we will compute
X+ the closure of a set of attributes X
50
Computing X +
The following algorithm to compute X+ has poor performance but is easy to
understand
Algorithm: X-Closure
Input: A set of attributes X and a set of FDs F
Output: The closure of X under F denoted X+
51
Computing X +
EXAMPLES
Consider the relation schema r(ABCDEGHIJ) subject to the dependencies in
F = { AB E, AG J, BE I, E G, GI H }.
Compute closure of AB
(AB) + = A B reflexivity
ABE using AB E
ABEI BE I
ABEIG E G
A B E I G HJ GI H
Note: Observe that AB ABEIHG. This rule is a compact notation for the 27 FDs having AB as LHS.
52
Computing X +
EXAMPLES
Consider the relation schema r(ABCDEGHIJ) subject to the dependencies in
F = { AB E, AG J, BE I, E G, GI H }.
(DEC) + = D E C
DECG using E G
53
Member Algorithm
Checking Membership
In order to verify whether or not a functional
dependency X Y could be derived from a set F of FDs
the following simple test could be applied
F B X Y if Y is part of X+
54
Member Algorithm
Member Algorithm
Input: Rule X Y and functional dependencies F
Output: TRUE whenever the rule is derived from F
Method:
begin
if ( Xclosure (X, F) Y ) then
return( True )
else
return( False );
end;
Example
Question: Does rule AB EH follow from F = { AB E, AG J, BE I, E G, GI H }
Answer: YES. Observe that (AB)+ = ABEIGHJ EH.
55
Linear Closure – XF+
Input: A set of attributes X and a set of functional dependencies F
Output: The closure of X under F demote XF+
Procedure LINCLOSURE ( Attribute X, SetOfFDs F)
BEGIN
/* Initialization */
for each FD W Z in F do begin
COUNT[ W Z ] = lenghtOf(w);
for each attribute A in W do
add rule W Z into LIST[ A ];
end;
NEWDEP = X; UPDATE = X;
/* Computation */
while ( UPDATE Ø) do begin
Choose an attribute A in UPDATE;
UPDATE = UPDATE - A;
for each FD W Z in LIST[A] do begin
COUNT[W Z] = COUNT[W Z] - 1;
if ( COUNT[w Z] = 0 ) then
ADD = Z - NEWDEP;
NEWDEP = NEWDEP ADD;
UPDATE = UPDATE ADD;
end if;
end for;
end while;
END 56
Linear Closure – XF+
Example
Consider the schema r(ABCDEI) subject to the dependencies
F = { A D, AB E, BI E, CD I, E C}
Find the closure of AE using the Linear Closure algorithm
57
Linear Closure – XF+
Example (continuation…) Tracing the execution of the linear time closure algorithm
58
Linear Closure – XF+
Homework
59