Normalization
Normalization
Normalization
1
Normalization
There is a sequence to normal forms:
1NF is considered the weakest,
2NF is stronger than 1NF,
3NF is stronger than 2NF, and
BCNF is considered the strongest
Also,
any relation that is in BCNF, is in 3NF;
any relation in 3NF is in 2NF; and
any relation in 2NF is in 1NF.
2
Normalization
3
Normalization
We consider a relation in BCNF to be fully normalized.
A design that has a lower normal form than another design has
more redundancy. Uncontrolled redundancy can lead to data
integrity problems.
4
Functional Dependencies
Functional Dependencies
We say an attribute, B, has a functional dependency on
another attribute, A, if for any two records, which have
the same value for A, then the values for B in these two
records must be the same. We illustrate this as:
AB
Example: Suppose we keep track of employee email
addresses, and we only track one email address for each
employee. Suppose each employee is identified by their
unique employee number. We say there is a functional
dependency of email address on employee number:
6
Functional Dependencies
EmpNum EmpEmail
EmpNum EmpFname 3 different ways
EmpNum EmpLname you might see FDs
depicted
EmpEmail
EmpNum EmpFname
EmpLname
7
Determinant
Functional Dependency
EmpNum EmpEmail
8
Trivial Functional Dependency
10
Partial dependency
A partial dependency exists when an attribute B is
functionally dependent on an attribute A, and A is a
component of a multipart candidate key.
11
First Normal Form
First Normal Form
We say a relation is in 1NF if all values stored in the
relation are single-valued and atomic.
12
First Normal Form
The following in not in 1NF
13
First Normal Form
EmpNum EmpPhone EmpDegrees
123 233-9876
333 233-1231 BA, BSc, PhD
679 233-1231 BSc, MSc
14
First Normal Form
EmployeeDegree
Employee
EmpNum EmpDegree
EmpNum EmpPhone
333 BA
123 233-9876
333 BSc
333 233-1231
333 PhD
679 233-1231
679 BSc
679 MSc
15
Before we learn about the second normal form,
we need to understand the following −
• Key attribute − An attribute, which is a part of
the prime-key, is known as a prime attribute.
• Non-key attribute − An attribute, which is not
a part of the prime-key, is said to be a non-
prime attribute.
16
Second Normal Form
Second Normal Form
A relation is in 2NF if it is in 1NF, and every non-key
attribute is fully dependent on each candidate key. (That is, we
don’t have any partial functional dependency.)
17
Second Normal Form
Consider this InvLine table (in 1NF):
InvNum LineNum ProdNum Qty InvDate
InvNum, LineNum ProdNum, Qty
InvNum InvDate
InvLine is not 2NF since there is a partial
dependency of InvDate on InvNum
InvLine is
only in 1NF
18
Second Normal Form
InvLine
InvNum LineNum ProdNum Qty InvDate
The above relation has redundancies: the invoice date is
repeated on each invoice line.
We can improve the database by decomposing the relation
into two relations:
InvNum LineNum ProdNum Qty
InvNum InvDate
19
Third Normal Form
Third Normal Form
• A relation is in 3NF if the relation is in 1NF and all
determinants of non-key attributes are candidate keys
That is, for any functional dependency: X Y, where Y is
a non-key attribute (or a set of non-key attributes), X is a
candidate key.
• This definition of 3NF differs from BCNF only in the
specification of non-key attributes - 3NF is weaker than
BCNF. (BCNF requires all determinants to be candidate
keys.)
• A relation in 3NF will not have any transitive dependencies
20
Third Normal Form
Consider this Employee relation
21
Third Normal Form
EmpNum EmpName DeptNum DeptName
22
The Boyce-Codd Normal Form
23
24
25
26
Types of keys in DBMS
27
• Alternate Key – Out of all candidate keys, only one
gets selected as primary key, remaining keys are
known as alternate or secondary keys.
• Composite Key – A key that consists of more than
one attribute to uniquely identify rows (also known
as records & tuples) in a table is called composite
key.
• Foreign Key – Foreign keys are the columns of a
table that points to the primary key of another table.
They act as a cross-reference between tables.
28