0% found this document useful (0 votes)

27 views

Normalization Document

Data normalization is a process in which data attributes within a data model are organized to increase the cohesion of entity types. A data schema is considered to be at the level of normalization of its least normalized entity type. Higher levels of data normalization are beyond the scope of this article.

Uploaded by

Yelena Bytenskaya

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views

Normalization Document

Uploaded by

Yelena Bytenskaya

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Dr.

Igwe

Figure 1: An Initial Data Schema for Order (UML Notation).

Order 0NF
OrderId: integer <<PK>>
DateOrdered: Date
DateFulfilled: Date
Payment1Amount: Currency
Payment1Type: Char(4)
Payment1Description: Char(40)
Payment2Amount: Currency
Payment2Type: Char(4)
Payment2Description: Char(40)
Taxdeferal: Currency
TaxState: Currency
TaxLocal: Currency
SubtotalBeforeTax: Currency
ShipToName: char(45)
ShipToStreet: char(40)
ShipToCity: char(20)
ShipToState: char(20)
ShipToCountry: char(20)
ShipToZipCode: char(20)
ShipToPhone: char(20)
BillToName: char(45)
BillToStreet char(40)
BillToCity: char(20)
BillToState: Char(20)
BillToCountry: char(20)
BillToZipcode: char(20)
BillToPhone: char(20)
ItemName1: char (40)
ItemNumber1: integer
NumberOrdered1: integer
InitialItemPrice1: currency
TotalPriceExtended1: currency
ItemName2: char (40)
ItemNumber2: integer
NumberOrdered2: integer
InitialItemPrice2: currency
TotalPriceExtended2: currency
.
ItemName9: char (40)
ItemNumber9: integer
NumberOrdered: integer
InitialItemPrice: currency
TotalPriceExtended: currency

Data normalization is a process in which data attributes within a data model are organized to
increase the cohesion of entity types. In other words, the goal of data normalization is to reduce
and even eliminate data redundancy, an important consideration for application developers
because it is incredibly difficult to stores objects in a relational database that maintains the same
information in several places. Table 1 summarizes the three most common forms of
normalization ( First normal form (1NF), Second normal form (2NF), and Third normal
form (3NF)) describing how to put entity types into a series of increasing levels of
normalization. Higher levels of data normalization are beyond the scope of this article. With
respect to terminology, a data schema is considered to be at the level of normalization of its
least normalized entity type. For example, if all of your entity types are at second normal form
(2NF) or higher then we say that your data schema is at 2NF.

Dr. Igwe
Table 1: Data Normalization Rules.
Level

Rule

First normal form (1NF)

Second normal form (2NF)

An entity type is in 1NF when it contains no repeating groups of data.

An entity type is in 2NF when it is in 1NF and when all of its non-key attributes are fully dependent on its
primary key.
An entity type is in 3NF when it is in 2NF and when all of its attributes are directly dependent on the primary
key.

Third normal form (3NF)

1. First Normal Form (1NF)

Lets consider an example; an entity type is in first normal form (1NF) when it contains no
repeating groups of data. For example, in Figure 1 you see that there are several repeating
attributes in the data Order0NF table the ordered item information repeats nine times and the
contact information is repeated twice, once for shipping information and once for billing
information. Although this initial version of orders could work, what happens when an order has
more than nine order items? Do you create additional order records for them? What about the
vast majority of orders that only have one or two items? Do we really want to waste all that
storage space in the database for the empty fields? Likely not. Furthermore, do you want to
write the code required to process the nine copies of item information, even if it is only to
marshal it back and forth between the appropriate numbers of objects? Once again, likely not.
Figure 2 presents a reworked data schema where the order schema is put in first normal form.
The introduction of the OrderItem1NF table enables us to have as many, or as few, order items
associated with an order, increasing the flexibility of our schema while reducing storage
requirements for small orders (the majority of our business). The ContactInformation1NF table
offers a similar benefit, when an order is shipped and billed to the same person (once again the
majority of cases) we could use the same contact information record in the database to reduce
data redundancy.

OrderPayment1NF was introduced to enable customers to make several payments against an

order Order0NF could accept up to two payments, the type being something like MC and the
description MasterCard Payment, although with the new approach far more than two payments
could be supported. Multiple payments are accepted only when the total of an order is large
enough that a customer must pay via more than one approach, perhaps paying some by check
and some by credit card.

Dr. Igwe

Figure 2: An Order Data Schema in 1NF (UML Notation).

An important thing to notice is the application of primary and foreign keys in the new solution.
Order1NF has kept OrderID, the original key of Order0NF, as its primary key. To maintain
the relationship back to Order1NF, the OrderItem1NF table includes the OrderID column within
its schema, which is why it has the stereotype of FK. When a new table is introduced into a
schema, in this case OrderItem1NF, as the result of first normalization efforts it is common to
use the primary key of the original table (Order0NF) as part of the primary key of the new table.
Because OrderID is not unique for order items, you can have several order items on an order, the
column ItemSequence was added to form a composite primary key for the OrderItem1NF table.
A different approach to keys was taken with the ContactInformation1NF table. The column
ContactID, a surrogate key that has no business meaning, was made the primary key.

2. Second Normal Form (2NF)

Although the solution presented in Figure 2 is improved over that of Figure 1, it can be
normalized further. Figure 3 presents the data schema of Figure 2 in second normal form
(2NF). an entity type is in second normal form (2NF) when it is in 1NF and when every non-key
attribute, any attribute that is not part of the primary key, is fully dependent on the primary key.
This was definitely not the case with the OrderItem1NF table, therefore we need to introduce the
new table Item2NF. The problem with OrderItem1NF is that item information, such as the name
and price of an item, do not depend upon an order for that item. For example, if Hal Jordan
orders three widgets and Oliver Queen orders five widgets, the facts that the item is called a
widget and that the unit price is $19.95 is constant. This information depends on the concept
of an item, not the concept of an order for an item, and therefore should not be stored in the order
items table therefore the Item2NF table was introduced. OrderItem2NF retained the
TotalPriceExtended column, a calculated value that is the number of items ordered multiplied by

Dr. Igwe

the price of the item. The value of the SubtotalBeforeTax column within the Order2NF table is
the total of the values of the total price extended for each of its order items.
Figure 3. An Order in 2NF (UML Notation).

3. Third Normal Form (3NF)

An entity type is in third normal form (3NF) when it is in 2NF and when all of its attributes are
directly dependent on the primary key. A better way to word this rule might be that the attributes
of an entity type must depend on all portions of the primary key. In this case there is a problem
with the OrderPayment2NF table, the payment type description (such as Mastercard or
Check) depends only on the payment type, not on the combination of the order id and the
payment type. To resolve this problem the PaymentType3NF table was introduced in Figure 4,
containing a description of the payment type as well as a unique identifier for each payment type.

Dr. Igwe

Figure 4: An Order in 3NF (UML Notation).

4. Beyond 3NF
The data schema of Figure 4 can still be improved upon, at least from the point of view of data
redundancy, by removing attributes that can be calculated/derived from other ones. In this case
we could remove the SubtotalBeforeTax column within the Order3NF table and the
TotalPriceExtended column of OrderItem3NF, as you see in Figure 5.

Dr. Igwe

Figure 5. An Order Without Calculated Values (UML Notation).

5. Why Data Normalization?

The advantage of having a highly normalized data schema is that information is stored in one
place and one place only, reducing the possibility of inconsistent data. Furthermore, highlynormalized data schemas in general are closer conceptually to object-oriented schemas because
the object-oriented goals of promoting high cohesion and loose coupling between classes results
in similar solutions (at least from a data point of view). This generally makes it easier to map
your objects to your data schema.

6. Denormalization
From a purist point of view you want to normalize your data structures as much as possible, but
from a practical point of view you will find that you need to 'back out" of some of your
normalizations for performance reasons. This is called "denormalization". For example, with
the data schema of Figure 1 all the data for a single order is stored in one row (assuming orders
of up to nine order items), making it very easy to access. With the data schema of Figure 1 you
could quickly determine the total amount of an order by reading the single row from the
Order0NF table. To do so with the data schema of Figure 5 you would need to read data from a
row in the Order table, data from all the rows from the OrderItem table for that order and data
from the corresponding rows in the Item table for each order item. For this query, the data
schema of Figure 1 very likely provides better performance.

Dr. Igwe

Visual Analytics with Tableau
From Everand
Visual Analytics with Tableau
Alexander Loth
No ratings yet
Learn SAP SD in 24 Hours
From Everand
Learn SAP SD in 24 Hours
Alex Nordeen
No ratings yet
(1964) East Africa Law Reports
67% (9)
(1964) East Africa Law Reports
990 pages
James Hall Ais
100% (2)
James Hall Ais
22 pages
Databases 03 - Normalisation
No ratings yet
Databases 03 - Normalisation
7 pages
3 Normal Forms Tutorial
No ratings yet
3 Normal Forms Tutorial
12 pages
Database Normalization
100% (3)
Database Normalization
19 pages
634 Normalize
No ratings yet
634 Normalize
4 pages
Normal Forms
No ratings yet
Normal Forms
19 pages
Week 6 lecture Normalization
No ratings yet
Week 6 lecture Normalization
29 pages
Database Normalization
No ratings yet
Database Normalization
30 pages
DATABASE NOTES Database Normalization
No ratings yet
DATABASE NOTES Database Normalization
13 pages
First Normal Form: Functional Dependency
No ratings yet
First Normal Form: Functional Dependency
8 pages
Normalisation example (1)
No ratings yet
Normalisation example (1)
10 pages
Normalization Extra
No ratings yet
Normalization Extra
5 pages
Erd
No ratings yet
Erd
1 page
Section 6 Notes Database Design
No ratings yet
Section 6 Notes Database Design
6 pages
Chapter6_NormalizationDatabaseTables_Part4 (2)
No ratings yet
Chapter6_NormalizationDatabaseTables_Part4 (2)
38 pages
Database Normalization What Is Normalization?
No ratings yet
Database Normalization What Is Normalization?
5 pages
Lecture 2 Part 1 Normalization PDF
No ratings yet
Lecture 2 Part 1 Normalization PDF
18 pages
Normalization of Database Tables
No ratings yet
Normalization of Database Tables
24 pages
20200722141920D3408 - ISYS6198 Session 15 16 Logical Database Design Normalization
No ratings yet
20200722141920D3408 - ISYS6198 Session 15 16 Logical Database Design Normalization
24 pages
Chapter Nine-Data Normalization
No ratings yet
Chapter Nine-Data Normalization
4 pages
Normalization in DBMS11
No ratings yet
Normalization in DBMS11
12 pages
Normalization
No ratings yet
Normalization
39 pages
Normalisation
No ratings yet
Normalisation
27 pages
Normalization Lecture
No ratings yet
Normalization Lecture
47 pages
Dbms Vimp Micro
No ratings yet
Dbms Vimp Micro
20 pages
The Normal Forms 3NF and BCNF: BY Jasbir Jassu
No ratings yet
The Normal Forms 3NF and BCNF: BY Jasbir Jassu
25 pages
Normalisation: Cust# Name Ord# Date Part# Desc Qty Price Supp# Name
No ratings yet
Normalisation: Cust# Name Ord# Date Part# Desc Qty Price Supp# Name
4 pages
What Is Normalization
No ratings yet
What Is Normalization
2 pages
Normalization Is A Method For Organizing Data Elements in A Database Into Tables
No ratings yet
Normalization Is A Method For Organizing Data Elements in A Database Into Tables
12 pages
Normalization: Normalization Is A Systematic Way of Ensuring That A Database Structure Is Suitable For
No ratings yet
Normalization: Normalization Is A Systematic Way of Ensuring That A Database Structure Is Suitable For
6 pages
Chapter 5
No ratings yet
Chapter 5
7 pages
Chapter 5
No ratings yet
Chapter 5
7 pages
Chapter3 Session2
No ratings yet
Chapter3 Session2
32 pages
bd
No ratings yet
bd
4 pages
Normalisation Database
No ratings yet
Normalisation Database
6 pages
Week 5 Lecture
No ratings yet
Week 5 Lecture
61 pages
DBMSPPT
No ratings yet
DBMSPPT
20 pages
The Normal Forms 3NF and BCNF
No ratings yet
The Normal Forms 3NF and BCNF
25 pages
The Normal Forms 3NF and BCNF
No ratings yet
The Normal Forms 3NF and BCNF
25 pages
Research Activity
No ratings yet
Research Activity
9 pages
The Normal Forms 3NF and BCNF
No ratings yet
The Normal Forms 3NF and BCNF
25 pages
Basic Principles of Database Normalization
No ratings yet
Basic Principles of Database Normalization
7 pages
Normalization Examples
No ratings yet
Normalization Examples
8 pages
Database Modelling: Lecture 7: Data Normalisation Nick Rossiter
No ratings yet
Database Modelling: Lecture 7: Data Normalisation Nick Rossiter
54 pages
Normalization 2
No ratings yet
Normalization 2
33 pages
1741088134 Normalization
No ratings yet
1741088134 Normalization
17 pages
Fundamental of Database CH-5
No ratings yet
Fundamental of Database CH-5
34 pages
Normalization
No ratings yet
Normalization
12 pages
Quiz 8
No ratings yet
Quiz 8
4 pages
The Normal Forms 3NF and BCNF
No ratings yet
The Normal Forms 3NF and BCNF
25 pages
Normalization
No ratings yet
Normalization
3 pages
Normalisation (Tilley)
No ratings yet
Normalisation (Tilley)
5 pages
Relational Database Management System (17332) Experiment No: 1.2
No ratings yet
Relational Database Management System (17332) Experiment No: 1.2
33 pages
Normalization 1
No ratings yet
Normalization 1
26 pages
Normal
No ratings yet
Normal
10 pages
Anu Table Design
No ratings yet
Anu Table Design
15 pages
100 Puzzles to Learn Data Warehousing
From Everand
100 Puzzles to Learn Data Warehousing
Cristian Scutaru
No ratings yet
Pivot Tables: Easy Excel Essentials, #1
From Everand
Pivot Tables: Easy Excel Essentials, #1
M.L. Humphrey
No ratings yet
Pivot Tables In Depth For Microsoft Excel 2016
From Everand
Pivot Tables In Depth For Microsoft Excel 2016
Suljan Qeska
3.5/5 (3)
Oracle Database 11g Oracle Label Security and The Data Masking Pack
No ratings yet
Oracle Database 11g Oracle Label Security and The Data Masking Pack
59 pages
Oracle Database 11g Using DDL Views Sequences Indexes and Synonyms
No ratings yet
Oracle Database 11g Using DDL Views Sequences Indexes and Synonyms
54 pages
Oracle Database 11g SQL and PLSQL New Features
No ratings yet
Oracle Database 11g SQL and PLSQL New Features
232 pages
Oracle Database 11g RAC Performance Tuning
No ratings yet
Oracle Database 11g RAC Performance Tuning
52 pages
Oracle Database 11g Database Architecture and ASM
No ratings yet
Oracle Database 11g Database Architecture and ASM
44 pages
Practice Q's 14-18
No ratings yet
Practice Q's 14-18
11 pages
Practice Q's 1-7
No ratings yet
Practice Q's 1-7
17 pages
Oracle Database 11g Transparent Data Encryption
No ratings yet
Oracle Database 11g Transparent Data Encryption
40 pages
Assign5 Ans
100% (3)
Assign5 Ans
3 pages
Assign3 Ans
100% (1)
Assign3 Ans
1 page
Assign6 Ans
No ratings yet
Assign6 Ans
2 pages
Assign4 Ans
No ratings yet
Assign4 Ans
3 pages
Assign2 Ans
No ratings yet
Assign2 Ans
2 pages
Normalization Example Document
No ratings yet
Normalization Example Document
7 pages
Database Components and Database Concepts, Data Independence, Structural Independence
No ratings yet
Database Components and Database Concepts, Data Independence, Structural Independence
3 pages
Statement of Work Accomplishment: Amount 2.0
No ratings yet
Statement of Work Accomplishment: Amount 2.0
6 pages
01P
No ratings yet
01P
37 pages
Donnell Cydney C 00059681 449089 000 2009
No ratings yet
Donnell Cydney C 00059681 449089 000 2009
30 pages
Project Report: Submitted To: Prof. Sushil Kumar Submitted by
No ratings yet
Project Report: Submitted To: Prof. Sushil Kumar Submitted by
10 pages
JLL Market Pulse q3 2020
No ratings yet
JLL Market Pulse q3 2020
12 pages
DTA - Object List For Customizing Synchronization
No ratings yet
DTA - Object List For Customizing Synchronization
8 pages
Pricing Guide 2021
No ratings yet
Pricing Guide 2021
1 page
Birla Institute of Technology: Bachelor of Business Administrations (Bba) Regular 2020-2021
No ratings yet
Birla Institute of Technology: Bachelor of Business Administrations (Bba) Regular 2020-2021
3 pages
Chapter 5: The Production Process and Costs Answers To Questions and Problems
No ratings yet
Chapter 5: The Production Process and Costs Answers To Questions and Problems
7 pages
Edelweiss Business Cycle Fund - NFO Presentation (1) - 1
No ratings yet
Edelweiss Business Cycle Fund - NFO Presentation (1) - 1
37 pages
ED Answers (Unit 3 & 4)
No ratings yet
ED Answers (Unit 3 & 4)
8 pages
Annex C Editable Template
No ratings yet
Annex C Editable Template
1 page
4.2. Reading 1
No ratings yet
4.2. Reading 1
6 pages
Philippine CPA Review - Summary of The Old Conceptual Framework Issued by The Accounting Standards Council (ASC)
0% (1)
Philippine CPA Review - Summary of The Old Conceptual Framework Issued by The Accounting Standards Council (ASC)
8 pages
Lickiss v. FINRA, 208 Cal. App. 4th 1125, 146 Cal. Rptr. 3d 173 (2012), Review Denied (Nov. 14, 2012)
No ratings yet
Lickiss v. FINRA, 208 Cal. App. 4th 1125, 146 Cal. Rptr. 3d 173 (2012), Review Denied (Nov. 14, 2012)
13 pages
From Cost to Performance Management A Blueprint for Organizational Development 1st Edition Catherine Stenzel - Download the full set of chapters carefully compiled
100% (1)
From Cost to Performance Management A Blueprint for Organizational Development 1st Edition Catherine Stenzel - Download the full set of chapters carefully compiled
59 pages
Responsibility Accounting Practice Problem
No ratings yet
Responsibility Accounting Practice Problem
4 pages
Villeroy & Boch v. Amazon - Complaint
No ratings yet
Villeroy & Boch v. Amazon - Complaint
155 pages
Export-Oriented Development, The State, and Social Capital A Case Study of Maquiladora Production in Yucatan, Mexico
No ratings yet
Export-Oriented Development, The State, and Social Capital A Case Study of Maquiladora Production in Yucatan, Mexico
12 pages
Compensation Management SYNOPSIS
100% (2)
Compensation Management SYNOPSIS
94 pages
KFC in The Philippines Our Story Our Humble Origins
No ratings yet
KFC in The Philippines Our Story Our Humble Origins
6 pages
HAZOP - Overview
No ratings yet
HAZOP - Overview
20 pages
Mahindra Tractors - Offering Tough and Reliable Tractors To The Farmers
No ratings yet
Mahindra Tractors - Offering Tough and Reliable Tractors To The Farmers
3 pages
TOEIC 2 - Final Reading Test
No ratings yet
TOEIC 2 - Final Reading Test
12 pages
Car Rental Agreement
No ratings yet
Car Rental Agreement
2 pages
Marketing
No ratings yet
Marketing
11 pages
Tata Steel 4523
No ratings yet
Tata Steel 4523
132 pages
Chrysler Affidavit
No ratings yet
Chrysler Affidavit
117 pages

Normalization Document

Uploaded by

Normalization Document

Uploaded by

Dr.

Figure 1: An Initial Data Schema for Order (UML Notation).

First normal form (1NF)

An entity type is in 1NF when it contains no repeating groups of data.

Third normal form (3NF)

1. First Normal Form (1NF)

OrderPayment1NF was introduced to enable customers to make several payments against an

Figure 2: An Order Data Schema in 1NF (UML Notation).

2. Second Normal Form (2NF)

3. Third Normal Form (3NF)

Figure 4: An Order in 3NF (UML Notation).

Figure 5. An Order Without Calculated Values (UML Notation).

5. Why Data Normalization?

You might also like