0% found this document useful (0 votes)

42 views5 pages

How Non Data Quality Can Cost Money WWW - Morefromtom

The document discusses the costs of poor quality data for companies. It identifies three categories of costs: 1) immediate costs when processes break down or data needs to be reworked, 2) costs of assessing data quality, and 3) costs of improving processes to prevent defects. Specific examples are given like sending marketing materials to the wrong address or denying credit to qualified customers due to data errors. The conclusion states that poor data quality can negatively impact employees and customers, and the costs are often hidden, so data quality deserves management attention.

Uploaded by

Bhoomii More

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views5 pages

How Non Data Quality Can Cost Money WWW - Morefromtom

Uploaded by

Bhoomii More

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Tom Breur

September 2005

How Non-Quality Data Can Cost Money

Introduction
When viewed from a high level, the cost of poor quality data can affect a
company’s bottom-line in two ways. First, there’s the cost of scrap and rework,
and second, missed opportunities.

An example of scrap and rework costs might be when an agent errs in

recording a customer’s address details, and consequently a marketing
premium is sent to the wrong address. Later, the customer calls to complain.
The complaint needs to be handled (extra call center time), the address
details then need to be entered a second time (rework), and a second
premium needs to be sent. The initial premium is scrapped.

An example of missed opportunity costs might be a credit card that is not

granted because the calculated credit score (erroneously) falls below the cut-
off score, and the customer is rejected. The opportunity to make a sale is lost,
when marketing costs were already incurred.

In this whitepaper, I attempt to supply a comprehensive list of potential data

quality costs.

Cost Categories of Information Quality

The costs of data quality can be broken down in 3 categories:
1. Immediate costs of non-quality data. This happens when the primary
process breaks down as a result of erroneous data. Or, information scrap
and rework, when immediately apparent errors or omissions in the data
need to be circumvented in support of the primary business process. For
example, data entry of a non-valid ZIP code requires back-office staff to
look this up again and correct it before sending out a product.
2. Information quality assessment or inspection costs. These are costs/efforts
expended for (re)assuring processes work properly. Every time a ‘suspect’
data source is handled, the time spent to seek reassurance of data quality
is an irrecoverable expense.
3. Information quality process improvement and defect prevention costs.
Broken business processes need to be improved to eliminate unnecessary
information costs. When a data capture or processing operation
malfunctions, it requires fixing. This is the long-term investment needed to
avoid further losses.

1. Immediate costs of non-quality data

Process failure
For example, capturing erroneous customer data like address, contact
information, account details.
- Irrecoverable costs; e.g. premiums sent in vain to non-existing customer
addresses.
- Liability and exposure costs; for instance credit risk losses when data
quality problems cause erroneously offering credit to a customer who is
not considered creditworthy on the basis of self-supplied information.
- Recovery costs of unhappy customers; time spent handling complaints.

Information Scrap and Rework

- Redundant data handling; because many processes are ‘known’ to rely on
inaccurate data, it is customary for front-line and back-office staff to
maintain little private “lists” of all sorts. These serve merely as a backup or
improved version of what is available in the primary database. Apart from
further problems like ‘maintenance’ and ‘recovery’ not being possible for
these private lists, such activities are redundant, and non-value adding.
- Costs of chasing missing information; a field that has not been filled out
properly, or not at all, needs to be looked up later on in the process.
Excess time and costs, inefficiency, and not in the least place an
aggravation factor. Time spent looking up missing information is not being
spent servicing the customer better.
- Business rework costs; e.g. reissuing a credit card that was sent out with a
misspelled customer name.
- Workaround costs; when a primary key is missing or faulty, laborious fuzzy
matches need to be performed to match records. This kind of work is
challenging, and eats up precious time of the most highly skilled database
workers.
- Data verification costs; e.g. costs of reworking data entry. But also,
analyses by knowledge workers must begin by checking the correctness of
data available before beginning analysis.
- Program rewrite costs; rewriting programs that fail to run because of
invalid entries found in the data. E.g.: sometimes pre- or post-conversion
scripts needed to be written to deal with the content of source systems
prior to loading in a Data Warehouse environment.
- Data cleansing and correction costs; when feeds are processed to load
into the Data Warehouse, these data need to be transformed for reasons
that stem from quality issues. Any data cleansing and scrubbing that
needs to be performed in the ETL process is essentially redundant and

-2-
unnecessary insofar this is caused by faulty initial data entry. For example,
when a mailing is done on the basis of a problematic customer file,
dedicated scripts need to be run to deal with the (known!) errors in the
address fields. This process needs to be repeated for every mailing. Since
such customer files are often shared across departments and systems,
source changes need to be negotiated with all end users of these data.
- Data cleansing software costs; data cleansing software (like Vality,
Ascential, etc.) is usually very expensive. However, there’s a tradeoff
between scarce labor doing this ‘by hand’, and the fact that ETL data
quality software to help with such tasks typically has very high license
costs. Purchase may sometimes prove remarkably economical when
related to (often unseen) labor costs for manually improving data quality.

Lost and missed opportunity costs

- Lost opportunity costs; when e.g. misspelling customer name on the card
causes the customer to not use their card (instead of calling up to
complain about this) the business looses their future revenue.
- Missed opportunity costs; when unhappy customers directly influence their
social environment, they generate negative publicity. This will make it
harder to sell to people in the social network of displeased customers.
- Lost shareholder value; information quality puts a drain on precious
resources (scarce database experts), preventing knowledge workers from
performing value added work towards market share growth. Scarce human
resources are often a bottleneck towards progress, like running one more
marketing campaign, delivering insight in a product portfolio’s
performance, etcetera.

2. Information quality assessment or inspection costs

- People spend time in assessment processes when they are aware of
suspect data quality; in any database project, each and every file of
questionable quality needs to be inspected for data quality problems first.
This time is irreplaceable, forever lost and never recouped in any way.
Merely assessing if data is of sufficient quality is specialist work. This
requires access to scarce resources that are often a bottleneck towards
progress.

3. Information quality process improvement and defect prevention costs

- Development costs to rework existing front-end applications; data entry
applications need to enforce data quality by performing validity checks,
and minimizing keystrokes and eye-hand movements. On the basis of
usability findings, interface improvements invariably lead to both higher
efficiency and better data quality.
- Management attention to redefine accountabilities and monitor improved
information quality; steering the organization towards higher data quality
requires changing accountabilities and continuously monitoring

-3-
improvement. This topic will need to stay high on management’s agenda to
create lasting improvement.

Conclusion
Problems in data quality often go unnoticed. It can be both a source of
process inefficiencies (timeliness), as well as operational costs (direct and
indirect losses). In neither of these cases is it apparent that improvement is
possible from enhancing data quality.

One of the pernicious consequences of suboptimal data quality is that the cost
of poor quality data is usually hidden. Lack of data quality is not obvious to
those not deliberately looking for it. Quantifying costs isn’t always easy. What
makes the indirect costs of poor data quality so pernicious is that the relation
between data quality problems and its consequences is non-obvious, and
often only occurs with a substantial time delay. Therefore, the connection
between downstream consequences and poor quality data is often not made,
and the problems are not attributed to their true cause.

The cause of many downstream data quality costs can easily remain largely
hidden (e.g. data quality), and therefore insufficiently subject to management
attention and intervention. Also, progress after improvement efforts is gradual,
relatively slow, in large part ‘cultural’, and therefore difficult to monitor and
track.

Another, and probably the most significant problem caused by poor-quality

information, is that it frustrates the most valuable resource of the company: its
employees. Non-quality information prevents knowledge workers from
performing their job effectively. On top of that, it alienates customers because
of wrong information about them, and to them. Customer data is the raw
material that needs to be managed for what it is: a strategic resource.

Data quality is far more than accurate data entry. It stems from monitoring
downstream data usage, maintaining comprehensive and up-to-date meta
data, and nurturing a corporate culture of naturally doing things right at the
first attempt. Only then will knowledge workers learn to expect data quality,
and enforce it because it’s the natural thing to do. Letting data quality slide will
promote a culture of negligence, and disdain for the use of one’s most
precious assets: customer information.

The case for accurate source data is further underlined when one realizes that
the source in and of itself does little more than support primary processes,
which is fine. However, the greater value to the organization comes from
enhancing these data, from deriving new information from source data.

-4-
The investment in improving information quality is recouped several times in
decreased costs, and improved value of information to accomplish strategic
business goals.

Rapid access to high quality data is the decisive factor in an organization’s

ability to assess and adapt it’s business model to changing market conditions.
As corporations become ever more ‘digitized’, those that get a grip on their
data quality assurance processes can reap great rewards. In a highly
turbulent market this may well be the critical factor in determining the
survivors in a competitive business, and therefore prove to be ultimately
priceless.

Resources
Larry P. English (1999) Improving Data Warehouse and Business Information
Quality: Methods for Reducing Costs and Increasing Profits. Wiley, ISBN 0-
471-25383-9

Jack E. Olson (2003) Data Quality: the Accuracy Dimension. Morgan

Kaufman, ISBN 1-55860-891-5

Sid Adelman, Larissa Moss & Majid Abai (2005) Data Strategy. Addison-
Wesley, ISBN 0-321-24099-5

-5-

Data Quality
100% (1)
Data Quality
21 pages
Data Quality - Information Quality For Northwind
No ratings yet
Data Quality - Information Quality For Northwind
18 pages
Practical Formulas For Estimating The Value of Quality Data
100% (1)
Practical Formulas For Estimating The Value of Quality Data
3 pages
Standardize, Validate and Improve Your Information Assets: Data Quality
No ratings yet
Standardize, Validate and Improve Your Information Assets: Data Quality
14 pages
Tho Nguyen, SAS Institute Inc. Cary, North Carolina: The Value of ETL and Data Quality
No ratings yet
Tho Nguyen, SAS Institute Inc. Cary, North Carolina: The Value of ETL and Data Quality
3 pages
Tony O'Brien / Procedia Computer Science 64 (2015) 442 - 449
No ratings yet
Tony O'Brien / Procedia Computer Science 64 (2015) 442 - 449
4 pages
Larry English 7 Deadly Misconceptions About Information Quality
No ratings yet
Larry English 7 Deadly Misconceptions About Information Quality
8 pages
Often Hidden Costs of Poor Data
No ratings yet
Often Hidden Costs of Poor Data
12 pages
The Costs of Poor Data Quality: University of Southern Denmark (DENMARK)
No ratings yet
The Costs of Poor Data Quality: University of Southern Denmark (DENMARK)
26 pages
Cape Notes Unit 2 Module 1 Content 08
No ratings yet
Cape Notes Unit 2 Module 1 Content 08
3 pages
The Cost of Bad Data Infographic
No ratings yet
The Cost of Bad Data Infographic
1 page
Data Quality
No ratings yet
Data Quality
48 pages
Data Quality Sas
No ratings yet
Data Quality Sas
13 pages
A Data Quality Business Case
100% (1)
A Data Quality Business Case
20 pages
Bad Data Costs The U.S. $3 Trillion Per Year
No ratings yet
Bad Data Costs The U.S. $3 Trillion Per Year
4 pages
IDQ Functionality Imp
No ratings yet
IDQ Functionality Imp
7 pages
Chapter 1 Introduction - The Value of High Quality Data
No ratings yet
Chapter 1 Introduction - The Value of High Quality Data
8 pages
5 Fundamental Data Quality Practices
No ratings yet
5 Fundamental Data Quality Practices
12 pages
Ch-06 Terms
No ratings yet
Ch-06 Terms
23 pages
Why Data Cleaning Is Critical
No ratings yet
Why Data Cleaning Is Critical
5 pages
Managing A Data Cleansing Process For Material or Service Master Data 20130529
No ratings yet
Managing A Data Cleansing Process For Material or Service Master Data 20130529
34 pages
Chapter 2 Engineering Costs
No ratings yet
Chapter 2 Engineering Costs
34 pages
Biometrics Fingerprint Attendance Management: Increase Accountability and Security in Your Workplace
From Everand
Biometrics Fingerprint Attendance Management: Increase Accountability and Security in Your Workplace
Troy Williams
No ratings yet
The Impact of Data Quality Information On Decision Making An Exploratory Analysis
No ratings yet
The Impact of Data Quality Information On Decision Making An Exploratory Analysis
12 pages
Five Fundamental Data Quality Practices - WP
No ratings yet
Five Fundamental Data Quality Practices - WP
12 pages
Internal Failure Costs
No ratings yet
Internal Failure Costs
1 page
Evidencia 3 Lengua y Cultura Extranjera
No ratings yet
Evidencia 3 Lengua y Cultura Extranjera
4 pages
Data Entry Operator: Skills, Software, Career Tips, and Interview Q&A
From Everand
Data Entry Operator: Skills, Software, Career Tips, and Interview Q&A
Sumitra Kumari
No ratings yet
WP Dirty Data Omni
No ratings yet
WP Dirty Data Omni
13 pages
SIA Romney Ch06
No ratings yet
SIA Romney Ch06
22 pages
Chapter 6 Part2
No ratings yet
Chapter 6 Part2
23 pages
INFS 6018: Managing Business Intelligence Week 3: Dirty Data and Data Quality
No ratings yet
INFS 6018: Managing Business Intelligence Week 3: Dirty Data and Data Quality
9 pages
Data Warehouse: FPT University Hanoi 2010
No ratings yet
Data Warehouse: FPT University Hanoi 2010
32 pages
Introduction To Data Quality 19052020 042306pm 1 01072022 024032pm
No ratings yet
Introduction To Data Quality 19052020 042306pm 1 01072022 024032pm
15 pages
Data Quality MDM
No ratings yet
Data Quality MDM
20 pages
Decision Support Systems: Adir Even, G. Shankaranarayanan, Paul D. Berger
No ratings yet
Decision Support Systems: Adir Even, G. Shankaranarayanan, Paul D. Berger
12 pages
Implementation of a Central Electronic Mail & Filing Structure
From Everand
Implementation of a Central Electronic Mail & Filing Structure
Patapios Tranakas
No ratings yet
Bad Data Can Result in Brand Damage
No ratings yet
Bad Data Can Result in Brand Damage
3 pages
Ass
No ratings yet
Ass
4 pages
Ba - Data Quality
No ratings yet
Ba - Data Quality
2 pages
Optimizing Accounting Information System Through Effective Data Quality Management-A Framework For Reducing Costs and Improving Performance
No ratings yet
Optimizing Accounting Information System Through Effective Data Quality Management-A Framework For Reducing Costs and Improving Performance
10 pages
Data Quality Concepts PDF
100% (3)
Data Quality Concepts PDF
83 pages
Quiz Review
100% (1)
Quiz Review
6 pages
Data Quality
No ratings yet
Data Quality
6 pages
Chapter 4-INFORMATION MANAGEMENT
No ratings yet
Chapter 4-INFORMATION MANAGEMENT
39 pages
The What, Why and How of Data Quality
No ratings yet
The What, Why and How of Data Quality
15 pages
Data Quality Assessment - Better Data
100% (3)
Data Quality Assessment - Better Data
2 pages
Data Cleaning 2021
No ratings yet
Data Cleaning 2021
61 pages
Isom Midterms
No ratings yet
Isom Midterms
27 pages
MBA 633 Assignment 2 Digital Analytics, Big Data and Technology Case Study Analysis
No ratings yet
MBA 633 Assignment 2 Digital Analytics, Big Data and Technology Case Study Analysis
10 pages
Data Skills For Business
No ratings yet
Data Skills For Business
14 pages
Quality Cost
No ratings yet
Quality Cost
22 pages
Data Quality Strategy 2013
100% (1)
Data Quality Strategy 2013
16 pages
Data Cleaning in Excel
100% (1)
Data Cleaning in Excel
68 pages
Mis Group 6 Assignment 1
No ratings yet
Mis Group 6 Assignment 1
10 pages
Unit 2
No ratings yet
Unit 2
22 pages
Garbage In, Garbage Out
No ratings yet
Garbage In, Garbage Out
12 pages
Quality Cost Analysis: Benefits and Risks
No ratings yet
Quality Cost Analysis: Benefits and Risks
10 pages
Evidencia 3
No ratings yet
Evidencia 3
1 page
Grade 8 Term 1 Notes
No ratings yet
Grade 8 Term 1 Notes
26 pages
ASD 4 Stack (Tumpukan) Dan Queue (Antrian)
No ratings yet
ASD 4 Stack (Tumpukan) Dan Queue (Antrian)
19 pages
9032 02b1
No ratings yet
9032 02b1
3 pages
Inset Resear Inset Resear Inset Resear Inset Resear Inset Research C CHC CHC CHC CH Cafe AFE AFE AFE AFE
0% (1)
Inset Resear Inset Resear Inset Resear Inset Resear Inset Research C CHC CHC CHC CH Cafe AFE AFE AFE AFE
2 pages
Class X SST Worksheet of 100 Questions
No ratings yet
Class X SST Worksheet of 100 Questions
5 pages
CBSE Board Previous Year 2023
No ratings yet
CBSE Board Previous Year 2023
180 pages
Anti-Bacterial Finish Using Lemon Peel Extract - PPTX - P.gowthami
No ratings yet
Anti-Bacterial Finish Using Lemon Peel Extract - PPTX - P.gowthami
11 pages
Glass
100% (1)
Glass
30 pages
Green Chemistry in Drug Discovery From Academia To Industry 1st Edition by Paul Richardson 1071615777 978-1071615775
100% (11)
Green Chemistry in Drug Discovery From Academia To Industry 1st Edition by Paul Richardson 1071615777 978-1071615775
70 pages
MS Quiz 3 (B46)
No ratings yet
MS Quiz 3 (B46)
4 pages
Two Empires
100% (3)
Two Empires
9 pages
Bowling Club Rs. 9.71 Million Dec-2019
No ratings yet
Bowling Club Rs. 9.71 Million Dec-2019
17 pages
Critical Minerals of India
No ratings yet
Critical Minerals of India
2 pages
08 - BPI Investment Corp. v. CA, G.R. No. 133632
No ratings yet
08 - BPI Investment Corp. v. CA, G.R. No. 133632
2 pages
Chapter 12 Physical Inventory and Warehouse Management
No ratings yet
Chapter 12 Physical Inventory and Warehouse Management
4 pages
What Causes Success
No ratings yet
What Causes Success
2 pages
Chapter 7 Variable Costing A Tool For Management
No ratings yet
Chapter 7 Variable Costing A Tool For Management
34 pages
Cord Measurements
No ratings yet
Cord Measurements
16 pages
The Isolator, Vol. 5 - The Liquidizer
No ratings yet
The Isolator, Vol. 5 - The Liquidizer
204 pages
Field Study Report
No ratings yet
Field Study Report
21 pages
Activity Completion Report (Acr) : Department of Education
No ratings yet
Activity Completion Report (Acr) : Department of Education
11 pages
The Languageof Mediain Digital Age
No ratings yet
The Languageof Mediain Digital Age
13 pages
ELAKKIYA
No ratings yet
ELAKKIYA
5 pages
Akamai DDoS Feature Sheet
No ratings yet
Akamai DDoS Feature Sheet
2 pages
Credit Risk Grading PDF
100% (2)
Credit Risk Grading PDF
14 pages
Form-M C Con-290120
No ratings yet
Form-M C Con-290120
2 pages
Contract Notes
No ratings yet
Contract Notes
3 pages
Investment Office ANRS: Project Profile On SMALL SCALE Pineapple Plantation
100% (1)
Investment Office ANRS: Project Profile On SMALL SCALE Pineapple Plantation
24 pages
ML Hist - WWII Wake Island Prisoners
No ratings yet
ML Hist - WWII Wake Island Prisoners
14 pages
Effect Analysis Ifrs18 April2024
No ratings yet
Effect Analysis Ifrs18 April2024
64 pages
Lesson 7: Summarizing Virtualization and Cloud Concepts
No ratings yet
Lesson 7: Summarizing Virtualization and Cloud Concepts
17 pages

How Non Data Quality Can Cost Money WWW - Morefromtom

Uploaded by

How Non Data Quality Can Cost Money WWW - Morefromtom

Uploaded by

Tom Breur

How Non-Quality Data Can Cost Money

An example of scrap and rework costs might be when an agent errs in

An example of missed opportunity costs might be a credit card that is not

In this whitepaper, I attempt to supply a comprehensive list of potential data

Cost Categories of Information Quality

1. Immediate costs of non-quality data

Information Scrap and Rework

Lost and missed opportunity costs

2. Information quality assessment or inspection costs

3. Information quality process improvement and defect prevention costs

Another, and probably the most significant problem caused by poor-quality

Rapid access to high quality data is the decisive factor in an organization’s

Jack E. Olson (2003) Data Quality: the Accuracy Dimension. Morgan

You might also like