0% found this document useful (0 votes)
2K views14 pages

Data Management Fundamentals

This document discusses data management principles and defines key concepts. It describes data management as developing plans to control, protect, and enhance the value of data throughout its lifecycle. Data management involves both technical and business skills and requires collaboration between IT and business roles. The objectives of data management are to understand information needs and ensure high quality data is available. Metadata provides important context about data. While data and information are intertwined, distinguishing between them helps communicate requirements to different stakeholders. Data is recognized as a valuable asset that produces value for organizations.

Uploaded by

Aikovin Clerigo
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2K views14 pages

Data Management Fundamentals

This document discusses data management principles and defines key concepts. It describes data management as developing plans to control, protect, and enhance the value of data throughout its lifecycle. Data management involves both technical and business skills and requires collaboration between IT and business roles. The objectives of data management are to understand information needs and ensure high quality data is available. Metadata provides important context about data. While data and information are intertwined, distinguishing between them helps communicate requirements to different stakeholders. Data is recognized as a valuable asset that produces value for organizations.

Uploaded by

Aikovin Clerigo
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 14

Contents

DATA MANAGEMENT PRINCIPLES..............................................................................................................2


Data Management Definition.................................................................................................................2
AAP Frameworks and Career Paths........................................................................................................2
Objectives of Data Management............................................................................................................3
Data, Metadata, and Data Representation............................................................................................3
Data vs. Information...............................................................................................................................4
Data as an Asset......................................................................................................................................6
Data Management Principles.................................................................................................................7
Data Management Challenges................................................................................................................9
Data Management Strategy Defined....................................................................................................12
Summary...............................................................................................................................................14
DATA MANAGEMENT PRINCIPLES
Data Management Definition
As defined by DAMA International:

Data Management is the development, execution, and supervision of plans, policies, programs, and
practices that deliver, control, protect, and enhance the value of data and information assets
throughout their life cycles.

A Data Management Professional is any person who works in any facet of data management
(from technical management of data throughout its lifecycle to ensuring that data is properly
utilized and leveraged) to meet strategic organizational goals.

Data management professionals are called by many names in the industry, fill numerous roles,
from the highly technical (for example: database administrators, network administrators, and
programmers) to strategic and business (such as: Data Stewards, Data Strategists, Chief
Data Officers).

Data management activities are wide-ranging.

They include everything from the ability to make consistent decisions about how to get strategic
value from data to the technical deployment and performance of databases.

Thus, data management requires both technical and non-technical (in other words ‘business’) skills.

Responsibility for managing data must be shared between business and information technology roles,
and people in both areas must be able to collaborate to ensure an organization has high-quality data
that meets its strategic needs.

AAP Frameworks and Career Paths

The framework of the Analytics Association of the Philippines defines five main career paths for a
data professional:

 Data Steward
 Data Engineer
 Data Scientist
 Functional Analyst
 Analytics Managers

Data Management is involved in each one of these career paths, but most especially affects Data
Stewards and Engineers.
Data Stewards develop, enforce, and maintain an organization’s data governance process, data
usage, and data security policies to ensure that data assets provide the organization with high-quality
data.

Data Engineers design, construct, test, and maintain data infrastructures including applications
that extract, clean, transform, and load data from the data sources to centralized data
repositories.

These two roles will benefit tremendously from understanding data management concepts and
they are usually the main practitioners of data management in any organization.

Objectives of Data Management


The primary driver for data management is to enable organizations to get value from their data
assets, just as effective management of financial and physical assets enables organizations to get
value from those assets.
Within an organization, data management goals include:

Understanding and supporting the information needs of the enterprise and its
stakeholders, including customers, employees, and business partners

Capturing, storing, protecting, and ensuring the integrity of data assets

Ensuring the quality of data and information

Ensuring the privacy and confidentiality of stakeholder data

Preventing unauthorized or inappropriate access, manipulation, or use of data and


information

Ensuring data can be used effectively to add value to the enterprise

Data, Metadata, and Data Representation


Long-standing definitions of data emphasize its role in representing facts about the world.

In relation to information technology, data is also understood as information that has been stored
in digital form (though data is not limited to information that has been digitized and data
management principles apply to data captured on paper as well as in databases).
Still, because today we can capture so much information electronically, we call many things ‘data’
that would not have been called ‘data’ in earlier times – things like names, addresses, birthdates,
what one ate for dinner on Saturday, the most recent book one purchased.

Most people assume that, because data represents facts, it is a form of truth about the world and
that the facts will fit together.

But ‘facts’ are not always simple or straightforward.

Data is a means of representation.

It stands for things other than itself.

Data is both an interpretation of the objects it represents and an object that must be interpreted.
This is another way of saying that we need context for data to be meaningful.

Context can be thought of as data’s representational system; such a system includes a common
vocabulary and a set of relationships between components.

If we know the conventions of such a system, then we can interpret the data within it.

These conventions are often documented in a specific kind of data referred to as Metadata– or data
about data.

People often make different choices about how to represent concepts.

From these choices, data takes on different shapes.

Think of the range of ways we have to represent calendar dates, a concept about which there is an
agreed-to definition.

Now consider more complex concepts (such as customer or product), where the granularity and
level of detail of what needs to be represented is not always self-evident.

Within a single organization, there are often multiple ways of representing the same idea.

Hence the need for Data Architecture, modeling, governance, and stewardship, and Metadata and
Data Quality management, all of which help people understand and use data.

Across organizations, the problem of multiplicity multiplies. Hence the need for industry-level
data standards that can bring more consistency to data.

Data vs. Information


Data has been called the “raw material of information” and information has been called “data in
context”.
Often a layered pyramid is used to describe the relationship between data (at the base),
information, knowledge, and wisdom (at the very top).

Wisd
om

Knowledge

Information

Data

While the pyramid can be helpful in describing why data needs to be well-managed, this
representation presents several challenges for data management.

It is based on the assumption that data simply exists.

But data does not simply exist. Data has to be created.

By describing a linear sequence from data through wisdom, it fails to recognize that it takes
knowledge to create data in the first place.

It implies that data and information are separate things, when in reality, the two concepts are
intertwined with and dependent on each other.

Data is a form of information and information is a form of data.

Within an organization, it may be helpful to draw a line between information and data for purposes
of clear communication about the requirements and expectations of different uses by different
stakeholders.

(“Here is a sales report for the last quarter [information]. It is based on data from our data
warehouse [data]. Next quarter these results [data] will be used to generate our quarter-over-
quarter performance measures [information]”).

Recognizing data and information need to be prepared for different purposes drives home a central
tenet of data management: Both data and information need to be managed.
Both will be of higher quality if they are managed together with uses and customer requirements in
mind.

Data as an Asset
An asset is an economic resource, that can be owned or controlled, and that holds or produces
value.

Assets can be converted to money.

Data is widely recognized as an enterprise asset, though understanding of what it means to


manage data as an asset is still evolving.

In the early 1990s, some organizations found it questionable whether the value of goodwill should
be given a monetary value.

Now, the ‘value of goodwill’ commonly shows up as an item on the Profit and Loss Statement (P&L).

Similarly, while not universally adopted, monetization of data is becoming increasingly common.

It will not be too long before we see this as a feature of P&Ls.

Today’s organizations rely on their data assets to make more effective decisions and to operate more
efficiently.

Businesses use data to:

 understand their customers


 create new products and services
 improve operational efficiency by cutting costs and controlling risks

Government agencies, educational institutions, and not-for-profit organizations also need high-
quality data to guide their operational, tactical, and strategic activities.

As organizations increasingly depend on data, the value of data assets can be more clearly
established.

Many organizations identify themselves as ‘data-driven’.

Businesses aiming to stay competitive must stop making decisions based on gut feelings or
instincts, and instead, use event triggers and apply analytics to gain actionable insight.

Being data-driven includes the recognition that data must be managed efficiently and with
professional discipline, through a partnership of business leadership and technical expertise.

Furthermore, the pace of business today means that change is no longer optional; digital
disruption is the norm.
To react to this, business must co-create information solutions with technical data professionals
working alongside line-of-business counterparts.

They must plan for how to obtain and manage data that they know they need to support business
strategy.

They must also position themselves to take advantage of opportunities to leverage data in new
ways.

Data Management Principles


Data management shares characteristics with other forms of asset management, it involves
knowing what data an organization has and what might be accomplished with it, then determining
how best to use data assets to reach organizational goals.

Like other management processes, it must balance strategic and operational needs.

This balance can best be struck by following a set of principles that recognize salient features of
data management and guide data management practice.

Data is an asset with unique properties.

The value of data can and should be expressed in economic terms.

Managing data means managing the quality of data.

It takes Metadata to manage data.

It takes planning to manage data.

Data management is cross-functional.

Data management requires an enterprise perspective.

Data management must account for a range of perspectives.

Data management is lifecycle management.

Different types of data have different lifecycle characteristics.

Managing data includes managing the risks associated with data.

Data management requirements must drive Information Technology decisions.

Effective data management requires leadership commitment.

First, Data is an asset with unique properties: Data is an asset, but it differs from other assets in
important ways that influence how it is managed. The most obvious of these properties is that data
is not consumed when it is used, as are financial and physical assets.

Second, The value of data can and should be expressed in economic terms: Calling data an
asset implies that it has value. While there are techniques for measuring data’s qualitative and
quantitative value, there are not yet standards for doing so. Organizations that want to make better
decisions about their data should develop consistent ways to quantify that value. They should also
measure both the costs of low-quality data and the benefits of high-quality data.

Third, Managing data means managing the quality of data: Ensuring that data is fit for purpose
is a primary goal of data management. To manage quality, organizations must ensure they
understand stakeholders’ requirements for quality and measure data against these requirements.

Fourth, It takes Metadata to manage data: Managing any asset requires having data about that
asset (number of employees, accounting codes, etc.). The data used to manage and use data is called
Metadata. Because data cannot be held or touched, to understand what it is and how to use it
requires definition and knowledge in the form of Metadata.

Metadata originates from a range of processes related to data creation, processing, and use,
including architecture, modeling, stewardship, governance, Data Quality management, systems
development, IT and business operations, and analytics.

Fifth, It takes planning to manage data: Even small organizations can have complex technical and
business process landscapes. Data is created in many places and is moved between places for use.
To coordinate work and keep the end results aligned requires planning from an architectural and
process perspective.

Sixth, Data management is cross-functional; it requires a range of skills and expertise: A data
team cannot manage all of an organization’s data. Data management requires both technical and
non-technical skills and the ability to collaborate.

Seventh, Data management requires an enterprise perspective: Data management has local
applications, but it must be applied across the enterprise to be as effective as possible. This is one
reason why data management and data governance are intertwined.

Eighth, Data management must account for a range of perspectives: Data is fluid. Data
management must constantly evolve to keep up with the ways data is created and used and the data
consumers who use it.

Nineth, Data management is lifecycle management: Data has a lifecycle and managing data
requires managing its lifecycle. Because data begets more data, the data lifecycle itself can be very
complex. Data management practices need to account for the data lifecycle.

Tenth, Different types of data have different lifecycle characteristics: And for this reason, they
have different management requirements. Data management practices have to recognize these
differences and be flexible enough to meet different kinds of data lifecycle requirements.

Eleventh, Managing data includes managing the risks associated with data: In addition to being
an asset, data also represents risk to an organization. Data can be lost, stolen, or misused.
Organizations must consider the ethical implications of their uses of data. Data-related risks must be
managed as part of the data lifecycle.
Twelfth, Data management requirements must drive Information Technology decisions: Data
and data management are deeply intertwined with information technology and information
technology management. Managing data requires an approach that ensures technology serves,
rather than drives, an organization’s strategic data needs .

And Thirteenth, Effective data management requires leadership commitment: Data


management involves a complex set of processes that, to be effective, require coordination,
collaboration, and commitment. Getting there requires not only management skills, but also the
vision and purpose that come from committed leadership.

Data Management Challenges


Because data management has distinct characteristics derived from the properties of data itself, it
also presents challenges in following these principles.

Data Differs Metadata and


from Other Data Valuation Data Quality Data
Assets Management
First, Data Differs From Other Assets

Physical assets can be pointed to, touched, and moved around. They can be in only one place at a
time. Financial assets must be accounted for on a balance sheet.

However, data is different. Data is not tangible. Yet it is durable; it does not wear out, though the
value of data often changes as it ages.

Data is easy to copy and transport. But it is not easy to reproduce if it is lost or destroyed. Because it is
not consumed when used, it can even be stolen without being gone.

Data is dynamic and can be used for multiple purposes. The same data can even be used by multiple
people at the same time – something that is impossible with physical or financial assets.

Many uses of data beget more data. Most organizations must manage increasing volumes of data
and the relation between data sets.

Next is Data Valuation


Since each organization’s data is unique to itself, an approach to data valuation needs to begin by
articulating general cost and benefit categories that can be applied consistently within an
organization.

Sample categories include:

 Cost of obtaining and storing data


 Cost of replacing data if it were lost
 Impact to the organization if data were missing
 Cost of risk mitigation and potential cost of risks associated with data
 Cost of improving data
 Benefits of higher quality data
 What competitors would pay for data
 What the data could be sold for
 Expected revenue from innovative uses of data

Next is Data Quality

Ensuring that data is of high quality is central to data management. Organizations manage their data
because they want to use it. If they cannot rely on it to meet business needs, then the effort to
collect, store, secure, and enable access to it is wasted.

To ensure data meets business needs, they must work with data consumers to define these needs,
including characteristics that make data of high quality.

Largely because data has been associated so closely with information technology, managing Data
Quality has historically been treated as an afterthought.

IT teams are often dismissive of the data that the systems they create are supposed to store.

It was probably a programmer who first observed ‘garbage in, garbage out’ – and who no doubt
wanted to let it go at that.

But the people who want to use the data cannot afford to be dismissive of quality.

They generally assume data is reliable and trustworthy, until they have a reason to doubt these
things.

Once they lose trust, it is difficult to regain it.

Metadata and Data Management

Organizations require reliable Metadata to manage data as an asset. Metadata in this sense should
be understood comprehensively. It includes not only the business, technical, and operational
Metadata but also the Metadata embedded in Data Architecture, data models, data security
requirements, data integration standards, and data operational processes.

Metadata describes what data an organization has, what it represents, how it is classified, where it
came from, how it moves within the organization, how it evolves through use, who can and cannot use
it, and whether it is of high quality.

Data is abstract. Definitions and other descriptions of context enable it to be understood. They
make data, the data lifecycle, and the complex systems that contain data comprehensible.

Cross-functional nature

Data management is a complex process.

Data is managed in different places within an organization by teams that have responsibility for
different phases of the data lifecycle.

Data management requires design skills to plan for systems, highly technical skills to administer
hardware and build software, data analysis skills to understand issues and problems, analytic skills
to interpret data, language skills to bring consensus to definitions and models, as well as strategic
thinking to see opportunities to serve customers and meet goals.

The challenge is getting people with this range of skills and perspectives to recognize how the pieces
fit together so that they collaborate well as they work toward common goals.

Like other assets, Data has a lifecycle.

To effectively manage data assets, organizations need to understand and plan for the data lifecycle.

Well-managed data is managed strategically, with a vision of how the organization will use its data.

A strategic organization will define not only its data content requirements, but also its data
management requirements.

These include policies and expectations for use, quality, controls, and security; an enterprise approach
to architecture and design; and a sustainable approach to both infrastructure and software
development.

Data Risk

Data not only represents value, it also represents risk. Low quality data (inaccurate, incomplete, or
out-of-date) obviously represents risk because its information is not right. But data is also risky
because it can be misunderstood and misused.
Organizations get the most value from the highest quality data – available, relevant, complete,
accurate, consistent, timely, usable, meaningful, and understood. Yet, for many important decisions,
we have information gaps – the difference between what we know and what we need to know to
make an effective decision.

Information gaps represent enterprise liabilities with potentially profound impacts on operational
effectiveness and profitability.

Organizations that recognize the value of high quality data can take concrete, proactive steps to
improve the quality and usability of data and information within regulatory and ethical cultural
frameworks.

Data and Technology

From its inception, the concept of data management has been deeply intertwined with
management of technology.

That legacy continues.

In many organizations, there is ongoing tension between the drive to build new technology and the
desire to have more reliable data – as if the two were opposed to each other instead of necessary to
each other.

Successful data management requires sound decisions about technology, but managing
technology is not the same as managing data.

Organizations need to understand the impact of technology on data, in order to prevent


technological temptation from driving their decisions about data.

Instead, data requirements aligned with business strategy should drive decisions about technology.

Data Management Strategy Defined


A strategy is a set of choices and decisions that together chart a high-level course of action to achieve
high-level goals.

In the game of chess, a strategy is a sequenced set of moves to win by checkmate or to survive by
stalemate.

A strategic plan is a high-level course of action to achieve high-level goals.

A data strategy should include business plans to use information to competitive advantage and
support enterprise goals.
Data strategy must come from an understanding of the data needs inherent in the business
strategy: what data the organization needs, how it will get the data, how it will manage it and ensure
its reliability over time, and how it will utilize it.

In many organizations, the data management strategy is owned and maintained by the Chief Data
Officer (CDO) and enacted through a data governance team, supported by a Data Governance
Council.

Often, the CDO will draft an initial data strategy and data management strategy even before a Data
Governance Council is formed, in order to gain senior management’s commitment to establishing
data stewardship and governance.

The components of a data management strategy should include:

 A compelling vision for data management


 A summary business case for data management, with selected examples
 Guiding principles, values, and management perspectives
 The mission and long-term directional goals of data management
 Proposed measures of data management success
 Short-term (12-24 months)
 Data Management program objectives that are SMART (specific, measurable, actionable,
realistic, time-bound)
 Descriptions of data management roles and organizations, along with a summary of their
responsibilities and decision rights
 Descriptions of Data Management program components and initiatives
 A prioritized program of work with scope boundaries
 A draft implementation roadmap with projects and action items

Deliverables from strategic planning for data management include:

A Data Management Charter: Overall vision, business case, goals, guiding principles, measures of
success, critical success factors, recognized risks, operating model, etc.

A Data Management Scope Statement: Goals and objectives for some planning horizon (usually 3
years) and the roles, organizations, and individual leaders accountable for achieving these
objectives.

A Data Management Implementation Roadmap: Identifying specific programs, projects, task


assignments, and delivery milestones.
Summary
Metadata is data about data. Information is the context and insights from raw data. Data and
information need to be managed together with business and customer requirements in mind. We
learned that data is an asset which has an economic value to a business and if used effectively can
provide competitive advantages.

DATA MANAGEMENT FRAMEWORKS

You might also like