0% found this document useful (0 votes)

165 views

How To Succeed With Data Classification Using Modern Approaches

The document discusses modern approaches to data classification, including shifting from manual to automated classification using tools that leverage metadata and machine learning. It recommends establishing an effective data classification program by focusing on automation, segmentation into discovery, enrichment, and control phases, and increasing depth through descriptive classification and metadata tagging.

Uploaded by

tim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

165 views

How To Succeed With Data Classification Using Modern Approaches

Uploaded by

tim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Licensed for Distribution

How to Succeed With Data Classification Using Modern

Approaches
Published 25 March 2022 - ID G00764590 - 6 min read

By Ravisha Chugh, Bart Willemsen, and 1 more

Data classification is critical as most of an organization’s data is in an unstructured format

and classifying it manually is cumbersome. Security and risk management leaders need to
understand the alternatives to traditional classification approaches and address data security
governance.

Overview
Key Findings
Manual data classification approaches can result in misclassification of data due to human
error or a lack of user awareness training.

While users label/tag their data, these labels remain one-dimensional, serving a single purpose,
and do not provide sufficient context for increasing regulatory data controls.

Recommendations
To implement an effective data classification program, security and risk management leaders
tasked with data security must:

Establish a data classification program by shifting focus from user awareness and training
toward automation and the enrichment tools that generate metadata.

Increase depth and dimensionality in the data classification approach by segmenting into a
discovery phase, data enrichment phase and control phase.

Introduction
Data classification is vital as it is useful in supporting controls for data security and governance
such as data loss prevention (DLP), data access governance and enterprise digital rights
management (EDRM). It also helps organizations to understand data in the context of its usage
and risk levels. However, Gartner observes that unstructured data is becoming increasingly
difficult to manage. As a result, the individuals or systems that are tasked with processing
information rarely classify, label or enforce controls on every piece of data. This inconsistency
makes classification unreliable as a driver of and means of support for data security and
compliance efforts. Organizations need a practical data classification approach that provides a
foundation for the business to understand and address the mitigating measures necessary.

There are two types of tools that are available in the market for data classification.

1. User-driven/manual tools: These tools enforce the classification of data at the time of creation
or use. They rely on user education and awareness, an absence of which will lead to
inconsistent and misclassified data.

2. Automated tools: These tools are based on out-of-the-box policies and templates that are
provided by the vendors to identify the sensitive data and further classify it. Apart from
analyzing the content, leading tools also leverage context such as location, access groups and
adjacent documents. Automated tools get the best results with well-known standard data
types (such as driving license information, proper names and social security numbers). If your
intellectual property data is consistently well-formatted (such as with an account number or
project coding system), then automated systems will succeed there.

The introduction of machine learning to automated data classification tools has proven to be
beneficial, especially as some of these tools are now supporting dynamic feedback. These tools
learn from the responses provided by the security analyst/administrators, which helps to quickly
address any false positives. But for most tools, the cost of implementing and tuning them to
reliably identify sensitive internal or proprietary data in detail is prohibitively expensive — and for
those use cases, user-driven classification should be considered instead (or preferably as well).
Analysis
Enrich, Don’t Just Classify Data
Traditional data classification approaches have always relied on users. Data owners and data
creators were responsible for classifying any file or document they created or owned. There are
some prerequisites involved, including user awareness training, educating users about the
importance of data classification and preexisting data classification policies.

To accommodate users, sensitivity classification schemes are often simplified into “buckets.” The
four levels of classification that are often used are:

Restricted

Confidential

Internal

Public

This approach is dependent on the understanding (and often the risk appetite) of the users that
are classifying the information. This is prone to human error, which might also lead to
misclassification of data.

Misclassification comes in two flavors: data can be:

Underclassified (either through error or because users realize that a lower classification will
make their job easier).

Overclassified (a common mistake when users are risk-averse or uncomfortable with the
scheme, leading to overspending and difficulty in accessing and handling the data).

SRM leaders currently using a traditional classification scheme — and finding that it does not
support the increased detail demanded of modern data governance laws — should take steps to
evolve toward metadata enrichment. As metadata in general terms refers to data about data, this
approach provides additional information to the data, which can be further embedded directly into
the files. This approach is called “descriptive classification.” Here, data is classified not in
accordance with control requirements, but in accordance with the semantic description of the
data. Figure 1 is an example of descriptive classification.

Figure 1. Descriptive Classification

Here, users set the description of the data (such as customer records, financial data and HR data)
which is mapped with the control requirement so that the description itself yields metadata. The
benefits of this method are a reduction in the need for awareness, and a reduction in human error
and misclassification This approach also provides a good transition from control-based
classifiers, as each descriptive classifier maps to a control. The organization also gains the
benefit of inferred metadata associated with the descriptive classifier, so for example “HR data”' is
taken to contain both “personal” and “personal sensitive” data. Also, as there is high risk of data
exfiltration, this approach will help organizations to easily classify the information and ensure that
only the right people have access to any sensitive data. The one downside is that the list of
descriptive classifiers is far longer.

Adopt Governance Agility by Breaking Classification Into Discovery, Enrichment

and Control
As traditional manual data classification methods have lots of limitations, we also see many tools
providing automated data classification techniques. This approach is called “governance agility,”
which involves three phases.

The first phase is a discovery process, which involves locating information. This may seem trivial,
but the nature of our digital world means that information is everywhere, and much of it is
unknown to IT teams. Most of the work carried out by the automated data classification tool
provides data discovery capabilities. Next comes enrichment, which takes the result of discovery
and applies tags or labels to data objects. Many tools provide the needed automation for this step
by using content inspection capabilities as well as AI-driven methods including machine learning,
natural language processing (NLP) and computer vision.

For example, some of the tags associated with a résumé document would include aspects like
“Personal,” “Sensitive,” “HR,” “CV,” “DOB: 19760822,” “Last Edit: 20190326” and “Region: India.” The
last step is applying controls where these tags provide the critical metadata needed by control
tools — such as data retention tools, DLP tools or content collaboration platforms — to properly
handle the files in question (see Figure 2).

Figure 2. Governance Agility

In this example, simply detecting personal data in unstructured objects does not give an
organization much context with which to mitigate any risk. Associating metadata tags or labels
with the personal data in the objects gives the organization actionable outcomes, allowing
multiple control tools to automate risk mitigation. Metadata enrichment is an important step that
helps to develop a rich understanding of data, and allows further controls to be applied. Some of
the vendors that are moving in this direction include NOW Privacy, MinerEye, Dathena (now
acquired by Proofpoint) and Securiti.AI.
Evidence
This research is based on a large volume of inquiries on the topic of data classification policies
and technologies between March 2021 and March 2022.

© 2023 Gartner, Inc. and/or its affiliates. All rights reserved. Gartner is a registered trademark of Gartner, Inc.
and its affiliates. This publication may not be reproduced or distributed in any form without Gartner's prior
written permission. It consists of the opinions of Gartner's research organization, which should not be
construed as statements of fact. While the information contained in this publication has been obtained from
sources believed to be reliable, Gartner disclaims all warranties as to the accuracy, completeness or adequacy
of such information. Although Gartner research may address legal and financial issues, Gartner does not
provide legal or investment advice and its research should not be construed or used as such. Your access and
use of this publication are governed by Gartner’s Usage Policy. Gartner prides itself on its reputation for
independence and objectivity. Its research is produced independently by its research organization without input
or influence from any third party. For further information, see "Guiding Principles on Independence and
Objectivity."

About Careers Newsroom Policies Site Index IT Glossary Gartner Blog Network Contact Send
Feedback

The Cloud Strategy Cookbook, 2019
No ratings yet
The Cloud Strategy Cookbook, 2019
16 pages
Forrester Five Steps To A Zero Trust Network Oct 2018
No ratings yet
Forrester Five Steps To A Zero Trust Network Oct 2018
15 pages
UI/UX Design Learning Roadmap: To Dig Deeper On The Topic
No ratings yet
UI/UX Design Learning Roadmap: To Dig Deeper On The Topic
3 pages
OIMT Strategic Roadmap - v1 - Sep 2015-Done-Reduced PDF
No ratings yet
OIMT Strategic Roadmap - v1 - Sep 2015-Done-Reduced PDF
36 pages
Gartner Research Digital Infrastructures Emerge 2018
No ratings yet
Gartner Research Digital Infrastructures Emerge 2018
9 pages
DWDM Notes 1-5 Units
No ratings yet
DWDM Notes 1-5 Units
192 pages
Re-Thinking The Library Pathfinder
No ratings yet
Re-Thinking The Library Pathfinder
13 pages
The Definitive Guide To Data Classification Fortra
100% (1)
The Definitive Guide To Data Classification Fortra
39 pages
Data Classification: Secure Cloud Adoption
No ratings yet
Data Classification: Secure Cloud Adoption
18 pages
Classification-consultation-Generic SOW
No ratings yet
Classification-consultation-Generic SOW
7 pages
Threat Modeling
No ratings yet
Threat Modeling
25 pages
350 Third-Party Cyber Risk Management Primer
No ratings yet
350 Third-Party Cyber Risk Management Primer
5 pages
DI CDO Playbook Effective Data Story
No ratings yet
DI CDO Playbook Effective Data Story
6 pages
ZT Concept Content - Dennis
No ratings yet
ZT Concept Content - Dennis
43 pages
Generic Data Processing Systems ListV1 - 19feb2018
No ratings yet
Generic Data Processing Systems ListV1 - 19feb2018
16 pages
IN 1040 DataDiscoveryGuide en PDF
No ratings yet
IN 1040 DataDiscoveryGuide en PDF
215 pages
Framework For Secure Cloud Computing
No ratings yet
Framework For Secure Cloud Computing
15 pages
CSU Information Assets: Identification Guide
100% (1)
CSU Information Assets: Identification Guide
10 pages
Data Storage Policy
No ratings yet
Data Storage Policy
1 page
Cti Analyst Core Competencies Framework v1
No ratings yet
Cti Analyst Core Competencies Framework v1
8 pages
CARTA
No ratings yet
CARTA
16 pages
Tackling RMF W/Devsecops: Jennifer Rekas March 2019
No ratings yet
Tackling RMF W/Devsecops: Jennifer Rekas March 2019
16 pages
Agile Project Management For End User Information Systems Development
No ratings yet
Agile Project Management For End User Information Systems Development
10 pages
CAIQ Lite
No ratings yet
CAIQ Lite
12 pages
Varonis Data Risk Assessment: Sample Report: Acme
No ratings yet
Varonis Data Risk Assessment: Sample Report: Acme
12 pages
Cybersecurity CISO Vital Part Operations
No ratings yet
Cybersecurity CISO Vital Part Operations
11 pages
NIST SP 800-207-Draft2
No ratings yet
NIST SP 800-207-Draft2
58 pages
Checkpoint Enterprise Security Framework Whitepaper v2
No ratings yet
Checkpoint Enterprise Security Framework Whitepaper v2
34 pages
COSO Framework and Internal Control
100% (1)
COSO Framework and Internal Control
15 pages
2020 Data Center Roadmap Survey PDF
No ratings yet
2020 Data Center Roadmap Survey PDF
16 pages
The Growing Use and Increased Complexity of Cloud Computing Is Creating New Challenges For Internal Auditors
No ratings yet
The Growing Use and Increased Complexity of Cloud Computing Is Creating New Challenges For Internal Auditors
4 pages
Achieving Data Security and Compliance FINAL 20-04-27
100% (1)
Achieving Data Security and Compliance FINAL 20-04-27
16 pages
ITMGD Sample Report PDF
No ratings yet
ITMGD Sample Report PDF
53 pages
Mason IT PM Framework v1
No ratings yet
Mason IT PM Framework v1
123 pages
Cloud Data Lakes For Dummies Snowflake Special Edition V1 3
No ratings yet
Cloud Data Lakes For Dummies Snowflake Special Edition V1 3
10 pages
Data Governance Maturity Model
No ratings yet
Data Governance Maturity Model
42 pages
How To Conduct Risk Assessment
No ratings yet
How To Conduct Risk Assessment
4 pages
OCEG - Your Source For GRC and Principled Performance: Add Ress
No ratings yet
OCEG - Your Source For GRC and Principled Performance: Add Ress
1 page
CCSP Program Overview
No ratings yet
CCSP Program Overview
14 pages
Data Security in Cloud Computing
0% (1)
Data Security in Cloud Computing
6 pages
The Ultimate Guide To Data Lineage
No ratings yet
The Ultimate Guide To Data Lineage
19 pages
Data Center Transformations
No ratings yet
Data Center Transformations
11 pages
BCBS 239
100% (1)
BCBS 239
16 pages
PA - IT Strategy
No ratings yet
PA - IT Strategy
36 pages
NSE Quiz 1
No ratings yet
NSE Quiz 1
5 pages
Data Management Framework Checklist (PDF Checklist)
No ratings yet
Data Management Framework Checklist (PDF Checklist)
2 pages
Using Electronic Signatures PDF
No ratings yet
Using Electronic Signatures PDF
58 pages
8.3 SailPoint Access Management Infrastructure Module Guide
No ratings yet
8.3 SailPoint Access Management Infrastructure Module Guide
31 pages
U.S. Department of Defense Cloud Computing Strategy
100% (2)
U.S. Department of Defense Cloud Computing Strategy
44 pages
The Vulnerability Life Cycle
100% (1)
The Vulnerability Life Cycle
2 pages
Forrester Data Security Report Imperva Reprint
No ratings yet
Forrester Data Security Report Imperva Reprint
16 pages
Breaking Down ISO 31000:2018
No ratings yet
Breaking Down ISO 31000:2018
5 pages
The NIST Special Publications
No ratings yet
The NIST Special Publications
27 pages
Demystifying IT: The Language of IT for the CEO
From Everand
Demystifying IT: The Language of IT for the CEO
Bhopi Dhall
No ratings yet
Compromise Assessment
No ratings yet
Compromise Assessment
2 pages
Teradata InfoSec Slides Defense in Depth Best Practices Pres December 2011 - FINAL
No ratings yet
Teradata InfoSec Slides Defense in Depth Best Practices Pres December 2011 - FINAL
134 pages
Consulting Whitepaper Next Generation Application Portfolio Rationalization 09 2011
0% (1)
Consulting Whitepaper Next Generation Application Portfolio Rationalization 09 2011
19 pages
Capgemini Payment Trends
100% (1)
Capgemini Payment Trends
33 pages
Itil Foundation Training Certification: What You Will Learn
No ratings yet
Itil Foundation Training Certification: What You Will Learn
2 pages
2019 Cloud Security Report Sponsored by (ISC) .
No ratings yet
2019 Cloud Security Report Sponsored by (ISC) .
20 pages
IEC 20000-1 - 2018 - Service Management System (SMS) Requirements - ANSI Blog PDF
No ratings yet
IEC 20000-1 - 2018 - Service Management System (SMS) Requirements - ANSI Blog PDF
7 pages
Guide To Data Governance Part2 People and Process Whitepaper
No ratings yet
Guide To Data Governance Part2 People and Process Whitepaper
28 pages
Continuity of Operations The Ultimate Step-By-Step Guide
From Everand
Continuity of Operations The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Cyber Risk Thematic Review 2024
No ratings yet
Cyber Risk Thematic Review 2024
3 pages
UML-Library System Documentation
50% (6)
UML-Library System Documentation
55 pages
10 Heuristics For User Interface Design
No ratings yet
10 Heuristics For User Interface Design
3 pages
Welcome: To This Presentation
No ratings yet
Welcome: To This Presentation
15 pages
Fortinet EMEA Channel 40mins Webinar FortiSASE Q42022
No ratings yet
Fortinet EMEA Channel 40mins Webinar FortiSASE Q42022
27 pages
Automotive Navigation Systems
100% (1)
Automotive Navigation Systems
15 pages
Business Process Lifecycle
No ratings yet
Business Process Lifecycle
46 pages
Vibration Problems in Structures Practical Guidelines PDF Free
No ratings yet
Vibration Problems in Structures Practical Guidelines PDF Free
263 pages
Real Time Analytics With SAP Hana - Sample Chapter
No ratings yet
Real Time Analytics With SAP Hana - Sample Chapter
26 pages
Preparation Resources Ead A 105
No ratings yet
Preparation Resources Ead A 105
1 page
Building Models
No ratings yet
Building Models
605 pages
Oracle Database Architecture Diagram Overview
No ratings yet
Oracle Database Architecture Diagram Overview
7 pages
Training Presentation 3 Create Relationships For A New Database
No ratings yet
Training Presentation 3 Create Relationships For A New Database
39 pages
SAS Operations Guide v1
No ratings yet
SAS Operations Guide v1
14 pages
IT326 - Ch1
100% (1)
IT326 - Ch1
17 pages
JD Edwards Enterpriseone 9.1 Interactive Runtime Changes: Café One
No ratings yet
JD Edwards Enterpriseone 9.1 Interactive Runtime Changes: Café One
32 pages
Unit-4 PPT-8
No ratings yet
Unit-4 PPT-8
17 pages
SFT Presentation
No ratings yet
SFT Presentation
16 pages
Shayanul Amazon Resume
No ratings yet
Shayanul Amazon Resume
2 pages
MS Excel Workshop
No ratings yet
MS Excel Workshop
6 pages
Databases of NCBI
No ratings yet
Databases of NCBI
13 pages
Chenilyn Story
No ratings yet
Chenilyn Story
3 pages
BI - Canvas Alinhamento DFT
No ratings yet
BI - Canvas Alinhamento DFT
3 pages
Studi Kasus: Pada Perpustakaan SMA Negeri 1 Padang
No ratings yet
Studi Kasus: Pada Perpustakaan SMA Negeri 1 Padang
10 pages
The Power of Personalization - Creating Tailored Experiences For Your Audience
No ratings yet
The Power of Personalization - Creating Tailored Experiences For Your Audience
5 pages
UI/UX Design & Product Management
No ratings yet
UI/UX Design & Product Management
17 pages
Owasp Api Security Top 10 Cheat Sheet A4
No ratings yet
Owasp Api Security Top 10 Cheat Sheet A4
4 pages

How To Succeed With Data Classification Using Modern Approaches

Uploaded by

How To Succeed With Data Classification Using Modern Approaches

Uploaded by

Licensed for Distribution

How to Succeed With Data Classification Using Modern

By Ravisha Chugh, Bart Willemsen, and 1 more

Data classification is critical as most of an organization’s data is in an unstructured format

Misclassification comes in two flavors: data can be:

Figure 1. Descriptive Classification

Adopt Governance Agility by Breaking Classification Into Discovery, Enrichment

Figure 2. Governance Agility

© 2023 Gartner, Inc. and/or its Affiliates. All Rights Reserved.

You might also like