0% found this document useful (0 votes)

2 views

Lecture-43 program to detect double space

The document discusses data anonymization, which is the process of protecting sensitive information by removing identifiers that link individuals to their data. It outlines various techniques for anonymization, such as data masking, pseudonymization, generalization, and synthetic data, while also highlighting the limitations imposed by regulations like GDPR. Additionally, it addresses the disadvantages of anonymization, including the potential loss of valuable insights from the data.

Uploaded by

MUHAMMAD AHMAD

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Lecture-43 program to detect double space

Uploaded by

MUHAMMAD AHMAD

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 29

Information Security

Lecture # 43

Dr. Shafiq Hussain

Associate Professor & Chairperson
Department of Computer Science

1
Objectives
• Introduction to Anonymity of Data.

2
Anonymity of Data
• Data anonymization is the process of protecting
private or sensitive information by erasing or
encrypting identifiers that connect an individual to
stored data.

3
Anonymity of Data (Cont..)
• For example, you can run Personally Identifiable
Information (PII) such as names, social security
numbers, and addresses through a data anonymization
process that retains the data but keeps the source
anonymous.

4
Anonymity of Data (Cont..)
• However, even when you clear data of identifiers,
attackers can use de-anonymization methods to
retrace the data anonymization process.

• Since data usually passes through multiple sources—

some available to the public—de-anonymization
techniques can cross-reference the sources and reveal
personal information.

5
Anonymity of Data (Cont..)
• The General Data Protection Regulation (GDPR)
outlines a specific set of rules that protect user data
and create transparency.

• While the GDPR is strict, it permits companies to

collect anonymized data without consent, use it for
any purpose, and store it for an indefinite time—as
long as companies remove all identifiers from the
data.

6
Anonymity of Data (Cont..)
Data Anonymization Techniques
Data masking
• Hiding data with altered values.

• You can create a mirror version of a database and

apply modification techniques such as character
shuffling, encryption, and word or character
substitution.

7
Anonymity of Data (Cont..)
Data Anonymization Techniques
Data masking
• For example, you can replace a value character with a
symbol such as “*” or “x”.

• Data masking makes reverse engineering or detection

impossible.

8
Anonymity of Data (Cont..)
Data Anonymization Techniques
Pseudonymization
• A data management and de-identification method that
replaces private identifiers with fake identifiers or
pseudonyms, for example replacing the identifier
“John Smith” with “Mark Spencer”.

9
Anonymity of Data (Cont..)
Data Anonymization Techniques
Pseudonymization
• Pseudonymization preserves statistical accuracy and
data integrity, allowing the modified data to be used
for training, development, testing, and analytics while
protecting data privacy.

10
Anonymity of Data (Cont..)
Data Anonymization Techniques
Generalization
• Deliberately removes some of the data to make it less
identifiable.

• Data can be modified into a set of ranges or a broad

area with appropriate boundaries.

11
Anonymity of Data (Cont..)
Data Anonymization Techniques
Generalization
• You can remove the house number in an address, but
make sure you don’t remove the road name.

• The purpose is to eliminate some of the identifiers

while retaining a measure of data accuracy.

12
Anonymity of Data (Cont..)
Data Anonymization Techniques
Data swapping
• Also known as shuffling and permutation, a technique
used to rearrange the dataset attribute values so they
don’t correspond with the original records.

13
Anonymity of Data (Cont..)
Data Anonymization Techniques
Data swapping
• Swapping attributes (columns) that contain identifiers
values such as date of birth, for example, may have
more impact on anonymization than membership type
values.

14
Anonymity of Data (Cont..)
Data Anonymization Techniques
Data perturbation
• Modifies the original dataset slightly by applying
techniques that round numbers and add random noise.

• The range of values needs to be in proportion to the

perturbation.

15
Anonymity of Data (Cont..)
Data Anonymization Techniques
Data perturbation
• A small base may lead to weak anonymization while
a large base can reduce the utility of the dataset.

• For example, you can use a base of 5 for rounding

values like age or house number because it’s
proportional to the original value.

16
Anonymity of Data (Cont..)
Data Anonymization Techniques
Synthetic data
• Algorithmically manufactured information that has
no connection to real events.

• Synthetic data is used to create artificial datasets

instead of altering the original dataset or using it as is
and risking privacy and security.

17
Anonymity of Data (Cont..)
Data Anonymization Techniques
Synthetic data
• The process involves creating statistical models based
on patterns found in the original dataset.

• You can use standard deviations, medians, linear

regression or other statistical techniques to generate
the synthetic data.

18
Anonymity of Data (Cont..)
Data Anonymization Techniques
Data aggregation
• Data aggregation, which combines data collected
from many different sources into a single view, is
used to gain insights for enhanced decision-making,
or analysis of trends and patterns.

19
Anonymity of Data (Cont..)
Data Anonymization Techniques
Data aggregation
• Data can be aggregated at different levels of
granularity, from simple summaries to complex
calculations, and can be done on categorical data,
numerical data, and text data.

20
Anonymity of Data (Cont..)
Data Anonymization Techniques
Data aggregation
• Aggregated data can be presented in various forms,
and used for a variety of purposes, including analysis,
reporting, and visualization.

21
Anonymity of Data (Cont..)
Data Anonymization Techniques
Random data generation
• Random data generation, which randomly shuffles
data in order to obscure sensitive information, can be
applied to an entire dataset, or to specific fields or
columns in a database.

22
Anonymity of Data (Cont..)
Data Anonymization Techniques
Random data generation
• Often used together with data masking tools or data
tokenization tools, random data generation is ideal for
clinical trials, to ensure that the subjects are not only
randomly chosen, but also randomly assigned to
different treatment groups.

23
Anonymity of Data (Cont..)
Data Anonymization Techniques
Random data generation
• Often used together with data masking tools or data
tokenization tools, random data generation is ideal for
clinical trials, to ensure that the subjects are not only
randomly chosen, but also randomly assigned to
different treatment groups.

24
Anonymity of Data (Cont..)
Disadvantages of Data Anonymization
• The GDPR stipulates that websites must obtain
consent from users to collect personal information
such as IP addresses, device ID, and cookies.

25
Anonymity of Data (Cont..)
Disadvantages of Data Anonymization
• Collecting anonymous data and deleting identifiers
from the database limit your ability to derive value
and insight from your data.

• For example, anonymized data cannot be used for

marketing efforts, or to personalize the user
experience.

26
Questions
Any Question Please?

You can contact me at: [email protected]

Your Query will be answered within one working day.

27
Further Readings
• Chapter No. 1
Computer_Security_Principles_and_Practice_(3rd_E
dition)
By William Stallings and Lawrie Brown

28
Thanks

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
87% (46)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
58% (78)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (78)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
Shortcut To Shred Ebook Revised 9-9-2015 PDF
88% (8)
Shortcut To Shred Ebook Revised 9-9-2015 PDF
15 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Workbook For The Body Keeps The Score
88% (52)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
How 2 Setup Trust
97% (307)
How 2 Setup Trust
3 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
80% (35)
100 Questions To Ask Your Partner
2 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (7)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
ALCHEMIST
64% (14)
ALCHEMIST
4 pages
1001 Songs
70% (71)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
CHDA Recommended Resources 061218
100% (1)
CHDA Recommended Resources 061218
3 pages
Artificial Intelligence-For Speech Recognition
100% (3)
Artificial Intelligence-For Speech Recognition
13 pages
2.2.3 2.2.4
No ratings yet
2.2.3 2.2.4
25 pages
Data Anonymization
No ratings yet
Data Anonymization
19 pages
SD Guide
No ratings yet
SD Guide
42 pages
A Study On Privacy For Sensitive Data by DM Algorithms: Dr. Halkar Rachappa
No ratings yet
A Study On Privacy For Sensitive Data by DM Algorithms: Dr. Halkar Rachappa
3 pages
14 Module Six Privacy
No ratings yet
14 Module Six Privacy
45 pages
Proposed_Guide_on_Synthetic_Data_Generation_1740328790
No ratings yet
Proposed_Guide_on_Synthetic_Data_Generation_1740328790
48 pages
Team-17 Final
No ratings yet
Team-17 Final
42 pages
A Retrievable Data Perturbation Method Used in Privacy-Preserving in Cloud Computing
No ratings yet
A Retrievable Data Perturbation Method Used in Privacy-Preserving in Cloud Computing
14 pages
Data Masking Guide
No ratings yet
Data Masking Guide
57 pages
Ai 9-Data Literacy Notes
No ratings yet
Ai 9-Data Literacy Notes
16 pages
A Survey On Privacy For Sensitive Big Data by DM Algorithms
No ratings yet
A Survey On Privacy For Sensitive Big Data by DM Algorithms
3 pages
QB 10 Marker
No ratings yet
QB 10 Marker
19 pages
Data Mining
No ratings yet
Data Mining
14 pages
Data Warehousing and Data Mining
No ratings yet
Data Warehousing and Data Mining
18 pages
WINSEM2024-25_MCSE615L_TH_VL2024250502897_2024-12-19_Reference-Material-I
No ratings yet
WINSEM2024-25_MCSE615L_TH_VL2024250502897_2024-12-19_Reference-Material-I
58 pages
DM-Unit-I Introduction To Association-1
No ratings yet
DM-Unit-I Introduction To Association-1
97 pages
Research Proposal
No ratings yet
Research Proposal
17 pages
Course Manual on Data Mining_CSC 425_015446
No ratings yet
Course Manual on Data Mining_CSC 425_015446
44 pages
Instructor Materials Chapter 3: Data Analysis
No ratings yet
Instructor Materials Chapter 3: Data Analysis
20 pages
Data mining
No ratings yet
Data mining
8 pages
DSBDA_UNIT1
No ratings yet
DSBDA_UNIT1
232 pages
BDA-24_Lect (5-6)-(Chapter 3 Data Analysis)
No ratings yet
BDA-24_Lect (5-6)-(Chapter 3 Data Analysis)
17 pages
past ppr(1)
No ratings yet
past ppr(1)
31 pages
Data Analyst Chapter 3
No ratings yet
Data Analyst Chapter 3
20 pages
IOT- Unit_4
No ratings yet
IOT- Unit_4
62 pages
Session1-DataCharacteristics
No ratings yet
Session1-DataCharacteristics
41 pages
DATA MINING ASSIGN 1
No ratings yet
DATA MINING ASSIGN 1
7 pages
Study Material I
No ratings yet
Study Material I
140 pages
Class3-9 DataPreprocessing 22Aug-06Sept2019
No ratings yet
Class3-9 DataPreprocessing 22Aug-06Sept2019
53 pages
Data Mining and Warehousing-1
No ratings yet
Data Mining and Warehousing-1
43 pages
Data Analytics_Module-1.1
No ratings yet
Data Analytics_Module-1.1
42 pages
Siva Sankar
No ratings yet
Siva Sankar
6 pages
02 Synopsis
No ratings yet
02 Synopsis
16 pages
What is Data Obfuscation | Techniques & Strategy | Imperva
No ratings yet
What is Data Obfuscation | Techniques & Strategy | Imperva
10 pages
Cse2026 Module 1 & 2 Detailed Notes
No ratings yet
Cse2026 Module 1 & 2 Detailed Notes
185 pages
Module 6 Cybersecurity Principles, Practices, and Processes
No ratings yet
Module 6 Cybersecurity Principles, Practices, and Processes
28 pages
Unit Viii DLP
No ratings yet
Unit Viii DLP
46 pages
DWDM Unit3
No ratings yet
DWDM Unit3
15 pages
[IoT'24] Lecture 5
No ratings yet
[IoT'24] Lecture 5
22 pages
PPT
No ratings yet
PPT
17 pages
New Static Data Anonymization on Multidimensional Data 19-02-2024.Pptx (1)
No ratings yet
New Static Data Anonymization on Multidimensional Data 19-02-2024.Pptx (1)
71 pages
FDM notes
No ratings yet
FDM notes
48 pages
Privacy Preserving technology of data mining
No ratings yet
Privacy Preserving technology of data mining
5 pages
Data Generalization
No ratings yet
Data Generalization
5 pages
Module 1
No ratings yet
Module 1
107 pages
2.2.1 2.2.2
No ratings yet
2.2.1 2.2.2
23 pages
Knowledge Discovery in Databases
No ratings yet
Knowledge Discovery in Databases
17 pages
DM &W UNIT 1 - PPT Shobana
No ratings yet
DM &W UNIT 1 - PPT Shobana
46 pages
da257829-b262-4875-aa76-2975d8aeaa2c
No ratings yet
da257829-b262-4875-aa76-2975d8aeaa2c
31 pages
L3
No ratings yet
L3
34 pages
DMW Notes by Me
No ratings yet
DMW Notes by Me
45 pages
Unit 3 Dw&DM Notes Mr. Rohit Pratap Singh
No ratings yet
Unit 3 Dw&DM Notes Mr. Rohit Pratap Singh
22 pages
Privacy Preserving Data Mining
No ratings yet
Privacy Preserving Data Mining
10 pages
Ds Data Mask 6793 PDF
No ratings yet
Ds Data Mask 6793 PDF
2 pages
Cs3352 Foundation of Data Science
No ratings yet
Cs3352 Foundation of Data Science
80 pages
Privacy Enhancing Technologies
No ratings yet
Privacy Enhancing Technologies
18 pages
Introduction To Ds - 2024
No ratings yet
Introduction To Ds - 2024
25 pages
CS3352-QB Fds
No ratings yet
CS3352-QB Fds
12 pages
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
IT Specialist: Data Analytics Certification Prep - 500 Exam Questions and Explanations
From Everand
IT Specialist: Data Analytics Certification Prep - 500 Exam Questions and Explanations
Steve Brown
No ratings yet
Lecture-13 program to detect double space
No ratings yet
Lecture-13 program to detect double space
32 pages
Lecture-12program to detect double space
No ratings yet
Lecture-12program to detect double space
38 pages
Lecture-10program to detect double space
No ratings yet
Lecture-10program to detect double space
27 pages
Layouts in android Layouts in android Layouts in android
No ratings yet
Layouts in android Layouts in android Layouts in android
55 pages
Lecture 17
No ratings yet
Lecture 17
29 pages
Recipe Recommendation by Ingredients Detection
No ratings yet
Recipe Recommendation by Ingredients Detection
10 pages
Presentation On Youtube Streamers Analysis
No ratings yet
Presentation On Youtube Streamers Analysis
9 pages
Interactive Voice Response (IVR)
No ratings yet
Interactive Voice Response (IVR)
16 pages
Peter Lalovsky Learn Microsoft SQL Server Intuitively. Transact SQL The Solid Basics
100% (1)
Peter Lalovsky Learn Microsoft SQL Server Intuitively. Transact SQL The Solid Basics
289 pages
Modern Information Retrieval Systems: Bs (Lis)
No ratings yet
Modern Information Retrieval Systems: Bs (Lis)
129 pages
Complete Download Introduction to Technical Services 7th Edition G. Edward Evans PDF All Chapters
100% (6)
Complete Download Introduction to Technical Services 7th Edition G. Edward Evans PDF All Chapters
60 pages
Collpoll Redesign
No ratings yet
Collpoll Redesign
10 pages
Chapter - 4: Database Administration
No ratings yet
Chapter - 4: Database Administration
32 pages
ICT200 Revision Test 18122023
No ratings yet
ICT200 Revision Test 18122023
11 pages
#Lecture 2 FDB
No ratings yet
#Lecture 2 FDB
51 pages
ChatGPT Prompt For SEO
No ratings yet
ChatGPT Prompt For SEO
3 pages
Module 3: File and Database Organization: Test-Your-Knowledge Questions
No ratings yet
Module 3: File and Database Organization: Test-Your-Knowledge Questions
21 pages
Salesforce SOQL Query
No ratings yet
Salesforce SOQL Query
6 pages
Sas Data Management
100% (1)
Sas Data Management
908 pages
Untitled
No ratings yet
Untitled
74 pages
Archival Studies (As)
No ratings yet
Archival Studies (As)
1 page
Data Mining
No ratings yet
Data Mining
2 pages
DW Slides
No ratings yet
DW Slides
246 pages
Project Database
No ratings yet
Project Database
9 pages
Live - SEO - Service - SEO Report
No ratings yet
Live - SEO - Service - SEO Report
7 pages
Designing Enterprise Architecture Toward Big Data Readiness Using TOGAF ADM in The Public Health Sector
No ratings yet
Designing Enterprise Architecture Toward Big Data Readiness Using TOGAF ADM in The Public Health Sector
9 pages
SAP Data Warehouse Cloud Welcome Guide
No ratings yet
SAP Data Warehouse Cloud Welcome Guide
30 pages
(Ms-Dos) : Microsoft - Disk Operating System A Practical Demonstration
No ratings yet
(Ms-Dos) : Microsoft - Disk Operating System A Practical Demonstration
12 pages

Lecture-43 program to detect double space

Uploaded by

Lecture-43 program to detect double space

Uploaded by

Information Security

Dr. Shafiq Hussain

• Since data usually passes through multiple sources—

• While the GDPR is strict, it permits companies to

• You can create a mirror version of a database and

• Data masking makes reverse engineering or detection

• Data can be modified into a set of ranges or a broad

• The purpose is to eliminate some of the identifiers

• The range of values needs to be in proportion to the

• For example, you can use a base of 5 for rounding

• Synthetic data is used to create artificial datasets

• You can use standard deviations, medians, linear

• For example, anonymized data cannot be used for

You can contact me at: [email protected]

Your Query will be answered within one working day.

You might also like