Database Integration

The document discusses bottom-up design for database integration. It involves integrating existing local databases with their schemas into a global database with a global conceptual schema. This involves schema matching to identify correspondences between schemas, integrating common schema elements, and mapping elements between local and global schemas. It also discusses related topics like schema mapping, data cleaning, view management, data security, and semantic integrity control.

Uploaded by

Motivational Videos

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views

Database Integration

Uploaded by

Motivational Videos

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

UNIVERSITY INSTITUTE OF COMPUTING

MASTER OF COMPUTER APPLICATIONS

Advance Database Management System

DISCOVER . LEARN . EMPOWER

1
Database Integration

we focus on bottom-up design that is appropriate in multidata base systems.

• In this case, a number of databases already exist, and the design task involves integrating them into one
database. The starting point of bottom-up design is the individual local conceptual schemas. The process
consists of integrating local databases with their (local) schemas into a global database with its global
conceptual schema (GCS) (also called the mediated schema).
• Database integration, and the related problem of querying multi databases, is only one part of the more
general interoperability problem. In recent years, new distributed applications have started to pose new
requirements regarding the data source(s) they access. In parallel, the management of “legacy systems”
and reuse of the data they generate have gained importance. The result has been a renewed consideration
of the broader question of information system interoperability, including non-database sources and
interoperability at the application level in addition to the database level.

2
Bottom-Up Design Methodology
• Bottom-up design involves the process by which information from participating
databases can be (physically or logically) integrated to form a single cohesive
multi-database.
• There are two alternative approaches. In some cases, the global conceptual (or
mediated) schema is defined first, in which case the bottom-up design involves
mapping LCSs to this schema.
• This is the case in data warehouses, but the practice is not restricted to these and
other data integration methodologies may follow the same strategy. In other cases,
the GCS is defined as an integration of parts of LCSs. In this case, the bottom-up
design involves both the generation of the GCS and the mapping of individual
LCSs to this GCS.

3
Database Integration Process

4
Database Integration Process
• The schema generation process consists of the following steps:
1. Schema matching to determine the syntactic and semantic
correspondences among the translated LCS elements or between
individual LCS elements and the pre-defined GCS elements.
2. Integration of the common schema elements into a global conceptual
(mediated) schema if one has not yet been defined.
3. Schema mapping that determines how to map the elements of each
LCS to the other elements of the GCS.

5
Schema Matching
• Schema matching determines which concepts of one schema match
those of another. if the GCS has already been defined, then one of
these schemas is typically the GCS, and the task is to match each LCS
to the GCS. Otherwise matching is done on two LCSs. The matches
that are determined in this phase are then used in schema mapping to
produce a set of directed mappings, which, when applied to the source
schema, would map its concepts to the target schema.

6
Schema Matching issues
Aside from schema heterogeneity, other issues that complicate the matching process are
the following:
• Insufficient schema and instance information
• Unavailability of schema documentation
• Subjectivity of matching

7
Schema Integration
Once schema matching is done, the correspondences between the various LCSs
have been identified. The next step is to create the GCS, and this is referred to as
schema integration. As indicated earlier, this step is only necessary if a GCS has
not already been defined and matching was performed on individual LCSs. If the
GSC was defined up-front, then the matching step would determine correspondences
between it and each of the LCSs and there would be no need for the integration step.
If the GCS is created as a result of the integration of LCSs based on correspondences
identified during schema matching, then, as part of integration, it is important to
identify the correspondences between the GCS and the LCSs.

8
Schema Mapping
Once a GCS (or mediated schema) is defined, it is necessary to identify how the
data from each of the local databases (source) can be mapped to GCS (target) while
preserving semantic consistency (as defined by both the source and the target).
Although schema matching has identified the correspondences between the LCSs
and the GCS, it may not have identified explicitly how to obtain the global database
from the local ones. This is what schema mapping is about.
In the case of data warehouses, schema mappings are used to explicitly extract data
from the sources, and translate them to the data warehouse schema for populating it.
In the case of data integration systems, these mappings are used in query processing
phase by both the query processor and the wrappers

9
Data Cleaning
Errors in source databases can always occur, requiring cleaning in order to correctly answer user queries.

Data cleaning is a problem that arises in both data warehouses and data integration systems, but in different
contexts.

In data warehouses where data are actually extracted from local operational databases and materialized as a
global database, cleaning is performed as the global database is created.

In the case of data integration systems, data cleaning is a process that needs to be performed during query
processing when data are returned from the source databases.

10
View Management
One of the main advantages of the relational model is that it provides full logical data independence.
External schemas enable user groups to have their particular view of the database. In a relational system, a
view is a virtual relation, defined as the result of a query on base relations (or real relations), but not
materialized like a base relation, which is stored in the database. A view is a dynamic window in the sense
that it reflects all updates to the database.
An external schema can be defined as a set of views and/or base relations. Besides their use in external
schemas, views are useful for ensuring data security in a simple way.
By selecting a subset of the database, views hide some data. If users may only access the database through
views, they cannot see or manipulate the hidden data, which are therefore secure.

11
Data Security
Data security is an important function of a database system that protects data against unauthorized access.
Data security includes two aspects:
• Data protection
• Access control

12
Data protection
Data protection is required to prevent unauthorized users from understanding the physical content of data.
This function is typically provided by file systems in the context of centralized and distributed operating
systems.
The main data protection approach is data encryption. which is useful both for information stored on disk
and for information exchanged on a network. Encrypted (encoded) data can be decrypted (decoded) only by
authorized users who “know” the code.
The two main schemes are the Data Encryption Standard [NBS, 1977] and the public-key encryption
schemes.

13
Access control
Access control must guarantee that only authorized users perform operations they are allowed to perform on
the database.
Many different users may have access to a large collection of data under the control of a single centralized
or distributed system.
The centralized or distributed DBMS must thus be able to restrict the access of a subset of the database to a
subset of the users.
Access control has long been provided by operating systems, and more recently, by distributed operating
systems as services of the file system.

14
Semantic Integrity Control
Another important and difficult problem for a database system is how to guarantee database consistency. A
database state is said to be consistent if the database satisfies a set of constraints, called semantic integrity
constraints. Maintaining a consistent database requires various mechanisms such as concurrency control, re-
liability, protection, and semantic integrity control, which are provided as part of transaction management.
Semantic integrity control ensures database consistency by rejecting update transactions that lead to
inconsistent database states, or by activating specific actions on the database state, which compensate for the
effects of the update transactions.
Note that the updated database must satisfy the set of integrity constraints.

15
THANK YOU

Chapter 4
No ratings yet
Chapter 4
24 pages
Database Management System
From Everand
Database Management System
Knowledge Flow
No ratings yet
Unit 4 New
No ratings yet
Unit 4 New
129 pages
Basis Midterm Database
No ratings yet
Basis Midterm Database
19 pages
Relational Databases and Beyond
No ratings yet
Relational Databases and Beyond
12 pages
07-Data Design 11-13 PDF
No ratings yet
07-Data Design 11-13 PDF
40 pages
Lecture 09
No ratings yet
Lecture 09
56 pages
CIS 472 Database System
No ratings yet
CIS 472 Database System
131 pages
Unit 4: Database Design & Development
No ratings yet
Unit 4: Database Design & Development
122 pages
Chapter Eight Database Management System
No ratings yet
Chapter Eight Database Management System
27 pages
2.database Arckoiukhitecture. and Transaction Management
No ratings yet
2.database Arckoiukhitecture. and Transaction Management
37 pages
Unit 1
No ratings yet
Unit 1
79 pages
CH-1
No ratings yet
CH-1
18 pages
GIS Data Management: Ge 118: Introduction To Gis Engr. Meriam M. Santillan Caraga State University
No ratings yet
GIS Data Management: Ge 118: Introduction To Gis Engr. Meriam M. Santillan Caraga State University
47 pages
Dbms Basics
No ratings yet
Dbms Basics
13 pages
Assignment 1 - Database Management System
No ratings yet
Assignment 1 - Database Management System
12 pages
Prof. Ishani Saha Computer Department Mpstme (Nmims)
No ratings yet
Prof. Ishani Saha Computer Department Mpstme (Nmims)
38 pages
Data Modeling and T-SQL: Meetings / Methodology
No ratings yet
Data Modeling and T-SQL: Meetings / Methodology
13 pages
Managing Database Systems
No ratings yet
Managing Database Systems
14 pages
Mis 3 Dbms Sad
No ratings yet
Mis 3 Dbms Sad
150 pages
2.basisdata-Basisdata Dan Pengguna
No ratings yet
2.basisdata-Basisdata Dan Pengguna
63 pages
CS6302 Notes PDF
No ratings yet
CS6302 Notes PDF
126 pages
Database and BI
No ratings yet
Database and BI
33 pages
Spatial Data Base Mangment The GIS Best
No ratings yet
Spatial Data Base Mangment The GIS Best
68 pages
Ch09 PPT
No ratings yet
Ch09 PPT
65 pages
14UIT305-Database Systems PDF
No ratings yet
14UIT305-Database Systems PDF
174 pages
Chapter 3-Database Systems Eighth Edition Presentation
No ratings yet
Chapter 3-Database Systems Eighth Edition Presentation
55 pages
ESIA Study
No ratings yet
ESIA Study
28 pages
Unit-2_Distributed Database System
No ratings yet
Unit-2_Distributed Database System
7 pages
INFOMAN Prelim Notes
No ratings yet
INFOMAN Prelim Notes
9 pages
Database System
No ratings yet
Database System
14 pages
Data Integration
No ratings yet
Data Integration
44 pages
Null 17
No ratings yet
Null 17
4 pages
CHAPTER 4 Relational Databases
No ratings yet
CHAPTER 4 Relational Databases
31 pages
Chapter 9 PPT (AIS - James Hall)
0% (1)
Chapter 9 PPT (AIS - James Hall)
17 pages
DATABASE
No ratings yet
DATABASE
23 pages
LectureFour - Spatial Databases and Database Design
No ratings yet
LectureFour - Spatial Databases and Database Design
50 pages
DBMS Reference Notes
No ratings yet
DBMS Reference Notes
104 pages
Chapter01 Updated
No ratings yet
Chapter01 Updated
38 pages
Chapter 4 CIS
No ratings yet
Chapter 4 CIS
47 pages
RDBMS-UNIT-I-A
No ratings yet
RDBMS-UNIT-I-A
32 pages
Adbms Final Reviewer Upd
No ratings yet
Adbms Final Reviewer Upd
6 pages
A Brief History of Database Systems
100% (1)
A Brief History of Database Systems
4 pages
Database
No ratings yet
Database
47 pages
DataBase System CH - 1-5
No ratings yet
DataBase System CH - 1-5
158 pages
Dbms Gis Design
No ratings yet
Dbms Gis Design
5 pages
Databases and Database Users: Importance
No ratings yet
Databases and Database Users: Importance
36 pages
Unit1 dbms
No ratings yet
Unit1 dbms
103 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Unit 1 DBMS
No ratings yet
Unit 1 DBMS
55 pages
Lecture#4
No ratings yet
Lecture#4
31 pages
DATAMA
No ratings yet
DATAMA
10 pages
INF2080_Lecture01_BasicConcepts_S2024 (1)
No ratings yet
INF2080_Lecture01_BasicConcepts_S2024 (1)
25 pages
Chapter 1
No ratings yet
Chapter 1
64 pages
23acs12 Dbms Complete Notes Unit i to V
No ratings yet
23acs12 Dbms Complete Notes Unit i to V
124 pages
Unit 1
No ratings yet
Unit 1
59 pages
Relational Database Management System (17332) : Theory Paper: 100 Marks Term Work: 50 Marks Sessional: 10 Marks
No ratings yet
Relational Database Management System (17332) : Theory Paper: 100 Marks Term Work: 50 Marks Sessional: 10 Marks
36 pages
Dbms 1
No ratings yet
Dbms 1
87 pages
Parent 1998 Issues and Approaches of Database Integration
No ratings yet
Parent 1998 Issues and Approaches of Database Integration
12 pages
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
synchronous-data-mesh-for-graphql-queries-ra
No ratings yet
synchronous-data-mesh-for-graphql-queries-ra
1 page
Enrolment System
No ratings yet
Enrolment System
22 pages
Bcoe 144
No ratings yet
Bcoe 144
4 pages
Test1 IoT SV
No ratings yet
Test1 IoT SV
3 pages
Data Structure PPT
No ratings yet
Data Structure PPT
21 pages
Your Name: Phone:-Phone No
No ratings yet
Your Name: Phone:-Phone No
2 pages
SearchandCollections - E3D
No ratings yet
SearchandCollections - E3D
10 pages
DVCAMformat
No ratings yet
DVCAMformat
97 pages
Infineon-AURIX ADC Background Scan 1 KIT TC297 TFT-Training-v01 01-EN PDF
No ratings yet
Infineon-AURIX ADC Background Scan 1 KIT TC297 TFT-Training-v01 01-EN PDF
11 pages
Cambridge IGCSE and O Level Computer Science Computer Systems Workbook (David Watson, Helen Williams)
100% (2)
Cambridge IGCSE and O Level Computer Science Computer Systems Workbook (David Watson, Helen Williams)
97 pages
Smart Glasses for the Visually Impaired With AI and ML Integrations (1)
No ratings yet
Smart Glasses for the Visually Impaired With AI and ML Integrations (1)
10 pages
AMAG Symmetry Blue Datasheet 17AUG16
No ratings yet
AMAG Symmetry Blue Datasheet 17AUG16
2 pages
08 GUI Evaluation Techniques
No ratings yet
08 GUI Evaluation Techniques
86 pages
Ecotank Pro l15180 Datasheet PDF
No ratings yet
Ecotank Pro l15180 Datasheet PDF
4 pages
MC - Unit 3
No ratings yet
MC - Unit 3
44 pages
SF Dump
No ratings yet
SF Dump
20 pages
6050A3022501-MB-A01
No ratings yet
6050A3022501-MB-A01
105 pages
Versant Developer Assembly Kit Instructions
No ratings yet
Versant Developer Assembly Kit Instructions
1 page
Module 2
No ratings yet
Module 2
56 pages
Box-Corer T-Bar System - Issue 2
No ratings yet
Box-Corer T-Bar System - Issue 2
2 pages
Bep20 USDT
No ratings yet
Bep20 USDT
20 pages
Round Robin Algorithm
100% (1)
Round Robin Algorithm
16 pages
Data Structures Using C: Example 4.13
No ratings yet
Data Structures Using C: Example 4.13
5 pages
qcc5121 Datasheet
No ratings yet
qcc5121 Datasheet
79 pages
Naukri RavindranathBandaru (9y 0m)
No ratings yet
Naukri RavindranathBandaru (9y 0m)
3 pages
CTSD QUESTION BANK
No ratings yet
CTSD QUESTION BANK
3 pages
UiPath RPAv1
No ratings yet
UiPath RPAv1
5 pages
Api Delta
No ratings yet
Api Delta
88 pages
React Native CLI vs. Expo CLI
No ratings yet
React Native CLI vs. Expo CLI
3 pages
Spectra Act1000 Four Door Access Controller
No ratings yet
Spectra Act1000 Four Door Access Controller
2 pages