Data Integration Concepts, Processes, and Techniques
Data Integration Concepts, Processes, and Techniques
Module 4
Data Integration Concepts,
Processes,
and Techniques
Lesson 2: Change Data Concepts
Lesson Objectives
• Explain the types of data sources involved in data
integration
• Provide examples of typical data quality problems
encountered during data integration
• Reflect on the relationship between type of
change data and data quality
Processing Level
Logged
Snapshot
Queryable
Cooperative
Applications
UPDATE … UPDATE
UPDATE trigger
trigger
INSERT …
Table
DELETE … INSERT
INSERT trigger
trigger
DELETE
DELETE trigger
trigger
IP Address 111.111.111.111
Remote user -
Authenticated user -
Timestamp [08/Oct/2014:11:17:55 -0400]
Access request "GET / HTTP/1.1"
Status 200
Bytes 10801
Referrer URL "http://www.google.com/search?q=log+analyzer&ie=utf-8&oe=utf-
8&aq=t&rls=org.mozilla:en- US:official&client=firefox-a"
User agent "Mozilla/5.0 (Windows; U; Windows NT 5.2; en-US; rv:1.8.1.7)
Gecko/20070914 Firefox/2.0.0.7"
SELECT …
FROM <EventTable>
WHERE <event-cond>
Delta
Difference
New rows
Changed rows
Current Source File Deleted rows
• Multiple identifiers
• Different units
• Missing values
• Text data with different components and formats
• Conflicting data
• Different update times
10