Ict550 Final Assessment
Ict550 Final Assessment
INSTRUCTIONS TO CANDIDATES
1. This question paper consists 2 PARTS. Part A consists of 3 questions and Part B consists
of 3 questions.
a) Data quality measures the condition of data elements to identify issues and
assess their overall "level of truth”. List FIVE (5) dimensions to measure data
quality.
(5 marks)
b) Choose any 3 answers from question (b). How those THREE (3) dimensions
enable an organization to judge whether data is fit for its intended purposes.
(6 marks)
c) Keeping an eye out for the kinds of errors in data profile reports will help to
eliminate data quality errors at the root and will allow you to leverage data
directly for its intended purpose. Identify THREE (3) other causes of poor data
quality and discuss solutions to improve data quality for each of the causes.
(9 marks)
PART B
Choose any TWO (2) industries in Malaysia, explain how future applications of big
data and/or data science could benefit the industry.
(10 marks)
a) Explain the following components of Hadoop ecosystem and their role in big data
environment;
a) Susan, who owns a Bank of America visa card, wants to withdraw money from a
Citi Bank ATM machine, as illustrated in Figure 1. What type of data integration is
used by Citibank in order to verify customers who belong to other banks?
(4 marks)
Figure 1
b) A school uses multiple systems to manage student data. The data is scattered
across different databases, spreadsheets, and applications, as illustrated in
Figure 2. Discuss how ETL (Extract-Transform-Load) approach can be employed
to ensure information received by the principle is consistent, accurate, and
available for reporting and analysis.
(15 marks)