0% found this document useful (0 votes)
215 views

Data Warehousing Questions

This document provides instructions for a pre-final exam in data warehousing and data mining (DWDM). It lists 8 questions to choose from that cover topics such as the differences between databases and data warehouses, data mining functionalities, designing a data warehouse for weather data, comparing data modeling concepts, data smoothing techniques, histograms, association rule mining, and comparing the Apriori and FP-growth frequent pattern mining algorithms. Students have 180 minutes to answer 5 of the 8 questions.

Uploaded by

Komo Mo
Copyright
© © All Rights Reserved
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
215 views

Data Warehousing Questions

This document provides instructions for a pre-final exam in data warehousing and data mining (DWDM). It lists 8 questions to choose from that cover topics such as the differences between databases and data warehouses, data mining functionalities, designing a data warehouse for weather data, comparing data modeling concepts, data smoothing techniques, histograms, association rule mining, and comparing the Apriori and FP-growth frequent pattern mining algorithms. Students have 180 minutes to answer 5 of the 8 questions.

Uploaded by

Komo Mo
Copyright
© © All Rights Reserved
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 2

Rajeev Gandhi Memorial College of Engineering and

Technology,

PreFinal
Class: IV year I semester
Branch: CSE
Subject: DWDM
Answer any five questions

Date: 6-10-08
Time: 180 minutes

1. (A). How is a data warehouse different from a database? How are they similar?
(B). Define each of the following data mining functionalities and give one
appropriate example for each: characterization, discrimination, association,
classification, clustering.
2. Design a data warehouse for a regional weather bureau. The weather bureau has
about 1000 probes, which are scattered throughout various land and ocean
locations in the region to collect basic weather data, including air pressure,
temperature, and precipitation at each hour. All data are sent to the central station,
which has collected such data for over 10 years. Your design should facilitate
efficient querying and on-line analytical processing, and derive general weather
patterns in multidimensional space.
3. Briefly compare the following concepts by using appropriate examples.
(A). Snowflake schema, fact constellation, starnet query model
(B). Data cleaning, data transformation, refresh.
4. Suppose that the data for analysis include the attribute age. The age value for the
data tuples are (in increasing order): 13, 15, 16, 16, 19, 20, 20, 21, 22, 22, 25, 25,
25, 25, 30, 33, 33, 35, 35, 35, 35, 36, 40, 45, 46, 52, 70.
(A). Use smoothing by bin means to smooth the above data, using a bin depth of
3. Illustrate your steps.
(B). What other methods are there for data smoothing.
5. Using the data for age given in Question 4 above,
(A). Plot an equiwidth histogram of width 10.
(B).Give five number summary of the data.
6. Consider the table, and if birth_country is the concept, find the best attribute
gender

major

birth_country

age_range

gpa

count

Science

Canada

20-25

Very_good

16

Science

Foreign

25-30

Excellent

22

Engineering

Foreign

25-30

Excellent

18

Science

Foreign

25-30

Excellent

25

Science

Canada

20-25

Excellent

21

Engineering

Canada

20-25

Excellent

18

based on Information Gain.


7. A. Define support and confidence of an association rule.
B. Describe with examples how candidate itemsets are generated in the Apriori
method (while frequent itemset finding).
8. Compare FP-tree based FP-growth method with Apriori method. Compare merits
and demerits of them.

You might also like