Data Warehousing Questions
Data Warehousing Questions
Technology,
PreFinal
Class: IV year I semester
Branch: CSE
Subject: DWDM
Answer any five questions
Date: 6-10-08
Time: 180 minutes
1. (A). How is a data warehouse different from a database? How are they similar?
(B). Define each of the following data mining functionalities and give one
appropriate example for each: characterization, discrimination, association,
classification, clustering.
2. Design a data warehouse for a regional weather bureau. The weather bureau has
about 1000 probes, which are scattered throughout various land and ocean
locations in the region to collect basic weather data, including air pressure,
temperature, and precipitation at each hour. All data are sent to the central station,
which has collected such data for over 10 years. Your design should facilitate
efficient querying and on-line analytical processing, and derive general weather
patterns in multidimensional space.
3. Briefly compare the following concepts by using appropriate examples.
(A). Snowflake schema, fact constellation, starnet query model
(B). Data cleaning, data transformation, refresh.
4. Suppose that the data for analysis include the attribute age. The age value for the
data tuples are (in increasing order): 13, 15, 16, 16, 19, 20, 20, 21, 22, 22, 25, 25,
25, 25, 30, 33, 33, 35, 35, 35, 35, 36, 40, 45, 46, 52, 70.
(A). Use smoothing by bin means to smooth the above data, using a bin depth of
3. Illustrate your steps.
(B). What other methods are there for data smoothing.
5. Using the data for age given in Question 4 above,
(A). Plot an equiwidth histogram of width 10.
(B).Give five number summary of the data.
6. Consider the table, and if birth_country is the concept, find the best attribute
gender
major
birth_country
age_range
gpa
count
Science
Canada
20-25
Very_good
16
Science
Foreign
25-30
Excellent
22
Engineering
Foreign
25-30
Excellent
18
Science
Foreign
25-30
Excellent
25
Science
Canada
20-25
Excellent
21
Engineering
Canada
20-25
Excellent
18