0% found this document useful (0 votes)
9 views

Data Mining and Data Warehousing 2023

Uploaded by

jit189111
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views

Data Mining and Data Warehousing 2023

Uploaded by

jit189111
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

QP Code: RD21BTECH271 Reg.

AY 21
No

GIET UNIVERSITY, GUNUPUR – 765022


B. Tech (Fifth Semester Regular) Examinations, December – 2023
21BCSPC35002 – Data Mining and Data Warehousing
(CSE)
Time: 3 hrs Maximum: 70 Marks
Answer all questions
(The figures in the right hand margin indicate marks)
PART – A (2 x 5 = 10 Marks)

Q.1. Answer ALL questions CO # Blooms


Level

a. Why data transformation is essential in the process of knowledge discovery? CO1 K1

b. What smoothing techniques are available to remove noise? CO1 K1

c. How are association rules mined from large databases? CO2 K1

d. Distinguish between Coverage and Accuracy of a Rule. CO3 K4

e. What are the application areas of Data Mining? CO4 K1

PART – B (15 x 4 = 60 Marks)

Answer ALL questions Marks CO # Blooms


Level
2. a. Explain the steps of KDD with the help of a diagram. 7 CO1 K1

b. Find the Chi square correlation analysis for the given four entity instances. 8 CO1 K2

Qualification/status Middle High Bachelor Master Ph.D.


school school
Never married 18 36 21 9 6
Married 12 36 45 36 21
Divorced 6 9 9 3 3
Widowed 3 9 9 6 3

(OR)
c. What is normalization? Explain why normalization is performed? 7 CO1 K1

d. Suppose that a hospital tested the age and body fat data for 18 randomly selected 8 CO1 K2
adults with the following results:

(a) Calculate the mean, median, and standard deviation of age and %fat.
(b) Find out the covariance and correlation among these two attributes.
3.a. Define data warehouse. Draw the architecture of data warehouse and explain 8 CO2 K1
the three tiers in detail with a case study.

Page 1 of 2
b. Differentiate between star schema, snowflake schema and fact constellation. 7 CO2 K4

(OR)
c. Distinguish between OLAP and OLTP. List out the various OLAP operations carried 8 CO2 K4
out in Data Warehouse.
d. What is the difference between Virtual data warehouse and enterprise data 7 CO2 K1

warehouse?
4.a. There are five transactions (T1, T2, T3, T4, T5) with items (A, B, C, D) purchased as 8 CO3 K2
T1(B, C), T2(A, C, D), T3(B, C), T4(A, B, C, D), T5(B, D). The min_sup = 2. Show
how FP-growth approach can generate the association rules for the above dataset.
b. Explain Decision tree induction algorithm for classification. Discuss the 7 CO3 K2

usage of information gain in this.


(OR)
c. Discuss about the attribute selection measures in constructing a decision tree 8 CO3 K2

with an example.

d. Describe KNN Algorithm for data classification with appropriate example. 7 CO3 K2

5.a. Cluster the following eight points (with (x, y) representing locations) into 8 CO4 K2

three clusters:
A1(2, 10), A2(2, 5), A3(8, 4), A4(5, 8), A5(7, 5), A6(6, 4), A7(1, 2), A8(4, 9)
b. Elaborate the various partitioning methods in detail. 7 CO4 K2

(OR)
c. Differentiate Agglomerative and Divisive Hierarchical Clustering? 8 CO4 K2

d. How can we use data mining in the field of retail and telecommunication 7 CO4 K2

industry?

--- End of Paper ---

Page 2 of 2

You might also like