Introduction To Data Analytics and Visualization Question Paper
Introduction To Data Analytics and Visualization Question Paper
B.Tech.
(SEM V) THEORY EXAMINATION 2022-23
INTRODUCTION TO DATA ANALYTICS AND VISUALIZATION
SECTION A
2
(i) Define sharding and database shard.
90
13
(j) List any two visualization tools.
_2
2.
P1
24
SECTION B
5.
3D
.5
P2
|1
(c) Explain Alon-Matias-Szegedy Algorithm for second moments on the
given stream {a, b, c, b, d, a, c, d, a, b, d, c, a, a, b}.
7
SECTION C
3. Attempt any one part of the following: 10*1 = 10
3
02
(a) Describe PCA algorithm. Compute the principal component using PCA
algorithm on given data as:
CLASS 1: X=2,3,4
Y=1,5,3
CLASS 2:
X=5,6,7
Y=6,7,8
(b) What do you mean by k-means clustering? How does the k-means
algorithm work? Write k-meansalgorithm for partitioning.
2
7. Attempt any one part of the following:
90 10*1 = 10
13
(a) Define HDFS. Discuss the HDFS architecture and HDFS commands in
_2
2.
brief.
P1
24
(b) Differentiate between Map Reduce and Apache Pig.
5.
3D
.5
P2
17
Q
|1
7
:3
: 44
13
3
02
-2
01
4-
|2