100% found this document useful (1 vote)
2K views

Introduction To Data Analytics and Visualization Question Paper

This document is an exam for a course on Introduction to Data Analytics and Visualization. It contains 7 sections with multiple choice, fill in the blank, and long answer questions. Section A contains 10 short answer questions covering topics like big data analytics platforms, time series components, regression techniques, and visualization tools. Section B has 3 long answer questions about data types, fuzzy models, and stream algorithms. Section C provides 2 long answer question choices about data analytics tools and the data lifecycle. Sections D-G each have 2 additional long answer question choices covering topics such as PCA, Bayesian networks, data stream management, clustering, HDFS, and MapReduce vs Apache Pig.

Uploaded by

Abhay Gupta
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
2K views

Introduction To Data Analytics and Visualization Question Paper

This document is an exam for a course on Introduction to Data Analytics and Visualization. It contains 7 sections with multiple choice, fill in the blank, and long answer questions. Section A contains 10 short answer questions covering topics like big data analytics platforms, time series components, regression techniques, and visualization tools. Section B has 3 long answer questions about data types, fuzzy models, and stream algorithms. Section C provides 2 long answer question choices about data analytics tools and the data lifecycle. Sections D-G each have 2 additional long answer question choices covering topics such as PCA, Bayesian networks, data stream management, clustering, HDFS, and MapReduce vs Apache Pig.

Uploaded by

Abhay Gupta
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Printed Pages:02 Sub Code: KDS-501

Paper Id: 2 3 2 1 7 3 Roll No.

B.Tech.
(SEM V) THEORY EXAMINATION 2022-23
INTRODUCTION TO DATA ANALYTICS AND VISUALIZATION

Time: 3 Hours Total Marks: 100


Note: Attempt all Sections. If you require any missing data, then choose suitably.

SECTION A

1. Attempt all questions in brief. 2*10 = 20


(a) List the features of big data analytics platform.
(b) Why there is need of data analytics?
(c) What are the components of time series?
(d) List various types of regression analysis techniques.
(e) What is sentiment analysis?
(f) What are problem in Flajolet-Martin (FM) algorithm?
(g) What is Market Basket Analysis?
(h) Explain hierarchical method of clustering.

2
(i) Define sharding and database shard.
90

13
(j) List any two visualization tools.
_2

2.
P1

24
SECTION B

5.
3D

2. Attempt any three of the following: 10*3 = 30

.5
P2

(a) Differentiate between structured, semi-structured and unstructured data.


(b) Describe grid-based fuzzy model. 17
Q

|1
(c) Explain Alon-Matias-Szegedy Algorithm for second moments on the
given stream {a, b, c, b, d, a, c, d, a, b, d, c, a, a, b}.
7

(d) Explain market basket analysis with an example.


:3

(e) Discuss various classification of visualization techniques.


: 44
13

SECTION C
3. Attempt any one part of the following: 10*1 = 10
3
02

(a) Discuss all the moderntools of data analytic.


-2

(b) Describe all the phases of data analytics life cycle.


01

4. Attempt any one part of the following: 10 *1 = 10


4-
|2

(a) Describe PCA algorithm. Compute the principal component using PCA
algorithm on given data as:
CLASS 1: X=2,3,4
Y=1,5,3
CLASS 2:
X=5,6,7
Y=6,7,8

(b) Illustrate Bayesian network with an example.

QP23DP1_290 | 24-01-2023 13:44:37 | 117.55.242.132


5. Attempt any one part of the following: 10*1 = 10
(a) Describe briefly the architecture of Data Stream Management
System (DSMS).
(b) Explain any one algorithm for counting oneness in a window.

6. Attempt any one part of the following: 10*1 = 10


(a) For the given data, find the association rule using apriori algorithm.

(b) What do you mean by k-means clustering? How does the k-means
algorithm work? Write k-meansalgorithm for partitioning.

2
7. Attempt any one part of the following:
90 10*1 = 10

13
(a) Define HDFS. Discuss the HDFS architecture and HDFS commands in
_2

2.
brief.
P1

24
(b) Differentiate between Map Reduce and Apache Pig.

5.
3D

.5
P2

17
Q

|1
7
:3
: 44
13
3
02
-2
01
4-
|2

QP23DP1_290 | 24-01-2023 13:44:37 | 117.55.242.132

You might also like