0% found this document useful (0 votes)
1K views4 pages

Etl Sample Paper

The document contains sample question papers and test papers for the course Big Data Analytics. It includes questions about key concepts like Hadoop, Hive, Spark, and data analytics. The questions cover topics such as defining Big Data, describing Hadoop features, explaining MapReduce and HDFS, writing Hive queries, and more.

Uploaded by

workwithsnehh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
1K views4 pages

Etl Sample Paper

The document contains sample question papers and test papers for the course Big Data Analytics. It includes questions about key concepts like Hadoop, Hive, Spark, and data analytics. The questions cover topics such as defining Big Data, describing Hadoop features, explaining MapReduce and HDFS, writing Hive queries, and more.

Uploaded by

workwithsnehh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Scheme - I

Sample Question Paper


Program Name : Diploma in Artificial Intelligence and Machine Learning
Program Code : AN
22684
Semester : Sixth
Course Title : Big Data Analytics
Marks : 70 Time: 3 Hrs.

Instructions:
(1) All questions are compulsory.
(2) Illustrate your answers with neat sketches wherever necessary.
(3) Figures to the right indicate full marks.
(4) Assume suitable data if necessary.
(5) Preferably, write the answers in sequential order.

Q.1) Attempt any FIVE of the following. 10 Marks


a) Define Big Data.
b) State the importance of Big Data Analytics.
c) State the various raw data sources.
d) Enlist any two key advantages of Hadoop.
e) State the any two complex data type of Hive.
f) Define RDD.
g) State the use of SPARK SQL.

Q.2) Attempt any THREE of the following. 12 Marks


a) Explain the challenges with Big Data Analytics.
b) State any four importance of HADOOP.
c) Explain any one domain specific example of Big Data.
d) Describe HDFS.

Q.3) Attempt any THREE of the following. 12 Marks


a) Describe classification of Big Data Analytics.
b) State different types of data analytics.
c) Describe data preparation process with an example.
d) State any four data frame operations in SPARK session.
Q.4) Attempt any THREE of the following. 12 Marks
a) Compare RDBMS versus Hadoop.
b) Describe any four Hive data types.
c) Explain Hive file format.
d) Describe data processing in HADOOP.
e) Write and explain the Scala/Python code to create the Spark session.

Q.5) Attempt any TWO of the following. 12 Marks


a) Describe the responsivities of Data Scientist.
b) Describe mapping analysis flow to big data stack.
c) Write syntax and example of Hive Query commands for following.
(i) Create table
(ii) Alter Table
(iii) loading data into table from file

Q.6) Attempt any TWO of the following. 12 Marks


a) Describe Hive architecture.
b) Write a code for building Spark SQL application with SBT.
c) Explain Apache Spark Architecture
Scheme - I
Sample Test Paper - I
Program Name : Diploma in Artificial Intelligence and Machine
Learning
Program Code : AN 22684
Semester : Sixth
Course Title : Big Data Analytics
Marks : 20 Time: 1 Hour

Instructions:
(1) All questions are compulsory.
(2) Illustrate your answers with neat sketches wherever necessary.
(3) Figures to the right indicate full marks.
(4) Assume suitable data if necessary.
(5) Preferably, write the answers in sequential order.

Q.1) Attempt any FOUR. 08 Marks


a) Define Big Data Analytics.
b) State the characteristics of data.
c) State different Big Data Stack.
d) List domain specific examples of Big Data.
e) State the features of Hadoop.

Q.2) Attempt any THREE. 12 Marks


a) Explain Data Science.
b) Explain analytics flow for Big Data.
c) Explain Data Collection process of Big Data with example.
d) Describe HDFS.
Scheme - I
Sample Test Paper - II
Program Name : Diploma in Artificial Intelligence and Machine
Learning
Program Code : AN 22684
Semester : Sixth
Course Title : Big Data Analytics
Marks : 20 Time: 1 Hour

Instructions:
(1) All questions are compulsory.
(2) Illustrate your answers with neat sketches wherever necessary.
(3) Figures to the right indicate full marks.
(4) Assume suitable data if necessary.
(5) Preferably, write the answers in sequential order.

Q.1) Attempt any FOUR. 08 Marks


a) Enlist key advantages of Hadoop.
b) State the use of HIVE.
c) Write syntax for loading data into table from file in HIVE
d) State the Spark Components.
e) Define RDD.

Q.2) Attempt any THREE. 12 Marks


a) Compare RDBMS versus Hadoop.
b) Explain SERDE.
c) Describe Apache Spark Architecture.
d) Describe Data Frame Operations.

You might also like