100% found this document useful (1 vote)
828 views

III-II Big Data Analytics Question Bank

This document contains a question bank for the subject "Big Data Analytics" for the third year, second semester students of SRI VENKATESWARA COLLEGE OF ENGINEERING. It includes 2 marks questions covering 5 units and 10 marks questions covering the same units. The units cover topics like introduction to big data, Hadoop distributed file system, MapReduce framework, Pig and Hive. The question bank aims to assess students' understanding of key concepts related to big data technologies at different cognitive levels based on Bloom's taxonomy.

Uploaded by

UDAY REDDY
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
828 views

III-II Big Data Analytics Question Bank

This document contains a question bank for the subject "Big Data Analytics" for the third year, second semester students of SRI VENKATESWARA COLLEGE OF ENGINEERING. It includes 2 marks questions covering 5 units and 10 marks questions covering the same units. The units cover topics like introduction to big data, Hadoop distributed file system, MapReduce framework, Pig and Hive. The question bank aims to assess students' understanding of key concepts related to big data technologies at different cognitive levels based on Bloom's taxonomy.

Uploaded by

UDAY REDDY
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

SRI VENKATESWARA COLLEGE OF ENGINEERING

(Autonomous)
Karakambadi Road, TIRUPATI – 517507

Question bank -10 Marks Questions

Name of the subject III B.Tech II Sem Regular 2023


Subject Big Data Analytics
Subject Code CS20APE602
Year & Sem III Year & II Sem

2 Marks Questions

Unit – 1
1 What is the need of Big Data?
2 What are the major technological challenges in managing Big Data?
3 List the technologies available to manage big data?
4 Discuss why is big data analytics important?
5 Define Big Data?
Unit – 2
1 Discuss in brief the term HDFS in Hadoop environment.
2 What is the key-value pair format? How is it different from other data
structures?
3 What is a Job tracker program? How does it differ from the Task Tracker
Program?
4 What is the default replication factor in HDFS?
5 What is YARN?
Unit – 3
1 Discover the steps involved in running a Map Reduce application.
2 List the types of Map Reduce applications.
3 What is the task of Mapper?
4 How a secondary name node differs from the name node in HDFS?
5 How much memory does a Namenode need?
Unit – 4
1 Explain the core components of Hadoop.
2 List out the benefits of Pig.
3 Specify Role of PIG in Hadoop.
4 How security is provided in Hadoop?
5 List the data processing operators in PIG.
Unit – 5
1 What are the differences between Pig and Hive?
2 What is Hive? List any four main features of Hive
3 List the main features of spark.
4 What are the advantages of HBase?
5 What do you mean by windowing in HiveQL?
10 Marks Questions
Bloom’s
Unit – 1 Cos
Level
1 What is Big Data? Explain characteristics of Big Data? L1 CO1
2 List various applications of big data. What are the challenges to
L2 CO1
improve business for a superstore?
3 Describe the structure of HDFS in a Hadoop ecosystem using a CO1
L2
diagram
4 Compare Big Data with Conventional Data and indicate some of CO1
L2
the importance of Big Data Analysis
Explain how to analyze data with Hadoop with suitable diagrams CO1
5 L1
and example.
What are the major sources of big data? Describe a source of CO1
6 L2
each type.
7 Describe the architecture of Hadoop Technology. L2 CO1
8 List and explain the advantages of big data analytics. L1 CO1
Unit – 2
1 Explain the design of HDFS and HDFS concepts. L1 CO3
2 Explain Blocks, Namenodes, Datanodes and Block Caching
L2 CO2
concepts in HDFS.
3 Discuss the architecture of Hadoop Distributed File System. L2 CO3
4 Elicit how to Setting up the Development Environment of a Map
L3 CO2
Reduce?
5 Describe in brief about API for the map-reduce framework. L2 CO2
6 Enumerate the steps to create a word count application using CO2
L2
Map Reduce.
7 Explain the importance of Command Line interface in Hadoop. L1 CO2
8 Describe the working of Map reduce with a relevant example. CO2
L2
9 Briefly explore the feature of Map Reduce. L1 CO2
Unit-3
1 Discuss in brief the implementation of the MapReduce concept
L2 CO3
with a suitable example
Explain in brief, MapReduce types, Input formats,and output CO3
2 L2
formats.
3 Explain the framework of MapReduce. L1 CO3
4 List and explain the features of MapReduce programming model. CO3
L2
How does MapReduce program enable parallel processing?
5 Describe the Anatomy of a MapReduce. L2 CO3
With a neat sketch briefly discuss anatomy of a MapReduce Job
6 L2 CO3
Run.
7 Discuss how security is provided in Hadoop. L2 CO4
How does a map task implement using key-value pairs in an
8 input file? What are the uses of shuffle in processing the L3 CO4
aggregates?
Unit-4
1 Derive the core components of the Hadoop cluster. L2 CO5
2 Write in detail about the network topology of Hadoop cluster L2 CO5
architecture.

3 Explain in detail the Master and Slave components of the L2 CO5


Hadoop cluster.
4 What is Hadoop cluster? Write steps to configure of Hadoop L2 CO5
cluster.
5 Write in detail about user defined functions in pig. L1 CO6
6 Describe pig data types and operators: Group, Join, Filter, Order L2 CO4
by, Sort and Split.
7 Discuss how security is provided in Hadoop. L1 CO5
8 Explain the data processing operators in Pig. L1 CO6
Unit-5
1 Explain with suitable examples the built-in functions in Hive. L2 CO6
2 Compare Hive with traditional databases. L1 CO6
3 Describe the Hive architecture components. Why are HiveQL L2 CO6
used for big data?
4 What is HBASE? Give a detailed description of the feature of L2 CO6
HBASE.
5 Discuss in detail about how spark runs a job. L2 CO6
6 Explain the concept of Resilient Distributed Datasets in Spark. L1 CO6
7 What is HBase? Difference between HBase and Hive. L1 CO6
8 Construct or Building an Online Query Application using HBASE? L2 CO6

Signature of the Faculty Signature of the HOD

You might also like