0% found this document useful (0 votes)
153 views

OEC - CS801A Big Data Analysis

The document discusses topics related to big data analysis including MapReduce, HDFS, NoSQL databases, and Hadoop. It contains questions that assess understanding of key characteristics, components, and applications of big data systems. Example questions include defining Business Intelligence, explaining the CAP theorem, and describing features of Hadoop and NoSQL databases.

Uploaded by

Binit Karmakar
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
153 views

OEC - CS801A Big Data Analysis

The document discusses topics related to big data analysis including MapReduce, HDFS, NoSQL databases, and Hadoop. It contains questions that assess understanding of key characteristics, components, and applications of big data systems. Example questions include defining Business Intelligence, explaining the CAP theorem, and describing features of Hadoop and NoSQL databases.

Uploaded by

Binit Karmakar
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

CS/B.

TECH(N)/EVEN/SEM-8/8303/2022-2023/I195
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WEST BENGAL
Paper Code : OEC- CS801A Big Data Analysis
UPID : 008303

Time Allotted : 3 Hours Full Marks :70


The Figures in the margin indicate full marks.
Candidate are required to give their answers in their own words as far as practicable

Group-A (Very Short Answer Type Question)


1. Answer any ten of the following : [ 1 x 10 = 10 ]
(I) ________ operator executes a shell command from the Hive shell
(II) The data being captured can be any form or structure. Which characteristic of Big data are we talking about?
(III) Which NoSQL database store data in nodes and edges?
(IV) Which term is used to denote the small subsets of a large file created by HDFS?
(V) YARN stands for: _______________
(VI) Is HBase a NoSQL database?
(VII) _____the step is performed by data scientist after acquiring the data.
(VIII) NoSQL databases is used mainly for handling large volumes of ______________ data.
(IX) What is the default file size of an HDFS data block?
(X) A ________ node acts as the Slave and is responsible for executing a Task assigned to it by the JobTracker.
(XI) Most NoSQL databases support automatic __________ meaning that you get high availability and disaster
recovery.
(XII) The ____________ and the EditLog are central data structures of HDFS.

Group-B (Short Answer Type Question)


Answer any three of the following : [ 5 x 3 = 15 ]
2. Explain the difference between parallel and distributed computing system? [5]
3. What is the role of Name Node in an HDFS cluster? [5]
4. Define Business Intelligence(BI). [5]
5. Explain the CAP theorem . [5]
6. As an HR manager of a company providing Big Data solutions to clients, what characteristics would you [5]
look for while recruiting a potential candidate for the position of a data analyst?

Group-C (Long Answer Type Question)


Answer any three of the following : [ 15 x 3 = 45 ]
7. (a) Explain application in BigData in Healthcare. [6]
(b) How do you deploy a Big Data solution? [9]
8. (a) What is NoSQL database? [2]
(b) Why it is used? [3]
(c) What are the advantages of NoSQL? [5]
(d) List the differences between NoSQL and relational databases. [5]
9. (a) Explain some important features of Hadoop. [5]
(b) Explain the different modes in which Hadoop run. [5]
(c) Explain the core components of Hadoop. [5]
10. (a) What do you mean by schema-less databases? [3]
(b) Explain types of NoSQL Databases? [ 12 ]
11. (a) Explain Job Scheduling in MapReduce. How it is done in case of (i)The Fair Scheduler (ii)The [ 10 ]
Capacity Scheduler.
(b) Write a short note on the FileInputFormat class. [3]
(c) Explain the types of MapReduce applications. [2]

1/1

You might also like