0% found this document useful (0 votes)
15 views

University of Petroleum and Energy Studies, Dehradun: Assignment #2, Summer School 2020

The document is a 5 question, 15 mark assignment for a Big Data course on Disk based Processing. The questions cover key MapReduce concepts like configuration parameters for running a job, the distributed cache, daemon processes, combiners, and distinguishing features of Spark vs MapReduce.

Uploaded by

Ajay Rawat
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views

University of Petroleum and Energy Studies, Dehradun: Assignment #2, Summer School 2020

The document is a 5 question, 15 mark assignment for a Big Data course on Disk based Processing. The questions cover key MapReduce concepts like configuration parameters for running a job, the distributed cache, daemon processes, combiners, and distinguishing features of Spark vs MapReduce.

Uploaded by

Ajay Rawat
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

UNIVERSITY OF PETROLEUM AND ENERGY STUDIES, DEHRADUN

Assignment #2, Summer School 2020


Programme Name: B.Tech(CSE- Big Data) Semester : V
Course Name : Disk based Processing
Course Code : CSBD3001 Max. Marks : 15
Nos. of page(s) : 01
Instructions : Answer the following questions

S. No. Marks CO
Q1 Mention what are the main configuration parameters that user need to specify to run
MapReduce Job? 3 CO1

Q2 Explain what is distributed Cache in MapReduce Framework?


3 CO2
Q3 How many Daemon processes run on a Hadoop system? Explain
3 CO1
Q4 Explain what combiners is and when you should use a combiner in a MapReduce
Job? 3 CO3

Q5 Distinguish Apache Spark and MapReduce?


3 CO4

You might also like