0% found this document useful (0 votes)
23 views

University of Petroleum and Energy Studies, Dehradun: Assignment #1, Summer School 2020

The document is a 5 question, 15 point assignment for a "Disk based Processing" course. It asks students to: [1] Explain what happens if more than one client tries to access the same file on HDFS; [2] State why mappers can't perform aggregation and why reducers are needed; [3] Explain the function of a MapReduce partitioner; [4] Detail the other concrete implementations of the Hadoop FileSystem abstract class besides HDFS. The assignment covers multiple course objectives and has a maximum of 15 marks.

Uploaded by

Ajay Rawat
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
23 views

University of Petroleum and Energy Studies, Dehradun: Assignment #1, Summer School 2020

The document is a 5 question, 15 point assignment for a "Disk based Processing" course. It asks students to: [1] Explain what happens if more than one client tries to access the same file on HDFS; [2] State why mappers can't perform aggregation and why reducers are needed; [3] Explain the function of a MapReduce partitioner; [4] Detail the other concrete implementations of the Hadoop FileSystem abstract class besides HDFS. The assignment covers multiple course objectives and has a maximum of 15 marks.

Uploaded by

Ajay Rawat
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

UNIVERSITY OF PETROLEUM AND ENERGY STUDIES, DEHRADUN

Assignment #1, Summer School 2020


Programme Name: B.Tech(CSE- Big Data) Semester : V
Course Name : Disk based Processing
Course Code : CSBD3001 Max. Marks : 15
Nos. of page(s) : 01
Instructions : Answer the following questions

S. No. Marks CO
Q1 What happens if more than one client is trying to access the same file on the HDFS?
3 CO1
Q2 State the reason why we can’t perform “aggregation” (addition) in a mapper? Why
do we need the reducer for this? 3 CO2

Q3 Explain what is the function of MapReduce Partitioner?


5 CO3
Q4 Hadoop has an abstract notion of filesystems, of which HDFS is just one
implementation. The Java abstract class org.apache.hadoop.fs.FileSystem represents
the client interface to a filesystem in Hadoop, so what are the other several concrete 4 CO2
implementations write them in details

You might also like