0% found this document useful (0 votes)
211 views5 pages

BDA Assignments

The document outlines an assignment with 15 batches of questions related to big data concepts. The questions cover topics like Hadoop architecture, MapReduce, NoSQL databases, and analytics tools. Students are asked to explain concepts, draw architectures, list components, and describe use cases. The questions are meant to assess students' understanding of big data fundamentals and how related frameworks address storage, processing, and analytics challenges at scale.

Uploaded by

Sana Hosaritti
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
211 views5 pages

BDA Assignments

The document outlines an assignment with 15 batches of questions related to big data concepts. The questions cover topics like Hadoop architecture, MapReduce, NoSQL databases, and analytics tools. Students are asked to explain concepts, draw architectures, list components, and describe use cases. The questions are meant to assess students' understanding of big data fundamentals and how related frameworks address storage, processing, and analytics challenges at scale.

Uploaded by

Sana Hosaritti
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 5

ASSIGNMENT - 1

Module – 1
Q. RBT
Description of Questions Marks
No. Level
BATCH 1
What is big data? Explain classification of data.
1 5 L2
What are characteristics of big data? Explain big data types.
2 5 L2
Explain how big data used in chocolate marketing company.
3 5 L2
Explain how to implement vertical scalability in big data analytics.
4 5 L2
Draw and explain layers and functions in data processing
5 architectures. 5 L2

BATCH 2
Explain how big data used in weather data recording, monitoring and
1 prediction organization. 5 L2

List and explain parameters of good quality data.


2 5 L2
Explain different activities in data pre-processing.
3 5 L2
Draw and explain data store export to cloud.
4 5 L2
Draw and explain Google cloud platform for bigquery cloud service.
5 5 L2
BATCH 3
Explain how data store is used with structured and semi structured
1 data. 5 L2

List and explain different big data storages.


2 5 L2
Write different phases in analytics.
3 5 L2
Explain how big data used in automotive components and predictive
4 maintenance services. 5 L2

Explain how to implement horizontal scalability in big data analytics.


5 5 L2
ASSIGNMENT - 2
Module – 2
Q. RBT
Description of Questions Marks
No. Level
BATCH 1
Draw and explain Hadoop architecture.
1 5 L2
How does Hadoop works.
2 5 L2
Explain different components of Hadoop ecosystem.
3 5 L2
Explain components of Hadoop distributed file system.
4 5 L2
List and explain Hadoop user commands.
5 5 L2
BATCH 2
Draw and explain Hadoop MapReduce architecture.
1 5 L2
Explain following terminologies related to Hadoop MapReduce
architecture.
2 5 L2
1) Payload 2) Mapper 3) Name Node 4) Data Node 5) master
Node

Explain how YARN manages resources in Hadoop architecture.


3 5 L2
Explain the main components of YARN architecture.
4 5 L2
What is HBase? How it store semi structured data?
5 5 L2
BATCH 3
Explain two use cases of HBase.
1 5 L2
Draw Hive architecture and explain how it process query.
2 5 L2
Explain components of Apche pig.
3 5 L2
Explain big data import and export using Sqoop.
4 5 L2
Explain how Apache Flume is used to process web log data.
5 5 L2
ASSIGNMENT - 3
Module – 3
Q. RBT
Description of Questions Marks
No. Level
BATCH 1
Explain features of distributed computing architectures
1 5 L2
Explain CAP theorem.
2 5 L2
Explain characteristics of schema less model.
3 5 L2
Draw and explain flexible NoSQL DB of students.
4 5 L2
Explain key-value store. Also explain advantages of key-value
5 store. 5 L2

BATCH 2
Write features of document store.
1 5 L2
Explain CSV and JSON file format.
2 5 L2
With example explain XML document architecture pattern.
3 5 L2
Explain columnar data store with example.
4 5 L2
Write and explain characteristics of columnar family data store.
5 5 L2
BATCH 3
Explain features of BigDataTable.
1 5 L2
Draw and explain ORC file format.
2 5 L2
Explain characteristics of big data NoSQL solutions.
3 5 L2
Explain any two sharding models.
4 5 L2
Explain features of MongoDB.
5 5 L2
ASSIGNMENT - 4
Module – 4
Q. RBT
Description of Questions Marks
No. Level
BATCH 1
Draw and explain the MapReduce process on client submitting a job.
1 5 L2
With sample code, explain map task.
2 5 L2
Explain following terms, Inputsplit, RecordReader, combiner.
3 5 L2
Draw ad explain MapReduce execution steps.
4 5 L2
Explain MapReduce for ACPAMS data analysis.
5 5 L2
BATCH 2
Explain how node failures are handled.
1 5 L2
Explain various operations of MapReduce.
2 5 L2
Explain composing of MapReduce for different types of calculations.
3 5 L2
Explain cascade steps for multiplication of two matrices.
4 5 L2
Explain main features of Hive.
5 5 L2
BATCH 3
Draw and explain architecture of Hive.
1 5 L2
Draw and explain Hive data flow sequences and workflow steps.
2 5 L2
Explain Hive data definition language.
3 5 L2
Write features of pig.
4 5 L2
Draw an explain pig architecture.
5 5 L2
ASSIGNMENT - 5
Module – 5
Q. RBT
Description of Questions Marks
No. Level
BATCH 1
Explain linear and non linear relationship between data points.
1 5 L2
Explain terms outliers and variance.
2 5 L2
Explain how to calculate standard deviation and standard error.
3 5 L2
Explain simple linear regression.
4 5 L2

5 List and explain examples of modeling using regression. 5 L2


BATCH 2
Explain applications of text mining.
1 5 L2
Draw and explain the text mining process.
2 5 L2
Explain feature generation phase in text mining.
3 5 L2
Explain Naive Bayes analysis.
4 5 L2
Explain how support vector machine can be used for classification.
5 5 L2
BATCH 3
Draw and explain taxonomy of web mining.
1 5 L2
Explain different tasks in web content analysis.
2 5 L2
Draw and explain web usage mining.
3 5 L2
Explain page rank algorithm using in degrees.
4 5 L2
Explain centralities, ranking and anomaly detection in social network
5 graph. 5 L2

You might also like