0% found this document useful (0 votes)
0 views4 pages

QB

The document outlines various topics related to big data, data analytics, and Hadoop technologies, including definitions, characteristics, and comparisons of different systems and tools. It also covers concepts such as HDFS, cloud computing techniques, and NoSQL databases, along with their features and applications. Additionally, it discusses phases of data processing, analytics architecture, and specific tools like Pig, Hive, and MongoDB.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
0 views4 pages

QB

The document outlines various topics related to big data, data analytics, and Hadoop technologies, including definitions, characteristics, and comparisons of different systems and tools. It also covers concepts such as HDFS, cloud computing techniques, and NoSQL databases, along with their features and applications. Additionally, it discusses phases of data processing, analytics architecture, and specific tools like Pig, Hive, and MongoDB.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

M1- M2

1. A. Define big data explain characteristics of big data


B. Explain the following
a) Vertical scalability
b) Horizontal capability
c) Massively parallel platform
2. A. Define data analytics. Explain the types of data analytics
OR
B. Explain the 5 layers of data processing architecture
3. A. Illustrate the various phases involved in Big Data Analytics with neat diagram.
OR
B. Explain the classification of cloud computing techniques
4. A. Explain the Hadoop Distributed File system design features and discuss the various
system roles in an HDFS components or deployment?
OR
B. Explain HDFS safe mode and rack awareness.
5. A. Discuss the HDFS high availability design
OR
B. Explain the HDFS Name Node federation with example and discuss various HDFS user
commands.

6. A. Define Big data, list and explain Big data types.


OR

B. Illustrate the various phases involved in Big Data Analytics with neat diagram.
7. A. Explain the 5 layers of data processing architecture.
OR
B. Define data analytics. Explain the types of data analytics.
8. A. Explain the classification of cloud computing techniques
OR
B. Explain bigdata platform, and list the applications of big data.
9. A. Explain the Hadoop Distributed File system design features and discuss the various
system roles in an HDFS components or deployment?

OR
B. Explain the HDFS Name Node federation with example and discuss various HDFS user
commands.
10. A. Discuss the HDFS high availability design
OR
B. Explain HDFS safe mode and rack awareness.

M2 – M3

Q.1 A. Discuss HBase by explaining Hbase data model overview. Differentiate HBase and
HDFS.
OR
B. Discuss any 3 tools of Hadoop.

Q.2 A. With a neat diagram explain the 2 steps SQOOP data import and export
OR
B. What is NOSQL. List and explain NOSQL data store characteristics

Q.3 A. Distinguish between NOSQL and SQL RDBMS


OR
B. Explain graph database with characteristics

Q.4 A. Explain CAP theorem with a neat diagram


OR
B. Explain the characteristics of schema less model

Q.5 A. Discuss the functions of MangoDB query language and database commands.
OR
B. Illustrate the CQL commands and their functionality

Q.1 A. Discuss the following Hadoop tools a) Pig b) Hive c) YARN


OR
B. Explain Hbase data model overview. Differentiate HBase and HDFS.

Q.2 A. Discuss Apache Hadoop SQOOP by clearly explaining import and Export operations.
OR
B. Distinguish between OLTP and OLAP by taking suitable example.

Q.3 A. Explain the characteristics of schema less model with examples.


OR
B. Briefly explain any 3 NO-SQL data architecture patterns.

Q.4 A. Discuss Shared Nothing (SN) architecture for big data tasks
OR
B. Discuss the functions of MangoDB query language and database commands.

Q.5 A. Explain CAP theorem with a neat diagram


OR
B. Illustrate the CQL commands and their functionality

Q.1 A. Distinguish between RDBMS and HIVE


OR
B. Explain the following
a. HIVE data types
b. Collection data types
c. HIVE data model
d. HIVE built instruction

Q.2 A. Explain the following concepts with respect to PIG


a. Application and features of PIG
OR
B. Compare PIG and Map reduce

Q.3 A. Explain PIG architecture with neat diagram


OR
B. Describe the MapReduce execution steps with neat diagram

Q.4 A. Describe the regression analysis predict the value of the dependent variable in case of
linear regression
OR
B. Discuss Analysis of Variances (ANOVA) and correlation indicators of linear
relationship

Q.5 A. Describe the web content mining and three phases for web usage mining
OR
B. Illustrate the various phases in text mining process pipeline

Q.1 A. Discuss the functions of Group By, partitioning and combining using one example for
each
OR
B. Describe the MapReduce execution steps with neat diagram

Q.2 A. Illustrate main features and Architecture of Hive with neat diagram.
OR
B. Discuss the pig Latin data types and examples

Q.3 A. Explain PIG architecture with neat diagram


OR
B. Explain Matrix Vector multiplication process by Map-Reduce

Q.4 A. Explain the following with reference to Machine Learning


a. Outliers b. Standard deviation Standard Error Estimates c. variance d. Kernel functions
OR
B. Discuss Analysis of Variances (ANOVA) and correlation indicators of linear
relationship

Q.5 A. Describe the web content mining and three phases for web usage mining
OR
B. Describe the regression analysis predict the value of the dependent variable in case of
linear regression

You might also like