Model question paper _Big data_2024-25_kca022

Fjj

Uploaded by

Pranjal chauhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

201 views3 pages

Model question paper _Big data_2024-25_kca022

Fjj

Uploaded by

Pranjal chauhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Model Question paper

UNIT -1
1. What is Big Data? why we need to analyze big data?
2. What are the benefits of big data?
3. Discuss challenges under big data .
4. How big data analytics will be useful in the development of smart cities?
5. Discuss big data in terms of volume and velocity.
6. What are the different types of big data technologies?
7. List and discuss various dimensions of Big data. Explain in detail industry
examples of big data?
8. Explain big data and Hadoop open source technology
9. Discuss and differentiate structured, unstructured and semi - structured data.
Give proper examples.
10. Explain 4 ‘V’s of big data with suitable example. Discuss how big data
analytics can be useful in the development of smart transports.
11. Discuss on how cloud and Big data related to each other?
12. How does Hadoop system analyze data? Explain your answer with example.
UNIT 2
13. How does Hadoop work? What are the advantages of Hadoop? What are the
different modes in which Hadoop can be installed and what is use of each
mode from application and developer point of view?
14. List down the tools related with Hadoop
15. What is Map reducing? Explain with neat sketch about the processing of a job
in Hadoop?
16. Explain the stages of map reduce program execution?
17. Explain the anatomy of map reduce job run ?
18. Define the role of combiner and partitioner in a map reduce application?
19. Specify the role of job tracker & task tracker in HDFS.
20. Explain shuffle and sort phase and reducer phase in map reduce?
21. Explain the role of driver code, mapper code and reducer code with in a map
reduce program model by a suitable example,
22. Explain briefly about the input and output format in mapreduce?
23. What are the various operational modes of Hadoop cluster configuration and
explain in detail about configuring/installing Hadoop in fully distributed
mode.
24. Explain about the implementation of map reduce concept with a small
example.
25. Discuss role of JobTracker and TaskTracker in processing data with Hadoop.
26. What is MapReduce? Explain working of various phases of MapReduce with
word count example.
27. Explain Hadoop architecture and its component with proper diagram
28. Discuss the different types and formats of Map Reduce with exam
29.
UNIT -3
30. Draw and explain HDFS architecture. Explain the function of name node and
data node? what is a secondary name node? Is it a substitute of Namenode?
31. How does HDFS ensure data integrity in a Hadoop cluster?
32. State the purpose of Hadoop pipes
33. Show on how a client read and write data in HDFS, Give an example code.
34. Discuss the design of Hadoop Distributed FileSystem(HDFS) and concept in
detail
35. Explain Avro file based data structures in detail?
36. Write the working procedure of HDFS and also explain the features of HDFS.
37. Give commands with appropriate arguments to perform data transfer between
local file system and HDFS.
38. With suitable block diagram explain architecture of HDFS.
39. Discuss role of Data node and Name node in HDFS
UNIT -4
1. Write a short note on NOSQL database. Compare & Contrast NOSQL
relational Database.
2. Describe about the graph database and schemaless database?
3. Explain the aggregate data models?List four advantages and disadvantages of
aggregate oriented database?
4. Explain master slave and peer to peer replication in detail?
5. List down the entities of YARN. What are the limitations of classic map
reduce? Compare classic map reduce with YARN? Discuss Hadoop YARN in
detail
6. Distinguish between the old and new versions of Hadoop ApI for Map
Reduce framework.
7. What is NoSQL database? Discuss key characteristics and advantages of
NoSQL database
8. Write a short note on Hadoop Ecosystem.
9. What is transformation and actions in Spark? Explain with example.
10. Discuss limitations of Hadoop and how it is overcome in Apache Spark.
11. Write a short note on Spark stack. Give brief explanation of each component.
12. What is RDD? Explain role of RDD in Spark.
13. Differentiate SQL and NoSQL databases. What are the applications of
NoSQL database?
14. Discuss Spark Streaming with suitable example such as analyzing tweets from
Twitter
15. What is MongoDB? Discuss important features of MongoDB.
16. Discuss different types of NoSQL databases with proper example.
17. Explain basic CRUD operations with example in MongoDB
18. Explain database, collection, document and fields with respect to MongoDB.
Also give its equivalent term in RDBMS.
19. Explain use of aggregate function in MongoDB with suitable example
20. List down the entity of YARN.

UNIT-5
21. Write a note on the use of Zookeeper?
21. Write in detail about Hbase data model and Pig data model?
22. What is the necessity of PIG Latin?
23. What are the components of pig execution environment?
24. Explain about the various data types supported by pig in its data model with
an example.
25. Explain the storage mechanism in Hbase? write a query to create a table in
hbase
26. Explain the metastore in HIVE.
27. Explain the architecture of HIVE with neat sketch.
28. What are views in hive .
29. what is the difference between internal and external tables in hive .
30. Explain about various data types supported by HiveQL with an example.
31. Define the various file formats supported by HIVE.Discuss the queries
involved in hive data definition?
32. Explain the Cassandra Data Model with examples? How Cassandra integrated
with Hadoop?
33. Explain the operators supported by Pig w.r.t. data access, transformations and
debugging operations.
34. How are Pig programs packaged and explain the modes of running a pig
script with a neat sketch.
35. Write Example Hive Queries for Natural Join and outer-Join .
36. What is HBase? Differentiate HBase and RDBMS.Explain H base and their
data model and implementations.
37. Explain in detail about the Hive data manipulation, queries, data definition
and data types.
38. Write a short note on Apache Pig.
39. What is HiveQL? Explain various statements in HiveQL with example.

1Z0 1105 22 Demo
No ratings yet
1Z0 1105 22 Demo
4 pages
CS6001-C Sharp and .NET Programming
No ratings yet
CS6001-C Sharp and .NET Programming
12 pages
Cryptography and Network Security Overview
No ratings yet
Cryptography and Network Security Overview
16 pages
B. SC Computer Science
100% (1)
B. SC Computer Science
5 pages
Web Programming Manual
No ratings yet
Web Programming Manual
34 pages
Internet Technology and Web Design Viva Questions: 1.what Is DNS?
No ratings yet
Internet Technology and Web Design Viva Questions: 1.what Is DNS?
7 pages
Question Paper Code:: Reg. No.
No ratings yet
Question Paper Code:: Reg. No.
2 pages
CCS 302 Human Computer Interaction - Notes
No ratings yet
CCS 302 Human Computer Interaction - Notes
82 pages
Unit 5 2 Marks
No ratings yet
Unit 5 2 Marks
10 pages
OBJECT ORIENTED SYSTEM DESIGN Question Paper 21 22
No ratings yet
OBJECT ORIENTED SYSTEM DESIGN Question Paper 21 22
3 pages
VTU B.E B.tech 2019 8th Semester July CBCS 15 Scheme 15CS833 Network Management
No ratings yet
VTU B.E B.tech 2019 8th Semester July CBCS 15 Scheme 15CS833 Network Management
2 pages
Input and Output Text and Binary I/O: Introduction To Java Y.Daniel Liang 1
No ratings yet
Input and Output Text and Binary I/O: Introduction To Java Y.Daniel Liang 1
64 pages
Ii Bca Java Notes (3.5 Units)
No ratings yet
Ii Bca Java Notes (3.5 Units)
87 pages
IMP Questions Software Testing
No ratings yet
IMP Questions Software Testing
4 pages
Web Application and Development: Lab Manual
No ratings yet
Web Application and Development: Lab Manual
52 pages
Iv Vsem Bca Blownup and Practical List
No ratings yet
Iv Vsem Bca Blownup and Practical List
28 pages
C#&.Net Programming Question Bank
50% (2)
C#&.Net Programming Question Bank
4 pages
BCA Slybus Purbanchal
0% (1)
BCA Slybus Purbanchal
106 pages
Fsd-Question Bank - Imp (Gtu Papers)
No ratings yet
Fsd-Question Bank - Imp (Gtu Papers)
2 pages
Past Paper Questions (Topic Wise)
100% (1)
Past Paper Questions (Topic Wise)
18 pages
NLP Asgn2
No ratings yet
NLP Asgn2
7 pages
CP7102-Advanced Datastructure and Algorithm Question Bank
No ratings yet
CP7102-Advanced Datastructure and Algorithm Question Bank
4 pages
Module-1 Introduction To File Structures
No ratings yet
Module-1 Introduction To File Structures
50 pages
Question Bank
No ratings yet
Question Bank
16 pages
Web Lab Report 2
No ratings yet
Web Lab Report 2
6 pages
IGNOU MCS-011 Previous Years Questions
No ratings yet
IGNOU MCS-011 Previous Years Questions
64 pages
Hpu Bca 4th Sem Paper of Internet Technology&webpagedesign
No ratings yet
Hpu Bca 4th Sem Paper of Internet Technology&webpagedesign
4 pages
Updated 5th and 6th Sem 2021 Scheme and Syllabus
No ratings yet
Updated 5th and 6th Sem 2021 Scheme and Syllabus
71 pages
C++ Question Bank
No ratings yet
C++ Question Bank
28 pages
Web Lab Ex 1-10
No ratings yet
Web Lab Ex 1-10
26 pages
2020 - Marking - HNDIT2313 Object Oriented Analysis and Design
No ratings yet
2020 - Marking - HNDIT2313 Object Oriented Analysis and Design
7 pages
Mobile Computing Unit I Wireless Communication Fundamentals
No ratings yet
Mobile Computing Unit I Wireless Communication Fundamentals
19 pages
Requirements Modeling
No ratings yet
Requirements Modeling
39 pages
Oops Using C++ Notes
No ratings yet
Oops Using C++ Notes
66 pages
SQT - Question Papers
0% (1)
SQT - Question Papers
7 pages
DBMS (UNIT-6) (Advances in Databases and Big Data)
No ratings yet
DBMS (UNIT-6) (Advances in Databases and Big Data)
103 pages
Advance Computer Architecture: Unit:Ii System Interconnect Architectures
No ratings yet
Advance Computer Architecture: Unit:Ii System Interconnect Architectures
53 pages
BCA Project Report Format
No ratings yet
BCA Project Report Format
3 pages
Digital Logic Design Jan 2023
No ratings yet
Digital Logic Design Jan 2023
8 pages
Fill in The Blanks: Is A Physical or Conceptual Connection Between Objects
No ratings yet
Fill in The Blanks: Is A Physical or Conceptual Connection Between Objects
3 pages
Mc5024-Web Design Model
No ratings yet
Mc5024-Web Design Model
2 pages
Mobile Computing Unit III
No ratings yet
Mobile Computing Unit III
17 pages
AOOP-4340701-Lab Manual (1) Added Page
No ratings yet
AOOP-4340701-Lab Manual (1) Added Page
301 pages
Data Structure MCA Question Bank
100% (1)
Data Structure MCA Question Bank
8 pages
Human Computer Interaction
No ratings yet
Human Computer Interaction
1 page
AI Lab Manual
No ratings yet
AI Lab Manual
37 pages
Mobile Application Dev
No ratings yet
Mobile Application Dev
104 pages
Anna University Questions Department of CSE III Year CS1005 - Advanced Java Programming (Elective) Unit I 2 Marks
No ratings yet
Anna University Questions Department of CSE III Year CS1005 - Advanced Java Programming (Elective) Unit I 2 Marks
5 pages
Te Aids - (Elective-I) Human Computer Interface
No ratings yet
Te Aids - (Elective-I) Human Computer Interface
2 pages
web programming BCA - unit 1 study materials(BHARATHIAR UNIVERSITY)
No ratings yet
web programming BCA - unit 1 study materials(BHARATHIAR UNIVERSITY)
9 pages
V Sem Solution Bank
100% (1)
V Sem Solution Bank
303 pages
Principles of Compiler Design
No ratings yet
Principles of Compiler Design
36 pages
Content Beyond Syllabus
No ratings yet
Content Beyond Syllabus
7 pages
CCA3002 - FOG-AND-EDGE-COMPUTING - LT - 1.0 - 34 - Fog and Edge Computing
No ratings yet
CCA3002 - FOG-AND-EDGE-COMPUTING - LT - 1.0 - 34 - Fog and Edge Computing
3 pages
HCI Unit IV
No ratings yet
HCI Unit IV
34 pages
CS2402 Mobile and Pervasive Computing Syllabus
No ratings yet
CS2402 Mobile and Pervasive Computing Syllabus
1 page
JAVA Sample Questions For Practice (II CSE - A' & II IT - B')
No ratings yet
JAVA Sample Questions For Practice (II CSE - A' & II IT - B')
5 pages
Software Engineering Question Paper
No ratings yet
Software Engineering Question Paper
9 pages
Mastering WebGL: Crafting Advanced 3D Web Experiences: WebGL Wizadry
From Everand
Mastering WebGL: Crafting Advanced 3D Web Experiences: WebGL Wizadry
Kameron Hussain
No ratings yet
BDAV Question Bank
No ratings yet
BDAV Question Bank
2 pages
Question Bank - Big Data Analytics - Final1
100% (1)
Question Bank - Big Data Analytics - Final1
6 pages
LLM Graph RAG From Scratch
No ratings yet
LLM Graph RAG From Scratch
22 pages
Chapter 2 Modeling Data in The Organization
No ratings yet
Chapter 2 Modeling Data in The Organization
48 pages
Unit 16
No ratings yet
Unit 16
24 pages
7 Business Intelligence Lifecycle 03-01-2025
No ratings yet
7 Business Intelligence Lifecycle 03-01-2025
8 pages
Credential Hunting_OSCP
No ratings yet
Credential Hunting_OSCP
9 pages
Walden University RSCH 8210: Quantitative Reasoning and Analysis Dr. Randy Heinrich September 19, 2021
No ratings yet
Walden University RSCH 8210: Quantitative Reasoning and Analysis Dr. Randy Heinrich September 19, 2021
9 pages
Chethan-Advanced Database-Quiz
No ratings yet
Chethan-Advanced Database-Quiz
20 pages
Inf1343 2011W Assignment 1
No ratings yet
Inf1343 2011W Assignment 1
4 pages
Venkatesh - SK
No ratings yet
Venkatesh - SK
4 pages
Exercises chap 9
No ratings yet
Exercises chap 9
4 pages
Back-End Assignment-Fullstack Intern OSUMARE
No ratings yet
Back-End Assignment-Fullstack Intern OSUMARE
2 pages
Aarya AI Report
No ratings yet
Aarya AI Report
22 pages
Power Platform Fundamentals (PL-900)
No ratings yet
Power Platform Fundamentals (PL-900)
66 pages
A Road Map For Data Science. What Is Data Science - by Jared - Towards Data Science PDF
No ratings yet
A Road Map For Data Science. What Is Data Science - by Jared - Towards Data Science PDF
6 pages
Blazor Webassembly Succinctly PDF
No ratings yet
Blazor Webassembly Succinctly PDF
105 pages
Referential Integrity in Databases
No ratings yet
Referential Integrity in Databases
16 pages
Document 2380444.1
No ratings yet
Document 2380444.1
3 pages
AIDA Booklet V2.2
No ratings yet
AIDA Booklet V2.2
37 pages
Azure Training Draft 2
No ratings yet
Azure Training Draft 2
4 pages
eBAY QA 1
No ratings yet
eBAY QA 1
10 pages
DBMSFinal Jan15
No ratings yet
DBMSFinal Jan15
6 pages
1ST ASSIGNMENT Implementation of DDL and DML Queries 1
No ratings yet
1ST ASSIGNMENT Implementation of DDL and DML Queries 1
3 pages
Tandem-EnFORM Users Guide
No ratings yet
Tandem-EnFORM Users Guide
236 pages
DBMS Tutorial - 2 Solutions Final
No ratings yet
DBMS Tutorial - 2 Solutions Final
17 pages
CV RomanWAW
No ratings yet
CV RomanWAW
2 pages
C_ABAPD_2309
No ratings yet
C_ABAPD_2309
55 pages
Forest Inventory Design Principles - Challenges and Solutions
No ratings yet
Forest Inventory Design Principles - Challenges and Solutions
6 pages
8960 - DWM Experiment 2
No ratings yet
8960 - DWM Experiment 2
15 pages
Resume - Muhammad Muzammil Sabir - ETL
No ratings yet
Resume - Muhammad Muzammil Sabir - ETL
3 pages

Model question paper _Big data_2024-25_kca022

Uploaded by

Model question paper _Big data_2024-25_kca022

Uploaded by

Model Question paper

You might also like