0% found this document useful (0 votes)

4 views

Big Data and Hadoop

Uploaded by

qabiswajit

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Big Data and Hadoop

Uploaded by

qabiswajit

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Difference Between Big Data and Hadoop

Rashmi Karan
Manager - Co ntent

Updated on Nov 26, 2021 18:01 IST

As predicted by IDC, global data volume grew from 4.4 zettabytes to 44 zettabytes
between 2013 and 2020. By 2025, IDC predicts that there will be 163 zettabytes of
data from mobile devices, Internet of things devices with information sensing,
remote sensing, software logs, cameras, microphones, RFID readers, and wireless
sensor networks. When we talk about big data, Hadoop often comes into the
picture and people use them interchangeably, however, there is a difference between
big data and Hadoop, let us check out.

Disclaim e r: This PDF is auto -generated based o n the info rmatio n available o n Shiksha as
o n 0 1-No v-20 23.
Big Data

The term Big Data refers to large data sets. Such huge volumes that it gets
necessary to use specific techniques and tools to deal with them. Due to its
characteristics of size, speed of growth, and variability, traditional technologies and
methods are not enough to manage big data efficiently.

Among these computer tools designed to handle large amounts of data is specific
software, generally distributed and capable of scaling with the volume and speed at
which the data is generated. Current usage of big data includes predictive analytics,
user behavior analytics, or certain other advanced data analytics methods that
extract value from big data. However, there is no specific data size defined for a set
of data to be called Big Data.

Import ance of Big Dat a

This generation of massive data and its storage, processing, and analysis has
become critical for many organizations, being one of the sectors with the most
growth and professional trajectory today. The Big Data sector is expected to
multiply its valuation in the market by 4 times by 2025, including the internet of things ,
cloud computing, artif icial intelligence, and automation.

The value that organizations can extract from this data is focused on its use for
making better strategic decisions, developing mathematical models, artificial
intelligence, etc. In many cases, the analysis of the data obtained by an organization
can give clues and ideas about new problems, and answer questions based on
objective information, which increases security and confidence.

Hadoop

Hadoop is an open-source framework with which any type of massive data can be
stored and processed. It has the ability to operate tasks in an almost unlimited way
with great processing power and get quick responses to any type of query about
the stored data. The main purpose of the framework is to store large amounts of

Disclaim e r: This PDF is auto -generated based o n the info rmatio n available o n Shiksha as
o n 0 1-No v-20 23.
data and allow queries on said data, with a low response time. This is achieved
through the distributed execution of code in multiple nodes (machines), each of
which is in charge of processing a part of the work to be done.

Apache Hadoop Component s

The basic components of Apache Hadoop are –

Hadoop Distributed File System: The information is not stored on a single machine,
but is distributed among all the machines that make up the cluster.

MapReduce Framework: MapReduce is a systematic approach that uses the HDFS

distributed file system for the parallel processing of data. The system is structured
through a master-slave architecture where the master server of each Hadoop
cluster receives and queues user requests and assigns them to the slave servers for
processing.

Advant ages of using Hadoop

Some remarkable benefits that Hadoop offers, include –

Developers do not have to f ace the problems of parallel programming

Allows to distribute the inf ormation in multiple nodes and execute the processes in
parallel

It has mechanisms f or data monitoring

Allows data queries

Has multiple f unctionalities to f acilitate the treatment, monitoring, and control of the
stored inf ormation

Dif f erence between Big Data and Hadoop

Big Dat a Hadoop

It is an open-source f ramework
Ref ers to a huge chunk of
required to manage that data.
structured and non-structured
Based on a distributed sof tware
Def inition data. It is raw data containing

Disclaim e r: This PDF is auto -generated based o n the info rmatio n available o n Shiksha as
o n 0 1-No v-20 23.
Based on a distributed sof tware
Def inition data. It is raw data containing
f ramework to handle huge data
mainly user-generated content
set storage and processing
to be analyzed
across clustered servers

One of the dif f erent tools to

Has little or no value until
Value store, process, and analyze big
processed
data

Allows to access and process the

Accessibility Dif f icult to access given its size
big data very f ast

Hadoop Distributed File System

Not possible to store big data
(HDFS) is the primary data storage
Storage because of its raw and
system in Hadoop, storing big
unstructured f orm
data

Just a tool to pull out value f rom

Nature Big data is considered an asset
the asset

Clusters dif f erent f ormats of data

Consists of multiple f ormats of which can be stored as structured,
T ype
data semi-structured, and completely
unstructured

Used in –
Used in f etching inf ormation Fraud detection and
f rom – prevention in f inance
Social Networking sites Detect and prevent cyber-
like Facebook, Instagram, attacks
and T witter
Understand user behavior
Applications
Public transportation f rom huge data sets
Healthcare and education Real-time analysis of
systems customers data
Agriculture manage content on social
media platf orms

A complex set of data that is Allows to scale the system as the

Scalability open to interpretation and can volume of data received grows,
be unscalable since to process more data

Disclaim e r: This PDF is auto -generated based o n the info rmatio n available o n Shiksha as
o n 0 1-No v-20 23.
be unscalable since to process more data

Conclusion

Through the knowledge extracted from big data analysis using tools like Hadoop,
organizations are able to find new trends. This adds a lot of value and allows them
to come up with viable and effective solutions at a higher speed. Hope this article
helped in clearing the doubts regarding the concepts of big data and Hadoop and
the difference between big data and Hadoop . Keep reading and learning!

Disclaim e r: This PDF is auto -generated based o n the info rmatio n available o n Shiksha as
o n 0 1-No v-20 23.

Introduction To Big Data With Spark and Hadoop
No ratings yet
Introduction To Big Data With Spark and Hadoop
61 pages
20IT503 - Big Data Analytics - Unit4
No ratings yet
20IT503 - Big Data Analytics - Unit4
73 pages
Big Data & Hadoop Training Material 0 1 PDF
50% (2)
Big Data & Hadoop Training Material 0 1 PDF
168 pages
BDA - Unit-1
No ratings yet
BDA - Unit-1
24 pages
Big Data Analytics Unit-1
No ratings yet
Big Data Analytics Unit-1
39 pages
hadoop-big-data-unit-2
No ratings yet
hadoop-big-data-unit-2
23 pages
Big Data: Introduction To Terms, Concepts and Tools
No ratings yet
Big Data: Introduction To Terms, Concepts and Tools
23 pages
Bdhs - Ebook
No ratings yet
Bdhs - Ebook
970 pages
Hadoop & BigData (UNIT - 2)
No ratings yet
Hadoop & BigData (UNIT - 2)
22 pages
Big Data Analytics 0th Lecture
No ratings yet
Big Data Analytics 0th Lecture
19 pages
BDH Admin Ebook
No ratings yet
BDH Admin Ebook
807 pages
Intr Oduction of Big Data
No ratings yet
Intr Oduction of Big Data
12 pages
Hadoop PPT
No ratings yet
Hadoop PPT
25 pages
Big Data Analytics Overview
No ratings yet
Big Data Analytics Overview
17 pages
Ashish_Presentation_Stage1_modify_LR
No ratings yet
Ashish_Presentation_Stage1_modify_LR
24 pages
Lect 2 Big Data Lesson01
No ratings yet
Lect 2 Big Data Lesson01
26 pages
Chapter 2-Data Science
No ratings yet
Chapter 2-Data Science
23 pages
biggdata
No ratings yet
biggdata
24 pages
Seminar Report PDF
100% (2)
Seminar Report PDF
35 pages
11 Lecture
No ratings yet
11 Lecture
22 pages
BD by maaz
No ratings yet
BD by maaz
19 pages
Updated Unit-2
0% (1)
Updated Unit-2
55 pages
Big Data Training
No ratings yet
Big Data Training
244 pages
Experiment No. 11 Part A A.1 Aim: 2 Prerequisite: A.3 Outcome: After Successful Completion of This Experiment, Students Will Be Able To
No ratings yet
Experiment No. 11 Part A A.1 Aim: 2 Prerequisite: A.3 Outcome: After Successful Completion of This Experiment, Students Will Be Able To
21 pages
Unit-I Material
No ratings yet
Unit-I Material
32 pages
BIG DATA ANALYTICS (1)
No ratings yet
BIG DATA ANALYTICS (1)
20 pages
Big Data Analytics Digital Notes
No ratings yet
Big Data Analytics Digital Notes
119 pages
Hadoop V.01
No ratings yet
Hadoop V.01
24 pages
The Age OF: Every Minute
No ratings yet
The Age OF: Every Minute
47 pages
Hadoop Ecosystem Large PDF
No ratings yet
Hadoop Ecosystem Large PDF
229 pages
Bigdata PPT Slides (E)
No ratings yet
Bigdata PPT Slides (E)
10 pages
Prepared by Richa Btech (Cse) 6 Sem Dav University Jalandhar
No ratings yet
Prepared by Richa Btech (Cse) 6 Sem Dav University Jalandhar
30 pages
IOT and Comp.architecture
No ratings yet
IOT and Comp.architecture
17 pages
TIE- 21CS71 SIMP with Key Answers (1)
No ratings yet
TIE- 21CS71 SIMP with Key Answers (1)
19 pages
Experiment No _ 1 Bda
No ratings yet
Experiment No _ 1 Bda
10 pages
MCAD2232 (PRESS) BIG DATA and Its Applications
No ratings yet
MCAD2232 (PRESS) BIG DATA and Its Applications
140 pages
Data Science
No ratings yet
Data Science
87 pages
Unit 1
No ratings yet
Unit 1
19 pages
BDA viva
No ratings yet
BDA viva
26 pages
Big Data?: Hadoop?
No ratings yet
Big Data?: Hadoop?
2 pages
CS 4407 Discussion Forum Unit 2
No ratings yet
CS 4407 Discussion Forum Unit 2
2 pages
INtroduction To Big DAta and HAdoop
No ratings yet
INtroduction To Big DAta and HAdoop
30 pages
Hadoop Chapter 1
No ratings yet
Hadoop Chapter 1
6 pages
Big Data Overview
No ratings yet
Big Data Overview
18 pages
Hadoop Lab
100% (1)
Hadoop Lab
32 pages
0 The BigDataEra
No ratings yet
0 The BigDataEra
36 pages
Big Data S All Units
No ratings yet
Big Data S All Units
122 pages
20ai402 Data Analytics Unit-2
No ratings yet
20ai402 Data Analytics Unit-2
72 pages
Chapter 2 Hadoop Eco System
No ratings yet
Chapter 2 Hadoop Eco System
34 pages
Hadoop Quick Guide
No ratings yet
Hadoop Quick Guide
32 pages
Hadoop - Quick Guide Hadoop - Big Data Overview
No ratings yet
Hadoop - Quick Guide Hadoop - Big Data Overview
32 pages
IJECEfgfdgfdgfdgfdfgfdgfdgfdgf
No ratings yet
IJECEfgfdgfdgfdgfdfgfdgfdgfdgf
9 pages
Big Data Intro
No ratings yet
Big Data Intro
10 pages
L8 Big Data Management en
No ratings yet
L8 Big Data Management en
58 pages
Lecture8 -Big Data (Hadoop)
No ratings yet
Lecture8 -Big Data (Hadoop)
29 pages
Hadoop Ecosystem for Big Data
From Everand
Hadoop Ecosystem for Big Data
Dr. Zemelak Goraga
No ratings yet
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
The Power of Big Data: Transforming Industries and Shaping the Future
From Everand
The Power of Big Data: Transforming Industries and Shaping the Future
Tom Henricksen
No ratings yet
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
From Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
No ratings yet
Mastering Big Data and Hadoop: From Basics to Expert Proficiency
From Everand
Mastering Big Data and Hadoop: From Basics to Expert Proficiency
William Smith
No ratings yet
Load Runner Tutorial
No ratings yet
Load Runner Tutorial
63 pages
EPF Name Change Form
No ratings yet
EPF Name Change Form
1 page
Why Use The Command-Line?: in Unity
No ratings yet
Why Use The Command-Line?: in Unity
10 pages
Cookie Testing Guru99
No ratings yet
Cookie Testing Guru99
9 pages
Accessibility Testing Guru99
No ratings yet
Accessibility Testing Guru99
8 pages
Cyclomatic Complexity Guru99
No ratings yet
Cyclomatic Complexity Guru99
8 pages
Top 150 Software Testing Interview Questions
No ratings yet
Top 150 Software Testing Interview Questions
37 pages
Maven Steps
No ratings yet
Maven Steps
2 pages
Bugzilla
No ratings yet
Bugzilla
11 pages
SQL Injection Cheat Sheet
No ratings yet
SQL Injection Cheat Sheet
6 pages
How To Do Performance Testing
No ratings yet
How To Do Performance Testing
11 pages
Guru99 Security Testing
No ratings yet
Guru99 Security Testing
9 pages
Bypass Diodes in Solar Panels
No ratings yet
Bypass Diodes in Solar Panels
4 pages
Unit Plan Template 2
No ratings yet
Unit Plan Template 2
3 pages
Book Your Tickets For Kantara (Kannada) at Coorg
No ratings yet
Book Your Tickets For Kantara (Kannada) at Coorg
1 page
Indian Army Agniveer Vacancy 67d1034d0f12747558773
No ratings yet
Indian Army Agniveer Vacancy 67d1034d0f12747558773
24 pages
Muhammad Junaid BCS 8th Roll No I6 .... 03325041327
No ratings yet
Muhammad Junaid BCS 8th Roll No I6 .... 03325041327
56 pages
Shodh Chakra-Researchers Manual
No ratings yet
Shodh Chakra-Researchers Manual
19 pages
Global Venture Capital Landscape
No ratings yet
Global Venture Capital Landscape
18 pages
Business Plan Chicken Diner
No ratings yet
Business Plan Chicken Diner
32 pages
Track Consignment (1) (1)uu
No ratings yet
Track Consignment (1) (1)uu
21 pages
Encoder Check PLC Programs
No ratings yet
Encoder Check PLC Programs
3 pages
11.RIR Recording Dates Corrected
No ratings yet
11.RIR Recording Dates Corrected
11 pages
Sustainable Local Economic Development Indicator Framework A Tool For Property Building Redevelopment Projects
No ratings yet
Sustainable Local Economic Development Indicator Framework A Tool For Property Building Redevelopment Projects
20 pages
Deemax Pro 29 2019 REAR
No ratings yet
Deemax Pro 29 2019 REAR
2 pages
Concepts of Arts Definition and History
No ratings yet
Concepts of Arts Definition and History
103 pages
Dinacharya std. 8-9-10 - English
No ratings yet
Dinacharya std. 8-9-10 - English
1 page
AKGEC Opening and Closing Ranks
No ratings yet
AKGEC Opening and Closing Ranks
3 pages
Browns Summer Drinks pb2 pb1
No ratings yet
Browns Summer Drinks pb2 pb1
9 pages
Management Robbins 14e-Ge C01
No ratings yet
Management Robbins 14e-Ge C01
39 pages
Question Paper Code:: Reg. No.
No ratings yet
Question Paper Code:: Reg. No.
2 pages
Bala Bala Ji Maharaj
No ratings yet
Bala Bala Ji Maharaj
8 pages
Chef Basics Favorite Recipes
100% (2)
Chef Basics Favorite Recipes
58 pages
Coronavirus PowerPoint
No ratings yet
Coronavirus PowerPoint
22 pages
Sex Harassment Complaint Reporting Form 2018
No ratings yet
Sex Harassment Complaint Reporting Form 2018
2 pages
Welcome To Oral Communication in Context
No ratings yet
Welcome To Oral Communication in Context
13 pages
Breathing and exchange of gases (MCQs)
No ratings yet
Breathing and exchange of gases (MCQs)
6 pages
Example Programs For Cvode v4.0.0
No ratings yet
Example Programs For Cvode v4.0.0
34 pages
10.4324 9781315560854 Previewpdf
No ratings yet
10.4324 9781315560854 Previewpdf
92 pages
Color Psychology Book Alterspark 25 42347951
No ratings yet
Color Psychology Book Alterspark 25 42347951
49 pages
PD 856 Code of Sanitation
No ratings yet
PD 856 Code of Sanitation
33 pages
Sugantha Kumar S: Mobile No: 8861090515 Email
No ratings yet
Sugantha Kumar S: Mobile No: 8861090515 Email
2 pages

Big Data and Hadoop

Uploaded by

Big Data and Hadoop

Uploaded by

Difference Between Big Data and Hadoop

Updated on Nov 26, 2021 18:01 IST

Import ance of Big Dat a

Apache Hadoop Component s

The basic components of Apache Hadoop are –

MapReduce Framework: MapReduce is a systematic approach that uses the HDFS

Advant ages of using Hadoop

Some remarkable benefits that Hadoop offers, include –

It has mechanisms f or data monitoring

Allows data queries

Dif f erence between Big Data and Hadoop

Big Dat a Hadoop

One of the dif f erent tools to

Allows to access and process the

Hadoop Distributed File System

Just a tool to pull out value f rom

Clusters dif f erent f ormats of data

A complex set of data that is Allows to scale the system as the

You might also like