Big Data Processing Concepts

Uploaded by

S.KAVITHA COMPUTERSCIENCE

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views9 pages

Big Data Processing Concepts

Uploaded by

S.KAVITHA COMPUTERSCIENCE

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Big Data Processing Concepts

S.Kavitha
Head & Assistant Professor
Department of Computer Science
Sri Sarada Niketan College of Science for Women,Karur.
Parallel Data Processing
• Parallel data processing involves the simultaneous
execution of multiple sub-tasks that
• collectively comprise a larger task. The goal is to
reduce the execution time by dividing a
• single larger task into multiple smaller tasks that run
concurrently.
• Although parallel data processing can be achieved
through multiple networked machines,
• it is more typically achieved within the confines of a
single machine with multiple
• processors
Distributed Data Processing
• Distributed data processing is closely related
to parallel data processing in that the same
• principle of “divide-and-conquer” is applied.
However, distributed data processing is
• always achieved through physically separate
machines that are networked together as a
• cluster.
Hadoop
• Hadoop is an open-source framework for large-scale
data storage and data processing that
• is compatible with commodity hardware. The Hadoop
framework has established itself as
• a de facto industry platform for contemporary Big
Data solutions. It can be used as an
• ETL engine or as an analytics engine for processing
large amounts of structured, semistructured
• and unstructured data. From an analysis perspective,
Hadoop implements the
• MapReduce processing framework.
Processing Workloads
• A processing workload in Big Data is defined
as the amount and nature of data that is
• processed within a certain amount of time.
Workloads are usually divided into two types:
• batch
• transactional
Batch
• Batch processing, also known as offline
processing, involves processing data in
batches
• and usually imposes delays, which in turn
results in high-latency responses. Batch
• workloads typically involve large quantities of
data with sequential read/writes and
• comprise of groups of read or write queries.
Transactional
• Transactional processing is also known as online
processing. Transactional workload
• processing follows an approach whereby data is
processed interactively without delay,
• resulting in low-latency responses. Transaction workloads
involve small amounts of data
• with random reads and writes.
• OLTP and operational systems, which are generally write-
intensive, fall within this
• category. Although these workloads contain a mix of
read/write queries, they are generally
• more write-intensive than read-intensive.
Cluster
• In the same manner that clusters provide
necessary support to create horizontally scalable
• storage solutions, clusters also provides the
mechanism to enable distributed data
• processing with linear scalability. Since clusters
are highly scalable, they provide an ideal
• environment for Big Data processing as large
datasets can be divided into smaller datasets
• and then processed in parallel in a distributed
manner.
Processing in Batch Mode
• In batch mode, data is processed offline in
batches and the response time could vary from
• minutes to hours. As well, data must be
persisted to the disk before it can be
processed.
• Batch mode generally involves processing a
range of large datasets, either on their own or
• joined together, essentially addressing the
volume and variety characteristics of Big Data
• datasets.

Artificial Respiration: Force
No ratings yet
Artificial Respiration: Force
15 pages
Hadoop PPT
No ratings yet
Hadoop PPT
25 pages
Bigdata PPT Slides (E)
No ratings yet
Bigdata PPT Slides (E)
10 pages
Big Data Training
No ratings yet
Big Data Training
244 pages
Lesson 1-Geographic, Linguistic and Ethnic Dimensions of Phil Lit
100% (1)
Lesson 1-Geographic, Linguistic and Ethnic Dimensions of Phil Lit
22 pages
CLOUD COMPUTING UNIT 3
No ratings yet
CLOUD COMPUTING UNIT 3
10 pages
Damanhur An Esoteric Community Open To The World Stefania Palmisano download
No ratings yet
Damanhur An Esoteric Community Open To The World Stefania Palmisano download
80 pages
REview of Personal Teaching Qualities
No ratings yet
REview of Personal Teaching Qualities
4 pages
Chapter 9 - BDMT
No ratings yet
Chapter 9 - BDMT
61 pages
Fourth Bulletin 2
No ratings yet
Fourth Bulletin 2
19 pages
Truck Driver format
100% (2)
Truck Driver format
55 pages
BigData Materials
No ratings yet
BigData Materials
68 pages
BDP 2023 03
No ratings yet
BDP 2023 03
59 pages
Ecs765p W1
No ratings yet
Ecs765p W1
39 pages
Introduction To Big Data-0
No ratings yet
Introduction To Big Data-0
77 pages
Big Data Processing, MapReduce
No ratings yet
Big Data Processing, MapReduce
13 pages
Jifs223295 2
No ratings yet
Jifs223295 2
25 pages
Introduction to Big Data
No ratings yet
Introduction to Big Data
69 pages
Cluster Basics
No ratings yet
Cluster Basics
34 pages
Articol Disteibuted Data Processing
No ratings yet
Articol Disteibuted Data Processing
9 pages
Big Data Question Bank
No ratings yet
Big Data Question Bank
38 pages
Bda CH3
No ratings yet
Bda CH3
10 pages
Big Data and Hadoop - 12 Aug 2021
No ratings yet
Big Data and Hadoop - 12 Aug 2021
19 pages
Data Collection & Analysis Educational Presentation in Pink and Blue Lined Style
No ratings yet
Data Collection & Analysis Educational Presentation in Pink and Blue Lined Style
51 pages
Spark
No ratings yet
Spark
36 pages
Unit 1 (Chapter 3) - Big Data Processing
No ratings yet
Unit 1 (Chapter 3) - Big Data Processing
9 pages
Apznzazjo 11alycvdryonqpbec 5rayiswmeqdj7tdf9lzdbqz3fyfqimvdhobrxk2cshbphryoa7avx3vv8cv Gg4h81ojwronue2twsfc5eifvyppetawllb0sh12okn7def9ydrsx1q1zyrs5lqbwjktpcbrllwdxclmv11kamhf7ygbaup4itld55rtzkqkld4jtsdu7ixe8bwmqbcikhqchz4r0g-Ctn Kdm Nnc2m
No ratings yet
Apznzazjo 11alycvdryonqpbec 5rayiswmeqdj7tdf9lzdbqz3fyfqimvdhobrxk2cshbphryoa7avx3vv8cv Gg4h81ojwronue2twsfc5eifvyppetawllb0sh12okn7def9ydrsx1q1zyrs5lqbwjktpcbrllwdxclmv11kamhf7ygbaup4itld55rtzkqkld4jtsdu7ixe8bwmqbcikhqchz4r0g-Ctn Kdm Nnc2m
29 pages
C42-Batch Stream Micro Batch Realtime Processing
No ratings yet
C42-Batch Stream Micro Batch Realtime Processing
33 pages
OS Lecture 5
No ratings yet
OS Lecture 5
29 pages
Intro Big Data
No ratings yet
Intro Big Data
36 pages
Chapter 6
No ratings yet
Chapter 6
15 pages
BDA Model Qp Soln
No ratings yet
BDA Model Qp Soln
55 pages
Big Data Engineer 2021-Ecosystem-Course Guide (2)-51-60
No ratings yet
Big Data Engineer 2021-Ecosystem-Course Guide (2)-51-60
10 pages
Bandwidth Utilization
No ratings yet
Bandwidth Utilization
17 pages
IPv4 Uses 32-Bit Addresses
No ratings yet
IPv4 Uses 32-Bit Addresses
17 pages
Chapter - 1 Introduction
No ratings yet
Chapter - 1 Introduction
22 pages
Lecture 10
No ratings yet
Lecture 10
7 pages
Big Data and Hadoop Overview
100% (1)
Big Data and Hadoop Overview
17 pages
Unit 1 1
No ratings yet
Unit 1 1
10 pages
Big Data Engines: Binary Batch Processing
No ratings yet
Big Data Engines: Binary Batch Processing
12 pages
CHEM F110 Chemistry Laboratory I Sem 2022-23HO
No ratings yet
CHEM F110 Chemistry Laboratory I Sem 2022-23HO
2 pages
Big Data Processing: Jiaul Paik
No ratings yet
Big Data Processing: Jiaul Paik
47 pages
Iot AND m2m
No ratings yet
Iot AND m2m
11 pages
Big data assignment notes
No ratings yet
Big data assignment notes
13 pages
Computer Lecture 7 8
No ratings yet
Computer Lecture 7 8
11 pages
Stack and Queue
No ratings yet
Stack and Queue
10 pages
DBMS Unit-5
No ratings yet
DBMS Unit-5
92 pages
Danico Narrative
No ratings yet
Danico Narrative
39 pages
Session 8 Big Data
No ratings yet
Session 8 Big Data
7 pages
IIT JEE Main Advnaced Inorganic Chemistry 11th N&O Family PDF
0% (1)
IIT JEE Main Advnaced Inorganic Chemistry 11th N&O Family PDF
58 pages
Java Throw Keyword
No ratings yet
Java Throw Keyword
9 pages
UNIT1 -BDH
No ratings yet
UNIT1 -BDH
77 pages
Big_Data_Analytics_-_Chapter_4
No ratings yet
Big_Data_Analytics_-_Chapter_4
22 pages
j.ijdsa.20241005.11
No ratings yet
j.ijdsa.20241005.11
14 pages
Security Management of An Iot Ecosystem
No ratings yet
Security Management of An Iot Ecosystem
8 pages
Realization of IoT Ecosystem Using Wireless Technologies
No ratings yet
Realization of IoT Ecosystem Using Wireless Technologies
8 pages
0 The BigDataEra
No ratings yet
0 The BigDataEra
36 pages
There S No Place For Us Here
No ratings yet
There S No Place For Us Here
22 pages
Big Data Storage Concepts
No ratings yet
Big Data Storage Concepts
7 pages
Document 15
No ratings yet
Document 15
15 pages
Big Data Unit5
No ratings yet
Big Data Unit5
57 pages
40833 OR
No ratings yet
40833 OR
29 pages
IoT Protocol
No ratings yet
IoT Protocol
12 pages
Big Data Analytics Presentation
No ratings yet
Big Data Analytics Presentation
30 pages
Adtran 3140 Configuration: Proxy Configuration For Vipedge On Page 6
No ratings yet
Adtran 3140 Configuration: Proxy Configuration For Vipedge On Page 6
8 pages
Nuraddeen Abdulmalik
No ratings yet
Nuraddeen Abdulmalik
3 pages
Error Detection & Correction Codes
No ratings yet
Error Detection & Correction Codes
10 pages
Bipolar Logic Family
No ratings yet
Bipolar Logic Family
10 pages
Java Swing
No ratings yet
Java Swing
9 pages
Parcial Cono 1 21
No ratings yet
Parcial Cono 1 21
21 pages
NAND and NOR Implementations
No ratings yet
NAND and NOR Implementations
15 pages
Chemical Bonding (ASSIGNMENT)
No ratings yet
Chemical Bonding (ASSIGNMENT)
18 pages
Questions On Topologies
No ratings yet
Questions On Topologies
17 pages
Big data Handling Techniques
No ratings yet
Big data Handling Techniques
21 pages
Parcial Cono 1 14
No ratings yet
Parcial Cono 1 14
14 pages
ER-260 265 AU User Manual
No ratings yet
ER-260 265 AU User Manual
109 pages
Ashish_Presentation_Stage1_modify_LR
No ratings yet
Ashish_Presentation_Stage1_modify_LR
24 pages
Golden Ratio Golden Ratio
No ratings yet
Golden Ratio Golden Ratio
9 pages
Introduction To Big Data: Soorya Prasanna Ravichandran
No ratings yet
Introduction To Big Data: Soorya Prasanna Ravichandran
33 pages
Types of Big Data Analytics
No ratings yet
Types of Big Data Analytics
7 pages
Roman_Jackiw
No ratings yet
Roman_Jackiw
3 pages
An Informal History of ELearning
No ratings yet
An Informal History of ELearning
9 pages
Comandos
No ratings yet
Comandos
8 pages
Hadoop & BigData (UNIT - 2)
No ratings yet
Hadoop & BigData (UNIT - 2)
22 pages
ESF13 Public Safety Security Annex Bobsville
No ratings yet
ESF13 Public Safety Security Annex Bobsville
3 pages
Glascow Coma Scale Micro Teaching
No ratings yet
Glascow Coma Scale Micro Teaching
11 pages
Big Data: Introduction To Terms, Concepts and Tools
No ratings yet
Big Data: Introduction To Terms, Concepts and Tools
23 pages
Internship Presentation
100% (2)
Internship Presentation
10 pages
CB-2 Assignment
No ratings yet
CB-2 Assignment
7 pages
Terminal Remote Company Interface Internet Website Product Orders Dealing Payments Offers Savings Efficiency Business Sales Operations System Terms
No ratings yet
Terminal Remote Company Interface Internet Website Product Orders Dealing Payments Offers Savings Efficiency Business Sales Operations System Terms
4 pages
Types of Data Processing
No ratings yet
Types of Data Processing
3 pages
Advance® Sections: CE Marked Structural Sections BS 5950 Version
No ratings yet
Advance® Sections: CE Marked Structural Sections BS 5950 Version
60 pages
Session 1 - 4 - Qualcomm - Patrick Tsie
No ratings yet
Session 1 - 4 - Qualcomm - Patrick Tsie
20 pages
FFA Test CHP INV and 16
No ratings yet
FFA Test CHP INV and 16
8 pages
Lenovo Services Sealed Battery ANZ12
No ratings yet
Lenovo Services Sealed Battery ANZ12
1 page
Moving Coil Galvanometer
No ratings yet
Moving Coil Galvanometer
13 pages
Quiz On Lent
100% (1)
Quiz On Lent
4 pages
Bataan School of Fisheries Student-Parent-Teacher Contract
No ratings yet
Bataan School of Fisheries Student-Parent-Teacher Contract
2 pages
Introduction To Petroleum Geology
No ratings yet
Introduction To Petroleum Geology
6 pages
Advanced Apache Tez Techniques: Definitive Reference for Developers and Engineers
From Everand
Advanced Apache Tez Techniques: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Practical TimescaleDB Solutions: Definitive Reference for Developers and Engineers
From Everand
Practical TimescaleDB Solutions: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Hadoop Ecosystem for Big Data
From Everand
Hadoop Ecosystem for Big Data
Dr. Zemelak Goraga
No ratings yet
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
From Everand
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
AJIT DASH
2/5 (2)
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet

Big Data Processing Concepts

Uploaded by

Big Data Processing Concepts

Uploaded by

Big Data Processing Concepts

You might also like