Sayan Ghosh 26900123054 Distributed Database System Cse 6th Sem

The presentation provides an overview of parallel database systems, emphasizing their ability to improve performance through simultaneous operations on large datasets. It contrasts parallel databases with distributed databases, detailing architectures, query processing techniques, and data partitioning strategies. The document also discusses real-world implementations and the future of parallel databases, highlighting trends such as cloud adoption and big data integration.

Uploaded by

Sayan Ghosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Sayan Ghosh 26900123054 Distributed Database System Cse 6th Sem

Uploaded by

Sayan Ghosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 11

PRESENTATION ON - PARALLEL DATABASE SYSTEM

PAPER NAME- DISTRIBUTED DATABASE SYSTEM

FOR- Continuous Assessment 1 (CA1)

NAME – SAYAN GHOSH

ROLL NO -26900123054
DEPERTMENT-CSE REGISTRATION NO -
232690120125 SEMESTER-6 TH

SESSION-2023-2024
Parallel Database Systems:
A n O er iew
Parallel database systems are designed to improve performance
by executing multiple operations simultaneously. These
systems are essential for managing large datasets and complex
queries in distributed environments. This presentation will
explore the key concepts, architectures, techniques, and real-
world implementations of parallel database systems.

We will begin with an introduction to parallel database

systems, comparing them to traditional systems and
highlighting their key benefits. Then, we will delve into the
architectures, query processing techniques, and data
partitioning strategies used in these systems.

by Sayan Ghosh
Distributed s. Parallel Databases: Core
Diff erences
Distributed Databases Parallel Databases
Data is spread across multiple machines, A centralized system with multiple processors,
emphasizing location transparency and autonomy. emphasizing performance and throughput via
The focus is on data distribution, fault tolerance, parallel processing. The focus is on performance,
and geographic dispersion. scalability, and high availability within a single
These databases are loosely coupled and potentially system. These databases are tightly coupled and
heterogeneous, ideal for worldwide banking typically homogeneous, suitable for large data
systems with local data management. warehouses used for complex analytics.
Architectures for Parallel
Databases

Shared Memory Shared D i s k Shared Nothing

Multiple processors Multiple processors Each processor has
access a common share common its own memory
memory space, disks, providing and disks,
facilitating easy high availability communicating via
communication and moderate a network. This
and low latency. scalability. Disk off ers high
However, this contention and scalability and fault
architecture suffers complex tolerance but
from memory concurrency control involves complex
contention and are its drawbacks. communication
limited scalability. IBM DB2 with and higher latency.
Oracle Exadata shared disk cluster Teradata systems
exemplifies this configurations is a and Hadoop
with its tightly notable example. clusters are
integrated representative of
hardware and this architecture.
software.
Parallel Query Processing:
Core Techniques
1 Parallel S ca n 2 Parallel Sort
Distributes table scans Sorts large datasets in
across multiple parallel using algorithms
processors to speed up like parallel merge sort,
data retrieval. enhancing sorting
For example, scanning a performance. For
1TB table using 10 example, sorting a 500GB
processors, each scanning dataset in parallel using
100GB. multiple sorter nodes.

3 Parallel Join
Joins large tables in parallel using techniques like hash join
and sort-merge join to improve join performance. Hash
join involves partitioning tables based on hash values and
joining partitions in parallel.
D ata Parti ti oning Strategies
Horizontal Parti ti oning
Divides rows of a table across multiple nodes. Round
Robin distributes rows evenly, while Hash
1 Partitioning distributes rows based on a hash
function applied to a key column (e.g.,
customer_id). Range Partitioning distributes rows
based on ranges of values in a key column (e.g.,
customer_id 1-1000).

Ro u n d Ro b i n E xa m p l e
2 Node 1 gets rows 1, 4, 7; Node 2 gets rows 2, 5, 8;
Node 3 gets rows 3, 6, 9, ensuring even distribution
across nodes.

H a s h Parti ti oning E xa m p l e
3 Hashing customer_id to distribute customer data
across nodes, ensuring related data can be
processed together.
Parallel Query Opti mizati on
Techniques
Query Decompositi on
Breaks down complex queries into smaller, parallelizable
tasks that can be executed concurrently.

Cost-B as ed Opti mizati on

Chooses the most effi cient execution plan based on
estimated costs, considering factors like CPU, I/O, and
network costs.

Parallel J oin Ordering

Determines the optimal order to perform joins in parallel,
often joining the smallest tables fi rst to reduce
intermediate result sizes.

D ata Localizati on
Moves computation to the data to minimize data transfer,
applying filters on data at the node where the data resides
before transferring it.
Concurrency Control and Transacti on
Management
Two- Phase C o m m i t (2PC)
Ensures that transactions are
2 either fully committed or fully
rolled back across all nodes,
Distributed L o c k i n g
maintaining atomicity.
Manages locks across multiple
1
nodes to ensure data
consistency, using protocols
Distributed Deadlock
like two-phase
Detecti on
locking.
Detects and resolves deadlocks
3 that occur across multiple
nodes, using a global deadlock
detector.
Fault Tolerance and H i g h A ailability
Replicati on D ata Parti ti oning with Automati c Failo er
Redundancy
Creating multiple copies of data Automatically switching to a
on diff erent nodes to ensure Distributing data across nodes backup node in case of a failure,
data is available even if one with redundant copies to using heartbeat mechanisms to
node fails. Can be synchronous ensure data availability. detect node failures.
or asynchronous. Utilizing RAID configurations
and mirroring data across
nodes.
Case Studies: Real-World Implementati ons

Teradata IBM DB2 Oracle E xa d ata

Utilizes a shared-nothing Employs a shared-disk architecture Features a shared-memory
architecture for large-scale data for high availability and scalability, architecture optimized for
warehousing, serving major used by enterprises for Oracle databases, catering to
retailers and financial transactional processing and organizations needing high
institutions. data warehousing. performance and scalability.
Conclusion: The Future of Parallel Databases
Cloud Adopti on 1
Increasing adoption of cloud-based
parallel database solutions like Amazon
Redshift and
2 B i g D ata Integrati on
Google BigQuery Seamless integration with big data
is on the rise.
technologies such as Hadoop and Spark
Algorithm D e elopment 3 continues to evolve.
The development of new parallel query
processing algorithms and optimization
techniques is ongoing
a
nd crucial.
Parallel databases will continue to evolve, playing a critical role in data management and analytics. They
are essential for handling large datasets and complex queries in distributed environments, driving
innovation and effi ciency in various industries.

BMW E39 Integrated Automatic Heating and Air Conditioning
100% (8)
BMW E39 Integrated Automatic Heating and Air Conditioning
24 pages
Yale - GP40-60MX (A986)
100% (1)
Yale - GP40-60MX (A986)
504 pages
DP-50 CE&FDA ServiceManual V17.0 EN
No ratings yet
DP-50 CE&FDA ServiceManual V17.0 EN
183 pages
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Configuring Fleet Management in SAP
67% (3)
Configuring Fleet Management in SAP
12 pages
SAYAN_GHOSH_26900123054_DISTRIBUTED_DATABASE_SYSTEM_CSE_6TH_SEM
No ratings yet
SAYAN_GHOSH_26900123054_DISTRIBUTED_DATABASE_SYSTEM_CSE_6TH_SEM
11 pages
Parallel Database Systems an Overview
No ratings yet
Parallel Database Systems an Overview
10 pages
Parallel Database Systems and Their Architecture
No ratings yet
Parallel Database Systems and Their Architecture
17 pages
Elective-I Advanced Database Management Systems: Unit Ii
100% (1)
Elective-I Advanced Database Management Systems: Unit Ii
141 pages
Second Unit ADBMS
No ratings yet
Second Unit ADBMS
53 pages
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
No ratings yet
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
23 pages
ParallelDBs PDF
No ratings yet
ParallelDBs PDF
23 pages
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
No ratings yet
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
27 pages
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
No ratings yet
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
114 pages
TDD: Topics in Distributed Databases: Parallel Database Management Systems
No ratings yet
TDD: Topics in Distributed Databases: Parallel Database Management Systems
38 pages
Module1 ADBMS
No ratings yet
Module1 ADBMS
99 pages
Parallel-Databases
No ratings yet
Parallel-Databases
10 pages
databace1
No ratings yet
databace1
7 pages
Parallel Database
No ratings yet
Parallel Database
22 pages
9.CSI2004-ADBMS_Module2__part1
No ratings yet
9.CSI2004-ADBMS_Module2__part1
54 pages
Parallel and Distributed Databases in DBMS
No ratings yet
Parallel and Distributed Databases in DBMS
31 pages
Adbms
No ratings yet
Adbms
70 pages
Unit 5 Parallel and Distributed Databases
No ratings yet
Unit 5 Parallel and Distributed Databases
22 pages
Introducing Relational Database Products-2
No ratings yet
Introducing Relational Database Products-2
43 pages
Lecture 1 Parallel Databases
No ratings yet
Lecture 1 Parallel Databases
30 pages
Unit No.4 Parallel Database
No ratings yet
Unit No.4 Parallel Database
32 pages
Data Base Ppt.... Dbms
No ratings yet
Data Base Ppt.... Dbms
8 pages
Parallel and Distributed Databases
No ratings yet
Parallel and Distributed Databases
7 pages
Distributed Databases: Daniel Marcous
No ratings yet
Distributed Databases: Daniel Marcous
41 pages
ADBMS DATA WAREHOUSING CORE
No ratings yet
ADBMS DATA WAREHOUSING CORE
9 pages
Parallel Database System
No ratings yet
Parallel Database System
55 pages
8-Parallel Nhom5
No ratings yet
8-Parallel Nhom5
59 pages
adbms-unit4
No ratings yet
adbms-unit4
24 pages
Adv DBMS-Unit 2
No ratings yet
Adv DBMS-Unit 2
15 pages
DBMS Unit5
No ratings yet
DBMS Unit5
30 pages
subtitle (12)
No ratings yet
subtitle (12)
2 pages
Lecture 2 - Relational Data Processing
No ratings yet
Lecture 2 - Relational Data Processing
10 pages
ADBMS Parallel and Distributed Databases
No ratings yet
ADBMS Parallel and Distributed Databases
98 pages
Parallel Dbms
No ratings yet
Parallel Dbms
5 pages
bda-ia2-bda
No ratings yet
bda-ia2-bda
7 pages
Fundamentals of Database Systems: (Parallel and Distributed Databases)
No ratings yet
Fundamentals of Database Systems: (Parallel and Distributed Databases)
46 pages
Unit_I DBMS
No ratings yet
Unit_I DBMS
74 pages
p64 Stonebraker PDF
No ratings yet
p64 Stonebraker PDF
8 pages
Parallel Databases
No ratings yet
Parallel Databases
11 pages
02 Distdbms Storage
No ratings yet
02 Distdbms Storage
62 pages
P24CDMCA4_unit2[1]
No ratings yet
P24CDMCA4_unit2[1]
15 pages
DBMS CHAP-6
No ratings yet
DBMS CHAP-6
15 pages
Ads unit 3
No ratings yet
Ads unit 3
8 pages
19516_Week 2 Parallel and Distributed Database
No ratings yet
19516_Week 2 Parallel and Distributed Database
7 pages
ADBMS Tutorial
No ratings yet
ADBMS Tutorial
6 pages
Advanced DBMS Viva :: New Edition
No ratings yet
Advanced DBMS Viva :: New Edition
33 pages
Distributed Databases AND Client-Server Architechures
No ratings yet
Distributed Databases AND Client-Server Architechures
73 pages
Parallel DB /D.S.Jagli 1 5/4/2012 1 1. Parallel DB /D.S.Jagli
No ratings yet
Parallel DB /D.S.Jagli 1 5/4/2012 1 1. Parallel DB /D.S.Jagli
70 pages
ADBMS IMP Questions
No ratings yet
ADBMS IMP Questions
41 pages
Basis For Distributed Database Technology
No ratings yet
Basis For Distributed Database Technology
35 pages
Basis For Distributed Database Technology
No ratings yet
Basis For Distributed Database Technology
35 pages
DBMS (1)
No ratings yet
DBMS (1)
74 pages
Parallel Database
No ratings yet
Parallel Database
27 pages
Distributed Databases: CMP-3440 - Database Systems
No ratings yet
Distributed Databases: CMP-3440 - Database Systems
12 pages
rdms 1
No ratings yet
rdms 1
23 pages
26 Distributed Dbms Nosql
No ratings yet
26 Distributed Dbms Nosql
45 pages
Advanced Database Integration Group 52
No ratings yet
Advanced Database Integration Group 52
45 pages
DDIS U1-3
No ratings yet
DDIS U1-3
40 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Sayan Ghosh 26900123054 Cse Data Mining 6th Sem
No ratings yet
Sayan Ghosh 26900123054 Cse Data Mining 6th Sem
11 pages
Wireless-Networks-Wi-Fi-Bluetooth-and-Mobile-Networks
No ratings yet
Wireless-Networks-Wi-Fi-Bluetooth-and-Mobile-Networks
10 pages
Sayan Ghosh 26900123054 Cse Dbms 6th Sem
No ratings yet
Sayan Ghosh 26900123054 Cse Dbms 6th Sem
11 pages
26900123054 Sayan Ghosh Cse 6th Sem Computer Networks
No ratings yet
26900123054 Sayan Ghosh Cse 6th Sem Computer Networks
11 pages
Sayan Ghosh 26900123054 Cse Dbms 6th Sem
No ratings yet
Sayan Ghosh 26900123054 Cse Dbms 6th Sem
11 pages
SAYAN_GHOSH_26900123054_CSE_DATA_MINING_6TH_SEM
No ratings yet
SAYAN_GHOSH_26900123054_CSE_DATA_MINING_6TH_SEM
11 pages
VDO FMprofessional Datasheet en
No ratings yet
VDO FMprofessional Datasheet en
2 pages
Network Cables ........ 1
No ratings yet
Network Cables ........ 1
18 pages
12 Tools For Growth Marketing
No ratings yet
12 Tools For Growth Marketing
14 pages
Windows 98
No ratings yet
Windows 98
4 pages
Modbus
No ratings yet
Modbus
22 pages
Module 4: Time Response of Discrete Time Systems: Lecture Note 1
100% (1)
Module 4: Time Response of Discrete Time Systems: Lecture Note 1
5 pages
Logic Gates: Engr. Gilbert M. Mendoza, Ece, Me Fic, Me Department
No ratings yet
Logic Gates: Engr. Gilbert M. Mendoza, Ece, Me Fic, Me Department
38 pages
Technical Format FYP 1 2022
No ratings yet
Technical Format FYP 1 2022
11 pages
Goal
No ratings yet
Goal
2 pages
Electrical & Instrumentation Punch List S.No Description
No ratings yet
Electrical & Instrumentation Punch List S.No Description
6 pages
Virtual Assistant: Project Bachelor of Technology CSE
No ratings yet
Virtual Assistant: Project Bachelor of Technology CSE
11 pages
Siguiente Nivel
No ratings yet
Siguiente Nivel
5 pages
Sets With 3 Vertical Multistage Centrifugal Pumps
No ratings yet
Sets With 3 Vertical Multistage Centrifugal Pumps
2 pages
Metallography, Microstructure, and Analysis: Covers The Methods of Evaluation of Metallic Materials For Use in The
No ratings yet
Metallography, Microstructure, and Analysis: Covers The Methods of Evaluation of Metallic Materials For Use in The
1 page
ABB Thermal Overload Relays
No ratings yet
ABB Thermal Overload Relays
6 pages
Hotel Mode New Modells LG
No ratings yet
Hotel Mode New Modells LG
20 pages
099-Q.improvement Plan PDF
No ratings yet
099-Q.improvement Plan PDF
2 pages
Aroma SE200 - 300 - Eng
No ratings yet
Aroma SE200 - 300 - Eng
2 pages
Dilip Resume
No ratings yet
Dilip Resume
4 pages
E700 E800 Replacement
No ratings yet
E700 E800 Replacement
35 pages
Hotel Reservation Android App Report
No ratings yet
Hotel Reservation Android App Report
27 pages
Network Engineer
No ratings yet
Network Engineer
5 pages
Rubric: PC Installation: Poor Fair Good
No ratings yet
Rubric: PC Installation: Poor Fair Good
2 pages
E-Commerce Project Assignment-Group-03
No ratings yet
E-Commerce Project Assignment-Group-03
11 pages
Business Analysis Template-AIM MD050 App Ext Funct Design-V101
No ratings yet
Business Analysis Template-AIM MD050 App Ext Funct Design-V101
13 pages
7.6.3 Exploring IPv6 Addressing On Routers
No ratings yet
7.6.3 Exploring IPv6 Addressing On Routers
3 pages

Sayan Ghosh 26900123054 Distributed Database System Cse 6th Sem

Uploaded by

Sayan Ghosh 26900123054 Distributed Database System Cse 6th Sem

Uploaded by

PRESENTATION ON - PARALLEL DATABASE SYSTEM

PAPER NAME- DISTRIBUTED DATABASE SYSTEM

NAME – SAYAN GHOSH

We will begin with an introduction to parallel database

Shared Memory Shared D i s k Shared Nothing

Cost-B as ed Opti mizati on

Parallel J oin Ordering

Teradata IBM DB2 Oracle E xa d ata

You might also like