Grid Computing PPT 5 Wecompress - Com 1

Uploaded by

Jaya R

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views

Grid Computing PPT 5 Wecompress - Com 1

Uploaded by

Jaya R

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 15

Grid Computing

(Center for Computational Mathematics)

Dr Ashok Mishra

Team : Dr. Banitamani Mallik, Dr. Tumbanath Samantara, Mr.Balaji

Padhy , Mrs Sasmita Jena
Lecture_5
Data-intensive Applications
Data-intensive computing is a class of parallel
computing applications which use a data
parallel approach to process large volumes of
data typically terabytes or petabytes in size
and typically referred to as big data .
Computing applications which devote most of
their execution time to computational
requirements are deemed compute-intensive,
whereas computing applications which require
large volumes of data and devote most of their
processing time to I/O and manipulation of data
are deemed data-intensive.
ABSTRACT MODEL OF A WORKFLOW
MANAGEMENT SYSTEM:

The architecture of a Grid workflow system

based on the workflow reference model
proposed by Workflow Management Coalition
(WfMC).
The build time and run time borders separate
the functionality of the design to defining and
executing tasks, respectively.
At the core of the run time, components to
actively process both data and tasks equally
• The scheduler, that forms the core of the
engine,handles data flow schedules on top of
task schedules.
• For example, if a workflow is modelled such
that the data transfer tasks are separate from
computation tasks, the scheduler may apply a
different scheduling policy to the data transfer
tasks.
• Similarly, when there is no distinction between
these tasks, the scheduler may prioritize data
transfers between certain tasks over
computation depending on the structure of the
workflow, scheduling objectives, and so forth.
SURVEY:
In this section, we characterize and classify key
concepts and techniques used for scheduling and
managing data-intensive application workflows.
we have classified the techniques into seven
major categories:
(a) data locality,
(b) data transfer,
(c) data-footprint,
(d) granularity,
(e) model,
(f) platform,
(g) miscellaneous technologies
Data Locality

• Transferring data between computing nodes

takes significant amount of time depending on
the size of data and network capacity between
participating nodes. Hence, most scheduling
techniques target on optimizing data transfers
by exploiting the locality of data. These
techniques can be classified into
(i) spatial clustering,
(ii) task clustering, and
(iii) worker centric.
Data Transfer:

• several mechanisms for transferring data

so that data transfer time is minimized.
These techniques are:
• (i) data parallelism,
• (ii) data streaming, and
• (iii) data throttling
Data Footprint:

Workflow systems adopt several

mechanisms to track and utilize the data
footprint of the application. These
mechanisms can be classified into:
• cleaning jobs,
• restructuring of workflow,
• data placement & replication.
Granularity :
• Workflow schedulers can make scheduling
decisions based on either:
• (a) task level, or
• (b)workflow level.
• Task level schedulers map individual tasks to
compute resources.
• The decision of resource selection and data
movement is based on the characteristics of
individual task and its dependencies with
other tasks
Model :
Workflow scheduling model depends on
the way the tasks and data are
composed and handled.
They can be classified into two
categories:

(i) task-based,
(ii) service-based
Platform :

Data-intensive application workflows could be

executed in different resource configuration and
environments (e.g. Cluster, Data Grids, Clouds
etc.) depending on the requirements of the
application.
Miscellaneous :
In this Section, we list some technologies
that have been used for enhancing the
performance of data-intensive
application workflows
Semantic Technology :

Database Technology
•

• Thank You

Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Learn Data Warehousing in 24 Hours
From Everand
Learn Data Warehousing in 24 Hours
Alex Nordeen
No ratings yet
Saurav Dudulwar Resume
No ratings yet
Saurav Dudulwar Resume
1 page
Information Sciences: C.L. Philip Chen, Chun-Yang Zhang
No ratings yet
Information Sciences: C.L. Philip Chen, Chun-Yang Zhang
34 pages
The InfluxDB Handbook: Deploying, Optimizing, and Scaling Time Series Data
From Everand
The InfluxDB Handbook: Deploying, Optimizing, and Scaling Time Series Data
Robert Johnson
No ratings yet
Data Structures Explained: A Practical Guide with Examples
From Everand
Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Model-Driven Online Capacity Management for Component-Based Software Systems
From Everand
Model-Driven Online Capacity Management for Component-Based Software Systems
André van Hoorn
No ratings yet
Management of Data Intensive Application Workflow in Cloud Computing
No ratings yet
Management of Data Intensive Application Workflow in Cloud Computing
4 pages
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
From Everand
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
Byron Ellis
No ratings yet
Cloud Computing For Noobs
From Everand
Cloud Computing For Noobs
Silas Meadowlark
No ratings yet
AI-Driven Web Apps: Practical Machine Learning for Software Developers
From Everand
AI-Driven Web Apps: Practical Machine Learning for Software Developers
Sivaramarajalu Ramadurai Venkataraajalu
No ratings yet
Cloud Computing Essentials: A Practical Guide with Examples
From Everand
Cloud Computing Essentials: A Practical Guide with Examples
William E. Clark
No ratings yet
The Ultimate Guide to Unlocking the Full Potential of Cloud Services: Tips, Recommendations, and Strategies for Success
From Everand
The Ultimate Guide to Unlocking the Full Potential of Cloud Services: Tips, Recommendations, and Strategies for Success
Rick Spair
No ratings yet
Database Management System
From Everand
Database Management System
Knowledge Flow
No ratings yet
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
From Everand
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
JAMIE POWERS
No ratings yet
unit 4-1
No ratings yet
unit 4-1
13 pages
Graph Layout Support for Model-Driven Engineering
From Everand
Graph Layout Support for Model-Driven Engineering
Miro Spönemann
No ratings yet
Big Data Unit5
No ratings yet
Big Data Unit5
57 pages
Edge Computing Applications in Supply Chain Management
From Everand
Edge Computing Applications in Supply Chain Management
Bo Li
No ratings yet
Application Design: Key Principles For Data-Intensive App Systems
From Everand
Application Design: Key Principles For Data-Intensive App Systems
Rob Botwright
No ratings yet
Basic Concepts in Data Structures
From Everand
Basic Concepts in Data Structures
K.Meenendranath Reddy
No ratings yet
HPE Compute Certification Guide: 444 Practice Questions for the Advanced HPE1-H02 Exam
From Everand
HPE Compute Certification Guide: 444 Practice Questions for the Advanced HPE1-H02 Exam
Steve Brown
No ratings yet
Computer Science Self Management: Fundamentals and Applications
From Everand
Computer Science Self Management: Fundamentals and Applications
Fouad Sabry
No ratings yet
Cloud Computing Made Simple: Navigating the Cloud: A Practical Guide to Cloud Computing
From Everand
Cloud Computing Made Simple: Navigating the Cloud: A Practical Guide to Cloud Computing
Poonam Devi
No ratings yet
Mastering Cloud Computing With Best Practices
From Everand
Mastering Cloud Computing With Best Practices
Manish Soni
No ratings yet
Data Intensive Computing
No ratings yet
Data Intensive Computing
18 pages
BDA Unit 2 1
No ratings yet
BDA Unit 2 1
42 pages
VTU Exam Question Paper With Solution of 18CS72 Big Data and Analytics Feb-2022-Dr. v. Vijayalakshmi
No ratings yet
VTU Exam Question Paper With Solution of 18CS72 Big Data and Analytics Feb-2022-Dr. v. Vijayalakshmi
25 pages
15CS565 Module4
No ratings yet
15CS565 Module4
61 pages
HPC Chapter 1
No ratings yet
HPC Chapter 1
12 pages
Big Data Architecture
No ratings yet
Big Data Architecture
4 pages
Cloud Computing: Harnessing the Power of the Digital Skies: The IT Collection
From Everand
Cloud Computing: Harnessing the Power of the Digital Skies: The IT Collection
Christopher Ford
No ratings yet
Parallel Algorithm Models: An Algorithm Model Is Typically A Way of Structuring A Parallel Algo. Models
No ratings yet
Parallel Algorithm Models: An Algorithm Model Is Typically A Way of Structuring A Parallel Algo. Models
28 pages
U4S9
No ratings yet
U4S9
18 pages
Exam AZ 900: Azure Fundamental Study Guide-1: Explore Azure Fundamental guide and Get certified AZ 900 exam
From Everand
Exam AZ 900: Azure Fundamental Study Guide-1: Explore Azure Fundamental guide and Get certified AZ 900 exam
Mamta Devi
No ratings yet
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
Database Management System
From Everand
Database Management System
Manish Soni
No ratings yet
The Study of Building the Data Warehouse
From Everand
The Study of Building the Data Warehouse
venkateswara Rao
No ratings yet
LS1.1 - V6 Generalized Architecture of Big Data Systems
No ratings yet
LS1.1 - V6 Generalized Architecture of Big Data Systems
8 pages
C++ Data Structures Explained: A Practical Guide with Examples
From Everand
C++ Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Parallel Algorithm - Introduction
No ratings yet
Parallel Algorithm - Introduction
36 pages
E Farming
No ratings yet
E Farming
59 pages
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet
C Data Structures and Algorithms: Implementing Efficient ADTs
From Everand
C Data Structures and Algorithms: Implementing Efficient ADTs
Larry Jones
No ratings yet
DataCloud 2016 004
No ratings yet
DataCloud 2016 004
8 pages
Unit 5
No ratings yet
Unit 5
54 pages
Cloud COMPUTING Module 4
No ratings yet
Cloud COMPUTING Module 4
50 pages
Analysis of Dynamic Workflow Scheduling Algorithm For Big Data Application
No ratings yet
Analysis of Dynamic Workflow Scheduling Algorithm For Big Data Application
5 pages
Big Data Training
No ratings yet
Big Data Training
244 pages
The Snowflake Handbook: Optimizing Data Warehousing and Analytics
From Everand
The Snowflake Handbook: Optimizing Data Warehousing and Analytics
Robert Johnson
No ratings yet
Networking Programming with C++: Build Efficient Communication Systems
From Everand
Networking Programming with C++: Build Efficient Communication Systems
Robert Johnson
No ratings yet
2021_11_Data Fabric for PMI
No ratings yet
2021_11_Data Fabric for PMI
25 pages
Mainframe to Cloud Mastery: Best Practices: Mainframes
From Everand
Mainframe to Cloud Mastery: Best Practices: Mainframes
Ricardo Nuqui
No ratings yet
Semantic Translation: Fundamentals and Applications
From Everand
Semantic Translation: Fundamentals and Applications
Fouad Sabry
No ratings yet
Project Synopsis: Integrated Modeling Framework For Enterprise Systems
No ratings yet
Project Synopsis: Integrated Modeling Framework For Enterprise Systems
302 pages
Consise Cloud Compute: It Professionals’ Handbook
From Everand
Consise Cloud Compute: It Professionals’ Handbook
Vijay
No ratings yet
System Modeling
No ratings yet
System Modeling
20 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Cloud-Based Multi-Modal Information Analytics
From Everand
Cloud-Based Multi-Modal Information Analytics
Tanushri Kaniyar
No ratings yet
Presentation EBTIC D1 M1
No ratings yet
Presentation EBTIC D1 M1
106 pages
Network Coding and Signcryption for Cloud Data Integrity
From Everand
Network Coding and Signcryption for Cloud Data Integrity
Noah Joan
No ratings yet
17 Utility-Software
No ratings yet
17 Utility-Software
9 pages
Salt Analysis
No ratings yet
Salt Analysis
7 pages
XSEDE15 Part1 Intro
No ratings yet
XSEDE15 Part1 Intro
101 pages
UnitTesting Lecture s05
No ratings yet
UnitTesting Lecture s05
55 pages
Auto Test
No ratings yet
Auto Test
80 pages
Chapter 6 Interface Python With MYSQL
No ratings yet
Chapter 6 Interface Python With MYSQL
80 pages
Hands On Mahout - Mammoth Scale Machine Learning Presentation
No ratings yet
Hands On Mahout - Mammoth Scale Machine Learning Presentation
68 pages
Lemh105 Pages Deleted
No ratings yet
Lemh105 Pages Deleted
26 pages
Security Risk Management Principles
No ratings yet
Security Risk Management Principles
2 pages
CF Unit-5 Basics of Internet and Web - PDF - Internet & Web - World Wide Web
No ratings yet
CF Unit-5 Basics of Internet and Web - PDF - Internet & Web - World Wide Web
21 pages
HICET - Department of Computer Science and Engineering
No ratings yet
HICET - Department of Computer Science and Engineering
1 page
Knowledge Discovery in Data Science: KDD Meets Big Data
No ratings yet
Knowledge Discovery in Data Science: KDD Meets Big Data
6 pages
Conquering Big Data With High Performance Computing
100% (1)
Conquering Big Data With High Performance Computing
328 pages
15cs565 Cloud Computing Module 4 Notes
No ratings yet
15cs565 Cloud Computing Module 4 Notes
33 pages
Faculty of It
No ratings yet
Faculty of It
38 pages
Prishu Project
No ratings yet
Prishu Project
70 pages
Cloud Class1
No ratings yet
Cloud Class1
14 pages
MapReduce Tutorial
No ratings yet
MapReduce Tutorial
192 pages
Cloud Computing Applications and Paradigms
No ratings yet
Cloud Computing Applications and Paradigms
36 pages
1.1 - Why Clouds
No ratings yet
1.1 - Why Clouds
38 pages
Quote: "Data Is Widely Available. What Is Scarce Is The Ability To Extract Wisdom From It."
No ratings yet
Quote: "Data Is Widely Available. What Is Scarce Is The Ability To Extract Wisdom From It."
58 pages
SAP HANA SQL Script Reference en
No ratings yet
SAP HANA SQL Script Reference en
328 pages
CSE 7th Sem Syllabus
No ratings yet
CSE 7th Sem Syllabus
14 pages
Open Electives Vtu
No ratings yet
Open Electives Vtu
26 pages
BE 8th Semester Cloud Computing Syllabus
No ratings yet
BE 8th Semester Cloud Computing Syllabus
2 pages
A Provider-Friendly Serverless Framework For Latency-Critical Applications
No ratings yet
A Provider-Friendly Serverless Framework For Latency-Critical Applications
4 pages
All Syllabus
No ratings yet
All Syllabus
6 pages
Seminar Report On Cluster Computing
No ratings yet
Seminar Report On Cluster Computing
21 pages
Online Class Scheduling System
100% (1)
Online Class Scheduling System
35 pages
Business Strategy Case Study Touching On Innovation Around An In-Depth PESTLE Analysis of Amazon Inc.
No ratings yet
Business Strategy Case Study Touching On Innovation Around An In-Depth PESTLE Analysis of Amazon Inc.
18 pages
Select Solutions For Gbase 8a MPP Cluster Brief
100% (1)
Select Solutions For Gbase 8a MPP Cluster Brief
4 pages
Building A Private HPC Cloud For Compute and Data
No ratings yet
Building A Private HPC Cloud For Compute and Data
21 pages
Advanced Information and Knowledge
No ratings yet
Advanced Information and Knowledge
105 pages
Grid Computing Making The Global Infrastructure A Reality
No ratings yet
Grid Computing Making The Global Infrastructure A Reality
1,053 pages
Vtu 5th Sem Open Electives
No ratings yet
Vtu 5th Sem Open Electives
10 pages
2020 An Evaluation of Test Suite Minimization Techniques PDF
No ratings yet
2020 An Evaluation of Test Suite Minimization Techniques PDF
142 pages
C Se 487 Course Outline Jan 28
No ratings yet
C Se 487 Course Outline Jan 28
4 pages
Grid Computing - An: Mr. Kisanjara S.B
No ratings yet
Grid Computing - An: Mr. Kisanjara S.B
31 pages

Grid Computing PPT 5 Wecompress - Com 1

Uploaded by

Grid Computing PPT 5 Wecompress - Com 1

Uploaded by

Grid Computing

(Center for Computational Mathematics)

Team : Dr. Banitamani Mallik, Dr. Tumbanath Samantara, Mr.Balaji

The architecture of a Grid workflow system

• Transferring data between computing nodes

• several mechanisms for transferring data

Workflow systems adopt several

Data-intensive application workflows could be

You might also like