0% found this document useful (0 votes)

135 views

What Is Parallel Computing 1 PDF

Parallel computing involves solving computational problems simultaneously using multiple processors. Traditionally, problems were solved serially using a single processor. In parallel computing, a problem is broken into discrete parts that can be solved concurrently on different processors. This allows for faster completion time compared to serial computing. Popular applications of parallel computing include modeling scientific problems, processing large datasets, and graphics/simulation.

Uploaded by

jayteearora

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

135 views

What Is Parallel Computing 1 PDF

Uploaded by

jayteearora

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

What is Parallel Computing?

Traditionally, software has been written for serial computation:

o To be run on a single computer having a single Central Processing Unit (CPU);
o A problem is broken into a discrete series of instructions.
o Instructions are executed one after another.
o Only one instruction may execute at any moment in time.

In the simplest sense, parallel computing is the simultaneous use of multiple compute
resources to solve a computational problem:
o To be run using multiple CPUs
o A problem is broken into discrete parts that can be solved concurrently
o Each part is further broken down to a series of instructions
o Instructions from each part execute simultaneously on different CPUs

The compute resources can include:

o A single computer with multiple processors;
o An arbitrary number of computers connected by a network;
o A combination of both.
The computational problem usually demonstrates characteristics such as the ability to be:
o Broken apart into discrete pieces of work that can be solved simultaneously;
o Execute multiple program instructions at any moment in time;
o Solved in less time with multiple compute resources than with a single compute
resource.

The Universe is Parallel:

Parallel computing is an evolution of serial computing that attempts to emulate what has always
been the state of affairs in the natural world: many complex, interrelated events happening at the
same time, yet within a sequence. For example:
Galaxy formation
Planetary movement
Weather and ocean
patterns
o
o
o

o
o
o

Rush hour traffic

Automobile assembly line
Building a space shuttle

Ordering a hamburger at the drive

through.
o

Tectonic plate drift

The Real World is Massively Parallel

Uses for Parallel Computing:

Historically, parallel computing has been considered to be "the high end of computing",
and has been used to model difficult scientific and engineering problems found in the real
world. Some examples:
o Atmosphere, Earth, Environment
o Physics - applied, nuclear, particle, condensed matter, high pressure, fusion,
photonics
o Bioscience, Biotechnology, Genetics
o Chemistry, Molecular Sciences
o Geology, Seismology
o Mechanical Engineering - from prosthetics to spacecraft

o
o

Electrical Engineering, Circuit Design, Microelectronics

Computer Science, Mathematics

Today, commercial applications provide an equal or greater driving force in the

development of faster computers. These applications require the processing of large
amounts of data in sophisticated ways. For example:
o Databases, data mining
o Oil exploration
o Web search engines, web based business services
o Medical imaging and diagnosis
o Pharmaceutical design
o Management of national and multi-national corporations
o Financial and economic modeling
o Advanced graphics and virtual reality, particularly in the entertainment industry
o Networked video and multi-media technologies
o Collaborative work environments

Why Use Parallel Computing?

Main Reasons:

Save time and/or money: In theory, throwing more resources at a task will shorten its time
to completion, with potential cost savings. Parallel clusters can be built from cheap,
commodity components.

Solve larger problems: Many problems are so large and/or complex that it is impractical
or impossible to solve them on a single computer, especially given limited computer
memory. For example:
o "Grand Challenge" (en.wikipedia.org/wiki/Grand_Challenge) problems requiring
PetaFLOPS and PetaBytes of computing resources.
o Web search engines/databases processing millions of transactions per second

Provide concurrency: A single compute resource can only do one thing at a time.
Multiple computing resources can be doing many things simultaneously. For example, the
Access Grid (www.accessgrid.org) provides a global collaboration network where people
from around the world can meet and conduct work "virtually".

Use of non-local resources: Using compute resources on a wide area network, or even the
Internet when local compute resources are scarce. For example:
o SETI@home (setiathome.berkeley.edu) uses over 330,000 computers for a compute
power over 528 TeraFLOPS (as of August 04, 2008)
o Folding@home (folding.stanford.edu) uses over 340,000 computers for a compute
power of 4.2 PetaFLOPS (as of November 4, 2008)

Limits to serial computing: Both physical and practical reasons pose significant
constraints to simply building ever faster serial computers:
o Transmission speeds - the speed of a serial computer is directly dependent upon
how fast data can move through hardware. Absolute limits are the speed of light (30

cm/nanosecond) and the transmission limit of copper wire (9 cm/nanosecond).

Increasing speeds necessitate increasing proximity of processing elements.
Limits to miniaturization - processor technology is allowing an increasing number
of transistors to be placed on a chip. However, even with molecular or atomic-level
components, a limit will be reached on how small components can be.
Economic limitations - it is increasingly expensive to make a single processor
faster. Using a larger number of moderately fast commodity processors to achieve
the same (or better) performance is less expensive.

Current computer architectures are increasingly relying upon hardware level parallelism to
improve performance:
o
o
o

Multiple execution units

Pipelined instructions
Multi-core

RAM model
Random Access Machine is a favorite model of a sequential computer. Its main features are:
1. Computation unit with a user defined program.
2. Read-only input tape and write-only output tape.
3. Unbounded number of local memory cells.
4. Each memory cell is capable of holding an integer of unbounded size.
Instruction set includes operations for moving data between memory cells, comparisons and
conditional
branches, and simple arithmetic operations.
5.
6. Execution starts with the first instruction and ends when a HALT instruction is executed.
7. All operations take unit time regardless of the lengths of operands.
8. Time complexity = the number of instructions executed.
9. Space complexity = the number of memory cells accessed.

PRAM model
Parallel Random Access Machine is a straightforward and natural generalization of RAM. It is
an idealized
model of a shared memory SIMD machine. Its main features are:
1. Unbounded collection of numbered RAM processors P0, P1, P2,... (without tapes).
2. Unbounded collection of shared memory cells M[0], M[1], M[2],....
3. Each Pi has its own (unbounded) local memory (registers) and knows its index i.
4. Each processor can access any shared memory cell (unless there is an access conflict, see
further) in unit time.
5. Input af a PRAM algorithm consists of n items stored in (usually the first) n shared 5. memory
cells.
6. Output of a PRAM algorithm consists of n' items stored in n' shared memory cells.

7. PRAM instructions execute in 3-phase cycles.

1. Read (if any) from a shared memory cell.
2. Local computation (if any).
3. Write (if any) to a shared memory cell.
8. Processors execute these 3-phase PRAM instructions synchronously.
9. Special assumptions have to be made about R-R and W-W shared memory access conflicts.
10. The only way processors can exchange data is by writing into and reading from memory cells.
11. P0 has a special activation register specifying the maximum index of an active processor.
Initially, only P0 is active, it computes the number of required active processors and loads this
register, and then the other corresponding processors start executing their programs.
12. Computation proceeds until P0 halts, at which time all other active processors are halted.
13. Parallel time complexity = the time elapsed for P0's computation.
14. Space complexity = the number of shared memory cells accessed.
PRAM is an attractive and important model for designers of parallel algorithms. Why?
1. It is natural: the number of operations executed per one cycle on p processors is at most p.
2. It is strong: any processor can read or write any shared memory cell in unit time.
3. It is simple: it abstracts from any communication or synchronization overhead, which makes the
complexity and correctness analysis of PRAM algorithms easier. Therefore,
4. It can be used as a benchmark: If a problem has no feasible/efficient solution on PRAM, it has
no feasible/efficient solution on any parallel machine.
5. It is useful: it is an idealization of existing (and nowaday more and more abundant) shared
memory parallel machines.

Simulation From One PRAM Model To Other

PREFIX SUM

The sequential for loop executes [logn] times. Hence, The overall execution time will be [logn].

List Ranking Algorithm

.
The
.

Merging Two Sorted List

Cost Optimal Parallel Algorithms

SIAE AGS20 Datasheet
No ratings yet
SIAE AGS20 Datasheet
6 pages
Parallel Computing Terminology
No ratings yet
Parallel Computing Terminology
11 pages
Parallel Computing Main
No ratings yet
Parallel Computing Main
47 pages
Data Parallel Architecture
No ratings yet
Data Parallel Architecture
17 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Introduction To Computing
No ratings yet
Introduction To Computing
6 pages
Lecture Parallel Computing
No ratings yet
Lecture Parallel Computing
6 pages
Introduction To Parallel Computing
0% (1)
Introduction To Parallel Computing
34 pages
Parallel Computing Varun Patial
No ratings yet
Parallel Computing Varun Patial
41 pages
Introduction To Parallel Computing LLNL
No ratings yet
Introduction To Parallel Computing LLNL
44 pages
unit1 2 and 3
No ratings yet
unit1 2 and 3
76 pages
Lecture 4
No ratings yet
Lecture 4
27 pages
Case Study On Amazon Ec2
100% (1)
Case Study On Amazon Ec2
30 pages
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
No ratings yet
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
47 pages
Bips - Grade Ix - Ch-01-02
No ratings yet
Bips - Grade Ix - Ch-01-02
12 pages
Week1 - Parallel and Distributed Computing
100% (1)
Week1 - Parallel and Distributed Computing
46 pages
24-25 - Parallel Processing PDF
No ratings yet
24-25 - Parallel Processing PDF
36 pages
CS0051 - Module 01
No ratings yet
CS0051 - Module 01
52 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
24 pages
L1.0 HPC Overview
No ratings yet
L1.0 HPC Overview
58 pages
01 Intro Parallel Computing
No ratings yet
01 Intro Parallel Computing
40 pages
UNIT-2 PP FlynnsClassification
No ratings yet
UNIT-2 PP FlynnsClassification
80 pages
Lecture 2 General Parallelism Terms
No ratings yet
Lecture 2 General Parallelism Terms
22 pages
HPC Unit 1
100% (1)
HPC Unit 1
12 pages
Unit 7 - Parallel Processing Paradigm
No ratings yet
Unit 7 - Parallel Processing Paradigm
26 pages
Aca Notes
No ratings yet
Aca Notes
148 pages
HPC-Unit-2
No ratings yet
HPC-Unit-2
72 pages
PC-Notes
No ratings yet
PC-Notes
26 pages
Parallel Computing
100% (1)
Parallel Computing
53 pages
Week1-Parallel-and-Distributed-Computing
No ratings yet
Week1-Parallel-and-Distributed-Computing
55 pages
Module 5
No ratings yet
Module 5
45 pages
Introduction to Parallel Computing (1)
No ratings yet
Introduction to Parallel Computing (1)
41 pages
LLNL Introduction To Parallel Computing
No ratings yet
LLNL Introduction To Parallel Computing
39 pages
The New Trends of Parallel Processing
No ratings yet
The New Trends of Parallel Processing
5 pages
Types of Parallel Computing
No ratings yet
Types of Parallel Computing
11 pages
Homework 1 Solution PDF
No ratings yet
Homework 1 Solution PDF
5 pages
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
No ratings yet
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
33 pages
Unit VI Parallel Programming Concepts
No ratings yet
Unit VI Parallel Programming Concepts
90 pages
Parallel_computing
No ratings yet
Parallel_computing
32 pages
Chapter 2
No ratings yet
Chapter 2
28 pages
Assignment 1st PC
No ratings yet
Assignment 1st PC
12 pages
CS621 Final Term Current Papers
No ratings yet
CS621 Final Term Current Papers
9 pages
10 Parallel Computing
No ratings yet
10 Parallel Computing
15 pages
Parallel Computing: Charles Koelbel
No ratings yet
Parallel Computing: Charles Koelbel
12 pages
Study Notes COAL Mids
No ratings yet
Study Notes COAL Mids
14 pages
Technical Seminar Report On: "High Performance Computing"
No ratings yet
Technical Seminar Report On: "High Performance Computing"
14 pages
02 - Lecture #2
No ratings yet
02 - Lecture #2
29 pages
Introduction To Parallel Computing
No ratings yet
Introduction To Parallel Computing
30 pages
Assembly Language Final Term Paper Solutions
No ratings yet
Assembly Language Final Term Paper Solutions
7 pages
Screenshot 2024-12-05 at 2.01.32 PM
No ratings yet
Screenshot 2024-12-05 at 2.01.32 PM
49 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
61 pages
Parallel Computing Seminar Report
100% (3)
Parallel Computing Seminar Report
35 pages
SDE Sheet (Core)
No ratings yet
SDE Sheet (Core)
21 pages
41 PDF
No ratings yet
41 PDF
135 pages
FBISE Compter Notes Chapter 1 (Computer Systems Topics )
No ratings yet
FBISE Compter Notes Chapter 1 (Computer Systems Topics )
28 pages
Lecture 2 General Parallelism Terms
No ratings yet
Lecture 2 General Parallelism Terms
22 pages
Multicore Architecture
No ratings yet
Multicore Architecture
159 pages
cloud computing unit-1
No ratings yet
cloud computing unit-1
51 pages
UNIT - IV
No ratings yet
UNIT - IV
40 pages
Lec1 Introduction to Parallel Computing (2)
No ratings yet
Lec1 Introduction to Parallel Computing (2)
40 pages
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
Cell Tutorial
No ratings yet
Cell Tutorial
87 pages
Vdocuments - MX - bsc6900 Umts Resource Analysis PDF
No ratings yet
Vdocuments - MX - bsc6900 Umts Resource Analysis PDF
36 pages
Accelerating Marching Cubes With Graphics Hardware
No ratings yet
Accelerating Marching Cubes With Graphics Hardware
6 pages
AGS20.l.d.us - Lealfet and Datasheet
No ratings yet
AGS20.l.d.us - Lealfet and Datasheet
6 pages
Ahmad Aljebaly Department of Computer Science Western Michigan University
No ratings yet
Ahmad Aljebaly Department of Computer Science Western Michigan University
42 pages
EC2042 Embedded and Real Time Systems Lecture Notes
No ratings yet
EC2042 Embedded and Real Time Systems Lecture Notes
79 pages
The Case For GPGPU Spatial Multitasking: 978-1-4673-0826-7/12/$26.00 ©2011 IEEE
No ratings yet
The Case For GPGPU Spatial Multitasking: 978-1-4673-0826-7/12/$26.00 ©2011 IEEE
12 pages
IBM Cell Broadband Engine
No ratings yet
IBM Cell Broadband Engine
81 pages
GPU Architectures
No ratings yet
GPU Architectures
29 pages
Assignment
No ratings yet
Assignment
16 pages
Ibm Cell Processor
No ratings yet
Ibm Cell Processor
26 pages
Clusterguide-V3 0
No ratings yet
Clusterguide-V3 0
80 pages
RX 8640
No ratings yet
RX 8640
26 pages
On The Security of 1024-Bit RSA and 160-Bit Elliptic Curve Cryptography
No ratings yet
On The Security of 1024-Bit RSA and 160-Bit Elliptic Curve Cryptography
19 pages
Quad Core
No ratings yet
Quad Core
31 pages
Thesis Lenart
No ratings yet
Thesis Lenart
195 pages
A Journey Through The CPU Pipeline
No ratings yet
A Journey Through The CPU Pipeline
20 pages
Algorithms and Architectures For 2D Discrete Wavelet Transform
No ratings yet
Algorithms and Architectures For 2D Discrete Wavelet Transform
20 pages
Python Bindings For The Open Source Electromagnetic Simulator Meep
No ratings yet
Python Bindings For The Open Source Electromagnetic Simulator Meep
20 pages
Playstation 3: Jump To Navigationjump To Search
No ratings yet
Playstation 3: Jump To Navigationjump To Search
16 pages
On Chip Network by Natalie PDF
100% (1)
On Chip Network by Natalie PDF
141 pages
Overview of The Architecture, Circuit Design, and Physical Implementation of A First-Generation Cell Processor
No ratings yet
Overview of The Architecture, Circuit Design, and Physical Implementation of A First-Generation Cell Processor
18 pages
PlayStation 3 Secrets
No ratings yet
PlayStation 3 Secrets
19 pages
MultiCore Architecture
100% (2)
MultiCore Architecture
44 pages
Mouse Controlling Using Bluetooth
No ratings yet
Mouse Controlling Using Bluetooth
11 pages
Montgomery Multiplication On The Cell
No ratings yet
Montgomery Multiplication On The Cell
9 pages
Eeiol 2011mar15 Eda Ta 01
No ratings yet
Eeiol 2011mar15 Eda Ta 01
2 pages
Design Issues: SMT and CMP Architectures
No ratings yet
Design Issues: SMT and CMP Architectures
9 pages
Communication Architectures For SOC AYALA
100% (1)
Communication Architectures For SOC AYALA
434 pages