Why Multiprocessors?: Motivation: Opportunity

Multiprocessors provide opportunities to go beyond the performance of a single processor by exploiting parallelism without requiring specialized hardware. They take advantage of existing software and can handle both parallel programs and multi-programmed workloads without excessive complexity. The key models are SIMD, MIMD with centralized shared memory, and MIMD with physically distributed memory using either distributed shared memory or message passing approaches. Effective parallel applications exhibit high computation to communication ratios.

Uploaded by

Aaryadeep Jaiswal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views20 pages

Why Multiprocessors?: Motivation: Opportunity

Uploaded by

Aaryadeep Jaiswal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Why Multiprocessors?

Motivation: Opportunity:
Go beyond the Software available
performance offered
by a single processor Parallel programs

Without requiring Multi-programmed

specialized machines
processors
Without the
complexity of too
much multiple issue
Multiprocessors: The SIMD Model

SISD: Single Instruction stream, Single
Data stream
Uniprocessor
This is the view at the ISA level
Tomasulo uncovers data stream parallelism

SIMD: Single Instruction stream, Multiple
Data streams
ISA makes data parallelism explicit
Special SIMD instructions
Same instruction goes to multiple functional
units, but acts on different data
SIMD Drawbacks
SIMD useful for loop-level parallelism
Model is too inflexible to accommodate
parallel programs as well as multi-
programmed environments
Cannot take advantage of uniprocessor
performance growth
SIMD architecture usually used in special
purpose designs
Signal or image processing
Multiprocessors: The MIMD Model

MIMD: Multiple Instruction streams, Multiple
Data streams
Each processor fetches its own instruction and
data

Advantages:
Flexibility: parallel programs, or multi-
programmed OS, or both
Built using off-the-shelf uniprocessors
MIMD: The Centralized Shared-
Memory Model
Single bus connects a
shared memory to all
P P P
processors
Also called Uniform
$ $ $
Memory Access (UMA)
Bus machine
Disadvantage: cannot
Main Memory I/O scale very well, especially
with fast processors (more
memory bandwidth required)
MIMD: Physically Distributed
Memory
Independent memory for
P+$ P+$ each processor

M I/O M I/O High-bandwidth

interconnection
Interconnection n/w Adv: cost-effective memory
bandwidth scaling
M I/O M I/O
Adv: lesser latency for
P+$ P+$ local access
Disadv: communication of
data between nodes
Communication Models with
Physically Distributed Memory

Distributed Shared Memory (DSM)
Memory address space is the same across nodes
Also called scalable shared memory
Also called NUMA: non-uniform memory access
Communication is implicit via load/store

Multicomputer, or Message Passing Machine
Separate private address spaces for each node
Communication is explicit, through messages
Synchronous, or asynchronous
Std. Message Passing Interface (MPI) possible
Multiprocessing: Classification
Multiprocessing

SIMD MIMD

Centralized Physically
shared memory distributed memory

Distributed shared Message passing

memory (DSM) machines
Multiprocessing: Classification
Multiprocessing

SIMD MIMD

Centralized Physically
shared memory distributed memory

Distributed shared Message passing

memory (DSM) machines
DSM vs. Message Passing
Shared Memory Message Passing
Well understood Hardware simplicity
mechanisms for
Communication is
programming
explicit – forces
Program independent of programmer to pay
communication pattern attention to what is
Low overhead for expensive
communicating small
items
Hardware controlled
caching
Achieving the Desired
Communication Model
Message Passing on top of Shared Memory
Considerable easier
Difficulty arises in dealing with arbitrary
message lengths
Shared Memory on top of Message Passing
Harder since every load/store has to be faked
Every memory reference may involve OS
One promising direction: use of VM to share
objects at page level: shared VM
Challenges in Parallel
Processing

Limited parallelism available in programs
90% parallelizable ==> max speed possible?
Exception: super-linear speedup

Increased memory/cache available

Usually not very great however

Large latency of communication
50-10000 clock cycles
0.5% instructions access remote memory ==>
what is the increase in CPI?
Addressing the Challenges

Limited parallelism
Tackled mainly by redesigning the algorithm or
software

Avoiding large latency
Hardware mechanism: caching
Software mechanism: restructure to make more
accesses local
Some Example Applications

Two classes
Parallel programs or program kernels
Multi-programmed OS

Spatial and temporal data access patterns
are important

Computation to communication ratio is
important
Parallel Application Kernels

The FFT kernel
Used in spectral methods
Data represented as array
Computation involves

1D FFT on each row

Transpose

1D FFT on each row again
Each processor gets a few rows of data
Main communication step is the transpose (all to
all communication)
Parallel Application Kernels
(continued)

The LU kernel
LU factorization of a matrix
Blocking is used
Computation (dense matrix multiply) is
performed by processor which owns the
destination block
Communication happens at regular intervals
Parallel Applications

Barnes application
N-body problem
Octree representation
Each processor is allocated a subtree
Tree expansion as required (communication in
this process)
Parallel Applications
(continued)

Ocean application
Influence of eddy and boundary currents on
ocean flows
Involves solving PDEs
Ocean divided into hierarchy of grids (finer grid
for more accuracy)
Each processor gets a set of grids
Communication to exchange boundary
conditions, at each step of the process
Computation to Communication
Ratios
Scaling of
Computation Communication
Application computation to
scaling scaling
communication
FFT nlogn/p n/p Logn
LU n/p sqrt(n/p) sqrt(n/p)
Barnes nlogn/p logn*sqrt(n/p) sqrt(n/p)
Ocean n/p sqrt(n/p) sqrt(n/p)
Multiprogrammed OS workload

Workload used here is:
Two independent copies of the compilation of the
Andrew benchmark
Three steps:

Compilation: compute intensive

Installing object files in a library: I/O intensive

Removing the object files: I/O intensive

Docker Short Notes 1729397680
No ratings yet
Docker Short Notes 1729397680
17 pages
P2020RDB BSP UserManual
No ratings yet
P2020RDB BSP UserManual
87 pages
Thread MCQs
No ratings yet
Thread MCQs
7 pages
55-Types of Caches, Caches Misses,-04!03!2025
No ratings yet
55-Types of Caches, Caches Misses,-04!03!2025
64 pages
Red Hat Virtualization-4.4-Installing Red Hat Virtualization As A Self-Hosted Engine Using The Cockpit Web interface-en-US
No ratings yet
Red Hat Virtualization-4.4-Installing Red Hat Virtualization As A Self-Hosted Engine Using The Cockpit Web interface-en-US
82 pages
SAP ERP 6 - IDES Installation
No ratings yet
SAP ERP 6 - IDES Installation
22 pages
Logcat Prev CSC Log
No ratings yet
Logcat Prev CSC Log
53 pages
CC Record
No ratings yet
CC Record
59 pages
Technologies For Network
No ratings yet
Technologies For Network
3 pages
Ubuntu - FCM - Setup v1
No ratings yet
Ubuntu - FCM - Setup v1
51 pages
Triforma Configuartion Variables
No ratings yet
Triforma Configuartion Variables
17 pages
Computer Architecture: First Edition
No ratings yet
Computer Architecture: First Edition
6 pages
10 Distributed Systems
No ratings yet
10 Distributed Systems
66 pages
Step To Clear Common Error MHF Client Side-1
No ratings yet
Step To Clear Common Error MHF Client Side-1
20 pages
CAINE 12.4 Imaging Instructions (October 2022) - External
No ratings yet
CAINE 12.4 Imaging Instructions (October 2022) - External
22 pages
Q14. Illustrate File Allocation of Operating System
No ratings yet
Q14. Illustrate File Allocation of Operating System
3 pages
OS Process
No ratings yet
OS Process
33 pages
UEM Configuration RunBook - V5.2.doxc
No ratings yet
UEM Configuration RunBook - V5.2.doxc
36 pages
Unit VI File Handling and Dictionaries (07 HRS)
No ratings yet
Unit VI File Handling and Dictionaries (07 HRS)
19 pages
Physical Memory Management
No ratings yet
Physical Memory Management
24 pages
Galaxy S6 Boot Recovery
No ratings yet
Galaxy S6 Boot Recovery
15 pages
Aws Persistent Storage Answers PDF
No ratings yet
Aws Persistent Storage Answers PDF
33 pages
Mind Map Virsh 1.3
No ratings yet
Mind Map Virsh 1.3
1 page
Bochs - A Guide and Tutorial For Windows
No ratings yet
Bochs - A Guide and Tutorial For Windows
27 pages
Kubernetes Cluster Deployment Using Kubespray: Raman Pandey Apr 24, 2019 9 Min Read
No ratings yet
Kubernetes Cluster Deployment Using Kubespray: Raman Pandey Apr 24, 2019 9 Min Read
24 pages
PowerShell Cheat Sheet PDF
No ratings yet
PowerShell Cheat Sheet PDF
5 pages
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
Chapter 2Q
No ratings yet
Chapter 2Q
7 pages
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (280)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
How To Install 64-Bit Microsoft Database Drivers Alongside 32-Bit Microsoft Office
No ratings yet
How To Install 64-Bit Microsoft Database Drivers Alongside 32-Bit Microsoft Office
4 pages
Step 1. Put BIOS File Into WINFLASH Folder
No ratings yet
Step 1. Put BIOS File Into WINFLASH Folder
3 pages
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
Concurrency and Locking: Pessimistic Concurrency Optimistic Concurrency
No ratings yet
Concurrency and Locking: Pessimistic Concurrency Optimistic Concurrency
1 page
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (643)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6456)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (582)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1022)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (628)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4372)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (1005)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2885)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4102)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)

Why Multiprocessors?: Motivation: Opportunity

Uploaded by

Why Multiprocessors?: Motivation: Opportunity

Uploaded by

Why Multiprocessors?

Without requiring Multi-programmed

M I/O M I/O High-bandwidth

Distributed shared Message passing

Distributed shared Message passing

You might also like