L38 TLP

Uploaded by

Hari Kalyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views13 pages

L38 TLP

Uploaded by

Hari Kalyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

THREAD LEVEL PARALLELISM

NEED FOR MULTI-PROCESSER

 The importance of multiprocessors was growing as
 Designers found a way to build servers and supercomputers
that achieved higher performance than a single
microprocessor
 While exploiting the cost-performance advantages of
commodity microprocessors
 Slowdown in uniprocessor performance arising from
 Diminishing returns in exploiting instruction-level
parallelism (ILP) combined with growing concern over
power
 Leading to a new era in computer architecture—where
multiprocessors play a major role from the low end to high end2
FACTORS REFLECTING THE IMPORTANCE OF
MULTIPROCESSING
 Finding and exploiting more ILP, turned out to be inefficient, since power
and silicon costs grew faster than performance
 Other than ILP, the only scalable and general-purpose way to increase
performance is through multiprocessing
 A growing interest in high-end servers
 A growth in data-intensive applications
 Increasing performance on the desktop is less important, as highly
compute and data-intensive applications are being done in cloud
 An improved understanding of effective use of multiprocessors
 The advantages of leveraging a design investment by replication rather
than unique design
3
MULTIPROCESSOR
 Thread level parallelism (TLP) implies existence of multiple program
counter and is exploited through MIMDs
 Multiprocessors
 Computers consisting of tightly coupled processors
 Coordination and usage controlled by a single operating system
 Share memory through a shared address space
 Multiprocessing exploits TLP in two different software models
 Parallel processing - execution of a tightly coupled set of threads
collaborating on a single task
 Request level parallelism - Execution of multiple, relatively
independent processes originate from one or more users
4
MULTIPROCESSOR
 Multiprocessors have typically dual to dozens of processors
 Communicate and coordinate through the sharing of memory
 Such multiprocessors include both
 single-chip systems with multiple cores
 multiple chips, each of which may be a multicore design

5
MULTIPROCESSOR ARCHITECTURE
 To take advantage of an MIMD multiprocessor with n processors,
we must usually have at least n threads or processes to execute
 Independent threads within a single process are typically
identified by the programmer or created by the OS
 Grain size
 The amount of computation assigned to a thread
 Important in considering how to exploit TLP efficiently
 Threads consist of hundreds to millions of instructions that may
be executed in parallel

6
THREADS AND DLP
 Threads can also be used to exploit data-level parallelism (DLP)
 The overhead is likely to be higher than SIMD processor or with
a GPU
 Grain size must be sufficiently large to exploit the parallelism
efficiently
 The grain size when the parallelism is split among many threads
may be so small that the overhead makes the exploitation of the
parallelism prohibitively expensive in an MIMD

7
CLASSES OF SHARED MEMORY
MULTIPROCESSORS
 Based on number of processors involved which in turn dictate a
memory organization and interconnect strategy
 Symmetric (shared memory) multiprocessor (SMPs) or
centralized shared memory multiprocessor
 Distributed shared memory (DSM)

 Small numbers of cores, typically eight or fewer

 Possible for the processors to share a single centralized memory

with all processors have equal access
 In multicore chips, the memory is effectively shared in a
centralized fashion among the cores, and all existing
multicores are SMPs
 SMP architectures are also sometimes called uniform memory 8
access (UMA) multiprocessors
UMA
 Multiple processor–cache
subsystems share the
same physical memory,
typically with one level
of shared cache, and one
or more levels of private
per-core cache

 The key architectural

property is the uniform
access time to all of the
memory from all of the
processors

9
DSM

 Multiprocessor with physically distributed memory

 Distributing the memory among the nodes both increases the
bandwidth and reduces the latency to local memory
10
 NUMA (nonuniform memory access), since the access time depends
on the location of a data word in memory
CHALLENGES OF PARALLEL
PROCESSING
 The application of multiprocessors ranges from running
independent tasks with essentially no communication to running
parallel programs where threads must communicate to complete
the task
 Two important hurdles make parallel processing challenging
 The first hurdle limited parallelism available in programs,
 Second arises from the relatively high cost of
communications.
 Limitations in available parallelism make it difficult to achieve
good speedups in any parallel processor,
11
 Suppose you want to achieve a speedup of 80 with 100
processors
 What fraction of the original computation can be
sequential?

 Assume that the program operates in only two modes:

 Parallel with all processors fully used, (enhanced mode)
or 12

 Serial with only one processor in use

13
 To achieve a speedup of 80 with 100 processors, only 0.25% of the
original computation can be sequential.

Logcat
No ratings yet
Logcat
9,080 pages
तर्क संग्रह, संस्कृत टीका सहित
No ratings yet
तर्क संग्रह, संस्कृत टीका सहित
196 pages
MULTIPROCTLPA
No ratings yet
MULTIPROCTLPA
99 pages
Memory Coherent
No ratings yet
Memory Coherent
62 pages
ज्योतिर्गणितम
No ratings yet
ज्योतिर्गणितम
484 pages
Multithreading, SMT and CMP
No ratings yet
Multithreading, SMT and CMP
7 pages
EE6304 Lecture12 TLP
No ratings yet
EE6304 Lecture12 TLP
70 pages
Architecture
No ratings yet
Architecture
67 pages
Watcher Autocheck)
No ratings yet
Watcher Autocheck)
344 pages
CS516: Parallelization of Programs: Overview of Parallel Architectures
No ratings yet
CS516: Parallelization of Programs: Overview of Parallel Architectures
43 pages
Ca - Unit 4
No ratings yet
Ca - Unit 4
77 pages
06 Flynn-S Classification
No ratings yet
06 Flynn-S Classification
31 pages
Crash 2024-05-06 19-42-29
No ratings yet
Crash 2024-05-06 19-42-29
31 pages
Tlp
No ratings yet
Tlp
19 pages
Arch13 Multiprocessors Afterlecture
No ratings yet
Arch13 Multiprocessors Afterlecture
70 pages
L1 - Instructions - Intro - Operations - Operands of The Computer
No ratings yet
L1 - Instructions - Intro - Operations - Operands of The Computer
19 pages
Mod 7
No ratings yet
Mod 7
56 pages
Lecture-13-14 Parallel and Distributed Systems Programming Models-Jameel
No ratings yet
Lecture-13-14 Parallel and Distributed Systems Programming Models-Jameel
70 pages
Ayushagrawal Hpc
No ratings yet
Ayushagrawal Hpc
17 pages
Multiprocessors I
No ratings yet
Multiprocessors I
13 pages
lecture1
No ratings yet
lecture1
37 pages
Coa PPT-2
No ratings yet
Coa PPT-2
16 pages
How To Install and Enable SNMP Windows 2012 Server
No ratings yet
How To Install and Enable SNMP Windows 2012 Server
16 pages
Demystifying Multicore Germany 14 PDF
No ratings yet
Demystifying Multicore Germany 14 PDF
82 pages
APznzaaBPbq19r7DttJsFJDiz6xdljQmPxg0oflqRAoyoqcN6IEEo4yrW Ck8XgHkH5PDMZIHRNz7h0ZpQWHOHwyjvO3PX93sVHvLd5fwcGETUu8XvmdTkaodNRbNrLgkDFPQZVQMfz8KHkZay30aqD0CVLA10PSummzrUt1vN32NEahcaq-m3CTYqZXjSBaBus9kPl5fj8KDKPT (1)
No ratings yet
APznzaaBPbq19r7DttJsFJDiz6xdljQmPxg0oflqRAoyoqcN6IEEo4yrW Ck8XgHkH5PDMZIHRNz7h0ZpQWHOHwyjvO3PX93sVHvLd5fwcGETUu8XvmdTkaodNRbNrLgkDFPQZVQMfz8KHkZay30aqD0CVLA10PSummzrUt1vN32NEahcaq-m3CTYqZXjSBaBus9kPl5fj8KDKPT (1)
80 pages
Unit Iv
No ratings yet
Unit Iv
31 pages
Pca App Launch Dic
No ratings yet
Pca App Launch Dic
2 pages
crash_20241208
No ratings yet
crash_20241208
6 pages
Osa Multi Core
No ratings yet
Osa Multi Core
37 pages
CA Chap7 Multicores Multiprocessors
No ratings yet
CA Chap7 Multicores Multiprocessors
42 pages
Computer Education 3
No ratings yet
Computer Education 3
10 pages
CALCULATRICE
No ratings yet
CALCULATRICE
17 pages
Arkom 13-40275
No ratings yet
Arkom 13-40275
32 pages
N1mm+morse Runner
No ratings yet
N1mm+morse Runner
20 pages
L39 - Centralized Shared Memory Architectures
No ratings yet
L39 - Centralized Shared Memory Architectures
31 pages
L41 - Distributed Shared-Memory and Directory-Based
No ratings yet
L41 - Distributed Shared-Memory and Directory-Based
11 pages
28 MIMD Architecture
No ratings yet
28 MIMD Architecture
28 pages
Ict 2
No ratings yet
Ict 2
19 pages
SSC Course 6 CPU
No ratings yet
SSC Course 6 CPU
17 pages
Final Alert 2 Log
No ratings yet
Final Alert 2 Log
3 pages
Introduction
No ratings yet
Introduction
9 pages
Parallel_computing
No ratings yet
Parallel_computing
32 pages
2 - Parallel Computer Architecture - 1
No ratings yet
2 - Parallel Computer Architecture - 1
26 pages
Moo 3
No ratings yet
Moo 3
5 pages
L12 - Advanced Branch Preiction
No ratings yet
L12 - Advanced Branch Preiction
9 pages
CS Chap7 Multicores Multiprocessors Clusters
No ratings yet
CS Chap7 Multicores Multiprocessors Clusters
65 pages
Cloud Computing CS 15-319: Programming Models-Part I Lecture 4, Jan 25, 2012
No ratings yet
Cloud Computing CS 15-319: Programming Models-Part I Lecture 4, Jan 25, 2012
40 pages
Multi-Core Architectures
100% (1)
Multi-Core Architectures
43 pages
High Performance Computing Unit 1
No ratings yet
High Performance Computing Unit 1
3 pages
Ijcse10 02 05 43
No ratings yet
Ijcse10 02 05 43
10 pages
Parallelism (2) & Heterogeneous Computing & Future Perspetives
No ratings yet
Parallelism (2) & Heterogeneous Computing & Future Perspetives
50 pages
Module 2
No ratings yet
Module 2
5 pages
Radeon Software Command Line Installation User Guide
No ratings yet
Radeon Software Command Line Installation User Guide
5 pages
Prev CSC Log
No ratings yet
Prev CSC Log
2 pages
COA - Unit 4
No ratings yet
COA - Unit 4
84 pages
10 Alternative PC Operating Systems You Can Install
No ratings yet
10 Alternative PC Operating Systems You Can Install
11 pages
Test-Driving The App From Android Studio: Doodlz
No ratings yet
Test-Driving The App From Android Studio: Doodlz
8 pages
Lec2 ParallelProgrammingPlatforms
No ratings yet
Lec2 ParallelProgrammingPlatforms
26 pages
Part - B Unit - 5 Multiprocessors and Thread - Level Parallelism
No ratings yet
Part - B Unit - 5 Multiprocessors and Thread - Level Parallelism
20 pages
Sketchup Toolbars Settings
No ratings yet
Sketchup Toolbars Settings
5 pages
Getting Started With Javaserver Faces 1.2, Part 1:: Building Basic Applications
No ratings yet
Getting Started With Javaserver Faces 1.2, Part 1:: Building Basic Applications
49 pages
Concurrent Programming With Threads: Rajkumar Buyya
No ratings yet
Concurrent Programming With Threads: Rajkumar Buyya
168 pages
Multi Core
No ratings yet
Multi Core
19 pages
Online Venue
No ratings yet
Online Venue
12 pages
Unit 7 - Parallel Processing Paradigm
No ratings yet
Unit 7 - Parallel Processing Paradigm
26 pages
Multi Threading
No ratings yet
Multi Threading
168 pages
Lec 4 Superscalarprocessor Updated PDF
No ratings yet
Lec 4 Superscalarprocessor Updated PDF
40 pages
Lec 4 Superscalarprocessor PDF
No ratings yet
Lec 4 Superscalarprocessor PDF
23 pages
JDBC Notes
No ratings yet
JDBC Notes
3 pages
Parallel Processors From Client To Cloud: Omputer Rganization and Esign
No ratings yet
Parallel Processors From Client To Cloud: Omputer Rganization and Esign
43 pages
Multi-Core Computing: Osama Awwad
No ratings yet
Multi-Core Computing: Osama Awwad
37 pages
Driver Installation For SP BSP Tools
No ratings yet
Driver Installation For SP BSP Tools
13 pages
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
No ratings yet
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
72 pages
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
No ratings yet
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
91 pages
Multicore Processor
100% (1)
Multicore Processor
23 pages
Chapter 1 (Parallel Computer Models)
No ratings yet
Chapter 1 (Parallel Computer Models)
20 pages
VSS Writers - WMI Writer State (5) Waiting For Completion
No ratings yet
VSS Writers - WMI Writer State (5) Waiting For Completion
4 pages
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
No ratings yet
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
72 pages
Installation Guide For The SRAM Cards
No ratings yet
Installation Guide For The SRAM Cards
1 page
SMT and CMP Architectures
100% (3)
SMT and CMP Architectures
19 pages
5 4 Parallel
No ratings yet
5 4 Parallel
47 pages
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
No ratings yet
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
26 pages
Memory in Multiprocessor System
No ratings yet
Memory in Multiprocessor System
52 pages
Multi Processors and Thread Level Parallelism
No ratings yet
Multi Processors and Thread Level Parallelism
74 pages
CICS 504 Computer Organization
No ratings yet
CICS 504 Computer Organization
35 pages
Multiprocessors - Parallel Processing Overview: "The Real World Is Inherently Concurrent Yet Our Computational
No ratings yet
Multiprocessors - Parallel Processing Overview: "The Real World Is Inherently Concurrent Yet Our Computational
78 pages
Cbse Class 12 Cs Project
No ratings yet
Cbse Class 12 Cs Project
19 pages
Minibus 69 - Sutjeska - Nahorevo
No ratings yet
Minibus 69 - Sutjeska - Nahorevo
2 pages
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
No ratings yet
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
26 pages
Airdroid Premium Apk Free Download PDF
No ratings yet
Airdroid Premium Apk Free Download PDF
3 pages
SMT and CMP Architectures
No ratings yet
SMT and CMP Architectures
19 pages
Question Bank (I Scheme)
100% (1)
Question Bank (I Scheme)
2 pages
Iridology Station 5.1 - Iris Supplies (NP)
No ratings yet
Iridology Station 5.1 - Iris Supplies (NP)
1 page
Technical Interview Questions - Active Directory
No ratings yet
Technical Interview Questions - Active Directory
4 pages
Parallel Programming with MPI: Definitive Reference for Developers and Engineers
From Everand
Parallel Programming with MPI: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mastering System Programming with C: Files, Processes, and IPC
From Everand
Mastering System Programming with C: Files, Processes, and IPC
Larry Jones
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet

L38 TLP

Uploaded by

L38 TLP

Uploaded by

THREAD LEVEL PARALLELISM

NEED FOR MULTI-PROCESSER

 Small numbers of cores, typically eight or fewer

 Possible for the processors to share a single centralized memory

 The key architectural

 Multiprocessor with physically distributed memory

 Assume that the program operates in only two modes:

 Serial with only one processor in use

You might also like