0% found this document useful (0 votes)

4 views

Week_4

The document discusses the evolution of parallel computing, emphasizing the shift from faster processors to wider, multicore architectures that require rethinking algorithms for parallel execution. It outlines the differences between generic multicore and many-core chips, focusing on memory hierarchies and cache designs, including private versus shared caches. The advantages of each cache type are also explored, highlighting their impact on performance and access speed.

Uploaded by

malikayan575

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Week_4

Uploaded by

malikayan575

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Parallel Computing

Landscape
(CS 526)

Muhammad Nadeem Nadir,

Department of Computer Science,

The University of Lahore,
The “New” Moore’s
Law
• Computers no longer get faster, just
wider

• You must re-think your algorithms

to be parallel !

• Data-parallel computing is most

scalable solution:
2 4 8 16
cores cores cores cores…
Generic Multicore Chip
Process Local Process Local
or Memo or Memo
ry ry

Global Memory

• Handful of processors each supporting ~1 hardware threads

• On-chip memory near processors (cache, RAM, or both)

• Shared global memory space (external DRAM)

Generic Many-core
Chip
Process Memor Process Memor
or y •• or y

Global Memory

• Many processors each supporting many hardware threads

• On-chip memory near processors (cache, RAM, or

both)

• Shared global memory space (external DRAM)

Emergence of Parallel
Architectures
– Multi-core processors:
• Processors having n computing cores
Multi-
cores
- Transistors
- Clock Speeds

- Power

- Performance
(Perf/Clock) ILP
The memory
hierarchy
• If simultaneous multithreading only:
– all caches shared

• Multi-core chips:
– L1 caches private
– L2 caches private in some architectures
and shared in others

• Memory is always shared

Multi-cores – Memory
Hierarchies
hyper-threads
• Dual-core
Intel Xeon processors

CORE1

CORE0
• Each core is L1 cache L1 cache
hyper-threaded (SMT)
L2 cache

• Private L1 caches
memory
• Shared L2 caches
Designs with private L2
caches

CORE1

CORE0

CORE1

CORE0
L1 cache L1 cache L1 cache L1 cache

L2 cache L2 cache L2 cache L2 cache

L3 cache L3 cache
memory
memory
Both L1 and L2 are private
Examples: AMD Opteron,
A design with L3 caches
AMD Athlon, Intel Pentium D
Example: Intel Itanium 2
Private vs Shared
caches?
• Advantages ???
Private vs Shared
caches
• Advantages of private:
– They are closer to core, so faster access
– Reduces contention

• Advantages of shared:
– Threads on different cores can share the same cache
data
– More cache space available if a single (or a few) high-
performance thread runs on the system

Q-1.Write An SQL Query To Fetch "FIRST - NAME" From Worker Table Using The Alias Name As
85% (13)
Q-1.Write An SQL Query To Fetch "FIRST - NAME" From Worker Table Using The Alias Name As
31 pages
Week 4
No ratings yet
Week 4
13 pages
ALL CSC 417 NOTE
No ratings yet
ALL CSC 417 NOTE
238 pages
Lecture 3 (Memory Hierarchy and Caches)
No ratings yet
Lecture 3 (Memory Hierarchy and Caches)
88 pages
Multi-Core Architectures
100% (1)
Multi-Core Architectures
43 pages
L05 Memory
No ratings yet
L05 Memory
45 pages
Lecture 10: Memory System - Memory Technology: CSE 564 Computer Architecture Summer 2017
No ratings yet
Lecture 10: Memory System - Memory Technology: CSE 564 Computer Architecture Summer 2017
44 pages
Ddca 2024 Lecture24 Memory Hierarchy and Caches Beforelecture
No ratings yet
Ddca 2024 Lecture24 Memory Hierarchy and Caches Beforelecture
304 pages
Lecture Notes
No ratings yet
Lecture Notes
10 pages
Ch0 Overview
No ratings yet
Ch0 Overview
81 pages
CS 152 Computer Architecture and Engineering Lecture 6 - Memory
No ratings yet
CS 152 Computer Architecture and Engineering Lecture 6 - Memory
29 pages
Cache Memory
No ratings yet
Cache Memory
89 pages
P D Group2-2
No ratings yet
P D Group2-2
6 pages
Final Report: Multicore Processors
No ratings yet
Final Report: Multicore Processors
12 pages
5 mark q mdc
No ratings yet
5 mark q mdc
13 pages
Computer Organization and Architecture Chapter 7 Large and Fast Exploiting
No ratings yet
Computer Organization and Architecture Chapter 7 Large and Fast Exploiting
32 pages
L7 Multicore 1
No ratings yet
L7 Multicore 1
50 pages
Multi Core 15213 Sp07
No ratings yet
Multi Core 15213 Sp07
67 pages
lecture 3
No ratings yet
lecture 3
16 pages
Memory Subsytems
No ratings yet
Memory Subsytems
19 pages
04 Cache Memory Comparc
No ratings yet
04 Cache Memory Comparc
47 pages
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
43 pages
Chache Memory, Internal Memory and External Memory
No ratings yet
Chache Memory, Internal Memory and External Memory
113 pages
23 Cache Memory Basics 11-03-2024
No ratings yet
23 Cache Memory Basics 11-03-2024
19 pages
Shared Memory. Distributed Memory. Hybrid Distributed-Shared Memory
No ratings yet
Shared Memory. Distributed Memory. Hybrid Distributed-Shared Memory
22 pages
Memory Hierarchy: CPU Registers
No ratings yet
Memory Hierarchy: CPU Registers
3 pages
Module 6_Memory
No ratings yet
Module 6_Memory
32 pages
Cache Memory: How Caching Works
No ratings yet
Cache Memory: How Caching Works
15 pages
CSO Unit-4 Summary
No ratings yet
CSO Unit-4 Summary
3 pages
UNIT II - Multi Core Architecture
No ratings yet
UNIT II - Multi Core Architecture
102 pages
Cache Memory: 13 March 2013
No ratings yet
Cache Memory: 13 March 2013
80 pages
Computer Architecture
No ratings yet
Computer Architecture
24 pages
Chapter-3 Edited
No ratings yet
Chapter-3 Edited
42 pages
Lec2 PDF
No ratings yet
Lec2 PDF
21 pages
Chapter 3 P1
No ratings yet
Chapter 3 P1
57 pages
Lec13 Memory 1 Notes
No ratings yet
Lec13 Memory 1 Notes
27 pages
Pipelining For Multi-Core Architectures
No ratings yet
Pipelining For Multi-Core Architectures
31 pages
Memory Design
No ratings yet
Memory Design
36 pages
Memory Subsystem: Dr. Gayathri Sivakumar Assistant Professor (SG-I) School of Electronics VIT, Chennai
No ratings yet
Memory Subsystem: Dr. Gayathri Sivakumar Assistant Professor (SG-I) School of Electronics VIT, Chennai
16 pages
Usha Mittal Institute of Technology SNDT Women'S University: MUMBAI - 400049
No ratings yet
Usha Mittal Institute of Technology SNDT Women'S University: MUMBAI - 400049
19 pages
Core Performance
No ratings yet
Core Performance
13 pages
Petros Niguse
No ratings yet
Petros Niguse
16 pages
Sreememory 151216145054
No ratings yet
Sreememory 151216145054
23 pages
Multicore Computers
No ratings yet
Multicore Computers
21 pages
Level 18 (Chapter 18 - Multicore Computers)
No ratings yet
Level 18 (Chapter 18 - Multicore Computers)
10 pages
Unit VI - Multi Core Architectures
No ratings yet
Unit VI - Multi Core Architectures
51 pages
CSC 308
No ratings yet
CSC 308
36 pages
Memory Hierarchy Presentation Detailed
No ratings yet
Memory Hierarchy Presentation Detailed
24 pages
Elements Assignment
No ratings yet
Elements Assignment
8 pages
Chapter 3 Lecture 1
No ratings yet
Chapter 3 Lecture 1
54 pages
EE6304 Lecture8 Mem Hierarchy
No ratings yet
EE6304 Lecture8 Mem Hierarchy
54 pages
Course: Computer Architecture and Organization. Faculty: Waqar Khan. Presented By: Anusha and Talha
No ratings yet
Course: Computer Architecture and Organization. Faculty: Waqar Khan. Presented By: Anusha and Talha
20 pages
ARM Memory Organisation (1)
No ratings yet
ARM Memory Organisation (1)
7 pages
This Unit: Caches: - Basic Memory Hierarchy Concepts
No ratings yet
This Unit: Caches: - Basic Memory Hierarchy Concepts
24 pages
Basic Components of A Parallel (Or Serial) Computer: Processors
No ratings yet
Basic Components of A Parallel (Or Serial) Computer: Processors
14 pages
Architecture1 1 (2012)
No ratings yet
Architecture1 1 (2012)
87 pages
Chapter 2
No ratings yet
Chapter 2
3 pages
04 Cache Memory Internal Memory Revised 2
No ratings yet
04 Cache Memory Internal Memory Revised 2
43 pages
Memory Basics Explained
From Everand
Memory Basics Explained
Alisa Turing
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
From Everand
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
Rodrigo Copetti
No ratings yet
PDC_MidTerm (1)
No ratings yet
PDC_MidTerm (1)
1 page
Week_7 (1)
No ratings yet
Week_7 (1)
27 pages
Week_5
No ratings yet
Week_5
35 pages
Week_6_A
No ratings yet
Week_6_A
22 pages
Week_3
No ratings yet
Week_3
12 pages
Week_2
No ratings yet
Week_2
18 pages
Full download Machine Learning for Signal Processing: Data Science, Algorithms, and Computational Statistics Max A. Little pdf docx
100% (4)
Full download Machine Learning for Signal Processing: Data Science, Algorithms, and Computational Statistics Max A. Little pdf docx
76 pages
Gemini 5
No ratings yet
Gemini 5
2 pages
Description: Tags: 08bylevel
No ratings yet
Description: Tags: 08bylevel
2 pages
Formulating Land Use Principles: Key Dimension/S Areas For Consideration Proposed Planning Principles
No ratings yet
Formulating Land Use Principles: Key Dimension/S Areas For Consideration Proposed Planning Principles
3 pages
Gwalior Bhopal: Departure Arrival
No ratings yet
Gwalior Bhopal: Departure Arrival
3 pages
Drillpipe and Bottom Hole Assembly Standards
100% (1)
Drillpipe and Bottom Hole Assembly Standards
2 pages
13-03-21 Shofiah Nur Azizah - Research Proposal
No ratings yet
13-03-21 Shofiah Nur Azizah - Research Proposal
35 pages
Compressed Gas: Symbol Means
No ratings yet
Compressed Gas: Symbol Means
8 pages
Written Report (Alibai Galmak)
No ratings yet
Written Report (Alibai Galmak)
6 pages
Springs
No ratings yet
Springs
8 pages
Mission & Goals
No ratings yet
Mission & Goals
2 pages
Cancoil Fluid Cooler Operate Install Manual
No ratings yet
Cancoil Fluid Cooler Operate Install Manual
4 pages
WWW Erphpnlu in
No ratings yet
WWW Erphpnlu in
3 pages
Minimum Spanning Tree: Presented By: Hinal Lunagariya
No ratings yet
Minimum Spanning Tree: Presented By: Hinal Lunagariya
30 pages
Englishlangartg 4 T 1205 Albe
No ratings yet
Englishlangartg 4 T 1205 Albe
330 pages
Essential Chinese Characters For The Martial Artist
No ratings yet
Essential Chinese Characters For The Martial Artist
13 pages
Class 8 PRACTICE PAPER-S and M
0% (4)
Class 8 PRACTICE PAPER-S and M
9 pages
Design and Analysis of Hydraulic Bumper System
No ratings yet
Design and Analysis of Hydraulic Bumper System
7 pages
Srinivas Puni, FRM: Director at Quantart FX
No ratings yet
Srinivas Puni, FRM: Director at Quantart FX
2 pages
METHOD-STATEMENTS-ELECTRICAL - Testing
No ratings yet
METHOD-STATEMENTS-ELECTRICAL - Testing
3 pages
Unit 2 JP and DWD by Shailendra Sir
No ratings yet
Unit 2 JP and DWD by Shailendra Sir
65 pages
D60 Brochure
No ratings yet
D60 Brochure
2 pages
ĐỀ CƯƠNG CUỐI KỲ 2 11
No ratings yet
ĐỀ CƯƠNG CUỐI KỲ 2 11
7 pages
01 ASEP RCC Qatar - Abstract and Outline of Topics
No ratings yet
01 ASEP RCC Qatar - Abstract and Outline of Topics
2 pages
Recent Developments With The PAI
No ratings yet
Recent Developments With The PAI
31 pages
Rugasol G 2011-10 - 1
No ratings yet
Rugasol G 2011-10 - 1
3 pages
Week 8 (Buying Things 3, Map)
No ratings yet
Week 8 (Buying Things 3, Map)
13 pages
Hwids - 2016 10 13 - 08 40 01
No ratings yet
Hwids - 2016 10 13 - 08 40 01
9 pages
WESTERMAN, Jonah. Between Action and Image - Performance As Inframedium' (2015)
No ratings yet
WESTERMAN, Jonah. Between Action and Image - Performance As Inframedium' (2015)
4 pages

Week_4

Uploaded by

Week_4

Uploaded by

Parallel Computing

Muhammad Nadeem Nadir,

Department of Computer Science,

• You must re-think your algorithms

• Data-parallel computing is most

• Handful of processors each supporting ~1 hardware threads

• On-chip memory near processors (cache, RAM, or both)

• Shared global memory space (external DRAM)

• Many processors each supporting many hardware threads

• On-chip memory near processors (cache, RAM, or

• Shared global memory space (external DRAM)

• Memory is always shared

L2 cache L2 cache L2 cache L2 cache

You might also like