0% found this document useful (0 votes)

29 views22 pages

Multiprocessor

Multiprocessor architecture enhances performance by utilizing multiple CPUs for improved responsiveness, multi-tasking, and resource sharing. It includes symmetric and asymmetric configurations, with communication methods like message passing and shared memory. The document also discusses various processor coupling types, operating system options, interconnection networks, and the significance of parallel processing for future computing advancements.

Uploaded by

Rahul mandal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views22 pages

Multiprocessor

Uploaded by

Rahul mandal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Multiprocessors architecture

Why Choose a Multiprocessor?

• A single CPU can not go so fast, use more
than one CPU to improve performance

• Multiple users

• Multiple applications

• Multi-tasking within an application

• Responsiveness and/or throughput

• Share hardware between CPUs

Multiprocessor Symmetry
• In a multiprocessing system, all CPUs may be equal, or
some may be reserved for special purposes.
• A combination of hardware and operating-system
software design considerations determine the symmetry.

• Systems that treat all CPUs equally are called symmetric

multiprocessing (SMP) systems.
• If all CPUs are not equal, system resources may be
divided in a number of ways, including asymmetric
multiprocessing (ASMP), non-uniform memory access
(NUMA) multiprocessing, and clustered multiprocessing.
Instruction and Data Streams
Multiprocessors can be used in different ways:
• Uniprossesors (single-instruction, single-data or SISD)
• Within a single system to execute multiple, independent
sequences of instructions in multiple contexts (multiple-
instruction, multiple-data or MIMD);
• A single sequence of instructions in multiple contexts
(single-instruction, multiple-data or SIMD, often used in
vector processing);
• Multiple sequences of instructions in a single context
(multiple-instruction, single-data or MISD, used for
redundancy in fail-safe systems and sometimes applied
to describe pipelined processors).
Processor Coupling
Tightly-coupled multiprocessor systems:
• Contain multiple CPUs that are connected at the bus level.
• These CPUs may have access to a central shared memory
(Symmetric Multiprocessing, or SMP), or may participate in a
memory hierarchy with both local and shared memory (Non-
Uniform Memory Access, or NUMA).
• Example: IBM p690 Regatta, Chip multiprocessors, also
known as multi-core computing.

Loosely-coupled multiprocessor systems:

• Often referred as clusters
• Based on multiple standalone single or dual processor
commodity computers interconnected via a high speed
communication system, such as Gigabit ethernet.
• Example: Linux Beowulf cluster
Multiprocessor Communication
Architectures
Message Passing
• Separate address space for each processor
• Processors communicate via message passing
• Processors have private memories
• Focuses attention on costly non-local operations

Shared Memory
• Processors communicate with shared address space
• Processors communicate by memory read/write
• Easy on small-scale machines
• Lower latency
• SMP or NUMA
Shared-Memory Processors
• Single copy of the OS (although some parts might be
parallel)
• Relatively easy to program
• Difficult to scale to large numbers of processors
processor
1
processor
2
... processor
N

cache cache cache

interconnection network

memory
1
memory
2
... memory
M

UMA machine block diagram

Types of Shared-Memory Architectures
UMA
• Uniform Memory Access
• Access to all memory occurred at the same speed for
all processors.
• We will focus on UMA today.
NUMA
• Non-Uniform Memory Access
• “Distributed Shared Memory”.
• Typically interconnection is grid or hypercube.
• Access to some parts of memory is faster for some
processors than other parts of memory.
• Harder to program, but scales to more processors
Bus Based UMA
(a) Simplest MP:
More than one processor on a single bus
connect to memory, bus bandwidth becomes
a bottleneck.
(b) Each processor has a cache to reduce the need
to access to memory.
(c) To further scale the number of processors, each
processor is given private local memory.
NUMA
• All memories can be addressed by all processors, but access to a
processor’s own local memory is faster than access to another
processor’s remote memory.
• Looks like a distributed machine, but the interconnection network is
usually custom-designed switches and/or buses.
OS Option 1
Each CPU has its own OS
• Statically allocate physical memory to each CPU
• Each CPU runs its own independents OS
• Share peripherals
• Each CPU handles its processes system calls
• Used in early multiprocessor systems
• Simple to implement
• Avoids concurrency issues by not sharing
• Issues: 1. Each processor has its own scheduling queue.
2. Each processor has its own memory partition.
3. Consistency is an issue with independent disk buffer caches and
potentially shared files.
OS Option 2
Master-Slave Multiprocessors
• OS mostly runs on a single fixed CPU.
• User-level applications run on the other CPUs.
• All system calls are passed to the Master CPU for processing
• Very little synchronisation required
• Single to implement
• Single centralised scheduler to keep all processors busy
• Memory can be allocated as needed to all CPUs.
• Issues: Master CPU becomes the bottleneck.
OS Option 3
Symmetric Multiprocessors (SMP)
• OS kernel runs on all processors, while load and resources are balanced
between all processors.
• One alternative: A single mutex (mutual exclusion object) that make the
entire kernel a large critical section; Only one CPU can be in the kernel at a
time; Only slight better than master-slave
• Better alternative: Identify independent parts of the kernel and make each of
them their own critical section, which allows parallelism in the kernel
• Issues: A difficult task; Code is mostly similar to uniprocessor code; hard
part is identifying independent parts that don’t interfere with each other
Interconnection Networks
Multiprocessors interconnection networks (INs) can be
classified based on a number of criteria. These include
(1) Mode of operation (synchronous versus asynchronous),
(2) Control strategy (centralized versus decentralized),
(3) Switching techniques (circuit versus packet),
(4) Topology (static versus dynamic).

14
Mode of Operation

• According to the mode of operation, INs are

classified as synchronous versus asynchronous. In
synchronous mode of operation, a single global
clock is used by all components in the system such
that the whole system is operating in a lock–step
manner. Asynchronous mode of operation, on the
other hand, does not require a global clock.
Handshaking signals are used instead in order to
coordinate the operation of asynchronous systems.
While synchronous systems tend to be slower
compared to asynchronous systems, they are race
and hazard-free.
2- Control Strategy

According to the control strategy, INs can be classified as centralized versus

decentralized. In centralized control systems, a single central control unit is used to
oversee and control the operation of the components of the system. In decentralized
control, the control function is distributed among different components in the
system.
3- Switching Techniques
Interconnection networks can be classified according to the switching mechanism
as circuit versus packet switching networks. In the circuit switching mechanism, a
complete path has to be established prior to the start of communication between a
source and a destination.
. In a packet switching mechanism, communication between a source and
destination takes place via messages that are divided into smaller entities, called
packets. On their way to the destination, packets can be sent from a node to
another in a store-and-forward manner until they reach their destination. While
packet switching tends to use the network resources more efficiently compared to
circuit switching, it suffers from variable packet delays.

16
4- Topology
An interconnection network topology is a mapping function from the set
of processors and memories onto the same set of processors and
memories. In other words, the topology describes how to connect
processors and memories to other processors and memories. A fully
connected topology, for example, is a mapping in which each processor is
connected to all other processors in the computer. A ring topology is a
mapping that connects processor k to its neighbors, processors (k - 1) and
(k 1 +).
In general, interconnection networks can be classified as static versus
dynamic networks. In static networks, direct fixed links are established
among nodes to form a fixed network, while in dynamic networks,
connections are established as needed. Switching elements are used to
establish connections among inputs and outputs.

17
Shared memory interconnection networks.
(a) bus-based and (b) switch-based shared memory
systems

bus-based systems when a single bus is used versus 18

the case when multiple buses are used
Examples of static topologies.

19
dynamic INs: (a) single-stage, (b) multistage,

The omega MIN connects eight sources to eight destinations. The

connection from the source 010 to the destination 010 is shown as a
bold path.

20
Conclusion
• Parallel processing is a future technique for
higher performance and effectiveness for multi
programmed workloads.
• MPs combine the difficulties of building complex
hardware systems and complex software
systems.
• Communication, memory, affinity and
throughputs presents an important influence on
the systems costs and performances
• On-chip MPs technology appears to be growing
Thank you

15 Parallel Processing
No ratings yet
15 Parallel Processing
36 pages
Chapter 3
No ratings yet
Chapter 3
35 pages
Multiprocessor Systems Overview
No ratings yet
Multiprocessor Systems Overview
51 pages
William Stallings Computer Organization and Architecture: Parallel Processing
No ratings yet
William Stallings Computer Organization and Architecture: Parallel Processing
40 pages
Unit VI
No ratings yet
Unit VI
50 pages
07 Multiprocessors MF PDF
No ratings yet
07 Multiprocessors MF PDF
99 pages
COA Assignment
No ratings yet
COA Assignment
21 pages
Unit6 - Microprocessor - Final 1
No ratings yet
Unit6 - Microprocessor - Final 1
30 pages
Lecture 19
No ratings yet
Lecture 19
20 pages
Organization of Multiprocessor Systems
No ratings yet
Organization of Multiprocessor Systems
87 pages
Multi-Processor / Parallel Processing
No ratings yet
Multi-Processor / Parallel Processing
12 pages
Multi-Processor-Parallel Processing PDF
No ratings yet
Multi-Processor-Parallel Processing PDF
12 pages
Multi-Processor / Parallel Processing
No ratings yet
Multi-Processor / Parallel Processing
12 pages
Lect2-Parallel System
No ratings yet
Lect2-Parallel System
26 pages
Multiple Processor Systems: 8.1 Multiprocessors 8.2 Multicomputers 8.3 Distributed Systems
No ratings yet
Multiple Processor Systems: 8.1 Multiprocessors 8.2 Multicomputers 8.3 Distributed Systems
36 pages
5 4 Parallel
No ratings yet
5 4 Parallel
47 pages
CS Chap7 Multicores Multiprocessors Clusters
No ratings yet
CS Chap7 Multicores Multiprocessors Clusters
65 pages
Unit 3
No ratings yet
Unit 3
28 pages
Unit-5 Part-2
No ratings yet
Unit-5 Part-2
22 pages
Parallel Computers
No ratings yet
Parallel Computers
39 pages
Unit 6
No ratings yet
Unit 6
36 pages
Week 6 A
No ratings yet
Week 6 A
22 pages
Parallel Processors: Session 2
No ratings yet
Parallel Processors: Session 2
32 pages
L32 SMP
No ratings yet
L32 SMP
47 pages
CA Chap7 Multicores Multiprocessors
No ratings yet
CA Chap7 Multicores Multiprocessors
42 pages
Parallel Prrocessor
No ratings yet
Parallel Prrocessor
12 pages
Ch-9 MIMD Architecture and SPMD
No ratings yet
Ch-9 MIMD Architecture and SPMD
8 pages
Multiprocessor Memory Architectures
No ratings yet
Multiprocessor Memory Architectures
10 pages
Final Unit5 CO Notes
No ratings yet
Final Unit5 CO Notes
7 pages
A502018463 23825 5 2019 Unit6
No ratings yet
A502018463 23825 5 2019 Unit6
36 pages
Multiprocessor Architecture and Programming
No ratings yet
Multiprocessor Architecture and Programming
20 pages
Parallel Programming Platforms Overview
No ratings yet
Parallel Programming Platforms Overview
38 pages
Multiprocessor Systems Explained
No ratings yet
Multiprocessor Systems Explained
17 pages
Unit 1
No ratings yet
Unit 1
25 pages
02 Lecture Flynn IN
No ratings yet
02 Lecture Flynn IN
78 pages
CS82 Advanced Computer Architecture: Parallel Computer Models 1.2 Multiprocessors and Multicomputers
No ratings yet
CS82 Advanced Computer Architecture: Parallel Computer Models 1.2 Multiprocessors and Multicomputers
19 pages
Multiprocessors and Multicomputers
No ratings yet
Multiprocessors and Multicomputers
27 pages
Lecture 4 Network Topologies For Parallel Architecture
No ratings yet
Lecture 4 Network Topologies For Parallel Architecture
34 pages
Single Bus Systems in Multiprocessors
No ratings yet
Single Bus Systems in Multiprocessors
29 pages
CSCI 8150 Advanced Computer Architecture
100% (2)
CSCI 8150 Advanced Computer Architecture
18 pages
Distributed Shared Memory Basics
No ratings yet
Distributed Shared Memory Basics
36 pages
CS-3006 3 ParallelArchitectures
No ratings yet
CS-3006 3 ParallelArchitectures
56 pages
Week 6 A
No ratings yet
Week 6 A
32 pages
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
No ratings yet
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
55 pages
Ch-8 Shared Memory Multiprocessors
No ratings yet
Ch-8 Shared Memory Multiprocessors
45 pages
Multiprocessors
No ratings yet
Multiprocessors
12 pages
Explicitly Parallel Platforms
No ratings yet
Explicitly Parallel Platforms
90 pages
Multiprocessors and Thread-Level Parallelism
No ratings yet
Multiprocessors and Thread-Level Parallelism
20 pages
COE4590 8 Multiprocessor
No ratings yet
COE4590 8 Multiprocessor
17 pages
Distributed Operating Syst EM: 15SE327E Unit 1
No ratings yet
Distributed Operating Syst EM: 15SE327E Unit 1
49 pages
Multiple Processor Systems: 8.1 Multiprocessors 8.2 Multicomputers 8.3 Distributed Systems
No ratings yet
Multiple Processor Systems: 8.1 Multiprocessors 8.2 Multicomputers 8.3 Distributed Systems
55 pages
Parallelism and Multicores
No ratings yet
Parallelism and Multicores
54 pages
cs668 Lec1 ParallelArch
No ratings yet
cs668 Lec1 ParallelArch
18 pages
Distributed Systems Overview
No ratings yet
Distributed Systems Overview
54 pages
Coa Unit 5
No ratings yet
Coa Unit 5
18 pages
MultiProcessors Tanenbaum BP
No ratings yet
MultiProcessors Tanenbaum BP
29 pages
Multiprocessor Basics & Performance
No ratings yet
Multiprocessor Basics & Performance
52 pages
Overview of Parallel Processing Systems
No ratings yet
Overview of Parallel Processing Systems
29 pages
Wa0023.
No ratings yet
Wa0023.
44 pages
Pointers in C Programming
No ratings yet
Pointers in C Programming
53 pages
Lect 4
No ratings yet
Lect 4
46 pages
Lect 6
No ratings yet
Lect 6
55 pages
Lect 1
No ratings yet
Lect 1
58 pages
CN Ca1
No ratings yet
CN Ca1
10 pages
Compilier Design Ca1
No ratings yet
Compilier Design Ca1
4 pages
Memory L
No ratings yet
Memory L
44 pages
File Handing
No ratings yet
File Handing
8 pages
Four Asysmptotic Notations
No ratings yet
Four Asysmptotic Notations
3 pages
Linux Fun
No ratings yet
Linux Fun
260 pages
Recuva Quick Start
0% (1)
Recuva Quick Start
12 pages
En - User Manual - Linux & Modicia Os
No ratings yet
En - User Manual - Linux & Modicia Os
71 pages
MariFlow Setup for Mario Kart AI
No ratings yet
MariFlow Setup for Mario Kart AI
6 pages
OS Outline
No ratings yet
OS Outline
4 pages
Parallel Max Servers
No ratings yet
Parallel Max Servers
1 page
Intel GFX
No ratings yet
Intel GFX
12 pages
Linux Commands for Beginners
No ratings yet
Linux Commands for Beginners
2 pages
Evaluation of Operating Systems
No ratings yet
Evaluation of Operating Systems
4 pages
Unix Sample Questions..
100% (1)
Unix Sample Questions..
17 pages
HP-UX LVM & Disk Management Guide
No ratings yet
HP-UX LVM & Disk Management Guide
29 pages
10EC65 Operating Systems - File Systems
100% (1)
10EC65 Operating Systems - File Systems
61 pages
Advanced Java Thread Assignment
No ratings yet
Advanced Java Thread Assignment
48 pages
How To Configure Oracle Forms12c - Part 1
No ratings yet
How To Configure Oracle Forms12c - Part 1
2 pages
438 Final Exam F13
No ratings yet
438 Final Exam F13
4 pages
Overview of Windows 2000 Architecture
No ratings yet
Overview of Windows 2000 Architecture
21 pages
Linux Tutorial Souri
No ratings yet
Linux Tutorial Souri
16 pages
Module 1 - 8085 Microprocessor: Lecture 7 - Interrupts - Hardware and Software Interrupts
No ratings yet
Module 1 - 8085 Microprocessor: Lecture 7 - Interrupts - Hardware and Software Interrupts
15 pages
Producer-Consumer Solutions Overview
No ratings yet
Producer-Consumer Solutions Overview
5 pages
SL Lab
No ratings yet
SL Lab
9 pages
Lecture18 New
No ratings yet
Lecture18 New
19 pages
Tech-Savvy System Report
No ratings yet
Tech-Savvy System Report
38 pages
NetBackup10 AdminGuide Hyper-V
No ratings yet
NetBackup10 AdminGuide Hyper-V
208 pages
Build
No ratings yet
Build
10 pages
An A-Z Index of The Command Line: Linux BASH
No ratings yet
An A-Z Index of The Command Line: Linux BASH
8 pages
Usp Lab Manual
No ratings yet
Usp Lab Manual
35 pages
Week 1 Lab Exercises A Virtual Machines
No ratings yet
Week 1 Lab Exercises A Virtual Machines
13 pages
Ubisoft Game Launcher Uninstaller Log
No ratings yet
Ubisoft Game Launcher Uninstaller Log
16 pages
Windows 10 System Report Summary
No ratings yet
Windows 10 System Report Summary
30 pages
Introduction to Apache Spark ETL
No ratings yet
Introduction to Apache Spark ETL
15 pages

Multiprocessor

Uploaded by

Multiprocessor

Uploaded by

Multiprocessors architecture

Why Choose a Multiprocessor?

• Multi-tasking within an application

• Responsiveness and/or throughput

• Share hardware between CPUs

• Systems that treat all CPUs equally are called symmetric

Loosely-coupled multiprocessor systems:

cache cache cache

UMA machine block diagram

• According to the mode of operation, INs are

According to the control strategy, INs can be classified as centralized versus

bus-based systems when a single bus is used versus 18

The omega MIN connects eight sources to eight destinations. The

You might also like