Unit 5
Unit 5
Vector Processing
By Vishva Gandhi
What is parallel processing?
• Parallel processing in pipelining refers to the concurrent execution of multiple instructions or
tasks within a pipeline. It involves dividing the work among multiple pipeline stages or multiple
independent pipelinesto achieve increased throughput and performance.
• Basic Pipeline Stages:
Instruction Fetch (IF): Fetches the next instruction from memory.
Instruction Decode (ID): Decodes the fetched instruction and reads operands.
Execution (EX): Executes the instruction or computes the result.
Memory Access (MEM): Accesses memory if needed (e.g., load/store operations).
• RISC stands for Reduced Instruction Set Computers. It was introduced to execute as fast as
one instruction per clock cycle. This RISC pipeline helps to simplify the computer
architecture’s design.
• Principles of RISCs Pipeline
Keep the most frequently accessed operands in CPU registers.
It can minimize the register-to-memory operations.
It can use a high number of registers to enhance operand referencing and decrease the
processor memory traffic.
It can optimize the design of instruction pipelines such that minimum compiler code
generation can be achieved.
Cont..
• It can use a simplified instruction set and leave out those complex and unnecessary
instructions.
RISC (Reduced Instruction Set Computer) pipelining is a technique used in RISC-
based processor architectures to improve instruction throughput and overall
performance.
In RISC architectures, instructions are typically designed to execute in a fixed
number of clock cycles, simplifying the instruction decoding and execution process.
Pipelining in RISC architectures involves breaking down the execution of instructions
into multiple stages, each performed by a separate pipeline stage. Here's how RISC
pipelining works:
Cont…
Complex Instruction Set Architecture (CISC)
Uses only Hardwired control unit Uses both hardwired and microprogrammed control unit
Can perform only Register to Register Arithmetic operations Can perform REG to REG or REG to MEM or MEM to MEM
An instruction executed in a single clock cycle Instruction takes more than one clock cycle
Array processors with example.
• An array processor has an architecture mainly designed for processing arrays of numbers.
• This processor architecture contains a number of processors that works simultaneously.
• each handling one array element, so that a single operation is applied to all the array
elements in parallel.
• To get the same effect within a conventional processor, the operation should be applied to
every array element sequentially and much more slowly.
Array processors perform computation on large amounts of data.
It has mainly two types: -
1. Attached Array Processor
2. SIMD Processor
Attached array processors are generally used for handling numerical computations on host
computer which is used to perform operations onauxiliary memory.
Attached array processors have two interfaces: general purpose computerhas main memory
interface and attached array processor has cache.
Cont…
Cache memory interconnects the main memory.
A host computer is general purpose computer, and the attached array processoris a backend
machine which is controlled by a host computer.
The array processor is connected through input/output interface or acontroller to the
computer and a computer treats it as an external interface.
Cache
Memory
Thank you