0% found this document useful (0 votes)

64 views

0020.matrix Multiplication Systolic

1) Systolic arrays replace single processors with an array of simple processing elements to enable high throughput computations with less memory access. 2) Each processing element may perform a different operation and communicate data to neighboring elements in different directions through the array in a nonlinear, multidirectional flow. 3) A 3x3 systolic array is presented as an example for matrix multiplication where each processing element computes and accumulates one element of the product matrix.

Uploaded by

Tejas.S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views

0020.matrix Multiplication Systolic

Uploaded by

Tejas.S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Slides from

Shaaban Systolic Architectures

• Replace single processor with an array of regular processing elements
• Orchestrate data flow for high throughput with less memory access

M M

PE
PE PE PE

• Different from pipelining

– Nonlinear array structure, multidirection data flow, each PE
may have (small) local instruction and data memory
• Different from SIMD: each PE may do something different
• Initial motivation: VLSI enables inexpensive special-purpose chips
• Represent algorithms directly by chips connected in regular pattern

EECC756 - Shaaban
#1 lec # 1 Spring 2003 3-11-2003
Systolic Array Example:
3x3 Systolic Array Matrix Multiplication
• Processors arranged in a 2-D grid b2,2
• Each processor accumulates one b2,1 b1,2
element of the product b2,0 b1,1 b0,2
b1,0 b0,1
Alignments in time b0,0
Columns of B

Rows of A

a0,2 a0,1 a0,0

a1,2 a1,1 a1,0

a2,2 a2,1 a2,0

T=0
EECC756 - Shaaban
Example source: http://www.cs.hmc.edu/courses/2001/spring/cs156/
#2 lec # 1 Spring 2003 3-11-2003
Systolic Array Example:
3x3 Systolic Array Matrix Multiplication
• Processors arranged in a 2-D grid
• Each processor accumulates one b2,2
element of the product b2,1 b1,2
b2,0 b1,1 b0,2
Alignments in time b1,0 b0,1
b0,0
a0,0*b0,0
a0,0
a0,2 a0,1

a1,2 a1,1 a1,0

a2,2 a2,1 a2,0

T=1
EECC756 - Shaaban
Example source: http://www.cs.hmc.edu/courses/2001/spring/cs156/
#3 lec # 1 Spring 2003 3-11-2003
Systolic Array Example:
3x3 Systolic Array Matrix Multiplication
• Processors arranged in a 2-D grid
• Each processor accumulates one
element of the product b2,2
b2,1 b1,2
Alignments in time b2,0 b1,1 b0,2

b1,0 b0,1
a0,0*b0,0 a0,0*b0,1
a0,1 + a0,1*b1,0 a0,0
a0,2

b0,0
a1,0*b0,0
a1,2 a1,1 a1,0

a2,2 a2,1 a2,0

T=2
EECC756 - Shaaban
Example source: http://www.cs.hmc.edu/courses/2001/spring/cs156/
#4 lec # 1 Spring 2003 3-11-2003
Systolic Array Example:
3x3 Systolic Array Matrix Multiplication
• Processors arranged in a 2-D grid
• Each processor accumulates one
element of the product
b2,2
Alignments in time
b2,1 b1,2
b2,0 b1,1 b0,2
a0,0*b0,0 a0,0*b0,1
a0,2 + a0,1*b1,0 a0,1 + a0,1*b1,1 a0,0 a0,0*b0,2
+ a0,2*b2,0

b1,0 b0,1
a1,0*b0,0
a1,1 a1,0 a1,0*b0,1
a1,2 + a1,1*b1,0

b0,0
a2,0*b0,0
a2,0
a2,2 a2,1

T=3
EECC756 - Shaaban
Example source: http://www.cs.hmc.edu/courses/2001/spring/cs156/
#5 lec # 1 Spring 2003 3-11-2003
Systolic Array Example:
3x3 Systolic Array Matrix Multiplication
• Processors arranged in a 2-D grid
• Each processor accumulates one
element of the product

Alignments in time
b2,2
b2,1 b1,2
a0,0*b0,0 a0,0*b0,1
+ a0,1*b1,0 a0,2 + a0,1*b1,1 a0,1 a0,0*b0,2
+ a0,1*b1,2
+ a0,2*b2,0 + a0,2*b2,1

b2,0 b1,1 b0,2

a1,0*b0,0
a1,1 a1,0*b0,2
a1,2 + a1,1*b1,0 a1,0*b0,1 a1,0
+ a1,2*a2,0 +a1,1*b1,1

b1,0 b0,1
a2,0*b0,0 a2,0*b0,1
a2,2 a2,1 + a2,1*b1,0
a2,0

T=4
EECC756 - Shaaban
Example source: http://www.cs.hmc.edu/courses/2001/spring/cs156/
#6 lec # 1 Spring 2003 3-11-2003
Systolic Array Example:
3x3 Systolic Array Matrix Multiplication
• Processors arranged in a 2-D grid
• Each processor accumulates one
element of the product

Alignments in time

b2,2
a0,0*b0,0 a0,0*b0,1
+ a0,1*b1,0 + a0,1*b1,1 a0,2 a0,0*b0,2
+ a0,1*b1,2
+ a0,2*b2,0 + a0,2*b2,1
+ a0,2*b2,2

b2,1 b1,2
a1,0*b0,0
a1,2 a1,0*b0,2
+ a1,1*b1,0 a1,0*b0,1 a1,1 + a1,1*b1,2
+ a1,2*a2,0 +a1,1*b1,1
+ a1,2*b2,1

b2,0 b1,1 b0,2

a2,0*b0,0 a2,0*b0,1 a2,0*b0,2
a2,2 + a2,1*b1,0
a2,1 + a2,1*b1,1 a2,0
+ a2,2*b2,0

T=5
EECC756 - Shaaban
Example source: http://www.cs.hmc.edu/courses/2001/spring/cs156/
#7 lec # 1 Spring 2003 3-11-2003
Systolic Array Example:
3x3 Systolic Array Matrix Multiplication
• Processors arranged in a 2-D grid
• Each processor accumulates one
element of the product

Alignments in time

a0,0*b0,0 a0,0*b0,1
a0,0*b0,2
+ a0,1*b1,0 + a0,1*b1,1
+ a0,1*b1,2
+ a0,2*b2,0 + a0,2*b2,1
+ a0,2*b2,2

b2,2
a1,0*b0,0
a1,0*b0,2
+ a1,1*b1,0 a1,0*b0,1 a1,2 + a1,1*b1,2
+ a1,2*a2,0 +a1,1*b1,1
+ a1,2*b2,1 + a1,2*b2,2

b2,1 b1,2
a2,0*b0,0 a2,0*b0,1 a2,0*b0,2
+ a2,1*b1,0
a2,2 + a2,1*b1,1 a2,1 + a2,1*b1,2
+ a2,2*b2,0 + a2,2*b2,1

T=6
EECC756 - Shaaban
Example source: http://www.cs.hmc.edu/courses/2001/spring/cs156/
#8 lec # 1 Spring 2003 3-11-2003
Systolic Array Example:
3x3 Systolic Array Matrix Multiplication
• Processors arranged in a 2-D grid
• Each processor accumulates one
element of the product

Alignments in time

a0,0*b0,0 a0,0*b0,1
a0,0*b0,2
+ a0,1*b1,0 + a0,1*b1,1
+ a0,1*b1,2
+ a0,2*b2,0 + a0,2*b2,1
+ a0,2*b2,2

a1,0*b0,0
a1,0*b0,1 a1,0*b0,2
+ a1,1*b1,0
+a1,1*b1,1 + a1,1*b1,2
+ a1,2*a2,0
+ a1,2*b2,1 + a1,2*b2,2

Done
b2,2
a2,0*b0,0 a2,0*b0,1 a2,0*b0,2
+ a2,1*b1,0 + a2,1*b1,1 a2,2 + a2,1*b1,2
+ a2,2*b2,0 + a2,2*b2,1 + a2,2*b2,2

T=7
EECC756 - Shaaban
Example source: http://www.cs.hmc.edu/courses/2001/spring/cs156/
#9 lec # 1 Spring 2003 3-11-2003

easyFIXS - Download For Free Epson ET-1110 - L1110 - L1118 - L1119 Adjustment Program, Service Manual
100% (2)
easyFIXS - Download For Free Epson ET-1110 - L1110 - L1118 - L1119 Adjustment Program, Service Manual
3 pages
Dell Latitude D820 Laptop Schematic
No ratings yet
Dell Latitude D820 Laptop Schematic
54 pages
0020.matrix Multiplication Systolic PDF
No ratings yet
0020.matrix Multiplication Systolic PDF
9 pages
Systolic Architecture
No ratings yet
Systolic Architecture
9 pages
Parallel Architectures Parallel Architectures: Ever Faster
No ratings yet
Parallel Architectures Parallel Architectures: Ever Faster
11 pages
Matrix-Matrix Multiplication Using Systolic Array Architecture in Bluespec
No ratings yet
Matrix-Matrix Multiplication Using Systolic Array Architecture in Bluespec
8 pages
Systolic Array
No ratings yet
Systolic Array
42 pages
Presentation 13627 Content Document 20231203040237PM
No ratings yet
Presentation 13627 Content Document 20231203040237PM
39 pages
Array Unit 2 Notes
No ratings yet
Array Unit 2 Notes
39 pages
Arrays: Fundamentals of Data Structures
No ratings yet
Arrays: Fundamentals of Data Structures
13 pages
Systolic Arrays & Their Applications
No ratings yet
Systolic Arrays & Their Applications
35 pages
VLSI Programming Systolic Design: Book Parhi, Chp. 7 Rudolf Mak R.h.mak@tue - NL
No ratings yet
VLSI Programming Systolic Design: Book Parhi, Chp. 7 Rudolf Mak R.h.mak@tue - NL
49 pages
Embeddedcmodule
No ratings yet
Embeddedcmodule
103 pages
Blocked Matrix Multiply
No ratings yet
Blocked Matrix Multiply
6 pages
Lecture 17
No ratings yet
Lecture 17
7 pages
Lab10 - Arrays2 - Sec450 C#
No ratings yet
Lab10 - Arrays2 - Sec450 C#
9 pages
Class 1 - Slides
No ratings yet
Class 1 - Slides
35 pages
Advanced Computer Architecture 1
No ratings yet
Advanced Computer Architecture 1
14 pages
Parallel Algorithms Underlying MPI Implementations
No ratings yet
Parallel Algorithms Underlying MPI Implementations
55 pages
LAB 4 Matlab
No ratings yet
LAB 4 Matlab
11 pages
Arrays: 01/05/2021 S. Senthilkumar, Asso - Prof/Ece, Grtiet
No ratings yet
Arrays: 01/05/2021 S. Senthilkumar, Asso - Prof/Ece, Grtiet
11 pages
Matrix Multi
No ratings yet
Matrix Multi
38 pages
Embedded C Module3
No ratings yet
Embedded C Module3
103 pages
COA Chapter 9
No ratings yet
COA Chapter 9
36 pages
Cs8151 Unit II Notes1
No ratings yet
Cs8151 Unit II Notes1
21 pages
Parallel Algorithms Underlying MPI Implementations
No ratings yet
Parallel Algorithms Underlying MPI Implementations
55 pages
ELEC 1018Y: MATLAB For Electrical Engineers
No ratings yet
ELEC 1018Y: MATLAB For Electrical Engineers
9 pages
Module 2 Part1 ArrayMatrics
No ratings yet
Module 2 Part1 ArrayMatrics
37 pages
Matrix Multiplication-Javan.
No ratings yet
Matrix Multiplication-Javan.
6 pages
CM303
No ratings yet
CM303
38 pages
228 Sakshi Pahade Lab Manual 5
No ratings yet
228 Sakshi Pahade Lab Manual 5
13 pages
Matrices and Two-dimensional Arrays
No ratings yet
Matrices and Two-dimensional Arrays
6 pages
Lec 38
No ratings yet
Lec 38
19 pages
Introduction To Systolic Arrays
100% (1)
Introduction To Systolic Arrays
18 pages
Lab 2 - NA
No ratings yet
Lab 2 - NA
20 pages
Week 2-Arrays-Updated
No ratings yet
Week 2-Arrays-Updated
40 pages
Data Structures: Data May Be Organized in Many
No ratings yet
Data Structures: Data May Be Organized in Many
23 pages
Module 8 - Arrays in C
No ratings yet
Module 8 - Arrays in C
101 pages
PPS unit-III PDF
No ratings yet
PPS unit-III PDF
161 pages
IntroP Curs 4 Eng
No ratings yet
IntroP Curs 4 Eng
48 pages
UNIT-V-Pipeline and Array Processing and Multi Processors
No ratings yet
UNIT-V-Pipeline and Array Processing and Multi Processors
51 pages
Task 1 Types of Parallel Processing
No ratings yet
Task 1 Types of Parallel Processing
3 pages
Haskell Arrays Accelerated With GPUs
100% (1)
Haskell Arrays Accelerated With GPUs
47 pages
Unit Three Update
No ratings yet
Unit Three Update
70 pages
matrix_mul
No ratings yet
matrix_mul
33 pages
Matrix and Graph
No ratings yet
Matrix and Graph
44 pages
Systolic Arrays & Their Applications
No ratings yet
Systolic Arrays & Their Applications
36 pages
Introduction To Signal Processing Using MATLAB
No ratings yet
Introduction To Signal Processing Using MATLAB
17 pages
Lecture 3 - Matrices PDF
No ratings yet
Lecture 3 - Matrices PDF
40 pages
08 Odds Ends
No ratings yet
08 Odds Ends
27 pages
High Performance Computing Matrix Mul.
No ratings yet
High Performance Computing Matrix Mul.
15 pages
SystemVerilogForVerification Woquiz
100% (5)
SystemVerilogForVerification Woquiz
218 pages
SIMD Computer Organizations
0% (1)
SIMD Computer Organizations
20 pages
Lecture 9 Arrays
100% (5)
Lecture 9 Arrays
7 pages
Basic Simulation Lab
No ratings yet
Basic Simulation Lab
69 pages
Lab Tasks: Task 1
No ratings yet
Lab Tasks: Task 1
2 pages
4.Introduction to Arrays, Strings
No ratings yet
4.Introduction to Arrays, Strings
21 pages
Unit 1
No ratings yet
Unit 1
65 pages
CSC 210 NOTES_2025
No ratings yet
CSC 210 NOTES_2025
18 pages
Data Structures
No ratings yet
Data Structures
32 pages
Chapter 5
No ratings yet
Chapter 5
28 pages
Digital Clock With Visitor Counter
100% (1)
Digital Clock With Visitor Counter
52 pages
DPDS - 131064 - September 2024
No ratings yet
DPDS - 131064 - September 2024
23 pages
S1 S1 S1 S1 User User User User Manual Manual Manual Manual Tablet Tablet Tablet Tablet Android Android Android Android 4. 4. 4. 4.1 1 1 1
No ratings yet
S1 S1 S1 S1 User User User User Manual Manual Manual Manual Tablet Tablet Tablet Tablet Android Android Android Android 4. 4. 4. 4.1 1 1 1
18 pages
HHDLU
No ratings yet
HHDLU
4 pages
The Project and Its Background Introductions
No ratings yet
The Project and Its Background Introductions
10 pages
Pin Configuration of 8086
No ratings yet
Pin Configuration of 8086
12 pages
Universal Serial Bus (USB) Is An
No ratings yet
Universal Serial Bus (USB) Is An
2 pages
P7DPS59b Step7 Profibus
No ratings yet
P7DPS59b Step7 Profibus
170 pages
Ict 3
No ratings yet
Ict 3
10 pages
Digital Literacy Level 6 - Learning Guide
No ratings yet
Digital Literacy Level 6 - Learning Guide
50 pages
MC Protocol en
No ratings yet
MC Protocol en
476 pages
DELL_Latitude 5490_LA-F401P
No ratings yet
DELL_Latitude 5490_LA-F401P
5 pages
Grade 10 Information and Communication Technology Paper 2019 2nd Term Test North Central Province
No ratings yet
Grade 10 Information and Communication Technology Paper 2019 2nd Term Test North Central Province
14 pages
Spark Architecture
No ratings yet
Spark Architecture
6 pages
NVR-104B-P4: Datasheet
No ratings yet
NVR-104B-P4: Datasheet
3 pages
Inventario Valorizado Enero 2023
No ratings yet
Inventario Valorizado Enero 2023
18 pages
Spirit Ls User Manual
No ratings yet
Spirit Ls User Manual
136 pages
DLP 3 Roles Functions
No ratings yet
DLP 3 Roles Functions
2 pages
VideoEditorBusiness - Fac K ING - Info.20230524 090822.3276
No ratings yet
VideoEditorBusiness - Fac K ING - Info.20230524 090822.3276
15 pages
MAXstation Operator Guide PDF
No ratings yet
MAXstation Operator Guide PDF
97 pages
Five Generation of Computer
No ratings yet
Five Generation of Computer
6 pages
Nuvoton M052LBN Datasheet
No ratings yet
Nuvoton M052LBN Datasheet
72 pages
Output Devices of Computer
50% (2)
Output Devices of Computer
6 pages
Computer, Computer!: Session 2020-21 Class III (Subject: Computer) Date: 5 May
No ratings yet
Computer, Computer!: Session 2020-21 Class III (Subject: Computer) Date: 5 May
3 pages
Hart P&F
No ratings yet
Hart P&F
40 pages
Advanced Operating System: Etefa Belachew
No ratings yet
Advanced Operating System: Etefa Belachew
25 pages
Computer Systems Servicing NC Ii: Competency-Based Learning Materials (CBLM)
No ratings yet
Computer Systems Servicing NC Ii: Competency-Based Learning Materials (CBLM)
29 pages
Microcontroller-8051: Presented by NK
No ratings yet
Microcontroller-8051: Presented by NK
9 pages

0020.matrix Multiplication Systolic

Uploaded by

0020.matrix Multiplication Systolic

Uploaded by

Slides from

Shaaban Systolic Architectures

• Different from pipelining

a0,2 a0,1 a0,0

a1,2 a1,1 a1,0

a2,2 a2,1 a2,0

a1,2 a1,1 a1,0

a2,2 a2,1 a2,0

a2,2 a2,1 a2,0

b2,0 b1,1 b0,2

b2,0 b1,1 b0,2

You might also like