0% found this document useful (0 votes)

429 views22 pages

Computer Performance

Uploaded by

Muntasir Sunny

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

429 views22 pages

Computer Performance

Uploaded by

Muntasir Sunny

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

CSE 317 Lecture 2

Computer
performance
The Computer Revolution
1. Progress in computer technology .
2. Makes novel applications feasible.
■ Computer in Automobile.
■ Cell phone
■ Worldwide web
■ Search engine …..etc.
Performance
■ Performance is the key to understanding underlying motivation
for the hardware and its organization
■ Measure, report, and summarize performance to enable users to
■ make intelligent choices
■ see through the marketing hype!

■ Why is some hardware better than others for different programs?

■ What factors of system performance are hardware related?
(e.g., do we need a new machine, or a new operating system?)
■ How does the machine's instruction set affect performance?
The Role of Performance
Response Time
Throughput
Relative performance
Measuring Execution time
CPU time
CPU clocking ,instruction count and CPI
What do we measure?
Define performance….
Airplane Passengers Range (mi) Speed (mph)

Boeing 737-100 101 630 598

Boeing 747 470 4150 610
BAC/Sud Concorde 132 4000 1350
Douglas DC-8-50 146 8720 544

■ How much faster is the Concorde compared to the

747?
■ How much bigger is the Boeing 747 than the Douglas
DC-8?

■ So which of these airplanes has the best performance?!

Computer Performance:
TIME, TIME, TIME!!!
■ Response Time (elapsed time, latency):
■ how long does it take for my job to run?
■ how long does it take to execute (start to Individual user
finish) my job? concerns…
■ how long must I wait for the database query?
■ Throughput:
■ how many jobs can the machine run at once?
■ what is the average execution rate? Systems manager
concerns…
■ how much work is getting done?

■ If we upgrade a machine with a new processor what do we increase?

■ If we add a new machine to the lab what do we increase?
Execution Time
■ Elapsed Time
■ counts everything (disk and memory accesses, waiting for I/O,
running other programs, etc.) from start to finish
■ a useful number, but often not good for comparison purposes
elapsed time = CPU time + wait time (I/O, other programs, etc.)

■ CPU time
■ doesn't count waiting for I/O or time spent running other programs
■ can be divided into user CPU time and system CPU time (OS calls)
CPU time = user CPU time + system CPU time
⇒ elapsed time = user CPU time + system CPU time + wait time

■ Our focus: user CPU time (CPU execution time or, simply,
execution time)
■ time spent executing the lines of code that are in our program
Definition of Performance
■ For some program running on machine X:
PerformanceX = 1 / Execution timeX
This means that for two computers X and Y, if the performance
of X is greater than the performance of Y,
We have
PerformanceX > PerformanceY
1/ execution of time X > 1/ execution time of Y
Execution timeY > Execution timeX

X is n times faster than Y means:

PerformanceX / PerformanceY = n
Example of Relative
performance
■ If Afia’s Computer runs a program in 10
seconds and Tahmid’s computer runs
the same program in 15 seconds,
whose computer is faster and by how
much faster?
Clock Cycles
■ Instead of reporting execution time in seconds, we often use
cycles. In modern computers hardware events progress cycle by
cycle: in other words, each event, e.g., multiplication, addition,
etc., is a sequence of cycles

■ Clock ticks indicate start and end of cycles:

cycle time
tick

tick

■ cycle time = time between ticks = seconds per cycle

■ clock rate (frequency) = cycles per second (1 Hz. = 1
cycle/sec, 1 MHz. = 106 cycles/sec)
■ Example: A 200 Mhz. clock has a
cycle time
Performance Equation I

equivalently

CPU execution time CPU clock cycles × Clock cycle time

=
for a program for a program

■ So, to improve performance one can either:

■ reduce the number of cycles for a program, or
■ reduce the clock cycle time, or, equivalently,
■ increase the clock rate
How many cycles are required
for a program?
■ Could assume that # of cycles = # of instructions
2nd instruction
3rd instruction
1st instruction

4th
5th
6th
...
time

■ This assumption is incorrect! Because:

■ Different instructions take different amounts of time (cycles)
■ Why…?
How many cycles are required
for a program?
time

■ Multiplication takes more time than addition

■ Floating point operations take longer than integer ones
■ Accessing memory takes more time than accessing registers
■ Important point: changing the cycle time often changes the
number of cycles required for various instructions because it
means changing the hardware design. More later…
Example
■ Our favorite program runs in 10 seconds on computer A, which
has a 400Mhz. clock.
■ We are trying to help a computer designer build a new machine
B, that will run this program in 6 seconds. The designer can use
new (or perhaps more expensive) technology to substantially
increase the clock rate, but has informed us that this increase
will affect the rest of the CPU design, causing machine B to
require 1.2 times as many clock cycles as machine A for the
same program.

■ What clock rate should we tell the designer to target?

Terminology
■ A given program will require:
■ some number of instructions (machine instructions)
■ some number of cycles
■ some number of seconds
■ We have a vocabulary that relates these quantities:
■ cycle time (seconds per cycle)
■ clock rate (cycles per second)
■ (average) CPI (cycles per instruction)
■ a floating point intensive application might have a higher average CPI
■ MIPS (millions of instructions per second)
■ this would be higher for a program using simple instructions
Performance Measure
■ Performance is determined by execution time

■ Do any of these other variables equal performance?

■ # of cycles to execute program?
■ # of instructions in program?
■ # of cycles per second?
■ average # of cycles per instruction?
■ average # of instructions per second?

■ Common pitfall : thinking one of the variables is indicative of

performance when it really isn’t
Performance Equation II
CPU execution time Instruction count × average CPI × Clock cycle time
=
for a program for a program

■ Derive the above equation from Performance Equation I

CPI Example I
■ Suppose we have two implementations of the same instruction
set architecture (ISA). For some program:
■ machine A has a clock cycle time of 10 ns. and a CPI of 2.0
■ machine B has a clock cycle time of 20 ns. and a CPI of 1.2

■ Which machine is faster for this program, and by how much?

■ If two machines have the same ISA, which of our quantities (e.g., clock
rate, CPI, execution time, # of instructions, MIPS) will always be
identical?
CPI Example II
■ A compiler designer is trying to decide between two code
sequences for a particular machine.
■ Based on the hardware implementation, there are three
different classes of instructions: Class A, Class B, and Class C,
and they require 1, 2 and 3 cycles (respectively).
■ The first code sequence has 5 instructions:
2 of A, 1 of B, and 2 of C
The second sequence has 6 instructions:
4 of A, 1 of B, and 1 of C.

■ Which sequence will be faster? How much? What is the CPI for each
sequence?
MIPS Example

■ Two different compilers are being tested for a 100 MHz.

machine with three different classes of instructions: Class A,
Class B, and Class C, which require 1, 2 and 3 cycles
(respectively). Both compilers are used to produce code for a
large piece of software.
■ Compiler 1 generates code with 5 million Class A instructions, 1
million Class B instructions, and 1 million Class C instructions.
■ Compiler 2 generates code with 10 million Class A instructions, 1
million Class B instructions, and 1 million Class C instructions.

■ Which sequence has the higher MIPS rating?

■ Which sequence will be faster according to execution time?
Benchmarks
■ Performance best determined by running a real application
■ use programs typical of expected workload
■ or, typical of expected class of applications
e.g., compilers/editors, scientific applications, graphics, etc.

■ Small benchmarks
■ nice for architects and designers
■ easy to standardize
■ can be abused!

■ Benchmark suites
■ Perfect Club: set of application codes
■ Livermore Loops: 24 loop kernels
■ Linpack: linear algebra package
■ SPEC: mix of code from industry organization
Summary
■ Performance is specific to a particular program
■ total execution time is a consistent summary of performance
■ For a given architecture performance increases come from:
■ increases in clock rate (without adverse CPI affects)
■ improvements in processor organization that lower CPI
■ compiler enhancements that lower CPI and/or instruction count
■ Pitfall: expecting improvement in one aspect of a machine’s
performance to affect the total performance

CIE3301 C56i
No ratings yet
CIE3301 C56i
54 pages
C A Lecture-3
No ratings yet
C A Lecture-3
41 pages
Ilovepdf_merged (4) 36 274 Converted
No ratings yet
Ilovepdf_merged (4) 36 274 Converted
120 pages
2_Computer Organization and Architecture
No ratings yet
2_Computer Organization and Architecture
21 pages
Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
80% (5)
Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
118 pages
Unit 2 Performance
No ratings yet
Unit 2 Performance
6 pages
Lecture-4
No ratings yet
Lecture-4
37 pages
M365 Excel Basics Video 09 - VLOOKUP and XLOOKUP Functions
No ratings yet
M365 Excel Basics Video 09 - VLOOKUP and XLOOKUP Functions
6,575 pages
Computer Performance
No ratings yet
Computer Performance
18 pages
Week 10 Part 02 - Processor Performance (Q Only) - Tagged 2
No ratings yet
Week 10 Part 02 - Processor Performance (Q Only) - Tagged 2
23 pages
05 Performance
No ratings yet
05 Performance
16 pages
Lec10 Performance
No ratings yet
Lec10 Performance
22 pages
4 Perfrmance
No ratings yet
4 Perfrmance
30 pages
Computer Performance
No ratings yet
Computer Performance
17 pages
1ACA_L1
No ratings yet
1ACA_L1
35 pages
02 Performance
No ratings yet
02 Performance
23 pages
PreSonus Studio One 4 Reference Manual English 4.0.0.2 Unofficial
67% (3)
PreSonus Studio One 4 Reference Manual English 4.0.0.2 Unofficial
564 pages
Performances of Computer Systems: CSE 675.02: Introduction To Computer Architecture
No ratings yet
Performances of Computer Systems: CSE 675.02: Introduction To Computer Architecture
52 pages
Module 2 [26-10-2024]
No ratings yet
Module 2 [26-10-2024]
50 pages
DA_CI
No ratings yet
DA_CI
13 pages
Lecture # 2
No ratings yet
Lecture # 2
33 pages
Performance Measures
No ratings yet
Performance Measures
25 pages
The Role of Performance: Chapter - 2
No ratings yet
The Role of Performance: Chapter - 2
40 pages
Performance Measures For Computers
No ratings yet
Performance Measures For Computers
53 pages
09 Perf
No ratings yet
09 Perf
22 pages
Lecture 02 CH01 Performance Power
No ratings yet
Lecture 02 CH01 Performance Power
76 pages
Puter Performance
No ratings yet
Puter Performance
15 pages
COMP 303 Computer Architecture
No ratings yet
COMP 303 Computer Architecture
34 pages
Computer Architecture 2
No ratings yet
Computer Architecture 2
17 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
52 pages
Performance
No ratings yet
Performance
51 pages
DHXD - Chuong 8. Performance
No ratings yet
DHXD - Chuong 8. Performance
27 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
56 pages
CSE 332 L4 - 14 Nov 2020
No ratings yet
CSE 332 L4 - 14 Nov 2020
41 pages
Week 2 - Lecture 2 - Performance Measurement
No ratings yet
Week 2 - Lecture 2 - Performance Measurement
25 pages
Computer Architecture Measurement
No ratings yet
Computer Architecture Measurement
26 pages
Lecture4 Performance Evaluation 2011
No ratings yet
Lecture4 Performance Evaluation 2011
34 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
52 pages
Cse - 321 - 2
No ratings yet
Cse - 321 - 2
37 pages
Assessing and Understanding Performance
No ratings yet
Assessing and Understanding Performance
31 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
47 pages
Module Pelatihan Lookup 1
No ratings yet
Module Pelatihan Lookup 1
12 pages
Week 13 14 - Performance Evaluation
No ratings yet
Week 13 14 - Performance Evaluation
19 pages
Latitude E5440 Laptop - Users Guide - en Us PDF
No ratings yet
Latitude E5440 Laptop - Users Guide - en Us PDF
84 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
17 pages
Measuring Computer Performance
No ratings yet
Measuring Computer Performance
26 pages
M116C 1 M116C 1 Lect02-Performance
No ratings yet
M116C 1 M116C 1 Lect02-Performance
23 pages
SP21 BCS 022 (Class - Assignment 03)
No ratings yet
SP21 BCS 022 (Class - Assignment 03)
5 pages
Co Unit1 Part3
No ratings yet
Co Unit1 Part3
11 pages
Lect 1
No ratings yet
Lect 1
56 pages
Computer Organization The Role of Performance
No ratings yet
Computer Organization The Role of Performance
45 pages
Measuring Performance: Chris Clack B261 Systems Architecture
No ratings yet
Measuring Performance: Chris Clack B261 Systems Architecture
19 pages
Module 3.3 - Problems On Performance
No ratings yet
Module 3.3 - Problems On Performance
54 pages
Week 10 Part 02 - Processor Performance (Answers)
No ratings yet
Week 10 Part 02 - Processor Performance (Answers)
35 pages
3scope & Operators in Arduino
No ratings yet
3scope & Operators in Arduino
12 pages
Lesson 3 - Computing For Performance
No ratings yet
Lesson 3 - Computing For Performance
38 pages
COD Ch. 2 The Role of Performance
No ratings yet
COD Ch. 2 The Role of Performance
28 pages
Cloud Service Management Notes Unit 3
No ratings yet
Cloud Service Management Notes Unit 3
21 pages
CH 02a-Computer Performance
No ratings yet
CH 02a-Computer Performance
22 pages
Complex Computing Problem
No ratings yet
Complex Computing Problem
3 pages
Lect 1
No ratings yet
Lect 1
54 pages
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
28 pages
ACA Lec2 New
No ratings yet
ACA Lec2 New
44 pages
Chapter 1 Performance
No ratings yet
Chapter 1 Performance
32 pages
COD Ch. 2 The Role of Performance
No ratings yet
COD Ch. 2 The Role of Performance
13 pages
01) Fundamentals of Quantitative Design and Analysis
No ratings yet
01) Fundamentals of Quantitative Design and Analysis
71 pages
Chapter 01 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
83% (6)
Chapter 01 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
49 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
29 pages
Computer Peripherals: CSE 315: Peripheral & Interfacing
No ratings yet
Computer Peripherals: CSE 315: Peripheral & Interfacing
23 pages
Barcode Readers Product Guide
No ratings yet
Barcode Readers Product Guide
19 pages
EU AI4CSM__D1.5 - Report on requirements and specification definition on communication and connectivity technologies(2022)
No ratings yet
EU AI4CSM__D1.5 - Report on requirements and specification definition on communication and connectivity technologies(2022)
54 pages
Defining Performance
No ratings yet
Defining Performance
6 pages
Lecture Ch4 Performance
No ratings yet
Lecture Ch4 Performance
25 pages
Control Statements and Loops
No ratings yet
Control Statements and Loops
28 pages
Computer Performance
No ratings yet
Computer Performance
27 pages
A Material History of Bits
No ratings yet
A Material History of Bits
32 pages
db2 Plan 11
No ratings yet
db2 Plan 11
4 pages
Introduction To System Analysis and Design (Slides)
100% (4)
Introduction To System Analysis and Design (Slides)
46 pages
Hit-and-Run Tactics Enable Guerrilla Capacity Planning
No ratings yet
Hit-and-Run Tactics Enable Guerrilla Capacity Planning
7 pages
POW03049USEN
No ratings yet
POW03049USEN
74 pages
Big Data Benchmarking 2014
0% (1)
Big Data Benchmarking 2014
164 pages
Studio One 3.5 Reference Manual English Unofficial
No ratings yet
Studio One 3.5 Reference Manual English Unofficial
433 pages
2010 Mechatronics More Questions Than Answers
No ratings yet
2010 Mechatronics More Questions Than Answers
15 pages
Distributed System Models - Workstation Model
No ratings yet
Distributed System Models - Workstation Model
19 pages
EMT1600 Finished
No ratings yet
EMT1600 Finished
67 pages
Mas Osx - Digital Performer 4
No ratings yet
Mas Osx - Digital Performer 4
2 pages
Module2 Intro Google Earth Engine Presentation PDF
No ratings yet
Module2 Intro Google Earth Engine Presentation PDF
31 pages
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet
Lesson 7: System Performance: Objective
No ratings yet
Lesson 7: System Performance: Objective
2 pages
The Aesthetics of Interactive Music Systems: Robert Rowe
0% (1)
The Aesthetics of Interactive Music Systems: Robert Rowe
5 pages
Competitive Programming in Python 128 Algorithms To Develop Your Coding Skills
100% (8)
Competitive Programming in Python 128 Algorithms To Develop Your Coding Skills
267 pages

Computer Performance

Uploaded by

Computer Performance

Uploaded by

CSE 317 Lecture 2

■ Why is some hardware better than others for different programs?

Boeing 737-100 101 630 598

■ How much faster is the Concorde compared to the

■ So which of these airplanes has the best performance?!

■ If we upgrade a machine with a new processor what do we increase?

X is n times faster than Y means:

■ Clock ticks indicate start and end of cycles:

■ cycle time = time between ticks = seconds per cycle

CPU execution time CPU clock cycles × Clock cycle time

■ So, to improve performance one can either:

■ This assumption is incorrect! Because:

■ Multiplication takes more time than addition

■ What clock rate should we tell the designer to target?

■ Do any of these other variables equal performance?

■ Common pitfall : thinking one of the variables is indicative of

■ Derive the above equation from Performance Equation I

■ Which machine is faster for this program, and by how much?

■ Two different compilers are being tested for a 100 MHz.

■ Which sequence has the higher MIPS rating?

You might also like