Module 3.3 - Problems On Performance
Module 3.3 - Problems On Performance
Clock Rate (CR) (MHz, GHz) is inverse of Clock Cycle (CC) time (clock period)
CC = 1 / CR
The clock cycle time is the amount of time for one clock period to elapse
(e.g. 5 ns).
Question
If a computer has a clock cycle time of 5 ns, What is the clock rate?
CPU execution time is a combination of user CPU time and system CPU time
performanceX = 1 / execution_timeX
Where ICi is the count (percentage) of the number of instructions of class i executed
ICtotal is total number of machine instructions executed
CPIi is the (average) number of clock cycles per instruction for that instruction class
n is the number of instruction classes
F – Frequency
T- Processor Execution Time
Ic- Instruction Count
MIPS : millions of instructions per second
For example, a program that executes 3 million instructions in 2
seconds has a MIPS rating of 1.5
Advantage : Easy to understand and measure
Disadvantages : May not reflect actual performance, since simple
instructions do better.
MFLOPS - millions of floating point operations per
second
MFLOPS : millions of floating point operations per second
Max. Speedup=
Problem
Suppose that we know that 20% of inherently sequential computation in the
problem of interest is made parallel. What is the least number of processors
that we need to use to obtain a speedup of 6.0?
Amdhal’s - Speedup Based Problem
Consider a CPU used in Web servicing. We need to enhance 30% of the processor by increasing
the computation speed 10 times faster on computation process in web service applications.
Speedenhanced =SUf = 10
Speedupoverall = 1/(1-0.3)+(0.3/10)
= 1/0.7+0.03
=1/0.73
͌ 1.369
MODULE 1
Introduction and Overview of Computer Architecture
Problems to be solved
Problem 1
Assume that # of instructions in the program is 1,000,000,000. Suppose
we have two implementations of the same instruction set architecture
(ISA). For some program,
Machine A has a clock cycle time of 10 ns. and a CPI of 2.0
Machine B has a clock cycle time of 20 ns. and a CPI of 1.2
Note: We have the instruction count: 109 instructions. The clock time can be
computed quickly from the clock rate to be 0.5×10-9 seconds. So we only need to to
compute clocks per instruction as an effective value:
Problem 6
Problem 7
We want to compare the computers R1 and R2, which differ that R1 has the machine
instructions for the floating point operations, while R2 has not (FP operations are implemented
in the software using several non-FP instructions). Both computers have a clock frequency of
400 MHz. In both we perform the same program, which has the following mixture of
commands:
a) Calculate the MIPS for the computers R1 and R2.
b) Calculate the CPU program execution time on the computers R1 and R2, if there are 12000
instructions in the program?
Problem 8
The clock of the processor runs at 200 MHz with 4.4 Cycles per
instruction. Compute the MIPS processor speed for the benchmark
in millions of instructions per second?
Problem 9
Suppose that we are considering developing a parallel program to
improve on an existing sequential program and that we determine
that 20% of the execution time of the sequential program is spent
in inherently sequential code. (We have to inspect the code to
determine this.) The remaining code can be parallelized, although
we do not as yet know how many processors would be optimal.
What is the maximum possible speedup that could be obtained if
we were to develop a parallel version that used ten processors?
Problem 10
Suppose that we know that fraction of inherently sequential
computation is 0.12 in the problem of interest. What is the least
number of processors that we need to use to obtain a speedup of
5.0?
Text Book(s) Friday, March 2
4, 2023
Any Queries??
THANK YOU