0% found this document useful (0 votes)

22 views

2_Computer Organization and Architecture

Uploaded by

diptondey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

2_Computer Organization and Architecture

Uploaded by

diptondey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Computer Organization and

Architecture
Assessing and Understanding
Performance
Introduction

 Response time/execution time : The time between the start and

completion of a task. (try to minimize)
 Throughput : The total amount of work done in a given time. (try to
increase)
 =
 = =n
Continue…
 If computer X runs a program in 10 seconds and computer Y
runs the same program in 15 seconds, how much faster is x than
y?
Solution: we know, = = n
Given, execution time (x)=10 seconds
execution time (y) = 15 seconds

So the performance ratio is, = n , that is = 1.5

That means, = 1.5
so, = 1.5 x
So, X is 1.5 times faster than Y.
CPU Performance and its Factors

For a program:
 CPU execution time= CPU clock cycles X Clock cycle time …..(1)
 Clock cycle time= …..(2)
by putting the value from (2) into (1) we get,
 CPU execution time=
 CPU clock cycle= Instructions for a program x Avg clock cycles per instruction
 CPU time= Instruction count x CPI x Clock cycle time
Or , CPU time=

CPI=
 CPI = [ CPU clock cycle= CPU time x clock rate ]
Improving performance

 Our favorite program runs in 10 seconds on computer A, which

has a 4 GHz clock. We are trying to help a computer designer
build a computer B, that will run this program in 6 seconds. The
designer has determined that a substantial increase in the
clockrate is possible, but this increase will affect the rest of the
CPU design, causing computer B to require 1.2 times as many
clock cycles as computer A for this program. What clock rate
should we tell the designer to target?
Solution: given,
CPU time(A)=10 seconds ; Clock rate(A) = 4 GHz = 4 x
CPU time(B)=6 seconds ; Clock rate(B) = ? (have to determine)
Continue…
 For program on computer A:
CPU time(A)=
or, 10 seconds =
So, = 10 seconds x 4 x = 40 x cycles
 For program on computer B:
 CPU time(B)=
CPU time(B)=
or, 6 seconds =
so, = 8 x = 8 GHz
 computer B must therefore have twice the clock rate of A to run the
program in 6 seconds.
Using the performance equation

 Suppose we have two implementations of the same instruction set

architecture. Computer A has a clock cycle time of 250 ps and a CPI
of 2.0 for some program and computer B has a clock cycle time of
500 ps and a CPI of 1.2 for the same program. Which computer is
faster for his program and by how much?
Solution: We know that each computer executes the same number of
instructions for the program. Let’s call this number k .
So, CPU clock cycles(A) = k x 2.0
CPU clock cycles(B) = k x 1.2
So, CPU execution time(A)= CPU clock cycles(A) x Clock cycle time(A)
=k x 2.0 x 250 ps = 500 x
k ps
Continue…

CPU execution time(B)= CPU clock cycles(B) x Clock cycle time(B)

= k x 1.2 x 500 ps =600 x k ps
In between 500 x k and 600 x k , it is clear to us that 600 x k is greater. So,
computer A is faster as its CPU execution time is smaller.

= = = 1.2
That means , Computer A is 1.2 times faster than Computer B.
Comparing code segment
 A compiler designer is trying to decedide between two code sequences
for a particular computer. The hard ware designers have supplied the
following facts:
CPI for this instruction class
A B C
CPI 1 2 3

For a particular high level language statement, the compiler writter Is

considering two code sequences that require the following instruction
counts.
Code Instruction counts for instruction class
Sequence A B C

1 2 1 2
2 4 1 1

i. Whish code sequences executes the most instructions? Ii. Which will be
faster? Iii. What is the CPI for each sequence?
Continue…
 Solution for i. : Sequence 1 executes 2+1+2 = 5 instructions.
Sequence 2 executes 4+1+1 = 6
instructions.
So, sequence 2 executes the most instructions.
 Solution for ii. : we know that, CPU clock cycle =
This yields, CPU clock cycles(1) = (2x1)+(1x2)+(2x3)=2+2+6=10 cycles
CPU clock cycles(2) = (4x1)+(1x2)+(1x3)=4+2+3=9 cycles
So code sequence 2 is faster, even though it actually executes one extra
instruction.
 Solution for iii. : CPI =
CPI (1)= = = 2
CPI (2)= = = 1.5
Evaluating performance

 The set of programs run would form a workload.

 Compare execution time of the same workload.
 A set of benchmarks programs specifically chosen to measure
performance.
 By using benchmarks programs determine and compare response time
or throughput.(not this much straight forward)
 There are some confusion.
 Different program instructions takes different execution time in different
computer(see the performance table in the next slide).
Continue…
Computer A Computer B
Program 1 (seconds) 1 10
Program 2 (seconds) 1000 100
Total time (seconds) 1001 110

 A is 10 times faster than B for program 1.

 B is 10 times faster than A for program 2.
 As the both statements are true, so it is confusing to understand the
better one between computer A and B in this way.
 So, for better understanding we have to use total execution time.
Total execution time: A consistent
summary measure
 The simplest approach:

= = = 9.1
So , = 9.1 x
 That is B is 9.1 time faster than A for programs 1 and 2 together.

 The average of the execution times that is directly proportional

to total execution time is the arithmetic mean(AM).
AM =
Performance, Power and Energy
Efficiency
 Power is increasingly becoming the key limitation and critical factor in
processor performance and also has impact on costing. (Laptop)
 To save power , techniques ranging from putting parts of the computer to
sleep, to reducing the clock rate and voltage have all been used, but not
good enough.
 CMOS technology is used to reduce power by reducing frequency but this
also causes the reduction of performance.

Please for more detail goto to text book :

Computer organization and design (3rd edition) – Patterson, Hennessy
Page no: 263,264,265
Fallacies and Pitfalls

 Pitfall: Expecting the improvement of one aspect of a computer to

increase performance by an amount proportional to the size of
improvement.
 Amdahl’s law: A rule stating that the performance enhancement possible
with a given improvement is limited by the amount that the improved
feature is used.
 The law is :
Execution time after improvement
= ( + Execution time unaffected)
Continue…
 Suppose a program runs in 100 seconds on a computer, with multiply
operations responsible for 80 seconds of this time. How much do you
have to improve the speed of multiplication if you want your program
to run five times faster?
Solution: By putting the values using Amdahl’s law, we get:
Execution time after improvement
= ( + (100 - 80)seconds)…..(i)
As the execution time is 100 seconds and now we want 5 times faster , so it
will become 20 seconds. Put this value in (i),
20 seconds = + 20 seconds
Or, 0 = ….(ii)
(ii) Indicates that there is no amount by which we can enhance multiply to
achieve a fivefold increase in performance., if multiply accounts for only
80% of the workload.
MIPS

 One alternative to time as the metric is MIPS.

 MIPS: million instructions per second.
 For a given program , MIPS is simply:

MIPS =
MIPS as a performance measure
CPI for this instruction class
A B C
CPI 1 2 3

Code from Instruction counts(in billions) for each

instruction class
A B C
Compiler 1 5 1 1
Compiler 2 10 1 1

 Assume that the computer’s clock rate is 4 GHz. Which code sequence
will execute faster according to MIPS? According to execution time?
Continue…

Solution: we know that, Execution time =

CPU clock cycle = …..(ii)
By using (ii)—
CPU clock cycle(1) = (5x1+1x2+1x3)x =10 x
CPU clock cycle(2) = (10x1+1x2+1x3)x =15 x
Now by using (i)---
Execution time(1) = = 2.5 seconds
Execution time(2) = = 3.75 seconds
So, compiler 1 generates the faster program, according to execution time.
Continue…

 Let’s compute the MIPS rate for each version of the program,using the
following equation:

MIPS =

MIPS (1) = = 2800

MIPS (2) = = 3200
So, the code from compiler 2 has a higher MIPS rating, but the code from
the compiler 1 runs faster.
End of slide
Thank you
Any question?

MP Assignment 1
No ratings yet
MP Assignment 1
9 pages
Cse - 321 - 2
No ratings yet
Cse - 321 - 2
37 pages
Lecture # 2
No ratings yet
Lecture # 2
33 pages
2 CPU Performance
No ratings yet
2 CPU Performance
35 pages
Module 3.3 - Problems On Performance
No ratings yet
Module 3.3 - Problems On Performance
54 pages
Lecture Ch4 Performance
No ratings yet
Lecture Ch4 Performance
25 pages
Performance Measures For Computers
No ratings yet
Performance Measures For Computers
53 pages
The Role of Performance: Chapter - 2
No ratings yet
The Role of Performance: Chapter - 2
40 pages
Unit 2 Performance
No ratings yet
Unit 2 Performance
6 pages
Week 10 Part 02 - Processor Performance (Answers)
No ratings yet
Week 10 Part 02 - Processor Performance (Answers)
35 pages
Chapter 1 Performance
No ratings yet
Chapter 1 Performance
32 pages
Chapter 2 A: Performance
No ratings yet
Chapter 2 A: Performance
33 pages
Computer Organization The Role of Performance
No ratings yet
Computer Organization The Role of Performance
45 pages
C A Lecture-3
No ratings yet
C A Lecture-3
41 pages
Week 2 - Lecture 2 - Performance Measurement
No ratings yet
Week 2 - Lecture 2 - Performance Measurement
25 pages
Measuring Computer Performance
No ratings yet
Measuring Computer Performance
26 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
52 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
56 pages
Computer Performance
No ratings yet
Computer Performance
22 pages
Performance
No ratings yet
Performance
51 pages
2. Performance
No ratings yet
2. Performance
23 pages
Module 2 [26-10-2024]
No ratings yet
Module 2 [26-10-2024]
50 pages
Puter Performance
No ratings yet
Puter Performance
15 pages
Computer Performance
No ratings yet
Computer Performance
17 pages
CH 02a-Computer Performance
No ratings yet
CH 02a-Computer Performance
22 pages
Lec 3
No ratings yet
Lec 3
21 pages
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
28 pages
Lecture 02 CH01 Performance Power
No ratings yet
Lecture 02 CH01 Performance Power
76 pages
4 Perfrmance
No ratings yet
4 Perfrmance
30 pages
Computer Architecture Measurement
No ratings yet
Computer Architecture Measurement
26 pages
Computer Architecture Measuring Performance
No ratings yet
Computer Architecture Measuring Performance
33 pages
Lesson 3 - Computing For Performance
No ratings yet
Lesson 3 - Computing For Performance
38 pages
Comp Org Notes On Measuring Cpu Performance
No ratings yet
Comp Org Notes On Measuring Cpu Performance
4 pages
4 Performance
No ratings yet
4 Performance
27 pages
Performance of Processor1
No ratings yet
Performance of Processor1
9 pages
COD Ch. 2 The Role of Performance
No ratings yet
COD Ch. 2 The Role of Performance
13 pages
Computer Architecture 2
No ratings yet
Computer Architecture 2
17 pages
09 Perf
No ratings yet
09 Perf
22 pages
SEN307 Lecture 8
No ratings yet
SEN307 Lecture 8
16 pages
Performance
No ratings yet
Performance
4 pages
Lecture - 4 - Performance
No ratings yet
Lecture - 4 - Performance
31 pages
SEN307-Lecture-5
No ratings yet
SEN307-Lecture-5
34 pages
Defining Performance
No ratings yet
Defining Performance
6 pages
COD Ch. 2 The Role of Performance
No ratings yet
COD Ch. 2 The Role of Performance
28 pages
02 Performance
No ratings yet
02 Performance
13 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
17 pages
M116C 1 M116C 1 Lect02-Performance
No ratings yet
M116C 1 M116C 1 Lect02-Performance
23 pages
Lecture-4
No ratings yet
Lecture-4
37 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
29 pages
L14 Introduction To Performance Evaluation
No ratings yet
L14 Introduction To Performance Evaluation
48 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
47 pages
CSE 332 L4 - 14 Nov 2020
No ratings yet
CSE 332 L4 - 14 Nov 2020
41 pages
Assessing and Understanding Performance
No ratings yet
Assessing and Understanding Performance
31 pages
Chapter 01
No ratings yet
Chapter 01
20 pages
02 Performance
No ratings yet
02 Performance
23 pages
Lecture4 Performance Evaluation 2011
No ratings yet
Lecture4 Performance Evaluation 2011
34 pages
COMP 303 Computer Architecture
No ratings yet
COMP 303 Computer Architecture
34 pages
Cs23402- Computer Architecture - Unit - 1 (4)
No ratings yet
Cs23402- Computer Architecture - Unit - 1 (4)
161 pages
2024 Lecture3 Come321
No ratings yet
2024 Lecture3 Come321
23 pages
Week 10 Part 02 - Processor Performance (Q Only) - Tagged 2
No ratings yet
Week 10 Part 02 - Processor Performance (Q Only) - Tagged 2
23 pages
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet
Parallel Computer Models: CSE7002: Advanced Computer Architecture
No ratings yet
Parallel Computer Models: CSE7002: Advanced Computer Architecture
37 pages
CompOrg 5thed HW1
No ratings yet
CompOrg 5thed HW1
2 pages
hw1 11 12 13 16 31 12 33 PDF
No ratings yet
hw1 11 12 13 16 31 12 33 PDF
7 pages
Comparc Cpo203
No ratings yet
Comparc Cpo203
39 pages
TUT2
No ratings yet
TUT2
3 pages
Model Answers - HW1
No ratings yet
Model Answers - HW1
6 pages
Computer Component Performance-Nguyễn Hoàng Long - BI11-157
100% (1)
Computer Component Performance-Nguyễn Hoàng Long - BI11-157
9 pages
CSCI 8150 Advanced Computer Architecture
No ratings yet
CSCI 8150 Advanced Computer Architecture
26 pages
The Future of The Mainframe: Gary Barnett
No ratings yet
The Future of The Mainframe: Gary Barnett
19 pages
Performances of Computer Systems: CSE 675.02: Introduction To Computer Architecture
No ratings yet
Performances of Computer Systems: CSE 675.02: Introduction To Computer Architecture
52 pages
Computer Abstractions and Technology
No ratings yet
Computer Abstractions and Technology
48 pages
Computer Architecture and Organization Ch#2 Examples
No ratings yet
Computer Architecture and Organization Ch#2 Examples
6 pages
Interfacing A Multiplexed Seven Segment Display With The 8086 Microprocessor
No ratings yet
Interfacing A Multiplexed Seven Segment Display With The 8086 Microprocessor
9 pages
Solution
No ratings yet
Solution
14 pages
12 CPUPerformance
No ratings yet
12 CPUPerformance
26 pages
Performance of Computers: Factors Affecting Computer Performance
No ratings yet
Performance of Computers: Factors Affecting Computer Performance
4 pages
CSC 505 Performance 1
No ratings yet
CSC 505 Performance 1
111 pages
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
50 pages
Types of Computers
No ratings yet
Types of Computers
10 pages
Computer Architecture Unit 1 - Phase 2 PDF
No ratings yet
Computer Architecture Unit 1 - Phase 2 PDF
26 pages
PDC Week 2 (Performance Metrice, Amdahl's Law)
No ratings yet
PDC Week 2 (Performance Metrice, Amdahl's Law)
18 pages
Key Characteristics of an RTOS
No ratings yet
Key Characteristics of an RTOS
15 pages
Computer Vs Human Brain: An Analytical Approach and Overview
100% (1)
Computer Vs Human Brain: An Analytical Approach and Overview
5 pages
5-Stage Pipeline CPU Hardware
No ratings yet
5-Stage Pipeline CPU Hardware
33 pages
COA Midterm
No ratings yet
COA Midterm
13 pages
CSC 306 22_22 PAST QUESTIONS AND ANSWERS
No ratings yet
CSC 306 22_22 PAST QUESTIONS AND ANSWERS
4 pages
It A Level CHPT 2 Hardware and Software
No ratings yet
It A Level CHPT 2 Hardware and Software
84 pages
Module-1: Metrics and Measures
No ratings yet
Module-1: Metrics and Measures
47 pages

2_Computer Organization and Architecture

Uploaded by

2_Computer Organization and Architecture

Uploaded by

Computer Organization and

 Response time/execution time : The time between the start and

So the performance ratio is, = n , that is = 1.5

 Our favorite program runs in 10 seconds on computer A, which

 Suppose we have two implementations of the same instruction set

CPU execution time(B)= CPU clock cycles(B) x Clock cycle time(B)

For a particular high level language statement, the compiler writter Is

 The set of programs run would form a workload.

 A is 10 times faster than B for program 1.

 The average of the execution times that is directly proportional

Please for more detail goto to text book :

 Pitfall: Expecting the improvement of one aspect of a computer to

 One alternative to time as the metric is MIPS.

Code from Instruction counts(in billions) for each

Solution: we know that, Execution time =

MIPS (1) = = 2800

You might also like