0% found this document useful (0 votes)

19 views53 pages

Chapter 1

The document provides an overview of computer architecture, focusing on the integration of hardware and software to support application execution. It covers key concepts such as Instruction Set Architecture (ISA), machine organization, and performance metrics, including execution time and throughput. Additionally, it discusses design principles, classes of computers, and common fallacies and pitfalls in performance evaluation.

Uploaded by

শতক দে

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views53 pages

Chapter 1

Uploaded by

শতক দে

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 53

CSE 305

Computer Architecture

Introduction
Prepared by
Madhusudan Basak
Assistant Professor
CSE, BUET
* Some modifications made by Saem Hasan
Why Computer Architecture?
 To apply the Architectural sense to a computer

 To apply computer for Architectural design

 To know about the basic Architecture of a computer

Why Computer Architecture?
 Purpose
 How hardware (processors, memories, disk drives, network infrastructure) plus
software (operating systems, compilers, libraries, network protocols) combine to
support the execution of application programs

 How you as a programmer can best use these resources

Why Computer Architecture?
What to know?
What the computer does How it does
 Logical View  Physical View
 Instruction Set Architecture  Computer Organization
(ISA)
Computer Architecture Instruction Set Architecture Computer Organization
Instruction Set Architecture
 Instruction set architecture is the attributes of a computing system as seen by
the assembly language programmer or compiler.
 Instruction Set (what operations can be performed?)

 Instruction Format (how are instructions specified?)

 Data storage (where is data located?)

 Addressing Modes (how is data accessed?)

 Exceptional Conditions (what happens if something goes wrong?)

Machine Organization
 Machine organization is the view of the computer that is seen by the logic
designer. This includes
 Capabilities & performance characteristics of functional units (e.g., registers,
ALU, shifters, etc.)

 Ways in which these components are interconnected

 How information flows between components

 Logic and means by which such information flow is controlled

 Coordination of functional units

Components of a Computer
Components of a Computer
• Gives directions to the other components
Control • e.g., bus controller, memory interface unit

• Performs arithmetic and logic operations

Datapath • e.g., adders, multipliers, shifters

• Holds data and instructions

Memory • e.g., cache, main memory, disk

• Sends data to the computer

Input • e.g., keyboard, mouse

• Gets data from the computer

Output • e.g., screen, sound card
Transfer within the computer:
LOAD R1, 0x1000: Load data from memory address 0x1000
into register R1

THREE TYPES OF COMMANDS in Instructions

Information in a computer -- Instructions

 Instructions specify commands to Transfer between the computer and I/O:
INPUT R2: Read a value from the keyboard into
 Transfer information within a computer register R2.
OUTPUT R3: Send the value in register R3 to the
• e.g., from memory to ALU printer.

 Transfer of information between the computer and I/O devices

• e.g., from keyboard to computer, or computer to printer

 Perform arithmetic and logical operations

• e. g., add two numbers, perform a logical AND

Arithmetic and logic:

ADD R4, R1, R2: Add the contents of registers R1 and R2, and store the result in R4.
AND R5, R1, R2: Perform a logical AND operation on R1 and R2, storing the result in R5
Information in a computer -- Instructions
 A sequence of instructions to perform a task is called a program, which is
stored in the memory.

 Processor fetches instructions from the memory and performs the

operations stated in those instructions.

 What do the instructions operate upon?

Information in a computer -- Data
 Data are the “operands” upon which instructions operate.

 Data could be:

 Numbers,

 Encoded characters.

 Data, in a broad sense means any digital information.

 Computers use data that is encoded as a string of binary digits called bits.
Classes of Computers
 Desktop / Notebook Computers
 Low-end systems, high performance workstations.
 Subject to cost/performance tradeoff
 Server Computers
 Network based
 High capacity, performance, reliability
 Range from small servers to building sized
 Embedded Computers
 Hidden as components of systems
 Minimize memory and power. Often not programmable
Eight Great Ideas in Computer Architecture
 Design for Moore’s Law
 Use Abstraction to Simplify Design
 Make the Common Case Fast
 Performance via Parallelism
 Performance via Pipelining
 Performance via Prediction
 Hierarchy of Memories
 Dependability via Redundancy
Exponential Growth:
This doubling means that computing power (or performance) and the complexity of electronic systems grow exponentially over time.

Design for Moore’s Law

 Provided by Gordon Moore (co-founder of Intel) in 1965
 Moore's law is the observation that the number of transistors in a dense
integrated circuit (IC) doubles about every two years.
1971 Intel 4004 Microprocessor:
Contained 2,300 transistors.
1989 Intel 80486 Microprocessor:
Effect on Costs: Contained more than 1 million
Moore's Law also predicted a decline in cost-per-transistor. transistors.
As manufacturing scales improve, the cost of producing 2024 Modern Processors:
additional transistors decreases. Contain billions of transistors (e.g.,
Apple's M2 processor has over 20
billion transistors)
Use Abstraction to Simplify Design
 Computing system maintains a hierarchical structure
 Lower-level details are hidden to the higher levels
 Higher level only gets the abstract view
 Both Hardware and Software consist of hierarchical layers using abstraction
Make the Common Case Fast
 More efficiency in common case, more impact in overall design.
Enhances throughput and computational power but introduces challenges like synchronization and
complexity

Performance via Parallelism

 Current multi processor system exploits parallelism.
 Often needs special care for coordination.

 x=a+b
 y = c*d

 x=a+b
 y = x*d
Performance via Pipelining
 Pipelining
 A special case of parallelism
 Performing multiple non-dependent operations at the same time

A
B
C
Performance via Prediction
 Perform operation just based on prediction/assumption
 Applicable when the impact is not costly
Hierarchy of Memories
 We want faster and cheaper memory
 Faster memory is costlier
 Cheaper memory is slower
 Trade-off is memory hierarchy
Dependability via Redundancy
 Redundancy means keeping multiple copies
 One fails, another exists => Dependable
Hierarchical Structure of Program Execution
 Simplified view including Hardware

e.g., MS Word, Powerpoint

e.g., Compiler, Operating System

e.g., CPU, HDD, RAM

Software Abstraction
 Hierarchical structure for a
program execution A+B

Add A, B

1000110010100000
Hardware Abstraction: Memory
CPU
 Memory Hierarchy Registers
Increased speed and cost

Cache
Memory
Volatile
(SRAMS)
Main Memory
(DRAMS)

Magnetic Disks

Optical Disks Non-Volatile

Magnetic Tapes
Simplified overall abstraction
Application S/W Application Software
Compiler,
Assembler,
Linker, Loader Executable Executable Executable
Program Program Program

Process
Operating
System Instruction Set
Virtual Memory
Architecture

File

Hardware Processor Main Memory I/O Devices

Performance
 What is the metric of the performance of a computing system?
 Depends on the purpose
 Two commonly used metrics are:
 Execution or Response Time
• How long it takes to do a task
 Throughput
• Total work done per unit time
– e.g., tasks/transactions/… per hour
Example
 Do the following changes to a computer system increase throughput, decrease
response time, or both?
 Replacing the processor in a computer with a faster version
• Response time decreases or improves
• Decreasing response time generally increases throughput

 Adding additional processors to a system that uses multiple processors for separate
tasks—for example, searching the web
• Throughput increases
• Response time depends on scenario
– Generally no impact on response time
– But in case tasks were waiting in the queue previously, response time will decrease after the change
Relative Performance
 We shall focus on Response or Execution time

 “X is times faster than Y” means

 Example: Time taken to run a program

 A: 10s, B=15s

 So, A is 1.5 times faster than B
Measuring Execution Time
 Elapsed time measures the total time taken for the program to run, from start to finish.
 Counts everything (disk and memory accesses, I/O, operating system overhead
etc.)
 A useful number, but often not good for comparison purposes
• Time sharing among multiple programs

 CPU time
 Doesn’t count I/O or time spent in running other programs
 Can be broken into system CPU time and user CPU time

 Our focus: user CPU time

 CPU time spent in executing the lines of code that are “in” our program

User CPU Time: Time the CPU spends executing the user code in your program.

System CPU Time: Time the CPU spends on system calls, such as file I/O or process
management, on behalf of your program.
Clock Cycles
 Time is not continuous, rather discrete to a Computer’s perspective
 Activities are performed during the discrete clock ticks

time
 cycle time = time between ticks = seconds per cycle
 clock rate (frequency) = cycles per second (1 Hz. = 1 cycle/sec)
A 2 Ghz. clock has a 1 / 2×109 = 0.5 nano-second (ns) cycle time
 So, for a program
Clock Cycles
 For a program

 Performance improvement means

 Decreasing number of clock cycles
 Increasing clock rate
 Hardware designer often trade off clock rate against cycle count
CPU Time Example
 Computer A: 2GHz clock, 10s CPU time
 Designing Computer B
 Aim for 6s CPU time
 Can do faster clock, but causes 1.2 × clock cycles
 How fast must Computer B clock be?
Clock CyclesB 1.2  Clock Cycles A
Clock RateB  
CPU Time B 6s
Clock Cycles A  CPU Time A  Clock Rate A
 10s  2GHz  20  10 9
1.2  20  10 9 24  10 9
Clock RateB    4GHz
6s 6s
Instructions vs Cycles
 Is the number of cycles identical with the number of instructions?
 No!
Why?
 Operations take different time
 Multiplication takes longer than addition
 Floating point operations take longer than integer operations
 The access time to a register is much shorter than to memory location
Instruction Count and CPI
Instruction Count and CPI
 Instruction Count for a program
 Determined by program, ISA and compiler

 CPI is an average since the number of cycles per instruction varies from
instruction to instruction

 CPI varies by application, as well as among implementation with the same

instruction set
 Number of cycles for each instruction
 Frequency of instructions (instruction mix)
 Memory access time
CPI Example
 Computer A: Cycle Time = 250ps, CPI = 2.0
 Computer B: Cycle Time = 500ps, CPI = 1.2
 Same ISA
 Which is faster, and by how much?

CPU Time  Instruction Count CPI Cycle Time

A A A
 I 2.0 250ps  I500ps A is faster…
CPU Time  Instruction Count CPI Cycle Time
B B B
 I1.2500ps I 600ps
CPU Time
B  I 600ps 1.2
CPU Time I500ps …by this much
A
Program Execution time
CPI Example
 Alternative compiled code sequences using instructions in classes A, B, C
Class A B C

CPI for class 1 2 3

IC in sequence 1 2 1 2

IC in sequence 2 4 1 1

 Sequence 1: IC = 5  Sequence 2: IC = 6
 Clock Cycles  Clock Cycles
= 2×1 + 1×2 + 2×3 = 4×1 + 1×2 + 1×3
= 10 =9
 Avg. CPI = 10/5 = 2.0  Avg. CPI = 9/6 = 1.5
Performance Summary
 The BIG Picture

 Performance depends on
 Algorithm
 Programming language
 Compiler
 Instruction set architecture
Tradeoffs
 Instruction count, CPI, and clock cycle present tradeoffs
 RISC – reduced instruction set computer (MIPS)
• Simple instructions
• Higher instruction counts for an application
• Lower CPI
 CISC – complex instruction set computer (IA-32)
• More complex instructions
• Lower instruction counts for an application
• Higher CPI
Comparing Computing Systems
 Comparing systems => comparing execution time of the workload is required
 Benchmarks can also help to evaluate measure the performance
 12 benchmarks of SPECINTC2006 are given in the next slide
 SPECratio can be used to measure the performance

 The geometric mean of the SPECratios (of the Benchmarks) can be calculated
Benchmarks
Fallacies and Pitfalls
 Fallacy
 Commonly held misconceptions
 Pitfall
 a hidden or unsuspected danger or difficulty
 Easily made mistakes
Fallacies
 Fallacy 1
Computers at low utilization use little power

 Fallacy 2

Designing for performance and designing for energy efficiency are

unrelated goals
Pitfalls
 Pitfall 1
Expecting the improvement of one aspect of a computer to
increase overall performance by an amount proportional to the
size of the improvement.

 Pitfall 2

Using a subset of the performance equation as a

performance metric.
Amdahl’s Law
Amdahl’s Law
Example
 Suppose a program runs in 100 seconds on a computer, with multiply
operations responsible for 80 seconds of this time. How much do I have to
improve the speed of multiplication if I want my program to run five times
faster?
Acknowledgements
 These slides contain material developed and copyright by:
 Krste Asanovic (UCB), James Hoe (CMU), Li-Shiuan Peh (MIT), Sudhakar
Yalamanchili (GATECH), and Amirali Baniasadi (UVIC) in part of their respective
courses
 Lecture slides by Dr. Tanzima Hashem, Professor, CSE, BUET
 Lecture slides by Ms. Mehnaz Tabassum Mahin, Assistant Professor, CSE, BUET
Thank You 

Computer Architecture and Operating Systems (Caos) Course Code: CS31702 4-0-0
No ratings yet
Computer Architecture and Operating Systems (Caos) Course Code: CS31702 4-0-0
33 pages
Computer Architecture & OS Syllabus
No ratings yet
Computer Architecture & OS Syllabus
30 pages
Abstraction in Computer Architecture
No ratings yet
Abstraction in Computer Architecture
74 pages
01 - Chapter 1
No ratings yet
01 - Chapter 1
41 pages
CPSC 321 Computer Architecture: Fall 2006
No ratings yet
CPSC 321 Computer Architecture: Fall 2006
36 pages
CH02-HP Computer Abstractions and Technology
No ratings yet
CH02-HP Computer Abstractions and Technology
36 pages
Computer Architecture Basics
100% (1)
Computer Architecture Basics
16 pages
Computer Architecture Course Guide
No ratings yet
Computer Architecture Course Guide
42 pages
Slide 1
No ratings yet
Slide 1
33 pages
CH6 - Computer Abstractions and Technology
No ratings yet
CH6 - Computer Abstractions and Technology
69 pages
ARM Computer Organization-Chapter01
No ratings yet
ARM Computer Organization-Chapter01
55 pages
Chapter 01 Modified
No ratings yet
Chapter 01 Modified
55 pages
Chapter - 01 - Computer Abstractions
No ratings yet
Chapter - 01 - Computer Abstractions
37 pages
Computer Architecture Overview
No ratings yet
Computer Architecture Overview
68 pages
Lecture1 Cda3101
No ratings yet
Lecture1 Cda3101
44 pages
Understanding Computer Architecture Basics
No ratings yet
Understanding Computer Architecture Basics
54 pages
Computer Architecture & Performance
No ratings yet
Computer Architecture & Performance
56 pages
Great Ideas in Computer Architecture
No ratings yet
Great Ideas in Computer Architecture
61 pages
Week 1
No ratings yet
Week 1
34 pages
Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
80% (5)
Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
118 pages
RISC-V Computer Organization Overview
No ratings yet
RISC-V Computer Organization Overview
49 pages
Computer Abstractions and Technology
No ratings yet
Computer Abstractions and Technology
49 pages
Computer Abstractions and Technology
No ratings yet
Computer Abstractions and Technology
46 pages
CS3350B Computer Architecture: Marc Moreno Maza
100% (1)
CS3350B Computer Architecture: Marc Moreno Maza
45 pages
Advanced Computer Architecture Overview
No ratings yet
Advanced Computer Architecture Overview
32 pages
Lecture 2 - ١٢٢١١٣
No ratings yet
Lecture 2 - ١٢٢١١٣
42 pages
Computer Design & Performance Basics
No ratings yet
Computer Design & Performance Basics
25 pages
Computer Abstractions and Technology
No ratings yet
Computer Abstractions and Technology
47 pages
Alllpdf PDF
No ratings yet
Alllpdf PDF
253 pages
Computer Organization & Design Basics
No ratings yet
Computer Organization & Design Basics
33 pages
Chapter 1
No ratings yet
Chapter 1
33 pages
Lec 1
No ratings yet
Lec 1
32 pages
CA0216D Chapter1B
No ratings yet
CA0216D Chapter1B
32 pages
Chapter 1 Computer Abstractions and Technology
No ratings yet
Chapter 1 Computer Abstractions and Technology
46 pages
Computer Architecture Unit 1
No ratings yet
Computer Architecture Unit 1
59 pages
Lecture 2 CPU Fundamentals
No ratings yet
Lecture 2 CPU Fundamentals
43 pages
Fundamentals of Computer Design Unit 1-Chapter 1: Reference
No ratings yet
Fundamentals of Computer Design Unit 1-Chapter 1: Reference
53 pages
Lec 3
No ratings yet
Lec 3
41 pages
Introduction To Computer Architecture and Performance Measurement
No ratings yet
Introduction To Computer Architecture and Performance Measurement
41 pages
Handout Chapter-1 PBK
No ratings yet
Handout Chapter-1 PBK
14 pages
Lecture 06 - Slides - Computer Technology and Instructions
No ratings yet
Lecture 06 - Slides - Computer Technology and Instructions
46 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
26 pages
Ico22 - 1 - Computer Abstraction and Technology
No ratings yet
Ico22 - 1 - Computer Abstraction and Technology
42 pages
Computer Abstractions and Technology Overview
No ratings yet
Computer Abstractions and Technology Overview
39 pages
Chapter 01 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
83% (6)
Chapter 01 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
49 pages
CO - 22CSE132 - Module 1
No ratings yet
CO - 22CSE132 - Module 1
114 pages
Chapter 1 Computer Abstractions and Technology
No ratings yet
Chapter 1 Computer Abstractions and Technology
46 pages
PPT#01
No ratings yet
PPT#01
30 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
64 pages
Cse431 02
No ratings yet
Cse431 02
50 pages
Advanced Computer Architecture: CSE-401 E
No ratings yet
Advanced Computer Architecture: CSE-401 E
71 pages
ARM Architecture in Embedded Systems
No ratings yet
ARM Architecture in Embedded Systems
463 pages
Chapter 1 Edit PDF
No ratings yet
Chapter 1 Edit PDF
40 pages
Cs6303comparchnotes PDF
No ratings yet
Cs6303comparchnotes PDF
250 pages
Computer Performance Insights
No ratings yet
Computer Performance Insights
29 pages
PDF
No ratings yet
PDF
41 pages
DFCA Module3
No ratings yet
DFCA Module3
78 pages
Computer Architecture Overview
No ratings yet
Computer Architecture Overview
38 pages
CO & A All Modules Notes 21CS34 PDF
100% (2)
CO & A All Modules Notes 21CS34 PDF
190 pages
SAP EPC 3.0 - SolutionManagement
No ratings yet
SAP EPC 3.0 - SolutionManagement
4 pages
Napoveda Brother
No ratings yet
Napoveda Brother
14 pages
HTC DC3 Whitepaper v1.3 Online Version
No ratings yet
HTC DC3 Whitepaper v1.3 Online Version
8 pages
Azure Fundamentals Q&A Session
No ratings yet
Azure Fundamentals Q&A Session
12 pages
8.7.1.1 Lab - Configuring A Site-To-Site VPN Using Cisco IOS and CCP - Instructor
No ratings yet
8.7.1.1 Lab - Configuring A Site-To-Site VPN Using Cisco IOS and CCP - Instructor
47 pages
Tutorial 1 Introduction To The Internet: GE2338 Internet Applications and Security
No ratings yet
Tutorial 1 Introduction To The Internet: GE2338 Internet Applications and Security
9 pages
Exceptio NS: Is Often Used To Refer To Any Event That Causes An Interruption
No ratings yet
Exceptio NS: Is Often Used To Refer To Any Event That Causes An Interruption
18 pages
TCP IP Document
No ratings yet
TCP IP Document
95 pages
Fire Fighting Robotic Vehicle: Project On
No ratings yet
Fire Fighting Robotic Vehicle: Project On
25 pages
Asa 99 Firewall Config PDF
No ratings yet
Asa 99 Firewall Config PDF
500 pages
Fee Management (Sourabh Verma)
No ratings yet
Fee Management (Sourabh Verma)
11 pages
Lecture 29 30
No ratings yet
Lecture 29 30
50 pages
Ambit Optimist 8 Installation Guide
0% (1)
Ambit Optimist 8 Installation Guide
87 pages
Grade 7/8 CHS Lesson: Measurements & Calculations
No ratings yet
Grade 7/8 CHS Lesson: Measurements & Calculations
5 pages
Assignment No: 2 TITLE: Directory Systems Using LDAP Name: Roll No.: Batch: Date: Remark
No ratings yet
Assignment No: 2 TITLE: Directory Systems Using LDAP Name: Roll No.: Batch: Date: Remark
8 pages
Matrox RT2000 Installation Guide
No ratings yet
Matrox RT2000 Installation Guide
4 pages
ATM Machines: A User's Guide
No ratings yet
ATM Machines: A User's Guide
19 pages
National Institute of Technology, Tiruchirappalli: Performa For CV of Faculty/ Staff Members
No ratings yet
National Institute of Technology, Tiruchirappalli: Performa For CV of Faculty/ Staff Members
17 pages
SQL Instance Uninstallation Steps
No ratings yet
SQL Instance Uninstallation Steps
2 pages
Hi Fieldbus Appl - ProfibusDP
No ratings yet
Hi Fieldbus Appl - ProfibusDP
10 pages
Solution Manual For Fundamentals of Digital Logic and Microcontrollers 6th Edition
No ratings yet
Solution Manual For Fundamentals of Digital Logic and Microcontrollers 6th Edition
15 pages
IT Skill Project
No ratings yet
IT Skill Project
9 pages
UserGuide 4
No ratings yet
UserGuide 4
1 page
CES WebSuite New
No ratings yet
CES WebSuite New
112 pages
S7-PDIAG - For S7-300 and S7-400 - First Steps
No ratings yet
S7-PDIAG - For S7-300 and S7-400 - First Steps
16 pages
Lenovo V14 IWL Spec
No ratings yet
Lenovo V14 IWL Spec
1 page
Rulings On Menstruation
No ratings yet
Rulings On Menstruation
5 pages
SAPI-S7 .NET Interface Guide
No ratings yet
SAPI-S7 .NET Interface Guide
9 pages
TMP 7617 PLDT Home DSL and PLDT Home Fiber New Default Wifi Password Hack 11508993826 PDF
No ratings yet
TMP 7617 PLDT Home DSL and PLDT Home Fiber New Default Wifi Password Hack 11508993826 PDF
7 pages
Software Engineer Resume: Google & Meta Experience
No ratings yet
Software Engineer Resume: Google & Meta Experience
1 page

Chapter 1

Uploaded by

Chapter 1

Uploaded by

CSE 305

 To apply computer for Architectural design

 To know about the basic Architecture of a computer

 How you as a programmer can best use these resources

 Instruction Format (how are instructions specified?)

 Data storage (where is data located?)

 Addressing Modes (how is data accessed?)

 Exceptional Conditions (what happens if something goes wrong?)

 Ways in which these components are interconnected

 How information flows between components

 Logic and means by which such information flow is controlled

 Coordination of functional units

• Performs arithmetic and logic operations

• Holds data and instructions

• Sends data to the computer

• Gets data from the computer

THREE TYPES OF COMMANDS in Instructions

Information in a computer -- Instructions

 Transfer of information between the computer and I/O devices

 Perform arithmetic and logical operations

Arithmetic and logic:

 Processor fetches instructions from the memory and performs the

 What do the instructions operate upon?

 Data could be:

 Data, in a broad sense means any digital information.

Design for Moore’s Law

Performance via Parallelism

e.g., MS Word, Powerpoint

e.g., CPU, HDD, RAM

Optical Disks Non-Volatile

Hardware Processor Main Memory I/O Devices

 “X is times faster than Y” means

 Example: Time taken to run a program

 Our focus: user CPU time

 Performance improvement means

 CPI varies by application, as well as among implementation with the same

CPU Time  Instruction Count CPI Cycle Time

CPI for class 1 2 3

Designing for performance and designing for energy efficiency are

Using a subset of the performance equation as a

You might also like