Computer Architecture and Organization: Dr. Mohd Hanafi Ahmad Hijazi
Computer Architecture and Organization: Dr. Mohd Hanafi Ahmad Hijazi
KT14203
Computer
Architecture
and
Organization
Presented by:
Dr. Mohd Hanafi Ahmad Hijazi
SKTM, UMS
Slides, with minor modifications, taken from
William Stallings Computer Organization and
Architecture, 9th Edition
+
Chapter 2
Computer Evolution and Performance
+
History of Computers
First Generation: Vacuum Tubes
ENIAC
Electronic Numerical Integrator And Computer
Designed and constructed at the University of Pennsylvania
Started in 1943 – completed in 1946
By John Mauchly and John Eckert
Its first task was to perform a series of calculations that were used to
help determine the feasibility of the hydrogen bomb
Major
Memory drawback
consisted
was the need
Occupied of 20
Contained Capable
1500 Decimal accumulators,
more of for manual
Weighed square 140 kW rather each
than 5000 programming
30 feet Power than capable
18,000 additions by setting
tons of consumption binary of
vacuum per switches
floor machine holding
tubes second and
space a
10 digit plugging/
number unplugging
cables
ENIAC
Source: https://www.youtube.com/watch?v=ANRJsigryJw
+
ENIAC
Pictures taken from
http://en.wikipedia.org/wiki/ENIAC
+
John von Neumann
EDVAC (Electronic Discrete Variable Computer)
IAS computer
Princeton Institute for Advanced Studies
Prototype of all subsequent general-purpose computers
Completed in 1952
Structure of von Neumann Machine
+
IAS Memory Formats
Both data and instructions are
The memory of the IAS stored there
consists of 1000 storage
locations (called words) of Numbers are represented in
binary form and each instruction
40 bits each is a binary code
+
Structure
of
IAS
Computer
+ Registers
Memory buffer register • Contains a word to be stored in memory or sent to the I/O unit
(MBR) • Or is used to receive a word from memory or from the I/O unit
Memory address • Specifies the address in memory of the word to be written from
register (MAR) or read into the MBR
Instruction register (IR) • Contains the 8-bit opcode instruction being executed
Accumulator (AC) and • Employed to temporarily hold operands and results of ALU
multiplier quotient (MQ) operations
+
IAS
Operations
+
Table 2.1
The IAS
Instruction
Set
Cheaper
+
Computer Generations
+
Second Generation Computers
Introduced:
Appearance of the Digital
More complex arithmetic
Equipment Corporation (DEC)
and logic units and control
units in 1957
The use of high-level
PDP-1 was DEC’s first
programming languages
computer
Provision of system software
which provided the ability This began the mini-computer
to:
phenomenon that would
load programs become so prominent in the
move data to peripherals third generation
and libraries
perform common
computations
Table 2.3
Example
Members of the
IBM 700/7000 Series
Discrete component
Single, self-contained transistor
Manufactured separately, packaged in their own containers, and
soldered or wired together onto masonite-like circuit boards
Manufacturing process was expensive and cumbersome
Announced in 1964
Increasing
Increasing
number of I/O
speed
ports
Increasing
Increasing cost
memory size
Generations
VLSI
Very Large
Scale
Integration
ULSI
Semiconductor Memory Ultra Large
Microprocessors Scale
Integration
+ Semiconductor Memory
In 1970 Fairchild produced the first relatively capacious semiconductor memory
In 1974 the price per bit of semiconductor memory dropped below the price per bit
of core memory
There has been a continuing and rapid decline in Developments in memory and processor
memory cost accompanied by a corresponding technologies changed the nature of computers in
increase in physical memory density less than a decade
Each generation has provided four times the storage density of the previous generation, accompanied
by declining cost per bit and declining access time
+
Microprocessors
The density of elements on processor chips continued to rise
More and more elements were placed on each chip so that fewer
and fewer chips were needed to construct a single computer
processor
a. 1970s Processors
b. 1980s Processors
Evolution of Intel Microprocessors
c. 1990s Processors
d. Recent Processors
+
The Evolution of the Intel x86
Architecture
Two processor families are the Intel x86 and the ARM
architectures
Pentium Pro
• Continued the move into superscalar organization with aggressive use of register renaming, branch
prediction, data flow analysis, and speculative execution
Pentium II
• Incorporated Intel MMX technology, which is designed specifically to process video, audio, and graphics
data efficiently
Pentium III
•Incorporated additional floating-point instructions
•Streaming SIMD Extensions (SSE)
Pentium 4
• Includes additional floating-point and other enhancements for multimedia
Core
• First Intel x86 micro-core
Core 2
• Extends the Core architecture to 64 bits
• Core 2 Quad provides four cores on a single chip
• More recent Core offerings have up to 10 cores per chip
• An important addition to the architecture was the Advanced Vector Extensions instruction set
Processor Memory
Human Diagnostic
interface port
A/D D/A
conversion Conversion
Actuators/
Sensors
indicators
It is the fourth generation that is usually thought of as the IoT and it is marked
by the use of billions of embedded devices
© 2016 Pearson Education, Inc., Hoboken, NJ. All rights reserved.
+
Embedded Application Processors
Operating versus
Systems Dedicated Processors
Is not programmable once the program logic for the device has been
burned into ROM
Cortex-M
• Cortex-M0
Cortex-R • Cortex-M0+
• Cortex-M3
Cortex- • Cortex-M4
A/Cortex-
A50
Peripheral bus
32-bit bus
Voltage Voltage High fre- High freq Flash SRAM Debug DMA
regula- compar- quency RC crystal memory memory inter- control-
tor ator oscillator oscillator 64 kB 64 kB face ler
Microcontroller Chip
ICode SRAM &
interface peripheral I/F
Bus matrix
Debug logic
Memory
DAP protection unit
ARM
NVIC core ETM
Cortex-M3 Core
NVIC ETM Cortex-M3
interface interface
Processor
32-bit ALU
Hardware 32-bit
divider multiplier
Control Thumb
logic decode
Instruction Data
interface interface
Cloud Storage
Subset of cloud computing
Today’s laptops have the computing power of an IBM mainframe from 10 or 15 years ago
Branch
•Processor looks ahead in the
instruction code fetched from memory
and predicts which branches, or
prediction groups of instructions, are likely to be
processed next
Speculative
• Using branch prediction and data flow analysis,
some processors speculatively execute
instructions ahead of their actual appearance in
execution
the program execution, holding the results in
temporary locations, keeping execution
engines as busy as possible
+
Performance
Balance
Adjust the organization and Increase the number
of bits that are
architecture to compensate retrieved at one time
by making DRAMs
for the mismatch among the “wider” rather than
“deeper” and by
capabilities of the various using wide bus data
paths
components
Reduce the
Architectural examples frequency of memory
access by
include: incorporating
increasingly
complex and
efficient cache
structures between
the processor and
main memory
Increase the
Change the DRAM interconnect
interface to make it bandwidth between
more efficient by processors and
including a cache or memory by using
other buffering higher speed buses
scheme on the DRAM and a hierarchy of
chip buses to buffer and
structure data flow
Typical I/O Device Data Rates
+
Improvements in Chip
Organization and Architecture
Increase hardware speed of processor
Fundamentally due to shrinking logic gate size
More gates, packed more tightly, increasing clock rate
Propagation time for signals reduced
𝐼𝑐 𝑓
𝑀𝐼𝑃𝑆 𝑟𝑎𝑡𝑒 = =
𝑇×106 𝐶𝑃𝐼×106
SPEC
An industry consortium
Defines and maintains the best known collection of benchmark
suites
Performance measurements are widely used for comparison and
research purposes
+ Best known SPEC benchmark suite
Law
in the development of multi-core
machines
Software must be adapted to a highly
parallel execution environment to
exploit the power of parallel
processing
(1 – f)T fT
N
1
1 f 1 T
N