The Micro Architecture of Intel Pentium 4
The Micro Architecture of Intel Pentium 4
Sudipta Mahapatra
Introduction
The Intel Pentium 4 was introduced in November 2000 targeted at a high clock rate of 1.5 GHz. The Netburst microarchitecture formed the basis for a new family of Intel processors starting from the Pentium 4. Developed with an intention of delivering high level of performance for many important applications such as multimedia.
Overview (Contd.)
Out-of-order speculative execution to enable parallelism Superscalar issue to exploit maximal parallelism
Main Features
Hardware register renaming to avoid register name space limitations (WAW hazards) Cache line sizes of 64 bytes Optimization for the common case of frequently executed instructions Improved branch handling techniques.
Branch-history update
Main sections
1. In order front end (FE) 2. Out-of-order Execution logic (OOE) 3. Integer and Floating-point Execution Units (EX) 4. Memory Subsystem (M)
Front end
From L2 Cache
To Allocator/ Register Renamer [Glenn Hinton et. al., Intel Technology Jn. Q1, 2001]
10
13
14
To execution units
[Glenn Hinton et. al., Intel Technology Jn. Q1, 2001]
15
Execution Units
The execution units include several integer and floating point units for result computation. The execution section also includes the L-1 data cache used for most of the load/store operations.
16
Execution Units
From out-of-order execution logic
From/to memory subsystem [Glenn Hinton et. al., Intel Technology Jn. Q1, 2001]
17
Memory Subsystem
The memory section contains the L2 cache and the system bus. Used to access the main memory when the L2 cache has a cache miss. Also used to access the I/O resources.
18
Memory Subsystem
To ITLB/Prefetcher
From execution units [Glenn Hinton et. al., Intel Technology Jn. Q1, 2001]
19
Pentium 4 pipeline
The P6 microarchitecture (P2, P3, Celeron) has twice the pipeline depth of Pentium processor. The Netburst microarchitecture has almost doubled the depth of pipelining of P6. - It allows for a higher frequency of operation. - Different parts of Pentium 4 operate at different clock frequencies.
20