The Micro Architecture of Intel Pentium 4

Netburst Microarchitecture formed the basis for a new family of Intel processors starting from the Pentium 4. Uses a deeply pipelined architecture to ensure a high clock rate. Uses high speed execution engine to reduce the latency of basic integer instructions.

Uploaded by

Rekha Govindaraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

287 views20 pages

The Micro Architecture of Intel Pentium 4

Uploaded by

Rekha Govindaraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 20

The Microarchitecture of Intel Pentium 4

Sudipta Mahapatra

Introduction
The Intel Pentium 4 was introduced in November 2000 targeted at a high clock rate of 1.5 GHz. The Netburst microarchitecture formed the basis for a new family of Intel processors starting from the Pentium 4. Developed with an intention of delivering high level of performance for many important applications such as multimedia.

Targeted application areas

Internet audio and streaming video. Image processing Video content creation Speech recognition 3D applications and games. Video editing and video conferencing.

Overview of the Netburst Microarchitecture

Uses a deeply pipelined architecture to ensure a high clock rate. Uses a high-performance, quad-pumped bus interface to the 100 MHz system bus to transfer data at a rate of 400 MHz. Uses a high speed execution engine to reduce the latency of basic integer instructions

Overview (Contd.)
Out-of-order speculative execution to enable parallelism Superscalar issue to exploit maximal parallelism

Main Features
Hardware register renaming to avoid register name space limitations (WAW hazards) Cache line sizes of 64 bytes Optimization for the common case of frequently executed instructions Improved branch handling techniques.

Basic Block Diagram

Branch-history update

[Glenn Hinton et. al., Intel Technology Jn. Q1, 2001]

Main sections
1. In order front end (FE) 2. Out-of-order Execution logic (OOE) 3. Integer and Floating-point Execution Units (EX) 4. Memory Subsystem (M)

In order front end

Fetches the instructions to be executed next. Supplies a set of decoded instructions to the execution pipeline. Uses accurate branch prediction logic to determine the branch target. The instructions from the branch target are decoded to generate a set of micro-operations or uops that may be executed in the execution core. Uses the trace cache to store the uops corresponding to the most recently executed 9 instructions.

Front end
From L2 Cache

To Allocator/ Register Renamer [Glenn Hinton et. al., Intel Technology Jn. Q1, 2001]
10

Front end components

Trace cache (TC): Serves as the L1 instruction cache. However, it holds the uops corresponding to the most recently decoded instructions. Delivers up to three uops per clock cycle to the OOE. Capacity=12K uops. Only in case of TC miss, the L2 cache is accessed. The trace cache has its own branch predictor that indicates where to go next in the trace cache. This is smaller than the Front-end BTB as it is concerned only with the subset of instructions that are currently in the trace cache. Also includes a 16-entry return address stack.
11

Front end components (Contd.)

Microcode ROM: Is used for complex IA-32 instructions such as the string move and for fault and interrupt handling. In case of complex instructions, control is transferred to the microcode ROM, which then issues the needed uops. Instruction TLB/Pre-fetcher: Responsible for fetching instructions from L2 cache in case of TC miss. Does the translation of supplied IA-32 linear instruction address into the corresponding physical address needed to access the L2 cache. Front-end BTB: Supplies the IA-32 instruction bytes that are predicted to be executed next from the L2 cache. In case of a miss in the BTB, backward branches are 12 predicted taken and forward branches not taken.

Front end components (Contd.)

Instruction decoder: Receives two IA-32 instructions at a time from the L2 cache and decodes them into uops. Can decode at a maximum rate of one IA-32 instruction at a time. Most of the instructions are converted into single uops. If the instruction needs more than 4 uops, control is transferred into the microcode ROM.

Out-of-order Execution logic

Prepares the instructions for out-of-order execution. Uses aggressive reordering to execute the instructions as soon as they are ready to execute. Maximal utilization of execution resources. Has retirement logic to reorder the instructions so that they commit in order.

Out-of-order Execution logic

From uop Queue

To execution units
[Glenn Hinton et. al., Intel Technology Jn. Q1, 2001]

Execution Units
The execution units include several integer and floating point units for result computation. The execution section also includes the L-1 data cache used for most of the load/store operations.

Execution Units
From out-of-order execution logic

From/to memory subsystem [Glenn Hinton et. al., Intel Technology Jn. Q1, 2001]
17

Memory Subsystem
The memory section contains the L2 cache and the system bus. Used to access the main memory when the L2 cache has a cache miss. Also used to access the I/O resources.

Memory Subsystem
To ITLB/Prefetcher

From execution units [Glenn Hinton et. al., Intel Technology Jn. Q1, 2001]

Pentium 4 pipeline
The P6 microarchitecture (P2, P3, Celeron) has twice the pipeline depth of Pentium processor. The Netburst microarchitecture has almost doubled the depth of pipelining of P6. - It allows for a higher frequency of operation. - Different parts of Pentium 4 operate at different clock frequencies.

Datasheet lm393n
No ratings yet
Datasheet lm393n
15 pages
Algorithms Problems
No ratings yet
Algorithms Problems
2 pages
c3088 Camera Module
No ratings yet
c3088 Camera Module
2 pages
Essential Facts About Fourier Series
No ratings yet
Essential Facts About Fourier Series
3 pages
Unit 2 Omputer Network Aktu
100% (1)
Unit 2 Omputer Network Aktu
30 pages
Digital Logic Design Jan 2023
No ratings yet
Digital Logic Design Jan 2023
8 pages
RF Module Quick Mannual
No ratings yet
RF Module Quick Mannual
2 pages
Features of MapReduce
No ratings yet
Features of MapReduce
4 pages
AVR Project Book PDF
No ratings yet
AVR Project Book PDF
71 pages
1.5 Extreme Programming
No ratings yet
1.5 Extreme Programming
12 pages
Research Article: Image Enhancement Method Based On Deep Learning
No ratings yet
Research Article: Image Enhancement Method Based On Deep Learning
9 pages
Hackathon Brochure
No ratings yet
Hackathon Brochure
6 pages
AVR Instruction Set
No ratings yet
AVR Instruction Set
149 pages
Studocu DAA Unit 5 Notes
No ratings yet
Studocu DAA Unit 5 Notes
23 pages
SE Unit 3
No ratings yet
SE Unit 3
10 pages
Collate Se Unit 4 Notes
No ratings yet
Collate Se Unit 4 Notes
37 pages
Question Bank Unit 1 PDF
No ratings yet
Question Bank Unit 1 PDF
27 pages
Unit-3 Oose
No ratings yet
Unit-3 Oose
81 pages
Angular JS Lab Manual
No ratings yet
Angular JS Lab Manual
43 pages
System Design Activities
No ratings yet
System Design Activities
41 pages
SE Unit 4 - Part 2
No ratings yet
SE Unit 4 - Part 2
9 pages
CCS356 OOSE -NOTES-Final
No ratings yet
CCS356 OOSE -NOTES-Final
114 pages
Android Studio Viva Questions
No ratings yet
Android Studio Viva Questions
23 pages
Distributed Databases: Course Code:13IT1109 L TPC 4 0 0 3
No ratings yet
Distributed Databases: Course Code:13IT1109 L TPC 4 0 0 3
3 pages
Cloud Computing Security Testing
No ratings yet
Cloud Computing Security Testing
12 pages
Notes - Unit 3 - Map Reduce Applications
No ratings yet
Notes - Unit 3 - Map Reduce Applications
11 pages
W5HH Principle
0% (1)
W5HH Principle
28 pages
Exercise 2
No ratings yet
Exercise 2
11 pages
Internship 7th Sem
No ratings yet
Internship 7th Sem
16 pages
Unit V - Security in The Cloud
0% (1)
Unit V - Security in The Cloud
10 pages
Oomd (U1&u2)
100% (1)
Oomd (U1&u2)
83 pages
Internal Product Attribute Measurement: Size
No ratings yet
Internal Product Attribute Measurement: Size
70 pages
SPM Lecture Notes 2023 (R20 III-I)
No ratings yet
SPM Lecture Notes 2023 (R20 III-I)
76 pages
DSA RTU 2022 Paper
No ratings yet
DSA RTU 2022 Paper
15 pages
NLP Asgn2
No ratings yet
NLP Asgn2
7 pages
Software Testing Methodologies
No ratings yet
Software Testing Methodologies
40 pages
Mobile Application Development
No ratings yet
Mobile Application Development
193 pages
PPS Course Material
100% (1)
PPS Course Material
177 pages
Clean Room Software Engineering
No ratings yet
Clean Room Software Engineering
39 pages
Unit I: Software Process Maturity Software Maturity Framework
No ratings yet
Unit I: Software Process Maturity Software Maturity Framework
27 pages
Enterprise Information Architecture Component Model - Chapter 5
100% (1)
Enterprise Information Architecture Component Model - Chapter 5
27 pages
Software Testing Methodologies: Asst - Prof.A.MOHAN
No ratings yet
Software Testing Methodologies: Asst - Prof.A.MOHAN
100 pages
Graphs Assignment
No ratings yet
Graphs Assignment
5 pages
AoA Important Question
100% (1)
AoA Important Question
3 pages
Classical Analysis
No ratings yet
Classical Analysis
6 pages
ACP Question Bank
No ratings yet
ACP Question Bank
5 pages
Se Module 2 PPT
No ratings yet
Se Module 2 PPT
86 pages
Industrial Extreme Programming: Submitted By: Group 3 Submitted To
No ratings yet
Industrial Extreme Programming: Submitted By: Group 3 Submitted To
7 pages
Basis Path Testing
No ratings yet
Basis Path Testing
4 pages
SOLUTIONS That I Can Copy and PASTE Krypton - Fhda.edu - Mmurperfefhy - Cnet-53f - Resources - ISM Book Exercise Solutions
No ratings yet
SOLUTIONS That I Can Copy and PASTE Krypton - Fhda.edu - Mmurperfefhy - Cnet-53f - Resources - ISM Book Exercise Solutions
32 pages
MCA 4th Sem ADA Lab Mannual
No ratings yet
MCA 4th Sem ADA Lab Mannual
26 pages
Lab 2
No ratings yet
Lab 2
6 pages
LP 4 Lab Manual
No ratings yet
LP 4 Lab Manual
52 pages
Lab Manual: Department of Computer Engineering
No ratings yet
Lab Manual: Department of Computer Engineering
66 pages
R20 - II To IV Year Syllabus CSE
No ratings yet
R20 - II To IV Year Syllabus CSE
25 pages
Unit - V Implementation, Testing & Maintenance
No ratings yet
Unit - V Implementation, Testing & Maintenance
60 pages
Ece443 - Wireless Sensor Networks Course Information Sheet: Electronics and Communication Engineering Department
No ratings yet
Ece443 - Wireless Sensor Networks Course Information Sheet: Electronics and Communication Engineering Department
10 pages
OOAD Question Bank
100% (2)
OOAD Question Bank
5 pages
Unit Ii
No ratings yet
Unit Ii
61 pages
ST 2
No ratings yet
ST 2
46 pages
TRB Rejinpaul Question Papets
No ratings yet
TRB Rejinpaul Question Papets
12 pages
Uid-Graphical System Advatages
No ratings yet
Uid-Graphical System Advatages
21 pages
CS 606 Skill Dev Lab - 7TO 10 - 1648109707
No ratings yet
CS 606 Skill Dev Lab - 7TO 10 - 1648109707
12 pages
Question Bank: Subject: Data Structures and Algorithms
No ratings yet
Question Bank: Subject: Data Structures and Algorithms
6 pages
Optimizing Hadoop for MapReduce
From Everand
Optimizing Hadoop for MapReduce
Khaled Tannir
No ratings yet
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
AppDynamics Third Edition
From Everand
AppDynamics Third Edition
Gerardus Blokdyk
No ratings yet