0% found this document useful (0 votes)

85 views

Comp Arch Proj Report 2

The document analyzes the performance of different branch predictor configurations and branch target buffer (BTB) configurations using three benchmarks: GCC, ANAGRAM, and GO. Simulation results show cycles per instruction (CPI) and hit rates for different branch predictor types (bimodal, two-level, combined) and BTB configurations varying the number of sets and associativity. The combined predictor performed best overall with ANAGRAM showing the highest hit rates and lowest CPI across configurations. GCC generally had the highest CPI and lowest hit rates. Address misses increased for all benchmarks with a smaller 32-set, 2-way BTB configuration.

Uploaded by

Nitu Vlsi

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

85 views

Comp Arch Proj Report 2

Uploaded by

Nitu Vlsi

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Page | 1

The University of Texas at Dallas

Department of Electrical Engineering

EECE/CS 6304: COMPUTER ARCHITECTURE

PROJECT #2

ANALYSIS OF DIFFERENT TYPES OF

BRANCH PREDICTORS

Submitted by,

Chintan Modi (chm130430)

Ujas Patel (unp130030)

Page | 1

INTRODUCTION
In computer architecture, a branch predictor is a digital circuit that
tries to speculate which way a branch will go before this is known for sure (i.e.,
before its execution). The purpose of the branch predictor is to improve the
flow in the instruction pipeline. They play a critical role in achieving high
effective performance in many modern pipelined microprocessor architectures
such as x86.
In this project, we analyze the behavior of different branch predictor
configurations in three well-recognized benchmarks, especially GCC,
ANAGRAM and GO. We used simplescalar sim-outorder, which models all the
execution aspects of Alpha 21264. The simulations provide the CPI
values(sim_CPI), which we used to compare among different benchmarks.
We have used three types of hardware based branch prediction
strategies, they are:
1) Bimodal Predictor: It is a simple predictor, which uses 2-bit saturating
counters to predict if a given branch is likely to be taken or not.
2) Two Level Predictor: A two-level adaptive predictor with an n-bit history
is that it can predict any repetitive sequence with any period if all n-bit subsequences are different. The advantage of the two-level adaptive predictor
is that it can quickly learn to predict an arbitrary repetitive pattern.
3) Combined Predictor: A hybrid predictor also called combined predictor
implements more than one prediction mechanism. The final prediction is
based either on a meta-predictor that remembers which of the predictors
has made the best predictions in the past or a majority vote function based
on an odd number of different predictors.

Page | 2

Part 1: Performance analysis of different types of

branch predictors and different RAS configurations
The simulation is done for different configuration of Return Address
Space (RAS) and types of branch predictions.

Baseline default RAS: Bimodal predictor with the default value for RAS.
-bpred bimod -bpred:bimod 256 -bpred:ras 8 -bpred:btb 64 2

2 Level Predictor: Uses two bit for defining the state for branch predictor.
-bpred 2lev -bpred:2lev 1 256 4 0 -bpred:ras 8 -bpred:btb 64 2

Combining (comb): Combines a two levels and bimodal predictor.

-bpred comb -bpred:comb 256 -bpred:bimod 256 -bpred:2lev 1 256 4 0
-bpred:ras 8 -bpred:btb 64 2

RAS 4: Change the return address stack (RAS) size to 4.

-bpred bimod -bpred:bimod 256 -bpred:ras 4 -bpred:btb 64 2

RAS 16: Change the return address stack (RAS) size to 16.
-bpred bimod -bpred:bimod 256 -bpred:ras 16 -bpred:btb 64 2

Performance Analysis based on CPI:

Sr. No.

Configuration

Benchmarks
GCC

ANAGRAM

Baseline

0.9069

0.466

0.8112

2 Level Predictor

0.9453

0.4578

0.8447

Combining

0.8934

0.4537

0.8052

Bimod:RAS 4

0.9115

0.4663

0.8113

Bimod:RAS 16

0.9066

0.466

0.8112

Graphical Representation with above CPI

1
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0

Benchmarks GCC
Benchmarks ANAGRAM
Benchmarks GO

Page | 3

Above graph clearly displays the performance of different configurations of

branch predictor.
Number of instructions run for GCC= 337326966
Number of instructions run for ANAGRAM= 27022205
Number of instructions run for GO = 692097038
Analysis: Benchmark GCC vs BP Configurations
GCC benchmark has more CPI as compared to the other benchmarks. It
has high CPI for 2 level predictor which uses two bits for defining state of
branch predictor. It can be noted that for combination of two level and bimodal
predictor CPI has decreased. With decrease in Return Stack Address ,CPI
increases.
Analysis: Benchmark ANAGRAM vs BP Configurations
From the above graph, we can infer that ANAGRAM benchmark has a
less CPI than the other two benchmarks. The performance of ANAGRAM
benchmark is fairly constant for all the configurations of branch predictor.
Specifically, CPI is optimal for combination of two level and bimodal predictor
(Comb).
Analysis: Benchmark GO vs BP Configurations
Above graph shows that GO benchmark performs better than the GCC
benchmark. The performance of GO benchmark is almost constant for all the
configurations of branch predictor. Specifically, CPI is optimal for combination
of two level and bimodal predictor (Comb). With respect to bimod size
variation, if we change baseline configuration from the default return address
space from size of 4 to size of 16, CPI performance does not change much.

Page | 4

Performance Analysis based on Address Hit Rates :

Sr. No.
1
2
3
4
5

Configuration
Baseline
2 Level Predictor
Comb
Bimod:RAS 4
Bimod:RAS 16

GCC
0.7102
0.6627
0.7206
0.7058
0.7105

Benchmarks
ANAGRAM
0.9555
0.9579
0.9684
0.9552
0.9555

GO
0.6402
0.5747
0.6409
0.64
0.6402

Graphical Representation with above Address Hit Rates

1.2
1
0.8
0.6
0.4
0.2
0

Benchmarks GCC
Benchmarks ANAGRAM
Benchmarks GO

The above graph clearly shows the performance

configurations of branch predictor for different benchmarks.

different

For ANAGRAM benchmark, for 2 level predictor and combining predictor ,

the hit rates are appreciable.
For GO benchmark, except for 2 level predictor configurations, the
Address Hit Rates are same and appreciable.
For GCC benchmark, except for 2 level predictor configurations, the
Address Hits Rates are appreciable.

Page | 5

Performance Analysis based on Direction Hit Rates

Sr. No.

Configuration

Benchmarks
GCC

ANAGRAM

0.8431

0.9608

0.7525

0.791

0.9629

0.6915

Baseline

2 Level Predictor

Comb

0.8568

0.9736

0.7572

Bimod:RAS 4

0.8431

0.9608

0.7525

Bimod:RAS 16

0.8431

0.9608

0.7525

The graph for the Direction Hit Rates with respect to every benchmark
will provide us more information on the effect of branch prediction
configurations on different benchmarks.
Graphical Representation with above Direction Hit Rates
1.2
1
0.8
0.6
0.4

Benchmarks GCC

0.2

Benchmarks ANAGRAM

Benchmarks GO

The Direction Hit Rates of the branch predictors fairly stays constant for
each benchmark. Specifically, ANAGRAM benchmark has more direction hit
rates than other two benchmarks. In this case, 2 level prediction direction rate
gives worst performance for GCC and GO benchmarks. Combining Predictor
gives best performance for all benchmarks.

Page | 6

Part 2: Modification of the code to accommodate

address misses
We carried out modifications in the following two files in Simplescalar.
1) bpred.h
2) bpred.c
1)

Changes in file bpred.h:

---------------/* branch predictor def */

struct bpred_t {
-----} dirpred;
struct {
-------} retstack;
/* stats */
counter_t addr_hits;
counter_t dir_hits;
counter_t addr_misses;
counter_t used_ras;
counter_t used_bimod;
----------};

/* num correct addr-predictions */

/* num correct dir-predictions (incl addr) */
/* num address misses */
/* num RAS predictions used */
/* num bimodal predictions used (BPredComb) */

2) Changes in file bpred.c:

----------sprintf(buf, "%s.dir_hits", name);
stat_reg_counter(sdb, buf, "total number of direction-predicted hits "
hits)",
&pred->dir_hits, 0, NULL);
sprintf(buf, "%s.addr_misses", name);
stat_reg_counter(sdb, buf, "total number of address misses",
&pred->addr_misses, 0, NULL);
----------if (bpred == NULL)
return;

"(includes addr-

bpred->dir_hits = 0;
bpred->addr_misses = 0;
----------/* Have a branch here */
if (correct)
pred->addr_hits++;
if (!!pred_taken == !!taken)
pred->dir_hits++;
else
pred->misses++;
pred->addr_misses= (pred->misses + pred->dir_hits - pred->addr_hits);
-----------

Page | 7
}

Part 3: Comparison of BTB Performance

The simulation is done for the following configurations of Branch Target
Buffer:
Baseline BTB configuration: 64 sets, 2 way associativity
bpred bimod bpred:bimod 256 -bpred:btb 64 2
Showing the effect of the number of sets in BTB with the following options
bpred bimod bpred:bimod 256 -bpred:btb 32 2
bpred bimod bpred:bimod 256 bpred:btb 128 2
Showing the effect of associativity when the total size of BTB is fixed with the
following options
bpred bimod bpred:bimod 256 -bpred:btb 32 4
bpred bimod bpred:bimod 256 -bpred:btb 128 1
Performance Analysis based on addr_hits
Sr. No.
1
2
3
4
5

Configuration
64 sets/2 way
32 sets/2 way
128 sets/2 way
32 sets/4 way
128 sets/1 way

GCC
1005521
937745
1100970
1018386
995879

Benchmarks
ANAGRAM
2032397
2020880
2034249
2037020
2028135

GO
1051818
1010267
1076578
1054258
1031176

Graphical Representation with above addr_hits

2500000
2000000
1500000
1000000
500000
0

Benchmarks GCC
Benchmarks ANAGRAM
Benchmarks GO

The above graph shows the behavior of various configurations of Branch

Target Buffer (BTB) for different benchmarks. Among all the three benchmarks,
ANAGRAM benchmark has the highest address hits and the performance is

Page | 8

relatively minimum for BTB with 32 sets and 2 way set associative. GO
benchmark has moderate address hits and the performance is relatively
minimum for BTB with 32 sets and 2 way set associative. GCC benchmark has
poor address hits when compared to other benchmark. For this benchmark,
the address hits is again minimum for the configuration of BTB with 32 sets
and 2 way set associative.
Comparison of BTB Performance based on addr_misses
Sr. No.
1
2
3
4
5

Configuration
64 sets/2 way
32 sets/2 way
128 sets/2 way
32 sets/4 way
128 sets/1 way

GCC
563339
631115
467890
550474
572981

Benchmarks
ANAGRAM
76544
88061
74692
71921
80806

GO
345506
387057
320746
343066
366148

Graphical Representation with above addr_misses

700000
600000
500000
400000
300000
200000
100000
0

Benchmarks GCC
Benchmarks ANAGRAM
Benchmarks GO

From the above graph, as expected, address misses is very optimal for
ANAGRAM benchmark. GCC benchmark has maximum address misses among
all the three benchmarks. As we can see from the graph, decreasing the
sets from 64 to 32 increases the address misses and increasing the
number of set from 64 to 128 decreases the address misses. This is
because capacity misses is reduced by increasing the number of sets. In case
of 32 sets/4 way configuration, even though set is decreased from 64 to 32 the
address miss is decreased because the associativity is increased which
reduces the conflict misses. In case of 128 sets/1 way configuration, due to
direct mapping, even the increase in number of set increases the addr_misses.

Page | 9

Comparison of BTB Performance based on CPI

Sr. No.
1
2
3
4
5

Configuration
64 sets/2 way
32 sets/2 way
128 sets/2 way
32 sets/4 way
128 sets/1 way

GCC
0.9741
0.9899
0.9495
0.9737
0.9748

Benchmarks
ANAGRAM
0.4578
0.4601
0.4572
0.457
0.4584

GO
0.7208
0.7265
0.716
0.7206
0.7226

Graphical Representation with above CPI

1.2
1
0.8
0.6
0.4

Benchmarks GCC

0.2

Benchmarks ANAGRAM

Benchmarks GO

From the above graph, CPI remains fairly constant for every benchmark.
Among the benchmarks, ANAGRAM benchmark has the most optimal CPI and
GCC benchmark holds the maximum CPI for execution with various BTB
configurations. The CPI seems to be higher for configuration 32 sets/2 way
compared to the 64 sets/2 way which has much higher sets than this
configuration. In case of 32 sets/4 way and 128 sets/1 way configurations,
associativity and number of sets makes the CPI almost equal to the 64 sets/2
way CPI. For the configuration with set 128 and associativity 2 the CPI remains
much lower than all other configurations.

P a g e | 10

Comparison of BTB Performance based on Branch Predictor Hit Rates

Sr. No.
1
2
3
4
5

Configuration
64 sets/2 way
32 sets/2 way
128 sets/2 way
32 sets/4 way
128 sets/1 way

GCC
0.6409
0.5977
0.7018
0.6491
0.6348

Benchmarks
ANAGRAM
0.9637
0.9582
0.9646
0.9659
0.9615

GO
0.7527
0.723
0.7705
0.7545
0.738

Graphical Representation with above Branch Predictor Hit Rates

1.2
1
0.8
0.6
0.4

Benchmarks GCC

0.2

Benchmarks ANAGRAM
Benchmarks GO

The above graph clearly shows us that the branch predictor hit rate
for all the benchmarks is relatively low when number of set decreases in
a BTB. When we closely observe the variation in the branch predictor hit rates
of different configurations, it is evident that for BTB configuration, 32 sets and
2 way set associative the branch prediction hit rate is lower for all the
benchmarks. If we have change 32 sets with 4 way set associative to 128 sets
with 1 way set associative, branch prediction hit rate decreases.

CONCLUSION
For an optimal branch predictor, it is recommended to have higher sets but at
the same time tradeoff between cost and performance should be taken into
consideration.
To have high address hit rates and direction hit rates, the simulation results
suggests that combination of two level and bimodal predictor configuration is
better.

Assignment 1
No ratings yet
Assignment 1
3 pages
Marketing Strategies of Harley-Davidson: Case Details
No ratings yet
Marketing Strategies of Harley-Davidson: Case Details
3 pages
PSC Checklist
No ratings yet
PSC Checklist
19 pages
Management and Ethics
100% (1)
Management and Ethics
12 pages
United States v. Bradley Carter - Plea Agreement Letter
100% (2)
United States v. Bradley Carter - Plea Agreement Letter
11 pages
Fundamentals of Digital Quadrature Modulation
100% (2)
Fundamentals of Digital Quadrature Modulation
5 pages
Branch Prediction: Prof. Mikko H. Lipasti University of Wisconsin-Madison
No ratings yet
Branch Prediction: Prof. Mikko H. Lipasti University of Wisconsin-Madison
22 pages
07 Branch Prediction
No ratings yet
07 Branch Prediction
35 pages
Branch Predictors
No ratings yet
Branch Predictors
41 pages
Branch Prediction Techniques: Prof. Pimal Khanpara Department of Computer Science & Engineering
No ratings yet
Branch Prediction Techniques: Prof. Pimal Khanpara Department of Computer Science & Engineering
20 pages
Branch Handling
No ratings yet
Branch Handling
23 pages
Dynamic Branch Prediction
No ratings yet
Dynamic Branch Prediction
17 pages
Dynamic Branch Prediction
No ratings yet
Dynamic Branch Prediction
7 pages
Branch Prediction
No ratings yet
Branch Prediction
38 pages
17.L15 BranchPrediction
No ratings yet
17.L15 BranchPrediction
38 pages
Software-Based and Hardware-Based Branch Prediction Strategies and Performance Evaluation
No ratings yet
Software-Based and Hardware-Based Branch Prediction Strategies and Performance Evaluation
19 pages
5.Branch prediction
No ratings yet
5.Branch prediction
25 pages
CA Lecture 4 Module 3
No ratings yet
CA Lecture 4 Module 3
27 pages
Spec Cpu2000
No ratings yet
Spec Cpu2000
3 pages
Branch Prediction
No ratings yet
Branch Prediction
2 pages
branchPred
No ratings yet
branchPred
27 pages
18 740 Fall15 Lecture05 Branch Prediction Afterlecture
No ratings yet
18 740 Fall15 Lecture05 Branch Prediction Afterlecture
93 pages
Branch Prediction Maryamhamza
No ratings yet
Branch Prediction Maryamhamza
12 pages
CA_L15a_BranchPrediction_Intro_And_StaticPredictors
No ratings yet
CA_L15a_BranchPrediction_Intro_And_StaticPredictors
19 pages
Prof. Ajit Pal Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture - 16 Branch Prediction
No ratings yet
Prof. Ajit Pal Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture - 16 Branch Prediction
26 pages
Computer Architecture: Branching
No ratings yet
Computer Architecture: Branching
37 pages
Branch Prediction
No ratings yet
Branch Prediction
6 pages
S - C L - C++ H F T: B B P H: EMI Static Onditions in OW Latency FOR IGH Requency Rading Etter Than Ranch Rediction Ints
No ratings yet
S - C L - C++ H F T: B B P H: EMI Static Onditions in OW Latency FOR IGH Requency Rading Etter Than Ranch Rediction Ints
53 pages
5 4-Pipelining
No ratings yet
5 4-Pipelining
10 pages
9.1.0 Branch Prediction Pentiums IBM PPC
No ratings yet
9.1.0 Branch Prediction Pentiums IBM PPC
163 pages
AmarthyaRidheeshSethPravarProj1
No ratings yet
AmarthyaRidheeshSethPravarProj1
4 pages
Aca Unit-4 Notes
No ratings yet
Aca Unit-4 Notes
23 pages
8 - Branch Prediction
No ratings yet
8 - Branch Prediction
29 pages
Lec4 Supp Branch Prediction
No ratings yet
Lec4 Supp Branch Prediction
45 pages
Ue21ec341b 20240412163937
No ratings yet
Ue21ec341b 20240412163937
22 pages
The Bi-Mode Branch Predictora
No ratings yet
The Bi-Mode Branch Predictora
11 pages
L11 PipelineHazards 4
No ratings yet
L11 PipelineHazards 4
30 pages
Questions That I Encountered
No ratings yet
Questions That I Encountered
9 pages
Branch Prediction ARM
No ratings yet
Branch Prediction ARM
14 pages
10_branchprediction
No ratings yet
10_branchprediction
49 pages
Folien BranchPredictionOptimization
No ratings yet
Folien BranchPredictionOptimization
5 pages
lect09-adv-branch-prediction
No ratings yet
lect09-adv-branch-prediction
55 pages
L10 PipelineHazards 3
No ratings yet
L10 PipelineHazards 3
35 pages
Computer Architecture Solutions_OK
No ratings yet
Computer Architecture Solutions_OK
6 pages
Branch Prediction
No ratings yet
Branch Prediction
41 pages
Implementing a Branch Predictor
No ratings yet
Implementing a Branch Predictor
7 pages
البحث الثاني
No ratings yet
البحث الثاني
10 pages
WRL-TN-36
No ratings yet
WRL-TN-36
29 pages
L12 - Advanced Branch Preiction
No ratings yet
L12 - Advanced Branch Preiction
9 pages
CA - Slides
No ratings yet
CA - Slides
28 pages
Confidence-Based Branch-Mispredict Compensation: David Robinson, Jonathan Taylor
No ratings yet
Confidence-Based Branch-Mispredict Compensation: David Robinson, Jonathan Taylor
2 pages
A Hybrid Branch Prediction Scheme: An Integration of Software and Hardware Techniques
No ratings yet
A Hybrid Branch Prediction Scheme: An Integration of Software and Hardware Techniques
8 pages
Branch Prediction: Jeroen Lichtenauer
No ratings yet
Branch Prediction: Jeroen Lichtenauer
23 pages
Dynamic Branch Prediction With Perceptrons
No ratings yet
Dynamic Branch Prediction With Perceptrons
10 pages
CS252 Graduate Computer Architecture Prediction (Con't) (Dependencies, Load Values, Data Values) February 22, 2010
No ratings yet
CS252 Graduate Computer Architecture Prediction (Con't) (Dependencies, Load Values, Data Values) February 22, 2010
54 pages
$RQ5E5IU
No ratings yet
$RQ5E5IU
9 pages
Pipeline Part 2 and Data Hazards
No ratings yet
Pipeline Part 2 and Data Hazards
11 pages
Finding Difficult Branches
No ratings yet
Finding Difficult Branches
19 pages
CA Classes-155-160
No ratings yet
CA Classes-155-160
6 pages
05 - Pipelining - Branch Prediction
No ratings yet
05 - Pipelining - Branch Prediction
20 pages
The Schemes and Performances of Dynamic Branch Predictors: Chih-Cheng Cheng
No ratings yet
The Schemes and Performances of Dynamic Branch Predictors: Chih-Cheng Cheng
18 pages
What About Branches?: Branch Outcomes Are Not Known Until EXE What Are Our Options?
No ratings yet
What About Branches?: Branch Outcomes Are Not Known Until EXE What Are Our Options?
27 pages
Correct Maintenance - Cognex DataMan 8500
From Everand
Correct Maintenance - Cognex DataMan 8500
Unique Content
No ratings yet
LEARN MPLS FROM SCRATCH PART-B: A Beginners guide to next level of networking
From Everand
LEARN MPLS FROM SCRATCH PART-B: A Beginners guide to next level of networking
POONAM DEVI
No ratings yet
Introduction to Area-Based Anti-Aliasing for CGI
From Everand
Introduction to Area-Based Anti-Aliasing for CGI
Michel A Rohner
No ratings yet
Process Monitor
100% (1)
Process Monitor
25 pages
High Speed Serial Intel
No ratings yet
High Speed Serial Intel
6 pages
Discrete-Time Signal Processing (A Review)
No ratings yet
Discrete-Time Signal Processing (A Review)
8 pages
Project #6 Final Project: Layout & Verification: Due: Wed Dec 10, 2014 (Start of Class)
No ratings yet
Project #6 Final Project: Layout & Verification: Due: Wed Dec 10, 2014 (Start of Class)
2 pages
Contractor Timesheet: Date IN Lunch Out Lunch in OUT Hours
No ratings yet
Contractor Timesheet: Date IN Lunch Out Lunch in OUT Hours
2 pages
Steady-State Analysis and Design of A Switched-Capacitor DC-DC Converter
No ratings yet
Steady-State Analysis and Design of A Switched-Capacitor DC-DC Converter
10 pages
Switched Capacitor DC-DC Converters: Topologies and Applications
No ratings yet
Switched Capacitor DC-DC Converters: Topologies and Applications
25 pages
E1.2 Digital Electronics 1: Problem Sheet 3
No ratings yet
E1.2 Digital Electronics 1: Problem Sheet 3
1 page
Digital
No ratings yet
Digital
3 pages
CummingsSNUG1999SJ FSM Perl
No ratings yet
CummingsSNUG1999SJ FSM Perl
20 pages
Functional Verification of GPIO Core Using OVM
No ratings yet
Functional Verification of GPIO Core Using OVM
4 pages
CummingsICU1997 VerilogCodingEfficiency
No ratings yet
CummingsICU1997 VerilogCodingEfficiency
13 pages
Lect3 Transistors
No ratings yet
Lect3 Transistors
19 pages
Scripting Language Manual
No ratings yet
Scripting Language Manual
25 pages
Hidráulica
No ratings yet
Hidráulica
30 pages
Tea in Vietnam - Analysis: Country Report - Mar 2019
No ratings yet
Tea in Vietnam - Analysis: Country Report - Mar 2019
2 pages
Fee Structure For Academic Year 2022-23 - 12072022
No ratings yet
Fee Structure For Academic Year 2022-23 - 12072022
2 pages
Shopify WL in Pakistan by Sheikh Daniyal
No ratings yet
Shopify WL in Pakistan by Sheikh Daniyal
3 pages
Geographical Information System: Course Description
No ratings yet
Geographical Information System: Course Description
4 pages
18549
No ratings yet
18549
11 pages
Chapter 11
No ratings yet
Chapter 11
13 pages
Appraisal On An Automatic Solar Cleaning Robot
No ratings yet
Appraisal On An Automatic Solar Cleaning Robot
8 pages
Algerian (OpenType)
No ratings yet
Algerian (OpenType)
1 page
BAC GIANG - Đề thi chọn ĐT 2023 (chính thức)
No ratings yet
BAC GIANG - Đề thi chọn ĐT 2023 (chính thức)
19 pages
D
0% (1)
D
4 pages
Coa sm0441
No ratings yet
Coa sm0441
3 pages
IKEA and DELL Case Study
No ratings yet
IKEA and DELL Case Study
2 pages
AVCON CAME I.4-Rev1
100% (1)
AVCON CAME I.4-Rev1
139 pages
PT19 1300 Series
100% (1)
PT19 1300 Series
112 pages
Case Chapter 2
No ratings yet
Case Chapter 2
2 pages
Word Quick Reference 2007
No ratings yet
Word Quick Reference 2007
2 pages
Boeco Catalog 2014
No ratings yet
Boeco Catalog 2014
82 pages
A Rule of Thumb Is That
No ratings yet
A Rule of Thumb Is That
4 pages
Case Study - Age Discrimination in The Workplace
No ratings yet
Case Study - Age Discrimination in The Workplace
3 pages
CA Harshad Tekwani Cost Audit 1649392713
No ratings yet
CA Harshad Tekwani Cost Audit 1649392713
2 pages
Compensation HLA Questions
No ratings yet
Compensation HLA Questions
7 pages
Resume of Zinnsqt
No ratings yet
Resume of Zinnsqt
6 pages
Directory of Accredited Testing Laboratories
No ratings yet
Directory of Accredited Testing Laboratories
645 pages

Comp Arch Proj Report 2

Uploaded by

Comp Arch Proj Report 2

Uploaded by

Page | 1

The University of Texas at Dallas

EECE/CS 6304: COMPUTER ARCHITECTURE

ANALYSIS OF DIFFERENT TYPES OF

Chintan Modi (chm130430)

Part 1: Performance analysis of different types of

Combining (comb): Combines a two levels and bimodal predictor.

RAS 4: Change the return address stack (RAS) size to 4.

Performance Analysis based on CPI:

Graphical Representation with above CPI

Above graph clearly displays the performance of different configurations of

Performance Analysis based on Address Hit Rates :

Graphical Representation with above Address Hit Rates

The above graph clearly shows the performance

For ANAGRAM benchmark, for 2 level predictor and combining predictor ,

Performance Analysis based on Direction Hit Rates

Part 2: Modification of the code to accommodate

Changes in file bpred.h:

---------------/* branch predictor def */

/* num correct addr-predictions */

2) Changes in file bpred.c:

Part 3: Comparison of BTB Performance

Graphical Representation with above addr_hits

The above graph shows the behavior of various configurations of Branch

Graphical Representation with above addr_misses

Comparison of BTB Performance based on CPI

Graphical Representation with above CPI

Comparison of BTB Performance based on Branch Predictor Hit Rates

Graphical Representation with above Branch Predictor Hit Rates

You might also like