0% found this document useful (0 votes)

46 views7 pages

CS398 Exam 3, 2 Chance December 17th, 2012: Circle The Section That Attend (So We Can Hand Back Your Exam)

This document contains instructions for an exam for the CS398 course. It provides the date, time and location for a second chance exam. It lists the sections that students can choose from to take the exam. It also includes instructions for the exam such as the time limit, materials allowed, and a warning to show work for credit. The exam contains 3 questions worth a total of 100 points. Question 1 is on pipelining for 40 points. Question 2 is on cache analysis for 25 points. Question 3 involves rewriting code to optimize cache performance for 20 points. Students are to indicate which questions they want graded on a scantron form.

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views7 pages

CS398 Exam 3, 2 Chance December 17th, 2012: Circle The Section That Attend (So We Can Hand Back Your Exam)

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

CS398 Exam 3, 2nd Chance

December 17th, 2012

Name:
NETID:
Circle the section that attend (so we can hand back your exam).
Monday

Tuesday

AYA (1-3pm) Craig

AYE (9-11am) Maria

AYB (2-4pm) Jon

AYF (10am-noon) Ting

AYC (3-5pm) Michael

AYG (11am-1pm) Ting

AYD (4-6pm) Ting

AYH (noon-2pm) Jon

AYI (1-3pm) Ryan
AYJ (2-4pm) Michael
AYK (3-5pm) Michael
AYL (4-6pm) Ryan
AYM (5-7pm) Ryan

This exam has 7 pages; the final sheet is provided as a reference to you.
You have 120 minutes.
No calculators or other electronics are allowed. You may bring one 8.5 x 11 sheet of handwritten notes.
To make sure you receive credit, please write clearly and show your work.
We will not answer questions regarding course material.
The 2nd chance test is done at the granularity of the 3 questions; if you choose to have a
question graded, we will grade ALL PARTS of that question and use that to update your score.

Question

Maximum

Total

100

Your Score

Question 1: Pipelining (40 points)

Consider the 6-stage pipeline shown below; in this pipeline, the MEM stage occupies two pipeline stages
(MEM1, MEM2) with loads completing in the MEM2 stage (arithmetic instructions complete in the EX stage as
normal). Full bypassing is provided. Assume that all branches and jumps are predicted as not-taken.
Conditional branches and indirect branches (e.g., jr) are resolved in the EX stage. Unconditional jumps are
resolved in the ID stage. Assume mul is a single real instruction that is executed by the ALU and completes in
the EX stage.

MEM1/MEM2
WB

EX/MEM1
WB
M
WB
Control
MEM2/WB

EX
M
WB
IF/ID

PC

Read
a
data 1

b
Read
register 1
Mem1Read Mem1Wr
c

Addr
Instr
d
Read
ALU

ALUSrc
register 2
Zero
forwA

Write
Read
Result
Address
a
register
data 2

Instruction
b
MemToReg
Data
0
memory
memory
Write
Registers
c

1
data
d

Write Read
Instr [15 - 0]
1
data
data
RegDst

Extend
forwB
0
Rt

0
Rd

1
EX/MEM1.
Rs
RegisterDst

MEM1/MEM2.

RegisterDst
Forwarding

Unit

MEM2/WB.RegisterDst

For all of this question consider the MIPS assembly code on the following page. Corresponding C code is
!
shown below.
!
Part (a) Annotate the MIPS assembly to indicate all of the true data dependences. (5 points)
!
For the next two parts, use your scantron form. We recommend that you first answer on the next page and then
!
copy your answers to the scantron form. In any case, we wont give credit for any answers not on the scantron.
!
!
Part (b) Indicate which instructions will be stalled: a) no stall b) 1-cycle stall c) 2-cycle stall (5 points)
!
Part (c) Indicate how each of the forwarding muxes (forwA, forwB) will set in the cycles when each
!
instruction is in the EX stage. Answer a-d as labeled in the diagram above. (10 point)
!
!
void !
typedef struct pixel {!
!
map (pixel_t *pixel, int scale) {!
int x, y, z;!
!
while (pixel != NULL) {!
struct pixel *next;!
pixel->z = scale*pixel->x +!
} pixel_t;!
!
pixel->y;!
!
pixel = pixel->next;
!
}!
}!

!
Question 1:! Pipelining, cont.

Mark this box if you want this question graded.

It will replace your score for this question, even if lower.

map:

!beq !$a0, $0, done!

loop:

!lw !$t1, 0($a0)

stall

!forwA

!mul !$t1, $t1, $a1

____1

!____10 !____11!

!lw !$t2, 4($a0)

____2

!____12 !!

!add !$t2, $t2, $t1

____3

!____13 !____14!

!sw !$t2, 8($a0)

____4

!____15 !____16!

!lw !$a0, 12($a0)

____5

!____17 !!

!bne !$a0, $0, loop

____6

!____18 !____19!

done:

!forwB!

!jr !$ra!
Part (e) Re-schedule/re-write the function to
make it faster. Faster code will achieve more
points, but your answer must fit in the space below.
(10 points)

Part (d) Compute how many cycles each loop

iteration takes on average. Explain your answer for
partial credit. (10 points)

Question 2: Cache Analysis (25 points)

For an 8KB 2-way set-associative, write-back cache with 32B blocks on a machine with 32-bit address
spaces (both virtual and physical) and no hardware prefetching, consider the following code:
struct hoof { int has_horseshoe, shoe_size; };
struct unicorn {
int horn_length;
char *name;
struct hoof *hooves[4];
// this is an array of pointers
};
struct unicorn unicorns[1000];
// thats a whole lotta unicorns
int longest_horn = 0, biggest_shoe = 0;
for (int i = 0 ; i < 1000 ; i ++) {
if (unicorns[i].horn_length >= longest_horn) {
longest_horn = unicorns[i].horn_length;
}
for (int j = 0 ; j < 4 ; j ++) {
if (unicorns[i].hooves[j]->has_horseshoe &&
(unicorns[i].hooves[j]->shoe_size >= biggest_shoe)) {
biggest_shoe = unicorns[i].hooves[j]->shoe_size;
}
}
}

Assume that everything is in registers, except the data structure unicorns.

Do not write here. Really.

Question 2: Cache Analysis (25 points)

Mark this box if you want this question graded.

It will replace your score for this question, even if lower.

Part (a) Compute the MINIMUM number of cache misses per outer-loop iteration that is possible for the
code on the previous page. Explain how you computed it and the assumptions you made! (15 points)

Part (b) Compute the MAXIMUM number of cache misses per outer-loop iteration that is possible for the
code on the previous page. Explain how you computed it and the assumptions you made! (10 points)

Question 3: Cache-aware Programming (20 points)

Mark this box if you want this question graded.
It will replace your score for this question, even if lower.
Rewrite the following codes to optimize cache performance on a system with hardware stream prefetching
(i.e., if you are fetching sequentially no software prefetching is necessary) and a single-level cache. The
cache is a 2-way set associative 16KB cache with 32B blocks. Software prefetch syntax, if you choose to
use it, is shown on the right.
void __builtin_prefetch(const void *addr,
! unsigned
rw,
unsigned
locality);!
!

a) (10 points)
!
addr: the address of the memory to prefetch.
#define N 5000!
rw: (optional) a compile-time constant 1 or 0; 1: the
!
program anticipates writing the data soon, 0 (default) in
for (int i = 0 ; i < N ; i ++) {!
the near term, the program expects to only read the data.
for (int j = 0 ; j < N ; j += 2) {! locality: (optional) a compile-time constant from 0 to 3.
0: the data has no temporal locality, so need not be left in
A[i][j] = A[i][j+1];!
the cache after the access, 3: (default) the data has a high
B[j][i] = B[j+1][i];!
degree of temporal locality and should be retained in all
levels of cache if possible. Use 0 or 3 based on the
}!
expected reuse.
}!
!
!
!
!
!
!
!

Question 3: Cache-aware Programming (20 points), cont.

b) (10 points)
!
#define N 5000!
double A[N][N][N], B[N][N], C[N][N];!
!
for (int i = 0 ; i < N ; i ++) {!
for (int j = 0 ; j < N ; j ++) {!
double temp = 0.0;!
for (int k = 0 ; k < N ; k ++) {!
temp += B[i][0] * A[k][j][i];!
}!
C[i][j] = temp;!
}!
}!

Answer:: Remark
No ratings yet
Answer:: Remark
72 pages
Final Exam Solution - Test Paper Final Exam Solution - Test Paper
No ratings yet
Final Exam Solution - Test Paper Final Exam Solution - Test Paper
82 pages
Exam 19 March Questions
No ratings yet
Exam 19 March Questions
13 pages
40 Out
No ratings yet
40 Out
80 pages
Written Asst2
No ratings yet
Written Asst2
27 pages
Computer Architecture Final 1 2022
No ratings yet
Computer Architecture Final 1 2022
2 pages
Final Exam Solution - Test Paper Final Exam Solution - Test Paper
No ratings yet
Final Exam Solution - Test Paper Final Exam Solution - Test Paper
15 pages
Final Exam: 15-213 Introduction To Computer Systems
No ratings yet
Final Exam: 15-213 Introduction To Computer Systems
17 pages
Lab Syllabus
No ratings yet
Lab Syllabus
21 pages
Final Soln 2015 PDF
No ratings yet
Final Soln 2015 PDF
19 pages
Coa Applied
No ratings yet
Coa Applied
13 pages
Final 2014
No ratings yet
Final 2014
12 pages
Practice Final Soln
No ratings yet
Practice Final Soln
17 pages
M116C 1 EE116C-Midterm2-w15 Solution
100% (1)
M116C 1 EE116C-Midterm2-w15 Solution
8 pages
EEX3417 Final - 20202021
No ratings yet
EEX3417 Final - 20202021
9 pages
COE301 Final Solution 162
No ratings yet
COE301 Final Solution 162
10 pages
Cs433 Fa20 Hw3 Solution
No ratings yet
Cs433 Fa20 Hw3 Solution
15 pages
CENG400 Midterm Fall 2015
No ratings yet
CENG400 Midterm Fall 2015
10 pages
End Sem 3rd Semister 2024
No ratings yet
End Sem 3rd Semister 2024
7 pages
Hw5 Solution
No ratings yet
Hw5 Solution
11 pages
CSGC 342
No ratings yet
CSGC 342
7 pages
COMP1411 Final Exam Question Book
No ratings yet
COMP1411 Final Exam Question Book
10 pages
Tech
No ratings yet
Tech
6 pages
Fall16exam Final KAISTans
No ratings yet
Fall16exam Final KAISTans
14 pages
Fall12exam Final KAIST
No ratings yet
Fall12exam Final KAIST
11 pages
Spring16exam Final KAISTans
No ratings yet
Spring16exam Final KAISTans
12 pages
Final 2011
No ratings yet
Final 2011
6 pages
CCEE 213 - 2006 - 2007 - II - Final
No ratings yet
CCEE 213 - 2006 - 2007 - II - Final
10 pages
June 2009 Ugc Net Computer Science Solved
No ratings yet
June 2009 Ugc Net Computer Science Solved
21 pages
2001 Spring Final Sol
No ratings yet
2001 Spring Final Sol
14 pages
ECE391 Final 8-8-2021 Solution-1
No ratings yet
ECE391 Final 8-8-2021 Solution-1
6 pages
350 Exam 2 Spring 2024
No ratings yet
350 Exam 2 Spring 2024
7 pages
Your Name:: Final Exam
No ratings yet
Your Name:: Final Exam
9 pages
Comparch Answers and Questions
No ratings yet
Comparch Answers and Questions
7 pages
Final w11
No ratings yet
Final w11
10 pages
CS4961: Parallel Programming Midterm Exam October 20, 2011
No ratings yet
CS4961: Parallel Programming Midterm Exam October 20, 2011
4 pages
cs146 Fall2017 Midterm1xx
No ratings yet
cs146 Fall2017 Midterm1xx
12 pages
School of Physics, Engineering and Technology: The Statement of Assessment
No ratings yet
School of Physics, Engineering and Technology: The Statement of Assessment
3 pages
Exam2 s09 v2
No ratings yet
Exam2 s09 v2
10 pages
CENG400-Midterm-Fall 2014
No ratings yet
CENG400-Midterm-Fall 2014
9 pages
ECE391 Final Sem202 Solution
No ratings yet
ECE391 Final Sem202 Solution
5 pages
2005 Computer Architecture Solutions
No ratings yet
2005 Computer Architecture Solutions
11 pages
111 Computer Organization - Final
No ratings yet
111 Computer Organization - Final
4 pages
CA Fall 2022 Final Exam
No ratings yet
CA Fall 2022 Final Exam
6 pages
Computer Architecture hw6
No ratings yet
Computer Architecture hw6
3 pages
Quiz 1
100% (1)
Quiz 1
12 pages
Compre 23
No ratings yet
Compre 23
3 pages
Final 18
No ratings yet
Final 18
7 pages
Coss 2
No ratings yet
Coss 2
2 pages
CS211 Exam
No ratings yet
CS211 Exam
10 pages
BFE Final Organization Fall 2014 Answer
No ratings yet
BFE Final Organization Fall 2014 Answer
8 pages
Illinois Exam2 Practice Solfa08
No ratings yet
Illinois Exam2 Practice Solfa08
4 pages
Midtermarch 2
No ratings yet
Midtermarch 2
9 pages
Comparch Comparch-002 Exams Midterm A8Xj46NCRo
No ratings yet
Comparch Comparch-002 Exams Midterm A8Xj46NCRo
9 pages
Instructions: Csce 212: Final Exam Spring 2009
No ratings yet
Instructions: Csce 212: Final Exam Spring 2009
5 pages
Digital Logic Design 5th Edition Chap 1 Notes
No ratings yet
Digital Logic Design 5th Edition Chap 1 Notes
34 pages
CS433 hw1 Fall 07
No ratings yet
CS433 hw1 Fall 07
3 pages
Computer Architecture: Ph.D. Qualifiers Examination - Sample Questions
No ratings yet
Computer Architecture: Ph.D. Qualifiers Examination - Sample Questions
2 pages
Cs433 Sp12 Midterm Sol
No ratings yet
Cs433 Sp12 Midterm Sol
9 pages
General System Architecture
No ratings yet
General System Architecture
28 pages
CP4253 Map Unit I
No ratings yet
CP4253 Map Unit I
31 pages
7 A H-Brigde For DC-Motor Applicattions
No ratings yet
7 A H-Brigde For DC-Motor Applicattions
29 pages
VLSI Interview Questions
No ratings yet
VLSI Interview Questions
3 pages
High and Low Side Driver: Features Product Summary
No ratings yet
High and Low Side Driver: Features Product Summary
14 pages
Lab Practical File: " Embedded System's "
No ratings yet
Lab Practical File: " Embedded System's "
17 pages
How To Understand Xilinx Spartan 6 FPGA Better
No ratings yet
How To Understand Xilinx Spartan 6 FPGA Better
15 pages
AMI BIOS Survival Guide
No ratings yet
AMI BIOS Survival Guide
22 pages
PCG-505F/505FX: Service Manual
No ratings yet
PCG-505F/505FX: Service Manual
20 pages
Design of Fault Tolerant Systems
No ratings yet
Design of Fault Tolerant Systems
7 pages
Singapore p1
No ratings yet
Singapore p1
46 pages
STA - Explanation of Clock Skew Concepts in VLSI - by ANKIT MAHAJAN - Medium
No ratings yet
STA - Explanation of Clock Skew Concepts in VLSI - by ANKIT MAHAJAN - Medium
8 pages
Manual Ga-78lmt-Usb3 v.5.0
No ratings yet
Manual Ga-78lmt-Usb3 v.5.0
36 pages
Nano Scale Silicon Mosfets IJERTCONV2IS03066
No ratings yet
Nano Scale Silicon Mosfets IJERTCONV2IS03066
4 pages
24T Adder PDF
No ratings yet
24T Adder PDF
4 pages
Prasad Pandit Resume
No ratings yet
Prasad Pandit Resume
3 pages
TL 7705 B
No ratings yet
TL 7705 B
25 pages
Ieee Icccsmd Template
No ratings yet
Ieee Icccsmd Template
15 pages
PIC16F84A Data Sheet: 18-Pin Enhanced FLASH/EEPROM 8-Bit Microcontroller
No ratings yet
PIC16F84A Data Sheet: 18-Pin Enhanced FLASH/EEPROM 8-Bit Microcontroller
88 pages
Satyam Electronics
No ratings yet
Satyam Electronics
12 pages
Solution EGCO2220 Quiz
No ratings yet
Solution EGCO2220 Quiz
10 pages
PCI-PCI Bridges: PCI Configuration Cycles and PCI Bus Numbering
No ratings yet
PCI-PCI Bridges: PCI Configuration Cycles and PCI Bus Numbering
2 pages
Lec4 8051 Timers Counters
No ratings yet
Lec4 8051 Timers Counters
10 pages
EET107 Tutorial 3
No ratings yet
EET107 Tutorial 3
7 pages
Pic 16F84
No ratings yet
Pic 16F84
12 pages
Bm11100 Toshiba
No ratings yet
Bm11100 Toshiba
6 pages
Pci-Sig Engineering Change Notice: Title: Date: Affected Document: Sponsor
No ratings yet
Pci-Sig Engineering Change Notice: Title: Date: Affected Document: Sponsor
16 pages
Panasonic Toughbook CF-U1 - Fisa Tehnica
No ratings yet
Panasonic Toughbook CF-U1 - Fisa Tehnica
2 pages
MPMC Imp Questions
No ratings yet
MPMC Imp Questions
1 page
IGNOU PGDCA MCS 202 Computer Organisation Previous Years Unsolved Papers
From Everand
IGNOU PGDCA MCS 202 Computer Organisation Previous Years Unsolved Papers
Manish Soni
No ratings yet
Apache Cassandra Developer Associate - Exam Practice Tests
From Everand
Apache Cassandra Developer Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet

CS398 Exam 3, 2 Chance December 17th, 2012: Circle The Section That Attend (So We Can Hand Back Your Exam)

Uploaded by

CS398 Exam 3, 2 Chance December 17th, 2012: Circle The Section That Attend (So We Can Hand Back Your Exam)

Uploaded by

CS398 Exam 3, 2nd Chance

December 17th, 2012

AYA (1-3pm) Craig

AYE (9-11am) Maria

AYB (2-4pm) Jon

AYF (10am-noon) Ting

AYC (3-5pm) Michael

AYG (11am-1pm) Ting

AYD (4-6pm) Ting

AYH (noon-2pm) Jon

Question 1: Pipelining (40 points)

Mark this box if you want this question graded.

!beq !$a0, $0, done!

!lw !$t1, 0($a0)

!mul !$t1, $t1, $a1

!lw !$t2, 4($a0)

!add !$t2, $t2, $t1

!sw !$t2, 8($a0)

!lw !$a0, 12($a0)

!bne !$a0, $0, loop

Part (d) Compute how many cycles each loop

Question 2: Cache Analysis (25 points)

Assume that everything is in registers, except the data structure unicorns.

Do not write here. Really.

Question 2: Cache Analysis (25 points)

Mark this box if you want this question graded.

Question 3: Cache-aware Programming (20 points)

Question 3: Cache-aware Programming (20 points), cont.

You might also like