HW 4

This document outlines an assignment for a predictive and prescriptive modeling course. It includes 5 problems to practice building predictive models using regression analysis and building prescriptive models to determine optimal prices. It provides data and grading instructions for students to complete the assignment and submit their R code.

Uploaded by

ham.mehran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

HW 4

Uploaded by

ham.mehran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

BTMA 636 (Fall 2022) HW #4: Building Predictive and

Prescriptive Models

Purpose of This Assignment

These problems will help solidify some skills that you might use for your projects. Building predictive models
helps you have a better understanding of the world based on the data, and building prescriptive models helps
you make decisions given your understanding of the world.

Building Predictive Models (45 points)

In the Content folder named HW 4, you’ll find btma.431.736.f2018.v2.rda. This contains the raw final scores
of students who took that class a few years ago, without accounting for any of their bonus marks. The
file also contain students’ final project, post-retake midterm, homework average, and textbook quiz average
scores. It also contains a column specifying whether or not the student was in the BANA concentration
(business analytics). Download the data to answer the following questions. Note that this question is not
meant to make you anxious about grades. Rather, it gives you a chance to practice analyzing data that
you’re familiar with in a way that illustrates some core concepts in regression analysis.
1a) (9 points) Suppose that you wanted to predict students’ raw final scores (excluding bonus marks) using
all the other columns as predictors. To two decimals, what is the coefficient estimate for final.project?
1b) (9 points) In 2018, the homework scores were out of 20 and the textbook scores were out of 15. Scale
them so they are both out of 100. In other words, normalize these numeric predictors so that they represent
percentages. Then re-do the regression with these re-scaled predictors. What changes in the regression
output compared to the previous model?
1c) (9 points) Does your regression model suggest that BANA students do statistically significantly better
than non-BANA students when all the other variables are included in the model? What is the p-value of
the line corresponding to that hypothesis test?
1d) (9 points) Is there evidence that the way in which the post.retake.midterm score impacts the final score
is different between BANA students and non-BANA students? Add the appropriate interaction term to your
previous model, and state the p-value of the line corresponding to the hypothesis test to two decimals.
1e) (9 points) Remove BANA as a predictor of your model. Create another model by predicting the log of
your response variable with the log of the numeric predictors. What is the coefficient for log(final.project)?
Compare the coefficients of this model to the coefficients of the same model but without the logs. Once we
go through the log-log module in the slide deck and understand the interpretation of coefficients in a log-log
model, why the numbers you see in your output seem similar will make more sense. This is just to plant the
seed and illustrate that point with real-world data.

1
Building Prescriptive Models (55 points)

2a) (7 points) Farmer Jill has been selling fresh-pressed apple juice at the farmer’s market for some time
now. She has noticed that the higher she sets the price, fewer people bought her apple juice, all else equal.
She wants to know how much to price a bottle of fresh-squeezed apple juice. From the data she has collected,
she estimates (using a regression model) that the expected quantity demanded as a function of price is given
by Q(p) = 50 − 5p. This means that if she sets price at p = 5, then she can expect to sell 25 bottles. If she
sets price to p = 2, she can expect to sell 40 bottles. Her marginal cost of production (the cost of producing
and packaging a single bottle of apple juice) is $1. To the nearest ten cents, what is the optimal price? On
D2L, leave out the dollars sign and write the answer to two decimals. What is the profit at the optimal price
(to two decimals, leaving out the dollar sign)? In particular, search between p = 1 and p = 9.
2b) (7 points) She actually only has a rough sense of the demand function. In particular, she wants to
understand how sensitive her decision is to parameters of her model. What would be the optimal price
if the demand function was really Q(p) = 45 − 5p? What would be the optimal price if the demand function
was really Q(p) = 55 − 5p? For your reference, this is called sensitivity analysis.
2c) (7 points) Let M denote the maximum demand she would see if she set her price to 0. Then Q(p) = M −5p.
Plot p∗ (M ) (how the optimal price changes as M changes), where M goes from 40 to 60.
2d) (7 points) Let k denote the parameter measuring the marginal impact of price on demand. Then
Q(p) = M − kp. On the same plot, plot p∗ (k) (how the optimal price changes as a function of k) for M = 45
and M = 55, where k goes from 2 to 8. Color-code your plot so that p∗ (k) for M = 45 is in some shade
of red and p∗ (k) for M = 55 is in some shade of blue. Note: When searching for the optimal price, don’t
constrain your price above by p = 10 in this. Search for a price p between p = 1 and p = 15. Also, the
following links may be useful if you want to use ggplot() to plot: https://rpubs.com/euclid/343644 and
https://stackoverflow.com/questions/40833809/add-legend-to-geom-line-graph-in-r/40834306.
Using your work above, to the nearest two decimal places, for what k would the optimal price be $5.00 when
M = 45? Using the same value of k as in the above question, what would be the optimal price when M =
55 (to the nearest two decimal places)?
2e) (27 points) Over the last few years, Jill has been recording the number of bottles of juices sold each
day as well as the price for that day. Assume that the marginal cost of production remained at $1 during
this time period. Based on the data, she wants to figure out what to price bottles at to maximize her
expected profit. She has asked you to build a decision support tool so that, given a data frame with
quantity and price data, she can find the optimal price to the nearest ten cents. Build this function for her.
In particular, given any set of data (with data on price and quantity sold), your function should build a
polynomial regression model, estimate the parameters of that model to predict profit from price (you can
assume a quadratic relationship between price and profit by adding a 2nd order term when predicting profit
from price), and then use those estimates to find the optimal price to the nearest ten cents. Using your
function and the dataset provided by Jill (salesData.rda in the HW 4 folder), what would you recommend
as the price of her apple juice? Round to the nearest ten cents and write to two decimals.
If you want a bit more of a challenge, instead of assuming a quadratic fit, jump ahead to the model selection
lecture module and try to use model selection techniques to decide the polynomial degree based on the data.
Note: Make sure you have built a user-defined function for 2e). For any dataset the retailer provides (you
can assume the dataset has the same format/columns as the one provided on D2L), your function should
tell the retailer what is the best price to set to maximize expected profit. Your answer to 2e) should be the
output of the functions you made. Otherwise, 25 points will be deducted from your score. Comment your
code well so that anyone can understand how to use your functions.

2
Grading Scheme

R Code

Submit your R code as a single R file to the D2L dropbox folder. In the file name, include the
homework number, your first name, your last name, and your section number. An example would be
‘HW4_firstName_lastName_L02.R’ or ‘HW4_firstName_lastName_L02.Rmd’ (depending on whether or
not you used RMarkdown). Submit your work as a single R file with the stated naming convention. Don’t
create a zip file with multiple R files. Also, make sure you clearly denote which problem is which
problem. For example, use R code comments to write: #### Problem 2a ####.
As always, you will have unlimited attempts on the D2L quiz for the assignment.
Note: Make sure you have built a user-defined function for 2e). For any set of quantity and price data,
your function should tell the retailer what is the best price to set to maximize expected profit. Your answer
to 2e) should be the output of this function you made. Otherwise, 25 points will be deducted from your
score. Comment your code well so that anyone can understand how to use your function. You can feel free
to use/modify your function from 2e) if you wanted to though.
Caution: Early on in the semester, students do not follow instructions carefully. If you follow the steps
below, you can almost guarantee that you will not lose points due to your R code not running properly on
my computer. Carefully making sure your code works fine and does not contain irrelevant chunks of code is
good practice, and you want to have formed these habits prior to working together in teams on projects.

1) Save your current homework R file, and close RStudio.

2) Open your homework R file again.
3) Start at Line 1 (the first line in your code). Press Ctrl + Enter (Cmd + Enter for Macs).
4) Line by line, keep pressing Ctrl + Enter (Cmd + Enter) and check the outputs. Make sure that your
outputs match your D2L quiz responses for the assignment.
5) Remove all extraneous bits or chunks of code that were not needed to produce your output. For
example, if you had bits of code that were dead ends and the objects you created were not used in
your solution, then delete those lines of code. Students have things like ‘Attempt 4’ or ‘Attempt 10’ in
their code. Before doing this step, you may want to save a separate copy that contains all your dead
ends (in case you accidentally delete something you needed). That’s great to have a separate copy of
your work (saved as a separate R file) containing the dead ends, but in your final submission, only
submit code that is part of your working solution. Otherwise, if you include all prior attempts before
your working solution, then the TA might accidentally mark the wrong approach and flag your work
for academic misconduct (in which case, I would have to investigate).
6) Repeat Steps 1 - 5 again (to make sure you didn’t remove a needed line of code).
7) If you use setwd() in your code, then either use the load() function to load the datasets or write as R
comments what files you are using from your working directory. I don’t have access to your computer,
so your code will not work on my computer if you just use setwd() and load the data in the Console.
I need those datasets in my working directory to run your code.
8) Save your file with your first name and last name (as specified in the instructions), and submit your
work to D2L. Your work for the assignment should all be in a single R file, not multiple R files. Points
will be taken off if the file name is not in the specified format or if you split your homework file into
multiple R files.

Project Four Individual Part V
No ratings yet
Project Four Individual Part V
4 pages
Pressure Drawdown Testing Techniques For Oil Wells
80% (5)
Pressure Drawdown Testing Techniques For Oil Wells
46 pages
Learn R Programming in 24 Hours
From Everand
Learn R Programming in 24 Hours
Alex Nordeen
No ratings yet
The University of Auckland: Second Semester, 2004 Campus: City
No ratings yet
The University of Auckland: Second Semester, 2004 Campus: City
23 pages
Assignment 2 Full
No ratings yet
Assignment 2 Full
10 pages
Assignment 1
No ratings yet
Assignment 1
16 pages
Final AK (Spring 2024)
No ratings yet
Final AK (Spring 2024)
14 pages
Subject Code: 18CS3064: Time: 2 Hours Max. Marks: 50 Key and Scheme of Evaluation
No ratings yet
Subject Code: 18CS3064: Time: 2 Hours Max. Marks: 50 Key and Scheme of Evaluation
17 pages
Group4
No ratings yet
Group4
9 pages
Week5 Tut
No ratings yet
Week5 Tut
5 pages
Assignment 4 Corrected
No ratings yet
Assignment 4 Corrected
3 pages
Assignment 3 (2023)
No ratings yet
Assignment 3 (2023)
9 pages
Assignment Econ6034 2023 s1
No ratings yet
Assignment Econ6034 2023 s1
7 pages
Individual Part 4
No ratings yet
Individual Part 4
4 pages
Mock_Exam_Final_Brief_Solutions
No ratings yet
Mock_Exam_Final_Brief_Solutions
14 pages
Econ 222 W2012 Assignment 3 Answers Posted
No ratings yet
Econ 222 W2012 Assignment 3 Answers Posted
9 pages
Assignment 4 - BUS 336
No ratings yet
Assignment 4 - BUS 336
4 pages
Assignment3 A20
No ratings yet
Assignment3 A20
3 pages
Final Exam MAT1004 Summer Code 2
No ratings yet
Final Exam MAT1004 Summer Code 2
3 pages
Fin 04
No ratings yet
Fin 04
15 pages
Semester A, 2021-2022 Final
No ratings yet
Semester A, 2021-2022 Final
4 pages
STA108HW4-1
No ratings yet
STA108HW4-1
5 pages
Previous QP
No ratings yet
Previous QP
9 pages
CS181 HW0
No ratings yet
CS181 HW0
9 pages
4311668368487
No ratings yet
4311668368487
9 pages
2024_KANTAR_CMI_coding_exercise (2)
No ratings yet
2024_KANTAR_CMI_coding_exercise (2)
3 pages
ESB2021 Resit With Solution
No ratings yet
ESB2021 Resit With Solution
9 pages
OR Record
No ratings yet
OR Record
35 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
11 pages
A Brief Introduction To Linear Models in R
No ratings yet
A Brief Introduction To Linear Models in R
21 pages
Stat 302 Practice Final: Brad Mcneney 2017-04-15
No ratings yet
Stat 302 Practice Final: Brad Mcneney 2017-04-15
7 pages
Assignment_III
No ratings yet
Assignment_III
3 pages
Problem 4.1 A)
No ratings yet
Problem 4.1 A)
11 pages
BML 202/ BCM 126 - Quantitative Techniques September-December, 2021 Instructions
No ratings yet
BML 202/ BCM 126 - Quantitative Techniques September-December, 2021 Instructions
6 pages
R Programs
No ratings yet
R Programs
12 pages
CS 3510 Homework 9 Q
No ratings yet
CS 3510 Homework 9 Q
4 pages
ADA Assignment - Final - 2022
No ratings yet
ADA Assignment - Final - 2022
6 pages
End-Term Assignment
No ratings yet
End-Term Assignment
5 pages
Previous QP 1
No ratings yet
Previous QP 1
4 pages
Lab-5-1-Regression and Multiple Regression
100% (2)
Lab-5-1-Regression and Multiple Regression
8 pages
19MCMS017012 ARUN REDDY Assignment - Summer Semester - Business Mathematics 2 - BBA - 2018 - 19
No ratings yet
19MCMS017012 ARUN REDDY Assignment - Summer Semester - Business Mathematics 2 - BBA - 2018 - 19
9 pages
Test Your Knowledge of Linear Regression and PCA in R
No ratings yet
Test Your Knowledge of Linear Regression and PCA in R
7 pages
Exercise_6 (1)
No ratings yet
Exercise_6 (1)
2 pages
PDF 2-1
No ratings yet
PDF 2-1
5 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Computer Lab 2 Block 1-3
No ratings yet
Computer Lab 2 Block 1-3
7 pages
Homework 5 Solutions
No ratings yet
Homework 5 Solutions
10 pages
Sample Exam For ML YSZ: Question 1 (Linear Regression)
No ratings yet
Sample Exam For ML YSZ: Question 1 (Linear Regression)
4 pages
R-Practical questions-Sem-IV
No ratings yet
R-Practical questions-Sem-IV
4 pages
Group 12 - MicroEconomics - EL
No ratings yet
Group 12 - MicroEconomics - EL
2 pages
Business Analytics II - Winter 2016 - Final Exam Solutions PDF
No ratings yet
Business Analytics II - Winter 2016 - Final Exam Solutions PDF
9 pages
Exam 2 2223
No ratings yet
Exam 2 2223
4 pages
7th Lecture Note 230515 135845
No ratings yet
7th Lecture Note 230515 135845
21 pages
CSE 312-Introduction to Statistical Tools in Research_Question Bank
No ratings yet
CSE 312-Introduction to Statistical Tools in Research_Question Bank
6 pages
STA 3201 Introduction to Econometrics_FT_DEC_22 (1)
No ratings yet
STA 3201 Introduction to Econometrics_FT_DEC_22 (1)
4 pages
CS2610 Final Exam: If Is - Nan Print
No ratings yet
CS2610 Final Exam: If Is - Nan Print
5 pages
s
No ratings yet
s
20 pages
Sample Exam For ML YSZ Sample For Machine Lerning - CMNKNVMNCS."NMD, MN, MVN, MDNV, MNDV MC, MDN, MDCNVM, NDV, M Ccwdmnbnbew, Mwbe
No ratings yet
Sample Exam For ML YSZ Sample For Machine Lerning - CMNKNVMNCS."NMD, MN, MVN, MDNV, MNDV MC, MDN, MDCNVM, NDV, M Ccwdmnbnbew, Mwbe
4 pages
Yaikob Second Assesiment Final
No ratings yet
Yaikob Second Assesiment Final
33 pages
Data Science with R: Beginner to Expert
From Everand
Data Science with R: Beginner to Expert
Narayana Nemani
No ratings yet
100 Puzzles to Learn Data Warehousing
From Everand
100 Puzzles to Learn Data Warehousing
Cristian Scutaru
No ratings yet
Lecture 7 - Type Curves
No ratings yet
Lecture 7 - Type Curves
57 pages
Base Flow Separation by The Conductivity Mass Balance Method Martin Comments
No ratings yet
Base Flow Separation by The Conductivity Mass Balance Method Martin Comments
12 pages
DFIT Analysis Explained
No ratings yet
DFIT Analysis Explained
5 pages
SPE-185861-MS Pressure Transient Analysis in Advanced Wells Completed With Flow Control Devices
No ratings yet
SPE-185861-MS Pressure Transient Analysis in Advanced Wells Completed With Flow Control Devices
20 pages
Interpretation Well Log
75% (4)
Interpretation Well Log
48 pages
Productivity Index, Flow Efficiency, Damage Ratio: Ideal Actual
100% (1)
Productivity Index, Flow Efficiency, Damage Ratio: Ideal Actual
6 pages
Basic Econometrics 5th Edition by Damoda (156-206) (21-42) PDF
No ratings yet
Basic Econometrics 5th Edition by Damoda (156-206) (21-42) PDF
22 pages
Lecture 7
No ratings yet
Lecture 7
20 pages
Chapter 9 (WTA) - Interference & Pulse Tests (D. Tiab)
No ratings yet
Chapter 9 (WTA) - Interference & Pulse Tests (D. Tiab)
107 pages
Semi-Log Analysis: Well Test Interpretation Methodology
No ratings yet
Semi-Log Analysis: Well Test Interpretation Methodology
31 pages
Agarwal-Gardner Typecurve Analysis Theory
No ratings yet
Agarwal-Gardner Typecurve Analysis Theory
14 pages
MEC-109 EM 2024-25 KP
No ratings yet
MEC-109 EM 2024-25 KP
20 pages
Lecture # 1 Inflow Performance Relationship
100% (1)
Lecture # 1 Inflow Performance Relationship
28 pages
Vest 1957
No ratings yet
Vest 1957
4 pages
Assignment 3 - Highway Materials (4300-468-568)
No ratings yet
Assignment 3 - Highway Materials (4300-468-568)
9 pages
Dfit 2
No ratings yet
Dfit 2
26 pages
drawing-curves-of-the-rainfall-intensity-duration-frequency-4dwfa1c2dc
No ratings yet
drawing-curves-of-the-rainfall-intensity-duration-frequency-4dwfa1c2dc
20 pages
How Is Fracture Pressure Determined?: What Is A Step-Rate Test?
No ratings yet
How Is Fracture Pressure Determined?: What Is A Step-Rate Test?
4 pages
5_Advanced Plotting_Class_S2024_P (1)
No ratings yet
5_Advanced Plotting_Class_S2024_P (1)
50 pages
Multi Rate Test
100% (1)
Multi Rate Test
39 pages
(Original PDF) Communicating in Geography and the Environmental Sciences 4thinstant download
100% (4)
(Original PDF) Communicating in Geography and the Environmental Sciences 4thinstant download
59 pages
Graphing On Logarithmic Paper - Anotado
No ratings yet
Graphing On Logarithmic Paper - Anotado
5 pages
(Ebook) The Design of Mammals: A Scaling Approach by John William Prothero ISBN 9781107110472, 1107110475 instant download
100% (1)
(Ebook) The Design of Mammals: A Scaling Approach by John William Prothero ISBN 9781107110472, 1107110475 instant download
48 pages
IOT Based Air Quality Monitoring System Using MQ13
No ratings yet
IOT Based Air Quality Monitoring System Using MQ13
9 pages
Introduction To MATLAB For Engineers, Third Edition: Advanced Plotting
No ratings yet
Introduction To MATLAB For Engineers, Third Edition: Advanced Plotting
51 pages
Biology
No ratings yet
Biology
10 pages
Analisa Pressure Draw-Down Dan Pressure Build Up Test: RABU, 27 JUNI 2007 12.30 - 16.30
No ratings yet
Analisa Pressure Draw-Down Dan Pressure Build Up Test: RABU, 27 JUNI 2007 12.30 - 16.30
33 pages
Const. Mgmt. Chp. 05 - Cost Estimation
No ratings yet
Const. Mgmt. Chp. 05 - Cost Estimation
44 pages
Jack Knife Diagram Guideline- Pardus Consulting
No ratings yet
Jack Knife Diagram Guideline- Pardus Consulting
11 pages

HW 4

Uploaded by

HW 4

Uploaded by

BTMA 636 (Fall 2022) HW #4: Building Predictive and

Purpose of This Assignment

Building Predictive Models (45 points)

1) Save your current homework R file, and close RStudio.

You might also like