Lecture 12
Lecture 12
Reference for Definitions and Formulas: Bowerman et al. (2009) and Field (2013)
Multiple Linear Regression
Analysis (Part I)
Now let’s extend simple linear regression analysis by adding
more independent variables…
Recall: Fuel Consumption Case
Cities in the USA use natural gas to heat their homes when
temperatures are low. As is obvious for most of you, natural gas
consumption is highest during the winter. Assume that we are
analysts in a management consulting firm hired by a natural gas
company serving a small city to predict weekly natural gas demand
based on average temperature so they can prepare their supply chain
accordingly. They have provided us with 8 week’s worth of data on
the city’s average temperature and corresponding weekly fuel
consumption.
The natural gas firm will tolerate an error rate of at most 10 percent,
meaning that the difference between our prediction and actual usage
should not be more than 10 percent. This is because the natural gas
firm will have to pay gas transmission companies in case it over or
under-predicts demand.
SSE ei2 ( yi yˆ i ) 2
The Standard Error is the point estimate of the residual standard deviation
σ
CONFIDENCE
INTERVAL
[ŷ t /2 s Distance value ]
PREDICTION
INTERVAL [ŷ t /2 s 1 Distance value ]
DM (mall location)
Value of DD Value of DM
Store is in a suburban street location 0 0
Store is in a downtown location 1 0
Store is in a mall location 0 1
Linear
Linear
Non-Linear in beta1
Linear
Non-Linear in beta 2 and 3
Source: http://blog.minitab.com/blog/adventures-in-statistics/what-is-the-difference-between-linear-and-
© Wilson
nonlinear-equations-in-regression-analysis and Gan 2010
http://www.stat.colostate.edu/regression_book/chapter9.pdf
But note that data transformation changes
interpretation of the regression function
For example:
In the standard model, a one-unit change in x results to a
<parameter value of x> unit change in y
In the power (log-log or elasticity) model, a one percent change in
x results to a <parameter value of x> percent change in y
Source: http://stattrek.com/regression/linear-transformation.aspx?Tutorial=AP
© Wilson Gan 2010
The assumptions that govern MRM are
similar to that of SRM
1. Mean of Zero
The mean of the error terms is equal to zero
2. Constant Variance Assumption
Homoscedasticity. The variance of the error terms σ 2 is, the
same for every combination values of x1, x2,…, xk
3. Normality Assumption
The error terms follow a normal distribution for every
combination values of x1, x2,…, xk
4. Independence Assumption
The values of the error terms are statistically independent of
each other
With any real data, assumptions will not hold exactly. Mild
departures do not affect our ability to make statistical
inferences.
In checking assumptions, we are looking for pronounced
departures from the assumptions.