0% found this document useful (0 votes)

43 views7 pages

Lecture 14: Multiple Linear Regression 1 Review of Simple Linear Regression in Matrix Form

This document provides an overview of multiple linear regression. It begins by reviewing simple linear regression in matrix form. It then introduces multiple linear regression, where there are p predictor variables instead of just one. It derives the least squares estimator for multiple linear regression, showing that it is the same formula as in simple linear regression but with an (n x (p+1)) design matrix X instead of an (n x 2) design matrix. It discusses why multiple regression slopes are not the same as those from separate simple regressions on each predictor. It also covers properties of the estimates such as being unbiased but having variance that depends on the inverse of X'X. Finally, it discusses issues of collinearity when X'

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views7 pages

Lecture 14: Multiple Linear Regression 1 Review of Simple Linear Regression in Matrix Form

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Lecture 14: Multiple Linear Regression

1 Review of Simple Linear Regression in Matrix Form

We have Y = (Y1 , . . . , Yn )T and an n × 2 matrix X whose first column is all 1’s. The model
is Y = Xβ + . The error (mse) is n−1 (Y − Xβ)T (Y − Xβ). The derivative of the MSE
with respect to β is
2
(−XT Y + XT Xβ) (1)
n
Setting this to zero at the optimum coefficient vector βb gives the (matrix) estimating equation
− XT Y + XT Xβb = 0 (2)
whose solution is
βb = (XT X)−1 XT Y. (3)
The fitted values are
b ≡m
Y b = Xβb = HY
where H is the hat matrix. Geometrically, this means that we find the fitted values by taking
the vector of observed responses Y and projecting it onto the column space of X.

2 Multiple Linear Regression

We are now ready to go from the simple linear regression model, with one predictor variable,
to multiple linear regression models, with more than one predictor variable.
In the basic form of the multiple linear regression model,
1. There are p quantitative predictor variables, X1 , X2 , . . . Xp . We make no assumptions
about their distribution; in particular, they may or may not be dependent. X without a
subscript will refer to the vector of all of these taken together. Thus, X = (X1 , . . . , Xp ).
2. There is a single response variable Y .
3. Y = β0 + pi=1 βi Xi + , for some constants (coefficients) β0 , β1 , . . . βp .
P

4. The noise variable has E [|X = x] = 0 (mean zero), Var [|X = x] = σ 2 (constant
variance), and is uncorrelated across observations.
In matrix form, when we have n observations,
Y = Xβ + (4)
where X is a n × (p + 1) matrix of random variables whose first column is all 1’s. We assume
that E [|X] = 0 and Var [|X] = σ 2 I.
Sometimes we further assume that ∼ M V N (0, σ 2 I), independent of X. From these
assumptions, it follows that, conditional on X, Y has a multivariate Gaussian distribution,
Y|X ∼ M V N (Xβ, σ 2 I). (5)

1
3 Derivation of the Least Squares Estimator
We now wish to estimate the model by least squares. Fortunately, we did essentially all of
the necessary work last time.
The MSE is
1
(Y − Xβ)T (Y − Xβ) (6)
n
with gradient
2
−XT Y + XT Xβ .

∇β M SE(β) = (7)
n
The estimating equation is
− XT Y + XT Xβb = 0 (8)
and the solution, the ordinary least squares (OLS) estimator, is

βb = (XT X)−1 XT Y. (9)

3.1 Why Multiple Regression Isn’t Just a Bunch of Simple Re-

gressions
When we do multiple regression, the slopes we get for each variable aren’t the same as the
ones we’d get if we just did p separate simple regressions. Why not?
Suppose the real model is Y = β0 + β1 X1 + β2 X2 + . (Nothing turns on p = 2, it just
keeps things short.) What would happen if we did a simple regression of Y on just X1 ? We
know that the optimal (population) slope on X1 is

Cov [X1 , Y ]
(10)
Var [X1 ]

Let’s substitute in the model equation for Y :

Cov [X1 , Y ] Cov [X1 , β0 + β1 X1 + β2 X2 + ]

= (11)
Var [X1 ] Var [X1 ]
β1 Var [X1 ] + β2 Cov [X1 , X2 ] + Cov [X1 , ]
= (12)
Var [X1 ]
β2 Cov [X1 , X2 ] + 0
= β1 + (13)
Var [X1 ]
Cov [X1 , X2 ]
= β1 + β2 (14)
Var [X1 ]

The total covariance between X1 and Y includes X1 ’s direct contribution to Y , plus the
indirect contribution through correlation with X2 , and X2 ’s contribution to Y .

2
3.2 Point Predictions and Fitted Values
Just as with simple regression, the vector of fitted values Y
b is linear in Y, and given by the
hat matrix:
b = Xβb = X(XT X)−1 XT Y = HY.
Y (15)

All of the interpretations given of the hat matrix in the previous lecture still apply. In
particular, H projects Y onto the column space of X.

4 Properties of the Estimates

As usual, we will treat X as fixed. Now

βb = (XT X)−1 XT Y (16)

and
Y = Xβ + (17)
and so
βb = (XT X)−1 XT Xβ + (XT X)−1 XT = β + (XT X)−1 XT . (18)

4.1 Bias
This is straight-forward:
h i
E βb = E β + (XT X)−1 XT

(19)
= β + (XT X)−1 XT E [] (20)
= β (21)

so the least squares estimate is unbiased.

4.2 Variance and Standard Errors

This needs a little more work. We have
h i
Var βb = Var β + (XT X)−1 XT

(22)
= Var (XT X)−1 XT

(23)
T −1 T T −1
= (X X) X Var [] X(X X) (24)
= (XT X)−1 XT σ 2 IX(XT X)−1 (25)
= σ 2 (XT X)−1 XT X(XT X)−1 (26)
= σ 2 (XT X)−1 (27)

3
To understand this a little better, let’s re-write it slightly:
h i σ2 1 −1
T
Var β =
b X X . (28)
n n

The first term, σ 2 /n, is what we’re familiar with from the simple linear model. As n grows,
we expect the entries in XT X to be increasing in magnitude, since they’re sums over all
n data points; dividing all entries in the matrix by n compensates for this. If the sample
covariances between all the predictor variables were 0, when we took the inverse we’d get
1/s2Xi down the diagonal (except for the top of the diagonal), just as we got 1/s2X in the
simple linear model.

5 Collinearity
We have been silently assuming that (XT X)−1 exists, in other words, that XT X is “invert-
ible” or “non-singular”. There are a number of equivalent conditions for a matrix to be
invertible:
1. Its determinant is non-zero.

2. It is of “full column rank”, meaning all of its columns are linearly independent1 .

3. It is of “full row rank”, meaning all of its rows are linearly independent.
The equivalence of these conditions are mathematical facts, proved in linear algebra.
What does this amount to in terms of our data? It means that the variables must
be linearly independent in our sample. That is, there must not be any set of constants
a0 , a1 , . . . ap where, for all rows i,
p
X
a0 + aj xij = 0 (29)
j=1

This, in other words, means that X must be of full column rank.

To understand why linearly dependence among variables is a problem, take an easy case,
where two predictors, say X1 and X2 , are exactly equal to each other. It’s then not surprising
that we don’t have any way of estimating their coefficients. If we get one set of predictions
with coefficients β1 , β2 , we’d get exactly the same predictions from β1 + γ, β2 − γ, no matter
what γ might be. If there are other exact linear relations among two variables, we can
similarly trade off their coefficients against each other, without any change in anything we
can observe. If there are exact linear relationships among more than two variables, all of
their coefficients become ill-defined.
We will come back in a few lectures to what to do when faced with collinearity. For now,
we’ll just mention a few clear situations:
1
Recall that a set of vectors is linearly independent if no linear combination of them is exactly zero.

4
• If n < p + 1, the data are collinear.

• If one of the predictor variables is constant, the data are collinear.

• If two of the predictor variables are proportional to each other, the data are collinear.

• If two of the predictor variables are otherwise linearly related, the data are collinear.

6 R
>
> pdf("plots.pdf")
>
> n = 100
> x1 = runif(n)
> x2 = runif(n)
> x3 = runif(n)
> y = 5 + 2*x1 + 3*x2 + 7*x3 + rnorm(n)
>
> Z = cbind(x1,x2,x3,y)
>
> pairs(Z,pch=20)
>
>
>
> out = lm(y ~ x1 + x2 + x3)
>
> print(out)

Call:
lm(formula = y ~ x1 + x2 + x3)

Coefficients:
(Intercept) x1 x2 x3
4.619 2.840 2.607 7.286

>
> coefficients(out)
(Intercept) x1 x2 x3
4.618816 2.840239 2.607443 7.285716
>
> confint(out)
2.5 % 97.5 %

5
(Intercept) 4.017390 5.220243
x1 2.257652 3.422826
x2 2.004926 3.209960
x3 6.659326 7.912105
>
> head(fitted(out))
1 2 3 4 5 6
11.984243 11.300009 12.006982 11.556004 9.205792 11.400073
>
> head(residuals(out))
1 2 3 4 5 6
-0.3574136 0.0609240 -1.4612416 -0.3427516 -0.1730116 0.3899873
>
> summary(out)

Call:
lm(formula = y ~ x1 + x2 + x3)

Residuals:
Min 1Q Median 3Q Max
-1.91902 -0.59934 0.00622 0.65931 1.81582

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 4.6188 0.3030 15.244 < 2e-16 ***
x1 2.8402 0.2935 9.677 7.34e-16 ***
x2 2.6074 0.3035 8.590 1.58e-13 ***
x3 7.2857 0.3156 23.088 < 2e-16 ***
---
Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1

Residual standard error: 0.8557 on 96 degrees of freedom

Multiple R-squared: 0.8656,Adjusted R-squared: 0.8614
F-statistic: 206.1 on 3 and 96 DF, p-value: < 2.2e-16

>
> newx = data.frame(x1 = .2, x2 = .3, x3 = .7)
>
> predict(out,newdata = newx)
1
11.0691
>

6
> dev.off()

0.0 0.2 0.4 0.6 0.8 1.0 6 8 10 12 14

0.0 0.2 0.4 0.6 0.8 1.0

● ●● ● ● ●● ● ● ●
● ● ● ● ● ●● ●
● ● ● ● ● ●
●● ● ● ● ● ●● ●
● ● ● ●
●
● ● ● ● ●
●
● ● ● ●
● ● ● ● ● ● ●
● ●●● ●
● ● ●
●
● ● ● ● ●
● ● ● ● ● ●●
● ● ● ● ●
●● ● ● ● ●● ●● ●
● ●● ● ●● ● ● ● ● ●
● ● ●
● ●● ● ● ● ● ●●
● ● ●● ●
●
● ● ●
●● ● ●
● ● ● ● ●

x1
●● ● ● ●●
● ● ● ● ● ●
● ● ● ●● ● ● ● ●
● ● ● ● ● ● ● ● ● ● ●
●
●● ● ● ● ●
● ● ● ● ● ● ● ● ●●
● ●
● ● ● ●● ● ● ● ●
● ● ● ● ●● ● ●
●
●● ● ● ● ●● ● ● ● ● ● ●
●
● ● ● ● ● ●●
● ● ● ●● ●
● ● ● ●
● ●● ● ●●
●
● ● ● ● ● ●
● ● ●
●● ● ● ● ● ● ● ● ●
● ●
●● ● ● ● ● ●● ● ● ●● ● ● ● ● ●
● ● ● ●
●● ● ●● ● ● ● ●● ●● ● ●●
● ● ●
0.0 0.2 0.4 0.6 0.8 1.0

● ● ● ● ● ●
● ● ● ● ● ● ● ● ●●
● ● ● ● ● ● ● ●
● ● ● ● ● ● ● ● ●
● ● ●
●●
●● ● ● ● ● ● ●
● ● ● ● ● ●
●● ● ● ●● ● ● ●
● ●
● ● ● ●● ● ● ● ●
● ● ● ● ● ●
● ● ● ●● ● ●
● ● ● ●●
● ● ● ● ● ●
●● ● ● ● ● ● ● ● ● ● ● ●
● ● ● ● ●
● ● ●

x2
● ● ● ● ● ●
● ● ● ● ● ●
● ● ● ● ● ● ● ● ● ● ● ●
● ● ● ● ● ● ● ●● ● ● ●● ● ●
● ● ● ● ● ●
● ● ●
● ● ● ● ● ● ● ● ●
● ● ●
● ●
● ●● ●● ● ● ● ●● ● ● ● ● ● ● ●●● ● ● ●
● ● ●
● ●● ●● ● ● ●● ● ● ● ● ● ●
●● ● ● ● ● ●●
● ● ● ●
● ●● ● ● ● ●
● ● ●
●● ●● ● ● ● ● ● ●● ●
● ●
● ● ● ● ●
●
● ● ● ● ● ● ●● ●
● ●● ● ●● ● ● ●● ● ● ● ● ● ●
● ● ● ●●
● ● ● ● ●●
● ● ●

0.0 0.2 0.4 0.6 0.8 1.0

● ● ● ●● ●
● ● ● ● ● ●
●● ● ● ● ● ● ● ● ●
● ● ● ● ●● ● ●
●● ● ● ● ● ● ●
● ● ● ● ● ●
● ● ●
● ● ●
● ●
●●
●
● ●
● ●

● ● ● ● ● ● ● ●
● ● ● ●● ● ● ● ● ● ● ● ●●
● ● ●●
● ● ● ● ●
●●● ●● ●● ● ● ●
●● ● ● ● ● ● ● ●● ●● ●
● ● ●● ● ● ● ● ● ● ● ● ● ● ●
● ● ● ● ● ●
●● ● ● ● ● ● ●
● ●● ● ● ● ●
● ●● ● ● ●

x3
●● ● ● ● ●●
● ●● ● ● ● ● ● ●● ●
● ●● ● ● ● ● ● ●
● ● ● ● ● ● ●●
● ● ● ● ● ●
● ● ● ● ● ●
●● ● ● ● ● ● ●●
● ● ● ● ● ● ● ● ●
● ● ● ● ● ●
● ● ●
● ● ●● ● ● ● ● ● ● ● ●
● ● ●
● ● ● ● ● ● ● ● ●
● ● ●
● ●
● ● ● ● ● ●
●
● ● ● ● ● ●
● ● ● ● ● ●
● ●● ● ● ● ●● ● ● ● ●●
● ●
●
●
● ● ●

● ● ● ● ●● ●● ●
● ● ● ● ● ● ● ● ●
●● ● ● ●
14

● ● ● ● ● ● ●
● ● ●
● ● ● ● ●● ● ● ●
● ● ●
● ● ● ● ● ● ● ● ● ● ●●
● ● ● ●● ● ● ● ●● ●
● ● ● ● ● ● ● ● ●
● ●● ●
● ● ● ● ● ●
● ● ● ● ●● ● ● ● ● ● ●
12

● ● ● ● ● ● ● ● ● ● ● ●● ● ●
● ●
●● ● ●● ● ● ● ● ● ● ● ●
● ● ● ● ● ● ● ● ●● ● ●
●●●● ● ● ●●
● ●
●
● ●●●
● ● ● ● ● ●●●
●
● ● ●●
● ●
● ●
●
● ● ●
●
●
●
●
●
●
● ●
●
● ●
●
●
● ●
● ●
●
y
10

● ● ●
●● ●● ● ● ● ● ● ● ● ● ● ● ● ●●
●
● ●● ● ● ● ●● ● ● ● ● ●
● ●
● ● ● ●
● ● ● ● ● ● ●
● ●
● ●
● ● ● ● ● ●
● ● ● ● ● ●
8

● ● ● ● ●● ● ● ●●
●●
● ●● ● ● ● ● ● ●

●● ● ●● ● ● ● ●
● ● ●
● ● ●
6

● ● ● ● ●
●

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0

MCB1472.ebook Retro Gamer Arcade Classics PDF
100% (3)
MCB1472.ebook Retro Gamer Arcade Classics PDF
164 pages
CX1213-1709-ENG0 (Maintenance Manual)
No ratings yet
CX1213-1709-ENG0 (Maintenance Manual)
96 pages
36-708 Statistical Machine Learning Homework #4 Solutions: DUE: April 19, 2019
No ratings yet
36-708 Statistical Machine Learning Homework #4 Solutions: DUE: April 19, 2019
16 pages
3208 MARINE ENGINE 01Z00001-21746 (SEBP1297 - 07) - Sistemas y Componentes
No ratings yet
3208 MARINE ENGINE 01Z00001-21746 (SEBP1297 - 07) - Sistemas y Componentes
1 page
4 - Multiple Linear Regressions
No ratings yet
4 - Multiple Linear Regressions
61 pages
Lecture 15: Diagnostics and Inference For Multiple Linear Regression 1 Review
No ratings yet
Lecture 15: Diagnostics and Inference For Multiple Linear Regression 1 Review
7 pages
Multiple Linear Reegression
No ratings yet
Multiple Linear Reegression
21 pages
3 Regression Diagnostics
100% (1)
3 Regression Diagnostics
53 pages
Multiple Regression Model - Matrix Form
No ratings yet
Multiple Regression Model - Matrix Form
22 pages
Linear Model
No ratings yet
Linear Model
11 pages
Regression III: Advanced Methods: William G. Jacoby Department of Political Science
No ratings yet
Regression III: Advanced Methods: William G. Jacoby Department of Political Science
21 pages
Multiple Linear Regression Model by Jeevan Bista[1]
No ratings yet
Multiple Linear Regression Model by Jeevan Bista[1]
16 pages
Multiple Regression
No ratings yet
Multiple Regression
32 pages
Mungadze Linear
No ratings yet
Mungadze Linear
21 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
18 pages
Matrix Model
No ratings yet
Matrix Model
6 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
Lecture 22: Review For Exam 2 1 Basic Model Assumptions (Without Gaussian Noise)
No ratings yet
Lecture 22: Review For Exam 2 1 Basic Model Assumptions (Without Gaussian Noise)
7 pages
Gary Chamberlain Econometric S
No ratings yet
Gary Chamberlain Econometric S
152 pages
MultivariableRegression Summary
No ratings yet
MultivariableRegression Summary
15 pages
Multiple Regression Model
No ratings yet
Multiple Regression Model
6 pages
Matrix OLS NYU Notes
No ratings yet
Matrix OLS NYU Notes
14 pages
Multicollinearity and Endogeneity PDF
No ratings yet
Multicollinearity and Endogeneity PDF
37 pages
Lect6 Math231
No ratings yet
Lect6 Math231
38 pages
Mult Hetero Notes Agd
No ratings yet
Mult Hetero Notes Agd
29 pages
EC1 Slides Part4
No ratings yet
EC1 Slides Part4
35 pages
Multiple Linear Regression: BIOST 515 January 15, 2004
No ratings yet
Multiple Linear Regression: BIOST 515 January 15, 2004
32 pages
Elementary Regression Analysis
No ratings yet
Elementary Regression Analysis
25 pages
Lecture 17: Multicollinearity 1 Why Collinearity Is A Problem
No ratings yet
Lecture 17: Multicollinearity 1 Why Collinearity Is A Problem
9 pages
Math644 - Chapter 1 - Part2 PDF
No ratings yet
Math644 - Chapter 1 - Part2 PDF
14 pages
Linera Regression II PDF
No ratings yet
Linera Regression II PDF
14 pages
Lec3 2019 PDF
No ratings yet
Lec3 2019 PDF
43 pages
Chapter 3 Multiple regression
No ratings yet
Chapter 3 Multiple regression
49 pages
SM I 2013 LecturesWeek 6
No ratings yet
SM I 2013 LecturesWeek 6
7 pages
Lesson01 PDF 02
No ratings yet
Lesson01 PDF 02
5 pages
Module01 LinearRegression
No ratings yet
Module01 LinearRegression
41 pages
Lect 6
No ratings yet
Lect 6
20 pages
cn2 Multi
No ratings yet
cn2 Multi
7 pages
Multi Variate Regression
No ratings yet
Multi Variate Regression
52 pages
MathModel_Lecture 8 1
No ratings yet
MathModel_Lecture 8 1
8 pages
Multicolinearidade
No ratings yet
Multicolinearidade
24 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
11 pages
Unit - 1
No ratings yet
Unit - 1
8 pages
Multiple Regression - Estimation
No ratings yet
Multiple Regression - Estimation
18 pages
8. Linear Regression
No ratings yet
8. Linear Regression
29 pages
REg03
No ratings yet
REg03
39 pages
Chapter 3 Multiple Regression
No ratings yet
Chapter 3 Multiple Regression
49 pages
Chapter2 Econometrics MultipleLinearRegressionModel 1 1
No ratings yet
Chapter2 Econometrics MultipleLinearRegressionModel 1 1
34 pages
MAS316/Math352 Regression Analysis: 1 Multiple Linear Regression Models
No ratings yet
MAS316/Math352 Regression Analysis: 1 Multiple Linear Regression Models
12 pages
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
No ratings yet
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
6 pages
LMnotes 04
No ratings yet
LMnotes 04
9 pages
Introduction To Management Science: Post Mid Sessions 2 & 3 November 4 and 6 2019
No ratings yet
Introduction To Management Science: Post Mid Sessions 2 & 3 November 4 and 6 2019
26 pages
Summary of Topics For Midterm Exam #2: STA 371G, Fall 2017
No ratings yet
Summary of Topics For Midterm Exam #2: STA 371G, Fall 2017
6 pages
Sta 3
No ratings yet
Sta 3
9 pages
Chapter 4 Multicollinearity
No ratings yet
Chapter 4 Multicollinearity
7 pages
Statics Thinking-Regression
No ratings yet
Statics Thinking-Regression
51 pages
Chapter 3 Multiple Linear Regression: Ray-Bing Chen Institute of Statistics National University of Kaohsiung
No ratings yet
Chapter 3 Multiple Linear Regression: Ray-Bing Chen Institute of Statistics National University of Kaohsiung
45 pages
DISC 212 Session 13
No ratings yet
DISC 212 Session 13
29 pages
Introduction to Bessel Functions
From Everand
Introduction to Bessel Functions
Frank Bowman
2.5/5 (1)
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
Group Theory I Essentials
From Everand
Group Theory I Essentials
Emil Milewski
No ratings yet
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
Density Estimation 36-708
No ratings yet
Density Estimation 36-708
32 pages
Support Vector Machines
No ratings yet
Support Vector Machines
5 pages
Linear Regression: 1 1 N N I I I D I I
No ratings yet
Linear Regression: 1 1 N N I I I D I I
20 pages
Differential Privacy: 1 N I 1 N N
No ratings yet
Differential Privacy: 1 N I 1 N N
7 pages
Boosting: I I I I
No ratings yet
Boosting: I I I I
5 pages
Homework 4 Due Friday April 19 3:00 PM Submit A PDF File On Canvas
No ratings yet
Homework 4 Due Friday April 19 3:00 PM Submit A PDF File On Canvas
2 pages
Linear Classification: 1 1 N N I D I
No ratings yet
Linear Classification: 1 1 N N I D I
33 pages
Nonparametric Classification 10/36-702: 1 1 N N N I I
No ratings yet
Nonparametric Classification 10/36-702: 1 1 N N N I I
20 pages
36-708 Statistical Methods For Machine Learning Homework #1 Solutions
No ratings yet
36-708 Statistical Methods For Machine Learning Homework #1 Solutions
12 pages
Sparse Additive Models: University of California, Berkeley, USA
No ratings yet
Sparse Additive Models: University of California, Berkeley, USA
22 pages
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
No ratings yet
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
40 pages
High-Dimensional, Two-Sample Testing
No ratings yet
High-Dimensional, Two-Sample Testing
9 pages
36-708 Statistical Machine Learning Homework #3 Solutions: DUE: March 29, 2019
No ratings yet
36-708 Statistical Machine Learning Homework #3 Solutions: DUE: March 29, 2019
22 pages
Online Learning: T T T T T T T T
No ratings yet
Online Learning: T T T T T T T T
8 pages
Causal Inference: 1.1 Two Types of Causal Questions
No ratings yet
Causal Inference: 1.1 Two Types of Causal Questions
19 pages
Manifold Estimation, Hidden Structure and Dimension Reduction
No ratings yet
Manifold Estimation, Hidden Structure and Dimension Reduction
39 pages
10/36-702 Statistical Machine Learning Homework #2 Solutions
No ratings yet
10/36-702 Statistical Machine Learning Homework #2 Solutions
11 pages
Data Analysis Project 2 Due 5:00 PM Nov 21 1 Instructions
No ratings yet
Data Analysis Project 2 Due 5:00 PM Nov 21 1 Instructions
3 pages
High-Dimensional, Two-Sample Testing
No ratings yet
High-Dimensional, Two-Sample Testing
9 pages
Data Analysis Exam 1 36-401, Section B
No ratings yet
Data Analysis Exam 1 36-401, Section B
3 pages
Lecture 9: Predictive Inference
No ratings yet
Lecture 9: Predictive Inference
10 pages
Lecture 7: Diagnostics: 36-401, Fall 2017, Section B
No ratings yet
Lecture 7: Diagnostics: 36-401, Fall 2017, Section B
35 pages
1 Review
No ratings yet
1 Review
7 pages
A Closer Look at Sparse Regression Ryan Tibshirani: 2.1 Three Norms: ', ', '
No ratings yet
A Closer Look at Sparse Regression Ryan Tibshirani: 2.1 Three Norms: ', ', '
25 pages
36-401 Modern Regression HW #2 Solutions: Problem 1 (36 Points Total)
No ratings yet
36-401 Modern Regression HW #2 Solutions: Problem 1 (36 Points Total)
15 pages
Lecture 8: Inference 36-401, Fall 2015, Section B
No ratings yet
Lecture 8: Inference 36-401, Fall 2015, Section B
16 pages
HW7
No ratings yet
HW7
1 page
Nonparametric Regression
No ratings yet
Nonparametric Regression
24 pages
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
No ratings yet
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
12 pages
SEL Study Material - 4
No ratings yet
SEL Study Material - 4
30 pages
23 Theft
No ratings yet
23 Theft
17 pages
2017 04 Monthly Report
No ratings yet
2017 04 Monthly Report
134 pages
Nik Elin Zurina BT Nik Abdul Rashid & Anor V Kerajaan Negeri Kelantan (2024) 2 MLJ 150
No ratings yet
Nik Elin Zurina BT Nik Abdul Rashid & Anor V Kerajaan Negeri Kelantan (2024) 2 MLJ 150
106 pages
Annexure-A: (See Rule 44 (3) of The UPVAT Rules 2007) List of Purchase Made Against Tas Invoice
No ratings yet
Annexure-A: (See Rule 44 (3) of The UPVAT Rules 2007) List of Purchase Made Against Tas Invoice
12 pages
Abstract:: The Bluetooth Technology
No ratings yet
Abstract:: The Bluetooth Technology
8 pages
Corporate Insurance Agent 250 MCQ Set
No ratings yet
Corporate Insurance Agent 250 MCQ Set
63 pages
Kentucky Oaks Past Performances 2019
100% (8)
Kentucky Oaks Past Performances 2019
11 pages
Passion Catalog of Tumbler
No ratings yet
Passion Catalog of Tumbler
7 pages
Al Furjan
No ratings yet
Al Furjan
2 pages
E Marketplaces
No ratings yet
E Marketplaces
61 pages
PHIL-AIR CONDITIONING CENTER, v. RCJ LINES AND ROLANDO ABADILLA, JR
No ratings yet
PHIL-AIR CONDITIONING CENTER, v. RCJ LINES AND ROLANDO ABADILLA, JR
2 pages
Habicht CNC Milling Machine Maxxmill 500 en
No ratings yet
Habicht CNC Milling Machine Maxxmill 500 en
4 pages
BOOKCHAPTER-VERTICALFARMINGANEWERAINVEGETABLECROPPRODUCTION
No ratings yet
BOOKCHAPTER-VERTICALFARMINGANEWERAINVEGETABLECROPPRODUCTION
18 pages
FE Chapter 3
No ratings yet
FE Chapter 3
36 pages
Pds Nax q125 F
No ratings yet
Pds Nax q125 F
3 pages
3G SSV DT Purpose Process
100% (1)
3G SSV DT Purpose Process
31 pages
GD Goenka Agra Holiday Homework 2015
100% (1)
GD Goenka Agra Holiday Homework 2015
5 pages
What Is Multifactor Authentication
No ratings yet
What Is Multifactor Authentication
3 pages
Fargo DTC1250e Brochure PDF
No ratings yet
Fargo DTC1250e Brochure PDF
2 pages
18934E Tds Pentane
No ratings yet
18934E Tds Pentane
1 page
Astm D5084-10 Permeability TRX
100% (1)
Astm D5084-10 Permeability TRX
24 pages
Atc Camp Documents
No ratings yet
Atc Camp Documents
5 pages
Contemporary-world-reviewer-chap-1-3.PDF (Source - StuDocu, Downloaded by - KC)
No ratings yet
Contemporary-world-reviewer-chap-1-3.PDF (Source - StuDocu, Downloaded by - KC)
8 pages
Design and Construction of 1 KVA Power Inverter System
100% (1)
Design and Construction of 1 KVA Power Inverter System
14 pages
PARENTS CONSENT F OSIP PC 001 Rev.1
No ratings yet
PARENTS CONSENT F OSIP PC 001 Rev.1
2 pages
City of General Santos Vs COA
No ratings yet
City of General Santos Vs COA
1 page

Lecture 14: Multiple Linear Regression 1 Review of Simple Linear Regression in Matrix Form

Uploaded by

Lecture 14: Multiple Linear Regression 1 Review of Simple Linear Regression in Matrix Form

Uploaded by

Lecture 14: Multiple Linear Regression

1 Review of Simple Linear Regression in Matrix Form

2 Multiple Linear Regression

βb = (XT X)−1 XT Y. (9)

3.1 Why Multiple Regression Isn’t Just a Bunch of Simple Re-

Let’s substitute in the model equation for Y :

Cov [X1 , Y ] Cov [X1 , β0 + β1 X1 + β2 X2 + ]

4 Properties of the Estimates

βb = (XT X)−1 XT Y (16)

so the least squares estimate is unbiased.

4.2 Variance and Standard Errors

This, in other words, means that X must be of full column rank.

• If one of the predictor variables is constant, the data are collinear.

Residual standard error: 0.8557 on 96 degrees of freedom

0.0 0.2 0.4 0.6 0.8 1.0 6 8 10 12 14

0.0 0.2 0.4 0.6 0.8 1.0

0.0 0.2 0.4 0.6 0.8 1.0

You might also like

Cov [X1 , Y ] Cov [X1 , β0 + β1 X1 + β2 X2 + ]