0% found this document useful (0 votes)

29 views

Multiple Regression Analysis & Applications

This document discusses multiple regression analysis and its applications. Multiple regression examines relationships between a dependent variable and multiple independent variables. It can determine how much variation in the dependent variable is explained by the independent variables, predict dependent variable values, and control for other independent variables. The document outlines statistics used in bivariate and multiple regression including coefficients, standard errors, and tests. It provides examples of applying regression and discusses interpreting partial regression coefficients and issues like multicollinearity.

Uploaded by

Harshul Bansal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views

Multiple Regression Analysis & Applications

Uploaded by

Harshul Bansal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 23

MULTIPLE REGRESSION ANALYSIS

&
APPLICATIONS
Regression Analysis

Regression analysis examines associative relationships

between a metric dependent variable and one or more
independent variables in the following ways:
• Determine whether the independent variables explain a
significant variation in the dependent variable: whether a
relationship exists.
• Determine how much of the variation in the dependent
variable can be explained by the independent variables:
strength of the relationship.
• Determine the structure or form of the relationship: the
mathematical equation relating the independent and
dependent variables.
• Predict the values of the dependent variable.
• Control for other independent variables when evaluating the
contributions of a specific variable or set of variables.
• Regression analysis is concerned with the nature and degree
of association between variables and does not imply or
assume any causality.
Statistics Associated with Bivariate
Regression Analysis

• Bivariate regression model. The basic regression

equation is Yi = 0 +  1 Xi + ei, where Y = dependent or
criterion variable, X = independent or predictor variable,  0
= intercept of the line,  1 = slope of the line, and ei is the
 
error term associated with the i th observation.

• Coefficient of determination. The strength of association

is measured by the coefficient of determination, r 2. It
varies between 0 and 1 and signifies the proportion of the
total variation in Y that is accounted for by the variation in
X.

Y Y
• Estimated or predicted value. The estimated or predicted
value
0 ofY1 i is i = a + b x, where i is the predicted value
of Yi, and a and b are estimators of
and , respectively.

 
Statistics Associated with Bivariate
Regression Analysis

• Regression coefficient. The estimated

parameter b is usually referred to as the non-
standardized regression coefficient.
• Scattergram. A scatter diagram, or
scattergram, is a plot of the values of two
variables for all the cases or observations.
• Standard error of estimate. This statistic,
SEE, is the standard deviation of the actual Y
values from the predicted Y values.
• Standard error. The standard deviation of b,
SEb, is called the standard error.
Statistics Associated with Bivariate
Regression Analysis

• Standardized regression coefficient. Also termed

the beta coefficient or beta weight, this is the slope
obtained by the regression of Y on X when the data
are standardized.

• Sum of squared errors. The distances of all the

points from the regression line are squared and added
together to arrive at the sum of squared errors, which
is a measure of total error, ej
2

• t statistic. A t statistic with n - 2 degrees of

freedom can be used to test the null hypothesis that
no linear relationship exists between X and Y, or H0:
β = 0, where t=b /SEb
Examples of Regression Analysis

AW = f(U, A, I)
Where,
AW= Awareness about the product
U = Uniqueness of the product
A = Advertisement
I = Interest in the product category
Examples of Regression Analysis

S = f(AE, P, D)
Where,
S = Sales
AE = Advertisement expenditure
P = Price
D = Level of distribution
Examples of Regression Analysis

MS = f(SSF, AE, SPB)

Where,
MS= Market share
SSF = Size of sales force
AE = Advertisement expenditure
SPB = Sales promotion budgets
Examples of Regression Analysis

CP = f(PP, BI, BA)

Where,
CP = Consumers’ perceptions of
quality
PP = Perceptions of prices
BI = Brand image
BA = Brand attributes
Multiple Regression

The general form of the multiple regression model

is as follows:

Y =  0 +  1 X1 +  2 X2 +  3 X3+ . . . +  k X k + e
which is estimated by the following equation:

Y= a + b1X1 + b2X2 + b3X3+ . . . + bkXk

As before, the coefficient a represents the intercept,
but the b's are now the partial regression coefficients.
Statistics Associated with Multiple Regression

• Adjusted R2. R2, coefficient of multiple determination, is

adjusted for the number of independent variables and the
sample size to account for the diminishing returns. After the
first few variables, the additional independent variables do not
make much contribution.

• Coefficient of multiple determination. The strength of

association in multiple regression is measured by the square of
the multiple correlation coefficient, R2, which is also called the
coefficient of multiple determination.

• F test. The F test is used to test the null hypothesis that the
coefficient of multiple determination in the population, R2pop, is
zero. This is equivalent to testing the null hypothesis. The test
statistic has an F distribution with k and (n - k - 1) degrees of
freedom.
Statistics Associated with Multiple Regression

• Partial F test. The significance of a partial regression

coefficient, i , of Xi may be tested using an incremental
F statistic. The incremental F statistic is based on the
increment in the explained sum of squares resulting
from the addition of the independent variable Xi to the
regression equation after all the other independent
variables have been included.

• Partial regression coefficient. The partial regression

coefficient, b1, denotes the change in the predicted
value, Y , per unit change in X1 when the other
independent variables, X2 to Xk, are held constant.
Conducting Multiple Regression Analysis
Partial Regression Coefficients

To understand the meaning of a partial regression coefficient,

let us consider a case in which there are two independent
variables, so that:

Y = a + b1X1 + b2X2

 First, note that the relative magnitude of the partial

regression coefficient of an independent variable is, in
general, different from that of its bivariate regression
coefficient.
 The interpretation of the partial regression coefficient, b1,
is that it represents the expected change in Y when X1 is
changed by one unit but X2 is held constant or otherwise
controlled. Likewise, b2 represents the expected change in
Y for a unit change in X2, when X1 is held constant. Thus,
calling b1 and b2 partial regression coefficients is
appropriate.
Conducting Multiple Regression Analysis
Partial Regression Coefficients

• It can also be seen that the combined effects of X1 and X2 on Y

are additive. In other words, if X1 and X2 are each changed by
one unit, the expected change in Y would be (b1+b2).

• Suppose one was to remove the effect of X2 from X1. This

could be done by running a regression of X1 on X2. In other
words, one would estimate the equation X 1 = a + b X2 and
X
calculate the residual Xr = (X1 - 1). The partial regression
coefficient, b1, is then equal to the bivariate regression
Y
coefficient, br , obtained from the equation = a + br Xr .
Conducting Multiple Regression Analysis
Partial Regression Coefficients

• Extension to the case of k variables is straightforward. The partial

regression coefficient, b1, represents the expected change in Y when
X1 is changed by one unit and X2 through Xk are held constant. It can
also be interpreted as the bivariate regression coefficient, b, for the
regression of Y on the residuals of X1, when the effect of X2 through Xk
has been removed from X1.
• The relationship of the standardized to the non-standardized
coefficients remains the same as before:
B1 = b1 (Sx1/Sy)
Bk = bk (Sxk /Sy)

The estimated regression equation is:

(Y ) = 0.33732 + 0.48108 X1 + 0.28865 X2

Attitude = 0.33732 + 0.48108 (Duration) + 0.28865 (Importance)

Multiple Regression

Table 17.3
Multiple R 0.97210
R2 0.94498
Adjusted R2 0.93276
Standard Error 0.85974

ANALYSIS OF VARIANCE
df Sum of Squares Mean Square

Regression 2 114.26425 57.13213

Residual 9 6.65241 0.73916
F = 77.29364 Significance of F = 0.0000

VARIABLES IN THE EQUATION

Variable b SEb Beta (ß) T Significance
of T
IMPORTANCE 0.28865 0.08608 0.31382 3.353 0.0085
DURATION 0.48108 0.05895 0.76363 8.160 0.0000
(Constant) 0.33732 0.56736 0.595 0.5668
Multicollinearity

• Multicollinearity arises when intercorrelations among

the predictors are very high.

• Multicollinearity can result in several problems, including:

• The partial regression coefficients may not be

estimated precisely. The standard errors are likely to

be high.
• The magnitudes, as well as the signs of the partial

regression coefficients, may change from sample to

sample.
• It becomes difficult to assess the relative importance of

the independent variables in explaining the variation in

the dependent variable.
• Predictor variables may be incorrectly included or

removed in stepwise regression.

Multicollinearity

• A simple procedure for adjusting for multicollinearity consists

of using only one of the variables in a highly correlated set of
variables.

• Alternatively, the set of independent variables can be

transformed into a new set of predictors that are mutually
independent by using techniques such as principal
components analysis.

• More specialized techniques, such as ridge regression and

latent root regression, can also be used.
Multicollinearity

• Tolerance: The percentage of variance in the IV not accounted

for by other IV.:1-R2

• Tolerance of .10-.20 are problematic as this means that only

10-20% of the IV is not explained by the other IV

• Variance inflation factor(VIF): 1/1-R2.

• So if Tolerance is .10 then the VIF is 10. If the variance

inflation factor of a predictor variable were 10 this means that
the standard error for the coefficient of that predictor variable
is 10 times as large as it would be if that predictor variable
were uncorrelated with the other predictor variables.

• Hence VIF should be ≤3 ; 3-5 =not good;5-10=bad; >10=VB

Dummy Variables

• There are situations where the dependent variable may

be influenced by the qualitative variables like gender,
marital status, profession, geographical region, and
religion etc.
• To quantify the qualitative variables, dummy variables are
used.
• The number of dummy variables in a regression model
equals the number of categories of data less one.
• Dummy variable may take two values such as zero, one;
ten, eleven; or any other such value.
• Dummy variables could also be used to examine the
moderator effect between two variables.
Example of a Dummy Variable Regression

Suppose the starting salary of a college lecturer is influenced

not only by years of teaching experience but also by gender.
Therefore, the model could be specified as:
Y = f (X, D)
Where,
Y = Starting salary of a college lecturer in thousands `
per month
X = No. of years of work experience
D is a dummy variable which takes values
D = 1 (if the respondent is a male)
= 0 (if the respondent is a female)

The model could be written as,

Y=α+βX+γD+U
Example of a Dummy Variable Regression

This can be estimated by using ordinary least squares (OLS)

techniques. Suppose the estimated regression equation looks like:

The above two equations differ by the amount γˆ. It is known

that γˆ can be positive or negative. If γˆ is positive it would
imply that the average salary of a male lecturer is more than that
of a female lecturer by the amount γˆ while keeping the number
of years of experience constant.
Moderator Variable

Consider Y = a + b1x + b2z + b3xz

• If b3 is insignificant and b2 is significant, than z is not a

moderator variable but simply an independent predictor
variable.
• If b2 is insignificant and b3 is significant, than z is a PURE
moderator variable.
• If both b2 and b3 are significant, than z is a QUASI
moderator variable.

Pattern Classification: Second Edition
No ratings yet
Pattern Classification: Second Edition
11 pages
Assignment Front Sheet
No ratings yet
Assignment Front Sheet
48 pages
Multiple Regression: Dr. Sanjay Rastogi IIFT, New Delhi
No ratings yet
Multiple Regression: Dr. Sanjay Rastogi IIFT, New Delhi
37 pages
Multiple Regression
No ratings yet
Multiple Regression
36 pages
Multiple Regression: by Dr. D. Israel
No ratings yet
Multiple Regression: by Dr. D. Israel
23 pages
Multiple linear regression
No ratings yet
Multiple linear regression
39 pages
11, 12. Predictive Analysis
No ratings yet
11, 12. Predictive Analysis
33 pages
Multiple Linear Regression Session 4
No ratings yet
Multiple Linear Regression Session 4
32 pages
BRM Multivariate Notes
No ratings yet
BRM Multivariate Notes
22 pages
Session-Multiple Regression
No ratings yet
Session-Multiple Regression
26 pages
Multiple Regression Analysis
No ratings yet
Multiple Regression Analysis
14 pages
BRM Multivariate Notes
No ratings yet
BRM Multivariate Notes
22 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
73 pages
Welcome To:: Multiple Regression and Model Building
No ratings yet
Welcome To:: Multiple Regression and Model Building
20 pages
Multiple Regression (Compatibility Mode)
No ratings yet
Multiple Regression (Compatibility Mode)
24 pages
Multivariate Analysis: Are Some of The Variables Dependent On Others?
100% (2)
Multivariate Analysis: Are Some of The Variables Dependent On Others?
16 pages
Note Multiple Regression KOM 6115
No ratings yet
Note Multiple Regression KOM 6115
18 pages
Simple and Multiple Linear Regression
No ratings yet
Simple and Multiple Linear Regression
6 pages
Chapter - Seventeen: Correlation & Regression Analysis
No ratings yet
Chapter - Seventeen: Correlation & Regression Analysis
17 pages
Chapter 3 MLR
No ratings yet
Chapter 3 MLR
40 pages
Unit 4-1
No ratings yet
Unit 4-1
29 pages
Multiple Regression Analysis
No ratings yet
Multiple Regression Analysis
15 pages
Multi Regression Model
No ratings yet
Multi Regression Model
3 pages
Regression
No ratings yet
Regression
24 pages
Bio2 Module 4 - Multiple Linear Regression
No ratings yet
Bio2 Module 4 - Multiple Linear Regression
20 pages
Investigating Variables
No ratings yet
Investigating Variables
15 pages
Dr. Hussin Abdullah School of Economics, Finance and Banking, Uum Cob
No ratings yet
Dr. Hussin Abdullah School of Economics, Finance and Banking, Uum Cob
12 pages
RMD S10 Regression
No ratings yet
RMD S10 Regression
22 pages
Introduction To Multiple Regression: Dale E. Berger Claremont Graduate University
No ratings yet
Introduction To Multiple Regression: Dale E. Berger Claremont Graduate University
13 pages
Unit 5
No ratings yet
Unit 5
10 pages
Topic 2 - Group 5 - Marketing Research Exploratory Research
No ratings yet
Topic 2 - Group 5 - Marketing Research Exploratory Research
50 pages
Chapter 14, Multiple Regression Using Dummy Variables
No ratings yet
Chapter 14, Multiple Regression Using Dummy Variables
19 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
Lecture 8 and 9 Regression Correlation and Index
No ratings yet
Lecture 8 and 9 Regression Correlation and Index
32 pages
Chapter 10
No ratings yet
Chapter 10
3 pages
Ch 4- Correlation and Regression YARA&LAMA
No ratings yet
Ch 4- Correlation and Regression YARA&LAMA
27 pages
CHAPTER 15 Partial and Multiple Correlation and Regression Analysis
100% (2)
CHAPTER 15 Partial and Multiple Correlation and Regression Analysis
48 pages
CH 17 Correlation Vs Regression
No ratings yet
CH 17 Correlation Vs Regression
17 pages
Business Analytics: Advance: Simple & Multiple Linear Regression
No ratings yet
Business Analytics: Advance: Simple & Multiple Linear Regression
38 pages
Lecture 10
No ratings yet
Lecture 10
33 pages
Common Pitfalls in Statistical Analysis: Linear Regression Analysis
No ratings yet
Common Pitfalls in Statistical Analysis: Linear Regression Analysis
4 pages
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
No ratings yet
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
10 pages
REGRESSION
No ratings yet
REGRESSION
8 pages
Module 5: Multiple Regression Analysis: Tom Ilvento
No ratings yet
Module 5: Multiple Regression Analysis: Tom Ilvento
20 pages
anova explain
No ratings yet
anova explain
10 pages
09 - M & S - Corr+Regr
No ratings yet
09 - M & S - Corr+Regr
18 pages
Multiple Linear Regression: y BX BX BX
No ratings yet
Multiple Linear Regression: y BX BX BX
14 pages
Regression: Simple Linear Regression Model
No ratings yet
Regression: Simple Linear Regression Model
16 pages
Week 9 Lesson 2 PDF
No ratings yet
Week 9 Lesson 2 PDF
4 pages
Regression
No ratings yet
Regression
12 pages
Assessing Relationships: Regression Analyses: February 25, 2020
No ratings yet
Assessing Relationships: Regression Analyses: February 25, 2020
20 pages
STAT 252-Notes-Topic 5-Multiple Linear Regression
No ratings yet
STAT 252-Notes-Topic 5-Multiple Linear Regression
33 pages
Multi Linear Regression
No ratings yet
Multi Linear Regression
51 pages
Multiple-Regression -Batool & Raya
No ratings yet
Multiple-Regression -Batool & Raya
24 pages
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
No ratings yet
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
11 pages
Simple Regression 1
No ratings yet
Simple Regression 1
18 pages
Chapter 5 - 1
No ratings yet
Chapter 5 - 1
5 pages
Regression and Introduction To Bayesian Network
No ratings yet
Regression and Introduction To Bayesian Network
12 pages
Bivariate
No ratings yet
Bivariate
28 pages
Regression
No ratings yet
Regression
15 pages
01 - Quantitative Methods
No ratings yet
01 - Quantitative Methods
28 pages
Calculus III Essentials
From Everand
Calculus III Essentials
Editors of REA
1/5 (2)
A Study On The Working Capital Management of Pharmaceutical Industry (A Case Study of Cipla LTD.)
No ratings yet
A Study On The Working Capital Management of Pharmaceutical Industry (A Case Study of Cipla LTD.)
7 pages
Asian Paints Limited: Add Colour To Your Portfolio
No ratings yet
Asian Paints Limited: Add Colour To Your Portfolio
26 pages
Strength Weakness: Swot Analysis
No ratings yet
Strength Weakness: Swot Analysis
3 pages
Project Title: Offline Vs
No ratings yet
Project Title: Offline Vs
6 pages
Pivot Tables
No ratings yet
Pivot Tables
24 pages
India Is One of The Fastest Growing Paints Market in The World
No ratings yet
India Is One of The Fastest Growing Paints Market in The World
3 pages
Hedging Strategies Using Forwards and Futures
No ratings yet
Hedging Strategies Using Forwards and Futures
45 pages
Final Report - Shreyas Venkatesh - 19PGDM193
No ratings yet
Final Report - Shreyas Venkatesh - 19PGDM193
39 pages
Make My Trip Final
No ratings yet
Make My Trip Final
23 pages
IS-LM Analysis
100% (1)
IS-LM Analysis
59 pages
Final Report - Harshul Bansal - 19PGDM156
No ratings yet
Final Report - Harshul Bansal - 19PGDM156
26 pages
Khadim India Limited Annual Report 2018 19
0% (1)
Khadim India Limited Annual Report 2018 19
5 pages
1-Nonlinear Regression Models in Agriculture
100% (1)
1-Nonlinear Regression Models in Agriculture
9 pages
ReliabilityHw1 R26104047
No ratings yet
ReliabilityHw1 R26104047
4 pages
To Find Correlation and Regression of The Following Data
No ratings yet
To Find Correlation and Regression of The Following Data
5 pages
Assignment 3 Hints
No ratings yet
Assignment 3 Hints
8 pages
PRAC8_23BME053
No ratings yet
PRAC8_23BME053
2 pages
Chapter 10, Exercises 10-1, 10-2, Problem # 15: X y Xy
No ratings yet
Chapter 10, Exercises 10-1, 10-2, Problem # 15: X y Xy
2 pages
A First Course in Linear Model Theory 2nd Edition Nalini Ravishanker download
No ratings yet
A First Course in Linear Model Theory 2nd Edition Nalini Ravishanker download
57 pages
4880-Article Text-13253-1-10-20120420
No ratings yet
4880-Article Text-13253-1-10-20120420
9 pages
Partial Adjustment Model Regression
No ratings yet
Partial Adjustment Model Regression
24 pages
Electric PS SE Monticelli
No ratings yet
Electric PS SE Monticelli
21 pages
ANCOVA Sample Assignment
No ratings yet
ANCOVA Sample Assignment
12 pages
Chap 014
No ratings yet
Chap 014
16 pages
EViews 14 Users Guide II
No ratings yet
EViews 14 Users Guide II
1,631 pages
Assignment - Forecasting
No ratings yet
Assignment - Forecasting
1 page
Lect 4 Notes
No ratings yet
Lect 4 Notes
21 pages
Pengaruh Kualitas Produk Terhadap Keputusan Pembelian Ulang Sabun Mandi Merek Shinzui Di Kota Palu
No ratings yet
Pengaruh Kualitas Produk Terhadap Keputusan Pembelian Ulang Sabun Mandi Merek Shinzui Di Kota Palu
10 pages
(eBook PDF) Introductory Econometrics: A Modern Approach 7th Edition by Jeffrey All Chapters Instant Download
100% (9)
(eBook PDF) Introductory Econometrics: A Modern Approach 7th Edition by Jeffrey All Chapters Instant Download
45 pages
ST221 Notes
No ratings yet
ST221 Notes
9 pages
Econometrics 1: BLUE Properties of OLS Estimators
100% (1)
Econometrics 1: BLUE Properties of OLS Estimators
11 pages
CS1B_April_2024_Exam_Paper
No ratings yet
CS1B_April_2024_Exam_Paper
7 pages
Chapter Four 1
No ratings yet
Chapter Four 1
91 pages
Documento 1 Bmantenimiento
No ratings yet
Documento 1 Bmantenimiento
70 pages
Download Full The Econometrics of Multi-dimensional Panels 2nd Edition Laszlo Matyas PDF All Chapters
100% (14)
Download Full The Econometrics of Multi-dimensional Panels 2nd Edition Laszlo Matyas PDF All Chapters
40 pages
Ery Eco
No ratings yet
Ery Eco
6 pages
Assignment B 1 LinearRegression
No ratings yet
Assignment B 1 LinearRegression
5 pages
Calculating The Parameters of The Weibull Distribution
No ratings yet
Calculating The Parameters of The Weibull Distribution
6 pages
Applied Econometrics Notes
No ratings yet
Applied Econometrics Notes
3 pages
Econometrics: Chapter 6: Multiple Regression Model
No ratings yet
Econometrics: Chapter 6: Multiple Regression Model
23 pages

Multiple Regression Analysis & Applications

Uploaded by

Multiple Regression Analysis & Applications

Uploaded by

MULTIPLE REGRESSION ANALYSIS

Regression analysis examines associative relationships

• Bivariate regression model. The basic regression

• Coefficient of determination. The strength of association

• Regression coefficient. The estimated

• Standardized regression coefficient. Also termed

• Sum of squared errors. The distances of all the

• t statistic. A t statistic with n - 2 degrees of

MS = f(SSF, AE, SPB)

CP = f(PP, BI, BA)

The general form of the multiple regression model

Y= a + b1X1 + b2X2 + b3X3+ . . . + bkXk

• Adjusted R2. R2, coefficient of multiple determination, is

• Coefficient of multiple determination. The strength of

• Partial F test. The significance of a partial regression

• Partial regression coefficient. The partial regression

To understand the meaning of a partial regression coefficient,

 First, note that the relative magnitude of the partial

• It can also be seen that the combined effects of X1 and X2 on Y

• Suppose one was to remove the effect of X2 from X1. This

• Extension to the case of k variables is straightforward. The partial

The estimated regression equation is:

Attitude = 0.33732 + 0.48108 (Duration) + 0.28865 (Importance)

Regression 2 114.26425 57.13213

VARIABLES IN THE EQUATION

• Multicollinearity arises when intercorrelations among

• Multicollinearity can result in several problems, including:

estimated precisely. The standard errors are likely to

regression coefficients, may change from sample to

the independent variables in explaining the variation in

removed in stepwise regression.

• A simple procedure for adjusting for multicollinearity consists

• Alternatively, the set of independent variables can be

• More specialized techniques, such as ridge regression and

• Tolerance: The percentage of variance in the IV not accounted

• Tolerance of .10-.20 are problematic as this means that only

• Variance inflation factor(VIF): 1/1-R2.

• So if Tolerance is .10 then the VIF is 10. If the variance

• Hence VIF should be ≤3 ; 3-5 =not good;5-10=bad; >10=VB

• There are situations where the dependent variable may

Suppose the starting salary of a college lecturer is influenced

The model could be written as,

This can be estimated by using ordinary least squares (OLS)

The above two equations differ by the amount γˆ. It is known

Consider Y = a + b1x + b2z + b3xz

• If b3 is insignificant and b2 is significant, than z is not a

You might also like