100% found this document useful (1 vote)

357 views21 pages

Regression and Multiple Regression Analysis

Regression analysis is a statistical technique used to model relationships between variables. It can be used for prediction, inference, and modeling causal relationships. The ordinary least squares method is commonly used to estimate unknown parameters by minimizing error. Model accuracy is evaluated using standard error, t-tests, R-squared, and ANOVA. Regression involves both simple linear regression with one variable and multiple linear regression with more than one independent variable.

Uploaded by

Raghu Nayak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

357 views21 pages

Regression and Multiple Regression Analysis

Uploaded by

Raghu Nayak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 21

Regression and

Multiple Regression Analysis

Regression
- a technique use for the modeling and analysis of
numerical data consisting of value of dependent
variable (response variable) and of one or more
independent variables (explanatory variables).

It can be used for prediction (including forecasting of

time-series data), inference, hypothesis testing, and
modeling of causal relationships.

Regression concepts were published in early of 1800.

It was published by Legendre 1805 and gauss 1809.

Applications
Applications of regression are numerous and occur in
almost every field, including:
- engineering,
- physical sciences,
- economics,
- management,
- life and biological sciences
- social sciences.

In fact, regression analysis may be the most widely used

statistical technique.
Types of Regression
Models
1 independent Regression 2+ Independent
Variable Models Variables

Simple Multiple

Non- Non-
Linear Linear
Linear Linear
simple linear regression model:
A regression model that involves only one independent
variable.
The form can be express as

Yi = β0+ β1Xi+ei i  1,2,3,..., n

Here, Y = the yield (dependent),

Xi= the independent variable
ei= error or disturbance
Multiple linear regression model:
A regression model that involves more than one
regressor (independent) variable.

The general form can be express as

Yi = β0+ β1Xi1+ β2Xi2+ …. + βkXik+ ei i  1,2,3,..., n

Here, Y = the yield (dependent),

Xi= the independent variable
ei= error or disturbance
Objectives

1. The general purpose of regression (multiple

regression) is to learn about the relationship between
several independent or predictor variables and a
dependent variable.

2. The specific objective of regression are:

• Estimate the unknown parameters in the
regression model (fitting the model to the data).
• Predict or forecast the response variable and these
predictions are helpful in planning the project.
Underlying Principles
According to Gaussian, standard or classical linear
regression model (CLRM), which is the
foundation/cornerstone of most econometric theory.

several assumptions:

Assumption 1: The regression model is linear in the

parameters
Assumption 2: X values are fixed in repeated sampling
Assumption 3: Zero mean values of disturbance (error)
Underlying Principles cont’s …

Assumption 4: Error variance

ie: Var(ei /Xi ) = 2 ( a constant)
Assumption 5: No autocorrelation between the
disturbances (error).
Assumption 6: Zero covariance between ei and Xi , or
Cov (ei, Xi) = 0
Assumption 7: There are no perfect linear
relationships among the
independent variables.
Methods of Estimation

Here we just name some well-known methods for

estimating the regression model:
1. The methods of moments
2. The methods of least squares
3. The methods of maximum likelihood

The Ordinary Least Squares (OLS) method of

estimation is the popular one, has a wide area of uses
for its flexibility.

The main aim of least square method is to estimate

parameters of the linear regression model by minimizing
the error sum of squares.
The Ordinary Least Squares (OLS)
A multi linear model of the form
Y = β0+ β1X1+ β2X2+….++ β6X6+e
We may write the sample regression model as follows
Yi = 0 + 1xi1 + 2xi2 + ---------+ kxik + I

The least-squares function is

n
S = ∑I 2
i=1

n k
= ∑( yi - 0 - ∑j xij )2
i=1 j =1
This function S must be with respect to 0, 1, ……….., k.
The least-squaresd estimators of 0, 1, ……….., k are estimated by
minimized this S function with respect to 0, 1, ……….., k.
The techniques to determining the model accuracy:

i) Standard error of the coefficient

ii) T-test of the coefficients
iii) Residuals standards deviations
iv) Coefficient of determination, R2
v) ANOVA for overall measures
(i) The standard error is represented
by

se(  i )  MSres / Sxx

MSres : residual means square
Sxx : Sum of square of independent variables
(ii) T-test of the coefficients

• Suppose that we wish to test the hypothesis

that the slope equals a constant, say ßi0.
The appropriate hypothesis are:

H0 : ßi = ßio
H1 : ßi ≠ ßio

where we have specified a two-sided alternative

(ii) T-test of the coefficients cont’s…

The definition of a t statistic is follows:

To = (βi – βio) / MSres / Sxx

iii) Coefficient of determination:

R2 as a PRE (proportional-reduction-in-error measure of association)


o

iv) Residual standard deviation:

the standard deviation of the residuals (residuals = differences

between observed and predicted values). It is calculated as
follows:
(v) ANOVA for overall measures

The analysis of variance table divides the total variation in

the dependent variable into two components,

1st component- which can be attributed to the regression

model (labeled Regression)
2nd component-which cannot (labeled Residual).

*If the significance level for the F-test is small (less than
0.05), then the hypothesis that there is no (linear)
relationship can be rejected, and the multiple correlation
coefficient can be called statistically significant. The F
statistic can be written as

Fo = MSr MSr = Regression means square

MSres = Residual means square
MSres
Literature on Applications of OLS method :

Here we have considered a seven variable Multiple linear regression model.

The model can be written as a linear form

Y = β0+ β1X1+ β2X2+….++ β6X6+e

Y = Overall rating of job being done by supervisor

X1 = Handles employee complaints
X2 = Does not allow special privileges
X3 = Opportunity to learn new things
X4 = Raises based on performance
X5 = To critical of poor performance
X6 = Rate of advancing to better jobs
e = Error term
β0, β1, β2,….,β6 are the unknown parameters.

Our ultimate goal is to estimate the unknown parameters from the model.

Data Source: http://www.ilr.cornell.edu/hadi/rabe4

For estimating model we have used here SPSS 11.5 version. The outputs getting
from SPSS 11.5 version are given below:

Summary of coefficients

t Sig.
Model Std. Error of
Coefficients Coefficients
(Constant) 10.787 11.589 .931 .362
X1 .613 .161 3.809 .001
X2 -.073 .136 -.538 .596
X3 .320 .169 1.901 .040
X4 .082 .221 .369 .715
X5 .038 .147 .261 .796
X6 -.217 .178 -1.218 .236

From summary of the coefficients table we see that the variables

X1 and X3 are significance than comparing the other variables.
The R2 value =0.73 and standard error of the estimate= 7.06
Here value of R2 is high, this imply that our fitting
model for this data set is appropriate.

ANOVA
Model Sum of
Squares df Mean Square F Sig.
Regression
3147.966 6 524.661 10.502 .000

Residual 1149.000 23 49.957

Total 4296.967 29

We can also comment from ANOVA Table that

over all fitting of the model is also appropriate (F=10.502, α=0.01).
Conclusion
1. Regression- can learn the relationship between several
independent variables and a dependent variable.

2. Regression- can estimate the unknown parameters of

regression model

3. It also can be use for forecasting the response variable

and these predictions are helpful in planning the project.

Correlation & Regression Guide
100% (1)
Correlation & Regression Guide
25 pages
Hypergeometric Distribution
No ratings yet
Hypergeometric Distribution
4 pages
Instructor's Manual: Econometrics 4e
100% (5)
Instructor's Manual: Econometrics 4e
620 pages
Regression Study Guide
No ratings yet
Regression Study Guide
9 pages
Statistic Book
100% (1)
Statistic Book
328 pages
Confirmatory Factor Analysis Guide
No ratings yet
Confirmatory Factor Analysis Guide
14 pages
Regression Analysis on Birth Weight
No ratings yet
Regression Analysis on Birth Weight
5 pages
Computing I (1) Melkamu
No ratings yet
Computing I (1) Melkamu
110 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
24 pages
Quadratic Forms
No ratings yet
Quadratic Forms
4 pages
Wooldridge Slides 10 Diff in Diffs
No ratings yet
Wooldridge Slides 10 Diff in Diffs
31 pages
Cox Proportional Hazard Model
No ratings yet
Cox Proportional Hazard Model
34 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
25 pages
Regression & Correlation Guide
100% (1)
Regression & Correlation Guide
9 pages
Multiple Regression
No ratings yet
Multiple Regression
100 pages
Example of Two Group Discriminant Analysis
No ratings yet
Example of Two Group Discriminant Analysis
7 pages
Correlation Analysis
100% (1)
Correlation Analysis
51 pages
R Squared and Adjusted R Squared
100% (1)
R Squared and Adjusted R Squared
2 pages
Multivariate Analysis IBS
No ratings yet
Multivariate Analysis IBS
20 pages
Confidence Interval Estimation
No ratings yet
Confidence Interval Estimation
62 pages
Transforming Non-Stationary Data
100% (2)
Transforming Non-Stationary Data
5 pages
Canonical Correlation Notes
No ratings yet
Canonical Correlation Notes
6 pages
Statistical Model Transformations
No ratings yet
Statistical Model Transformations
27 pages
Applied Longitudinal Analysis Lecture Notes
No ratings yet
Applied Longitudinal Analysis Lecture Notes
475 pages
Applied Econometrics & Time Series Analysis Homework 1
No ratings yet
Applied Econometrics & Time Series Analysis Homework 1
5 pages
OLS Assumptions & Issues Guide
No ratings yet
OLS Assumptions & Issues Guide
4 pages
Statistics and Data
No ratings yet
Statistics and Data
67 pages
Bahan Univariate Linear Regression
No ratings yet
Bahan Univariate Linear Regression
64 pages
Sampling Distribution and Confidence Interval
No ratings yet
Sampling Distribution and Confidence Interval
28 pages
(Springer Series in Statistics) Jun Shao, Dongsheng Tu (Auth.) - The Jackknife and Bootstrap-Springer-Verlag New York (1995)
100% (1)
(Springer Series in Statistics) Jun Shao, Dongsheng Tu (Auth.) - The Jackknife and Bootstrap-Springer-Verlag New York (1995)
532 pages
Agra University Journal Scie
No ratings yet
Agra University Journal Scie
69 pages
Regression Analysis
100% (2)
Regression Analysis
28 pages
K Kiran Kumar IIM Indore
100% (1)
K Kiran Kumar IIM Indore
115 pages
Ch18 Multiple Regression
No ratings yet
Ch18 Multiple Regression
51 pages
Nonparametric Methods for Business
No ratings yet
Nonparametric Methods for Business
17 pages
Research Methods in Economics Part II STAT
No ratings yet
Research Methods in Economics Part II STAT
350 pages
Advanced Statistical Distributions
No ratings yet
Advanced Statistical Distributions
13 pages
Mock Final Exam - Econometrics 2022-2023
100% (1)
Mock Final Exam - Econometrics 2022-2023
7 pages
Chow Test
No ratings yet
Chow Test
23 pages
Statistics For Health Research: Non-Parametric Methods
100% (1)
Statistics For Health Research: Non-Parametric Methods
56 pages
Quadratic Forms and Characteristic Roots Prof. NasserF1
No ratings yet
Quadratic Forms and Characteristic Roots Prof. NasserF1
65 pages
A Short Course of Time-Series Analysis and Forecasting by D S G Pollock
No ratings yet
A Short Course of Time-Series Analysis and Forecasting by D S G Pollock
133 pages
How To Use STATA To Estimate The Capital Asset Pricing Model and APT
No ratings yet
How To Use STATA To Estimate The Capital Asset Pricing Model and APT
5 pages
Autocorrelation
100% (2)
Autocorrelation
172 pages
Statistical Inference
100% (1)
Statistical Inference
11 pages
Econometrics Note
No ratings yet
Econometrics Note
13 pages
Introduction To Probability 1
No ratings yet
Introduction To Probability 1
71 pages
Stata Commands PDF
No ratings yet
Stata Commands PDF
5 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
33 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Regression Analysis Essentials
No ratings yet
Regression Analysis Essentials
55 pages
OLS in Two-Variable Regression
No ratings yet
OLS in Two-Variable Regression
65 pages
Lecture2 241007 162001
No ratings yet
Lecture2 241007 162001
11 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
Topic 6B Regression
No ratings yet
Topic 6B Regression
13 pages
Chapter Two
No ratings yet
Chapter Two
44 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
07 Multiple Regression Analysis PDF
No ratings yet
07 Multiple Regression Analysis PDF
26 pages
How To Get Gorgeous Breasts
No ratings yet
How To Get Gorgeous Breasts
23 pages
Quick Success Series - P Segment Loan Products (February 28, 2013)
No ratings yet
Quick Success Series - P Segment Loan Products (February 28, 2013)
16 pages
File Income Tax Online: Step-by-Step Guide
No ratings yet
File Income Tax Online: Step-by-Step Guide
7 pages
Amway Growth Incentive Program
No ratings yet
Amway Growth Incentive Program
12 pages
12 Ways To Spot A High Achiever: July 23, 2013
No ratings yet
12 Ways To Spot A High Achiever: July 23, 2013
8 pages
Stop Using 30 Phrases
No ratings yet
Stop Using 30 Phrases
2 pages
Worksheet in DEP CAL
No ratings yet
Worksheet in DEP CAL
34 pages
MCQ On Home Loans (A K Marandi, SBLC-Siliguri)
100% (1)
MCQ On Home Loans (A K Marandi, SBLC-Siliguri)
4 pages
Crisis in Life Gives One An Opportunity To Go For Change and Excel
No ratings yet
Crisis in Life Gives One An Opportunity To Go For Change and Excel
13 pages
Bipartite Talks With IBA
No ratings yet
Bipartite Talks With IBA
1 page
Setting Priority (Daily Routine) : Job Chat With Kids
No ratings yet
Setting Priority (Daily Routine) : Job Chat With Kids
1 page
Operating Engineer Resume
No ratings yet
Operating Engineer Resume
3 pages
Getting Started (90days)
No ratings yet
Getting Started (90days)
2 pages
Crazy Ones
No ratings yet
Crazy Ones
1 page
Senior Banking Professional Profile
No ratings yet
Senior Banking Professional Profile
3 pages
When It's Time To Go A Look at Contract
No ratings yet
When It's Time To Go A Look at Contract
2 pages
DSCR Calculation Sheet (Excel 2003 Ver)
0% (3)
DSCR Calculation Sheet (Excel 2003 Ver)
15 pages
When It's Time To Go A Look at Contract
No ratings yet
When It's Time To Go A Look at Contract
2 pages
College Fee Breakdown 2013-14
No ratings yet
College Fee Breakdown 2013-14
2 pages
DSCR Calculation Sheet (Excel 2003 Ver)
0% (3)
DSCR Calculation Sheet (Excel 2003 Ver)
15 pages
How To Model Viral Growth
100% (2)
How To Model Viral Growth
24 pages
Crazy Ones
No ratings yet
Crazy Ones
1 page
Class 1 Science Olympiad Prep
No ratings yet
Class 1 Science Olympiad Prep
6 pages
Class 1 Sample Paper
100% (1)
Class 1 Sample Paper
4 pages
New Cra Regular Trading) Dec 11
100% (1)
New Cra Regular Trading) Dec 11
40 pages
Bond Concepts
No ratings yet
Bond Concepts
35 pages
Chapter 13 Moving From A Buc To Aucs: Les Munday Universe, 2010
No ratings yet
Chapter 13 Moving From A Buc To Aucs: Les Munday Universe, 2010
21 pages
Grade 7 Math Research Methodology
No ratings yet
Grade 7 Math Research Methodology
5 pages
Statistical Analysis for Analysts
No ratings yet
Statistical Analysis for Analysts
10 pages
IA Document Biology Ib
No ratings yet
IA Document Biology Ib
10 pages
Lab 6 - Naive Bayesian Classification Exercises
No ratings yet
Lab 6 - Naive Bayesian Classification Exercises
9 pages
C3620 Exp 6
No ratings yet
C3620 Exp 6
6 pages
MATH 204 - LESSON 3 - Statistical Graphs
No ratings yet
MATH 204 - LESSON 3 - Statistical Graphs
2 pages
BUSN 6530 Syllabus Su19
No ratings yet
BUSN 6530 Syllabus Su19
4 pages
Review of Literature, Theoretical Framework and Hypotheses
No ratings yet
Review of Literature, Theoretical Framework and Hypotheses
52 pages
Introduction and Background of The Study
100% (2)
Introduction and Background of The Study
22 pages
Business Analytics Data Analysis Decision Making 6th Edition S. Christian Albright Install Download
0% (1)
Business Analytics Data Analysis Decision Making 6th Edition S. Christian Albright Install Download
59 pages
Sample Statement of Purpose
50% (2)
Sample Statement of Purpose
7 pages
Karen Schrier, "Revolutionizing History Education: Using Augmented Reality Games To Teach Histories"
No ratings yet
Karen Schrier, "Revolutionizing History Education: Using Augmented Reality Games To Teach Histories"
290 pages
Lecture 4b Explanation in Geography SK
No ratings yet
Lecture 4b Explanation in Geography SK
19 pages
Full Name: .. Student ID: . Class: 1 /10 2 /20 3 /20
No ratings yet
Full Name: .. Student ID: . Class: 1 /10 2 /20 3 /20
9 pages
Design of Experiments (SS)
No ratings yet
Design of Experiments (SS)
4 pages
Confidence Intervals for Statisticians
No ratings yet
Confidence Intervals for Statisticians
9 pages
Science & Math for Middle Schoolers
No ratings yet
Science & Math for Middle Schoolers
14 pages
String Theory
100% (2)
String Theory
39 pages
Project Guidebook
No ratings yet
Project Guidebook
16 pages
Vur Ps Flags
No ratings yet
Vur Ps Flags
7 pages
NSA Mind Control and Psyops
100% (5)
NSA Mind Control and Psyops
16 pages
Curriculum Science
No ratings yet
Curriculum Science
34 pages
Practical Research Statement of The Problem and Research Questions
100% (1)
Practical Research Statement of The Problem and Research Questions
30 pages
Research Methods Lecture Slides 1
No ratings yet
Research Methods Lecture Slides 1
66 pages
Notes 516 Summer 09 Part 2
No ratings yet
Notes 516 Summer 09 Part 2
15 pages
The Entrepreneurial Intentions Among The Undergraduates Involved in Business Administration and Entrepreneurship Courses in Sri Lanka
100% (1)
The Entrepreneurial Intentions Among The Undergraduates Involved in Business Administration and Entrepreneurship Courses in Sri Lanka
6 pages
powerpointENGLISH 10
No ratings yet
powerpointENGLISH 10
14 pages
ApplicationGuidelines Shingo
No ratings yet
ApplicationGuidelines Shingo
34 pages
Design and Analysis of The Randomized Response Technique: Graeme B, Kosuke I, and Yang-Yang Z
No ratings yet
Design and Analysis of The Randomized Response Technique: Graeme B, Kosuke I, and Yang-Yang Z
16 pages
Lesson 1: Fundamental Concepts and Summation Notation
No ratings yet
Lesson 1: Fundamental Concepts and Summation Notation
8 pages

Regression and Multiple Regression Analysis

Uploaded by

Regression and Multiple Regression Analysis

Uploaded by

Regression and

Multiple Regression Analysis

It can be used for prediction (including forecasting of

Regression concepts were published in early of 1800.

It was published by Legendre 1805 and gauss 1809.

In fact, regression analysis may be the most widely used

Yi = β0+ β1Xi+ei i  1,2,3,..., n

Here, Y = the yield (dependent),

The general form can be express as

Yi = β0+ β1Xi1+ β2Xi2+ …. + βkXik+ ei i  1,2,3,..., n

Here, Y = the yield (dependent),

1. The general purpose of regression (multiple

2. The specific objective of regression are:

Assumption 1: The regression model is linear in the

Assumption 4: Error variance

Here we just name some well-known methods for

The Ordinary Least Squares (OLS) method of

The main aim of least square method is to estimate

The least-squares function is

i) Standard error of the coefficient

se(  i )  MSres / Sxx

• Suppose that we wish to test the hypothesis

where we have specified a two-sided alternative

The definition of a t statistic is follows:

iii) Coefficient of determination:

the standard deviation of the residuals (residuals = differences

The analysis of variance table divides the total variation in

1st component- which can be attributed to the regression

Fo = MSr MSr = Regression means square

Here we have considered a seven variable Multiple linear regression model.

Y = β0+ β1X1+ β2X2+….++ β6X6+e

Y = Overall rating of job being done by supervisor

Data Source: http://www.ilr.cornell.edu/hadi/rabe4

From summary of the coefficients table we see that the variables

Residual 1149.000 23 49.957

We can also comment from ANOVA Table that

2. Regression- can estimate the unknown parameters of

3. It also can be use for forecasting the response variable

You might also like