Business Research Methods S16
Business Research Methods S16
Where N
SS y (Yi Y 2 )
i 1
c
SS x n(Y j Y ) 2
j 1
c n
SSerror (Yij Y j ) 2
j i
Yi = individual observation
The value of 2
varies between 0 and 1.
CONDUCTING ONE-WAY ANALYSIS OF VARIANCE
TEST SIGNIFICANCE (1 OF 2)
In one-way analysis of variance, the interest lies in testing the null
hypothesis that the category means are equal in the population.
H0: µ1 = µ2 = µ3 = ........... = µc
Under the null hypothesis, SSx and SSerror come from the same source of
variation. In other words, the estimate of the population variance of Y,
S y2 SS x / (c 1)
Mean square due to X
MS x
or
S y2 SSerror / ( N c)
Mean square due to error
MSerror
CONDUCTING ONE-WAY ANALYSIS OF VARIANCE
TEST SIGNIFICANCE (2 OF 2)
The null hypothesis may be tested by the F statistic
based on the ratio between these two estimates:
SS x / (c 1) MS x
F
SSerror / ( N c ) MSerror
as follows:
185.867 = 106.067 +79.80
SS x / (c 1) MS X
F
SSerror / ( N c) MS error
106.067 / (3 1)
F
79.800 / (30 3)
17.944
ILLUSTRATIVE APPLICATIONS OF ONE-WAY
ANALYSIS OF VARIANCE (7 OF 7)
From Table in the Statistical Appendix we see that for 2 and 27 degrees of freedom, the
critical value of F is 3.35 for = 0.05. Because the calculated value of F is greater than the
critical value, we reject the null hypothesis.
We now illustrate the analysis of variance procedure using a computer program. The
results of conducting the same analysis by computer are presented in Table 16.4.
ONE-WAY ANOVA: EFFECT OF IN-STORE PROMOTION ON
STORE SALES
Table 16.4 One-Way ANOVA: Effect of In-Store Promotion on Store
Sales
Where Co-variance of X and Y is given by
Standard error of estimate. This statistic, SEE, is the standard deviation of the actual Y
values from the predicted Ŷ values.
Standard error. The standard deviation of b, SEb, is called the standard error.
CONDUCTING BIVARIATE REGRESSION ANALYSIS
FORMULATE THE BIVARIATE REGRESSION MODEL
In the bivariate regression model, the general form of a straight line is: Y = β0 + β1 X
where
Y = dependent or criterion variable
X = independent or predictor variable
β0 = intercept of the line
β1 = slope of the line
The regression procedure adds an error term to account for the probabilistic or stochastic nature of the
relationship:
Y i = β 0 + β 1 Xi + e i
where ei is the error term associated with the i th observation.
PLOT OF ATTITUDE WITH DURATION
Plot of Attitude with Duration
WHICH STRAIGHT LINE IS BEST?
Which Straight Line Is Best?
BIVARIATE REGRESSION (1 OF 2)
CONDUCTING BIVARIATE REGRESSION ANALYSIS
ESTIMATE THE PARAMETERS (1 OF 3)
𝑌 = 𝛽0 + 𝛽1 𝑋
In most cases,β0 and β1 are unknown and are estimated from the sample
observations using the equation
= a + b xi
where is the estimated or predicted value of Yi, and a and b are estimators of β0
and β1, respectively.
ASSUMPTIONS
Multiple R 0.93608
R2 0.87624
Adjusted R2 0.86387
Standard error 1.22329
As before, the coefficient a represents the intercept, but the b's are now the partial regression
coefficients.
STATISTICS ASSOCIATED WITH MULTIPLE REGRESSION (1 OF 2)
Multiple R 0.97210
R2 0.94498
Adjusted R2 0.93276
Standard error 0.85974
Analysis of
Blank Blank Variance Blank
Blank df Sum of Squares Mean Square
Regression 2 114.26425 57.13213
Residual 9 6.65241 0.73916
F = 77.29364 Significance of F = Blank Blank
blank blank 0.0000
Variables in the blank blank blank
Equation
Variable b SEB Beta (B) t Significance
of t
Importanc 0.2886 0.08608 0.31382 3.353 0.0085
e 5
Duration 0.4810 0.05895 0.76363 8.160 0.000
8
(Constant 0.3373 0.56736 blank 0.595 0.5668
MORE ON EXAMINING RESIDUALS
A residual is the difference between the observed value of Yi and the value predicted by
the regression equation Ŷi.
Scattergrams of the residuals, in which the residuals are plotted against the predicted
values, Ŷi, time, or predictor variables, provide useful insights in examining the
appropriateness of the underlying assumptions and regression model fit.
The assumption of a normally distributed error term can be examined by constructing a
histogram of the residuals.
The assumption of constant variance of the error term can be examined by plotting the
residuals against the predicted values of the dependent variable, Ŷi.
EXAMINING RESIDUALS