SBE11 CH 13
SBE11 CH 13
John
Loucks
St. Edward’s
University
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
1
or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 13, Part A
Experimental Design and Analysis of
Variance
An Introduction to Experimental
Design
and Analysis of Variance
Analysis of Variance and
the Completely Randomized Design
Multiple Comparison
Procedures
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
2
or duplicated, or posted to a publicly accessible website, in whole or in part.
An Introduction to Experimental Design
and Analysis of Variance
Statistical studies can be classified as being
either experimental or observational.
In an experimental study, one or more factors
are controlled so that data can be obtained
about how the factors influence the variables
of interest.
In an observational study, no attempt is made
to control the factors.
Cause-and-effect relationships are easier to
establish in experimental studies than in
observational studies.
Analysis of variance (ANOVA) can be used to
analyze the data obtained from experimental or
observational studies.
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
3
or duplicated, or posted to a publicly accessible website, in whole or in part.
An Introduction to Experimental Design
and Analysis of Variance
In this chapter three types of experimental
designs are introduced.
• a completely randomized
• design
a randomized block design
• a factorial experiment
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
4
or duplicated, or posted to a publicly accessible website, in whole or in part.
An Introduction to Experimental Design
and Analysis of Variance
A factor is a variable that the experimenter
has selected for investigation.
A treatment is a level of a factor.
Experimental units are the objects of interest
in the experiment.
A completely randomized design is an
experimental design in which the treatments
are randomly assigned to the experimental
units.
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
5
or duplicated, or posted to a publicly accessible website, in whole or in part.
Analysis of Variance: A Conceptual
Overview
Analysis
Analysis of
of Variance
Variance (ANOVA)
(ANOVA) can
can be
be used
used to
to test
test
for
for the
the equality
equality of
of three
three or
or more
more population
population means.
means.
Data
Data obtained
obtained from
from observational
observational or
or experimental
experimental
studies
studies can
can be
be used
used for
for the
the analysis.
analysis.
We
We want
want to
to use
use the
the sample
sample results
results to
to test
test the
the
following
following hypotheses:
hypotheses:
H0: 1=2=3=. . . = k
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
6
or duplicated, or posted to a publicly accessible website, in whole or in part.
Analysis of Variance: A Conceptual
Overview
H0: 1=2=3=. . . = k
If
If H
H00 is
is rejected,
rejected, we
we cannot
cannot conclude
conclude that
that all
all
population
population means
means are
are different.
different.
Rejecting
Rejecting H
H00 means
means that
that at
at least
least two
two population
population
means
means have
have different
different values.
values.
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
7
or duplicated, or posted to a publicly accessible website, in whole or in part.
Analysis of Variance: A Conceptual
Overview
Assumptions for Analysis of
Variance
For
For each
each population,
population, the
the response
response (dependent)
(dependent)
variable
variable is
is normally
normally distributed.
distributed.
The
The variance
variance of
of the
the response
response variable, denoted 22,,
variable, denoted
is
is the
the same
same for
for all
all of
of the
the populations.
populations.
The
The observations
observations must
must be
be independent.
independent.
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
8
or duplicated, or posted to a publicly accessible website, in whole or in part.
Analysis of Variance: A Conceptual
Overview
x
Sampling Distribution of Given H0 is True
x2 x1 x3
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
9
or duplicated, or posted to a publicly accessible website, in whole or in part.
Analysis of Variance: A Conceptual
Overview
x
Sampling Distribution of Given H0 is False
x3 3 x1 1 2 x2
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
10
or duplicated, or posted to a publicly accessible website, in whole or in part.
Analysis of Variance and
the Completely Randomized Design
Between-Treatments Estimate of Population
Variance
Within-Treatments Estimate of Population
Variance the Variance Estimates: The F
Comparing
Test
ANOVA Table
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
11
or duplicated, or posted to a publicly accessible website, in whole or in part.
Between-Treatments Estimate
of Population Variance s 2
The estimate of 2 based on the variation of
the
sample means is called the mean square due
to
treatments and isk denoted2by MSTR.
j 1
nj (xj x)
MSTR
k 1
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
12
or duplicated, or posted to a publicly accessible website, in whole or in part.
Within-Treatments Estimate
of Population Variance s 2
The estimate of 2 based on the variation of
the sample observations within each sample is
called the mean square error and is denoted
by MSE.
k
2
(n j 1)sj
j 1
MSE
nT k
Numerator is
Denominator is the called
degrees of the sum of
freedom squares
associated with due to error (SSE)
SSE
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
13
or duplicated, or posted to a publicly accessible website, in whole or in part.
Comparing the Variance Estimates: The F
Test
If the null hypothesis is true and the ANOVA
assumptions are valid, the sampling distribution of
MSTR/MSE is an F distribution with MSTR d.f.
equal to k - 1 and MSE d.f. equal to nT - k.
If the means of the k populations are not equal, the
value of MSTR/MSE will be inflated because MSTR
overestimates 2.
Hence, we will reject H0 if the resulting value of
MSTR/MSE appears to be too large to have been
selected at random from the appropriate F
distribution.
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
14
or duplicated, or posted to a publicly accessible website, in whole or in part.
Comparing the Variance Estimates: The F
Test
Sampling Distribution of
MSTR/MSE
Sampling Distribution
of MSTR/MSE
Reject H0
Do Not Reject H0 a
MSTR/MSE
F
Critical Value
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
15
or duplicated, or posted to a publicly accessible website, in whole or in part.
ANOVA Table
for a Completely Randomized Design
Total SST nT - 1
SST’s degrees of
freedom
SST is partitioned
(d.f.) are partitioned
into SSTR and into
SSE. SSTR’s d.f. and SSE’s
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
d.f.
or duplicated, or posted to a publicly accessible website, in whole or in part.
Slide
16
ANOVA Table
for a Completely Randomized Design
SST
SST divided
divided by
by its
its degrees
degrees of
of freedom
freedom nnTT –– 1
1 is
is the
the
overall
overall sample
sample variance
variance that
that would
would be
be obtained
obtained if if we
we
treated
treated the
the entire
entire set
set of
of observations
observations as
as one
one data
data set.
set
With
With the
the entire
entire data
data set
set as
as one
one sample,
sample, the
the formula
formula
for
for computing
computing thethe total
total sum
sum of
of squares,
squares, SST,
SST, is:
is:
k nj
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
17
or duplicated, or posted to a publicly accessible website, in whole or in part.
ANOVA Table
for a Completely Randomized Design
ANOVA
ANOVA cancan be
be viewed
viewed as
as the
the process
process of
of partitioning
partitioning
the
the total
total sum
sum ofof squares
squares and
and the
the degrees
degrees of
of freedom
freedom
into
into their
their corresponding
corresponding sources:
sources: treatments
treatments andand erro
erro
Dividing
Dividing the
the sum
sum of
of squares
squares byby the
the appropriate
appropriate
degrees
degrees ofof freedom
freedom provides
provides the
the variance
variance estimates
estimates
and
and the
the FF value
value used
used to
to test
test the
the hypothesis
hypothesis of
of equal
equal
population
population means.
means.
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
18
or duplicated, or posted to a publicly accessible website, in whole or in part.
Test for the Equality of k Population
Means
Hypothese
s
H0: 1=2=3=. . . = k
Ha: Not all population means are equal
Test Statistic
F = MSTR/MSE
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
19
or duplicated, or posted to a publicly accessible website, in whole or in part.
Test for the Equality of k Population
Means
Rejection
Rule
p-value Approach: Reject H0 if p-value < a
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
20
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
A Completely Randomized Design
Example: AutoShine, Inc.
AutoShine, Inc. is considering marketing a
long-
lasting car wax. Three different waxes (Type 1,
Type 2,
In order to test the durability of these waxes,
and
5 newType 3) have been developed.
cars were waxed with Type 1, 5 with Type 2, and
5
with Type 3. Each car was then repeatedly run
through an automatic carwash until the wax
coating
showed signs of deterioration.
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
21
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
A Completely Randomized Design
Example: AutoShine, Inc.
The number of times each car went
through the
carwash before its wax deteriorated is shown
on the
next slide. AutoShine, Inc. must decide which
waxFactor . . . Car wax
toTreatments
market. Are. the threeI, waxes
. . Type Type 2,equally
Type 3
effective?
Experimental units . . . Cars
Response variable . . . Number of washes
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
22
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
A Completely Randomized Design
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
23
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
A Completely Randomized Design
Hypotheses
H0: 1=2=3
Ha: Not all the means are equal
where:
1 = mean number of washes using Type 1 wax
2 = mean number of washes using Type 2 wax
3 = mean number of washes using Type 3 wax
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
24
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
A Completely Randomized Design
Mean Square Between Treatments
Because the sample sizes are all
equal:
x (x1 x2 x3)/=3(29 + 30.4 + 30)/3 = 29.8
SSTR = 5(29–29.8)2 + 5(30.4–29.8)2 + 5(30–29.8)2 = 5.2
MSTR = 5.2/(3 - 1) = 2.6
Mean Square Error
SSE = 4(2.5) + 4(3.3) + 4(2.5) = 33.2
MSE = 33.2/(15 - 3) = 2.77
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
25
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
A Completely Randomized Design
Rejection Rule
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
26
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
A Completely Randomized Design
Test Statistic
F = MSTR/MSE = 2.60/2.77 = .939
Conclusion
The p-value is greater than .10, where F = 2.81.
(Excel provides a p-value of .42.)
Therefore, we cannot reject H0.
There is insufficient evidence to conclude that
the mean number of washes for the three wax
types are not all the same.
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
27
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
A Completely Randomized Design
ANOVA Table
Total 38.4 14
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
28
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
An Observational Study
Example: Reed
Manufacturing
Janet Reed would like to know if there is any
significant difference in the mean number of
hours
worked per week for the department managers
at her
three manufacturing plants (in Buffalo,
Pittsburgh,
and Detroit).
An F test will be conducted using a = .05.
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
29
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
An Observational Study
Example: Reed
Manufacturing
A simple random sample of five managers
from
each of the three plants was taken and the
number of
hours worked by each manager in the previous
week . . . Manufacturing plant
Factor
is shown on. the
Treatments next slide.
. . Buffalo, Pittsburgh, Detroit
Experimental units . . . Managers
Response variable . . . Number of hours worked
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
30
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
An Observational Study
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
31
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
An Observational Study
p -Value and Critical Value Approaches
1. Develop the hypotheses.
H0: 1= 2= 3
Ha: Not all the means are equal
where:
1 = mean number of hours worked per
week by the managers at Plant 1
2 = mean number of hours worked per
week by the managers at Plant 2
3 = mean number of hours worked per
week by the managers at Plant 3
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
32
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
An Observational Study
p -Value and Critical Value Approaches
a = .05
2. Specify the level of significance.
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
33
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
An Observational Study
p -Value and Critical Value Approaches
3. Compute the value of the test statistic.
(con’t.)
Mean Square Due to Error
SSE = 4(26.0) + 4(26.5) + 4(24.5) = 308
MSE = 308/(15 - 3) = 25.667
F = MSTR/MSE = 245/25.667 = 9.55
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
34
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
An Observational Study
ANOVA Table
Total 798 14
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
35
or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for the Equality of k Population
Means:
An Observational Study
p –Value Approach
4. Compute the p –value.
© 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
38
or duplicated, or posted to a publicly accessible website, in whole or in part.