Making Decisions For The Difference Between Two Independent Population Means
Making Decisions For The Difference Between Two Independent Population Means
HYPOTHESES
H o : A B or A B 0
H a : A B or A B 0 (two - tailed)
or
H a : A B or A B 0 (upper - tailed)
or
H a : A B or A B 0 (lower - tailed)
The appropriate test procedure depends upon whether or not we can assume that the
population variances are equal or not. In the example covered in this handout we will
examine methods used to determine whether or not the population variances can be
assumed to be equal for our study.
Test Statistic
x A xB x A xB
t ~ t distribution df n A n B 2
SE ( x A x B ) 1 1
s 2p
n A nB
where,
2 2
2 (n A 1) s A2 (nY 1) s B2 2 s A sB
sp or sp when n A n B
(n A 1) (n B 1) 2
2
s p = Pooled-estimate of the common variance
1
2 2
Assuming Unequal Population Variances/Standard Deviations ( A B )
Test Statistic
x A xB x A xB
t
SE ( x A x B ) sA
2 2
s B ~ t-distribution df = min( n A 1, n B 1 )
This is the conservative approach. As an alternative we
n A nB
could use Welch’s df formula, but for Welch’s we will
let the computer do the dirty work!
Goal: To be able to complete (and interpret the output from) a two-sample t-test in JMP.
Question of Do men and women have the same normal body temperature? Putting this statement
Interest: into a statement involving parameters that can be tested:
HO: F = M
HA: F ≠ M
F mean body temperature for
or females.
M mean body temperature for males.
HO: F - M= 0
HA: F - M ≠ 0
Intuitive Decision
In order to determine whether or not the null or alternative hypothesis is true, you could
review the summary statistics for the variable you are interested in testing across the two
groups. Remember, these summary statistics and/or graphs are for the observations you
2
sampled, and to make decisions about all observations of interest, we must apply some
inferential technique (i.e. hypothesis tests or confidence intervals)
One of the best graphical displays for this situation is the side-by-side boxplots. To get
side-by-side boxplots, select Analyze > Fit Y by X. Place Gender in the X box and
Temperature in the Y box. Place the mean diamonds on the boxplots and jitter the
points. The more separation there is in the mean diamonds, the more likely we are to
reject the null hypothesis (i.e data tends to support the alternative hypothesis).
Summary Statistics
x F 98.39
x M 98.10
s F .743
s M .699
n F 65
n M 65
Assumptions
3. Decide whether or not we wish to assume the population variances are equal.
3
Normality appears to
be satisfied here.
JMP gives four different tests for examining the equality of population variances. To use
the results of these tests simply examine the resulting p-values. If any/all are less than .10
or .05 then worry about the assumption of equal variances and use the unequal variance t-
Test instead of the pooled t-Test.
4
Here we can see that all of the p-values exceed the 0.05 (i.e. 5%). What does this mean?
What is your conclusion about the validity of the equality of the population variances
assumption?
Several new boxes of output will appear below the graph once the appropriate option has
been selected, some of which we will not concern ourselves with. The relevant box for us
will be labeled t Test as shown below for the mean body temperature comparison.
5
Because we have concluded
that the equality of variance
assumption is reasonable for
these data we can refer to the
output for the t-Test assuming
equal variances.
x A xB x A xB
t ~ t distribution
SE ( x A x B ) 1 1
s
2
p
What is the p-value? n A nB
where,
2
2 (n A 1) s A2 (nY 1) s B2 2 s A sB
sp or sp
(n A 1) (n B 1) 2
What is your decision for the test?
Write a conclusion for your findings.
x F 98.39
x M 98.10
s F .743
s M .699
n F 65
n M 65 6
x A xB x A xB
t ~ t distribution
SE ( x A x B ) 1 1
s
2
p
n
A n B
where,
Interpretation of the CI for ( F M ) 2
2 (n A 1) s A2 (nY 1) s B2 2 s A sB
sp or sp
(n A 1) (n B 1) 2
CI for ( A B )
7
Example 2: Gender Comparisons of Drinks Per Episode
for WSU Students
Is there evidence to suggest that the average number of drinks per episode for male
drinkers is greater than that for female drinkers? Using the WSU student survey data in
the file STAT 110 Survey we will examine this question.
Analysis in JMP
Using Analyze > Fit Y by X with Y = Howmuch, which is the number of drinks per
episode, and X = Gender we obtain the following. Select Oneway Analysis... > Normal
Quantile Plot to assess normality of the response for both groups.
8
Both distributions
are skewed right,
however our
sample sizes here
are quite large so
normality is less
critical.
Can we assume the population variances are equal? Select Oneway Analysis >
UnEqual Variances to check this assumption. The results are shown on the following
page.
What do we conclude?
9
Using the appropriate t-Test given the variance test results we select Oneway Analysis...
> t Test.
Conclusion:
Results
Additional Notes:
10