One-Sample T-Test: Example 1 Merchandise Shipment Times
One-Sample T-Test: Example 1 Merchandise Shipment Times
One-Sample t-Test
Example 1 Merchandise Shipment Times
Problem A mail-order catalog company ships orders from a large central warehouse. The warehouse must ensure that the orders are shipped within a specific time frame. If orders reach the shipping bay too quickly, goods may get damaged because of the back-log. Conversely, if orders reach the shipping bay too slowly, they may not be shipped on time, leading to customer dissatisfaction. Data collection The warehouse processes orders 24 hours a day. In the course of 3 days, the deviation of the processing time from the target time is recorded for each order. Deviation = Processing time Target time Tools
Run Chart Graphical Summary 1-Sample t
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-4
One-Sample t-Test
Hypothesis testing
What is a hypothesis test A hypothesis test uses sample data to test a hypothesis about the population from which the sample was taken. The onesample t-test is one of many procedures available for hypothesis testing in MINITAB. For example, to test whether the mean duration of a transaction is equal to the desired target, measure the duration of several transactions and use the mean of these samples to estimate the mean for all transactions. This is an example of statistical inference, which is using information about a sample to make an inference about a population. When to use a hypothesis test Use a hypothesis test to make inferences about one or more populations when sample data are available. Why use a hypothesis test Hypothesis testing can help answer questions such as:
Are turn-around times meeting or exceeding customer expectations? Is the service at one branch better than the service at another? On average, is a call center meeting the target time to answer customer questions? Is the mean billing cycle time shorter at the branch with a new billing process?
For example,
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-5
One-Sample t-Test
One-sample t-test
What is a one-sample t-test A one-sample t-test helps determine whether (the population mean) is equal to a hypothesized value (the test mean). The test uses the standard deviation of the sample to estimate (the population standard deviation). If the difference between the sample mean and the test mean is large relative to the variability of the sample mean, then is unlikely to be equal to the test mean. When to use a one-sample t-test Use a one-sample t-test when continuous data are available from a single random sample. The test assumes the population is normally distributed. However, it is fairly robust to violations of this assumption, provided the observations are collected randomly and the data are continuous, unimodal, and reasonably symmetric (see [1]). Why use a one-sample t-test A one-sample t-test can help answer questions such as:
Is the mean transaction time on target? Does customer service meet expectations? On average, is a call center meeting the target time to answer customer questions? Is the billing cycle time for a new process shorter than the current cycle time of 20 days?
For example,
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-6
One-Sample t-Test
4 Click OK.
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-7
One-Sample t-Test
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-8
One-Sample t-Test
3 Click OK.
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-9
One-Sample t-Test
H0: Data are from a normally distributed population. H1: Data are not from a normally distributed population.
The p-value from the Anderson-Darling test (0.986) assesses the probability that the data are from a normally distributed population. Using an of 0.05, evidence suggests that the data are from a normal population.
Note
When the data are not normally distributed, you can transform the data using Box-Cox transformation or use another procedure such as nonparameteric tests (1-sample sign test).
Conclusion Based on the test, you can assume that the data do not deviate substantially from a normal distribution. Whats next Proceed with the t-test.
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-10
One-Sample t-Test
1-Sample t
1 Choose Stat Basic Statistics 1-Sample t. 2 Complete the dialog box as shown below.
For this example, the two-tailed test answers the question of interest: whether the mean processing time is on target or not. These alternatives can be stated as:
3 Click OK.
The null hypothesis (H0): is equal to 0 seconds. The alternative hypothesis (H1): is not equal to 0 seconds.
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-11
One-Sample t-Test
Variable N Mean StDev SE Mean 95% CI T P Deviation 164 33.4549 25.4315 1.9859 (29.5335, 37.3762) 16.85 0.000
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-12
One-Sample t-Test
If P is less than or equal to , reject H0. If P is greater than , fail to reject H0. (Technically, you never accept H0, you simply fail to reject it.)
Variable N Mean StDev SE Mean 95% CI T P Deviation 164 33.4549 25.4315 1.9859 (29.5335, 37.3762) 16.85 0.000
A typical value for is 0.05, but you can choose higher or lower values depending on the sensitivity required for the test and the consequences of incorrectly rejecting the null hypothesis. P-value The t-test results indicate that the sample mean is not equal to 0 seconds. Thus, the test answers the question, If is equal to 0 seconds, how likely is it to see a sample mean this different (or even more different) from 0 seconds? The answer is given as a probability value (P), which for this test is equal to 0.000. Because this value is less than (0.05), you can reject the null hypothesis and conclude that the average deviation is different from 0.
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-13
One-Sample t-Test
Confidence intervals
What is a confidence interval A confidence interval is a range of likely values for a population parameter (such as ) that is based on sample data. For example, with a 95% confidence interval for , you can be 95% confident that the interval contains . When to use a confidence interval Use a confidence interval to make inferences about one or more populations from sample data or to quantify the precision of your estimate of . Why use a confidence interval Confidence intervals can help answer many of the same questions as hypothesis testing:
Is on target? How much error exists in an estimate of ? How low or high might be? Is the mean transaction time longer than 30 seconds? Is the mean daily revenue higher than $6,000?
For example,
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-14
One-Sample t-Test
Variable N Mean StDev SE Mean 95% CI T P Deviation 164 33.4549 25.4315 1.9859 (29.5335, 37.3762) 16.85 0.000
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-15
One-Sample t-Test
Final considerations
Summary and conclusions Based on the sample data, you can reject the null hypothesis at the 0.05 -level. The mean of the deviation is significantly different from 0 minutes. Orders are taking longer than expected, on average. The mean deviation from the target is estimated to be between 29 minutes 32 seconds and 37 minutes 23 seconds, with 95% confidence.
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-16
One-Sample t-Test
Final considerations
Additional considerations Hypotheses A hypothesis test always starts with two opposing hypotheses. The null hypothesis (H0): Assumptions Each hypothesis test is based on one or more assumptions about the data being analyzed. If these assumptions are not met, the conclusions may not be correct. When using a one-sample t-test:
Usually states that some property of a population (such as the mean) is not different from a specified value or from that of another population Is assumed to be true until sufficient evidence indicates the contrary Is never proven true; you simply fail to disprove it States that the null hypothesis is wrong Can also specify the direction of the difference
The sample must be random Sample data must be continuous Sample data should be normally distributed
The t-test procedure is fairly robust to violations of the normality assumption, provided that observations are collected randomly and the data are continuous, unimodal, and reasonably symmetric (see [1]). Confidence interval The confidence interval provides a likely range of values for (or other population parameters). You can conduct a two-tailed hypothesis test (alternative hypothesis of ) using a confidence interval. For example, if the test value is not within a 95% confidence interval, you can reject H0 at the 0.05 -level. Likewise, if you construct a 99% confidence interval and it does not include the test mean, you can reject H0 at the 0.01 -level.
Increasing increases the chance of detecting a difference, but it also increases the chance of rejecting H0 when it is actually true (a Type I error). Decreasing decreases the chance of making a Type I error, but also decreases the chance of correctly detecting a difference.
Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0
TRMEM14W.SQBS
1-17