AP Stats Module 6 Notes
AP Stats Module 6 Notes
• Hypothesis Testing: a formal procedure for using observed data to decide between two The power of a test is the probability that the test will find convincing
competing claims (called hypotheses). The claims are usually statements about a
parameter, like a population proportion p or the population mean μ. evidence for Ha when the specific alternative value of the parameter is true.
• Null Hypothesis (Ho): the claim that we weigh evidence against in a significance test.
Power = 1 - P(Type II Error)
• Alternative Hypothesis (Ha or H1): the claim that we are trying to find evidence to support.
• Test Statistic: measures how far a sample statistic is from what we would expect if the null Four ways to increase Power of a Hypothesis Test
hypothesis is true, in standard deviation units. 1. Increase the sample size, n.
• P-Value: the probability of getting evidence for the alternative hypothesis Ha as strong or 2. Increase the significance level, α.
stronger than the observed evidence when the null hypothesis Ho is true.
3. Choose alternative parameter value that’s further away from null.
• How to calculate: either using z/t table or technology (invNorm() or invT())
4. For means, decrease variability of population, σ.
• Statistical Significance: when the p-value is smaller than alpha (α).
• One-sided Test: when the alternative hypothesis is either > or < the null value. Relationship: As P(Type I Error) = α increases, P(Type II Error) decreases,
• Two-sided Test: when the alternative hypothesis is different from (≠) the null value. and Power increases.
• Significance Level (alpha value): the value that we use as a boundary for deciding whether
an observed result is unlikely to happen by chance alone when the null hypothesis is true. If For AP Stats, we are not required to know how to calculate Power.
not given, assume α = 0.05.
Interpreting p-value *Be sure to replace [brackets] with context!
Assuming that the true [parameter in context*] is true, there is a [p-value] probability of getting a sample statistic of [sample mean/proportion] or more extreme just by chance in a random sample of [n].
Calculator Tip: When estimating a population mean and the sample size is less than
30, use the calculator to create a boxplot to check for strong skewness and outliers.
Note: Don’t forget that we must pool (p-hat c as seen above) when conducting a two-proportion *Additional Condition: When there are two populations, we must verify that the samples are independent of
z-test and NOT pool (using both p-hat 1 and 2) when building a two-proportion z-interval. one another. This is either told in the scenario or must be reasonably inferred from scenario.
• Many students lose credit on the AP Statistics exam when defining parameters because their description refers to the sample
instead of the population or because the description isn't clear about which group of individuals the parameter is describing. When
defining a parameter, we suggest including the word “all”, “true”, or “population” in your description to make it clear that you
aren't referring to a sample statistic.
• Terminology matters. Never just say “the distribution.” Always say the “distribution of [blank]”, be careful to distinguish the
distribution of the population, the distribution of sample data, and the distribution of a statistic. Likewise don't use ambiguous
terms like “sample distribution” which could refer to the distribution of the sample data or to the sampling distribution of a
statistic. You will lose credit on the free-response questions for misusing statistical terms.
• Notation matters. The symbols all have specific and different meanings. Either use notation correctly–or don’t use it at all. You can
expect to lose credit if you use incorrect notation.
• The free response section almost always has a question that asks students to calculate a probability of some sort. Students should
always check the necessary conditions before calculating a probability even if the question doesn't specifically ask for the
conditions. Students will not be asked to perform a probability calculation in a context where the conditions have not been met.
There may, however, be a question that focuses on just the conditions. In this case, the conditions may not be met.
• The Random and Independence conditions are the same for sampling distributions that involve proportions and means. The only
condition that changes is the Normality condition. When working with proportions we must check the Large Counts Condition and
when working with means we must check the criteria for the Central Limit Theorem.
• If a free-response question asks you to complete a hypothesis test, you are expected to do the entire four-step process. That in-
cludes clearly defining the parameter, checking conditions, reporting calculations, and stating the conclusion. Don’t forget for one
population there are three conditions and when there are two populations there is a fourth condition (the independent samples
condition).
• When your sample size is fewer than 30 observations AND the population shape is not given to be approximately normal, it is not
enough just to make a graph of the data on the calculator when assessing Normality. You must sketch the graph on your paper to
receive credit. You don't have to draw multiple graphs, any appropriate graph will do.
• There is almost always one free-response question that asks students to perform a significance test. Students will most likely be
asked if the data provide convincing evidence for the alternative hypothesis, rather than if the data provide convincing evidence
against the null hypothesis.
• When the P-value is greater than the alpha level we fail to reject the null. Instead of failing to reject the null hypothesis, many stu-
dents use language that sounds like they accept the null hypothesis. Accepting that all hypothesis will always lose credit on the
AP Statistics exam. Instead students are expected to say that “there is insufficient evidence to support the alternative in context”.
• On the AP Statistics exam, it is acceptable for students to use a confidence interval rather than the test statistic and P-value to
address a two-sided alternative hypothesis. However, if the alternative hypothesis is one-sided, students will lose credit for using
confidence interval approach unless they explicitly addressed the imperfect link between the one-sided test and the confidence
level. For instance, by adjusting the confidence level appropriately. Our recommendation for the AP Statistics exam is to always
stick with a significance test.
• Many students lose credit when defining parameters in an experiment by describing the sample proportion rather than the true
proportion. For example, “the true proportion of the men who had surgery and survived 20 years” describes the sample statistic
and not the population parameter.
• The formula for the two-sample z interval for p1 - p2 often leads to calculation errors by students. As a result, it is recommended to
use the calculator’s 2-PropZInt feature to compute the confidence interval on the AP Statistics exam. Be sure to name the
procedure (2-sample z-interval for p1 - p2) in the Calculations step and give the interval computed by the calculator.
• When identifying the parameter of interest, it is essential to state which proportion is p 1 and which is p2. Because hypothesis
testing looks at whether there is a statistically significant difference between proportions, your alternative hypothesis is not
relevant without knowing which proportion is represented by which statistic.
• For any two-sample hypothesis test or interval, you must check and state all four conditions for both samples. If you do not include
the work for both, you will not get credit for checking the conditions.
• “Significant” in the statistical sense does not mean “important”. It means simply “not likely to happen by chance alone”.
Please note the last line contains the various confidence levels. When in need of a t-critical value (t*), see where confidence level
column intersects the appropriate degrees of freedom. For z*, use df = infinity row (last row) and confidence level.