AP Stats Module 1 Notes
AP Stats Module 1 Notes
*Always have units. For example, height (in.) , age (years), or SAT (points).
Quantitative Data*
Dot Plot Stemplot (stem-and-leaf) Histogram Frequency Display (ogive) Box plot (box and whisker)
Quick display! Don’t forget to include the key! Make sure that everything is less than Always cumulative display Best for summary stats!
bin limit.
Don’t forget to label and Variations: Uses 5 # Summary
title the graph!
Back-to-back (Min, Q1, Med, Q3, Max)
Split stems:
0-4, 5-9
Unless multiplying by a negative, the shape of the distribution will not change. InvNorm()
From Percentile to z-score.
Common mistakes Z-Scores When stating InvNorm, you must
state what each value represents
• Always show your work! Z-Scores: How many standard InvNorm(percentile= , mean = ,
deviations a data point is away standard deviation = )
• Round to four decimal places! from the mean.
• Don’t confuse bar charts with histograms. These two types of graphical displays look similar, but bar charts are used with categorical
data and histograms are used with numerical data.
• When constructing graphical displays, make sure to label the axes and mark them with appropriate scales (including units).
• The legend/key is an important part of a stem-and-leaf display. Make sure to always include one whenever you make a stem-and-leaf
display
• If the class intervals are not all the same width, be sure to use the density scale on the vertical axis when making a histogram.
• Center and variability are two important aspects of a distribution. When describing the distribution of a numerical data set, be sure
to address both center and variability (spread).
• The measures used to describe center and variability for distributions that are approximately symmetric (mean and standard devia-
tion) are different from those used to describe center and variability for distributions that are skewed or that have outliers (median
and IQR). It’s the difference between reporting resistant statistics vs. sensitive ones.
• For distributions that are skewed or that have outliers, you should consider using the median to describe the center.
• The lower quartile, the median, and the upper quartile divide the data set into four parts, with 25% of the data in each part.
• Outliers convey important information about a data set. For this reason, boxplots that show outliers (modified boxplots) are usually
preferred over boxplots that do not show outliers.
• When asked to compare distributions, don’t forget to use comparative language (more than, less than, same as, etc.)
Sections 1.05-1.06
• Not all distributions are mound shaped. Using the Empirical Rule in situations where you are not convinced that the data distribution
is mound shaped and approximately symmetric can lead to incorrect statements.
• z-scores indicate both direction and distance from mean in standard deviations.
• Only use the empirical rule (68-95-99.7 Rule) when specifically told to do so.
• When using your calculator to determine the area under the curve of a Normal/Approximately Normal distribution, you MUST
indicate what each value that was entered into the calculator represents. For example, NormalCDF(lower= 89, upper= 105,
mean=99, standard deviation=5)
• NEVER refer to the shape of a distribution as Normal. Random variables rarely have a shape that is perfectly normal. At the very best,
they are approximately normal.
• High percentiles are NOT always a good thing! Examples include: golf scores, swimming races, and student debt.
• Only use NormalCDF and InverseNorm when the shape of the distribution is Normal/Approximately Normal.