8.3x Week 1
8.3x Week 1
Goals
● Quantify natural concepts like “____________” and “____________”
● Examine ________-shaped distributions
● Understand why many of the _________________ distributions are bell shaped
Median? Average?
Plan B:
● Measure variability around the mean
● Need to figure out a way to quantify this
Standard Units
● How many SDs above average?
● z = (value - average)/SD
○ Negative z: value below average
○ Positive z: value above average
○ z = 0: value = average
● When values are in standard units: average = 0, SD = 1
● Chebyshev: At least 96% of the values of z are between -5 and 5
SD = ______
Second Reason for Using the SD: If the sample is large, and you draw it at ____________ with
replacement…
Then, regardless of the distribution of the population,
the probability distribution of the sample sum (or of the sample average) is roughly
normal.
Sample Averages
● Often, we only have a sample; we don’t know much about the population from which it
was drawn
● The Central Limit Theorem states that the probability distribution of the average of a
large random sample is roughly normal, regardless of the ___________________ of the
population.
● This allows us to make ___________________ based on averages of large random samples.
(ignore “bootstrap” references)
Prediction
● To predict the value of a variable,
○ Identify attributes that are associated with that variable that you can
________________
○ Describe the relation between the attributes and the variable you want to predict
○ Use the relation to make your prediction
● P_______________
○ Any discernible “shape” in the scatter
○ Linear or nonlinear
No linear association
Describe how to calculate r from this table:
Y is the square of X
Correlation = ______