Week 7 & 8
Week 7 & 8
STATIS Y
A B I L I T
PROB
STATISTICS AND PROBABILITY
TOPIC:
SAMPLING WEEK 7
DISTRIBUTION
OBJECTIVES:
1. identify sampling distributions of
statistics (sample mean);
2. find the mean and variance of the
sampling distributions of the sample
mean; and
3. solve problems involving sampling
distributions of the sample mean.
What is sampling distribution?
Sampling distribution is a statistic that determines the
probability of an event based on data from a small
group within a large population. Its primary purpose is
to establish representative results of small samples of a
comparatively larger population. Since the population
is too large to analyze, the smaller group is selected
and repeatedly sampled, or analyzed. The gathered
data, or statistic, is used to calculate the likely
occurrence, or probability, of an event.
There are three primary factors that influence the
variability of a sampling distribution. They are:
• The method of choosing the sample: How the samples were chosen
can account for variability in some cases.
There are many different possible samples of the
same size that can be drawn from a given
population. A statistic such as mean can be
computed for each of the samples drawn.
To find the mean of sampling distribution of
means, use the following formula
Definition:
where sample mean The probability distribution
total number of observations that describes the probability
Alternative formula for
for each mean of all the
samples with the same
sample size is called
where sample mean
sampling distribution.
VARIANCE AND STANDARD DEVIATION
OF THE SAMPLING DISTRIBUTION WITH
REPLACEMENT
Example 1. Consider a population with values
1. Find the population Mean.
2. Find the population Variance.
3. Find the population standard deviation.
4. Find all possible samples of size 2 which can be drawn with
replacement from this population.
5. Find the mean of the sampling distribution of means.
6. Find the variance of the sampling distribution of means.
7. Find the standard deviation of the sampling distribution of means.
Solutio
n:
VARIANCE AND STANDARD DEVIATION OF
THE SAMPLING DISTRIBUTIONS OF MEANS
WITHOUT REPLACEMENT
Example 2. A population consists of three numbers (2, 4, 6). Consider all
possible samples of size 2 which can be drawn without replacement
from the population.
Find the following
a. Population Mean
b. Population Variance
c. Population Standard Deviation
d. Mean of each sample and the mean of the sampling distribution of
means
e. Variance of the sampling distribution of means
f. Standard deviation of the sampling distribution of the means
Solutio
n:
ACTIVITY & ASSESSMENT:
A population consists of three numbers (2, 5, 8). Consider all possible
samples of size 2 which can be drawn without replacement from the
population. Find the following: (45 POINTS)
Complete the table:
1 ______
2 ______ 0.0 0.00
3 3.5 ______ ______
4 ______ ______ ______
5 5.0 ______ 0.00
6 6.5 1.5 ______
TOPIC:
THE CENTRAL WEEK 8
LIMIT
THEOREM
OBJECTIVES:
1. define the sampling distribution of the
sample mean for normal population when
the variance is:
a. known
b. unknown
2. illustrate the Central Limit Theorem; and
3. define the sampling distribution of the
sample mean using the Central Limit Theorem.
Central Limit Theorem
If samples of size , where is sufficiently large, are
drawn from any population with a mean and a
standard deviation , then the sampling distribution
of sample means approximates a normal
distribution.
The sampling distribution of the sample
means taken with replacement from a population
with population mean and variance will approach
a normal distribution according to the Central
Limit Theorem.
KEY TAKEAWAYS
•The central limit theorem (CLT) states that the
distribution of sample means approximates a normal
distribution as the sample size gets larger, regardless of
the population's distribution.
•Sample sizes equal to or greater than 30 are often
considered sufficient for the CLT to hold.
•A key aspect of CLT is that the average of the sample
means and standard deviations will equal the population
mean and standard deviation.
•A sufficiently large sample size can predict the
characteristics of a population more accurately.
Understanding the Central Limit Theorem (CLT)
According to the central limit theorem, the mean of a sample of data will be closer
to the mean of the overall population in question, as the sample size increases,
notwithstanding the actual distribution of the data. In other words, the data is
accurate whether the distribution is normal or aberrant.
As a general rule, sample sizes of around 30-50 are deemed sufficient for the CLT
to hold, meaning that the distribution of the sample means is fairly normally
distributed. Therefore, the more samples one takes, the more the graphed results
take the shape of a normal distribution. Note, however, that the central limit
theory will still be approximated in many cases for much smaller sample sizes,
such as n=8 or n=5.
The central limit theorem is often used in conjunction with the law of large numbers,
which states that the average of the sample means and standard deviations will
come closer to equaling the population mean and standard deviation as the sample
size grows, which is extremely useful in accurately predicting the characteristics of
populations.
GENERAL STEPS:
Step 1: Identify the parts of the problem. Your question should
state: 1.the mean (average or μ)
2.the standard deviation (σ)
3.population size
4.sample size (n)
5. a number associated with “less than”
Step 2: Draw a graph. Label the center with the mean. Shade the
area roughly below . (i.e. the “less than” area). This step is optional, but it
may help you see what you are looking for.
Step 3: Use the following formula to find the z-score. Plug in the
numbers from step 1.
Step 6: Convert the decimal in Step 6 to a percentage.
In our example, 0.6554 = 65.54%.
That’s it!
Example problem:
1. As the sample size increases, the sampling distribution of the means approaches a
normal distribution.
2. The mean of the sampling distribution of means is equal to the mean of the
population.
3. The variance of the sampling distribution of means is equal to the variance of the
population divided by the sample size