Lect w6m12 f2023
Lect w6m12 f2023
1 2 3 4 5
1 2 3 4 5 Original distribution
Sampling distribution
n=2
1 2 3 4 5
2 3 4
Sampling distribution n = 4 Means (n=60)
Example: sampling distribution
of proportion
Some-
never times
always
Exact sampling mean
= (12*0.0 + 16*0.5 + 2*1.0)/ 30 Persons
population palways = 0.333
=(0+8+2)/30 = 10/30 = 0.333 = AB
CD
Persons
ef
True population proportion!
CLT sampling Not always Always
=0 =1
Standard deviation
of sample N=2 possibilities:
proportion if n*p ƥalways= 0 for AB,BA,AC,CA,AD,DA
was > 10 (it isn’t!)
BC,CB,BD,DB,CD,CD 12
= sqrt[(.333*.667)/2]
= 0.3333 outcomes
ƥalways= 0.5 for Ae,eA,Af,fA,Be,eB,Bf,fB
Ce,eC,Cf,fC,De,eD,Df,fD
ƥ = 0.0 ƥ = 0.5 ƥ = 1.0
16 outcomes
Exact sampling Standard deviation ƥalways= 1.0 for ef,fe 2 outcomes
= sqrt[{12*(1/3)*(1/3)+16*(1/6)*(1/6)+2*(2/3)*(2/3)}/2]
= 0.298 … not really that far off!
Example: z-probability for sampling vs sample (or pop)
All boxes distributions
from one
company is
A food company sells “18 ounce” boxes
the of cereal. Let x denote the actual amount
population of
products of cereal in a box of cereal. Suppose that
from that x is normally distributed with µ = 18.03
company (or a _
sample of all
produced by any
ounces and = 0.05. (or x=18.03, s=0.05)
company)
a) What proportion of the boxes will
contain less than 18 ounces?
18 18.03
P(x 18) P z
0.05
P(z 0.60) 0.2743
Example - continued
One case is b) A case consists of 24 boxes of cereal. What
one sample
of 24 boxes is the probability that the mean amount of
from the cereal (per box in a case) is less than 18
population of ounces?
boxes; mean
amount per The central limit theorem states that the
x
box ( ) for
that set of 24 x
distribution of is normally distributed so
is one Note: If it were not stated
observation on the previous page that the
from the population distrib. was normal,
we could NOT use this formula 18 18.03
distribution
…unless… the sample size was
P(x 18) P z
of all 0.05 24
larger… perhaps 44 instead of 24.
possible This is because the CLT states that P(z 2.94) 0.0016
means from the sampling distribution of mean
all 24-box is normal if population distribution is
case normal OR sample size is large
The 95% Confidence Interval for p
p(1 p)
p z critical value
n
Required Sample Size
(for a specified width B of the bound, and a rough guess p
of the true population proportion)
2
1.96
n (1 )
B
The bound on error of estimation, B,
associated with a 95% confidence interval is
(1.96)·(standard error of the statistic).
The bound on error of estimation, B, associated
with a confidence interval is
(z critical value)·(standard error of the statistic).
** note that since 0.5*0.5 = 0.25, while 0.1*0.9 = 0.09… n needed is lower when
one of the two options applies to a relatively small minority than when there is
a roughly “50-50” balance of the two options **