Statistical Estimation
Statistical Estimation
Sampling Distribution
Distribution of all possible values of a statistic computed from samples of
the same size randomly selected from the same population.
Due to random variation different samples from the same population will
have different sample means.
If we repeatedly take sample of the same size n from a population, the
means of the samples form a sampling distribution of means of size n.
μ
x i
N
18 20 22 24
21
4
σ
i
(x μ) 2
2.236
N
Now consider all possible samples of size
n=2
μx
x i
18 19 21 24
21
N 16
σx
i x
(x μ ) 2
N
(18 - 21)2 (19 - 21)2 (24 - 21)2
1.58
16
Properties
1. Estimation
2. Hypothesis testing
Sample statistic Population parameter
Statistical Estimation
E.g.
Estimates for the proportion of smokers among all people aged 15 to 24
in the population
The mean level of a certain enzyme among healthy men.
Point Estimation
xi
x = i =1
n
The value of the sample statistic will vary from sample to sample therefore to
simply obtain an estimate of the single value of the parameter is not generally
acceptable.
We need to take into account the sample to sample variation of the statistic.
[ x z . , x z . ] for estimating mean
2 n 2 n
if is unknown, it can be estimated by s.
[ p z . p (1 p ) / n , p z . p (1 p ) / n ] for estimating proportion
2 2
17
The 95% confidence interval is interpreted in such a way that,
under the conditions assumed for underlying distribution, you are
95% confident that the interval contains the true parameter.
The 99% CI is wider than 95% CI; the extra width meaning that
we can be more certain that the interval will contain the
population parameter.
But to obtain a higher confidence from the same sample, we must be
willing to accept a larger margin of error (a wider interval).
For a given confidence level (i.e. 90%, 95%, 99%) the width of the
confidence interval depends on the standard error of the estimate which
in turn depends on the:
1. Sample size:-The larger the sample size, the narrower the confidence
interval and the more precise our estimate.
Example:
Solution:
• we are 95% sure that the difference is between -0.22 and –0.02.
Note: that the negative signs merely reflect the fact that better
results were obtained by using the new treatment.