SLIDES Statistics-Estimation
SLIDES Statistics-Estimation
Point Estimation
Sample statistic
To estimate the value of a population parameter, we calculate a corresponding
characteristic of the sample, referred to as a sample statistic.
Exercise: Method 1 (p. 309).
Point Estimate
Data: Status of the Salary and Training Program of 2500 EAI Managers.
We refer to the sample mean x as the point estimator of the population mean µ.
Sampling Distributions
E(x) = µ
Finite
√ population Infinite population
( )
Standard Deviation: N−n σ σ
σx = √ σx = √
N−1 n n
In the case where the population size is large, we may use the later formulate to
calculate the standard deviation of x.
Central Limit theorem
( ( )2 )
σ
x ∼ N µ, √
n
E(p) = p
▶ In stratified random sampling (chọn mẫu phân tổ), the elements in the population
are first divided into separate groups (for example, department, age, etc.). After
that, a simple random sample is taken from each group.
Other Sampling methods
▶ In cluster sampling (chọn mẫu cả khối), the elements in the population are first
divided into separate groups called clusters. Each element of the population
belongs to one and only one cluster. A simple random sample of the clusters is
then taken. All elements within each sampled cluster form the sample.
Other Sampling methods
▶ Systematic Sampling (chọn mẫu ngẫu nhiên hệ thống): every k-th element is
chosen from the population.
As we know,
x ∼ N(µ, σx2 ),
√
where σx = σ/ n.
Example: Lloyd Data: infinite population with mean µ needed to estimate and
σ = 20, a sample of size n = 100 with x = 82. This is a point estimate of µ.
where
0.95 = 1 − 2f(1.96) ▶ 1 − α is the confidence coefficient.
( )
1 − 0.95 ▶ f is the density function of the
1.96 = f−1
2 standard normal distribution.
σ ▶ zα/2 is the z-value. That is
σx = √
n f(zα/2 ) = α/2
error: ϵ = 1.96σx
Common Z values
▶ µ1 , and σ1 are the mean, and the standard deviation of population 1, respectively,
▶ µ2 and σ2 are the mean, and the standard deviation of population 2, respectively,
▶ x is the sample mean of a simple random sample from population 1,
▶ y is the sample mean of a simple random sample from population 2.
▶ The point estimator of the difference between two populations means:
x − y.
▶ Interval estimator of µ1 − µ2 :
x − y ± error . (1)
Interval Estimator of µ1 − µ2 : Independent Samples
▶ The interval estimator of µ1 − µ2 when σ1 and σ2 are known.
√
σ12 σ22
x − y ± zα/2 + (2)
n1 n2
▶ Interval estimator of µ1 − µ2 when σ1 and σ2 are unknown.
√
s21 s2
x − y ± tα/2 + 2 (3)
n1 n2
In the matched samples, two methods are tested under similar conditions. It results in
a smaller sampling error. In this case, two samples have the same size n.
Interval Estimator of µ1 − µ2 : Matched samples
Sd
x − y ± tα/2 √
n
p1 − p2 .
▶ Interval estimator of p1 − p2 :
p1 − p2 ± error . (4)
E(p1 − p2 ) = p1 − p2
√
p1 (1 − p1 ) p2 (1 − p2 )
σp1 −p2 = +
n1 n2
The interval estimate of p1 − p2