Formula Sheet_Test 2 - STAT4001
Formula Sheet_Test 2 - STAT4001
Classical Probability:
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑓𝑎𝑣𝑜𝑢𝑟𝑎𝑏𝑙𝑒 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠
𝑃𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑜𝑓 𝑎𝑛 𝑒𝑣𝑒𝑛𝑡 =
𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑝𝑜𝑠𝑠𝑖𝑏𝑙𝑒 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠
Empirical Probability:
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑖𝑚𝑒𝑠 𝑒𝑣𝑒𝑛𝑡 𝑜𝑐𝑐𝑢𝑟𝑟𝑒𝑑 𝑖𝑛 𝑝𝑎𝑠𝑡
𝑃𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑜𝑓 𝑎𝑛 𝑒𝑣𝑒𝑛𝑡 ℎ𝑎𝑝𝑝𝑒𝑛𝑖𝑛𝑔 =
𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠
Probability Rules
Additional Rule: If A and B are disjoint events, then 𝑃(𝐴 𝑜𝑟 𝐵) = 𝑃(𝐴) + 𝑃(𝐵)
𝑃(𝐴 𝑎𝑛𝑑 𝐵)
𝑃(𝐴|𝐵) = 𝑖𝑓 𝑃(𝐵) > 0
𝑃(𝐵)
General Multiplication Rule: For any two events A and B
Bernoulli Trial
➢ There are only two possible outcomes (success and failure) for each trial.
➢ The probability of success, denoted p, is the same for each trial. The probability
of failure is q = 1 – p.
➢ The trials are independent.
Geometric Distribution: Random variable X represent the number of trials until the
first success happens
1 𝑞
𝑃(𝑋 = 𝑘) = 𝑞 𝑘−1 𝑝, 𝑘 = 1, 2, 3, ⋯ ; 𝜇 = 𝐸(𝑋) = , 𝜎 = 𝑆𝐷(𝑋) = √
𝑝 𝑝2
Poisson Distribution: X represents the number of events that occur over a given
interval of time or space.
𝑒 −𝜆 𝜆𝑘
𝑃(𝑋 = 𝑘) = , 𝑘 = 0, 1, 2, ⋯ ; 𝜇 = 𝐸(𝑋) = 𝜆, 𝜎 = 𝑆𝐷(𝑋) = √𝜆
𝑘!
2|Page
Uniform Distribution:
𝑋−𝜇
Standard Normal Value 𝑧= 𝜎
Normal Approximation:
Exponential Distribution:
1 1
𝑃(𝑠 ≤ 𝑋 ≤ 𝑡) = 𝑒 −𝜆𝑠 − 𝑒 −𝜆𝑡 , 𝑃(𝑋 ≤ 𝑡) = 1 − 𝑒 −𝜆𝑡 , 𝜇 = 𝐸(𝑋) = , 𝜎 = 𝑆𝐷(𝑋) =
𝜆 𝜆
Central Limit Theorem
The means of all the possible random samples with the same sample size has a
sampling distribution which is approximately a normal distribution. The larger the
sample size is, the better the approximation will be.
𝑝(1 − 𝑝) 𝑝𝑞
𝜇(𝑝̂ ) = 𝑝, 𝑆𝐷(𝑝̂ ) = √ =√
𝑛 𝑛
𝑝𝑞
The normal model 𝑁 (𝑝, √ 𝑛 ) is a sampling distribution model for the sample proportion.
3|Page
Sampling Distribution for Mean
When a random sample is drawn from any population with mean μ and standard
deviation σ, its sample mean, 𝑥̅ , has a normal distribution with the mean μ and the
standard deviation is
𝜎
𝑆𝐷(𝑥̅ ) =
√𝑛
Confidence Interval for Proportions
𝑝̂ 𝑞̂ 𝑝̂ (1 − 𝑝̂ )
Standard Error is 𝑆𝐸(𝑝̂ ) = √ =√
𝑛 𝑛
𝑝̂ 𝑞̂ 𝑝̂ 𝑞̂
Confidence Interval is 𝑝̂ ± 𝑧 ∗ 𝑆𝐸(𝑝̂ ) = 𝑝̂ ± 𝑧 ∗ √ , 𝑴𝒂𝒓𝒈𝒊𝒏 𝒐𝒇 𝑬𝒓𝒓𝒐𝒓, 𝑀𝐸 = 𝑧 ∗ √
𝑛 𝑛
𝑧 ∗ is the critical value based on the confidence level, n is the sample size
𝒑̂𝟏 𝒒
̂𝟏 ̂𝟐 𝒒
𝒑 ̂𝟐
(𝒑 ̂ 𝟐 ) ± 𝒛∗ 𝑺𝑬(𝒑
̂𝟏 − 𝒑 ̂𝟏 − 𝒑
̂ 𝟐 ), 𝑺𝑬(𝒑 ̂𝟐 ) = √
̂𝟏 − 𝒑 + ̂𝟏 = 𝟏 − 𝒑
, 𝒒 ̂𝟏 , ̂𝟐 = 𝟏 − 𝒑
𝒒 ̂𝟐
𝒏𝟏 𝒏𝟐
where 𝑝̂1 𝑎𝑛𝑑 𝑝̂2 are the sample estimates of the true proportions from population 1 and 2,
respectively. 𝑧 ∗ is the critical value based on the confidence level.
𝑧 ∗ is the critical value based on the confidence level, n is the sample size.
𝑒 −𝜆 𝜆𝑘
POISSON.DIST(k,λ,0) gives the value of cumulative probability 𝑃(𝑋 = 𝑘) = 𝑘!
𝑒 −𝜆 𝜆𝑖
POISSON.DIST(k,λ,1) gives the value of cumulative probability 𝑃(𝑋 ≤ 𝑘) = ∑𝑘𝑖=1 𝑖!
5|Page