DPBS 1203 Business and Economic Statistics
DPBS 1203 Business and Economic Statistics
Discrete Continuous
– binomial – uniform
– Poisson – normal
– exponential
– t
– chi-square
– F
The binomial distribution
Based on the notion of a “binomial experiment” consisting of a
sequence of trials
Requirements/assumptions for something to qualify as a binomial
experiment:
– Sequence of fixed number of n trials
– Each trial has 2 outcomes, arbitrarily denoted “success” and “failure”
– There is a fixed probability of success (failure) p (1-p) over all trials
– Trials are independent
– Under these assumptions, this is a sequence of Bernoulli trials
– The outcome of each trial is recorded in a random variable
Binomial random variables
Represent our sequence of Bernoulli random variables:
– X1, X2, …, Xn where Xi = 1 is called “success” and Xi = 0 is called
“failure”
– Under the assumptions made, this is a sequence of
independent and identically distributed (iid) random variables
Suppose n = 5. Then:
– One possible outcome of the sequence of trials is 0, 1, 1, 0, 1 .
– Another is 1, 1, 0, 0, 0. And so on.
Binomial random variables…
We can form another random variable from these Bernoulli
random variables
– Consider the random variable formed by summing these n
Bernoulli random variables:
X = X1 + X2 + … + Xn
X represents the number of successes in n trials (note that
order does not matter!)
X is called a binomial random variable
– Characterized by 2 parameters: n and p
– i.e. once we know the values of these parameters in a given
case, we know everything about the random variable and its
probability distribution
Binomial distribution
In the gender composition example, we can calculate:
P(MMF) = P(X1=1, X2=1, X3=0)= pp(1-p) = p2(1-p)
Now, there are 3 ways to obtain 2 males (the two others being MFM or
FMM), so
P(2 Males) = P(X = 2) = 3p2(1-p)
A multiple choice exam has 10 Here n = 10, p = 0.2, and it is reasonable to assume
questions, each of which has 5 choices that the number of correct answers is a binomial
of which only one is correct. random variable.
If a student uses merely random P ( X = 5)=10 C5 (.2) 5 (.8) 5
guesswork to answer the quiz, what is: 10!
= (.2) 5 (.8) 5 = .0264
– P(X = 5)? 5!5!
– P(X ≤ 1)? P ( X ≤ 1) = P ( X = 0) + P ( X = 1)
– P(X > 1)? 10! 10!
= (.2) 0 (.8)10 + (.2)(.8) 9
0!10! 1!9!
= .1047 + .2684 = .3758
P ( X > 1) = 1 − P ( X ≤ 1) = 1 − .3758
Binomial example…
In this example
– P(X = 5) is a binomial probability
– P(X ≤ 1) is a cumulative binomial probability
– P(X > 1) is sometimes called a survivor probability
Even with counting techniques, these probabilities can be cumbersome
to evaluate
So, values of binomial probabilities have also been tabulated
– Tables exist of both individual probabilities and cumulative probabilities
Using binomial tables
For P(X=5) when n=10, p=0.2:
– We need the column labelled “p=0.2” in the block of values
corresponding to “n=10”
– Now consider the row labelled “k=5” 0.0264
P(X ≤ 1) could be verified from a table of individual probabilities by
summing the relevant ones
– Or directly from a table of cumulative probabilities
Binomial tables…
Binomial Probability P(X=k)
p
n k 0.05 0.10 0.15 0.20 0.25 0.90 0.95
2 0 0.9025 0.8100 0.7225 0.6400 0.5625 0.0100 0.0025
1 0.0950 0.1800 0.225 0.3220 0.3750 0.1800 0.0950
2 0.0025 0.0100 0.0225 0.0400 0.0625 0.8100 0.9025
distribution 0.30
Probability
0.20
Probability
0.15
2 distributions? 0.10
0.00
0 1 2 3 4 5 6 7 8 9 10
Correct answers
Mean and variance of a binomial random variable
𝐸𝐸(𝑋𝑋𝑖𝑖 ) = 0 × (1 − 𝑝𝑝) + 1 × 𝑝𝑝 = 𝑝𝑝
𝑉𝑉𝑉𝑉𝑉𝑉(𝑋𝑋𝑖𝑖 ) = (0 − 𝑝𝑝)2 (1 − 𝑝𝑝) + (1 − 𝑝𝑝)2 𝑝𝑝
= 𝑝𝑝(1 − 𝑝𝑝)(𝑝𝑝 + 1 − 𝑝𝑝) = 𝑝𝑝(1 − 𝑝𝑝)
Thus, 𝑛𝑛 𝑛𝑛
The variance is 𝜆𝜆 the same with the mean and the standard
deviation is the square root of 𝜆𝜆.
Review
Introduction to random variables
Mathematical expectations
Discrete random variables
The binomial distribution (and Bernoulli trials), in depth
Other discrete distributions, in brief