Introduction To Probability Distributions
Introduction To Probability Distributions
Distributions
Dr. Pravin Kumar
Probability distributions list or describe probabilities for all
possible occurrences of a random variable.
• Discrete distributions
• Continuous distributions.
Discrete probability distributions
• Discrete probability distributions describes a finite set of
possible occurrences, for discrete “count data.”
• For example, the number of successful treatments out of 2
patients is discrete, because the random variable represent
the number of success can be only 0, 1, or 2.
• The probability of all possible occurrences—Pr(0 successes),
Pr(1 success), Pr(2 successes)—constitutes the probability
distribution for this discrete random variable.
Continuous probability distributions
• Continuous probability distributions describe an “unbroken”
continuum of possible occurrences.
• For example, the probability of a given birthweight can be
anything from, say, half a pound to more than 12 pounds (or
something like that).
• Thus, the random variable of birthweight is continuous, with
an infinite number of possible points between any two values.
Random Variables
• A random variable is a variable whose values are determined
by chance.
• A discrete variable is a variable that can assume only a
specific number of values. For example, the outcomes for the
roll of a die or the outcomes for the toss of a coin.
• A continuous variable is a variable that can assume all
values in the interval between any two given values. For
example, the temperature between 62°F and 78°F.
Discrete Probability Distribution
• A discrete probability distribution consists of the values a
random variable can assume and the corresponding
probabilities of the values. The probabilities are determined
theoretically or by observation.
• Graphic representation: Probability distributions can be
shown graphically by representing the values of X on the x
axis and the probability P(X) on the y axis.
• Example:
When three coins are tossed, the sample space is represented
as TTT, TTH, THT, HTT, HHT, HTH, THH, HHH; and if X is the
random variable for the number of heads, then X assumes the
value 0, 1, 2, or 3. Probabilities for the values of X can be
determined as follows:
Discrete Probability Distribution (cont’d)
The sample space is HH, HT, TH, TT, and each outcome has a probability of 1/4. Now, in the
long run, one would expect two heads (HH) to occur approximately 1/4 of the time, one
head (HT or TH) to occur approximately 1/2 of the time, and no heads (TT) to occur
approximately 1/4 of the time. Hence, on average, one would expect the number of heads
to be
• Example: A coin is tossed 3 times. Find the probability of getting exactly two
head.
This problem can be solved by looking at the sample space.
First, find the mean number l of errors. Since there are 200
errors distributed over 500 pages, each page has an average of
λ =200/500 or 0.4 error per page. Since X = 3, substituting into
the formula yields
Hypergeometric Distribution
area=1
a b x
a b x
2,000 5,000 x
0
Calculating Normal Probabilities…
• P(–.5 < Z < 1) looks like this:
0 1.6
-2.23 0 2.23
0 1.52
0 0.9 1.9
P(0.9 < Z < 1.9) = P(0 < Z < 1.9) – P(0 < Z < 0.9)
=.4713 – .3159
= .1554
Example
• The return on investment is normally distributed with a
mean of 10% and a standard deviation of 5%. What is the
probability of losing money?
X 0 10
P(X 0) P
5
P( Z 2)
.5 P(0 Z 2)
.5 .4772
.0228
Finding Values of ZA…
• Often we’re asked to find some value of Z for a
given probability, i.e. given an area (A) under
the curve, what is the corresponding value of z
(zA) on the horizontal axis that gives us this
area? That is:
• P(Z > zA) = A
Finding Values of Z…
• What value of z corresponds to an area under
the curve of 2.5%? That is, what is z.025 ?
Area = .50 Area = .025
Similarly
– P(-1.645 < Z < 1.645) = .90
Exponential Distribution…[Not on test]
• Another important continuous distribution is the exponential
distribution which has this probability density function:
8.60
Other Continuous Distributions…
• Three other important continuous distributions which will
be used extensively in later sections are introduced here:
• Student t Distribution, Looks like the standard normal
distribution (Z) after someone sat on it
• Chi-Squared Distribution,
• F Distribution.
Student t Distribution…[don’t really need to know formula]
Figure 8.24
• As the number of degrees of freedom increases, the t
distribution approaches the standard normal distribution.
Using the t table (Table 4) for values…
• For example, if we want the value of t with 10 degrees of
freedom such that the area under the Student t curve is .05:
Area under the curve value (t) : COLUMN
t.05,10
t.05,10=1.812
8.67
Chi-Squared Distribution…[Not on test]
Figure 8.27
F Distribution…[Not on test]
• The F density function is given by:
– P(X = 40) =
– P (Y < 18) =
Problem: Target the Mean