Probability Distributions - Discrete and Normal
Probability Distributions - Discrete and Normal
• Expected Value
• The expected value, or mean, of a random variable is a measure of its central
location.
E(x) = = xf(x)
• The expected value is a weighted average of the values the random variable
may assume.
• The weights are the probabilities
• Variance and Standard Deviation
• The variance summarizes the variability in the values of a random variable
Combining Probabilities and Monetary Values aids in identifying expected loss and expected gain
• E(XY)= E(X) E(Y) if X and Y are independent
• Cov(XY)= E(XY)- E(X)E(Y)
• Var(X) = E(X2)- [E(X)]2
• Var(a+bx)=b2var(X)
• Var(X+Y)= var(X)+2cov(X,Y)+var(Y)
• Cov(X,X)= var(X) (covariance between uncorrelated variables is 0)
• cov(aU + bV, cY + dZ) = ac cov(U, Y ) + bc cov(V, Y ) + ad cov(U,Z) + bd cov(V,Z)
❑Elements/case: Entities on which data are collected
❑Variables: Object upon which the data are collected
❑E.g.: data collected on sales, profit, the number of customers served by
a bank
❑Observations: measurements of an element
• Probability: Numerical measure of the
likelihood/chances/odds that an event will occur
• Random experiment: Gauging customer's overall
satisfaction using Net promoter score
• Sample space: Most preferred, preferred, …, less preferred,
not preferred
• Sample point: Experimental outcome (Observations!)
• Event: Collection of sample space (subset/entire of the
sample space)
• Each event is associated with a probability
• Events can be combined to get new events
Random Variable
numerical description of the outcome of an experiment
Probability
Distributions
Example: continuous-uniform,
Continuous Probability normal probability
Distribution distribution, exponential
probability distribution
DISCRETE PROBABILITY DISTRIBUTIONS
x = an outcome
f (x) = the probability of x
Binomial Probability Distribution
• Four Properties of a Binomial Experiment
• The experiment consists of a sequence of ‘n’ identical trials.
• Only two outcomes are possible on each trial: success and failure
• The probability of a success, denoted by p, does not change from trial to trial
• where p = probability of success and q = probability of failure, q = 1− p
• The trials are independent
x −
e • x = the number of occurrences in an interval
• A property of the Poisson distribution is that the mean and variance are equal.
• Poisson distribution approximates the binomial distribution when n (# trials) is large
and p (change of success) is small
• The hourly number of customers arriving at a bank
• The daily number of accidents on a particular stretch of highway
• The number of cases of a disease in different towns
• The number of mutations in set sized regions of a chromosome
• The number of dolphin pod sightings along a flight path through a region
• The number of particles emitted by a radioactive source in a given time
• The number of births per hour during a given day
x x
x1 x2 x1 xx12 x2
• A continuous random variable can assume any value in an interval on the real line
or in a collection of intervals.
• It is not possible to talk about the probability of the random variable assuming a
particular value.
• Probability of the random variable assumes a value within a given interval
• The probability of the random variable assuming a value within some given
interval from x1 to x2 is defined to be the area under the graph of the probability
density function between x1 and x2.
Exponential Probability Distribution
• The exponential probability distribution is useful in describing the
time it takes to complete a task.
• The exponential random variables can be used to describe:
• Time between vehicle arrivals at a toll booth
• Time required to complete a questionnaire
• Distance between major defects in a highway
• In waiting line applications, the exponential distribution is often used for
service times
• A property of the exponential distribution is that the mean and
standard deviation are equal. 1 − x /
f ( x) = e for x > 0
Normal Probability Distribution
• Most important distribution for describing a continuous random
variable
• widely used in statistical inference Standard Deviation
• Characteristics
• Symmetric; its skewness measure is zero. .5 .5
• Defined by its mean and its standard deviation
• Mean can be any numerical value: negative, zero, or positive. Mean
• Standard deviation determines the width of the curve: larger values result in wider, flatter
curves.
• Probabilities for the normal random variable are given by areas under the curve.
• The total area under the curve is 1 (.5 to the left of the mean and .5 to the right).
• Tails of the normal distribution approach but never touch the horizontal axis
Standard Normal Probability Distribution
• Normal distribution with a mean of 0 and a standard deviation of 1 is
said to have a standard normal probability distribution.
• letter z is used to designate the standard normal random variable
x−
z= =1
• z as a measure of the number of standard deviations x is from
z
0
Empirical Rule
99.72%
95.44%
68.26%
x
– 3 – 1 + 1 + 3
– 2 + 2
Page number: 286
Practice Problems
• Beginners’ Level
• Page 285 - Problem No 9 to 16
• Intermediate Level
• Problem no. 19, 21, 22, 23, 24, 25
• Page 296: Problem no. 40, 42, 43, 45, 46, 47
• Slightly Advanced
• Problem no. 20, 41,44, 48
• Step 1: Convert the required ‘x’ into its corresponding ‘z’ value
• Step 2: Use table to find the value for z
• Step 3: Understand if the question is for greater than or less than z
• Step 4: Find the area appropriately