Siegel Andrew Random Variables
Siegel Andrew Random Variables
Random Variables
Working with Uncertain Numbers
Chapter Outline
7.1 Discrete Random Variables 156 7.4 The Normal Approximation to the Binomial 173
Finding the Mean and Standard Deviation 156 7.5 Two Other Distributions: The Poisson and the Exponential 176
7.2 The Binomial Distribution 158 The Poisson Distribution 176
Definition of Binomial Distribution and Proportion 159 The Exponential Distribution 177
Finding the Mean and Standard Deviation the Easy Way 161 7.6 End-of-Chapter Materials 178
Finding the Probabilities 162 Summary 178
7.3 The Normal Distribution 164 Key Words 180
Visualize Probabilities as the Area under the Curve 165 Questions 180
The Standard Normal Distribution Z and Its Probabilities 166 Problems 180
Solving Word Problems for Normal Probabilities 166 Database Exercises 185
The Four Different Probability Calculations 172 Projects 185
Be Careful: Things Need Not Be Normal! 173 Case 186
Many business situations involve random variables, such as Three: The number of qualified people who will
waiting to find out your investment portfolio performance or respond to your “help wanted” advertisement for a
asking customers in a marketing survey how much they new full-time employee.
would spend. Whenever a random experiment produces a num- Four: The price per barrel of oil next year.
ber (or several numbers) as part of its outcome, you can be sure Five: The reported income of the next family to respond
that random variables are involved. Naturally, you will want to to your information poll.
be able to compute and interpret summary measures (such as
A random variable may be defined as a specification
typical value and risk) as well as probabilities of events that
or description of a numerical result from a random experi-
depend on the observed random quantity—for example, the
ment. The value itself is called an observation. For exam-
probability that your portfolio grows by 10% or more.
ple, “next quarter’s sales” is a random variable because it
You can also think about random variables as being where
specifies and describes the number that will be produced
data sets come from. That is, many of the data sets you worked
by the random experiment of waiting until next quarter’s
with in Chapters 2–5 were obtained as observations of ran-
numbers are in and computing the sales. The actual future
dom variables. In this sense, the random variable itself repre-
value, $3,955,846, is an observation of this random vari-
sents the population (or the process of sampling from the
able. Note the distinction between a random variable
population), whereas the observed values of the random vari-
(which refers to the random process involved) and an obser-
able represent the sample data. Much more on population and
vation (which is a fixed number, once it has been observed).
samples is coming in Chapter 8 and beyond, but the funda-
The pattern of probabilities for a random variable is called
mentals of random numbers are covered here in this chapter.
its probability distribution.
Here are some examples of random variables. Note that
Many random variables have a mean and a standard
each one is random until its value is observed:
deviation. 1 In addition, there is a probability for each
One: Next quarter’s sales—a number that is currently event based on a random variable. We will consider two
unknown and that can take on one of a number of
different values. 1. All of the random variables considered in this chapter have a mean and
Two: The number of defective machines produced next a standard deviation, although in theory there do exist random variables
week. that have neither a mean nor a standard deviation.
Thus, the expected profit is $3.65 million. This number the Defined names section of the Formulas Ribbon. The
summarizes the various possible outcomes (10, 5, 1, –4) mean (3.65) is the sum of the products of value times
using a single number that reflects their likelihoods. probability; hence, the formula is “=SUMPRODUCT
The standard deviation of a discrete random variable (Profit,Probability).” Give this cell (which now contains
indicates approximately how far you expect it to be from the mean) the name “Mean.” The standard deviation
its mean. In many business situations, the standard deviation (4.40) is the square root (SQRT) of the sum of the pro-
indicates the risk by showing just how uncertain the situation ducts of the square of value minus mean times probability.
is. The standard deviation is denoted by σ, which matches our Hence, the formula is
use of σ as the population standard deviation. The formula is
= SQRTðSUMPRODUCTððProfit − MeanÞ^2,
Note that you would not get the correct answer by sim- This
cell is
ply using the Σ key on your calculator to accumulate only named
“Mean”
the single column of values, since this would not make
proper use of the probabilities.
The standard deviation of profit for our example is
pffi
σ= f½ð10 − 3:65Þ2 0:20 + ½ð5 − 3:65Þ2 0:40 Figure 7.1.1 shows the probability distribution, with the
heights of the lines indicating the probability and the loca-
+ ½ð1 − 3:65Þ 0:25 + ½ð−4 − 3:65Þ 0:15g
2 2
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi tion of the lines indicating the amount of profit in each case.
= 8:064500 + 0:729000 + 1:755625 + 8:778375 Also indicated is the expected value, $3.65 million, and the
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi standard deviation, $4.40 million.
= 19:3275 = 4:40
TABLE 7.1.1 Finding the Standard Deviation for a Discrete Random Variable
Squared Deviation
Profit Probability Deviation from Mean Squared Deviation Times Probability
10 0.20 6.35 40.3225 8.064500
Sum: 19.3275
Probability
Good
These standard deviations confirm your suspicions.
0.40 Project Z is indeed the riskiest—far more so than either of
the others. Project X is the safest—a sure thing with no risk
0.30
at all. Project Y involves a risk of $5,000.
OK Which project should be chosen? This question cannot
Great be answered by statistical analysis alone. Although the
0.20 expected value and the standard deviation provide helpful
Lousy
summaries to guide you in choosing a project, they do not
0.10 finish the task. Generally, people prefer larger expected pay-
offs and lower risk. However, with the choices presented
here, to achieve a larger expected payoff, you must take a
–5 0 5 10 greater risk. The ultimate choice of project will involve
your (and your firm’s) “risk versus return” preference to deter-
Standard deviation: Standard deviation: mine whether or not the increased expected payoff justifies
$4.40 million $4.40 million the increased risk.3
Expected profit: $3.65 million What if you measure projects in terms of profit instead of
Profit (millions) payoff? Since each project involves an initial investment of
$12,000, you can convert from payoff to profit by subtracting
FIGURE 7.1.1 The probability distribution of future profits, with the $12,000 from each payoff value in the probability distribu-
mean (expected profits) and standard deviation (risk) indicated. tion table:
Profit = Payoff − $12,000
TABLE 7.1.2 Payoffs and Probabilities for Three Projects
Using the rules from Section 5.4, which apply to summa-
Project Payoff Probability ries of random variables as well as to data, subtract $12,000
from each mean value and leave the standard deviation
X 14,000 1.00
alone. Thus, without doing any detailed calculations, you
Y 10,000 0.50 come up with the following expected profits:
Example—cont’d X: $0
The means are easily found: $14,000 for X, 10,000 × 0.50 + Y: $5,000
20,000 × 0.50 = $15,000 for Y, and 0 × 0.98 + 1,000,000 × Z: $140,000
0.02 = $20,000 for Z. We could write these as follows:
EðXÞ = μX = $14,000 3. In your finance courses, you may learn about another factor that is
EðYÞ = μY = $15,000 often used in valuing projects, namely, the correlation (if any) between
EðZÞ = μZ = $20,000 the random payoffs and the payoffs of a market portfolio. This helps mea-
sure the diversifiable and nondiversifiable risk of a project. Correlation
Based only on these expected values, it would appear that Z (a statistical measure of association) will be presented in Chapter 11.
The nondiversifiable component of risk is also known as systematic or sys-
is best and X is worst. However, these mean values don’t tell temic risk because it is part of the entire economic system and cannot be
the whole story. For example, although project Z has the high- diversified away.
est expected payoff, it also involves considerable risk: 98% of
the time there would be no payoff at all! The risks involved
here are summarized by the standard deviations:
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 7.2 THE BINOMIAL DISTRIBUTION
σ X = ð14,000 − 14,000Þ2 × 1:00 = $0
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi Percentages play a key role in business. When a percentage
σ Y = ð10,000 − 15,000Þ2 × 0:50 + ð20,000 − 15,000Þ2 × 0:50 is arrived at by counting the number of times something
= $5,000 happens out of the total number of possibilities, the number
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi of occurrences might follow a binomial distribution. If so,
σ Z = ð0 − 20,000Þ2 × 0:98 + ð1,000,000 − 20,000Þ2 × 0:02
there are a number of time-saving shortcuts available for
= $140,000 finding the expected value, standard deviation, and prob-
abilities of various events. Sometimes you will be interested
Chapter | 7 Random Variables 159
in the percentage; at other times the number of occurrences The binomial proportion p is also called a binomial frac-
will be more relevant. The binomial distribution can give tion. You may have recognized it as a relative frequency,
answers in either case. Here are some examples of random which was defined in Chapter 6.
variables that follow a binomial distribution:
1. The number of orders placed, out of the next three Example
telephone calls to your catalog order desk. How Many Orders Are Placed? The Hard Way to Compute
2. The number of defective products out of 10 items This example shows the hard way to analyze a binomial
produced. random variable. Although it is rarely necessary to draw
3. The number of people who said they would buy your the probability tree, since it is usually quite large, seeing
product, out of 200 interviewed. it once will help you understand what is really going on
with the binomial distribution. Furthermore, when the
4. The number of stocks that went up yesterday, out of all
shortcut computations are presented (the easy way) you
issues traded on major exchanges. will appreciate the time they save!
5. The number of female employees in a division of 75 Suppose you are interested in the next n = 3 telephone
people. calls to the catalog order desk, and you know from experi-
6. The number of Republican (or Democratic) votes cast in ence (or are willing to assume4) that π = 0.6, so that 60%
the next election. of calls will result in an order (the others are primarily calls
for information, or misdirected). What can we say about
the number of calls that will result in an order? Certainly,
Definition of Binomial Distribution this number will be either 0, 1, 2, or 3 calls. Since a call is
and Proportion more likely to result in an order than not, we should probably
expect the probability of getting three orders to be larger than
Focus attention on a particular event. Each time the random
the probability of getting none at all. But how can we find
experiment is run, either the event happens or it doesn’t.
these probabilities? The probability tree provides a complete
These two possible outcomes give us the bi in binomial. analysis, as shown in Figure 7.2.1a, indicating the result of
A random variable X, defined as the number of occurrences each of the three phone calls.
of a particular event out of n trials, has a binomial Note that the conditional probabilities along the branches
distribution if are always 0.60 and 0.40 (the individual probabilities for
each call) since we assume orders occur independently
1. For each of the n trials, the event always has the same
and do not influence each other. The number of orders is
probability π of happening. listed at the far right in Figure 7.2.1a; for example, the second
2. The trials are independent of one another. number from the top, 2, reports the fact that the first and sec-
The independence requirement rules out “peeking,” as in ond (but not the third) callers placed an order, resulting in
two orders placed. Note that there are three ways in which
the case of the distribution of people who order the special at
two orders could be placed. To construct the probability dis-
a restaurant. If some people order the special because they tribution of the number of orders placed, you could add up
see other customers obviously enjoying the rich, delicious the probabilities for the different ways that each number
combination of special aromatic ingredients, and say, could happen:
“WOW! I’ll have that too!” the number who order the special
would not follow a binomial distribution. Choices have to be Number of Percentage Probability
made independently in order to get a binomial distribution. Callers Who Who Ordered,
The binomial proportion p is the binomial random Ordered, X p = X/n
variable X expressed as a fraction of n: 0 0.0% 0.064
1 33.3% 0.288 (= 0.096 + 0.096 + 0.096)
2 66.7% 0.432 (= 0.144 + 0.144 + 0.144)
Binomial Proportion 3 100.0% 0.216
p= X Number of occurrences
n = Number of trials This probability distribution is displayed in Figure 7.2.1b.
Now that you have the probability distribution, you can
find all of the probabilities by adding the appropriate ones.
(Note that π is a fixed number, the probability of occur- For example, the probability of at least two orders is 0.432 +
rence, whereas p is a random quantity based on the data.) 0.216 = 0.648. You can also use the formulas for the mean
For example, if you interviewed n = 600 shoppers and and standard deviation from Section 7.1 to find the mean
found that X = 38 plan to buy your product, then the bino- value (1.80 orders) and the standard deviation (0.849 orders).
mial proportion would be However, this would be too much work! There is a much
quicker formula for finding the mean, standard deviation,
p = X = 38 = 0:063, or 6.3% (Continued)
n 600
160 PART | II Probability
Number
Caller 1 Caller 2 Caller 3 of callers
placed order? placed order? placed order? placing
orders
0.60 0.216 3
Yes
0.36 No
0.60 0.40
Yes 0.144 2
0.60 No
0.40 0.60 0.144 2
Yes
0.24 No
0.40
Yes 0.096 1
No
0.60 0.144 2
Yes
0.24 No
0.60 0.40
Yes 0.096 1
0.40 No
0.40 0.60 0.096 1
Yes
0.16 No
0.40
0.064 0
(a)
0.4
Binomial probability
0.3
0.2
0.1
0
0 1 2 3
Number of callers placing orders
(b)
FIGURE 7.2.1 a. The probability tree for three successive telephone calls, each of which either does or does not result in an order being placed. There are
eight combinations (the circles at the far right). In particular, there are three ways in which exactly two calls could result in an order: the second, third, and
fifth circles from the top, giving a probability of 3 × 0.144 = 0.432. b. The binomial probability distribution of the number of calls that result in an order
being placed.
Chapter | 7 Random Variables 161
and π = 0.05, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, and
Example—cont’d 0.95. Here is the exact formula:6
are likely to be. Your budget allows 50 people to be tested.
From your discussions with the research firm, it seems rea-
sonable initially to assume that 35% of people will recall Binomial Probability That X Equals a
the ad, although you really don’t know the exact propor-
!
tion. Based on the assumption that it really is 35%, how n
accurate will the results be? That is, about how far will PðX = aÞ = π a ð1−πÞn−a
a
the measured recall percentage be from the assumed
value π = 0.35 with n = 50 for a binomial distribution? = n! π a ð1−πÞn−a
The answer is a!ðn−aÞ!
Example
How Many Logic Analyzers to Schedule for Manufacturing?
7. The “FALSE” and “TRUE” in Excel’s binomial distribution formula You pay close attention to quality in your production facil-
refer to whether or not the probability distribution is cumulative, i.e., ities, but the logic analyzers you make are so complex that
whether or not it accumulates probabilities for all of the previous (Continued)
(smaller) values of a as well.
164 PART | II Probability
0.4 0.7
0.6
0.3
0.5
Probability
Probability
0.2 0.4
0.3
0.1
0.2
0.0 0.1
0 1 2 3 4 5 6
Number of customers who call 0
10 11 12 13 14 15 16 17 18
FIGURE 7.2.2 The probability distribution of the number of major cus- Number of working logic analyzers (17 scheduled)
tomers who will call you tomorrow. These are binomial probabilities, with
each vertical bar found using the formula based on n = 6 and p = 0.25. The FIGURE 7.2.3 The probability distribution of the number of working
number a is found along the horizontal axis. logic analyzers produced if you plan to produce only 17. This is binomial,
with n = 17 and π = 0.97.
Example—cont’d 0.7
there are still some failures. In fact, based on past experience,
0.6
about 97% of the finished products are in good working
order. Today you will have to ship 17 of these machines. 0.5
The question is: How many should you schedule for produc- Probability
0.4
tion to be reasonably certain that 17 working logic analyzers
will be shipped? 0.3
It is reasonable to assume a binomial distribution for the 0.2
number of working machines produced, with n being the
number that you schedule and π being each one’s probability 0.1
(0.97) of working. Then you can compute the probability that 0
17 or more of the scheduled machines will work. 10 11 12 13 14 15 16 17 18
What happens if you schedule 17 machines, with no mar- Number of working logic analyzers (18 scheduled)
gin for error? You might think that the high (97%) rate would
help you, but, in fact, the probability that all 17 machines FIGURE 7.2.4 The probability distribution of the number of working
will work (using n = 17 and a = 17) is just 0.596: logic analyzers produced if you plan to produce 18. This is binomial,
with n = 18 and π = 0.97.
!
17
PðX = 17 working machinesÞ = 0:9717 0:030
17
= 1 × 0:595826 × 1 = 0:596 So if you schedule 18 for production, you have a 90%
chance of shipping 17 good machines. It looks likely, but
Thus, if you schedule the same number, 17, that you need you would still be taking a 10% chance of failure. This prob-
to ship, you will be taking a big chance! There is only a 59.6% ability distribution is shown in Figure 7.2.4.
chance that you will meet the order, and a 40.4% chance that Similar tedious calculations reveal that if you schedule 19
you will fail to ship the entire order in working condition. The machines for production, you have a 98.2% chance of ship-
probability distribution is shown in Figure 7.2.3. ping 17 good machines (9.2% + 32.9% + 56.1%). So, to be
It looks as though you’d better schedule more than 17. reasonably sure of success, you’d better schedule at least 19
What if you schedule n = 18 units for production? To find machines to get 17 good ones!
the probability that at least 17 working analyzers will be
shipped, you’ll need to find the probabilities for a = 17 and
a = 18 and add them up:
PðX ≥ 17Þ = PðX = 17Þ + PðX = 18Þ 7.3 THE NORMAL DISTRIBUTION
! !
18 18 You already know from Chapter 3 how to tell if a data set
= 0:9717 0:031 + 0:9718 0:030
17 18 is approximately normally distributed. Now it’s time to
learn how to compute probabilities for this familiar bell-
= 18 × 0:595826 × 0:03 + 1 × 0:577951 × 1
shaped distribution. One reason the normal distribution
= 0:322 + 0:578 = 0:900 is particularly useful is the fact that, given only a
mean and a standard deviation, you can compute any
Chapter | 7 Random Variables 165
0 10 20 30 40 50 60 70 80
σ σ
μ 5 5 10 10
20 40
(a) (b)
FIGURE 7.3.1 a. The normal distribution, with mean value μ and standard deviation σ. Note that the mean can be any number, and the standard deviation
can be any positive number. b. Two different normal distributions. The one on the left has a smaller mean value (20) and a smaller standard deviation (5)
than the other. The one on the right has mean 40 and standard deviation 10.
Visualize Probabilities as the Area under the The shaded area gives the probability
Curve of being between here and here
The bell-shaped curve gives you a guide for visualizing the FIGURE 7.3.2 The probability that a normally distributed random vari-
able is between any two values is equal to the area under the normal curve
probabilities for a normal distribution. You are more likely between these two values. You are more likely to see values in regions
to see values occurring near the middle, where the curve is close to the mean.
high. At the edges, where the curve is lower, values are not
as likely to occur. Formally, it is the area under the curve
that gives you the probability of being within a region, as
illustrated in Figure 7.3.2.
Note that a shaded strip near the middle of the curve will
have a larger area than a strip of the same width located
nearer to the edge. Compare Figure 7.3.2 to Figure 7.3.3
to see this.
8. The formula for the normal probability distribution with mean μ and
standard deviation σ is
FIGURE 7.3.3 The probability of falling within a region that is farther
1 2
pffiffiffiffiffi e−½ðx−μÞ/σ /2 : from the middle of the curve. Since the normal curve is lower here, the
2π σ probability is smaller than that shown in Figure 7.3.2.
166 PART | II Probability
Random Variables
0.0222 0.1562 0.4960 0.01 0.5040 1.01 0.8438 2.01 0.9778
–2.02 0.0217 –1.02 0.1539 –0.02 0.4920 0.02 0.5080 1.02 0.8461 2.02 0.9783
–2.03 0.0212 –1.03 0.1515 –0.03 0.4880 0.03 0.5120 1.03 0.8485 2.03 0.9788
–2.04 0.0207 –1.04 0.1492 –0.04 0.4840 0.04 0.5160 1.04 0.8508 2.04 0.9793
–2.05 0.0202 –1.05 0.1469 –0.05 0.4801 0.05 0.5199 1.05 0.8531 2.05 0.9798
–2.06 0.0197 –1.06 0.1446 –0.06 0.4761 0.06 0.5239 1.06 0.8554 2.06 0.9803
–2.07 0.0192 –1.07 0.1423 –0.07 0.4721 0.07 0.5279 1.07 0.8577 2.07 0.9808
–2.08 0.0188 –1.08 0.1401 –0.08 0.4681 0.08 0.5319 1.08 0.8599 2.08 0.9812
–2.09 0.0183 –1.09 0.1379 –0.09 0.4641 0.09 0.5359 1.09 0.8621 2.09 0.9817
–2.10 0.0179 –1.10 0.1357 –0.10 0.4602 0.10 0.5398 1.10 0.8643 2.10 0.9821
–2.11 0.0174 –1.11 0.1335 –0.11 0.4562 0.11 0.5438 1.11 0.8665 2.11 0.9826
–2.12 0.0170 –1.12 0.1314 –0.12 0.4522 0.12 0.5478 1.12 0.8686 2.12 0.9830
–2.13 0.0166 –1.13 0.1292 –0.13 0.4483 0.13 0.5517 1.13 0.8708 2.13 0.9834
–2.14 0.0162 –1.14 0.1271 –0.14 0.4443 0.14 0.5557 1.14 0.8729 2.14 0.9838
–2.15 0.0158 –1.15 0.1251 –0.15 0.4404 0.15 0.5596 1.15 0.8749 2.15 0.9842
–2.16 0.0154 –1.16 0.1230 –0.16 0.4364 0.16 0.5636 1.16 0.8770 2.16 0.9846
–2.17 0.0150 –1.17 0.1210 –0.17 0.4325 0.17 0.5675 1.17 0.8790 2.17 0.9850
–2.18 0.0146 –1.18 0.1190 –0.18 0.4286 0.18 0.5714 1.18 0.8810 2.18 0.9854
–2.19 0.0143 –1.19 0.1170 –0.19 0.4247 0.19 0.5753 1.19 0.8830 2.19 0.9857
–2.20 0.0139 –1.20 0.1151 –0.20 0.4207 0.20 0.5793 1.20 0.8849 2.20 0.9861
–2.21 0.0136 –1.21 0.1131 –0.21 0.4168 0.21 0.5832 1.21 0.8869 2.21 0.9864
–2.22 0.0132 –1.22 0.1112 –0.22 0.4129 0.22 0.5871 1.22 0.8888 2.22 0.9868
–2.23 0.0129 –1.23 0.1093 –0.23 0.4090 0.23 0.5910 1.23 0.8907 2.23 0.9871
–2.24 0.0125 –1.24 0.1075 –0.24 0.4052 0.24 0.5948 1.24 0.8925 2.24 0.9875
–2.25 0.0122 –1.25 0.1056 –0.25 0.4013 0.25 0.5987 1.25 0.8944 2.25 0.9878
167
168
TABLE 7.3.1 Standard Normal Probability Table (See Figure 7.3.5)—cont’d
z Value Probability z Value Probability z Value Probability z Value Probability z Value Probability z Value Probability
–2.26 0.0119 –1.26 0.1038 –0.26 0.3974 0.26 0.6026 1.26 0.8962 2.26 0.9881
–2.27 0.0116 –1.27 0.1020 –0.27 0.3936 0.27 0.6064 1.27 0.8980 2.27 0.9884
–2.28 0.0113 –1.28 0.1003 –0.28 0.3897 0.28 0.6103 1.28 0.8997 2.28 0.9887
–2.29 0.0110 –1.29 0.0985 –0.29 0.3859 0.29 0.6141 1.29 0.9015 2.29 0.9890
–2.30 0.0107 –1.30 0.0968 –0.30 0.3821 0.30 0.6179 1.30 0.9032 2.30 0.9893
–2.31 0.0104 –1.31 0.0951 –0.31 0.3783 0.31 0.6217 1.31 0.9049 2.31 0.9896
–2.32 0.0102 –1.32 0.0934 –0.32 0.3745 0.32 0.6255 1.32 0.9066 2.32 0.9898
–2.33 0.0099 –1.33 0.0918 –0.33 0.3707 0.33 0.6293 1.33 0.9082 2.33 0.9901
–2.34 0.0096 –1.34 0.0901 –0.34 0.3669 0.34 0.6331 1.34 0.9099 2.34 0.9904
–2.35 0.0094 –1.35 0.0885 –0.35 0.3632 0.35 0.6368 1.35 0.9115 2.35 0.9906
–2.36 0.0091 –1.36 0.0869 –0.36 0.3594 0.36 0.6406 1.36 0.9131 2.36 0.9909
–2.37 0.0089 –1.37 0.0853 –0.37 0.3557 0.37 0.6443 1.37 0.9147 2.37 0.9911
–2.38 0.0087 –1.38 0.0838 –0.38 0.3520 0.38 0.6480 1.38 0.9162 2.38 0.9913
–2.39 0.0084 –1.39 0.0823 –0.39 0.3483 0.39 0.6517 1.39 0.9177 2.39 0.9916
–2.40 0.0082 –1.40 0.0808 –0.40 0.3446 0.40 0.6554 1.40 0.9192 2.40 0.9918
–2.41 0.0080 –1.41 0.0793 –0.41 0.3409 0.41 0.6591 1.41 0.9207 2.41 0.9920
–2.42 0.0078 –1.42 0.0778 –0.42 0.3372 0.42 0.6628 1.42 0.9222 2.42 0.9922
–2.43 0.0075 –1.43 0.0764 –0.43 0.3336 0.43 0.6664 1.43 0.9236 2.43 0.9925
–2.44 0.0073 –1.44 0.0749 –0.44 0.3300 0.44 0.6700 1.44 0.9251 2.44 0.9927
–2.45 0.0071 –1.45 0.0735 –0.45 0.3264 0.45 0.6736 1.45 0.9265 2.45 0.9929
–2.46 0.0069 –1.46 0.0721 –0.46 0.3228 0.46 0.6772 1.46 0.9279 2.46 0.9931
–2.47 0.0068 –1.47 0.0708 –0.47 0.3192 0.47 0.6808 1.47 0.9292 2.47 0.9932
PART | II
–2.48 0.0066 –1.48 0.0694 –0.48 0.3156 0.48 0.6844 1.48 0.9306 2.48 0.9934
–2.49 0.0064 –1.49 0.0681 –0.49 0.3121 0.49 0.6879 1.49 0.9319 2.49 0.9936
–2.50 0.0062 –1.50 0.0668 –0.50 0.3085 0.50 0.6915 1.50 0.9332 2.50 0.9938
Probability
–2.51 0.0060 –1.51 0.0655 –0.51 0.3050 0.51 0.6950 1.51 0.9345 2.51 0.9940
Chapter | 7
TABLE 7.3.1 Standard Normal Probability Table (See Figure 7.3.5)—cont’d
z Value Probability z Value Probability z Value Probability z Value Probability z Value Probability z Value Probability
–2.52 0.0059 –1.52 0.0643 –0.52 0.3015 0.52 0.6985 1.52 0.9357 2.52 0.9941
Random Variables
0.0057 0.0630 0.2981 0.53 0.7019 1.53 0.9370 2.53 0.9943
–2.54 0.0055 –1.54 0.0618 –0.54 0.2946 0.54 0.7054 1.54 0.9382 2.54 0.9945
–2.55 0.0054 –1.55 0.0606 –0.55 0.2912 0.55 0.7088 1.55 0.9394 2.55 0.9946
–2.56 0.0052 –1.56 0.0594 –0.56 0.2877 0.56 0.7123 1.56 0.9406 2.56 0.9948
–2.57 0.0051 –1.57 0.0582 –0.57 0.2843 0.57 0.7157 1.57 0.9418 2.57 0.9949
–2.58 0.0049 –1.58 0.0571 –0.58 0.2810 0.58 0.7190 1.58 0.9429 2.58 0.9951
–2.59 0.0048 –1.59 0.0559 –0.59 0.2776 0.59 0.7224 1.59 0.9441 2.59 0.9952
–2.60 0.0047 –1.60 0.0548 –0.60 0.2743 0.60 0.7257 1.60 0.9452 2.60 0.9953
–2.61 0.0045 –1.61 0.0537 –0.61 0.2709 0.61 0.7291 1.61 0.9463 2.61 0.9955
–2.62 0.0044 –1.62 0.0526 –0.62 0.2676 0.62 0.7324 1.62 0.9474 2.62 0.9956
–2.63 0.0043 –1.63 0.0516 –0.63 0.2643 0.63 0.7357 1.63 0.9484 2.63 0.9957
–2.64 0.0041 –1.64 0.0505 –0.64 0.2611 0.64 0.7389 1.64 0.9495 2.64 0.9959
–2.65 0.0040 –1.65 0.0495 –0.65 0.2578 0.65 0.7422 1.65 0.9505 2.65 0.9960
–2.66 0.0039 –1.66 0.0485 –0.66 0.2546 0.66 0.7454 1.66 0.9515 2.66 0.9961
–2.67 0.0038 –1.67 0.0475 –0.67 0.2514 0.67 0.7486 1.67 0.9525 2.67 0.9962
–2.68 0.0037 –1.68 0.0465 –0.68 0.2483 0.68 0.7517 1.68 0.9535 2.68 0.9963
–2.69 0.0036 –1.69 0.0455 –0.69 0.2451 0.69 0.7549 1.69 0.9545 2.69 0.9964
–2.70 0.0035 –1.70 0.0446 –0.70 0.2420 0.70 0.7580 1.70 0.9554 2.70 0.9965
–2.71 0.0034 –1.71 0.0436 –0.71 0.2389 0.71 0.7611 1.71 0.9564 2.71 0.9966
–2.72 0.0033 –1.72 0.0427 –0.72 0.2358 0.72 0.7642 1.72 0.9573 2.72 0.9967
–2.73 0.0032 –1.73 0.0418 –0.73 0.2327 0.73 0.7673 1.73 0.9582 2.73 0.9968
–2.74 0.0031 –1.74 0.0409 –0.74 0.2296 0.74 0.7704 1.74 0.9591 2.74 0.9969
–2.75 0.0030 –1.75 0.0401 –0.75 0.2266 0.75 0.7734 1.75 0.9599 2.75 0.9970
–2.76 0.0029 –1.76 0.0392 –0.76 0.2236 0.76 0.7764 1.76 0.9608 2.76 0.9971
–2.77 0.0028 –1.77 0.0384 –0.77 0.2206 0.77 0.7794 1.77 0.9616 2.77 0.9972
169
170
TABLE 7.3.1 Standard Normal Probability Table (See Figure 7.3.5)—cont’d
z Value Probability z Value Probability z Value Probability z Value Probability z Value Probability z Value Probability
–2.78 0.0027 –1.78 0.0375 –0.78 0.2177 0.78 0.7823 1.78 0.9625 2.78 0.9973
–2.79 0.0026 –1.79 0.0367 –0.79 0.2148 0.79 0.7852 1.79 0.9633 2.79 0.9974
–2.80 0.0026 –1.80 0.0359 –0.80 0.2119 0.80 0.7881 1.80 0.9641 2.80 0.9974
–2.81 0.0025 –1.81 0.0351 –0.81 0.2090 0.81 0.7910 1.81 0.9649 2.81 0.9975
–2.82 0.0024 –1.82 0.0344 –0.82 0.2061 0.82 0.7939 1.82 0.9656 2.82 0.9976
–2.83 0.0023 –1.83 0.0336 –0.83 0.2033 0.83 0.7967 1.83 0.9664 2.83 0.9977
–2.84 0.0023 –1.84 0.0329 –0.84 0.2005 0.84 0.7995 1.84 0.9671 2.84 0.9977
–2.85 0.0022 –1.85 0.0322 –0.85 0.1977 0.85 0.8023 1.85 0.9678 2.85 0.9978
–2.86 0.0021 –1.86 0.0314 –0.86 0.1949 0.86 0.8051 1.86 0.9686 2.86 0.9979
–2.87 0.0021 –1.87 0.0307 –0.87 0.1922 0.87 0.8078 1.87 0.9693 2.87 0.9979
–2.88 0.0020 –1.88 0.0301 –0.88 0.1894 0.88 0.8106 1.88 0.9699 2.88 0.9980
–2.89 0.0019 –1.89 0.0294 –0.89 0.1867 0.89 0.8133 1.89 0.9706 2.89 0.9981
–2.90 0.0019 –1.90 0.0287 –0.90 0.1841 0.90 0.8159 1.90 0.9713 2.90 0.9981
–2.91 0.0018 –1.91 0.0281 –0.91 0.1814 0.91 0.8186 1.91 0.9719 2.91 0.9982
–2.92 0.0018 –1.92 0.0274 –0.92 0.1788 0.92 0.8212 1.92 0.9726 2.92 0.9982
–2.93 0.0017 –1.93 0.0268 –0.93 0.1762 0.93 0.8238 1.93 0.9732 2.93 0.9983
–2.94 0.0016 –1.94 0.0262 –0.94 0.1736 0.94 0.8264 1.94 0.9738 2.94 0.9984
–2.95 0.0016 –1.95 0.0256 –0.95 0.1711 0.95 0.8289 1.95 0.9744 2.95 0.9984
–2.96 0.0015 –1.96 0.0250 –0.96 0.1685 0.96 0.8315 1.96 0.9750 2.96 0.9985
–2.97 0.0015 –1.97 0.0244 –0.97 0.1660 0.97 0.8340 1.97 0.9756 2.97 0.9985
PART | II
–2.98 0.0014 –1.98 0.0239 –0.98 0.1635 0.98 0.8365 1.98 0.9761 2.98 0.9986
–2.99 0.0014 –1.99 0.0233 –0.99 0.1611 0.99 0.8389 1.99 0.9767 2.99 0.9986
–3.00 0.0013 –2.00 0.0228 –1.00 0.1587 1.00 0.8413 2.00 0.9772 3.00 0.9987
Probability
Chapter | 7 Random Variables 171
9. You know that this is below the mean because the standardized number FIGURE 7.3.8 The probability of a really good quarter, in terms of stan-
z = –1.67 is negative. The standardized number z will be positive for any dardized sales numbers. The shaded area is 1 minus the unshaded area
number above the mean. The standardized number z for the mean itself under the curve, which may be looked up in the table. The answer is
is 0. 0.0918.
172 PART | II Probability
Binomial probability
Binomial probability
0.10 0.3
0.2
0.05
0.1
0
0 0 1 2 3 4 5 6 7 8 9 10
0 10 20
FIGURE 7.4.2 The probability distribution of a binomial with n = 10 and
FIGURE 7.4.1 The probability distribution of a binomial with n = 100 π = 0.10 is not very normal because n is not large enough.
and π = 0.10 is fairly close to normal.
and standard deviation (from Section 7.3), it should not be Using the Normal Approximation to the Binomial
difficult for you to compute these approximate binomial (Whole Numbers a and b):
probabilities.
Here is convincing evidence of the binomial approxima- The Probability Is Approximated by the Probability
tion to the normal. Suppose n is 100 and π is 0.10. The That the Binomial Is: That the Corresponding Normal Is:
probability distribution, computed using the binomial for- Exactly 8 Between 7.5 and 8.5
mula, is shown in Figure 7.4.1. It certainly has the bell Exactly a Between a – 0.5 and a + 0.5
shape of a normal distribution. Although it is still discrete, Between 15 and 23 Between 14.5 and 23.5
with separate, individual bars, there are enough observa- Between a and b Between a – 0.5 and b + 0.5
tions that the discreteness is not a dominant feature.
To approximate a binomial (which is discrete, only
taking on whole-number values) by using a normal ran- Compare Figure 7.4.1 (n = 100) to Figure 7.4.2 (n = 10)
dom variable (which is continuous), we will do better if to see that, with smaller n, the distribution is not as normal.
we extend the limits by one-half in each direction in Furthermore, the discreteness is more important when n is
order to include all numbers that round to the whole num- small.
ber(s).12 For example, to approximate the probability that
a binomial X is equal to 3, we would find the probability Example
that a normal distribution (with the same mean and stan- High- and Low-Speed Microprocessors
dard deviation) is between 2.5 and 3.5. We need to do We often don’t have as much control over a manufacturing
this because the probability is zero that any normal ran- process as we would like. Such is the case with sophisticated
dom variable is equal to 3 and, actually, all values that microprocessor chips, such as some of those used in micro-
the normal produced that were between 2.5 and 3.5 computers, which can have over a billion transistors placed
would round to 3. Similarly, to find the probability that on a chip of silicon smaller than a square inch. Despite care-
a binomial is between 6 and 9, you would find the prob- ful controls, there is variation within the resulting chips:
Some will run at higher speeds than others.
ability that a normal (with same mean and standard devia-
In the spirit of the old software saying “It’s not a bug, it’s a
tion) is between 5.5 and 9.5. The probability of not being
feature!” the chips are sorted according to the speed at which
between two numbers is, as usual, one minus the probabil- they will actually run and priced accordingly (with the faster
ity of being between them. chips commanding a higher price). The catalog lists two
products: 2 gigahertz (slower) and 3 gigahertz (faster).
Your machinery is known to produce the slow chips 80%
of the time, on average, and fast chips the remaining 20% of
12. We assume here that you are looking for probabilities about a binomial
the time, with chips being slow or fast independently of one
number of occurrences X. If, on the other hand, you need probabilities for a another. Today your goal is to ship 1,000 slow chips and 300
binomial proportion or percentage p, you should first convert to X and then fast chips, perhaps with some chips left over. How many
proceed from there. For example, the probability of observing “at least should you schedule for production?
20% of 261” is the same as the probability of observing “at least 53 of If you schedule 1,300 total chips, you expect 80% (1,040
261” since you need at least 0.20 × 261 = 52.2 and can observe only chips) to be slow and 20% (260 chips) to be fast. You would
whole numbers.
Chapter | 7 Random Variables 175
To solve this problem using the binomial distribution Your coworker: “It looks pretty close: 437 out of 800 is
directly would require that you compute the probability for pretty close to 50–50, which would be 400 out of 800.”
300, for 301, for 302, and so on until you got tired. The nor- You: “But 437 seems lots bigger than 400 to me.
mal approximation to the binomial allows you to solve it Let’s find out if the extra 37 could reasonably be just
much faster using the standard normal probability table. randomness.”
You will need to know the mean and standard deviation of Your coworker: “OK. Let’s assume that each person is as
the number of fast chips produced: likely to be in favor as not. Then we can compute the
chances of seeing 437 or more.”
μðNumber of fast chipsÞ = nπ You: “OK. If the chances are more than 5% or 10%, then
= 1,650 × 0:20 = 330 the extra 37 could reasonably be just randomness. But if
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi the chances are really small, say under 5% or under 1%,
σ ðNumber of fast chipsÞ =nπð1 − πÞ then it would seem that more than just randomness
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
= 1,650 × 0:20 × 0:80 = 16:24807 is involved.”
You also need to standardize the bounds on the number To do the calculation, let X represent the following bino-
of fast chips needed, 300 and 650 (after extending them by mial random variable: the number of people (out of 800 inter-
a half to 299.5 and 650.5), using the mean and standard viewed) who say they intend to vote in favor. If we assume that
deviation computed just above: people are evenly divided on the issue, then the probability for
each person interviewed is π = 0.50 that they intend to vote in
z1 = Standardized lower number of fast chips = 299:5 − 330 favor. Now let’s find the mean and standard deviation of X
16:24807
using formulas for the binomial distribution:
= −1:88
μX = nπ = ð800Þð0:50Þ = 400
0.8
Example—cont’d
0.7
in the sample, assuming that the population is evenly
divided, is 1 – 0.995 = 0.005. This is very unlikely: a prob- 0.6
Poisson probability
ability of about a half a percent, or 1 out of 200. 0.5
You asked, what if the population were evenly divided,
0.4
and found the answer: A sample percentage of 54.6% (this
is 437/800) or more is highly unlikely. The conclusion is 0.3
that you have evidence against the What if scenario of evenly
0.2
divided voters. It looks good for the initiative!
0.1
0
0 1 2 3 4 5
Many other probability distributions are useful in statistics. FIGURE 7.5.1 The Poisson distribution with 0.5 occurrences expected is
a skewed distribution. There is a high probability, 0.607, that no occur-
This section provides brief descriptions of two such distribu- rences will happen at all.
tions with an indication of how they might fit in with some
general classes of business applications. The Poisson distri- 0.4
bution is often useful as a model of the number of events
that occur during a fixed time, such as arrivals. The expo-
nential distribution can work well as a model of the amount 0.3
Poisson probability
0.07
1. The number of orders your firm receives tomorrow. 0.06
2. The number of people who apply for a job tomorrow to 0.05
your human resources division.
0.04
3. The number of defects in a finished product.
0.03
4. The number of calls your firm receives next week for
0.02
help concerning an “easy-to-assemble” toy.
0.01
5. A binomial number X when n is large and π is small.
0
The following figures show what the Poisson probabil- 0 10 20 30 40 50
ities look like for a system expecting a mean of 0.5 occur-
Mean = 20
rence (Figure 7.5.1), 2 occurrences (Figure 7.5.2), and 20
Number of occurrences
and randomly with a constant rate over time, the waiting Waiting time
time between successive events follows an exponential between successive events
is exponentially distributed
distribution.17
Here are some examples of random variables that might
follow an exponential distribution: Time
17. Note that this implies that the total number of events follows a Poisson The interpretations are familiar. The mean or expected
distribution. value indicates the typical or average value, and the
Chapter | 7 Random Variables 179
standard deviation indicates the risk in terms of approxi- number of standard deviations above the mean, or below
mately how far from the mean you can expect to be. the mean if the standardized number is negative) by sub-
A random variable X has a binomial distribution if it tracting the mean and dividing by the standard deviation:
represents the number of occurrences of an event out of n
Number − Mean
trials, provided (1) for each of the n trials, the event always z = Standardized number =
Standard deviation
has the same probability π of happening, and (2) the trials
Number − μ
are independent of one another. The binomial proportion =
is p = X/n, which also represents a percentage. The mean σ
and standard deviation of a binomial or binomial proportion Finally, look up the standardized number or numbers in
may be found as follows: the standard normal probability table and use the following
summary table (where z, z1, and z2 are standardized num-
Mean and Standard Deviation for a Binomial bers) to find the final answer:
Distribution
Number of Proportion or
Computing Probabilities for a Normal Distribution
Occurrences, X Percentage, p = X/n To Find the Procedure
Mean EðXÞ = μX = nπ EðpÞ = μp = π Probability of Being
rffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi πð1 − πÞ Less than z Look up z in the table
Standard deviation σ X = nπð1 − πÞ σp =
n More than z Subtract above answer from 1
Between z1 and z2 Look up z1 and z2 in the table, and
The probability that a binomial random variable X is subtract smaller probability from larger
equal to some given number a (from 0 to n) is given by Not between z1 and z2 Subtract above answer
the following formula. Binomial probability that X equals a: (for “between z1 and z2”) from 1
!
n Probabilities for a binomial distribution may be approxi-
PðX = aÞ = π a ð1− πÞn−a mated using the normal distribution with the same mean
a and standard deviation, provided n is large and π is not
n! too close to 0 or 1. Since the normal distribution is contin-
= π a ð1 −πÞn−a
a!ðn −aÞ! uous, extend the limits by one-half in each direction (for
1× 2× 3× . . . ×n example, the probability that a binomial is exactly a
= π a ð1− πÞn−a
ð1 ×2 ×3 × . . . × aÞ½1 ×2 ×3 × . . . × ðn− aÞ whole number a is approximated by the probability that
the corresponding normal is between a – 0.5 and a + 0.5).
If occurrences happen independently and randomly over
The notation n! is n factorial, the product of the numbers time, and the average rate of occurrence is constant
from 1 to n, with 0! = 1 by definition. The notation over time, then the number of occurrences that happen in a
n n! fixed amount of time will follow the Poisson distribution,
= a discrete random variable. The standard deviation is the
a a!ðn − aÞ!
square root of the mean. If the mean is large, the Poisson
is the binomial coefficient, read aloud as “n choose a.” distribution is approximately normal and the standard
The normal distribution, a continuous distribution, is normal probability table may be used. Exact Poisson prob-
represented by the familiar bell-shaped curve. The probabil- abilities may be found using the following formula:
ity that a normal random variable will be between any two
μa −μ
values is equal to the area under the normal curve between PðX = aÞ = e
these two values. There is a normal distribution for each a!
combination of a mean μ and a (positive) standard deviation The exponential distribution is a very skewed contin-
σ. The standard normal distribution is a normal distribu- uous distribution useful for understanding such variables as
tion with mean μ = 0 and standard deviation σ = 1. You may waiting times and durations of telephone calls. It has no
think of the standard normal distribution as representing the “memory,” in the sense that after you have waited awhile
number of standard deviations above or below the mean. without success for the next event, your average waiting
The standard normal probability table gives the prob- time until the next event is no shorter than it was when
ability that a standard normal random variable Z is less you started. Its standard deviation is always equal to its
than any given number z. mean. The probability that an exponential random variable
To solve word problems involving normal probabilities, X with mean μ is less than or equal to a is PðX ≤ aÞ =
first identify the mean μ, standard deviation σ, and the prob- 1 − e− a/μ . There is no normal approximation for an expo-
ability asked for. Convert to a standardized number z (the nential random variable.
180 PART | II Probability
9. You can invest in just one of four projects on a lot of 11. Suppose that 8% of the loans you authorize as vice
land you own. For simplicity, you have modeled the president of the consumer loan division of a neighbor-
payoffs (as net present value in today’s dollars) of the hood bank will never be repaid. Assume further that
projects as discrete distributions. By selling the land, you authorized 284 loans last year and that loans go
you can make $60,000 for sure. If you build an apart- sour independently of one another.
ment, you estimate a payoff of $130,000 if things a. How many of these loans, authorized by you, do
go well (with probability 0.60) and $70,000 otherwise. you expect will never be repaid? What percentage
If you build a single-family house, the payoff is do you expect?
$100,000 (with probability 0.60) and $60,000 other- b. Find the usual measure of the level of uncertainty in
wise. Finally, you could build a gambling casino the number of loans you authorized that will never
which would pay very well—$500,000—but with be repaid. Briefly interpret this number.
a probability of just 0.10 since the final government c. Find the usual measure of the level of uncertainty in
permits are not likely to be granted; all will be lost the percentage of loans you authorized that will
otherwise. never be repaid. Briefly interpret this number.
a. Find the expected payoff for each of these four 12. Your company is planning to market a new reading lamp
projects. In terms of just the expected payoff, rank and has segmented the market into three groups—avid
these projects in order from best to worst. readers, regular readers, and occasional readers—and
b. Find the standard deviation for each of these four currently assumes that 25% of avid readers, 15% of
projects. In terms of risk only, rank the projects from regular readers, and 10% of occasional readers will
best to worst. want to buy the new product. As part of a marketing
c. Considering both the expected payoff and the risk survey, 400 individuals will be randomly selected
involved, can any project or projects be eliminated from the population of regular readers. Using the cur-
from consideration entirely? rent assumptions, find the mean and standard deviation
d. How would you decide among the remaining of the percentage among those surveyed who will
projects? In particular, does any single project dom- want to buy the new product.
inate the others completely? 13. A company is conducting a survey of 235 people to
10. Your quality control manager has identified the four measure the level of interest in a new product. Assume
major problems, the extent to which each one occurs that the probability of a randomly selected person’s
(i.e., the probability that this problem occurs per item being “very interested” is 0.88 and that people are
produced), and the cost of reworking to fix each one selected independently of one another.
(see Table 7.6.5). Assume that only one problem can a. Find the standard deviation of the percentage who
occur at a time. will be found by the survey to be very interested.
a. Compute the expected rework cost for each problem b. How much uncertainty is there in the number of
separately. For example, the expected rework cost people who will be found to be very interested?
for “broken case” is 0.04 × 6.88. Compare the c. Find the expected number of people in the sample
results and indicate the most serious problem in who will say that they are very interested.
terms of expected dollar costs. d. Find the expected percentage that the survey will
b. Find the overall expected rework cost due to all four identify as being very interested.
problems together. 14. An election coming up next week promises to be very
c. Find the standard deviation of rework cost (don’t close. In fact, assume that 50% are in favor and 50%
forget the nonreworked items). are against. Suppose you conduct a poll of 791 ran-
d. Write a brief memo, as if to your supervisor, describ- domly selected likely voters. Approximately how differ-
ing and analyzing the situation. ent will the percent in favor (from the poll) be from the
50% in the population you are trying to estimate?
15. Repeat the previous problem, but now assume that 85%
are in favor in the population. Is the uncertainty larger or
smaller than when 50% was assumed? Why?
TABLE 7.6.5 Quality Control Problems: Type, 16. You have just performed a survey interviewing 358 ran-
Extent, and Cost domly selected people. You found that 94 of them are
interested in possibly purchasing a new cable TV ser-
Problem Probability Rework Cost
vice. How much uncertainty is there in this number
Broken case 0.04 $6.88 “94” as compared to the average number you would
expect to find in such a survey? (You may assume that
Faulty electronics 0.02 12.30
exactly 25% of all people you might have interviewed
Missing connector 0.06 0.75 would have been interested.)
17. You are planning to make sales calls at eight firms today.
Blemish 0.01 2.92
As a rough approximation, you figure that each call has
a 20% chance of resulting in a sale and that firms make
Chapter | 7 Random Variables 183
their buying decisions without consulting each other. 21. Find the probability that you will see moderate
Find the probability of having a really terrible day with improvement in productivity, meaning an increase in
no sales at all. productivity between 6 and 13. You may assume that
18. It’s been a bad day for the market, with 80% of securities the productivity increase follows a normal distribution
losing value. You are evaluating a portfolio of 15 secu- with a mean of 10 and a standard deviation of 7.
rities and will assume a binomial distribution for the 22. Under usual conditions, a distillation unit in a refinery
number of securities that lost value. can process a mean of 135,000 barrels per day of
a.* What assumptions are being made when you use a crude petroleum, with a standard deviation of 6,000 bar-
binomial distribution in this way? rels per day. You may assume a normal distribution.
b.* How many securities in your portfolio would you a. Find the probability that more than 135,000 barrels
expect to lose value? will be produced on a given day.
c.* What is the standard deviation of the number of b. Find the probability that more than 130,000 barrels
securities in your portfolio that lose value? will be produced on a given day.
d.* Find the probability that all 15 securities lose value. c. Find the probability that more than 150,000 barrels
e.* Find the probability that exactly 10 securities lose will be produced on a given day.
value. d. Find the probability that less than 125,000 barrels
f. Find the probability that 13 or more securities lose will be produced on a given day.
value. e. Find the probability that less than 100,000 barrels
19. Your firm has decided to interview a random sample of will be produced on a given day.
10 customers in order to determine whether or not to 23. The quality control section of a purchasing contract for
change a consumer product. Your main competitor valves specifies that the diameter must be between
has already done a similar but much larger study and 2.53 and 2.57 centimeters. Assume that the production
has concluded that exactly 86% of consumers approve equipment is set so that the mean diameter is 2.56 cen-
of the change. Unfortunately, your firm does not have timeters and the standard deviation is 0.01 centimeter.
access to this information (but you may use this figure What percent of valves produced, over the long run,
in your computations here). will be within these specifications, assuming a normal
a. What is the name of the probability distribution of distribution?
the number of consumers who will approve of the 24. Assume that the stock market closed at 13,246 points
change in your study? today. Tomorrow you expect the market to rise a mean
b. What is the expected number of people, out of the of 4 points, with a standard deviation of 115 points.
10 you will interview, who will approve of the Assume a normal distribution.
change? a. Find the probability that the stock market goes
c. What is the standard deviation of the number of down tomorrow.
people, out of the 10 you will interview, who will b. Find the probability that the market goes up more
approve of the change? than 50 points tomorrow.
d. What is the expected percentage of people, out of c. Find the probability that the market goes up more
the 10 you will interview, who will approve of the than 100 points tomorrow.
change? d. Find the probability that the market goes down
e. What is the standard deviation of the percentage of more than 150 points tomorrow.
people, out of the 10 you will interview, who will e. Find the probability that the market changes by
approve of the change? more than 200 points in either direction.
f. What is the probability that exactly eight of your 25. Based on recent experience, you expect this Saturday’s
interviewed customers will approve of the change? total receipts to have a mean of $2,353.25 and a
g. What is the probability that eight or more of standard deviation of $291.63 and to be normally
your interviewed customers will approve of the distributed.
change? a. Find the probability of a typical Saturday, defined as
20. Suppose that the number of hits on your company’s total receipts between $2,000 and $2,500.
website, from noon to 1 p.m. on a typical weekday, fol- b. Find the probability of a terrific Saturday, defined as
lows a normal distribution (approximately) with a mean total receipts over $2,500.
of 190 and a standard deviation of 24. c. Find the probability of a mediocre Saturday, defined
a. Find the probability that the number of hits is more as total receipts less than $2,000.
than 160. 26. The amount of ore (in tons) in a segment of a mine is
b. Find the probability that the number of hits is less assumed to follow a normal distribution with mean
than 215. 185 and standard deviation 40. Find the probability
c. Find the probability that the number of hits is that the amount of ore is less than 175 tons.
between 165 and 195. 27. You are a farmer about to harvest your crop. To describe
d. Find the probability that the number of hits is not the uncertainty in the size of the harvest, you feel that it
between 150 and 225. may be described as a normal distribution with a mean
184 PART | II Probability
value of 80,000 bushels and a standard deviation of and you have projected a probability of 0.53 that a
2,500 bushels. Find the probability that your harvest typical individual will vote to strike.
will exceed 84,000 bushels. a. Identify n and π for this binomial random variable.
28. Assume that electronic microchip operating speeds are b. Find the mean and standard deviation of the num-
normally distributed with a mean of 2.5 gigahertz and a ber who will vote to strike.
standard deviation of 0.4 gigahertz. What percentage of c. Find the (approximate) probability that a strike will
your production would you expect to be “superchips” result (i.e., that a majority will vote to strike).
with operating speeds of 3 gigahertz or more? 33. Reconsider the previous problem and answer each part,
29. Although you don’t know the exact total amount of pay- but assume that 1,000 people will vote. (The probability
ments you will receive next month, based on past for each one remains unchanged.)
experience you believe it will be approximately 34. Assume that if you were to interview the entire popula-
$2,500 more or less than $13,000, and will follow a nor- tion of Detroit, exactly 18.6% would say that they are
mal distribution. Find the probability that you will ready to buy your product. You plan to interview a
receive between $10,000 and $15,000 next month. representative random sample of 250 people. Find the
30. A new project will be declared “successful” if you (approximate) probability that your observed sample
achieve a market share of 10% or more in the next percentage is overoptimistic, where this is defined as
two years. Your marketing department has considered the observed percentage exceeding 22.5%.
all possibilities and decided that it expects the product 35. Suppose 15% of the items in a large warehouse are
to attain a market share of 12% in this time. However, defective. You have chosen a random sample of 250
this number is not certain. The standard deviation is items to examine in detail. Find the (approximate)
forecast to be 3%, indicating the uncertainty in the probability that more than 20% of the sample is
12% forecast as 3 percentage points. You may assume defective.
a normal distribution. 36. You are planning to interview 350 consumers randomly
a.* Find the probability that the new project is selected from a large list of likely sales prospects, in
successful. order to assess the value of this list and whether you
b. Find the probability that the new project fails. should assign salespeople the task of contacting them
c. Find the probability that the new project is wildly all. Assuming that 13% of the large list will respond
successful, defined as achieving at least a 15% mar- favorably, find (approximate) probabilities for the
ket share. following:
d. To assess the precision of the marketing projections, a. More than 10% of randomly selected consumers
find the probability that the attained market share will respond favorably.
falls close to the projected value of 12%, that is, b. More than 13% of randomly selected consumers
between 11% and 13%. will respond favorably.
31. A manufacturing process produces semiconductor chips c. More than 15% of randomly selected consumers
with a known failure rate of 6.3%. Assume that chip fail- will respond favorably.
ures are independent of one another. You will be pro- d. Between 10% and 15% of randomly selected con-
ducing 2,000 chips tomorrow. sumers will respond favorably.
a. What is the name of the probability distribution of 37. You have just sent out a test mailing of a catalog to
the number of defective chips produced tomorrow? 1,000 people randomly selected from a database of
b. Find the expected number of defective chips 12,320 addresses. You will go ahead with the mass
produced. mailing to the remaining 11,320 addresses provided
c. Find the standard deviation of the number of defec- you receive orders from 2.7% or more from the test mail-
tive chips. ing within two weeks. Find the (approximate) probability
d. Find the (approximate) probability that you will pro- that you will do the mass mailing under each of the fol-
duce fewer than 130 defects. lowing scenarios:
e. Find the (approximate) probability that you will pro- a. Assume that, in reality, exactly 2% of the population
duce more than 120 defects. would send in an order within two weeks.
f. You just learned that you will need to ship 1,860 b. Assume that, in reality, exactly 3% of the population
working chips out of tomorrow’s production of would send in an order within two weeks.
2,000. What are the chances that you will succeed? c. Assume that, in reality, exactly 4% of the population
Will you need to increase the scheduled number would send in an order within two weeks.
produced? 38. You expect a mean of 1,671 warranty repairs next
g. If you schedule 2,100 chips for production, what is month, with the actual outcome following a Poisson
the probability that you will be able to ship 1,860 distribution.
working ones? a. Find the standard deviation of the number of such
32. A union strike vote is scheduled tomorrow, and it looks repairs.
close. Assume that the number of votes to strike follows b. Find the (approximate) probability of more than
a binomial distribution. You expect 300 people to vote, 1,700 such repairs.
Chapter | 7 Random Variables 185
39. If tomorrow is a typical day, your human resources 45. Assuming the appropriate probability distribution for the
division will expect to receive résumés from 175 job situation described in the preceding problem:
applicants. You may assume that applicants act inde- a. Find the probability that the system will last 100,000
pendently of one another. hours or more (twice the average lifetime).
a. What is the name of the probability distribution of b. The system is guaranteed to last at least 5,000 hours.
the number of résumés received? What percentage of production is expected to fail
b. What is the standard deviation of the number of during the guarantee period?
résumés received? 46. Compare the “probability of being within one standard
c. Find the approximate probability that you will deviation of the mean” for the exponential and normal
receive more than 185 résumés. distributions.
d. Find the approximate probability of a slow day, with
160 or fewer résumés received.
Database Exercises
40. On a typical day, your clothing store takes care of 2.6
“special customers” on average. These customers are Problems marked with an asterisk (*) are solved in the Self
taken directly to a special room in the back, are assigned Test in Appendix C.
a full-time server, are given tea (or espresso) and scones, Refer to the employee database in Appendix A.
and have clothes brought to them. You may assume that
1. View each column as a collection of independent obser-
the number who will arrive tomorrow follows a Poisson
vations of a random variable.
distribution.
a. In each case, what kind of variable is represented,
a. Find the standard deviation of the number of special
continuous or discrete? Why?
customers.
b.* Consider the event “annual salary is above
b. Find the probability that no special customers arrive
$40,000.” Find the value of the binomial random
tomorrow.
variable X that represents the number of times this
c. Find the probability exactly 4 special customers will
event occurred. Also find the binomial proportion
arrive tomorrow.
p and say what it represents.
41. In order to earn enough to pay your firm’s debt this year,
c. What fraction of employees are male? Interpret
you will need to be awarded at least 2 contracts. This is
this number as a binomial proportion. What is n?
not usually a problem, since the yearly average is 5.1
2. You have a position open and are trying to hire a new per-
contracts. You may assume a Poisson distribution.
son. Assume that the new person’s experience will follow
a. Find the probability that you will not earn enough to
a normal distribution with the mean and (sample) stan-
pay your firm’s debt this year.
dard deviation of your current employees.
b. Find the probability that you will be awarded
a. Find the probability that the new person will have
exactly 3 contracts.
more than six years of experience.
42. Customers arrive at random times, with an exponential
b. Find the probability that the new person will have
distribution for the time between arrivals. Currently the
less than three years of experience.
mean time between customers is 6.34 minutes.
c. Find the probability that the new person will have
a. Since the last customer arrived, three minutes have
between four and seven years of experience.
gone by. Find the mean time until the next customer
3. Suppose males and females are equally likely and that the
arrives.
number of each gender follows a binomial distribution.
b. Since the last customer arrived, 10 minutes have gone
(Note that the database contains observations of random
by. Find the mean time until the next customer arrives.
variables, not the random variables themselves.)
43. In the situation described in the previous problem, a cus-
a. Find n and π for the binomial distribution of the
tomer has just arrived.
number of males.
a. Find the probability that the time until the arrival of
b. Find n and π for the binomial distribution of the
the next customer is less than 3 minutes.
number of females.
b. Find the probability that the time until the arrival of
c. Find the observed value of X for the number of
the next customer is more than 10 minutes.
females.
c. Find the probability that the time until the arrival of
d. Use the normal approximation to the binomial dis-
the next customer is between 5 and 6 minutes.
tribution to find the probability of observing this
44. A TV system is expected to last for 50,000 hours before
many females (your answer to part c) or fewer in
failure. Assume an exponential distribution for the time
the database.
until failure.
a. Is the distribution skewed or symmetric?
b. What is the standard deviation of the length of time Projects
until failure?
c. The system has been working continuously for the 1. Choose a continuous random quantity that you might
past 8,500 hours and is still on. What is the deal with in your current or future work as an executive.
expected time from now until failure? (Be careful!) Model it as a normally distributed random variable and
186 PART | II Probability
estimate (i.e., guess) the mean and standard deviation. years inspires you to do a What if scenario analysis, recog-
Identify three events of interest to you relating to this ran- nizing that there is no obligation to extract the oil and that
dom variable and compute their probabilities. Briefly it could be extracted fairly quickly (taking a few months) at
discuss what you have learned. any time during the three-year lease. You are wondering:
2. Choose a discrete random quantity (taking on from 3 to What if the price of oil rises enough during the three years
10 different values) that you might deal with in your cur- for it to be profitable to develop the oil field? If so, then
rent or future work as an executive. Estimate (i.e., guess) you would extract the oil. But if the price of oil didn’t rise
the probability distribution. Compute the mean and stan- enough, you would let the term of the lease expire in three
dard deviation. Identify two events of interest to you years, leaving the oil still in the ground. You would let the
relating to this random variable and compute their prob- future price of oil determine whether or not to exercise the
abilities. Briefly discuss what you have learned. option to extract the oil.
3. Choose a binomial random quantity that you might deal But such a proposition is risky! How much risk? What are
with in your current or future work as an executive. the potential rewards? You have identified the following
Estimate (i.e., guess) the value of n and π. Compute the basic probability structure for the source of uncertainty in
mean and standard deviation. Identify two events of this situation:
interest to you relating to this random variable and com-
pute their probabilities. Briefly discuss what you have Future Price of Oil Probability
learned. 60 0.10
4. On the Internet, find and record observations on at least 70 0.15
five different random variables such as stock market 80 0.20
indices, interest rates, corporate sales, or any business- 90 0.30
related topic of interest to you. 100 0.15
110 0.10
Case
The Option Value of an Oil Lease Discussion Questions
There’s an oil leasing opportunity that looks too good to be 1. How much money would you make if there were no
true, and it probably is too good to be true: An estimated costs of extraction? Would this be enough to retire?
1,500,000 barrels of oil sitting underground that can be 2. Would you indeed lose money if you leased and
leased for three years for just $1,300,000. It looks like a extracted immediately, considering the costs of extrac-
golden opportunity: Pay just over a million, bring the oil to tion? How much money?
the surface, sell it at the current spot price of $76.45 per bar- 3. Continue the scenario analysis by computing the future
rel, and retire. net payoff implied by each of the future prices of oil.
However, upon closer investigation, you come across the To do this, multiply the price of oil by the number of bar-
facts that explain why nobody else has snapped up this rels; then subtract the cost of extraction. If this is nega-
“opportunity.” Evidently, it is difficult to remove the oil tive, you simply won’t develop the field, so change
from the ground due to the geology and the remote location. negative values to zero. (At this point, do not subtract
A careful analysis shows that estimated costs of extracting the lease cost, because we are assuming that it has
the oil are a whopping $120,000,000. You conclude that already been paid.)
by developing this oil field, you would actually lose 4. Find the average future net payoff, less the cost of the
money. Oh well. lease. How much, on average, would you gain (or
During the next week, although you are busy investigat- lose) by leasing this oil field? (You may ignore the time
ing other capital investment opportunities, your thoughts value of money.)
keep returning to this particular project. In particular, the 5. How risky is this proposition?
fact that the lease is so cheap and that it lasts for three 6. Should you lease or not?