1 Continuous Probability Distributions: Objectives
1 Continuous Probability Distributions: Objectives
com
1 CONTINUOUS
PROBABILITY
DISTRIBUTIONS
Objectives
After studying this chapter you should
• know the definition of E(X) and E(g(X)) for both discrete and
continuous distributions;
• know the definition of mean and variance in terms of
expectations;
• be able to do calculations involving linear combinations of
independent normal random variables;
• be able to calculate probabilities using the exponential
distribution.
1.0 Introduction
In the text Statistics, continuous probability distributions were
introduced in Chapter 7. Probability was represented by the
area under a curve, known as the probability density function.
A probability density function of a random variable, X, must be
non-negative for all values of X and the total area under the
curve must be 1.
1
www.vagupu.com
Chapter 1 Continuous Probability Distributions
1.1 Expectation
For a discrete probability distribution
E ( X ) = ∑ xP( X = x )
x
If g( x ) is a function of x then
E ( g( x )) = ∑ g( x )P( X = x )
x
If, for example, the random variable X can take the values 1, 2 or
3 with probabilities 0.2, 0.3 and 0.5 respectively, then
∞
E( X ) = ∫
−∞
x f ( x )dx
∞
and E ( g ( X )) = ∫ g( x ) f ( x )dx
−∞
Example
The random variable X has probability density function
Solution
2
∫
3 2
E( X ) = x ( 2 − x )dx
0 4
∫ (2x )
3 2
= 2
− x 3 dx
4 0
2
www.vagupu.com
Chapter 1 Continuous Probability Distributions
2
3 2 x4
= x3 −
4 3 4 0
3 16
= − 4
4 3
=1
E =
2
∫
1 1 3
and . x (2 − x )dx
X 0 x 4
2
∫
3
= (2 − x )dx
0 4
2
3 x2
= 2x −
4 2 0
= 1.5
E ( X − E ( X ))
2
(
σ2 = E [X − µ]
2
)
An equivalent formula is
( )
σ2 = E X2 − µ 2 (see section 1.4)
Example
A charity group raises funds by collecting waste paper. A skip-
full will contain an amount, X, of other materials such as plastic
bags and rubber bands. X may be regarded as a random variable
3
www.vagupu.com
=0 otherwise.
Solution
4
(a) ∫ k ( x − 1)(4 − x )dx = 1
1
∫ (− x )
4
k 2
+ 5x − 4 dx = 1
1
4
− x 3 5x 2
k + − 4x = 1
3 2 1
8 11
k − − = 1
3 6
4.5k = 1
9 2
k =1 ⇒ k =
2 9
4
∫
2
(b) Mean, E( X ) = x ( x − 1)( 4 − x )dx
1 9
2
− x 3 + 5x 2 − 4x dx
4
=
9 ∫ 1
4
2 − x 4 5x 3 4x 2
= + −
9 4 3 2 1
2 32 7
= − −
9 3 12
= 2.5
4
www.vagupu.com
Chapter 1 Continuous Probability Distributions
( ) ∫
4 2 2
E X2 = x ( x − 1)( 4 − x )dx
1 9
∫ (− x )
2 4
= 4
+ 5x 3 − 4x 2 dx
9 1
4
2 x 5 5x 4 4x 3
= − + −
9 5 4 3 1
2 448 17
= − −
9 15 60
= 6. 7
Hence
( )
variance = E X 2 − µ 2 = 6. 7 − 2.52 = 0. 45
and
2 8 7
= −
9 3 3
= 0.0741
Thus the charity will make more money in the long run if
the paper is not sorted.
5
www.vagupu.com
Chapter 1 Continuous Probability Distributions
Exercise 1A
1. A temporary roundabout is installed at a crossroads. 3. A technique for measuring the density of a
The time, X minutes, which vehicles have to wait silicon compound is a random variable, X, with
before entering the crossroads has probability probability density function
density function
f (x) = k −0.04 < x < 0.04
f ( x ) = 0.8 − 0.32 x 0 < x < 2.5 =0 otherwise
=0 otherwise (a) Find the value of k.
Find the mean and standard deviation of X.
(AEB) (b) Find the mean and standard deviation of X.
2. The random variable, Y, has probability density (c) Find the probability that the error is between
function −0.03 and 0.01.
f ( y ) = k (8 − 2 y ) 0<x<4 (d) Find the probability that the magnitude of the
=0 otherwise error is greater than 0.035.
(a) Verify that k = 0.0625 and that the median
is 1.172.
(b) Find the mean and standard deviation of Y.
(c) What is the probability that the mean of a
random sample of size 50 from this
distribution will lie between the mean and
the median? (AEB)
(i) E ( a + bX ) = a + bE ( X )
(ii) E ( X + Y ) = E ( X ) + E (Y )
(iii) E ( a1 X1 + a2 X2 + K + an Xn )
= a1 E ( X1 ) + a2 E ( X2 ) + K + an E ( Xn )
(iv) E ( XY ) = E ( X ) E (Y )
6
www.vagupu.com
Note: Brief proofs are included but will not be tested in the AEB
examination and may be omitted.
(i) (
V(X) = E [X − µ ]
2
)
( )
= E X 2 − 2µE ( X ) + µ 2
= E( X ) − 2µ2 2
+ µ2
= E( X ) − µ 2 2
(ii) V ( a + bX ) = b 2 V ( X )
Since
E ( a + bX ) = a + bE ( X )
= a + bµ ,
then
(
V ( a + bX ) = E [ a + bX − a − bµ ]
2
)
= E(b [ X − µ ] )
2 2
= b E ([ X − µ ] )
2 2
= b 2V ( X )
V ( X ± Y ) = V ( X ) + V (Y )
7
www.vagupu.com
[ {
V ( X ± Y ) = E X ± Y − µ x ± µ y }]
2
[ {
= E {X − µ x } ± Y − µ y }]
2
( ) [
= E [ X − µ x ] + E X − µ y
2
]
2
( [ ])
± 2E [ X − µ x ] Y − µ y
= V ( X ) + V (Y ) ± 2E ( X − µ x ) E (Y − µ x )
Now
E( X − µ x ) = E( X ) − µ x
= µx − µx
=0
Hence
V ( X ± Y ) = V ( X ) + V (Y ) .
V ( a1 X1 + a2 X2 + K + an Xn )
= a12 V ( X1 ) + a2 2 V ( X2 ) + K + an 2 V ( Xn )
[ X + X2 + K + Xn ]
E( X ) = E 1
n
8
www.vagupu.com
E( X ) =
1
n
(
E[ X1 ] + E[ X2 ] + K + E[ Xn ] )
=
1
(µ + µ + K + µ )
n
=
1
(nµ )
n
=µ
to be µ .
V ( X ) = V ( X1 + X2 +... + Xn )
1
n
=
1
n2
(
V ( X1 ) + V ( X2 ) + K + V ( Xn ) )
=
1
n2
(
σ 2 + σ 2 + K+σ 2 )
=
1
n2
(
nσ 2 )
σ2
=
n
You have now shown that the sample mean X is distributed
σ2
with mean µ and variance , a result which has been
n
used frequently in Statistics.
First consider
([
∑ ( Xi − µ ) = ∑ Xi − X + X − µ ] [ ])
2 2
= ∑ ( Xi − X ) + ∑ ( X − µ ) + 2 ∑ ( Xi − X )( X − µ )
2 2
9
www.vagupu.com
Now ∑ ( Xi − X )( X − µ ) = ( X − µ )∑ ( Xi − X )
(
= ( X − µ ) [ ∑ Xi ] − nX )
= ( X − µ )( ∑ Xi − ∑ Xi )
=0
Hence
∑ ( Xi − µ ) = ∑ ( Xi − X ) + ∑ ( X − µ )
2 2 2
= ∑ ( Xi − X ) + n ( X − µ ) .
2 2
(
∑ E [ Xi − µ ]
2
) = E(∑ [ X − X ] ) + nE([ X − µ ] )
i
2 2
( [ ] ) + nnσ
2
2
⇒ nσ 2 = E ∑ Xi − X .
Hence
( [
E ∑ Xi − X ] ) = (n − 1)σ 2
2
or
( [
E ∑ Xi − X ])
2
= σ2 .
( n − 1)
The expression on the left hand side was met in Statistics and is
( )
denoted σ̂ 2 . You have just shown that E σ̂ 2 = σ 2 . σ̂ 2 is said
to be an unbiased estimator of σ and this is why it is generally
2
10
www.vagupu.com
The variance of X + Y = V ( X ) + V (Y ) = 4 2 + 32 = 25 .
A child opens a new jar of jam and takes a spoonful out. The
weight of jam in the spoon is a random variable, Z, with mean
22 g standard deviation 2 g. Z is independent of X.
The weight of jam remaining in the jar is X − Z .
The variance of X − Z is V ( X ) + V ( Z ) = 4 2 + 2 2 = 20 .
Example
A machine produces rubber balls whose diameters are normally
distributed with mean 5.50 cm and standard deviation 0.08 cm.
11
www.vagupu.com
Solution
(5.60 − 5.50 )
(a) (i) z= = 1.25 . 0.89435
0.08
Using Normal tables, the probability of being less than
5.60 cm is 0.894.
1.25
(5.34 − 5.50 )
(ii) z1 = = −2.0
0.08
(5. 44 − 5.50 )
z2 = = −0. 75 .
0.08
Hence the probability of being between 5.34 cm and 1 – 0.77337
(0.05 − 0.20 )
–1.04 0.347
z1 = = −1.040
0.1442
(0.25 − 0.20 )
z2 = = − 0.347
0.1442
Note: Interpolation has been used in reading the normal tables, but
the effect on the final answer is small.
12
www.vagupu.com
Example
A baker makes digestive biscuits whose masses are normally
distributed with mean 24.0 g and standard deviation 1.9 g. The
biscuits are packed by hand into packets of 25.
Solution
(i) Since the sample is random the masses of the 25 biscuits
included in the packet will be independent of each other.
The distribution of the total mass will therefore be normal.
(
The distribution is N 600, 9.52 . )
To find the probability the total mass lies between 598 g and
606 g, note that
(606 − 600 )
z2 = = 0.632 .
9.5 –0.211 0.632
Probability between 598 g and 606 g
13
www.vagupu.com
Chapter 1 Continuous Probability Distributions
(6010 − 6000 )
z1 = = 0.333
30.04 0.9771
0.6304
(6060 − 6000 )
z2 = = 1.997 .
30.04
0.333 1.997
Probability of being between 6010 g and 6060 g
= 0.9771 − 0.6304 = 0.347 .
Thus
( 0 − 24)
z= = −1. 769 .
13.57
This gives the probability of this occurring as
1 − 0.9615 = 0.0385 .
Exercise 1B
1. A dispenser discharges an amount of soft drink 2. The weights of pieces of home made fudge are
which is normally distributed with mean 475 ml normally distributed with mean 34 g and
and standard deviation 20 ml. standard deviation 5 g.
(a) What is the distribution of the total amount (a) A bag contains 15 pieces of fudge chosen at
in two independent drinks? random. What is the distribution of the total
(b) If the capacity of the cups into which the weight of fudge in the bag? What is the
drink is dispensed is normally distributed probability that the total weight is between
with mean 500 ml and standard deviation 490 g and 540 g?
30 ml, what is the distribution of the (b) What is the probability that the total weight
difference between the capacity of a cup of fudge in a bag containing 15 pieces
and the amount dispensed? exceeds that in another bag containing
Assume the two are independent. 16 pieces? (AEB)
14
www.vagupu.com
Chapter 1 Continuous Probability Distributions
3. Audrey is a regular customer of Toto’s taxis. (b) Find the mean of Y, assuming a normal
When she rings from home the time, X, a taxi distribution.
takes to arrive is normally distributed with mean
19 minutes and standard deviation 3 minutes. (c) What is the distribution of T where T = X − Y ?
(X and Y may be assumed independent.) If
(a) (i) What is the probability of her having to both firms were rung at the same time, what
wait less than 15 minutes for a taxi? is the probability that Toto would arrive first?
(ii) What waiting time will be exceeded with (d) In order to catch a train Audrey needs a taxi
probability 0.1? within 10 minutes. Which firm would you
Audrey decides to try Blue Star taxis. The advise her to ring? Explain your answer.
standard deviation of her waiting time, Y, is (AEB)
7 minutes and the probability of Y exceeding
8 minutes is 0.97725.
∞
e − mx
[ ] ∫
∞ ∞ ∞
∫
1
mxe − mx dx = − xe − mx − −e − mx dx = − =m
0 0 0 m 0
( ) ∫ [ ] −∫
∞ ∞ ∞ 2
E X2 = mx 2 e − mx dx = − x 2 e − mx −2xe − mx dx =
0 0 0 m2
∞
∫
1
mxe − mx dx = .
0 m
Thus
2 1 1
V(X) = − = .
m2 m2 m2
1
Hence the standard deviation is .
m
15
www.vagupu.com
[ ]
x
P( X < x ) = ∫
x
me − mx dx = −e − mx = 1 − e − mx
0 0
F ( x ) = 1 − e − mx
P( X < a ) − P( X < b ) = F ( a ) − F ( b )
2e −2 x 0<x
f (x) =
0 otherwise
( )
F ( 2 ) − F (1) = 1 − e −4 − 1 − e −2 = 0.9817 − 0.8647 = 0.117 .
(
F ( ∞ ) − F (3) = 1 − 1 − e −6 = 0.0025 . )
The probability that the next interval is less than 1.5 seconds is
( ) (
F (1.5) − F ( 0 ) = 1 − e −3 − 1 − e 0 = 0.9502 . )
16
www.vagupu.com
1
This exponential distribution will have a mean of = 0.5 seconds.
2
Therefore the average number of cars passing the point per second
1
is = 2.
0.5
( )
F ( ∞ ) − F ( a ) = 1 − 1 − e − ma = e − ma .
P( X > a + b X > a ) .
[ ]
P ( X > a + b ) ∩ ( X > a ) = P( X > a + b ) = e − m ( a + b )
17
www.vagupu.com
Chapter 1 Continuous Probability Distributions
Exercise 1C
1. The interval, T minutes, between successive 3. The lives of electric light bulbs, T hours, follows
telephone calls to a school office follows an an exponential distribution with probability
exponential distribution with parameter 0.2. distribution function
Find the probability that the interval between the
next two telephone calls will be 0.004e −0.004t 0<t
f (t ) =
(a) between 3 and 6 minutes, 0 otherwise
(b) between 2 and 7 muinutes, What is the probability that a bulb will last
(c) longer than 8 minutes, (a) between 200 and 300 hours,
(d) less than 10 minutes. (b) between 250 and 350 hours,
2. A factory worker is employed to watch a monitor (c) more than 400 hours
and give a warning when the monitor signals that
(d) less than 100 hours?
action is needed to adjust the process. The
interval, X hours, between successive signals What is the mean and standard deviation of T?
follows an exponential distribution with What is the mean and standard deviation of the
parameter 0.08. mean life of 225 bulbs?
What is the probability that the interval between Find, approximately, the probability that the
the next two signals is mean life of 225 bulbs will be less than
(a) between 10 and 20 hours, 240 hours.
(b) between 6 and 16 hours
(c) longer than 50 hours,
(d) less than 2 hours?
The factory worker decides to read a newspaper
for a few minutes instead of watching the
monitor. How long can she read for if the
probability of missing a signal is to be less
than 0.01?
18
www.vagupu.com
2. The Sunset Times has an average of 1 typing (a) Find the probability that the total mass of the
error per 500 words. The distribution of X, the sealed tin and its beans
number of words between successive typing
(i) exceeds 550 g,
errors, has probability density function
(ii) lies between 466 g and 575 g.
0.002e −0.002x 0<x
f (x) =
(b) Calculate an interval within which
0 otherwise
approximately 90% of the masses of the filled
Find the probability that the number of words tins will lie.
between successive errors is
The tins are packed in boxes of 24, the mass
(a) between 400 and 600, of the box being normally distributed with
(b) between 200 and 800, mean 500 g and standard deviation 30 g.
(c) greater than 1000, (c) Find the probability that a full box weighs
less than 12.75 kg.
(d) less than 100.
5. Lin Ying belongs to an athletic club. In 800 m
What is the probability that a reader starting at a races her times are normally distributed with
random point in the paper will read more than mean 128 seconds and standard deviation
750 words before reaching an error? 4 seconds.
The editor, Mr B. Pad, always reads the first (a) What is the probability of her time in an 800
paragraph before the paper is published. How being between 120 and 130 seconds?
many words can the first paragraph contain if
the probability of it containing an error is to be (b) What time will she beat in 70% of her races?
not more than 0.05? Julie belongs to the same club. In 800 m races
3. The thickness of a certain grade of hardboard 85% of her times are less than 140 seconds and
stocked by a DIY shop is normally distributed 70% are less than 135 seconds. Her times are
with mean 7.3 mm and standard deviation normally distributed.
0.5 mm.
(c) Find the mean and standard deviation of
(a) What proportion of sheets will be between Julie's times, each correct to two significant
7 mm and 8 mm thick? figures.
(b) Sheets of the same grade bought from a (d) What is the probability that in an 800 m race
second shop contain 9.1% over 8 mm thick Lin Ying will beat Julie?
and 2.3% less than 7 mm thick. Assuming
that the thickness is normally distributed, (e) The club has to choose one of these two
find its mean and standard deviation correct athletes to enter a major competition. In
to the nearest tenth of a mm. order to qualify for the final rounds it is
necessary to achieve a time of 114 seconds or
(c) What is the distribution of Y – X where X and less in the heats. Which athlete should be
Y are the thickness of pieces of hardboard chosen and why?
selected at random from the first and second
shops respectively? Find the probability that 6. The contents of bags of oats are normally
X exceeds Y. distributed with mean 3.05 kg, standard deviation
0.08 kg.
(d) It is possible to buy batches of hardboard
from the first shop with any required mean (a) What proportion of bags contain less than
and with the standard deviation remaining 3.11 kg?
0.5 mm. What value of the mean should be (b) What proportion of bags contain between 3.00
chosen and 3.15 kg?
(i) to minimise the proportion of sheets (c) What weight is exceeded by the contents of
outside the range 7 mm to 8 mm (no 99.9% of the bags?
proof required),
(ii) so that 0.1% of sheets are less than (d) If 6 bags are selected at random, what is the
7 mm thick? probability that the mean weight of the
contents will be between 3.00 and 3.15 kg?
4. A certain brand of beans is sold in tins, the tins
being filled and sealed by a machine. The mass The weight of the bags when empty is normally
of beans in each tin is normally distributed with distributed with mean 0.12 kg, standard deviation
mean 425 g and standard deviation 25 g and the 0.02 kg. Full bags are packed into boxes each of
mass of the tin is normally distributed with mean which holds 6 bags.
90 g and standard deviation 10 g.
19
www.vagupu.com
Chapter 1 Continuous Probability Distributions
20