0% found this document useful (0 votes)

105 views

CH 4

This chapter discusses probability generating functions (PGFs) which are useful for dealing with sums and limits of random variables. The key points are: 1) A PGF characterizes the distribution of a random variable and transforms sums into products, making it easier to handle sums of random variables. 2) The PGF of a random variable X is defined as GX(s) = E(sX) where s is a value for which the sum converges. 3) Common PGFs include (ps + q)n for a Binomial(n, p) variable and eλ(s-1) for a Poisson(λ) variable. PGFs can be used to

Uploaded by

hung13579

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

105 views

CH 4

Uploaded by

hung13579

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

74

Chapter 4: Generating Functions

This chapter looks at Probability Generating Functions (PGFs) for discrete
random variables. PGFs are useful tools for dealing with sums and limits of
random variables. For some stochastic processes, they also have a special role
in telling us whether a process will ever reach a particular state.

By the end of this chapter, you should be able to:

find the sum of Geometric, Binomial, and Exponential series;
know the definition of the PGF, and use it to calculate the mean, variance,
and probabilities;
calculate the PGF for Geometric, Binomial, and Poisson distributions;
calculate the PGF for a randomly stopped sum;
calculate the PGF for first reaching times in the random walk;
use the PGF to determine whether a process will ever reach a given state.

4.1 Common sums

1. Geometric Series

2 3
X 1
1+ r +r + r + ... = rx = , when |r| < 1.
x=0
1r
P
This formula proves that x=0 P(X = x) = 1 when X Geometric(p):
X X
P(X = x) = p(1 p)x P(X = x) = p(1 p)x
x=0 x=0

X
= p (1 p)x
x=0
p
= (because |1 p| < 1)
1 (1 p)
= 1.
75

2. Binomial Theorem For any p, q R, and integer n,

n
n
X n
(p + q) = px q nx .
x=0
x

n n!
Note that = (n Cr button on calculator.)
x (n x)! x!
Pn
The BinomialTheorem
proves that x=0 P(X = x) = 1 when X Binomial(n, p):
n x
P(X = x) = p (1 p)nx for x = 0, 1, . . . , n, so
x
n n
X X n x
P(X = x) = p (1 p)nx
x=0 x=0
x
n
= p + (1 p)
= 1n
= 1.

3. Exponential Power Series

X x
For any R, = e .
x=0
x!
P
This proves that x=0 P(X = x) = 1 when X Poisson():
x
P(X = x) = e for x = 0, 1, 2, . . ., so
x!

X X x
X x
P(X = x) = e = e
x=0 x=0
x! x=0
x!

= e e
= 1.
n

Note: Another useful identity is: e = lim 1+ for R.
n n
76

4.2 Probability Generating Functions

The probability generating function (PGF) is a useful tool for dealing

with discrete random variables taking values 0, 1, 2, . . .. Its particular strength
is that it gives us an easy way of characterizing the distribution of X + Y when
X and Y are independent. In general it is difficult to find the distribution of
a sum using the traditional probability function. The PGF transforms a sum
into a product and enables it to be handled much more easily.

Sums of random variables are particularly important in the study of stochastic

processes, because many stochastic processes are formed from the sum of a
sequence of repeating steps: for example, the Gamblers Ruin from Section 2.7.

The name probability generating function also gives us another clue to the role
of the PGF. The PGF can be used to generate all the probabilities of the
distribution. This is generally tedious and is not often an efficient way of
calculating probabilities. However, the fact that it can be done demonstrates
that the PGF tells us everything there is to know about the distribution.

Definition: Let X be a discrete random variable taking values in the non-negative

integers {0, 1, 2, . . .}. The probability generating function (PGF) of X is
GX (s) = E(sX ), for all s R for which the sum converges.

Calculating the probability generating function

X
X
sx P(X = x).

GX (s) = E s =
x=0

Properties of the PGF:

1. GX (0) = P(X = 0):

GX (0) = 00 P(X = 0) + 01 P(X = 1) + 02 P(X = 2) + . . .

GX (0) = P(X = 0).
77

X
X
x
2. GX (1) = 1 : GX (1) = 1 P(X = x) = P(X = x) = 1.
x=0 x=0

Example 1: Binomial Distribution

n x nx
Let X Binomial(n, p), so P(X = x) = p q for x = 0, 1, . . . , n.
x
n
X n x nx
GX (s) = sx p q
x=0
x
n
X n
= (ps)x q nx
x=0
x

= (ps + q)n by the Binomial Theorem: true for all s.

Thus GX (s) = (ps + q)n for all s R.

X ~ Bin(n=4, p=0.2)
Check GX (0):
200

GX (0) = (p 0 + q)n
150

= qn
G(s)
100

= P(X = 0).
50
0

20 10 0 10
Check GX (1): s

GX (1) = (p 1 + q)n
= (1)n
= 1.
78

Example 2: Poisson Distribution

x
Let X Poisson(), so P(X = x) = e for x = 0, 1, 2, . . ..
x!
x
X
x
X ( s)x
GX (s) = s e = e
x=0
x! x=0
x!

= e e(s) for all s R.

Thus GX (s) = e(s1) for all s R.
X ~ Poisson(4)
50
40
30
G(s)
20
10
0

1.0 0.5 0.0 0.5 1.0 1.5 2.0

Example 3: Geometric Distribution

Let X Geometric(p), so P(X = x) = p(1 p)x = pq x for x = 0, 1, 2, . . .,
where q = 1 p.

X X ~ Geom(0.8)
GX (s) = sx pq x G(s) to infinity
5

x=0
4
3

X
= p (qs)x
1
0

x=0 5 0 5 s

p
= for all s such that |qs| < 1.
1 qs
p 1
Thus GX (s) = for |s| < .
1 qs q
79

4.3 Using the probability generating function to calculate probabilities

The probability generating function gets its name because the power series can
be expanded and differentiated to reveal the individual probabilities. Thus,
given only the PGF GX (s) = E(sX ), we can recover all probabilities P(X = x).
For shorthand, write px = P(X = x). Then

X
X
GX (s) = E(s ) = px sx = p0 + p1s + p2s2 + p3 s3 + p4s4 + . . .
x=0

Thus p0 = P(X = 0) = GX (0).

First derivative: GX (s) = p1 + 2p2s + 3p3s2 + 4p4s3 + . . .

Thus p1 = P(X = 1) = GX (0).

Second derivative: GX (s) = 2p2 + (3 2)p3s + (4 3)p4s2 + . . .

1
Thus p2 = P(X = 2) = GX (0).
2

Third derivative: G
X (s) = (3 2 1)p3 + (4 3 2)p4 s + . . .

1
Thus p3 = P(X = 3) = G (0).
3! X

In general:
n
1 (n) 1 d
pn = P(X = n) = GX (0) = n
(GX (s)) .
n! n! ds
s=0
80

s
Example: Let X be a discrete random variable with PGF GX (s) = (2 + 3s2 ).
5
Find the distribution of X.
2 3 3
GX (s) = s + s : GX (0) = P(X = 0) = 0.
5 5
2 9 2 2
GX (s) = + s : GX (0) = P(X = 1) = .
5 5 5
18 1
GX (s) = s : G (0) = P(X = 2) = 0.
5 2 X
18 1 3
G
X (s) = : GX (0) = P(X = 3) = .
5 3! 5
(r) 1 (r)
GX (s) = 0 r 4 : G (s) = P(X = r) = 0 r 4.
r! X
Thus
1 with probability 2/5,
X=
3 with probability 3/5.

Uniqueness of the PGF

1 (n)
The formula pn = P(X = n) = GX (0) shows that the whole sequence of
n!
probabilities p0 , p1, p2, . . . is determined by the values of the PGF and its deriv-
atives at s = 0. It follows that the PGF specifies a unique set of probabilities.

Fact: If two power series agree on any interval containing 0, however small, then
all terms of the two series are equal.

Formally: let A(s) and B(s) be PGFs with A(s) =

P n
P n
n=0 an s , B(s) = n=0 bn s .
If there exists some R > 0 such that A(s) = B(s) for all R < s < R , then
an = bn for all n.

Practical use: If we can show that two random variables have the same PGF in
some interval containing 0, then we have shown that the two random variables
have the same distribution.

Another way of expressing this is to say that the PGF of X tells us everything
there is to know about the distribution of X .
81

4.4 Expectation and moments from the PGF

As well as calculating probabilities, we can also use the PGF to calculate the
moments of the distribution of X. The moments of a distribution are the mean,
variance, etc.

Theorem 4.4: Let X be a discrete random variable with PGF GX (s). Then:

1. E(X) = GX (1).

dk GX (s)
n o
(k)
2. E X(X 1)(X 2) . . . (X k + 1) = GX (1) = .
dsk s=1
(This is the k th factorial moment of X .)

Proof: (Sketch: see Section 4.8 for more details)

X ~ Poisson(4)
1. X
GX (s) = sx px ,
x=0
6

X
so GX (s) = xsx1 px
G(s)
4

x=0

X
2

GX (1) = xpx = E(X)

x=0
0

0.0 0.5 1.0 1.5

(k) dk GX (s) X
2. GX (s) = k
= x(x 1)(x 2) . . . (x k + 1)sxk px
ds
x=k

(k)
X
so GX (1) = x(x 1)(x 2) . . . (x k + 1)px
x=k
n o
= E X(X 1)(X 2) . . . (X k + 1) .
82

Example: Let X Poisson(). The PGF of X is GX (s) = e(s1) . Find E(X)

and Var(X). X ~ Poisson(4)

Solution:

6
GX (s) = e(s1)

G(s)
4
2
E(X) = GX (1) = .

0
For the variance, consider 0.0 0.5 1.0 1.5
n o s
E X(X 1) = GX (1) = 2 e(s1) |s=1 = 2 .

So
Var(X) = E(X 2) (EX)2
n o
= E X(X 1) + EX (EX)2
= 2 + 2
= .

4.5 Probability generating function for a sum of independent r.v.s

One of the PGFs greatest strengths is that it turns a sum into a product:

(X1 +X2 ) X1 X2
E s =E s s .

This makes the PGF useful for finding the probabilities and moments of a sum
of independent random variables.

Theorem 4.5: Suppose that X1 , . . . , Xn are independent random variables, and

let Y = X1 + . . . + Xn . Then
n
Y
GY (s) = GXi (s).
i=1
83

Proof: GY (s) = E(s(X1 +...+Xn ) )

= E(sX1 sX2 . . . sXn )
= E(sX1 )E(sX2 ) . . . E(sXn )
(because X1 , . . . , Xn are independent)
n
Y
= GXi (s). as required.
i=1

Example: Suppose that X and Y are independent with X Poisson() and

Y Poisson(). Find the distribution of X + Y .

Solution: GX+Y (s) = GX (s) GY (s)

= e(s1) e(s1)

= e(+)(s1) .
But this is the PGF of the Poisson( + ) distribution. So, by the uniqueness of
PGFs, X + Y Poisson( + ).

4.6 Randomly stopped sum

Remember the randomly stopped sum model from

Section 3.4. A random number N of events occur,
and each event i has associated with it a cost or
reward Xi . The question is to find the distribution
of the total cost or reward: TN = X1 + X2 + . . . + XN .
TN is called a randomly stopped sum because it has a random number of terms.
Example: Cash machine model. N customers arrive during the day. Customer i
withdraws amount Xi . The total amount withdrawn during the day is TN =
X1 + . . . + XN .
84

In Chapter 3, we used the Laws of Total Expectation and Variance to show

that E(TN ) = E(N ) and Var(TN ) = 2 E(N ) + 2 Var(N ), where = E(Xi)
and 2 = Var(Xi).

In this chapter we will now use probability generating functions to investigate

the whole distribution of TN .

Theorem 4.6: Let X1 , X2, . . . be a sequence of independent and identically dis-

tributed random variables with common PGF GX . Let N be a random variable,
independent of the Xi s, with PGF GN , and let TN = X1 +. . .+XN = N
P
i=1 Xi .
Then the PGF of TN is:

GTN (s) = GN GX (s) .

Proof:

GTN (s) = E(sTN ) = E sX1 +...+XN

n o
X1 +...+XN
= EN E s N (conditional expectation)
n o
X1 XN
= EN E s ...s |N
n o
X1 XN
= EN E s ...s (Xi s are indept of N )
n o
X1 XN

= EN E s ...E s (Xi s are indept of each other)
n o
N
= EN (GX (s))

= GN GX (s) (by definition of GN ).
85

Example: Let X1 , X2 , . . . and N be as above. Find the mean of TN .

d
E(TN ) = GTN (1) = GN (GX (s))

ds s=1

= GN (GX (s)) GX (s)

s=1

= GN (1) GX (1) Note: GX (1) = 1 for any r.v. X

= E(N ) E(X1), same answer as in Chapter 3.

Example: Heron goes fishing

My aunt was asked by her neighbours to feed the prize

goldfish in their garden pond while they were on holiday.
Although my aunt dutifully went and fed them every day,
she never saw a single fish for the whole three weeks. It
turned out that all the fish had been eaten by a heron
when she wasnt looking!

Let N be the number of times the heron visits the pond

during the neighbours absence. Suppose that N Geometric(1 ),
so P(N = n) = (1 )n, for n = 0, 1, 2, . . .. When the heron visits the pond
it has probability p of catching a prize goldfish, independently of what happens
on any other visit. (This assumes that there are infinitely many goldfish to be
caught!) Find the distribution of
T = total number of goldfish caught.

Solution:

1 if heron catches a fish on visit i,
Let Xi =
0 otherwise.
Then T = X1 + X2 + . . . + XN (randomly stopped sum), so
GT (s) = GN (GX (s)).
86

Now
GX (s) = E(sX ) = s0 P(X = 0) + s1 P(X = 1) = 1 p + ps.

Also,

X
X
n
GN (r) = r P(N = n) = rn (1 )n
n=0 n=0

X
= (1 ) (r)n
n=0
1
= . (r < 1/).
1 r

So
1
GT (s) = (putting r = GX (s)),
1 GX (s)

giving:
1
GT (s) =
1 (1 p + ps)

1
=
1 + p ps

1
[could this be Geometric? GT (s) = for some ?]
1 s

1
=
(1 + p) ps

1
1 + p
=
(1 + p) ps
1 + p
87

1 + p p
1 + p
=
p
1 s
1 + p

p
1
1 + p
= .
p
1 s
1 + p

p
This is the PGF of the Geometric 1 distribution, so by unique-
1 + p
ness of PGFs, we have:

1
T Geometric .
1 + p

Why did we need to use the PGF?

We could have solved the heron problem without using the PGF, but it is much
more difficult. PGFs are very useful for dealing with sums of random variables,
which are difficult to tackle using the standard probability function.

Here are the first few steps of solving the heron problem without the PGF.
Recall the problem:
Let N Geometric(1 ), so P(N = n) = (1 )n;
Let X1, X2 , . . . be independent of each other and of N , with Xi Binomial(1, p)
(remember Xi = 1 with probability p, and 0 otherwise);
Let T = X1 + . . . + XN be the randomly stopped sum;
Find the distribution of T .
88

Without using the PGF, we would tackle this by looking for an expression for
P(T = t) for any t. Once we have obtained that expression, we might be able
to see that T has a distribution we recognise (e.g. Geometric), or otherwise we
would just state that T is defined by the probability function we have obtained.

To find P(T = t), we have to partition over different values of N :

X
P(T = t) = P(T = t | N = n)P(N = n). ()
n=0

Here, we are lucky that we can write down the distribution of T | N = n:

if N = n is fixed, then T = X1 + . . . + Xn is a sum of n independent

Binomial(1, p) random variables, so (T | N = n) Binomial(n, p).

For most distributions of X, it would be difficult or impossible to write down the

distribution of X1 + . . . + Xn :
we would have to use an expression like
t tx t(x1 +...+xn2 ) n
X X1 X
P(X1 + . . . + XN = t | N = n) = ... P(X1 = x1)
x1 =0 x2 =0 xn1 =0
o
P(X2 = x2 ) . . . P(Xn1 = xn1) P[Xn = t (x1 + . . . + xn1)] .

Back to the heron problem: we are lucky in this case that we know the distri-
bution of (T | N = n) is Binomial(N = n, p), so

n t
P(T = t | N = n) = p (1 p)nt for t = 0, 1, . . . , n.
t

Continuing from ():

X
P(T = t) = P(T = t | N = n)P(N = n)
n=0
89

X n
= pt (1 p)nt(1 )n
n=t
t
t X h
p n in
= (1 ) (1 p) ()
1p n=t
t
= ...?

As it happens, we can evaluate the sum in () using the fact that Negative
Binomial probabilities sum to 1. You can try this if you like, but it is quite
tricky. [Hint: use the Negative Binomial (t + 1, 1 (1 p)) distribution.]

1
Overall, we obtain the same answer that T Geometric , but
1 + p
hopefully you can see why the PGF is so useful.

Without the PGF, we have two major difficulties:

1. Writing down P(T = t | N = n);

2. Evaluating the sum over n in ().
For a general problem, both of these steps might be too difficult to do without
a computer. The PGF has none of these difficulties, and even if GT (s) does not
simplify readily, it still tells us everything there is to know about the distribution
of T .

4.7 Summary: Properties of the PGF

Definition: GX (s) = E(sX )

Used for: Discrete r.v.s with values 0, 1, 2, . . .

n o
(k)
Moments: E(X) = GX (1) E X(X 1) . . . (X k + 1) = GX (1)
1 (n)
Probabilities: P(X = n) = G (0)
n! X
Sums: GX+Y (s) = GX (s)GY (s) for independent X, Y
90

4.8 Convergence of PGFs

We have been using PGFs throughout this chapter without paying much at-
tention to their mathematical properties. For example, are we sure that the
power series GX (s) = x
P
x=0 s P(X = x) converges? Can we differentiate and
integrate the infinite power series term by term as we did in Section 4.4? When
we said in Section 4.4 that E(X) = GX (1), can we be sure that GX (1) and its
derivative GX (1) even exist?

This technical section introduces the radius of convergence of the PGF.

Although it isnt obvious, it is always safe to assume convergence of GX (s) at
least for |s| < 1. Also, there are results that assure us that E(X) = GX (1) will
work for all non-defective random variables X.

Definition: The radius of convergence of a probability generating function is a

P x
number R > 0, such that the sum GX (s) = x=0 s P(X = x) converges if
|s| < R and diverges ( ) if |s| > R.

(No general statement is made about what happens when |s| = R.)

Fact: For any PGF, the radius of convergence exists.

It is always 1: every PGF converges for at least s (1, 1).

The radius of convergence could be anything from R = 1 to R = .

Note: This gives us the surprising result that the set of s for which the PGF GX (s)
converges is symmetric about 0: the PGF converges for all s (R, R), and
for no s < R or s > R.
This is surprising because the PGF itself is not usually symmetric about 0: i.e.
GX (s) 6= GX (s) in general.

Example 1: Geometric distribution

Let X Geometric(p = 0.8). What is the radius of convergence of GX (s)?
91

As in Section 4.2,

X
X
x x
GX (s) = s (0.8)(0.2) = 0.8 (0.2s)x
x=0 x=0
0.8
= for all s such that |0.2s| < 1.
1 0.2s
1
This is valid for all s with |0.2s| < 1, so it is valid for all s with |s| < 0.2 = 5.
(i.e. 5 < s < 5.)
The radius of convergence is R = 5.

The figure shows the PGF of the Geometric(p = 0.8) distribution, with its
radius of convergence R = 5. Note that although the convergence set (5, 5) is
symmetric about 0, the function GX (s) = p/(1 qs) = 4/(5 s) is not.

Geometric(0.8) probability generating function

G(s) to infinity
5
4
3
2
1
0

5 0 5 s
Radius of Convergence

In this region, p/(1qs) remains finite and wellbehaved,

but it is no longer equal to E(s X ).

At the limits of convergence, strange things happen:

At the positive end, as s 5, both GX (s) and p/(1 qs) approach infinity.
So the PGF is (left)-continuous at +R:
lim GX (s) = GX (5) = .
s5

However, the PGF does not converge at s = +R.

At the negative end, as s 5, the function p/(1 qs) = 4/(5 s) is

continuous and passes through 0.4 when s = 5. However, when s
5, this function no longer represents GX (s) = 0.8 x
P
x=0 (0.2s) , because
|0.2s| 1.
Additionally, when s = 5, GX (5) = 0.8 x
P
x=0 (1) does not exist.
Unlike the positive end, this means that GX (s) is not (right)-continuous
at R:
lim GX (s) = 0.4 6= GX (5).
s5
Like the positive end, this PGF does not converge at s = R.

Example 2: Binomial distribution

Let X Binomial(n, p). What is the radius of convergence of GX (s)?

As in Section 4.2,
n
X n x nx
x
GX (s) = s p q
x=0
x
n
X n
= (ps)x q nx
x=0
x

= (ps + q)n by the Binomial Theorem: true for all s.

This is true for all < s < , so the radius of convergence is R = .

Abels Theorem for continuity of power series at s = 1

Recall from above that if X Geometric(0.8), then GX (s) is not continuous

at the negative end of its convergence (R):
lim GX (s) 6= GX (5).
s5

Abels theorem states that this sort of effect can never happen at s = 1 (or at
+R). In particular, GX (s) is always left-continuous at s = 1:
lim GX (s) = GX (1) always, even if GX (1) = .
s1
93

Theorem 4.8: Abels Theorem.

X
Let G(s) = pisi for any p0 , p1, p2, . . . with pi 0 for all i.
i=0

Then G(s) is left-continuous at s = 1:

X
lim G(s) = pi = G(1) ,
s1
i=0
whether or not this sum is finite.

Note: Remember that the radius of convergence R 1 for any PGF, so Abels
Theorem means that even in the worst-case scenario when R = 1, we can still
trust that the PGF will be continuous at s = 1. (By contrast, we can not be
sure that the PGF will be continuous at the the lower limit R).

Abels Theorem means that for any PGF, we can write GX (1) as shorthand for
lims1 GX (s).

It also clarifies our proof that E(X) = GX (1) from Section 4.4. If we assume
that term-by-term differentiation is allowed for GX (s) (see below), then the
proof on page 81 gives:

X
GX (s) = sx px ,
x=0
X
so GX (s) = xsx1px (term-by-term differentiation: see below).
x=1

Abels Theorem establishes that E(X) is equal to lims1 GX (s):

X
E(X) = xpx
x=1

= GX (1)
= lim GX (s),
s1

because Abels Theorem applies to GX (s) = x1

P
x=1 xs px , establishing that

GX (s) is left-continuous at s = 1. Without Abels Theorem, we could not be
sure that the limit of GX (s) as s 1 would give us the correct answer for E(X).
94

Absolute and uniform convergence for term-by-term differentiation

We have stated that the PGF converges for all |s| < R for some R. In fact,
the probability generating function converges absolutely if |s| < R. Absolute
convergence is stronger than convergence alone: it means that the sum of abso-
lute values, x
P
x=0 |s P(X = x)|, also converges. When two series both converge
absolutely, the product series also converges absolutely. This guarantees that
GX (s) GY (s) is absolutely convergent for any two random variables X and Y .
This is useful because GX (s) GY (s) = GX+Y (s) if X and Y are independent.

The PGF also converges uniformly on any set {s : |s| R } where R < R.
Intuitively, this means that the speed of convergence does not depend upon the
value of s. Thus a value n0 can be found such that for all values of n n0,
the finite sum nx=0 sx P(X = x) is simultaneously close to the converged value
P
GX (s), for all s with |s| R . In mathematical notation: > 0, n0
Z such that s with |s| R , and n n0,

Xn
x
s P(X = x) GX (s) < .

x=0

Uniform convergence allows us to differentiate or integrate the PGF term by

term.

P
Fact: Let GX (s) = E(sX ) = x
x=0 s P(X = x), and let s < R.

!
d X X d x X
1. GX (s) = x
s P(X = x) = (s P(X = x)) = xsx1 P(X = x).
ds x=0 x=0
ds x=0
(term by term differentiation).

Z
!
Z b Z b X X b
x x
2. GX (s) ds = s P(X = x) ds = s P(X = x) ds
a a x=0 x=0 a

X sx+1
= P(X = x) for R < a < b < R.
x=0
x+1
(term by term integration).
95

4.9 Special Process: the Random Walk

We briefly saw the Drunkards Walk in Chapter 1: a drunk person staggers

to left and right as he walks. This process is called the Random Walk in
stochastic processes. Probability generating functions are particularly useful
for processes such as the random walk, because the process is defined as the
sum of a single repeating step. The repeating step is a move of one unit, left
or right at random. The sum of the first t steps gives the position at time t.

The transition diagram below shows the symmetric random walk (all transitions
have probability p = 1/2.)
1/2 1/2 1/2 1/2 1/2 1/2
2 1 0 1 2 3

1/2 1/2 1/2 1/2 1/2 1/2 1/2

Question:
What is the key difference between the random walk and the gamblers ruin?

The random walk has an INFINITE state space: it never stops. The gamblers
ruin stops at both ends.

This fact has two important consequences:

The random walk is hard to tackle using first-step analysis, because we
would have to solve an infinite number of simultaneous equations. In this
respect it might seem to be more difficult than the gamblers ruin.
Because the random walk never stops, all states are equal.
In the gamblers ruin, states are not equal: the states closest to 0 are
more likely to end in ruin than the states closest to winning. By contrast,
the random walk has no end-points, so (for example) the distribution of
the time to reach state 5 starting from state 0 is exactly the same as the
distribution of the time to reach state 1005 starting from state 1000. We
can exploit this fact to solve some problems for the random walk that
would be much more difficult to solve for the gamblers ruin.
96

PGFs for finding the distribution of reaching times

For random walks, we are particularly interested in reaching times:
How long will it take us to reach state j, starting from state i?
Is there a chance that we will never reach state j, starting from state i?

In Chapter 3 we saw how to find expected reaching times: the expected

number of steps taken to reach a particular state. We used the law of total
expectation and first-step analysis (Section 3.5).
However, the expected or average reaching time doesnt tell the whole story.
Think back to the model for gene spread in Section 3.7. If there is just one
animal out of 100 with the harmful allele, the expected number of generations to
fixation is quite large at 10.5: even though the allele will usually die out after one
or two generations. The high average is caused by a small chance that the allele
will take hold and grow, requiring a very large number of generations before it
either dies out or saturates the population. In most stochastic processes, the
average is of limited use by itself, without having some idea about the variance
and skew of the distribution.
With our tool of PGFs, we can characterise the whole distribution of the time
T taken to reach a particular state, by finding its PGF. This will give us the
mean, variance, and skew by differentiation. In principle the PGF could even
give us the full set of probabilities, P(T = t) for all possible t = 0, 1, 2, . . .,
though in practice it may be computationally infeasible to find more than the
first few probabilities by repeated differentiation.

However, there is a new and very useful piece of information that the PGF can
tell us quickly and easily:

what is the probability that we NEVER reach state j , starting from state i?
For example, imagine that the random walk represents the share value for an
investment. The current share price is i dollars, and we might decide to sell
when it reaches j dollars. Knowing how long this might take, and whether there
is a chance we will never succeed, is fundamental to managing our investment.
97

To tackle this problem, we define the random variable T to be the time taken
(number of steps) to reach state j, starting from state i. We find the PGF of
T , and then use the PGF to discover P(T = ). If P(T = ) > 0, there is a
positive chance that we will NEVER reach state j, starting from state i.

We will see how to determine the probability of never reaching our goal in
Section 4.11. First we will see how to calculate the PGF of a reaching time T
in the random walk.

Finding the PGF of a reaching time in the random walk

1/2 1/2 1/2 1/2 1/2 1/2

2 1 0 1 2 3

1/2 1/2 1/2 1/2 1/2 1/2 1/2

Define Tij to be the number of steps taken to reach state j , starting at state i.

Tij is called the first reaching time from state i to state j .

We will focus on T01 = number of steps to get from state 0 to state 1.

Problem: Let H(s) = E sT01 be the PGF of T01. Find H(s).

Arrived!
98

Solution:
Let Yn be the step taken at time
n: up or down. For the symmetric random walk,
1 with probability 0.5,
Yn =
1 with probability 0.5,
and Y1 , Y2, . . . are independent.
Recall Tij = number of steps to get from state i to state j for any i, j ,

and H(s) = E sT01 is the PGF required.

Use first-step analysis, partitioning over the first step Y1 :

H(s) = E sT01

= E sT01 | Y1 = 1 P(Y1 = 1) + E sT01 | Y1 = 1 P(Y1 = 1)

1n T01
T01
o
= E s | Y1 = 1 + E s | Y1 = 1 .
2

Now if Y1 = 1, then T01 = 1 definitely, so E sT01 | Y1 = 1 = s1 = s.

If Y1 = 1, then T01 = 1 + T1,1:

one step from state 0 to state 1,

then T1,1 steps from state 1 to state 1.

But T1,1 = T1,0 + T01, because the process must pass through 0 to get from 1
to 1.
Now T1,0 and T01 are independent (Markov property). Also, they have the
same distribution because the process is translation invariant (i.e. all states are
the same):
1/2 1/2 1/2 1/2 1/2 1/2
2 1 0 1 2 3

1/2 1/2 1/2 1/2 1/2 1/2 1/2

Thus
E sT01 | Y1 = 1 E s1+T1,1

=
E s1+T1,0+T0,1

=
sE sT1,0 E sT01

= by independence
= s(H(s))2 because identically distributed.

Thus
1
s + s(H(s))2

H(s) = by .
2

This is a quadratic in H(s):

1 1
s(H(s))2 H(s) + s = 0
2 2

q
1 1 4 21 s 12 s 1 1 s2
H(s) = = .
s s

Which root? We know that P(T01 = 0) = 0, because it must take atleast

one step
2
to go from 0 to 1. With the positive root, lims0 H(0) = lims0 = ; so
s
we take the negative root instead.

1 1 s2
Thus H(s) = .
s

Check this has lims0 H(s) = 0 by LHospitals Rule:

f (s)

f (s)
lim = lim
s0 g(s) s0 g (s)
( )
1 2 1/2
(1 s ) 2s
= lim 2
s0 1

= 0.
100

Notation for quick solutions of first-step analysis for finding PGFs

As with first-step analysis for finding hitting probabilities and expected reaching
times, setting up a good notation is extremely important. Here is a good
notation for finding H(s) = E sT01 .

Let T = T01. Seek H(s) = E(sT ).

Now
(
1 with probability 1/2,
T =
1 + T + T with probability 1/2,

where T T T and T , T are independent.

Taking expectations:
(
E s1

w. p. 1/2
H(s) = E(sT ) =
E s1+T +T

w. p. 1/2
(
s w. p. 1/2
H(s) =
sE sT E sT (by independence of T and T )

w. p. 1/2
(
s w. p. 1/2
H(s) =
sH(s)H(s) w. p. 1/2 (because T T T )

1
H(s) = 2
s + 21 sH(s)2.
101

Thus:
sH(s)2 2H(s) + s = 0.
Solve the quadratic and select the correct root as before, to get

1 1 s2
H(s) = for |s| < 1.
s

4.10 Defective random variables

A random variable is said to be defective if it can take the value .

In stochastic processes, a reaching time Tij is defective if there is a chance that

we NEVER reach state j , starting from state i.
The probability that we never reach state j, starting from state i, is the same
as the probability that the time taken is infinite: Tij = :

P(Tij = ) = P(we NEVER reach state j , starting from state i).

In other cases, we will always reach state j eventually, starting from state i.

In that case, Tij can not take the value :

P(Tij = ) = 0 if we are CERTAIN to reach state j , starting from state i.

Definition: A random variable T is defective, or improper, if it can take the value

. That is,

T is defective if P(T = ) > 0.

102

P
Thinking of t=0 P(T = t) as 1 P(T = )

Although it seems strange, when we write

P
t=0 P(T = t), we are not including
the value t = .
The sum
P
t=0 continues without ever stopping: at no point can we say we have
finished all the finite values of t so we will now add on t = . We simply
never get to t = when we take
P
t=0 .

For a defective random variable T , this means that

X
P(T = t) < 1,
t=0

because we are missing the positive value of P(T = ).

All probabilities of T must still sum to 1, so we have

X
1= P(T = t) + P(T = ),
t=0

in other words
X
P(T = t) = 1 P(T = ).
t=0

PGFs for defective random variables

When T is defective, the PGF of T is defined as the power series

X
H(s) = P(T = t)st for |s| < 1.
t=0

The term for P(T = )s is missed out. The PGF is defined as the generating
function of the probabilities for finite values only.
103

Because H(s) is a power series satisfying the conditions of Abels Theorem, we

know that:

H(s) is left-continuous at s = 1, i.e. lims1 H(s) = H(1).

This is different from the behaviour of E(sT ), if T is defective:

E(sT ) = H(s) for |s| < 1 because the missing term is zero: i.e. because
s = 0 when |s| < 1.
E(sT ) is NOT left-continuous at s = 1. There is a sudden leap (disconti-
nuity) at s = 1 because s = 0 as s 1, but s = 1 when s = 1.

Thus H(s) does NOT represent E(sT ) at s = 1. It is as if H(s) is a train that

E(sT ) rides on between 1 < s < 1. At s = 1, the train keeps going (i.e. H(s)
is continuous) but E(sT ) jumps off the train.

We test whether T is defective by testing whether or not E(sT ) jumps off the
train that is, we test whether or not H(s) is equal to E(sT ) when s = 1.

We know what E(sT ) is when s = 1:

E(sT ) is always 1 when s = 1, whether T is defective or not:

E(1T ) = 1 for ANY random variable T .
P t
But the function H(s) = t=0 s P(T = t) may or may not be 1 when s = 1:

If T is defective, H(s) is missing a term and H(1) < 1.

If T is not defective, H(s) is not missing anything so H(1) = 1.

Test for defectiveness:

Let H(s) = t
P
t=0 s P(T = t) be the power series representing the PGF of T
for |s| < 1. Then T is defective if and only if H(1) < 1.
104

Using defectiveness to find the probability we never get there

The simple test for defectiveness tells us whether there is a positive probability
that we NEVER reach our goal. Here are the steps.

1. We want to know the probability that we will NEVER reach state j, start-
ing from state i.
2. Define T to be the random variable giving the number of steps taken to
get from state i to state j.
3. The event that we never reach state j, starting from state i, is the same
as the event that T = . (If we wait an infinite length of time, we never
get there.) So

P(never reach state j | start at state i) = P(T = ).

4. Find H(s) = t
P
t=0 s P(T = t), using a calculation like the one we did in
Section 4.9. H(s) is the PGF of T for |s| < 1. We only need to find it for
|s| < 1. The calculation in Section 4.9 only works for |s| 1 because the
expectations are infinite or undefined when |s| > 1.
5. The random variable T is defective if and only if H(1) < 1.
6. If H(1) < 1, then the probability that T takes the value is the missing
piece: P(T = ) = 1 H(1).

Overall:

P( never reach state j | start at state i) = P(T = ) = 1 H(1).

Expectation and variance of a defective random variable

If T is defective, there is a positive chance that T = . This means that

E(T ) = , Var(T ) = , and E(T a) = for any power a.
105

E(T ) and Var(T ) can not be found using the PGF when T is defective: you
will get the wrong answer.

When you are asked to find E(T ) in a context where T might be defective:

First check whether T is defective: is H(1) < 1 or = 1?

If T is defective, then E(T ) = .
If T is not defective (H(1) = 1), then E(T ) = H (1) as usual.

4.11 Random Walk: the probability we never reach our goal

In the random walk in Section 4.9, we defined the first reaching time T01 as the
number of steps taken to get from state 0 to state 1.

In Section 4.9 we found the PGF of T01 to be:

1 1 s2
PGF of T01 = H(s) = for |s| < 1.
s

Questions:
a) What is the probability that we never reach state 1, starting from state 0?

b) What is expected number of steps to reach state 1, starting from state 0?

Solutions:
a) We need to know whether T01 is defective.
T01 is defective if and only if H(1) < 1.

1 112
Now H(1) = 1
= 1. So T01 is not defective.

Thus
P(never reach state 1 | start from state 0) = 0.
We will DEFINITELY reach state 1 eventually, even if it takes a very long time.
106

b) Because T01 is not defective, we can find E(T01) by differentiating the PGF:
E(T01) = H (1).

1 1 s2 1/2
H(s) = = s1 s2 1
s
1 1/2
So H (s) = s2 s2 1 2s3

2

Thus
1 1
E(T01) = lim H (s) = lim 2 + q = .

s1 s1 s 3 1
s s2 1
So the expected number of steps to reach state 1 starting from state 0 is infinite:
E(T01) = .

This result is striking. Even though we will definitely reach state 1, the
expected time to do so is infinite! In general, we can prove the following results
for random walks, starting from state 0:
p
Property Reach state 1? P(T01 = ) E(T01) 0 1
p>q Guaranteed 0 finite
q
1
p=q= 2 Guaranteed 0
p<q Not guaranteed >0

Note: (Non-examinable) If T is defective in the random walk, E(sT ) is not

continuous at s = 1. In Section 4.9 we had to solve a quadratic equation to find
H(s) = E(sT ). The negative root solution for H(s) generally represents E(sT )
for s < 1. At s = 1, the solution for E(sT ) suddenly flips from the root to
the + root of the quadratic. This explains how E(sT ) can be discontinuous as
s 1, even though the negative root for H(s) is continuous as s 1 and all the
working of Section 4.9 still applies for s = 1. The reason is that we suddenly
switch from the root to the + root at s = 1.
When |s| > 1, the conditional expectations are not finite so the working of
Section 4.9 no longer applies.

NISM XV Research Analyst Short Notes
83% (6)
NISM XV Research Analyst Short Notes
40 pages
Stochastic Solutions Manual
100% (8)
Stochastic Solutions Manual
144 pages
NMSA334 Stochastic Processes 1
No ratings yet
NMSA334 Stochastic Processes 1
76 pages
Rate Analsis Chapter No. 12 (Wood Work)
100% (2)
Rate Analsis Chapter No. 12 (Wood Work)
329 pages
Blessed Mannes
No ratings yet
Blessed Mannes
17 pages
Lesson Plan - 6th Grade Animal Mask
No ratings yet
Lesson Plan - 6th Grade Animal Mask
2 pages
Probability Generating Functions
No ratings yet
Probability Generating Functions
8 pages
Lec 12
No ratings yet
Lec 12
6 pages
Chap 3
No ratings yet
Chap 3
11 pages
Probability Generating Functions
No ratings yet
Probability Generating Functions
5 pages
1 Generating Functions: Subject - Statistics Paper - Probability I Module - Probability Generating Functions
No ratings yet
1 Generating Functions: Subject - Statistics Paper - Probability I Module - Probability Generating Functions
6 pages
lecture24_pgf
No ratings yet
lecture24_pgf
3 pages
Lecture24 PGF PDF
No ratings yet
Lecture24 PGF PDF
3 pages
Stat 333
No ratings yet
Stat 333
128 pages
STAT8310 Topic5 1 2021
No ratings yet
STAT8310 Topic5 1 2021
73 pages
6 - Probability Generating Functions
No ratings yet
6 - Probability Generating Functions
10 pages
Probability Generating Functions
No ratings yet
Probability Generating Functions
3 pages
CSC 203 - Lecture 5
No ratings yet
CSC 203 - Lecture 5
8 pages
37161: Probability and Random Variables: Uts:Science
No ratings yet
37161: Probability and Random Variables: Uts:Science
17 pages
Notes
No ratings yet
Notes
56 pages
Probability Theory - Formula Sheet
No ratings yet
Probability Theory - Formula Sheet
13 pages
Lecture 3 - Probability - BMSLec02
No ratings yet
Lecture 3 - Probability - BMSLec02
16 pages
2 Discrete Random Variables: 2.1 Probability Mass Function
No ratings yet
2 Discrete Random Variables: 2.1 Probability Mass Function
12 pages
TB ch2
No ratings yet
TB ch2
58 pages
StochasticModels 2011 Part 2 v1
No ratings yet
StochasticModels 2011 Part 2 v1
22 pages
Lecture 1
No ratings yet
Lecture 1
81 pages
Probability Distributions: D Ds Ds
No ratings yet
Probability Distributions: D Ds Ds
10 pages
Lecture-4: Binomial R.V Approximations and Conditional Probability Density Functions
No ratings yet
Lecture-4: Binomial R.V Approximations and Conditional Probability Density Functions
24 pages
Stat512_2022_Lecture_11
No ratings yet
Stat512_2022_Lecture_11
16 pages
Some Common Probability Distributions
No ratings yet
Some Common Probability Distributions
92 pages
Stat6201 ch1-5
No ratings yet
Stat6201 ch1-5
4 pages
MGF, Discrete Statistical Distributions - 551
No ratings yet
MGF, Discrete Statistical Distributions - 551
49 pages
Examp Formula Sheets
No ratings yet
Examp Formula Sheets
6 pages
ETS Ntegrals ONT Ounting Echniques: S I C - C T
No ratings yet
ETS Ntegrals ONT Ounting Echniques: S I C - C T
6 pages
ETS Ntegrals ONT Ounting Echniques: S I C - C T
No ratings yet
ETS Ntegrals ONT Ounting Echniques: S I C - C T
6 pages
4.lect4a (BINOMIAL AND CONDITIONAL PDFS)
No ratings yet
4.lect4a (BINOMIAL AND CONDITIONAL PDFS)
34 pages
Stochastic Processes Notes
100% (1)
Stochastic Processes Notes
22 pages
Formulae
No ratings yet
Formulae
2 pages
Solution_P-I_W-4
No ratings yet
Solution_P-I_W-4
8 pages
RM2
No ratings yet
RM2
102 pages
print
No ratings yet
print
12 pages
Lecture 7
No ratings yet
Lecture 7
6 pages
Econ-2042- Unit 2-HO
No ratings yet
Econ-2042- Unit 2-HO
12 pages
Binomial Random Variable Approximations and Conditional Probability Density Functions
No ratings yet
Binomial Random Variable Approximations and Conditional Probability Density Functions
24 pages
Stats ch1
No ratings yet
Stats ch1
22 pages
Discrete RV
No ratings yet
Discrete RV
8 pages
Probability Space and Random Variable Proporties
No ratings yet
Probability Space and Random Variable Proporties
21 pages
Generating Functions
No ratings yet
Generating Functions
6 pages
Stat 350 Study Guide
No ratings yet
Stat 350 Study Guide
37 pages
Two Interpretations of Probability: The Frequentist Interpretation
No ratings yet
Two Interpretations of Probability: The Frequentist Interpretation
18 pages
lecture7
No ratings yet
lecture7
9 pages
Lecture 7. Distributions. Probability Density and Cumulative Distribution Functions. Poisson and Normal Distributions
No ratings yet
Lecture 7. Distributions. Probability Density and Cumulative Distribution Functions. Poisson and Normal Distributions
23 pages
The Uniform Distributn
No ratings yet
The Uniform Distributn
7 pages
Chapter 3
No ratings yet
Chapter 3
8 pages
Probability Distribution and Expectations PDF
No ratings yet
Probability Distribution and Expectations PDF
56 pages
Random Variables: 1.1 Elementary Examples
No ratings yet
Random Variables: 1.1 Elementary Examples
14 pages
Revision Notes - ST2131: Ma Hongqiang April 18, 2017
No ratings yet
Revision Notes - ST2131: Ma Hongqiang April 18, 2017
30 pages
ch5 PDF
No ratings yet
ch5 PDF
24 pages
Random Variables
No ratings yet
Random Variables
26 pages
Introstat
No ratings yet
Introstat
16 pages
SlidesCourse 14 Oct
No ratings yet
SlidesCourse 14 Oct
10 pages
Eec 161 ch03
No ratings yet
Eec 161 ch03
148 pages
L-6 Probability Distribution
No ratings yet
L-6 Probability Distribution
58 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
3 - Linear Regression Multiple Variables
No ratings yet
3 - Linear Regression Multiple Variables
44 pages
Icpe11 JMT Tutorial
No ratings yet
Icpe11 JMT Tutorial
74 pages
Schedule: Monday Tuesday Wednesday Thursday
No ratings yet
Schedule: Monday Tuesday Wednesday Thursday
2 pages
DRV8711EVM REVE1 Bill of Materials
No ratings yet
DRV8711EVM REVE1 Bill of Materials
2 pages
How Pulse Position Works: Modulation
No ratings yet
How Pulse Position Works: Modulation
5 pages
S T (R) When Neighbor Is (X, Y) Itself, I.e., A: Single Pixel
No ratings yet
S T (R) When Neighbor Is (X, Y) Itself, I.e., A: Single Pixel
1 page
Skeleton of Paper
No ratings yet
Skeleton of Paper
1 page
Pin Out
No ratings yet
Pin Out
1 page
Timer Type Resolution Prescaler Channels MAX INTERFACE CLOCK Max Timer Clock APB
No ratings yet
Timer Type Resolution Prescaler Channels MAX INTERFACE CLOCK Max Timer Clock APB
2 pages
Writing IT Technical Papers in English & Research Ethics: Mid-Term Our Protocol Section
No ratings yet
Writing IT Technical Papers in English & Research Ethics: Mid-Term Our Protocol Section
11 pages
Timer Type Resolution Prescaler Channels MAX INTERFACE CLOCK Max Timer Clock APB
No ratings yet
Timer Type Resolution Prescaler Channels MAX INTERFACE CLOCK Max Timer Clock APB
3 pages
Declaration of Ethical Conduct in Research
No ratings yet
Declaration of Ethical Conduct in Research
1 page
Syllabus RBT 2016 1
No ratings yet
Syllabus RBT 2016 1
4 pages
How Pulse Position Works: Modulation
No ratings yet
How Pulse Position Works: Modulation
5 pages
(2017-1) Orientation For Intl Students
No ratings yet
(2017-1) Orientation For Intl Students
19 pages
(Ebook) The Nyarlathotep Cycle by Robert Price, editor All Chapters Instant Download
No ratings yet
(Ebook) The Nyarlathotep Cycle by Robert Price, editor All Chapters Instant Download
82 pages
Measuring Psychological Well-Being: Insights From Thai Elders
No ratings yet
Measuring Psychological Well-Being: Insights From Thai Elders
9 pages
Shubham's PPT (Internship)
No ratings yet
Shubham's PPT (Internship)
16 pages
Cot 1 MTB Letrang LL 1 Demo Teaching
No ratings yet
Cot 1 MTB Letrang LL 1 Demo Teaching
6 pages
The Shifting Dynamics of Centre-State Relationship
No ratings yet
The Shifting Dynamics of Centre-State Relationship
36 pages
Penchal Reddy High School: Welcome's You To Ruther Ford Model of An Atom Power Point Presentation
No ratings yet
Penchal Reddy High School: Welcome's You To Ruther Ford Model of An Atom Power Point Presentation
14 pages
Pressure Ulcers Ecourse: Module 3 - Quiz I
No ratings yet
Pressure Ulcers Ecourse: Module 3 - Quiz I
10 pages
Corona Case 12123
No ratings yet
Corona Case 12123
2 pages
Summary of The WIND
100% (1)
Summary of The WIND
3 pages
Speaking Topics for Grade 9
No ratings yet
Speaking Topics for Grade 9
5 pages
Full Download Mental Health in The Athlete Modern Perspectives and Novel Challenges For The Sports Medicine Provider Eugene Hong PDF
100% (6)
Full Download Mental Health in The Athlete Modern Perspectives and Novel Challenges For The Sports Medicine Provider Eugene Hong PDF
52 pages
Nat Ia Sample Test 02 PDF
No ratings yet
Nat Ia Sample Test 02 PDF
19 pages
HW5e - Int - Test Unit 5A
No ratings yet
HW5e - Int - Test Unit 5A
4 pages
Som Lab Manual
No ratings yet
Som Lab Manual
34 pages
Ansay V NDC
No ratings yet
Ansay V NDC
3 pages
Case Study - Phoenix Social Work Agency
No ratings yet
Case Study - Phoenix Social Work Agency
7 pages
The Perseverance of The Saints Wes Hall
100% (1)
The Perseverance of The Saints Wes Hall
8 pages
Digest G.R. No. 182239 People of The Philippines vs. Hermie M. Jacinto
100% (5)
Digest G.R. No. 182239 People of The Philippines vs. Hermie M. Jacinto
2 pages
Cancer Is Only Going To Be A Chapter in You Life, Not The Whole Story
No ratings yet
Cancer Is Only Going To Be A Chapter in You Life, Not The Whole Story
3 pages
Herzberg Two Factor Theory
No ratings yet
Herzberg Two Factor Theory
2 pages
Anterior Spinal Artery Syndrome
No ratings yet
Anterior Spinal Artery Syndrome
1 page
Order of Service For Requiem Eucharist For The Repose of The Soul of The Right Reverend Edward Lloyd Salmon, Jr.
No ratings yet
Order of Service For Requiem Eucharist For The Repose of The Soul of The Right Reverend Edward Lloyd Salmon, Jr.
24 pages
Guru Bhagavan Swami Sivanandaji Maharaj
No ratings yet
Guru Bhagavan Swami Sivanandaji Maharaj
65 pages
Wanjau - Factors Influencing Completion of Building Projects in Kenya, Ministry of Land, Housing and Urban Development PDF
No ratings yet
Wanjau - Factors Influencing Completion of Building Projects in Kenya, Ministry of Land, Housing and Urban Development PDF
78 pages
Cushing's Syndrome
100% (4)
Cushing's Syndrome
35 pages

CH 4

Uploaded by

CH 4

Uploaded by

74

Chapter 4: Generating Functions

By the end of this chapter, you should be able to:

4.1 Common sums

2. Binomial Theorem For any p, q R, and integer n,

3. Exponential Power Series

4.2 Probability Generating Functions

The probability generating function (PGF) is a useful tool for dealing

Sums of random variables are particularly important in the study of stochastic

Definition: Let X be a discrete random variable taking values in the non-negative

Calculating the probability generating function

Properties of the PGF:

1. GX (0) = P(X = 0):

GX (0) = 00 P(X = 0) + 01 P(X = 1) + 02 P(X = 2) + . . .

Example 1: Binomial Distribution

= (ps + q)n by the Binomial Theorem: true for all s.

Example 2: Poisson Distribution

= e e(s) for all s R.

1.0 0.5 0.0 0.5 1.0 1.5 2.0

Example 3: Geometric Distribution

4.3 Using the probability generating function to calculate probabilities

Thus p0 = P(X = 0) = GX (0).

First derivative: GX (s) = p1 + 2p2s + 3p3s2 + 4p4s3 + . . .

Thus p1 = P(X = 1) = GX (0).

Second derivative: GX (s) = 2p2 + (3 2)p3s + (4 3)p4s2 + . . .

Uniqueness of the PGF  

Formally: let A(s) and B(s) be PGFs with A(s) =

4.4 Expectation and moments from the PGF

Proof: (Sketch: see Section 4.8 for more details)

GX (1) = xpx = E(X)

0.0 0.5 1.0 1.5

Example: Let X Poisson(). The PGF of X is GX (s) = e(s1) . Find E(X)

4.5 Probability generating function for a sum of independent r.v.s

Theorem 4.5: Suppose that X1 , . . . , Xn are independent random variables, and

Proof: GY (s) = E(s(X1 +...+Xn ) )

Example: Suppose that X and Y are independent with X Poisson() and

Solution: GX+Y (s) = GX (s) GY (s)

4.6 Randomly stopped sum

Remember the randomly stopped sum model from

In Chapter 3, we used the Laws of Total Expectation and Variance to show

In this chapter we will now use probability generating functions to investigate

Theorem 4.6: Let X1 , X2, . . . be a sequence of independent and identically dis-

GTN (s) = E(sTN ) = E sX1 +...+XN

Example: Let X1 , X2 , . . . and N be as above. Find the mean of TN .

= GN (1) GX (1) Note: GX (1) = 1 for any r.v. X

= E(N ) E(X1), same answer as in Chapter 3.

Example: Heron goes fishing

My aunt was asked by her neighbours to feed the prize

Let N be the number of times the heron visits the pond

Why did we need to use the PGF?

To find P(T = t), we have to partition over different values of N :

Here, we are lucky that we can write down the distribution of T | N = n:

if N = n is fixed, then T = X1 + . . . + Xn is a sum of n independent

For most distributions of X, it would be difficult or impossible to write down the

Continuing from ():

Without the PGF, we have two major difficulties:

1. Writing down P(T = t | N = n);

4.7 Summary: Properties of the PGF

Definition: GX (s) = E(sX )

Used for: Discrete r.v.s with values 0, 1, 2, . . .

4.8 Convergence of PGFs

This technical section introduces the radius of convergence of the PGF.

Definition: The radius of convergence of a probability generating function is a

Fact: For any PGF, the radius of convergence exists.

It is always 1: every PGF converges for at least s (1, 1).

The radius of convergence could be anything from R = 1 to R = .

Example 1: Geometric distribution

Geometric(0.8) probability generating function

In this region, p/(1qs) remains finite and wellbehaved,

At the limits of convergence, strange things happen:

However, the PGF does not converge at s = +R.

At the negative end, as s 5, the function p/(1 qs) = 4/(5 s) is

Example 2: Binomial distribution

Let X Binomial(n, p). What is the radius of convergence of GX (s)?

= (ps + q)n by the Binomial Theorem: true for all s.

Abels Theorem for continuity of power series at s = 1

Uniqueness of the PGF

Which root? We know that P(T01 = 0) = 0, because it must take atleast