Mathematicalphysics PDF
Mathematicalphysics PDF
Physics
LICENSE, DISCLAIMER OF LIABILITY, AND LIMITED WARRANTY
By purchasing or using this book (the “Work”), you agree that this license
grants permission to use the contents contained herein, but does not give
you the right of ownership to any of the textual content in the book or own-
ership to any of the information or products contained in it. This license
does not permit uploading of the Work onto the Internet or on a network
(of any kind) without the written consent of the Publisher. Duplication or
dissemination of any text, code, simulations, images, etc. contained herein
is limited to and subject to licensing terms for the respective products, and
permission must be obtained from the Publisher or the owner of the con-
tent, etc., in order to reproduce or network any portion of the textual mate-
rial (in any media) that is contained in the Work.
Mercury Learning and Information (“MLI” or “the Publisher”) and any-
one involved in the creation, writing, or production of the companion disc,
accompanying algorithms, code, or computer programs (“the software”),
and any accompanying Web site or software of the Work, cannot and do
not warrant the performance or results that might be obtained by using
the contents of the Work. The author, developers, and the Publisher have
used their best efforts to insure the accuracy and functionality of the textual
material and/or programs contained in this package; we, however, make
no warranty of any kind, express or implied, regarding the performance
of these contents or programs. The Work is sold “as is” without warranty
(except for defective materials used in manufacturing the book or due to
faulty workmanship).
The author, developers, and the publisher of any accompanying content,
and anyone involved in the composition, production, and manufacturing of
this work will not be liable for damages of any kind arising out of the use of
(or the inability to use) the algorithms, source code, computer programs,
or textual material contained in this publication. This includes, but is not
limited to, loss of revenue or profit, or other incidental, physical, or conse-
quential damages arising out of the use of this Work.
The sole remedy in the event of a claim of any kind is expressly limited to
replacement of the book, and only at the discretion of the Publisher. The
use of “implied warranty” and certain “exclusions” vary from state to state,
and might not apply to the purchaser of this product.
Mathematical
Physics
An Introduction
All of our titles are available in digital format at authorcloudware.com and other
digital vendors. The sole obligation of Mercury Learning and Information to
the purchaser is to replace the book, based on defective materials or faulty
workmanship, but not based on the operation or functionality of the product.
CONTENTS
Preface������������������������������������������������������������������������������������ xiii
2 Elementary Functions................................................ 49
2.1 Binomial expansion....................................................... 50
2.2 Maclaurin series............................................................ 55
2.3 Taylor series................................................................... 58
2.4 Equilibrium points........................................................ 59
2.5 Definition of ex (the exponential function)................... 63
2.6 The inverse function of ex............................................. 63
2.7 Derivative of the exponential........................................ 64
2.8 Integration of exponentials........................................... 67
2.9 Hyperbolic functions..................................................... 68
2.10 Inverse functions........................................................... 69
2.11 Inverse hyperbolic functions......................................... 71
2.12 Exercises........................................................................ 72
2.13 Problems........................................................................ 75
5 Matrices.................................................................... 147
5.1 Matrix representation.................................................. 149
5.2 Solution of systems of equations................................. 153
5.3 Products....................................................................... 156
5.4 The identity matrix...................................................... 159
5.5 Symmetric and antisymmetric matrices..................... 160
5.6 Inverses........................................................................ 161
5.7 The inverse of a 2 × 2 matrix...................................... 165
5.8 Determinants............................................................... 167
5.9 Properties of determinants......................................... 169
5.10 Solution of 2 × 2 linear systems.................................. 174
5.11 Solution of 3 × 3 systems............................................ 177
5.12 Homogeneous systems................................................ 179
5.13 A formula for the inverse matrix................................. 179
5.14 Eigenvalues and Eigenvectors.................................... 181
5.15 Matrices as transformations........................................ 186
5.16 Extension: Transformation groups.............................. 191
5.17 Exercises...................................................................... 192
5.18 Problems...................................................................... 195
Index��������������������������������������������������������������������������� 515
PREFACE
The book differs from other introduction to mathematical methods
at this level in several important areas.
First, it does not follow the usual presentation of a description of
the theory followed by examples and exercises. Rather we use exam-
ples to introduce the theory. This approach is not new; it goes back
to the methods by which the scribes of Ancient Babylon learned
mathematics: by example problems and, to judge from the numbers
of surviving cuneiform tablets, lots of them!
Second, to help the reader digest the text, it is broken up into
quite short sections (often a page or so) followed by exercises. It may
be tempting to skip the exercises, (especially if one is used to doing
only a selection of “end-of-chapter” problems) on the grounds that
one can get through the book more quickly that way. This is true, in
the same sense that watching a film speeded up x8 will get to the end
more quickly, but it will be without much understanding of the plot.
There are optional additional exercises at the end of each chapter;
the ones in the text represent the minimum we think you need.
Third, however, we have tried to avoid too many “plug-and-
chug” exercises, that is, exercises which you solve by following the
text but substituting some different numbers (“pattern matching”).
These are useful to reinforce memory, but they are not very useful
to develop or test understanding. Rather, we have tried to make the
exercises diagnostic in the sense that they do test understanding,
that is, they test the ability to use what has been learned in a slightly
different context. An instructor can therefore use these to target
support for students.
xiv • Mathematical Physics
Leicester, 2017
CHAPTER
1
DERIVATIVES AND
INTEGRALS
y y
f (x+h)
f (x+h)
f (x) f (x)
x x
x x+h x x+h
(a) (b)
FIGURE 1.1: The tangent at an arbitrary point x as the limit of line joining points (x, f(x)) and
(x + h, f(x + h)) as h gets smaller.
f ( x + h) − f ( x)
f ′ ( x ) = lim . (1.1)
h→ 0 h
Equivalently, in a commonly used alternative notation, we write
dx in place of h to emphasize that we are making a small increment
in x. We also sometimes write y = y(x) instead of y = f(x) to indicate
that y is a function of x. The definition is then written
dy y( x + d x) − y( x)
= lim . (1.2)
dx d x → 0 dx
For a function of the form y = f(x) we use the notation f ′(x),
y′ and dy/dx interchangeably. The definition expresses in an exact
manner the fact that the derivative at x is the slope of the tangent at
x. In order to use it we need a way of taking the limit. The follow-
ing example shows how this is done for a simple function such as a
power of x.
therefore
f ( x + h) − f ( x)
= 3 x2 + 3 xh + h2 ,
h
and
x ) lim ( 3 x2 + 3 xh + h2 ) ,
f ′ (=
h →0
= 3 x2 + 0 + 0,
= 3 x2 .
dy ( x + h ) −x n n
=y x=
, and lim n
.
dx h→ 0 h
1
( x + h )n = x n + nx n−1 h + n ( n − 1 ) x n−2 h2 + + h n ,
2
4 • Mathematical Physics
and
1
( x + h )n − =
x n nx n−1 h + n ( n − 1 ) x n− 2 h2 + + h n ,
2
and so
( x + h )n − x n 1
= nx n−1 + h n ( n − 1 ) x n− 2 + + h n− 2 .
h 2
( x + h )n − x n
lim = nx n−1 ,
h →0 h
and therefore
dy
= nx n−1 .
dx
You should already know (or be willing to take on trust for the
present) the derivatives in Table 1.1.
Note: the trigonometric functions are introduced in section A.4
of Appendix A and the logarithm in Section A.3. Throughout this
course ln(x) will be written for loge(x), the “natural logarithm.” It
is common to use log(x) = log10(x), and the use of any other base is
indicated explicitly, as in log2(x). The derivatives in the table may be
taken as given.
Derivatives and Integrals • 5
x a axa−1
sin(x) cos(x)
cos(x) − sin(x)
tan(x) sec2(x)
eax aeax
The derivative of the sum of two functions is the sum of the deriv-
atives. This means that if f(x) and g(x) are both functions of x then
d df dg
( f + g) = + (1.3)
dx dx dx
8x + 4 = 2(1 + 2x) × 2.
dy dy du (1.4)
= .
dx du dx
dy/du = − sin(u),
du/dx = 2x.
Derivatives and Integrals • 7
dy dy du
= ,
dx du dx
= − sin ( u ) ⋅ 2 x,
= −2 x sin ( x2 ) .
Alternatively, and more economically, you can write down the
chain rule directly:
d cos ( x2 ) dx2
d
dx
{ }
cos ( x2 ) = 2
dx
⋅
dx
= ( )
− sin ( x2 ) ⋅ 2 x.
du du
= 1,= cos ( x ) .
dx dx
Therefore, by equation (1.5),
dy
= x ⋅ cos ( x ) + sin ( x ) ⋅ 1 = x cos ( x ) + sin ( x ) .
dx
= u =
w 3
with w cos( 2 x ) ,
dw
= − 2sin( 2 x ) ,
dx
du du dw
= = ( 3 w2 ) ( − 2sin( 2 x ) ) ,
dx dw dx
= − 6 cos2 ( 2 x ) sin( 2 x ) .
dy du du
= u +u ,
dx dx dx
−6 x2 cos2 ( 2 x ) sin ( 2 x ) + 2 x cos3 ( 2 x ) .
=
Eventually you should be able to do some of the intermediate
steps in your head. So your working might look something like
this:
(x2 cos3(2x))′ = x2(cos3(2x)3)′ + 2x cos3(2x),
= x2 · 3 cos2(2x)(cos(2x))′ + 2x cos3(2x),
= x2 · 3 cos2(2x) sin(2x)(−2) +2x cos3(2x),
= −6x2 cos2(2x) sin(2x) + 2x cos3(2x).
You should cultivate the habit of working like this, resorting to
introducing u and v explicitly only if you get stuck.
If you are familiar with proof by induction, you may like the fol-
lowing example.
du du
u− u
dy dx
= dx , equivalently ( u / u )′ u′u − u ′u . (1.6)
dx u2 u2
Exercise 1.8
(i) Derive the quotient rule: By putting f = u, g = 1/v show that
if y = f g = u/v then dy/dx = (u′v′ − v′u)/ v2.
(ii) Use the quotient rule to find dy/dx if y = sin(2x)/x3 and check
your result with exercise 1.7(iii).
(iii) Use the quotient rule to find dy/dx if y = tan(x). (Recall that
tan(x) = sin(x)/ cos(x).)
dy 1
= .
dx cos( y)
We want the answer as a function of x so we have to get cos(y)
in terms of x given that sin(y) = x. We first substitute for cos(y)
using sin2(y) + cos2(y) = 1. Then,
dy 1 1
= = .
(1 − sin2 ( y) ) (1 − x )
12 2 12
dx
x if x≥0
f ( y) = (1.7)
− x if x < 0.
This function is plotted in Figure 1.2(a).
+1 if x > 0
=f ( x ) =0 if x 0 (1.8)
−1 if x < 0.
This function is plotted in Figure 1.2(b).
y
y
y = sgn( x)
y = |x|
(a) (b)
FIGURE 1.2: (a) The graph of the “modulus” function y = |x|. (b) The “sign” function y = sgn(x).
d2 y y′ ( x + d x ) − y′ ( x ) (1.10)
= lim .
dx 2 d x →0 dx
Similarly, we can define the third, fourth and higher order
derivatives. Often the nth derivative of a function y = f(x) is writ-
ten as f(n)(x), or dny/dxn. Note the parentheses around the super-
script (n) in f(n)(x) to distinguish it from (f(x))n. In practice we
continue to differentiate the result after each differentiating
(tidying up the expressions as we go) until we get to the required
derivative.
d 2 y d dy d
=
dx 2
=
dx dx dx
( nx n−1 )
= n ( n − 1 ) x n−2 .
Derivatives and Integrals • 15
Exercise 1.13 Find derivatives of the first five orders of the func-
tion f(x) = sin(x).
y y y f’ (x) > 0
f’ (x) = 0
f ’’(x) > 0
f ’’(x) = 0
f’ (x) < 0 f’ (x) > 0
x
f’ (x) = 0
f’ (x) = 0
FIGURE 1.3: (a) At a minimum the slope increases as x increases, so d(y′)/dx = y″ > 0. (b) At a
maximum, the slope decreases as x increases, so y ″ < 0. (c) At a stationary point of inflection
the slope is zero and the rate of change of slope is zero.
16 • Mathematical Physics
= dF ( x ) (1.11)
F ( x ) ∫=
f ( x ) dx if f ( x ) .
dx
∫ nax
n −1
=
dx ax n + c.
Compare with the integral we want. Put na = b, n – 1 = m.
Hence n = m + 1 and a = b/(m + 1), provided m ≠ –1. Then,
b
∫ bx dx
= x m+ 1 + c. (for m ≠ −1)
m
( m + 1)
Derivatives and Integrals • 19
dx
∫x ∫=
−1
=
dx ln x + c.
x
Note the absolute value of x on the right.
d
dx
{sin ( 2 x )} = 2 cos ( 2 x )
so that,
cos ( 2 x ) =
d 1
{
dx 2
sin ( 2 x )}
and therefore,
1
( 2 x ) dx
∫ cos= sin ( 2 x ) + c.
2
1/x In(x) + c
sin(x) – cos(x) + c
cos(x) sin(x) + c
tan(x) – ln | cos(x)| + c
ln(x) x ln(x) – x + c
eax a–1e ax + c (a ≠ 0)
1.10. INTEGRALS OF COMBINATIONS OF
FUNCTIONS
d d
dx
{ln ( f ( x ) ) + c} = dx
( ln ( u ) ) ,
1 du
= ,
u dx
1
= f ′ ( x ).
f ( x)
1
x ) ( f ( x ) ) dx
( f ( x)) + c .
n +1
∫ f ′ (=
n
Exercise 1.18 Show that
n+1
(Remember that to prove a given integral you differentiate the
anti-derivative.)
dx 1 dt
∫ ax + b = a ∫ t
.
dx dt
∫a 2
+x 2
= K∫
1 + t2
where K is a constant.
We want to replace the a2 with a 1 in the denominator, so we
take out factor a2 to get
dx 1 dx
∫a 2
+x 2
= 2∫
a 1 + x 2 a2
.
dx 1 a dt
∫a 2
+x 2
= 2∫
a 1 + t2
,
1 dt
= ∫ .
a 1 + t2
Derivatives and Integrals • 23
Exercise 1.19
(i) Find an explicit change of variable that converts
dx 1 dt
∫ b − ( x − a)
2 2
into ∫
b 1 − t2
(ii) Similarly, simplify
dx
∫
(1 + b ( x + a ) )
2 2
In harder questions where we are not given the answer, the best
approach is often to try to make the integrand as simple as possible.
There is usually more than one way to start and, if you do not suc-
ceed the first time, try again!
dx
Example 1.18 Find I = ∫ .
(1 − x ) 2 12
− sin (q ) dq
=∫ ,
sin (q )
= − ∫ dq ,
=− q + c.
24 • Mathematical Physics
I = – cos−1(x) + c
dx
∫ .
(9 − x )
2 12
(iii) ∫ cot(x) dx
(iv) ∫ (1 + x2)5x dx
u′ = 1, and v′ = – cos(x),
I = uv–∫ vu′dx,
= x(– cos(x)) – (– cos(x)).1 dx,
= –x cos(x) + sin(x) + c.
2 x2 − 5 x + 10
Exercise 1.23 Find ∫ x−4
dx
1
Example 1.24 Find ∫−1
x2 dx (the definite integral of x2
−1
between – 1 and 1) and also find ∫
1
x2 dx .
1
1 x3 1 1 2
∫−1 = 3 = 3 −− 3 = 3.
2
x dx
−1
And similarly
−1
−1 x3 1 1 2
∫ x dx = =− − =− .
2
1
3 1 3 3 3
In general
b a
∫ f ( x ) dx = −∫ f ( x ) dx.
a b
The area “under” the graph of a function (i.e. the area between
the graph and the x-axis, with areas below the axis counted as nega-
tive) is given by the definite integral of the function.
y y = f (x)
x
a b
Exercise 1.27 Using the result of Example 1.20, find the area
under the graph of y = x sin(x) between 0 and p.
30 • Mathematical Physics
Exercise 1.28
(i) At what point is sgn(x) discontinuous?
(ii) Draw the graph of the function y = sgn(x) between x = –1
and x = 2.
(iii) Find the area under the graph (as defined in Section 1.15).
y = f (x) for x < x1 y = g (x) for x ≥ x1 y = f (x) for x < x1 y = h (x) for x ≥ x1
y y
x x
x1 x1
(a) (b)
FIGURE 1.5: (a) An example of a function defined piecewise that is continuous. In the region
x < x1, the function is y = f(x), and in the region x ≥ x1 the function is y = g(x). The two functions
are equal at x1, and continuous across x1. But notice the derivative is not continuous; there is
a sudden change in the gradient at x1. (b) Now with a discontinuity at the point x1. The graph
is no longer continuous and the derivative is not defined at x1.
C − t if t ≤ 3
f (t) =
exp ( − t 5 ) if t > 3
Find the value of C that makes the function continuous over
the range t = 0 to t = 10, and then compute the definite integral
of f(t) over this range. See Figure 1.6.
The change in the definition of the function occurs at t = 3, so
we examine this point. Inserting this value into each formula
we get exp(–3/5) and C – 3. If the function is continuous over
this point, these two must be equal. Therefore
exp(– 3/5) = C – 3 ⇒ C = exp(– 3/5) + 3.
In order to integrate the above function we split the range
of integration (t = 0 − 10) into two parts, and integrate each
definition of function within its own range:
exp ( − t 5 ) dt.
10 3 10
∫ f ( t ) dt = ∫ ( C − t ) dt + ∫
0 0 3
2 0 = 3exp ( −3 5 ) + 9 2 = 6.1464...
3 3
∫ ( C − t ) dt = Ct − t
2
0
= 2.0674...
Ax + 1 for x≤1
f ( x) =
2 x for x > 1.
y
y = C–t
t=3 t = 10
2
y = exp(–t/5)
1
0
0 2 4 6 8 10 t
FIGURE 1.6: The piecewise function y = f(t) of example 1.26. The range of integration is split
into two subranges t = 0 − 3 and t = 3 − 10 and the area under the graph over t = 0 − 10 is
the sum of these two areas.
(ii) If A = 1
(a) sketch the function f(x) between x = 0 and x = 2
(b) calculate the area under the graph between x = 0 and
x=2
Let
x
F ( x ) = ∫ f ( t ) dt
a
34 • Mathematical Physics
dF 1 d 3
=
dx 3 dx
{ }
x= x=
2
f ( x ).
dF dg
= f ( g ( x)) . (1.15)
dx dx
Derivatives and Integrals • 35
x2
Example 1.29 Calculate dF/dx if F ( x ) = ∫ e− t dt
0
Exercise 1.31
e2 x
∫ ( ln( t ) )
12
(i) If F ( x ) = dt what is dF/dx?
1
1
∫ ( ln( t ) )
12
(ii) If G( x ) = dt what is dG/dx?
e2 x
(Do not attempt to compute the integrals explicitly! For (ii) begin
by finding the relation between G and F.)
p2
Example 1.30 Find ∫ cos3 (q ) dq .
0
Reduction formulae are usually obtained by integration by
parts. The key is to write the integrand as a suitable product.
Here we write (with the benefit of hindsight) cos3(q) = cos2(q)
cos(q), then
p2 p2
=I cos (q ) dq ∫
∫=
3
cos2 (q ) cos (q ) dq .
0 0
2 cos (q ) (1 − cos2 (q ) ) dq
p2
= ∫
0
p2 p2
= 2∫ cos (q ) dq − 2 ∫ cos3 (q ) dq
0 0
Notice that the last term above is twice the integral we started
with, I. So,
p 2
=I 2∫ cos (q ) dq − 2 I.
0
( 2 n −2 )( 2 n −4 ) ... ( 2 )
Cn = .
( 2 n −1)( 2 n −3) ... ( 3)
We begin with
( 2n − 2 )
Cn = C .
( 2 n − 1 ) n −1
Then, replacing n with n – 1,
( 2n − 4 )
Cn−1 = C ,
( 2 n − 3 ) n−2
and comparing these two equations we see that
( 2 n − 2 )( 2 n − 4 )
Cn = C .
( 2 n − 1)( 2 n − 3 ) n−2
If we continue until we get C1 on the right hand side, we find
that
Cn =
( 2 n − 2)( 2 n − 4) ... ( 2) C .
( 2 n −1)( 2 n − 3) ... ( 3) 1
At this point we can stop because we can integrate C1 explicitly
and also the reduction formula no longer applies. We have
p2
=C1 cos( q ) dq
∫= 1
0
Revision Notes
Further Questions
The sets of additional questions ending chapters divide into
two groups. Although they are not necessarily straightforward, the
first set comprises exercises that usually illustrate single concepts
or techniques from the text. The second set contains more diffi-
cult problems and requires a degree of insight usually drawing on
or illustrating a range of ideas, possibly using material from earlier
chapters.
1.20. EXERCISES
( ) xx
3. Find the derivatives with respect to x of xx and x .
4. Simplify x + x + 5 x − 6 x − 10
5 3 2
x2 − 2
5. Verify that the derivative of the function
3t
g ( t ) tan −1 t −
=
3 + t2
is equal to 4t4/(1 +t2)(3 +t2)2, and deduce that for t > 0
(3 + t2)tan−1t > 3t.
∫ x (2 − 5x )
5 3 23
(i) dx
42 • Mathematical Physics
∫ x (1 − x )
2 12
(ii) dx
sin x cos3 x
(iii)
∫ 1 + cos2 x dx
−1
(iv) tan x dx
∫ 1 + x2
x11
(v)
∫ x8 + 3 x4 + 2 dx
dx
14. Find ∫ 2 3 4 3 .
x −x
15. Indicating the method used, obtain the values A = 1, B = 2
for the constants in the partial fraction identity
A B 5 −x
f ( x) = + , where f ( x ) = .
6 −x 7 −x ( 6 −x)( 7 −x)
At which points P, Q does the curve y = f(x) meet the axes
0x, 0y? Show that the area of the region bounded by OP,
OQ and the arc PQ of the curve is ln(49/24).
16. In the partial fraction expression
2t A B
= + ,
(1 + t )(1 + 3t ) 1 + t 1 + 3t
you are given that A = 1; find the value of B. Hence show
that
1 2t2 p p
∫0 (1 + t 2 )(1 + 3t 2 ) dt= 4 − 3 3 .
17. Use integration by parts to show that
∫ ue du = ue u − e u + C,
u
2
(C a constant) and with the help of a suitable change of
variable deduce that
1 2
sin ( 2 t ) ln ( sin ( t ) ) dt =
p2
∫p 4
ln .
4 e
y
ln d ( y cos( y) ) = 1.
p
20. Show that ∫ p2 p
21. It is known that
4x A B C
= + +
( x + 1)( x − 1) 2
x + 1 ( x − 1) ( x − 1)
2
Find the value of the constant C and show that f(x) is dif-
ferentiable at x = 1.
24. Given that the sequence an is generated by the recurrence
relation
( n + 1)( n − 3 )
an = a n −1
( n − 1)!
with a0 = 1, find a1, a2, a3, a4 and a2001. Note that 0! = 1 – see
the discussion of factorials in Section 2.1.
44 • Mathematical Physics
1.21. PROBLEMS
1 dx
5. Evaluate
∫ 1 + x + x2
0
. By expanding the integrand, written
(i)
f(x), (ii) g(x) = ln(1 + x) – x/(1 + x/2), and state
for what values of x these expansions are valid.
8. Let f(x) be a differentiable function of x increasing on the
interval a < x < b. What condition is satisfied by the deriva-
tive f ′(x)? If f(x) is an increasing function show that 1/f(x)
is decreasing, except at points where f(x) = 0. Show that
46 • Mathematical Physics
p2
9. Let Jm = ∫0 cos (q ) dq . Calculate the value of J0
m
p2
and J1. Writing Jm = ∫ cos m −1 ( q ) d sin( q ) , show that
0
m −1
Jm = Jm − 2 where m is a positive integer. Deduce that
m
2 n −1 2 n −3 1 2 n 2 n −2 2
=J2 n ⋅⋅⋅ J0 =
and J2 n+ 1 ⋅⋅⋅ J1
2 n 2 n −2 2 2 n + 1 2 n −1 3
where n is a positive integer, and hence that
2
p 2.4.6...2 n 1 J2 n
= .
2 3.5... ( 2 n − 1 ) 2 n + 1 J2 n+1
Show that J2n–1 ≥ J2n ≥ J2n+1 for any n, and use the relation
J2 n −1 2 n + 1
=
J2 n + 1 2n
to deduce that
J2 n −1 J2 n
=
lim =
lim 1.
n →∞J2 n + 1 n →∞ J2 n + 1
p 2 2 4 4 6 2n − 2 2n 2n
= lim ⋅ ⋅ ⋅ ⋅ ⋅⋅⋅ ⋅ ⋅ .
2 n→∞ 1 3 3 5 5 2 n − 1 2 n − 1 2 n + 1
10. Let
1
∫ x n (1 − x ) dx
m
=
I n, m
0
Derivatives and Integrals • 47
2
ELEMENTARY
FUNCTIONS
You are probably familiar with the fact that a function like
(1 + x)n, where n is a positive integer, can be expanded in powers of x.
For example, for n = 1, 2, 3,
(1 + x)1 = 1 + x,
(1 + x)2 = (1 + x)(1 + x) = 1 + 2 x + x 2,
(1 + x)3 = (1 + x)(1 + 2x + x 2) = 1 + 3x + 3x 2 + x 3.
n ( n − 1 ) 2 n ( n − 1 )( n − 2 ) 3
(1 + x )n =1 + nx + x + x + . . . + xn (2.1)
2 ⋅1 3 ⋅ 2 ⋅1
Exercise 2.1
(i) Expand (1 + x)7
(ii) Write out the three lowest order terms in the expansion of
(1 + x)12
Factorials
For n a positive integer, we define n! (pronounced “n factorial”)
as the product
n! = n × (n – 1) × (n – 2) × · · · × 2 × 1. (2.2)
n! = n(n – 1) · · · 3 ⋅ 2 ⋅ 1,
and
(n – r)! = (n – r)( n – r – 1) · · · 3 ⋅ 2 ⋅ 1.
Taking the ratio of these two we have
n! n ( n − 1). . . 3 ⋅ 2 ⋅ 1
= .
( n − r )! ( n − r )( n − r − 1 ) . . . 3 ⋅ 2 ⋅ 1
We can now cancel terms from the right end of the numerator
and denominator. The terms in the denominator cancel with
the terms (n – r)·( n – r – 1) · · · 3 ⋅ 2 ⋅ 1 in the numerator,
leaving (n – r + 1) and larger terms on the top,
n!
= n ( n − 1). . .( n − r + 1) .
( n − r )!
5!
Exercise 2.4 Evaluate for r = 0, 1, 3, 4, 5. Hence write
r !( 5 − r )!
out equation (2.3) for n = 5 and compare with Example 2.1.
The coefficients in front of the xr terms, i.e. n!/r!(n – r)!, are the
binomial coefficients. Sometimes n!/r!(n – r)! is considered as a func-
tion of n and r and is called “binomial n, r” or “n choose r.” There
Elementary Functions • 53
are also other notations that are commonly used for this expression;
n
for example, or nCr mean the same thing. So you might see the
r
binomial series written as
n n
n
= (1 + x )n ∑ n Cr x r , =
or (1 + x ) ∑ x r .
n
= r 0= r 0r
Binomial Series
What happens if n is not a positive integer? We first consider
the case n = –1. Writing out the binomial series, equation (2.1), with
n = –1 gives the infinite series:
( −1)( −2 ) ( −1)( −2 )( −3 )
(1 + x )−1 = 1 − x + x2 + x3 + . . . (2.4)
2 3⋅2
= 1 – x + x2 – x3 + . . . . (2.5)
This is correct provided |x| < 1 because, then, the right hand side
is just a geometric series (section A.7) with ratio (–x). If |x| < 1, then
the left hand side is the correct expression for the sum of an infinite
geometric series. If |x| > 1 the RHS is the sum (or difference) of ever
larger terms which grows arbitrarily large in magnitude and so cannot
equal the LHS, which is a finite number.
In general, the binomial expansion of (1 + x)n is valid when n is
not a positive integer, and |x| < 1, but will be an infinite series of terms,
not the finite series of equation (2.1).
n ( n − 1 ) 2 n ( n − 1 )( n − 2 ) 3
(1 + x )n =1 + nx + x + x + . . . . (2.6)
2 ⋅1 3 ⋅ 2 ⋅1
Example 2.5 Write down the first three terms of the binomial
expansion of (1 + 2x)1/2 and state the range of x for which the
expansion is valid.
Using equation (2.6) we have
11
1 − 1
1 + (2x) + 2 x 2 + . . . (2.7)
2 2
(1 + 2 x )1 2 = ( )
2 2!
54 • Mathematical Physics
1 2
=1 + x − x +.... (2.8)
2
The expansion is valid for |2x| < 1, i.e. for –1/2 < x < 1/2.
Exercise 2.6 Use your answer from Exercise 2.5 (i) to obtain the
expansion
a a ax ax2
= + 2 + 3 +... ,
b− x b b b
valid for |x| < |b|. Use this expansion to show that a radio fre-
quency of 198 (= 200 – 2) kHz corresponds to a wavelength of
approximately 1515m. (Wavelength=wavespeed/frequency, and
the speed of radio waves is approximately 3 × 108 m s–1.)
5 5 1
x1 3 ≈ + x − x2
9 9 9
near x = 1. Verify that the value obtained when x = 1000/729 dif-
fers from the true value of x1/3 by less than 0.25%.
Exercise 2.8 If d is small compared with |x| and 1, find the expan-
sion (in powers of d) of
(3x + d )
(1 + d ) ( x − 3d )
2
Example 2.6 Find the first three terms of the Maclaurin series
of (1 + 2x)1/2.
We have
f ( x ) =+ (1 2 x ) ⇒ f ( 0 ) =
12
1,
1
f ′ ( x= ) (1 + 2 x )−1 2 ⋅ 2 ⇒ f ′ ( 0=) 1,
2
1
f ′′ ( x ) =− (1 + 2 x ) ⋅ 2 ⇒ f ′′ ( 0 ) =−1.
−3 2
2
56 • Mathematical Physics
Example 2.7 Find the Maclaurin series for sin(x). Write down
the general term.
We have
f ( x ) = sin ( x ) ⇒ f ( 0 ) = 0,
f ′ ( x ) = cos ( x ) ⇒ f ′ ( 0 ) = 1,
f ′′ ( x ) =− sin ( x ) ⇒ f ′′ ( 0 ) = 0,
f ′′′ ( x ) =
− cos ( x ) ⇒ f ′′′ ( 0 ) =
−1.
8
=−
1 x 2 + 3 x2 8 + . . . .
In Exercise 2.11 the series (i) is correct only for –1 < x < 1,
whereas the cosine series (ii) (and the sine series also) are valid for all
values of x (though not very useful for numerical computation if x is
large). The series (iii) is valid for –1 ≤ x ≤ 1. It is beyond the scope of
this course to explain how this can be worked out.
58 • Mathematical Physics
2! p!
(2.14)
Here f ¢(x0) stands for f ¢(x) evaluated at x = x0; f ≤(x0) stands for
f ≤(x) evaluated at x = x0 etc. Note that the Taylor series gives us f(x) as
a power series in (x − x0). The Maclaurin series is a special case of the
Taylor series with x0 = 0. By putting x = x0 + d we obtain an equivalent
form of the Taylor series
1 2 1
f ( x0 + d=
) f ( x0 ) + d f ′ ( x0 ) + d f ′′ ( x0 ) + . . . + d p f ( ) ( x0 ) + . . . (2.15)
p
2! p!
which resembles the Maclaurin series with x0 replacing 0, and d
replacing x. The series can be written compactly as
∞
d p ( p)
∑
f ( x0 + d ) =
p = 0 p!
f ( x0 ) .
Example 2.9 Find the Taylor series of the function x1/3 about
the point x0 = 1 up to terms of order (x – 1)2.
We have
f ( x) = x1 3 and x0 = 1 ⇒ f ( x0 ) = 1,
1 1
f ′ ( x ) = x −2 3 ⇒ f ′ (1 ) = ,
3 3
2 1 −5 3 2
f ′′ ( x ) =− ⋅ x ⇒ f ′′ (1 ) =− .
3 3 9
Therefore, for the Taylor series (Equation 2.14) we get
1 2
1 + ( x − 1) − ( x − 1) + . . . .
2
x1 3 =
3 9
This is just the binomial expansion of (1 + (x – 1))1/3.
Elementary Functions • 59
2
less than 0.00025%.
V(x) = x4 – 8x2.
Sketch a graph of V(x), find and classify the equilibrium points,
and obtain an approximate expression for V(x) near to the point
of equilibrium where x > 0.
At equilibrium, by definition V′(x) = 0. We have V(x) = x4 – 8x2,
so V′(x) = 4x3 – 16x = 0. This has solutions x = 0 and x = ±2 and
these are the points of equilibrium. Now V″(x) = 12x2 – 16.
60 • Mathematical Physics
V(x)
10
( x − 2 )2 ′′
V ( x=
) V (2) + ( x − 2) V′(2) + V (2) + . . .
2
( x − 2) 2
=−16 + ( x − 2 ) ⋅ 0 + 32 + . . .
2
≈ −16 + 16 ( x − 2 ) .
2
Note how we find all the stationary (equilibrium) points and then
consider them one by one. Some may be maxima, hence unstable
equilibria; some may be minima, hence stable equilibria.
V ′′ ( x0 )
≈ V ( x0 ) + d 2,
2
where we have neglected higher order terms because these will
be small if the distance from the minimum, d, is small. Then, the
potential
1
=
V (d ) V ( x0 ) + kd 2 ,
2
where V(x0) and k = V″(x0) are constants. For any physical system
close to a minimum V(d) ∝ δ2 and the force F = −dV/dx is proportional
to the distance, F = −kδ.
This means that, close to a minimum in the potential energy, every
physical system behaves in an analogous way to a harmonic oscillator.
V ( x ) V0 0 − 2 0 ,
=
x x
describes the potential energy of two atoms separated by
a distance, x. This has a minimum value of −V0 when x = x0.
Show that, near to x = x0, the restoring force felt by the atoms is
proportional to the distance, d, from the minimum.
62 • Mathematical Physics
V(x)
0.5V0
x0 1.5x0 2x0
0
x
–0.5V0
–V0
δ
V ( x0 ) = − V0
V′( x) = −12 V0 x012 x −13 + 12 V0 x06 x −7 ⇒ V ′ ( x0 ) =
0
( x ) 156V0 x0 x − 84V0 x0 x ⇒ V ′′ (=
V ′′= 12 −14 6 −8
x0 ) 72 V0 x0−2 .
Then
1 72 V 2
V ( x0 + d ) =−V0 + 2 0 d ,
2 x0
72 V
− 2 0 d =
F= − kd .
x0
Thus, the force is proportional to the distance from the
minimum and, close to their equilibrium separation, a pair of
atoms will behave like a mass on a spring.
Elementary Functions • 63
y
15
y
1.0
10
0.6
y = ex
y = e−x
5
0.2
x x
−4 −2 0 2 4 1 2 3 4 5
(a) (b)
FIGURE 2.1: The functions (a) ex and (b) e −x .
y y = ln(x)
1
x
1 2 3
−1
−3
−5
FIGURE 2.2: Graph of ln(x). Note that log(x) (for any base) is only defined for x > 0.
d x
dx
{ e } = ex . (2.17)
In words, the rate of change of ex is ex. This is a fundamental
result that distinguishes the number e; it is not true if we replace e by
any other number. (See Exercise 2.17.)
x2
f ( x ) =f ( 0 ) + xf ′ ( 0 ) + f ′′ ( 0 ) + . . . .
2!
Therefore
x2 x3 xn
ex = 1 + x + + +...+ +.... (2.18)
2! 3! n!
So in general
d x
dx
{ a } ∝ ax
f ( x) = 1 ⇒ f ( x0 ) =
2
e− x and x0 = e−1 ,
f ′( x) =
−2 xe− x ⇒ f ′ (1 ) =
2
−2 e−1 ,
f ′′ ( x ) =
−2 e x + 4 x2 e− x ⇒ f ′′ (1 ) =
2 2
2 e−1 .
y
1.5
2nd 3rd
1.0
0.5
4th 9th
0.0
1st
−0.5
x
0.0 1.0 2.0
FIGURE 2.3: The function exp(−x2) (solid blue line) together with its Taylor expansion to
various orders (see Example 2.14). The black dashed line shows the first order
approximation, and the long dashed grey line shows the Taylor expansion to second order;
3rd, 4th and 9th order approximations are also shown. Close to x = 1 the expansion is very
good with just a few terms. For example, the first order (linear) approximation is accurate
to within 2% for 0.82 < x < 1.12. Further from x = 1 we need to use a higher order expansion
to give a good approximation to the function.
∫ e dx= e x + constant.
x
∫ xe
− ax2
Exercise 2.19 Find (Hint: put u = −ax2.)
∫e
− x2
Note that the indefinite integral dx (without the additional
factor of x in the integrand) cannot be expressed in terms of elementary
functions. Do not waste time trying! This is known as the Gaussian inte-
gral and is very important in probability theory and quantum mechanics.
y y
4
y = sinh(x)
4
x
y = cosh(x)
−3 −2 −1 1 2 3
2
−2
−4
x
−2 0 2
(a) (b)
FIGURE 2.4: Graphs of the hyperbolic functions (a) cosh(x) and (b) sinh(x).
68 • Mathematical Physics
sinh ( x )
tanh ( x ) = ,
cosh ( x )
1
coth ( x ) = ,
tanh ( x )
1
cosech ( x ) = ,
sinh ( x )
1
sech ( x ) = .
cosh ( x )
d
Example 2.17 Show that cosh ( x ) = sinh ( x ) .
dx
d d e x + e− x e x − e− x
dx
{ =
cosh ( x )} =
dx 2
= sinh ( x ) .
2
Exercise 2.22 Show that sinh(x) has the same sign as x for all x
and hence that cosh(x) has a global minimum at x = 0.
3
y = f (x)
2
2
–1
y=f (x)
1
1
x x
–3 –2 –1 1 2 3 –3 –2 –1 1 2 3
–1
–1
–2
–2
–3
–3
(a) (b)
FIGURE 2.5: (a) A graph of a function, y = f(x). (b) The graph of the inverse function y = f−1(x)
is obtained by reflection in the line y = x.
TABLE 2.1: Table of some standard functions and their inverse functions. The
third and fourth rows give restrictions on the x and y values for which the inverse
function y = f −1(x) is defined.
(
Example 2.20 Show that sinh −1 ( x ) = ln x + x2 + 1 . )
We need to turn this into a problem about the sinh
function whose properties we know. So to begin we invert
y = sinh−1(x). Now we can use the definition of sinh,
x = sinh(y) = (ey − e−y )/2.
We try to turn this into an equation we can solve for y. To begin,
multiply through by 2ey and rearrange to give
e2y − 2xey − 1 = 0.
This is a quadratic in ey. Let u = ey (which means we are seeking
y = ln(u)); then,
u2 − 2xu − 1 = 0.
This is a simple quadratic equation that has solutions,
u=
x ± x2 + 1.
We choose the + sign because otherwise u < 0 and ln(u) does
not exist. Then, finally,
(
y = ln x + x2 + 1 . )
( )
Exercise 2.24 Show that cosh −1 ( x ) = ln x ± x2 − 1 and hence,
or otherwise, show that
d 1
dx
{cosh −1 ( x )} = ± .
x −1
2
Revision Notes
2.12. EXERCISES
p 1 d2 d3
3. Show that sin + d= 1 + d − − + . . . .
4 2 2 6
4. By combining known series, find the Taylor series of
1+ x
about x = 0, and use it to show that ln(1.25) is ap-
1− x
proximately equal to 488/2187.
5. Write down the first three terms in the binomial expansion
of 4 + x . Use this truncated series to estimate 4.5 and
calculate the percentage error in your result.
6. Find the first four non-zero terms in the Taylor series of
(ln x)2 about x = 1.
7. Find the first non-vanishing term of the Taylor series for
y = sin4(x) near x = 0.
8. Use a Taylor series to show that sin(32o) is approximately
1 2p
equal to + 3.
2 360
9. Write down the series for sin(q) in terms of q and deduce
the approximate formula
1
q ≈ sin(q ) + q 3 .
6
1
By considering the case q = p/6 obtain the estimate p = 3 .
8
10. By equating coefficients of xn on both sides of
(1+ x)n(1+ x)n = (1+ x)2n
deduce the identity
2
n
n 2n
∑ = n .
r =0 r
11. Sketch the graph of y = 1 + sin(x) for − p/2 £ x £ p/2 and
hence that of its inverse function. For what values of the
variable is the latter defined?
74 • Mathematical Physics
16. Define the functions cosh(x) and sinh(x), and deduce that
each is the derivative of the other and that cosh(0) = 1,
sinh(0) = 0. Hence obtain the Maclaurin series for cosh(x)
as
x2 x 4
1+ +
2! 4!
and show that
2.13. PROBLEMS
1 1+ x
1. From the formula tanh −1 ( x ) = ln( ) deduce that
2 1− x
d 1 and hence show (with the help of an
dx
( tanh −1 ( x ) ) =
1 − x2
integration by parts) that ∫ tanh −1 ( x ) dx can be written as
1
x tanh −1 ( x ) + ln (1 − x2 ) + C.
2
2. (a) S
how that the function tanh−1(x) may be written as
ln 1 + x− ln 1 − x and that
d d
( 1 − x2 )
−1
tanh −1 ( x ) = and tanh ( x ) =
1 − tanh 2 ( x ) .
dx dx
(b) Sketch the curves y = tanh(x), y = tanh−1(x),
y = 1 − tanh−2(x) and y = (1 − x2)−1, indicating any axis
crossings and the asymptotic behavior for each.
(c) Show that the curves y = tanh(x) and y = 1 − tanh2(x)
d 1
sinh −1 ( x ) = .
dx x +1
2
( x ) dx
∫ sinh =
−1
x sinh ( x ) − x2 + 1+ C.
76 • Mathematical Physics
a0 + a1t + a2t2+ . . .
all yield the same values for a0, a1 and a2.
(i) Substitute 3t+t2 for x in the series for ex and collect
powers of t.
2
t
(ii) Multiply together the power series for e3t and e .
1
Obtain the Taylor coefficients f ( 0 ) , f ' ( 0 ) ,
(iii) f '' ( 0 )
2
1
f (0), f '(0), f '' ( 0 ) using
2
f′(t) = (3 + 2t)f(t), and f″(t) = (3 + 2t)f′(t) + 2f (t).
CHAPTER
3
FUNCTIONS, LIMITS,
AND SERIES
y = x2 – 4x + 3 = (x – 2)2 – 1.
This has its smallest value (of −1) at x = 2 (since the smallest
value of the squared term is 0). So x = 2 gives a minimum. Next
determine the axis crossings – first the x-axis:
y
4
3
(x − 2)2 − 1 = 0,
2
⇒ (x − 2)2 = 1,
1
⇒ x − 2 = ±1.
x
1 2 3 4
−1
dy/dx = 12x2 − 3 = 0
80 • Mathematical Physics
when x = ±1/2.
To join smoothly onto the curve for x → ±∞, x = −1/2 must
give a maximum and x = 1/2 a minimum. This can also be
determined from the sign of d2y/dx2.
Also,
d2y/dx2 = 24x = 0
y
3
2
1
x
−1.5 −0.5 0.5 1.0 1.5
−1
−2
−3
Note that a quadratic has at most two zeros, while a cubic has at
most three and at least one zero. In general, a polynomial of degree
n (i.e. having highest order term xn) has at most n zeros; if n is odd
Functions, Limits, and Series • 81
dy/dx = −1/x2 ≠ 0,
for any finite x. And
d2y/dx2 = 2/x3 ≠ 0,
x
−1 1 2 3
−1
82 • Mathematical Physics
−1
1 2
=
y = 1 −
1−2 / x x
2 4
y =1 + + 2 +...
x x
y
5
1
x
−1 2 3 4 5
−5
y
y=x
3
2
y = 3–x
1
y=0 x
1 2 3
y = 1/(3−x) xp y = 1/(x−1)
x
1 2 3 4
−1
−2
From the graph y1 < y2 for x > 3 and between xp and 1. The
crossover point is xp, where 1/(3 – xp) = 1/(xp – 1), i.e. xp – 1 = 3 – xp
so xp = 2. So 1/(3 – x) < 1/(x – 1) for 1 < x < 2 and x > 3.
where
1
( x)
f=
2
( h ( x ) + h ( − x ) ) is even,
and
1
( x)
g=
2
( h ( x ) − h ( − x ) ) is odd.
y y y
–a a x x
–a a x
curve between −a and 0 is equal and opposite to the area under the
curve between 0 and a. For any odd function,
a
∫ g ( x ) dx = 0.
−a
a
0 if h ( x ) isodd
∫− a h ( x ) dx = a (3.1)
2 ∫0 h ( x ) dx if h ( x ) iseven.
1
(ii) ∫ cosh ( x ) dx .
−1
3.6. LIMITS
Limits by Substitution
Consider the function y = x2. As x tends to 4, for example, then
y gets and remains arbitrarily close to 16, the value of y when x = 4.
88 • Mathematical Physics
= =
lim y 16 or lim x2 16.
x→4 x→4
sin ( x )
x
1
( a ) lim , ( b ) xlim x ln ( x ) , ( c ) lim x1/ x , ( d ) lim 1 + .
x →0 x +
→0 x →+∞
x →+∞ x
Direct substitution gives the meaningless results 0/0, 0 × (−∞),
∞0, 1∞, respectively. It is hopeless to try to proceed further like this.
In particular 0/0 is not 1, but meaningless. And 0 × ∞ is not 0, or ∞,
or any other number, but meaningless.
Such expressions may nevertheless approach definite limits
(finite or infinite) − for example, 1, 0, 1, e, respectively, in the
above four cases. But we must use a different method to evaluate
these indeterminate limits. One method that sometimes works is
expansion in series. Often limits can be related to those already
known. Alternatively, a method that usually works is l’Hôpital’s
rule.
sin ( x )
Example 3.10 Find lim .
x x →0
Substituting x = 0 we get sin(0)/0 which is indeterminate. To
work out the limit we expand sin(x) as a series
x3
sin ( x ) x − + ...
x2
= 6 =1 − + ...
x x 6
Putting x = 0 in the resulting series leads to a determinate
result.
So
sin ( x )
lim = 1.
x →0 x
90 • Mathematical Physics
ex
Example 3.11 Find lim where n is a positive integer.
x →+∞ x n
= 0.
And we used the result of Exercise 3.10 in the last line.
ln ( x )
ln ( L ) = lim .
x →+∞ x
To relate this to example 3.12 let u = 1/x. So u → 0+ as x →+∞.
= − lim+ u ln ( u )
u→0
= 0.
Now we see that ln(L) = 0 which implies L = 1.
This is only valid when the limit (on the left) is indetermi-
nate, and both f(x0) and g(x0) are zero, or both are ±∞. Notice
that to apply the rule we differentiate both f(x) and g(x) and then
divide f ′(x) by g′(x). This is not the derivative of (f/g). Also, if the
92 • Mathematical Physics
limit on the right is also indeterminate, you may apply the rule
again.
ln ( x ) −∞
lim+ = which is indeterminate
x →0 1/ x ∞
Using l’Hôpital’s rule with f(x) = ln(x) and g(x) = 1/x we get
= lim
=
( ln ( x ) )' lim 1 / x
x →0 (1 / x ) ' x → 0 + −1 / x
+ 2
= lim+ ( − x=
) 0.
x →0
Exercise 3.12 For which values of the real number a does the
∞
integral ∫
1
x a dx converge?
1
Example 3.15 Evaluate ∫ 0
x −1/ 3 dx .
We have
1 1
∫0
x −1/ 3 dx = lim+ ∫ x −1/ 3 dx
l →0 l
3 1
= lim+ x2 / 3
l →0 2 l
3 3
= lim+ − l 2 / 3
l →0 2 2
3
= .
2
1
Exercise 3.13 For what values of a does the integral ∫0
xa dx
converge?
sin ( x )
→ 1as x → 0.
x
This will be true if
sin ( x )
lim → 1as x → 0.
x →0 x
This was proved earlier in Example 3.10.
1
(iii) ln ( n!) n ln ( n ) − n + ln ( n ) ,
2
1 1
(iv) ln ( n!) n ln ( n ) − n + ln ( n ) + ln 2p.
2 2
The second one is usually sufficient in physics and should be memo-
rized. Note that you cannot get back to equation (3.3) by exponentiating
(ii), although you can do so from the more accurate form (iv). In general
asymptotic equivalences do not remain valid when exponentiated.
Exercise 3.18 Use trial and error on a pocket calculator to deter-
mine how large n must be for formula (ii) to give an answer accurate
to within 10%. For this value of n, how accurate an estimate does this
imply for n! itself, and how accurate an estimate does (iv) give for n!?
cosh ( x )
=
(e x
+ x− x )
= 1 + e−2 x → 1as x → ∞.
e x /2 ex
n ( n − 1)
(i)
n2 − 1
n
a
(ii) 1 + (Hint: use the result of exercise 3.11(ii).)
n
S1 = a1
S=
2 a1 + a2
S3 = a1 + a2 + a3
n
Sn = ∑ ai
i =1
1 + 1/2 + 1/4 + . . . = 2.
There are very few series that can be summed in this way
because we cannot usually find a simple closed form1 for the nth
partial sum, Sn, from which to obtain the limit. (Note that this
is not the same thing as a formula for the nth term of the series
which is often easy!) Instead, we might try to get an approxima-
tion to the sum by summing a sufficiently large finite number
of terms. For example, an approximate value for the sum of the
series 1 + 1/2 + 1/4 + 1/8 + . . . obtained by summing the first four
terms is S S4 = 15/8 = 1.875 (which is in fact in error by 6.25 %).
If you add enough terms on your calculator you will end up with
the value of 2.000. . . . This is not a proof that the sum of the
series is 2. You have run out of decimal places on your calcula-
tor and you have no proof that the sum of the infinite number of
small terms you have not yet considered is less than the accuracy
of your calculator.
Here is another example where your calculator would mani-
festly be useless. Consider the series S = 1 + 1/2 + 1/3 + 1/4 + . . . .
The first 4 terms give S4 = 2.08. Is this a good approximation?
The next 4 terms give S8 = 2.7145. . . . What about 12 terms? Or
100? In fact, the sum creeps up and up very slowly. (The sum of
the first 100 million terms is less than 20.) We can guess that even
though each additional term is getting smaller there are so many
of them that the sum is always growing toward infinity. This is
true, but you cannot prove it on your calculator. (This will only
show that the sum is beyond the maximum number your calcula-
tor will store, not that it is beyond any number any calculator can
store!) The following is a proper proof that the series does not
have a finite sum.
1
A “closed form” expression is one involving a finite number of mathe-
matical operations on the standard functions. This definition is intentionally
a little vague as the set of allowed functions may vary depending on the
context.
100 • Mathematical Physics
The essence of the proof is to show that the given series is greater
than a series which is known not to have a finite sum. There is no
rule for how to do this – just educated guesswork. Here we use the
trick of bracketing terms together. With a suitable choice of how to
bracket the terms we have
where the inequality arises because each term in the series has been
replaced by something equal or smaller. But for the new series we
can form the partial sums:
1 1 1
S >1+ + + +,
2 2 2
which clearly goes on getting bigger and bigger, i.e. the series
diverges to +∞.
Thus, before we try to approximate a series by summing a
number of terms, we need to know whether the series does have a
finite sum (in which case we say the series converges) or whether it
diverges. Courses on pure mathematics and mathematical reference
books give a large number of methods for doing this, based essen-
tially on comparison with known series. (The large number of meth-
ods has arisen because often a particular test will be inconclusive;
one goes through them in order until one finds a test that works.) We
give an example of comparison in which the bracketing technique
given above is used to establish convergence.
2 3 4
(i) S = + + + ...
1×1 2 × 2 3× 3
32 − 2 2 4 2 − 2 2 52 − 4 2
(ii) S = + + + ...
13 23 33
(i)
The terms in the numerator are growing linearly,
those in the denominator quadratically, so the series
looks something like 1/n. So try to prove it diverges by
comparing with 1/n.
2 3 4
S= + + + ...
1×1 2 × 2 3× 3
1 2 3
> + + + ...
1×1 2 × 2 3× 3
1 1 1
= + + ...
1 2 3
102 • Mathematical Physics
( n + 2 )2 − ( n + 1 )2
an =
n3
2n + 3
=
n3
2 3
= 2 + 3.
n n
Hence it is convergent.
Revision Notes
3.13. EXERCISES
∑
∞
12. Evaluate the sum n =1
2 − n. Hence show that the series
∑
∞ 2
n =1
2 − n converges.
13. Evaluate the limit of (ex − 1)/x as x → 0. Deduce the asymp-
totic form of ex − 1 as x → 0 . Show that (ea/x − 1)−1 ~ x/a as
x → ∞.
3.14. PROBLEMS
1. The function
x3 + 4 x2 + Ax + 1 x3 + x2 − 4
=f ( x) −
x2 + 2 x x2 − 4
where A and p are constants. Show also that the left side
can be written as
4sin 4 ( 2x )
cos ( x )
for the sum of the series on the left. (You may quote the
Maclaurin series for ln(1 − x).) What is the limit of this
expression as x → 1 from below?
7. Write down the expansion of cos(q) in powers of q (valid for
all real q).
(a) By considering the identity cos(q) sec(q) = 1, show that
the first four coefficients in the expansion
sec(q) = a0 + a2q 2 + a4q 4 + a6q 6 + . . .
2 cosh ( x) −1 < e x
for x > 0,
and
2 cosh ( x)−2 e x
− 1 for x → 0 +.
x2 x2 x3
x+ + < − ln (1 − x ) < x + + .
2 2 3 (1 − x )
is divergent when x = 1.
∑ ( −1) a
n
partial sum n together with all subsequent
n=0
partial sums, is within 0.01 of S.
∫ ( x − 1) e
−x
11. (a) Show that xe x + C and also find
dx =
∫ ( x − 1) e
−x
dx .
(b) The function f(x) is defined by
( x − 1 ) e
−x
for x > 2,
f ( x) =
A ( x + 1 ) e x<2
x
for
1 −4
and is continuous at x = 2. Show that A = e .
3
X
(c) Evaluate, for X > 2, the integral ∫ f ( x ) dx . What is the
2
∞
meaning of ∫ f ( x ) dx and what is its value?
2
2
(d) Also find the values of ∫ f ( x ) dx ( Y < 2) and
Y
2 ∞ 8 −2
∫ f ( x ) dx and deduce that ∫ f ( x ) dx = e .
−∞ −∞ 3
12. (a) Find the values of the function (sin(x))/x and its first
two derivatives at x = 0. Hence or otherwise write down
the Maclaurin series for (sin(x))/x up to terms
in x2.
(b) What is the ratio of the values of x which make the
truncated Maclaurin series in part (a) and the original
function (sin( x)) /x zero? (Use the smallest possible
absolute value of x in the latter case.)
CHAPTER
4
VECTORS
The physical world (as we perceive it) exists in three spatial dimen-
sions. Physical quantities such as velocities – which have both
magnitude and direction – need three components for their specifi-
cation. We call these vectors. (In fact, even more than this, a chemi-
cal system, or an economic model may have, say, 20 components;
such a system can often be represented conveniently by a vector
in the appropriate number of dimensions (20 in this example): the
algebra is the same.) You are going to learn to do algebra with vec-
tors (add and multiply them) in a geometrically meaningful way.
The first part of the chapter deals with the algebra and the sec-
ond with the geometrical picture. You can think of this in two ways:
either that the algebra enables you to do complicated geometrical
things rather mechanically (by following the algebraic rules), or that
the geometry enables you to get a picture of the algebra. In either
B B B
C
A
A A
D
(a) (b)
FIGURE 4.1: (a) The vectors AB and BA . (b) The parallelogram law of vector addition.
112 • Mathematical Physics
case you have to learn to switch smoothly between the two methods,
and this requires practice. Your reward will be command of a math-
ematical tool that pervades physics.
are
vectors.
Such expressions are referred to as linear combinations
of AB and CD .
C v
µv w
B
u
O λu
A
(a) (b)
FIGURE 4.2: (a) AC = 1.5 AB . (b) Linear dependence, with w = lu + mv.
114 • Mathematical Physics
Exercise 4.1If ABis 2 units due north and AC is 1 unit due
east, what is AB + 2 AC ?
Exercise 4.2 A person swims across a river with a speed rela-
tive to the water of 4 ft s−1at right angles to the current which is
flowing at 3 ft s−1. Draw a figure to show the swimmer’s resultant
motion and check that it accords qualitatively with your intuition.
Exercise 4.3 Rain falls vertically at 10 m s−1; draw a vector diagram
to show how it appears to a person running at 5 m s−1.
Exercise 4.4 A, B, C, D are four points in space; and the seg-
ments AB, BC, CD, DA have mid-points P, Q, R, S, respectively.
Show that
(i) 2AP = AB
(ii) 2PQ = AC
(iii) PQRS is a parallelogram.
Usually, instead of writing AB , say, for the line segment represent-
ing a vector, we give a vector a name, v say. The bold font indicates that
this is the name of a vector; in hand written work we use v or v or v.
•• The magnitude of a vector v is written as |v| or v.
• A linear combination of two vectors u and v would be written
lu + mv.
• The zero vector is printed as 0 (and written 0 ), so that, for
example, u − u = 0.
Linear Dependence
We have seen that two vectors may be added to make a third in
the same plane; e.g.
w = lu + mv. (4.3)
Vectors • 115
If this can only be satisfied when the coefficients are all zero
(i.e. l = m = n = 0) then the vectors u, v, w are linearly indepen-
dent. There is a close connection between the maximum number
of linearly independent vectors and the dimension of a given space:
the dimension of a “vector space” is the largest number of linearly
independent vectors we can choose.
In three-dimensional space we can specify three non-coplanar
vectors arbitrarily (say u, v, w) and construct any other vector (say x)
as a linear combination of these:
x = lu + mv + nw.
Basis Vectors
We can use these observations to provide a useful way of specify-
ing vectors. We make a convenient choice of the maximum number
of independent vectors (two non-parallel vectors in two dimensions,
three non-coplanar vectors in three dimensions). These we call our
basis vectors. Any other vector in that space can then expressed as a
linear combination of these basis vectors.
116 • Mathematical Physics
j
k C
v
i
3
k
O j
1
j A B
2
i i
(a) (b)
FIGURE 4.3: (a) Basis vectors in 2D and 3D. (b) Diagram for example 4.1.
(v + v22 + v32 ) .
12
v = 2
1
Example 4.3 Find the unit vector in the direction of the vector
v = i + 2j + 3k.
To obtain the unit vector we must divide each component
by the magnitude of v. We know from example 4.1 that
v= 12 + 2 2 + 33 = 14 , so the unit vector is
v 1
vˆ = = ( i + 2 j + 3k ) .
v 14
Definition: Let q be the angle between the two vectors a and b. The
scalar product (or “dot product”) of two vectors a and b, written a · b,
is defined to be the number |a||b| cos(q):
a·b=b·a (4.6)
(the order of the two vectors in the scalar product does not affect
the result).
If n̂ is a vector of unit length making an angle q with a vector
v, then
= cos (q ) v cos (q )
nˆ ⋅ v nˆ v=
b
c
b+c
θ b
a a
|b|cos(θ) a·c
a·b
(a) (b)
FIGURE 4.4: (a) Projection of b and a. (b) The sum of projections onto vector a (which can
be assumed to be a unit vector).
120 • Mathematical Physics
Exercise 4.9 If a ⋅ b =
1 2 and |a| = |b| = 1, what is the angle
between a and b?
Exercise 4.10 Show that the scalar product may be taken in
either order; i.e. prove equation (4.6).
We shall need the distributive law for the scalar product
a · (b + c) = a · b + a · c. (4.7)
To establish equation (4.7) assume first that a is a unit vector.
Then equation (4.7) says that the orthogonal projection of b + c on
a is the sum of the projections of b and of c, a fact which is obvious
geometrically (see Figure 4.4(b)). For a general a (not a unit vector)
divide (4.7) through by |a| since then a /|a| is a unit vector. Note that
equation (4.7) also implies that
(b + c) · a = a · b + a · c.
Exercise 4.11 If a = 7i + 0.2j + 3k, find a · k.
Exercise 4.12 By expanding (a + b) · (a + b) derive the formula
|a + b|2 =|a|2 +|b|2 + 2|a||b| cos(q),
where q is the angle between a and b. Draw a figure to demon-
strate that this is the usual cosine rule of trigonometry. Also deduce
that |a + b| ≤ |a| + |b| (which is called the “triangle inequality”).
Vectors • 121
area=|a×b|=|a||b|sin(θ)
|a||b|sin(θ)
a×b
b
^
n
|b|sin(θ)
b a
θ θ
a
(a) (b)
FIGURE 4.5: (a) The vector product a × b. (b) The vector product a × b interpreted as a
vector with magnitude equal to the area of the parallelogram formed by a and b.
We won’t prove this relation here (the proof is quite tedious) but
you can look it up if you are interested in the details.
volume = (a×b)·c
a×b
c
a
FIGURE 4.6: The scalar triple product and the volume of a parallelepiped.
124 • Mathematical Physics
In terms of the notation [a, b, c] for the scalar triple product the
above properties can be written
b×c
A=
[ a, b, c ]
satisfies A · a = 1, A · b = 0, A · c = 0 (if [a, b, c] ≠ 0).
v1 =
[ v, b, c ] ,
[ a, b, c ]
and find similar expressions for v2 and v3.
b×c
A= ,
[ a, b, c ]
c×a
B= ,
[ a, b, c ]
a×b
C= .
[ a, b, c ] (4.10)
Vectors • 125
t
A
r
l a
O
FIGURE 4.7: Equation of a line: r= a + l tˆ .
r= a + l tˆ . (4.11)
126 • Mathematical Physics
a + lt = b + ms, (4.12)
a − b = ms − lt.
r = a + ls + mt,
128 • Mathematical Physics
Σ t R
s
A
a
r
O
FIGURE 4.8: Equation of a plane: r = a + ls + mt.
so,
r · (s × t) = a · (s × t) + ls · (s × t) + mt · (s × t)
= a · (s × t) + 0 + 0
= constant.
s× t
We let nˆ = ; then it follows from Example 4.8 that we can
s× t
write, for any r in the plane ∑,
r ⋅ nˆ =p, (4.14)
where p = a ⋅ nˆ = constant (by Example 4.8). Since equation (4.14)
holds if and only if r is in ∑, this is also (an alternative form of) the
equation of the plane ∑. In applications we use whichever of equa-
tion (4.13) and equation (4.14) is more convenient.
Exercise 4.24 Show that two distinct planes with unit normals
n̂1 and n̂ 2 intersect if and only if nˆ 1 × nˆ 2 ≠ 0 .
a3
k
a1 j
y
i O
a2
x
FIGURE 4.9: Vector components.
130 • Mathematical Physics
Exercise 4.27 With r = (x, y, z), a = (1, −2, 0) and t = (−1, 3, 0),
write the three coordinate equations for the line r = a + lt.
Deduce that this line is in the (x, y) plane and find m and c such
that it can be put into the form y = mx + c.
Vectors • 131
a1 b1 + a2 b2 + a3 b3 =−1 × 2 + 3 × 0 + 1 × −1 =−3.
Also
( a ⋅ a )1 2 = ( −1 ⋅ −1 + 3 ⋅ 3 + 1 ⋅ 1 ) =
12
a = 5,
b= ( b ⋅ b )1 2 = ( 2 ⋅ 2 + 0 ⋅ 0 + −1 ⋅ −1)1 2 = 5.
132 • Mathematical Physics
a⋅b −3 3
cos (q ) = = = − ,
a b 5 5 5
Take the inverse of cos in the correct quadrant to find the angle q:
q = cos−1 (−3/5) = π − cos−1 (3/5) = 2.214 ... radians
Exercise 4.30 Find a vector N and a scalar P such that the equa-
tion x − y + 2z = 3 can be written in the form r · N = P (where
r = (x, y, z) as usual). Hence write the equation in the form r ⋅ nˆ =p
where n̂ is a unit vector. Deduce that the equation represents a
plane and obtain the shortest distance from the origin to the plane.
For readers familiar with matrix algebra, we can write the scalar
product of two vectors as a row vector multiplied by a column vector
1
b
⋅ b ( a1 , a2 , a3 ) b2=
a= ↓ a1 b1 + a2 b2 + a3 b3 . (4.20)
b
3
Proof:
a × b = (a1i + a2j + a3k) × (b1i + b2j + b3k)
= a1i × (b2j + b3k) + a2j × (b1i + b3k) + a3k × (b1i + b2j),
expanding out using the distributive law and remembering that
i × i = j × j = k × k = 0. Then making use of i × j = −j × i = k etc,
we have that
= a1b2k − a1b3j – a2b1k + a2b3i + a3b1j – a3b2i
=( a2b3 – a3b2)i + (a3b1 – a1b3)j + (a1b2 – a2b1)k.
We can use either equation (4.21) or equation (4.8) to evaluate a
vector product – depending on the information available. It is important
to know both expressions and to be able to spot which is appropriate.
Exercise 4.31 If a = (−2, −2, 1) and b = (2, 0, −1), find a × b.
Show by explicit calculation that
(i) a × b is orthogonal to a and b,
(ii) (a × b) × a lies in the same plane as a and b. (Express it as
a linear combination of these two vectors.)
Exercise 4.32 Find the unit normal to the plane containing the
vectors s = (1, 1, 2) and t = (−1, −3, 0) and hence write the equa-
tion of the plane containing these vectors which passes through
the point (0, 1, −1) in the form ax + by + cz = d.
For readers familiar with matrix algebra, we can write the vector
product of two vectors as a determinant as follows:
i j k
a×b =a1 a2 a3
b1 b2 b3
= ( a2 b3 − a3 b2 ) i + ( a3 b1 − a1 b3 ) j + ( a1 b2 − a2 b1 ) k.
a1 a2 a3
a ⋅ (b × c) =b1 b2 b3
c1 c2 c3
= a1 b2 c3 − a1 b3 c2 + a2 b3 c1 − a2 b1 c3 + a3 b1 c2 − a3 b2 c1 .
This is sometimes known as the “bac minus cab” rule, the name
chosen to help recall the order of the vectors on the right side of the
equation.
Proof: Since no axes have been specified we may choose them to
simplify the problem. Choose the i-axis perpendicular to b and c, so
that b1 = c1 = 0. Then
Vectors • 135
Exercise 4.36 Deduce from equation (4.23) and Exercise 4.34 that
1
=X 2
a +l 2
(l b + l −1
( a ⋅ b ) a + a × b ).
x × a + lx = b.
x = aa + b b + g a × b.
Substitute in (4.24)
Exercise 4.37
(i) Complete the missing algebra steps in example 4.14.
(ii) Complete the missing algebra steps in example 4.15 to
obtain the same solution.
Exercise 4.38 Given vectors a and b such that a · b = 0, solve
the equation x × a = b subject to the condition x · a = k (a given
constant).
∑ Ai Bi .
A ⋅B=
i =1
1 if i= j
d ij =
0 if i ≠ j.
So, for example, Œ132 = −1 and Œ112 = 0. We can write the cross
product using Œijk and the summation convention in the compact form
(A × B)i = Œijk AjBk.
Vectors • 141
Revision Notes
4.16. EXERCISES
4.17. PROBLEMS
1. S
how that the equation of the straight line passing through
the points with position vectors c and d can be written as
r = c + lt,
Vectors • 143
r · n = a · n.
(a − c) ⋅ n
c+{ }t.
t⋅n
(iii) Find a vector perpendicular to both AB and AC .
(iv) Find the area of the triangle ABC.
(v) Find the equation of the plane through ABC in the
form ax + by + cz = d.
(vi) Find the perpendicular distance of this plane from the
origin.
5. T
he plane which has vector equation k · r = 1 (with k a
unit vector) intersects the sphere which has vector equa-
tion |r| = 2. If a and b are the position vectors of the two
diametrically opposite points on the circle of intersection
of the plane and the sphere show that (a − b) = 2a −2k.
Hence find the diameter of the circle.
6. T
he angle between two non-parallel unit vectors a and b is
q, and two further vectors A and B are defined by
Show that
show that
lx + my + nz = p.
a × x + bx + c = 0.
5
MATRICES
3x + 2y = 7
6x − y = 4
148 • Mathematical Physics
3x + 2y = 7
12x − 2y = 8
15x + 10y = 35
15x + 0y = 15
0x + 10y = 20
15x + 0y = 15
3 2 7
12 −1 = 4
and
3 2 7
12 −2 = 8
And so on, provided we have a rule that enables us to put back the x
and y. This rule must be such that
3 2 x 3 x + 27
= . (5.1)
6 −1 y 6 x − y
Let a = (a1, a2, a3) and x = (x1, x2, x3) be vectors. Consider the
scalar product a · x = a1x1 + a2x2 + a3x3 from Section 4.9. We can
write this product as
1
x
( a1 , a2 , a3 ) x2 = a ⋅ x = a1 x1 + a2 x2 + a3 x3 (5.2)
x
3
where corresponding terms reading across the row and down the
column are to be multiplied together (a1 × x1, a2 × x2, a3 × x3) and
the results added. (The arrows are there for illustration only.) This
can be extended thus:
a1 a2 a3 x1 a ⋅ x a1 x1 + a2 x2 + a3 x3
b1 b2 b3 x2 = b ⋅ x = b1 x1 + b2 x2 + b3 x3 (5.3)
c1 c2 c3 x3 x ⋅ x c1 x1 + c2 x2 + c3 x3
where the result of going across each row in turn on the left is placed
at the corresponding level on the right. The first array on the left
is called a matrix. More specifically it is a 3 × 3 matrix (there are 3
rows running across the page, 3 columns running down the page).
A vector is a special case: it can be written as a single column, in
which case it is known as a “column vector” or 3 × 1 matrix, or as a
single row, hence known as a “row vector” or 1 × 3 matrix. Both are
used in equation (5.2).
150 • Mathematical Physics
1 1 −1 x −3
(5.5)
2 1 −1 y =
0
4 −2 −3 z 4
Ax = h,
where A stands for the matrix on the left side and h for the column
vector on the right, we should like to “divide through by A” and
obtain the matrix solution
x = A−1h. (5.6)
2x + 2y − 2z = −6
4x + 2y − 2z = 0
8x − 4y − 6z = 8.
2 2 −2 x −6
4 2 −2 ⋅ y = 0 ⋅
8 −4 −6 z 8
Thus we have shown that
1 1 −1 2 2 −2
2×2 1 − =1 4 2 −2 ,
4 −2 −3 8 −4 −6
0x + 0y − 2z + x + y + z = −3
0x + y + 0z + 2x + 0y − z = 0
x − 5y − 6z + 3x + 3y + 3z = 4.
0 0 −2 1 1 1 x −3
0 1 0 + 2 0 −1 y =
0 .
1 −5 −6 3 3 3 z 4
152 • Mathematical Physics
3 0 −1 8 1 1 1 x
A= 0 1 5 , B= 5 2 −3=, h =
2 , and x y .
3 −2 3 0 1 −1 1 z
(the first index labels the row, the second the column) then aij is the
element of A in the ith row and jth column. An M × N matrix has M
rows and N columns. A square matrix has the same number of rows
as columns, i.e. has dimensions N × N.
Using this notation we can write down the rules above. For mul-
tiplication by a scalar B = λA we multiply all the elements of the
matrix by the scalar:
bij= λaij,
(x + y − z ) + ( +0 x + 0 y + 0z ) =−3
( 2x + y − z ) + ( −2 x − 2 y + 2z ) = 0 − ( −6 ) = 6.
( 4 x − 2 y − 3z ) + ( +0 x + 0 y + 0z ) =4
1 1 −1 0 0 0 x −3
2 1 −1 + −2 −2 2 y = 6 ,
0 0 z 4
4 −2 −3 0
or
1 1 −1 x −3
0 −1 1 y = 6 . (5.7)
4 −2 −3 z 4
1 1 −1 x −3 1 1 −1 x −3
R 2 → R2 − ( 2 × R1)
=
2 1 −1 y 0 = 0 −1 1 y 6 .
4 −2 −3 z 4 4 −2 −3 z 4
154 • Mathematical Physics
1 1 −1 x −3 1 1 −1 x −3
R 3 → R3 − ( 4 × R1)
=0 −1 1 y 6 =0 −1 1 y 6
4 −2 −3 z 4 0 −6 −1 z 16
1 1 −1 x −3
R 3 → R 3 − ( 6 × R 2 )
0 −1 1 y = 6
0 0 −5 z 20
x + y − z = −3
−y + z = 6 (5.8)
−5z = −20
1 1 −1 x −3
0 −1 1 y =
6
0 0 −5 z −20
R2 → ( 5 × R2 ) + R3
1 1 −1 x −3
R → −
0 −5 0 y =
3 R 3 / 5
10
0 0 1 z 4
R1 → R1 + ( R2 / 5 ) + R3
1 0 0 x 3
R 2 → − R 2 / 5
0 1 0 y =
−2
0 0 1 z 4
x + 2y + 3z = 10
2x + 5y + 8z = 24 (5.9)
3x + 8y + 14z = 39
Start from the left hand column using either of the rows above.
1 2 3 10 1 2 3 10
R 3 → R 3 − ( 2 × R 2 )
0 1 2 4 0 1 2 4 .
0 2 5 9 0 0 1 1
From the upper triangular form write out the equivalent
system and solve by back-substitution.
x + 2y + 3z = 10
y + 2z = 4
z=1
from which we see z = 1, and so y = 4 − 2 = 2 and x = 10 − (2 × 2)
− 3 × 1 = 3. Or,
z = 1, y = 2, x = 3.
x + y + 3z = 5
x + 6y + 10z = 17
−x + 4y − 5z = 8.
5.3. PRODUCTS
3 0 −1 8 24
0 1 5 5 = 5 ,
1 −2 3 0 −2
3 0 −1 1 2
0 1 5 2 = 7 ,
1 −2 3 1 0
3 0 −1 1 4
0 1 5 −3 = −8 .
1 −2 3 −1 4
3 0 −1 8 1 1 24 2 4
0 1 5 5 2 =−3 5 7 −8 .
1 −2 3 0 1 −1 −2 0 4
3 0 −1 8 1 1
=A 0 1 5 =
and B 5 2 −3 .
1 −2 3 0 1 −1
Find
(i) BA (the product of B and A in the indicated order)
(ii) AB
(iii) Does AB = BA in this case?
3 0 1
C 1 −1 1 ,
=
0 2 1
find
(i) C(AB),
(ii) (CA)B.
(iii) Verify that C(AB) = (CA)B in this case.
0 1 0 −i 1 0
=s 1 = , s2 = , s3
1 0 i 0 0 −1
Find a relation between s1s2 – s2s1 and s3.
K
c ij = ∑ aik bkj
k =1
b b12
a11 a12 a13 11 c 11 c 12
a b21 b22 =
21 a22
a23 c 21 c 22
b31 b32
where c11 = a11b11 + a12b21 + a13b31 (formed from the 1st row of A and
the 1st column of B as marked).
1 0 0
I = 0 1 0
0 0 1
1 0 0
0 1
IN =
0
0 0 1
160 • Mathematical Physics
Exercise 5.6
(i) For a general 3×3 matrix
show that IA = AI = A.
(ii) What is SA where S is the matrix
7 0 0
S = 0 7 0 ?
0 0 7
ATBT = (BA) T.
Note the order in which the matrices are multiplied. This result is
true for general matrices A and B.
Exercise 5.8
(i) The matrix
⋅ a b
⋅ ⋅ c
⋅ ⋅ ⋅
is antisymmetric. Fill in the missing elements.
(ii) For any matrices A and B, what are (AT)T and (A + B)T ?
(iii) Verify that if M is an arbitrary matrix, the matrices
1 1
2
( M + MT ) and
2
( M − MT ) are respectively symmetric
and antisymmetric. Deduce that every matrix is the sum of a
symmetric and an antisymmetric matrix.
5.6. INVERSES
Exercise 5.9 Are the following statements (a) true or (b) false?
1
2 0 0
2 0 0
(i) 0 − 1 0 is the inverse of ,
3 0 −3 0
0 0 0 1
0 1
5 0 −5 1 −1 2
(ii) 1 ,
−6 10 1 is the inverse of 1 2 1
25 −4 −1 2
7 5 3
1 −1 2 5 0 −5
(iii) 25 1 2 1 is the inverse of 1 .
−6 10
−4 −1 2 7 5 3
1 2 3
Example 5.3 Find the inverse of A= 2 5 8 .
3 8 14
We have to solve
1 2 3 x1 1
2 5 8 y1 = 0 etc.
3 8 14 z 0
1
To do this we construct a sequence of equivalent representations
that reduce the matrix to the identity matrix.
164 • Mathematical Physics
R2 − ( 2 × R1 )
1 2 3 1 0 0 1 2 3 1 0 0
R3 − ( 3 × R1 )
2 5 8 0 1 0 0 1 2 −2 1 0
3 8 14 0 0 1 0 2 5 −3 0 1
1 2 3 1 0 0
R 3 − ( 2 × R2)
0 1 2 −2 1 0
0 0 1 1 −2 1
R1 − ( 2 × R2 )
1 0 −1 5 −2 0
R 2 − ( 2 × R 3 )
0 1 0 −4
5 −2 ,
0 0 1 1 −2 1
and so finally we have
1 0 0 6 −4 1
R 1 + R3 0 1 0 −4
5 −2 . (5.10)
0 0 1 1 −2 1
The equivalent system is now
x1 6 x2 −4 x3 1
I y1 =
−4 , I y2 =
5 , I y3 =
−2 ,
z 1 z −2 z 1
1 2 3
from which we read off (x1, y1, z1) etc. It is clear that the matrix
to the right of the vertical bar in the final representation (5.10)
is the required inverse, i.e.
6 −4 1
−1
A =
−4 5 −2 .
1 −2 1
Matrices • 165
The rule for finding A−1 is therefore: convert the matrix (A|I)
by row operations of the above type to (I|B); if this can be done then
A−1 exists and is equal to B.
1 1 3
Exercise 5.10 Find the inverse B of A = 1 6 10 by the
−2 8 10
method indicated. Check that AB = I.
ax + by = h (5.11)
cx + dy = k (5.12)
a b x h
= . (5.13)
c d y k
dh − bk ak − ch
=x = , y .
(5.14)
ad − bc ad − bc
166 • Mathematical Physics
So
dax − bcx = dh − bk,
dh − bk
x= ,
ad − bc
as required. Similarly, to eliminate x multiply equation (5.11)
by c, multiply equation (5.12) by a, and then subtract.
1 h 1 h
x= ( d , −b ) , y = ( −c , a )
ad − bc k ad − bc k
or, combining these, as
x 1 d −b h
y = a k
(5.15)
ad − bc − c
Let
a b x h
A = , x = and h = .
c d y k
a b 1 d −b
A = , then we can write A −1 = .
c d ad − bc −c a
This is the general formula for the inverse of a 2 × 2 matrix.
Note that from this general form we can see that not all 2 × 2
matrices have an inverse. Namely, if ad − bc = 0, then A−1 will not
exist. (This is analogous to the fact that not all numbers have inverses;
namely 0 does not have an inverse because there is no number x
such that 0x = 1.)
To extend this to larger matrices we need to introduce the idea
of a determinant, which we shall do in the next section.
5.8. DETERMINANTS
a b
det =
ad − bc.
c d
Or, using more systematic notation,
a11 a12
A= → det A = a11a22 − a12 a21 . (5.16)
a21 a22
−3 −2
(iii) det
−4 −3
168 • Mathematical Physics
1 −1 0
5.5 If A 2
Example= 4 −3 , find det A.
−2 1 1
Expanding by the first row as in the definition (5.17) gives
4 −3 2 −3 2 4
det A =1 × det − ( −1 ) × det + 0 × det
1 1 −2 1 −2 1
= (1 × 7 ) + (1 × −4 ) + 0= 3.
Matrices • 169
Exercise 5.12
0 1 2
(i) Let the matrix B = −7 3 −4 . Verify that det B = −17.
1 2 3
(ii) For A as defined in Example 5.5, and B defined here, find
AB and verify that
det AB = det A · det B.
l a11 l a12
l A = .
l a21 l a22
So,
2 4 −3
Example 5.7 Find det 1 −1 0 .
2 1 1
First, interchange the first and second rows; then expand
by the top row. The zero means there are only two minors
Matrices • 171
2 4 −3 1 −1 0
det 1 −1 0 =−det 2 4 −3 =−3.
2 1 1
2 1 1
2 4 −3
4 −3
det 1 −1 0 =( −1)2 +1 ⋅ (1) ⋅ det
2 1 1
1 1
2 −3
+ ( −1 ) ⋅ ( −1 ) ⋅ det
2+2
2 1
=−7 + 4 =−3.
Note the signs here: recall from the definition that the
coefficient of aij has a factor (−1)i+j.
a11 l a11
Exercise 5.15 Show that det =0 and that
a12 l a12
a11 a12
det =0.
l a11 l a12
m m
a + a a12 + a22
det A =det 11 l 21 l .
a21 a22
Matrices • 173
m m
But we know from equations (5.18) that a11 + a21 =
0 =+
a12 a22 .
l l
Therefore
0 0
det A = = 0.
a21 a22
If μ ≠ 0 we instead add λ/μ×row 1 to row 2.
Property 7: If A and B are square matrices of the same size, then the
determinant of the product is the product of the determinants: det(AB)
= det A det B.
ax + by =
h a b x h
⇔ = .
cx + dy =
k c d y k
h b a h
det det
k d c k
=x = , y . (5.19)
a b a b
det det
c d c d
3 4
det = 3 ≠ 0,
6 9
so the system has a unique solution. By Cramer’s rule this is
5 4 3 5
det det
=9 9 9 6=9 −3
=x = = 3, y = −1.
3 4 3 3 4 3
det det
6 9 6 9
The two simultaneous equations could also be solved by the
elementary method of elimination.
3x + 4y = 5
.
6x + 8y = 9
3 4 5 R2 − (2 × R1) 3 4 5
6 8 8 0 0 −2
from which 0 = −2, a contradiction. The system therefore has
no solution.
Take another look at the two equations we began with. The left
sides are the same except for a factor 2, but the right sides are
not; hence they contradict one another.
3x + 4y = 5
.
6x + 8y = 10
We have that
3 4
det = 0,
6 8
so the system is singular. Using row reduction:
3 4 5 R 2 − (2 × R1) 3 4 5
6 8 10 0 0 0
Exercise 5.20
(i) Determine which of the following systems has a unique solution
2x − 3y + 3z = −2
2x − 2y + 6z = 1
x + λz = k
for (a) λ = 6, k = 1, (b) λ = 3, k = 3, (c) λ = 6, k = 7/2.
(ii) Use Cramer’s rule to find the unique solution.
(iii) Find all the solutions in case (c).
Matrices • 179
We have stated the theory in terms of 3×3 systems, but the argu-
ment applies to any n×n homogeneous system (i.e. any system of
the form Ax = 0 with A an n × n matrix. In particular if A is a num-
ber, a say (a 1 × 1 matrix) then ax = 0 has only the trivial solution
x = 0 if a ≠ 0 and infinitely many solutions if a = 0. The statements
above are just the (important!) generalizations of this result to simul-
taneous equations.
Let
a11 a12 a13
A = a21 a22 a23 .
a a32 a33
31
In Section 5.8, we defined the minor of an element of A. For
example,
a22 a23
• the minor of a11 is det a ,
32 a33
180 • Mathematical Physics
a21 a23
• the minor of a12 is det
a31 a33
and so on. In general, to obtain the minor of aij we cross out the
ith row and jth column from A and take the determinant of what
remains. We now define the cofactor of aij, call it cij:
cij = (−1)i+j × Mij where Mij is the (i, j) minor of A.
So
a22 a23 a21 a23
c 11 = det , c 12 = −det .
a32 a33 a31 a33
and so on.
Using cofactors the determinant for a 3 × 3 matrix can be written
In fact this result holds for any square matrix. From it we get a
formula for A−1:
1
A −1 = adj A. (5.23)
det A
we get
A−1Ax = A−1h.
x = A−1h
as the solution of the system. Thus, we see again that the system has
a unique solution if det A ≠ 0. This is what we set out to understand
in Section 5.1 (Equation 5.6).
Ax = λx (5.24)
where A is a given matrix and λ some real number. For example, if
A is a 2 × 2 matrix, this system would be explicitly
a11x + a12y = λx
a21x + a22y = λy (5.25)
or equivalently
a11 x + a12y − λx + 0y = 0
a21 x + a22y + 0x − λy = 0. (5.26)
182 • Mathematical Physics
a11 a12 x −l 0 x 0
a + =
21 a22 y 0 −l
y 0
Ax − λIx = 0
or
(A − λI)x = 0, (5.27)
0 1 −1
A= 1 2 1 .
−1 1 2
Matrices • 183
λy
Ax = λx
y
x
x λx
FIGURE 5.1: Illustration of an eigenvector. The vector x is (left) multiplied by the matrix A
and the result is another vector in the same direction as x but scaled in magnitude by a factor
λ (its eigenvalue). If x is an eigenvector of A (with eigenvalue λ), then so is any multiple of this,
i.e. αx is also an eigenvector (and it has the same eigenvalue). We often use unit eigenvectors,
which are scaled to have unit length (x • x = 1).
0 1 −1 l 0 0
det 1 2 1 − 0 l 0
−1 1 2 0 0 l
−l 1 −1
=det 1 (2 − l ) 1 =0.
−1 1 ( 2 − l )
Write out the determinant by expanding out the top row, i.e.
Do not expand out the cubic – you will not be able to factorize
it (unless you are given or can guess one solution). Thus,
−l 1 −1 x 0
1 (2 − l ) 1 y = 0 .
−1
1 ( 2 − l ) z 0
The eigenvectors corresponding to λ = 3 therefore satisfy
−3 1 −1 x 0
1 −1 1 y = 0 ,
−1 1 −1 z 0
which we can write as
−3x + y − z = 0,
x − y + z = 0,
−x + y − z = 0.
( A − I) x= ( A − 2I) x=0,
i.e.
−2 1 −1 x 0
1 0 1 y = 0 ,
−1 1 0 z 0
which gives
–2x + y – z = 0,
x + z = 0,
–x + y = 0,
1 −4 4
−4 −1 0 .
4 0 3
Find the other eigenvalues and the corresponding eigenvectors.
y´ P
θ x´
C
D
E
Q
θ
O x
A
FIGURE 5.2: Rotation of coordinates in the plane. A point P given in coordinates (x, y). The
same point is at coordinates (x¢, y¢) in axes that are rotated (counterclockwise) by an angle q
from the original axes.
cos (q ) sin (q )
R= .
− sin (q ) cos (q )
Alternatively, we may think of this as a transformation of the
point P with coordinates (x, y) to a new point P¢ with coordinates
(x¢, y¢) in a fixed coordinate system by rotation about the origin
through the angle −q:
x x′
y → y′ .
Exercise 5.23
(i) Show that RRT = I and RTR = I, i.e. R−1 = RT.
(ii) What matrix would represent the transformation of a point
by rotation about the origin through an angle +q in a fixed
coordinate system?
188 • Mathematical Physics
12 3 2
R= .
− 3 2 1 2
2 3 −1 3 2 3
2 3 2 3 −1 3
−1 3 2 3 2 3
1 0 0
P = 0 1 0
0 0 0
effects a projection onto the (x, y) plane, since it gives
x¢ = x, y¢ = y, z¢ = 0.
So any point (x, y, z) becomes
x x′ = x
y → y′ = y .
z z′ = 0
Note that det P = 0. A transformation represented by a singular
matrix P is a projection if P2 = P (i.e. repetition of the transformation
has no further effect).
Figure 5.3 shows an example using the projection matrix
1 0 0
P = 0 1 0 .
0 0 0
x
FIGURE 5.3: Left: an example of orthogonal projection. Points are projected onto the (x, y)
plane using the matrix P.
190 • Mathematical Physics
0 1
R=
1 0
(x, y)
x
(x-y, 0)
FIGURE 5.4: In general a projection will not be orthogonal. For example the matrix Q repre-
sents the projection in two dimensions as shown in the figure.
since then
x′ 0 1 x y
= y′ =
1 0 y x
so the transformation of a point (x, y) is
x y
y → x .
x′ 1 0 0 x
′
y = 0 1 0 y .
z′ 0 0 −1 z
Matrices • 191
Revision Notes
5.17. EXERCISES
1 0 2
1. If 0 1 3 find det A, and hence (or otherwise) show
−1 1 1
that the equations
x + 2z = 0
y + 3z = 0
−x + y + z = 0
have non-trivial solutions.
Matrices • 193
2x + 3y + 5z = 7
x + 7y + z = 6
3x + y + 2z = 9.
x + y + 2z = 1
2x + 2y − 2z = 3
−4x + 2y + 4z = 5.
1 0 −1 3 −2 1
6. Given A = 2 1 0 and B = 3 0 1 find
−1 2 −1 2 0 1
det(AB).What is det(BA)?
194 • Mathematical Physics
0 −1 2 0
7. Evaluate the determinant 4 0 0 1 .
0 2 2 −1
−1 −1 1 1
1 0 −1 0 1 3
8. If A = 2 2 5 and B = −1 1 1 what are AB, AT,
3 −1 2 −2 1 0
(ATBT)T where AT denotes the transpose of A?
1 0 4 1 −1 −2
9. If A = −1 1 0 and B = 0 0 1 find AB and
3 1 −1 4 0 1
verify by explicit calculation of both sides that det AB = det A
det B.
1 2
10. What are the eigenvalues of the matrix ?
2 3
1 2 3
11. Verify that is an eigenvector of and find a
1 3 2
second linearly independent eigenvector.
12. The matrix A and its eigenvalues, λ, and eigenvectors, x,
satisfy Ax = λx. Deduce the eigenvalues of A−1.
13. A linear transformation is represented by
x′ −1 1 x
y′ = 2 3 y .
ind the inverse transformation and hence the equation of
F
the line 2y + x = 0 in the (x¢, y¢) coordinate system.
1 0 −1
14. Find the eigenvalues of the matrix Q = 0 2 0 and find
−1 0 −1
the eigenvector corresponding to the eigenvalue λ = 2.
Matrices • 195
5.18. PROBLEMS
1 3 0
A = 3 −2 −1
0 −1 1
are 1, 3, and −4, and find corresponding eigenvectors.
Verify that these eigenvectors are mutually orthogonal.
If the eigenvectors are (x1, x2, x3), (y1, y2, y3), (z1, z2, z3) and
a matrix P is defined as
x1 x2 x3
P = y1 y2 y3
z z2 z 3
1
show that PAPT is a diagonal matrix.
2. Reduce to triangular form the linear equations
4x + 3y + 2z = 1
w + 4y + 3z = 2
2w + x + 4z = 3
3w + 2x + y + λz = u
where λ, u are any real numbers. For what values of λ, u
will the equations have a solution? When is the solution
unique? Taking u = 4, what is the solution for the cases
λ = 0 and λ = 5?
196 • Mathematical Physics
x + 3y + 2z = 4
2x + y + 3z = 5
3x + 2y + (1 + μ)z = 0
where μ is a constant, and hence show that there is a single
value of μ for which the equations have no solution. Find
the value of μ which makes y = 0, and find the correspond-
ing values for x and z.
4. The coordinates (x, y, z) of a point, P, satisfy the determi-
nantal equation
x y z 1
1 0 1 1
= 0.
2 1 0 1
1 3 2 1
By working out the determinant to express the equation
in a simpler form, show that P must lie on a certain plane.
Verify that the point with coordinates (1, 3, 2) lies on this
plane, and explain how this could be shown directly using
the determinantal form of the equation.
1 0 2
A 0 −1 0 satisfies the equation
5. Show that the matrix=
3 0 1
2 1 1
A = 0 2 3
0 0 5
satisfies the matrix equation
6
DIFFERENTIAL
EQUATIONS 1
d2 y
2
= −w 2 y. (6.2)
dt
d3 y d2 y dy
3
− 3 2
+ 4 + 2y =
40. (6.3)
dt dt dt
d2 u
+ u = k + 3 u2 . (6.4)
dq 2
dr
dt
( )(
=Ar 1 2 + 1 Br 1 2 + 1 . ) (6.5)
d 5
dx
{
x f ( x) 1 + y = }
5 f ( x ) x4 .
(6.6)
Example 6.2 What are the orders of equations (6.1) and (6.3)?
Equation (6.1) is first order because it contains only first
derivatives (i.e. dy/dt).
Equation (6.3) is third order because the number of the highest
derivative is 3 (i.e. d3y/dt3).
u¢¢ + u = k + 3u2.
dy
Example 6.3 Solve for y(x) the differential equation = x2
dx
Step 1: In this simple case we can integrate both sides
directly to find y(x)
x3
y ( x=
) ∫ x dx=
2
+C
3
where C is a constant.
Step 2: We can then write our solution to the differential
equation as
x3
y ( x=
) +C
3
dy ( x ) d x 3
= + C=
x
2
dx dx 3
Do not omit step 3 to save time. On the contrary, it can save a lot of
wasted work if you pick up a mistake at this point.
Exercise 6.2 Solve the following, and check your solutions:
dy x
(i) =e ,
dx
dy
(ii) = sin ( t ) .
dt
Differential Equations 1 • 203
dy
= f ( x ) g ( y) , (6.7)
dx
dy
∫ g ( y) = ∫ f ( x ) dx. (6.8)
It should then be possible to find the solution y(x) that satisfies the
differential equation.
Note that we cannot write equation (6.7) as y = g(y) ∫ f(x)dx as
many begining students are tempted to do, because y and hence g(y)
is a function of x.
dy
∫ y dx dx = ∫ x dx ,
or
∫ y dy = ∫ x dx .
204 • Mathematical Physics
y2/2 = x2/2 + C.
(e.g. y1/2) or any functions of y (e.g. sin(y) or ey), then the equation
is non-linear.
y¢ + P (x)y = Q(x),(6.10)
Differential Equations 1 • 207
and
where P(x), Q(x), F(x), G(x) and H(x) are referred to as the coeffi-
cients of y and its derivatives, and are functions of x only (or in some
cases are replaced by constants; see Section 6.7). For most of the
rest of this chapter the equations we shall look at will be linear and
usually of first or second order.
y¢¢ = − w2 sin(wx).
d2
2 {
A sin (w x ) + B cos (w x )} + w 2 ( A sin (w x ) + B cos (w x ) )
dx
d2 d2
= A 2 {sin (w x )} + w 2 sin (w x ) + B 2 {cos (w x )} + w 2 cos (w x )
dx dx
= A ( −w 2 sin (w x ) + w 2 sin (w x ) ) + B ( −w 2 cos (w x ) + w 2 cos (w x ) )
=0
Exercise 6.5
(i) Verify that y = 1 and y = t are two solutions of d2y/dt2 = 0 and
that y = A + Bt is a solution (A and B constants).
(ii) Let y = y1(x) and y = y2(x) be two solutions of y¢¢ + F(x)y¢ +
G(x)y = 0. Show that Ay1 + By2 is also a solution for arbitrary
constants A, B.
(iii) Given that ex and e−x are solutions of d2y/dx2 − y = 0,
explain why it follows that sinh(x) and cosh(x) are also
solutions.
Differential Equations 1 • 209
d
dx
{ Ax3 } = 3 Ax2 . (6.14)
This is a solution if 3Ax2 = x2, i.e. if A = 1/3. (Of course, the gen-
eral solution still involves a constant of integration C.) Similarly, if
you guess that the integral is some power of x but cannot remember
which one, you could try y = Axn where both A and n are to be deter-
mined by substitution.
y¢ = nAxn−1 = x2.
A = 1/n and n − 1 = 2,
n = 3, and a = 1/3
y = x3/3 + C.
We can use this same approach not only where we have forgotten
an exact integral but where we perhaps never knew one, as long as
we can guess the correct form of the solution. This may seem rather
mysterious the first time you see it, but it is not really mystery – it
is experience. You will gain this experience by working through this
chapter. If you are mathematically minded you will soon remem-
ber what sort of functions satisfy what sort of equations. If you are
physically minded you will use physical intuition to guess what sort
of behavior to expect from a given system (as represented by a dif-
ferential equation).
But what happens if we make the wrong guess? Let us see.
Suppose that, instead of trying y = Ax3 in equation (6.14), we guess
y = Ax4 with A a constant. Then dy/dx = 4Ax3 = x2 if A = 1/4x. This
is impossible. (We differentiated y assuming A was a constant and it
has turned out not to be.) So the guess is wrong and we stop here.
The failure tells us that we must start again with a better guess.
The final problem is that the “guesswork” does not feel like “real
math” and we would naturally prefer to be given an algorithm to fol-
low in order to solve these types of equation (i.e. a set of instructions
that always works). There is no algorithm. That is okay; after all inte-
gration is a process of educated guesswork too, and we have learned
to cope with that. Solving a differential equation involves a similar
process of educated guesswork, based on a correct identification of
the type of equation to be solved.
Differential Equations 1 • 211
Exercise 6.6
(i) By trying a solution of the form y = A cos(px) solve
y¢ = sin(2x) for y(x).
(ii) By trying a solution of the form y = (Ax+B)e2x solve y¢ = 3e2x
for y(x).
y = Aeλx,
y = Ae2x + Be−2x,
Differential Equations 1 • 213
y = Aeλx,
and therefore
λ2 − λ − 6 = 0.
( λ − 3) ( λ + 2) =0,
λ = −2 or λ = 3.
214 • Mathematical Physics
y = Ae−2x + Be3x,
y = Aeλx,
λ2 − 3λ − 4 = 0.
Differential Equations 1 • 215
(λ − 4)(λ + 1)=0,
λ= 4 or −1
y = Ae4x + Be−x
Note that in these cases where one of the roots of the auxiliary
equation is positive the general solution is unstable since the solu-
tion will tend to infinity as x → ∞.
Exercise 6.7
(i) Show that the equation y¢¢ + 7y¢ + 12y = 0 has the general
solution y = Ae–4x + Be–3x.
(ii) Show that this can also be written as y = e –7x/2(P cosh(x/2) +
Q sinh(x/2)) with P = A + B, Q = B – A.
Here we are going to look at the special case where we find λ has two
equal values when we solve the auxiliary equation, or repeated roots.
Let us look at an example, following the same steps as previously.
y = Aeλx.
λ2 + 4λ + 4 = 0.
y = Ae−2x + Be−2x
y¢ =u¢(x)e−2x − 2u(x)e−2x,
y¢¢+4y¢+4y = [u¢¢(x)−4u¢(x)+4u(x)]+4[u¢(x)−2u(x)]+4[u(x)] = 0.
y = (Ax + B)e−2x.
Step 3: Since this must hold for all x we can compare coefficients
of x0 (= 1) and coefficients of x1 (= x). So we have that
12C = 24, and
7C + 12D = 26.
yPI = De2x.
1
=
30D 5=
or D
6
220 • Mathematical Physics
1
y PI = e 2 x .
6
As with Case 1, sometimes this method will fail. The left hand
side equals zero if emx is a solution of the homogeneous equation (i.e.
the solution to y¢¢ + 7y¢ + 12y = 0, found by solving the auxiliary
equation), in which case we cannot solve for D.
Key point: If using yPI = Demx makes the left hand side = 0, then
the next guess is yPI = Dxemx. If this fails in the same way, the correct
approach is to multiply through by x and try again. So, next we would
try yPI = Dx2emx, and so on until the method works.
7 1
=y cos ( 3 x ) − sin ( 3 x ) .
50 50
Note that even though H(x) in the example above involves only
sin(3x) (and with integer coefficient −3) we find that the PI involves
both cos(3x) and sin(3x) (and unwieldy fractions).
We have already found PIs for when the right hand side is
24x + 26 and 5e2x. The solutions have been found in Examples
6.12 and 6.13. The required PI is therefore the sum of the two
yPI solutions for those examples
1 2x
yPI = 2 x + 1 + e
6
yPI = 2x + 1.
224 • Mathematical Physics
4C − 8C + 4C = 0,
−8C + 8C = 0,
2C = 2.
yPI = x2e−2x.
Step 3: Write down the general solution, which is the sum of the
CF and PI
dq
R +q C =
V0
dt
226 • Mathematical Physics
Rp + 1/C = 0
therefore
ln ( x −2 ) 1
=
R ( x ) e=
−2ln ( x )
e= .
x2
The method works for every first order linear equation. The key to
the method is remembering how to find the integrating factor.
Proof: We need to verify that multiplying through by the integrating
factor R(x) leads to a left hand side that can be put in the form of the
derivative of a product. According to the rule to be proved, we first
take the integrating factor
d
dx
(R=
( x ) y ) R ( x ) y ′ + R′( x ) y,
= R ( x ) y ′ + R ( x ) P ( x ) y.
out each time. You can use the result directly as in the following
example.
y¢¢ + y = 0 (6.21)
=n 0=
n
n +n − 2
y″=
+y
n 0
∑ ( n + n )( n + n − 1) a x + ∑ a=
nx
n
0.
x0 : 2a2 + a0 = 0
x1 : 3 × 2a3 + a1 = 0
xn : (n + 2)(n + 1)an+2 + an = 0
Thus
an an − 2
an + 2 =
− = ...
=
(n + 2)(n + 1) (n + 2)(n + 1)n(n − 1)
(−1)n / 2 a0
(n + 2)! n even
= ( n −1 ) / 2
(−1) a1
n odd
(n + 2)!
We deduce that
x2 x 4 x3 x 5
y= a0 1 − + + ... − a1 x − + + ... ,
2! 4! 3! 5!
d 2 y dy
x2 2
+ x + ( x 2 − n2 ) y =
0
dx dx
which is satisfied by
∞
( −1)m x
2m+n
y ( x ) J=
= n ( x) ∑
m =0 Γ ( m + 1) Γ ( m + n + 1) 2
.
232 • Mathematical Physics
d 2 Hn
+ (2 n + 1 − x2 )Hn =0
dx 2
d 2 Ln dL
x 2
+ (1 − x ) n + nLn =
0
dx dx
which, for integer n, has the polynomial solutions
2n + 1 − x n
L0 =
1, L1 ( x) =
1 − x, Ln+1 ( x) = Ln ( x ) − Ln−1 ( x ) .
(n + 1) n+1
d 2 Pl d 2 Pl
(1 − x2 ) − 2 x + l (l + 1)Pl ( x) =
0,
dx2 dx
where l is an integer. (The use of l instead of n is conventional, arising
from the association of Legendre polynomials with angular momen-
tum, which is customarily denoted by l in quantum mechanics.) The
polynomial solutions are
Differential Equations 1 • 233
(2 l − 1) (l − 1)
=
P0 ( x) 1,=
P1 ( x) x=
, Pl ( x) xPl −1 ( x) − Pl − 2 ( x).
l l
Revision Notes
7
COMPLEX NUMBERS
(including those that, like π but unlike 2, are not solutions of alge-
braic equations with integer coefficients), together with the ratio-
nals, make up the real number system.
Now to the main point. The real number system is not large
enough to allow us to solve every quadratic equation. For example,
the quadratic equation
z2 + 1 = 0
has no real roots (see Figure 7.1). This is the same as saying the
equation
z2 = −1 (7.1)
cannot be solved by a real number. We therefore introduce another
new type of number, call it −1 or i.
The “number” i is defined such that i2 = −1.
You can’t say what i really is except through this definition. The
two solutions of equation (7.1) are then z = ±i. It turns out that the
introduction of i enables us to take the square root of any number,
hence to solve any quadratic equation. (In fact, the introduction of
i is sufficient to solve any polynomial equation, not just quadratics,
as we shall see later.)
f (z)
6
4
2
–2 –1 1 2 z
FIGURE 7.1: Graph of the quadratic function f (z) = z2 + 1. As this does not cross the z-axis,
there are no real z values for which f (z) = 0; the quadratic equation z2 + 1 = 0 has no real roots.
Complex Numbers • 237
z2 = 4 × (−1).
So,
z =± 4 × ( −1 ) =± 2 × i =± 2 i.
which is true if
( Z − 1 )2 =
−4
Z − 1 =± 2 i
z= 1 ± 2 i.
z = − ( −2 ) ± ( 4 − 20 ) / 2
= 2 ± ( −1 ) × 16 / 2
= 1 ± 2i
238 • Mathematical Physics
Exercise 7.1 Find, in the form x + yi, all solutions of the quadratic
equations
(i) z2 − 6z + 25 = 0
(ii) 2z2 + 2z + 5 = 0.
u2 = 25 ⇒ u = ±5
2uv = 30 ⇒ v = ±3.
Put only the corresponding values together
Arithmetic
Let z1 = x1 + y1i and z2 = x2 + y2i be two complex numbers. In the
following list we put the result of an algebraic operation on z1 and z2
in standard form (x + yi).
= z2z1.
Exercise 7.4 By working out the real and imaginary parts of both
sides, show that z3(z1 + z2) = z3z1 + z3z2 (multiplication of complex
numbers is “distributive”).
Complex Conjugate
If z = x + yi we define the complex conjugate of z to be
z* = x − yi. (7.2)
= x2 − (y)2(−1)
= x2 + y2.
Division
The procedure for division of two complex numbers is similar to
that for multiplication. We write the complex numbers in standard
form, z1 = x1 + y1i and z2 = x2 + y2i, take the ratio and manipulate the
result into a simpler form.
z1 x1 + y1 i x1 + y1 i x2 − y2 i
= = × .
z2 x2 + y2 i x2 + y2 i x2 − y2 i
∗
The trick is to multiply top and bottom by z= 2 x2 − iy2 , as in the
expression on the right, which will give an expression with a real num-
ber as the denominator
x1 x2 + y1 y2 y1 x2 − x1 y2
= + 2 2 i
x22 + y22 x2 + y2
5+ i
Example 7.6 Express in the standard form x + yi.
3 + 2i
We multiply top and bottom by the complex conjugate of the
bottom
5 + i 5 + i 3 − 2i
= ×
3 + 2i 3 + 2i 3 − 2i
15 + 3 i − 10 i − 2 i2
=
32 + 2 2
17 − 7 i 17 7
= = − i.
13 13 13
Complex Numbers • 243
2 − 5i
Exercise 7.7 Write in the standard form x + yi.
2 + 3i
( z1 / z2 ) z1* / z2* if z2 ≠ 0.
*
Exercise 7.8 Show that=
1 1 z* x −y
= = + 2 2 i. (7.3)
*
z z z x +y x +y
2 2
Hence every complex number z (≠ 0) has an inverse. From this
it follows that if z1z2 = 0, then z1 = 0 or z2 = 0. (Proof: suppose z1 ≠ 0;
then we can multiply through by 1/z1 to get z2 = 0.)
z1 z1 z2* x1 x2 + y1 y2 y1 x2 − x1 y2
6. Division:= = + 2 2 i
z2 z2 z2* x22 + y22 x2 + y2
1 1 z* x −y
7. Inverse:= = + 2 i .
z z z x + y x + y2
* 2 2
244 • Mathematical Physics
−b ± (b 2
− 4 ac )
(7.4)
z= ,
2a
and if 4ac > b2 you use i to take the square root of the negative number.
Suppose, however, that one or more of the coefficients a, b or
c is complex. We can still use this formula. But is the square root of
a complex number a complex number, or do we have to introduce
further new numbers into our scheme? Let us start with the simplest
example, the square root of i itself.
( x + yi )
2
z=
2
= x2 − y2 + 2 xyi = i
x=
+ 1 / 2, and y =
+ 1 / 2, or
x=
− 1 / 2, and y =
− 1 / 2.
Therefore
1 i 1 i
z =+ or z=
− −
2 2 2 2
Complex Numbers • 245
z2 = (x + yi)2
= x2 − y2 + 2xyi = −5 + 12i
Equate real and imaginary parts and solve for x and y
x4 + 5x2 − 36 = 0,
(x2 + 9)(x2 − 4) = 0.
∴ x2 = −9 or x2 = 4
x2 = −9 is impossible for x real. Hence x = +2, and y = 6/x = +3,
or x = −2 and y = 6/x = −3. Therefore
z = 2 + 3i or z = −2 − 3i.
(x + y2 ) =( x2 − y2 ) + ( 2 xy )
2 2 2 2
( −5 ) + (12 ) =
=
2
169
2
(x 2
+ y2 )= 169= 13
Exercise 7.10 Use one of these methods to find the square root
of 5 + 12i.
Since we now know how to find the square roots of a complex
number we can now solve any quadratic by using the standard for-
mula. We know az2 + bz + c = 0 has roots
−b ± (b2
− 4 ac )
z= . (7.5)
2a
(2 − i) ± (2 − i) − 4 (1 ) ( 2 − 4 i )
2
z=
2
( 2 − i ) ± ( −5 + 12 i )
1/ 2
= .
2
Complex Numbers • 247
Therefore
( 2 − i) ± ( 2 + 3i)
z=
2
z=
2 + i or z=
− 2 i.
y y
z1+ z2
4
z1
2
2
–4 –2 2 4 x
z2
–2
2 4 x
–4
z = 3 – 4i
(a) (b)
FIGURE 7.2: (a) Argand diagram for the complex number z = 3 − 4i. (b) Argand diagram
showing the addition of two complex numbers z1 + z2. Note that the real part is along the
horizontal axis (x = Re(z)), the imaginary part is along the vertical axis (y = Im(z)).
Figure 7.2(b) shows an Argand diagram for the sum of two com-
plex numbers. The real and imaginary parts each add to form a third
complex number, just as the components of (Cartesian) vectors add
in two-dimensional space. Figure 7.3(a) shows an Argand diagram
for a complex number and its conjugate. When we form the con-
jugate we swap the sign of the imaginary part, which is a reflec-
tion about the x-axis in the Argand diagram. Figure 7.3(b) shows
an Argand diagram for a complex number z and the same number
multiplied by i repeatedly: z, zi, −z and −zi. Each multiplication by i
rotates the point in the Argand diagram by 90 degrees (π/2 radians)
about the origin, in the counterclockwise direction. Clearly there is
a close connection between complex numbers and two-dimensional
geometry, especially rotations and reflections.
y y
z z
4
iz
2
2
−−4 –2 2 4 x −−4 –2 2 4 x
–2
–2
–iz
−−4
−−4
z* –z
(a) (b)
FIGURE 7.3: (a) Argand diagram for the complex number z = 2 + 4i and its conjugate z* =
2- 4i. (b) Argand diagram showing the complex number z = 2 + 4i and the complex numbers
obtained by repeated multiplication by i: z, zi, −z and −zi.
Therefore
|z1z2| = |z1||z2|.
Exercise 7.14 Show that, for any complex numbers z1 and z2,
z1 z
= 1 .
z2 z2
( )
2
(ii) z =− 3 + i ,
1 + 3i
(iii) z = .
1+ i
y
z
y z
π/4 + 2π
r π/4
x
θ −3π/4
x
−z
(a) (b)
FIGURE 7.4: (a) A complex number z in the Argand diagram, showing its modulus and
argument (r, q) and its real and imaginary parts (x, y), which are polar coordinates and Car-
tesian coordinates in the Argand diagram, respectively. (b) The Argand diagram for z = 1 + i
and − z = −1 − i. The argument is defined up to multiples of 2π. The argument of z is π/4 or
π/4 + 2π = 9π/4 etc.; the argument of −z is − 3π/4 or −3π/4 + 2π = 5π/4 etc.
From Figure 7.4(a) we see that (|z| , arg(z)) are the polar coor-
dinates of z in the Argand diagram. So write r = |z|, 𝜃 = arg(z). Then
x = r cos(𝜃), y = r sin(𝜃) and we get the polar representation of a
complex number
(1 + 1 )
z=
1/ 2
=2.
1 i
=z 2 + ,
2 2
p p
= 2 cos + i sin .
4 4
q2 q3
= 1 − + ... + i q − + ... ,
2! 3
and notice these are the series expansions of cos(𝜃) and sin(𝜃).
= cos(𝜃) + i sin(𝜃).
1 iq
(q )
Example 7.13 Show that cos=
2
( e + e− iq ) .
e−i𝜃 is obtained from ei𝜃 either by complex conjugation
(replacing i by −i) or by letting q → −q.
Hence
eiq + e−ie = 2cos(q)
1 iq
(q )
∴ cos=
2
( e + e− iq )
Exercise 7.20 Show that ei𝜃1 = ei𝜃2 if (and only if) q2 = q11+2nnπ
(n = 0, ±1, ±2 ...).
z = rei𝜃 (7.13)
where r = |z| and q = arg(z). This form is very useful for taking
products and roots of complex numbers as the following examples
show.
Example 7.15 If z1 = r1 eiq1 and z2 = r2eiq2 find z1z2, |z1 z2| and
arg(z1z2).
We see from this example that to form the product of two com-
plex numbers we multiply the moduli and add the arguments.
iq z z
Exercise 7.22 If z1 = r1 e iq and z2 = r2 e 2 (z2 ≠ 0) find 1 , 1
1
z2 z2
and arg z1
z2
See Figure 7.5. (Remember the ±: there are two square roots!)
y z
2
r
1
θ z1/2
θ/2 x
–2 –1 1 2
z1/2 r1/2
–1
–2
FIGURE 7.5: The Argand diagram showing a complex number z and its two square roots, z1/2.
valid for n = 0, ±1, ±2, …. This can be used to give a proof of various
complicated-looking identities.
Equating real parts (we have no further use for the imaginary
parts)
cos3 (𝜃) − 3cos(q) sin2(q) =cos(3q).
Replace sin2(q) by 1 − cos2(q) and we have
1
4 cos3 (q ) = ( e iq ) + 3 ( e iq ) ( e− iq ) + 3 ( e iq )( e− iq ) + ( e− iq )
3 2 2 3
2
1 3 iq
= e + 3 e iq + 3 e− iq + e−3 iq
2
1 3 iq 3
= e + e−3 iq + e iq + e− iq
2 2
= cos ( 3q ) + 3cos (q ) .
Exercise 7.25 Let Cn stand for the sum 1 + cos(q) + cos(2q) + ...
+ cos[(n − 1) q]. The aim of this question is to find a closed-form
expression for the sum of this series to n terms.
(i) Use de Moivre’s theorem to show that Cn is the real part of
1 + e iq + ( e iq ) + + ( e iq )
2 n −1
.
e inq − 1 e
=
inq / 2
( einq / 2 − e− inq / 2 )
e iq − 1 e iq / 2 ( e iq / 2 − e− iq / 2 )
and hence that
( n − 1 ) sin ( nq 2 )
1 + cos (q ) + cos ( 2q ) + + cos ( ( n − 1 )q ) =
cos q
2 sin (q 2 )
This section has shown that one of the many uses of complex
numbers is in obtaining real trigonometrical identities. These can
also be established without the use of complex numbers, but often
not so simply.
258 • Mathematical Physics
z = 1, e2πi/3, e4πi/3.
Proof:
Let
P (z) = anzn + an−1zn−1 + ... + a 0,
Complex Numbers • 259
where a0, a 1 ,... an are real coefficients. Taking the complex con-
jugate of this equation gives
( P ( z=
)) a∗n ( zn ) + a∗n−1 ( zn−1 ) + + a0∗ ,
∗ ∗ ∗
Exercise 7.27 Why does this proof fail if an, an−1,..., a0 are not
real?
∫ f ( z ) dz = 0
where the integral is taken around a circle in Ω with center z0.
(To do the integral around a circle of radius R put z = z0 + Reiq, dz =
iReiq dq and integrate with respect to q from 0 to 2π. The theorem
is true for any closed curve, not just circles, but the technique for
evaluating the integral is then more complicated.)
The theorem can be used to prove Cauchy’s integral formula for
an analytic function:
1 f ( z)
f ( z0 ) = ∫
dz.
2p i z − z0
where z lies within the annulus. (If f(z) is analytic within a disc
centered on z0, that includes z0, then the bn will all be zero.) Cauchy’s
formula can be used to show that
1 f ( z) 1 f ( z)
=an = ∫
2p i ( z − z0 ) n +1
, bn ∫
2p i ( z − z0 )− n+1
,
1
An annulus is the region between two concentric circles.
262 • Mathematical Physics
where the integrals are taken around closed curves within the
annulus A, and that
∫ f ( z ) dz = 2p ib .
1
(7.16)
Revision Notes
7.12. EXERCISES
1. Write in the form x + iy, where x and y are real, the com-
plex number u/v, where u = 1 + 7i, v = 1+ i, and verify that
(i) |x + iy| = |u| / |v|, (ii) x – iy = u* / v*.
2. Write in the form x + iy, where x and y are real, each of the
complex numbers w = (2 + 3i) / (1 + i), z = 3eiπ/4, and their
conjugates w*, z*.
3. Write the complex number w =−1 + i 3 in the form rei𝜃
where r > 0 and −π < q < π, and indicate its position on the
Argand diagram.
4. Show the positions of the circle |z| = 5 and of the particu-
∗
lar complex number z0 = −3 + 4i and of z0 in the Argand
diagram, and also indicate the approximate positions of the
two square roots of z0.
5. Show the positions of the circle |z| = 5 and of the particular
complex number z0 = 5 − 12i in the Argand diagram, write
down the values of |z0| and tan(arg z0), and indicate the
approximate positions of the two square roots of z0.
6. Indicate the positions of the complex numbers z1 = 1 + i
and z2 = 2 i in the Argand diagram, verify that they are at
an equal distance from the origin, and explain why
arg(z1 + z2) must be 3π/8.
7. Write the complex number z = − 1 + 2i in the form rei𝜃
where r = |z| > 0 and −π < q < π, expressing q in terms of
the number a = tan–1 2. Show that z* = 5 / z. Also indicate
the positions of z, z* and z2 on the Argand diagram.
8. Express each of the complex numbers z1 = 1 − i and z2 = i
in the form rei𝜃 where r > 0,−π < q < π, and indicate their
positions in the Argand diagram. What is z3, if 0, z1, z2, z3
form a parallelogram?
9. Express the complex number (1 + i) / (3 + 4i) in the form
x + iy where x and y are real. Show that if it is written in the
form reiq where r > 0 and−π < q ≤ π then r = 2 / 5 , and
determine q correct to two decimal places.
264 • Mathematical Physics
7.13. PROBLEMS
and also give the real part of ez. By taking the imaginary
part of the expansion of ez in ascending powers of z, deduce
from (7.17) that
r 2 sin ( 2q ) r 3 sin ( 3q )
er cos(q ) sin(r sin (q ) =
r sin (q ) + + + ....
2! 3!
(7.18)
Complex Numbers • 265
Cn = (2 cos(ø))n cos(nø).
2p 4p
(iii) If <q < what is the behavior of Cn as n → ∞?
3 3
6. Verify that if x and y are real numbers with y > 0, and
1 1/ 2
( ) 1 1/ 2
p=
2
x + x 2
+ y 2 1/ 2
and =
q
2
− x + ( x 2
+ y )
2 1/ 2
( )
z =−2 + 2 i 3 are ± 1 + i 3 . Verify this by writing z in the
form reiq and finding r1/2eiq/2.
3
Solve the quadratic equation z2 + 2 z + 1 − i = 0.
2
7. Verify that z = i is a root of the equation
z3 + (2 + i)z2 − (1 + 4i)z − 2 + 3i = 0,
and hence find the other roots. Indicate the positions of the
roots in the Argand diagram.
N −1
8. Evaluate the sum ∑ exp ( 2p ipn / N ) where p and N are
n=0
integers (≥ 1), for the cases when p ≠ 0 and p = 0. By put-
ting p ≠ k – k ′and p = k + k′ where k, k′ are integers and
k ≠ 0, k′ ≠ 0, show that
N
N −1
2p kn 2p k′n , k = k′,
∑
n=0
cos cos
=
2
N N 0 otherwise
(7.19)
N
N −1
2p kn 2p k′n , k = k′,
∑
n=0
sin sin =2
N N 0 otherwise
(7.20)
N −1
2p kn 2p k′n
∑ sin
n=0
cos
N N
= 0. (7.21)
8
DIFFERENTIAL
EQUATIONS 2
y GS ( t ) Aw cos (w t ) − Bw sin (w t ) .
=
y GS ( 0 )= Aw × 1 − B × 0 = Aw= 0,
hence,
A = 0.
Finally, we substitute A = 0 and B = 1 in the general solution.
This is then the required solution which satisfies the differential
equation and also the extra conditions imposed at t = 0 (it is
often useful to check this). Thus
y= A cos(wt).
Differential Equations 2 • 269
Note that the first step is to obtain the general solution, by any
of the methods that you have learned so far (or will learn in future).
Once you have done this, you can then apply the initial or boundary
conditions.
Key point: When the conditions imposed are values of the solu-
tion and its derivatives for the same value of the independent variable
then these are called initial conditions. This is the case in examples
8.1–8.3 and in Exercise 8.1. The independent variable is often time,
and the conditions are often imposed at t = 0, as in Example 8.1
above. When the conditions are values of the solution for different
values of the independent variable, as in Exercise 8.2, then these are
called boundary conditions.
Let us now look at another example of how to apply initial con-
ditions where we have already previously obtained the general solu-
tion in the exercises in Chapter 6.
yGS(0) = (A × 1) + (B × 1) + 0 + 1 = A + B + 1.
270 • Mathematical Physics
Then,
yGS(0) = A + B + 1 = 1 or A = −B.
This gives
−4A − 3B = 1.
y = −e−4x+ e−3x+ x + 1.
yCF = Aeλx,
Differential Equations 2 • 271
then
y′CF = λAeλx,
y″CF = λ2Aeλx.
λ2Aeλx − Aeλx = 0.
λ2 − 1 = 0,
⇒ λ= ±1.
yPI = Cxex,
y′PI = Cex + Cxex,
y″PI = 2Cex + Cxex.
We can write the general solution as the sum of yCF and yPI:
1 x
yGS =Ae x + Be− x + xe .
2
To apply the initial conditions we also need y′GS so let us derive
that next:
1 x 1 x
′ = Ae x − Be− x +
yGS e + xe .
2 2
We are given that y(0) = 0 and y′ (0) = 1. Substitute x = 0 in yGS,
and then in y′GS:
yGS(0) = (A × 1) + (B × 1) = A + B = 0,
1 1
′ ( 0 ) = ( A × 1 ) − ( B × 1 ) + × 1 = A − B + = 1.
yGS
2 2
Finally we can solve for A and B and substitute them back into
the general solution, yGS. From above we have A + B = 0 and
A − B = 1/2. Adding these together we get
1 1 1
2A = , leading to A = and= B -=
A - .
2 4 4
Finally, substituting into the GS gives
1 x 1 -x 1 x
y= e - e + xe .
4 4 2
Differential Equations 2 • 273
Exercise 8.3 Solve the following for y(x) subject to the specified
conditions. (See Exercise 6.14).
(i) y″ − y = x subject to y(0) = 1, y′(0) = 2,
(ii) y″ + 4y′ + 3y = e−x subject to y(0) = 0, y′(0) = 2,
(iii) y″+ 2y′ + y = xe−x subject to y(0) = 0, y(1) = 1/e.
2
Example 8.4 Solve y¢ + y = 1 subject to y= 0 at x = 1.
x
The equation is first order but not separable, and does not
have constant coefficients, so we will use the integrating factor
method here to find the general solution. We begin by writing
down the integrating factor, R(x).
æ 2 ö
R ( x) = exp ççç ò dx)÷÷÷,
è x ø
2 ln( x) ln( x2 )
= e= e= x2 .
(x2y)′= x2.
We now integrate both sides to obtain our general solution,
yGS(x). We have
x3
x2 y = + C,
3
so that the general solution, which contains one arbitrary con-
stant, C, is
x C
y= + 2 .
3 x
We can now impose the boundary condition: at x = 1,
yGS (1) = 0, so
1 C
GS (1)
y= =
+ 0
3 12
1
and C = - .
and
3
Differential Equations 2 • 275
y 1
Example 8.5 Find the solution of y¢ - = 2 such that
y(x) → 0 as x → ∞. x x
The equation is first order but not separable, and does not have
constant coefficients, so we will again use the integrating factor
method to solve the equation. We begin by writing down the
integrating factor, R(x).
æ -1 ÷ö
R ( x) = exp ççç ò dx÷÷,
è x ø
- ln( x) ln( x-1 ) 1
= e= e= .
x
We then multiply through by the integrating factor
1 y 1
y¢ - 2 = 3 ,
x x x
which can be written as
æ y ö÷¢ 1
çç ÷ = .
çè x ø÷ x3
276 • Mathematical Physics
y = l Ael t ,
y = l 2 Ael t .
Next we substitute into the differential equation and form the
auxiliary equation
λ2Aeλt − Aeλt = 0.
λ2 − 1 = 0, so λ = ±1.
Hence we can write down the general solution, which has two
arbitrary constants.
yGS = Aet + Be−t.
Differential Equations 2 • 277
y = Aeλx,
y′ = λAeλx, and y″ = λ2Aeλx.
λ2Aeλx + Aeλx = 0,
λ2 + 1 = 0,
λ2 = −1 thus λ = ±i.
1 1
A = ( a - ib), B = ( a + ib).
2 2
λ2Aeλx + 4Aeλx = 0,
⇒λ2 = −4,
So λ = ±2i.
We can write the general solution in explicitly real form with a
and b as the arbitrary real constants, as in Example 8.7 above.
yGS = Ae2ix + Be− 2ix,
yGS = a cos(2x) + b sin(2x).
Note that in the examples above we have found that the complex
roots of our auxiliary equations are both of the form ±bi. This has
allowed us to write the general solution yGS directly as a real function
of cos and sin, instead of a function of complex exponentials. In the
next example we will see what happens when the complex roots are
of the form a ± bi.
( )
l = −8 ± 64 − 80 / 2 =−4 ± 2 i.
Key point: If the auxiliary equation has real coefficients but com-
plex roots, the roots will be of the form λ = a ± ib with a and b real. We
can then directly write the general solution of the form yGS = Ae(a+bi)x
+ Be(a−bi)x in terms of real functions as yGS = eax[a cos(bx)+ b sin(bx)].
Here a and b are related to A and B as a = A + B and b = i(A − B).
Here is an example from electric circuit theory.
d2 q dq
L 2
+ R + q C = 0. (8.1)
dt dt
where R, C and L are positive constants and R2 < 4L/C. Find
the charge as a function of time.
For a trial solution q = Aeλt the auxilliary equation is
Lλ2 + Rλ + 1/C = 0.
This is a quadratic equation that can be solved to give
1/ 2
R 1 çæ 2 4 L ö÷
l =- ± çR - ø÷÷ . (8.2)
2 L 2 L çè C
If (R2 − 4L/C) < 0, then the roots are complex numbers, and
we can write this as
1/ 2
R 1 4L
l=
− ± iwc , where wc = − R2
2L 2L C
1/ 2
1 R2
= − 2 .
LC 4 L
(8.3)
Differential Equations 2 • 281
Then the solutions have the form e(-R /2L±iwc)t, and the general
solution is
q(t) = e− Rt/2L (Feiwct + Ge− iwct).
Or, using the relation that eiwct= cos(wct) + i sin(wct), we can
write the equivalent form
q(t) = e− Rt/2L (Acos(wct) + B sin(wct)), (8.4)
where the constants A and B (or F and G) are determined by
the initial values of the charge and the current in the circuit.
5
Exercise 8.7 Find the general solution of y″ + y′ + y = 0.
2
so l = ± i .
282 • Mathematical Physics
z + 2 z + 3 z = 0
We will try zCF = Aeλt hence zCF = l Ael t , and zCF = l 2 Ael t .
We substitute into the differential equations and divide through
by Aeλt to obtain the auxiliary equation
l 2 + 2l + 3 =0,
−2 ± 4 − 12
l= ,
2
−8 4 2
=−1 ± =−1 ± i =−1 ± i 2.
2 2
Then,
( −1+ i 2 ) t ( −1− i 2 ) t
=zCF Ae + Be ,
=zCF e− t Ae i ( 2t
+ Be− i 2t
).
The general solution is
zGS = zCF + zPI.
We found zPI in example 8.13 above, so
= (
zGS e− t Ae i 2t
+ Be− i 2t
) − 12 (1 + i ) e 3 it
.
Differential Equations 2 • 285
{1
}
yPI = Re - (1 + i) e3 ix ,
2
or, using e iq = cos(q ) + i sin(q ) , we have
286 • Mathematical Physics
{1
yPI = Re - (1 + i)( cos(3 x) + i sin(3 x)) ,
2 }
{1
}
= Re - ( cos(3 x) + i sin(3 x) + i cos(3 x) + i 2 sin(3 x)) .
2
But i2 = −1, so
1 1
yPI = - cos(3 x) + sin(3 x) .
2 2
z = Ae4ix,
z′ = 4iAe4ix, and z″ = −16Ae4ix.
2
A = (1 + 8 i).
2
A = (1 - 8 i) .
65
2
Hence zPI = (1 - 8 i) e4 ix .
65
To find the real PI we take the real part of the complex PI,
yPI = Re(zPI):
yPI = Re {
2
65
(1 - 8 i) e4 ix , }
2
hence yPI =
hence
Re{(1 - 8 i}( cos( 4 x) + i sin( 4 x))}
65
2
= ( cos( 4 x) - 8 i 2 sin( 4 x))
65
2
= ( cos( 4 x) + 8 sin( 4 x)) ,
65
2 16
hence yPI =
hence cos( 4 x) + sin( 4 x) .
65 65
288 • Mathematical Physics
yGS = CF + PI
2 16
= e-4 x ( a cos (2 x) + b sin (2 x)) + cos (4 x) + sin (4 x).
65 65
Note that the solution contains both cos(4x) and sin(4x) terms
even though the inhomogeneous term in the differential equation
only has cos(4x). Physically this means that the response of the sys-
tem is out of phase with the driving force, 8 cos(4x). As discussed
above we could find a PI by guessing yPI = c cos(4x) + d sin(4x), with
c and d to be found, but this is usually much more long-winded and
the chances of making an error are higher.
Exercise 8.12 Use the method above to find a PI for each of the
following equations:
(i) y″ + 2y′ + 9y = 2cos(3x);
(ii) y″ + 7y′ + 12y = −150 sin(3x).
Show that the steady state current dq/dt has its maximum
amplitude when w2 = 1/LC. (This value of w is called the resonant
frequency of the circuit.)
(
yCF = e-t a cos( 2t ) + b sin( 2t ) . )
We can use the result of example 8.15,
1 1
yPI = - cos(3 t ) + sin(3 t ) .
2 2
290 • Mathematical Physics
(
y GS = -e-t a cos( 2t ) + b sin( 2t ) )
(
+e-t - 2a sin( 2t ) + 2 b cos( 2t ) )
3 3
+ sin(3 t ) + cos(3 t ) .
2 2
1 1 1
yGS ( 0 ) =e0 ( a cos ( 0 ) + b sin ( 0 ) ) − cos ( 0 ) + sin ( 0 ) =a − =0.
2 2 2
And so a = 1/2. We also have y GS (0) = 1, so
y GS ( 0 ) (
− e0 ( a cos ( 0 ) + b sin ( 0 ) ) + e0 − 2 a sin ( 0 ) + 2 b cos ( 0 )
= )
3 3
+ sin ( 0 ) + cos ( 0 ) =
1,
2 2
3
⇒ − a + 2 b + =1.
2
=y
1 −t
2
e cos ( 2t ) − 12 cos ( 3t ) + 12 sin ( 3t ).
Example 8.18 Find the general solution of the system for u(t),
v(t):
u + v =
0,
v − u =
0.
The system is linear with constant coefficients so we guess
exponential forms for both u and v with A ≠ 0, B ≠ 0.
=
Try u Ae =lt
and v Belt ,
=
then u l=
Aelt and v l Belt .
Substitute and divide through by eλt. This is possible since we
have chosen u and v to have the same exponential dependence.
λA + B = 0,
and λB − A = 0.
Eliminate A and B to get the auxiliary equation for λ, and solve
for λ.
B = A/λ,
λ2A + A = 0.
Since A ≠ 0
λ2 + 1 = 0,
so λ = ±i.
292 • Mathematical Physics
where
1
wL −
d = tan − 1 wC .
R
Hence show that an applied voltage Re(V0eiwt), with V0 real, gives
rise to a current
V0
= I cos (w t − d ) .
2
1
R2 + wL −
wC
294 • Mathematical Physics
x + rx + l=
x 0, r > 0
y″ + 2y′ + 4y = 3 cos(wx)
yPI ( x ) = ∫ G ( x, x′ ) f ( x′ ) dx′
∫ ( cos ( x ) sin ( x′) − sin ( x ) cos ( x′) ) / W ( x′) f ( x′) dx′
x
=
x0
y ( x ) = ∫ G ( x, x′ ) f ( x′ ) dx′
x
x0
Where
( u ( x ) v ( x′ ) − u ( x′ ) v ( x ) ) / ( W ( x′ ) x′ ≤ x
G ( x, x′ ) =
0 x' > x
Revision Notes
8.10. EXERCISES
d 2 y dy
17. Find a particular integral of the equation 2
+ + y=
e2 x.
dx dx
300 • Mathematical Physics
18. F
ind a particular integral of the differential equation
y″+ 7y′ + 4y = 8x2.
19. F
ind the general solution of the differential equation
y″ − y′ + 3y = 0.
20. The general solution of the equation ÿ − 2y + 5y = 0 is
8.11. PROBLEMS
z − 2 z + 2 z = 2 (1 − 5 t ) e2 it .
(iii) y″ − 4y′ + 3y = 0
(iv) y″ − 4y′ + 4y = 0.
12. Find the general solutions of the following differential
equations:
(i) y2y′ − 1 = cos(x)
(ii) x3y′ + x2y = 1
(iii) y″ − 3y′ − 4y = 0
(iv) y″− 6y′ + 9y = 0.
13. F
ind the general solutions of the following differential
equations:
(i) y′ = 3x2y
2x
(ii) cot ( x ) y' − y =
sin ( x )
(iii) y″ −10y′ + 25y = 0
(iv) y″− 7y′ + 12y = 6ex.
14. (i) Find the general solution of the differential equation
y″ + 6y′ + 13y = 0.
y″ − 5y′ + 4y = 8x − 6.
9
MULTIPLE INTEGRALS
Given a function f(x, y) of two variables (x, y), we can obtain the
integral
f ( x, y) dy.
y2
F ( x) = ∫ y1
∫ (∫ )
f ( x, y) dy dx. (9.1)
x2 x2 y2
=I F ( x ) dx
∫=
x1 x1 y1
304 • Mathematical Physics
f ( x, y) dydx (9.2)
x2 y2
I= ∫ ∫x1 y1
with the understanding that equation (9.2) means “do the integrals
in the order indicated in equation (9.1),” i.e. “from the inside, work-
ing out.” We call this a repeated integral (also an iterated integral or
a double integral). Note that the order indicated by the bracketing
in equation (9.1) is one convention. Some authors use an alternative
convention in which the first integral sign is associated with the first
differential and so on in sequence.
1 1
Example 9.1 Evaluate ∫ ∫ −1 0
x2 y dydx .
1
The inner integral is ∫0
x2 y dy . For this inner integral with
respect to y, x is a constant, thus
1
1 y2 x2 1
∫0
= ∫0
= =
2 2 2
x y dy x y dy x .
2 0 2
So
1
1 1 1 x2 x3 1
∫ ∫
−1 0
x2 y=
dydx ∫−1 2
= dx =
−1 3
6
.
1 ∞
Example 9.2 Evaluate ∫∫ 0 a
xe− xy dydx (where a > 0 is a given
constant).
∞
The inner integral is ∫a xe dy . Here x is treated as a constant
− xy
So
x =1
e− ax 1
− ( e− a − 1 ) .
1 ∞ 1
∫∫ ∫0
− xy − ax
xe dydx = e dx =
− a =
0 a
x=0 a
2 1
Exercise 9.1 Evaluate ∫∫
0 0
xy3 dydx .
1 p /2 2
Example 9.3 Evaluate ∫∫ ∫ zx2 sin( y) dzdydx .
0 0 0
= 2 x sin( y) .
2
So now
p /2 2 p /2
∫ ∫ zx2 sin( y) dzdy = ∫ 2 x2 sin( y) dy
0 0 0
p /2
= 2 x2 ∫ sin( y) dy
0
= 2x . 2
And finally
1 p /2 2 1 2
∫∫ ∫ zx2 sin( y) =
dzdydx ∫=
2 x dx 2
.
0 0 0 0 3
1 p /2 1
Exercise 9.2 Find ∫∫ ∫ xy sin( zy) dzdydx .
0 0 0
This is obvious for cases like Example 9.1 because the double
integral is a product of single integrals:
∫ (∫
1
−1
1
0 )
x2 ydy dx = ∫ x2 ( ∫ ydy) dx
1
−1
1
= ( ∫ x dx ) ( ∫ y dy )
1 1
2
−1 0
product
= of single .
integrals
which, reversing the sequence of steps,
= ( ∫ y dy) ( ∫ x dx)
1
0
1
−1
2
= ∫ y( ∫ x dx ) dy
1 1
2
0 −1
= ∫ ( ∫ yx dx ) dy.
1 1
2
0 −1
∫ ∫ f ( x, y) dxdy g( x ) h( y) dxdy
y1 x1 y1 x1
y0 x0
= ∫ ∫ y0 x0
= (∫ x0
x1
g( x ) dx )( ∫ y1
y0 )
h( y) dy .
∫ (∫ )
∞ 1
xe− xy dx dy,
a 0
e− y e− y e− y
∫ y y2
+ dy =
−
y
.
a xk b
12
1
∫1 2 ( 1 + cos( p x) cos( py) ) dx =
12
x + p sin( p x ) cos( p y)
−1 2
2
= 1+ cos( p y) .
p
Now evaluate the outer integral:
1/ 2 2
=
M ∫ 1 + cos( p y) dy
1/ 2 p
1/ 2
2
= y + 2 sin( p y)
p −1/ 2
4
= 1+ .
p2
310 • Mathematical Physics
Cr−a exp(−bνr1/4)dνdr
where C, a and b are positive constants and a > 3/4. If the inner
radius of the disc is r0 and the outer radius is taken to be infi-
nite, write down and evaluate the double integral giving the total
power radiated by the disc over all (positive) frequencies.
=I ∑
= dI ∑ f ( x , y ) dA
k in R
k
k in R
k k k (9.3)
I= ∫∫ f ( x, y) dA (9.4)
R
(This result is known as Fubini’s theorem and holds for almost all
integrals you are likely to encounter.) So we choose coordinates that
make the integrand or the region simple to deal with, and evaluate
the repeated integral.
y Left: A bounded, irregular
(xk, yk)
region in the (x, y) plane, par-
titioned into rectangular cells.
R In order to define the double
δy
integral of a function f(x, y)
over a non-rectangular region
R, we imagine the region parti-
tioned into rectangular cells of
area dA = dx × dy. And for each
δx x cell (numbered k) we compute
the volume over that cell,
Vk = f (xk, yk)dx dy, just as in the case of the rectangular region. We
then sum these small volumes over only those cells that fall entirely
within the region R:
V≈ ∑ f ( x , y ) d xd y.
k in R
k k
=I f ( x, y) dA ∫∫ f ( x, y) dx dy. (9.5)
∫∫=
R R
D, y runs from 0 to 2x. These are the limits for the inner (y)
integral. Thus, the inner integral is
2 y= 2 x
2 y
2x 2x
∫= x y dy x= ∫0 y dy x =
2 2
2 x4 .
0 2
y= 0
Now, to cover D we must let x range between 0 and 1. These
are the required limits for the outer integral, thus
∫ (∫ )
1
1 2x 1 x5 2
= x2 y = ∫0 = =
4
I dy dx 2 x dx 2 5 .
0 0
0 5
Note that in these examples we have drawn the diagram for you.
In general you will have to begin by sketching your own figure (see
Exercise 9.7).
y
z
2
2
y = 2x x=1
y
2
1
1
1
y = 2x
0
0 x
0
0 y=0 1 1 x
(a) (b)
FIGURE 9.1: (a) The region bounded by y = 2x, x = 1 and y = 0. (b) The function z = x2y with
the triangular region indicated on the (x, y) plane.
Multiple Integrals • 313
y
x=0
y
x=0
1 x2+ y2 = 1
1
y = 1–x
0 0
0 1 x 0 1 x
y=0 y=0
(a) (b)
FIGURE 9.2: (a) The region for Exercise 9.5 bounded by y = 1 − x, x = 0 and y = 0. (b) The
region for Exercise 9.6 bounded by x2 + y2 = 1, x = 0 and y = 0.
y
parallel to the x-axis. We see
2
that, within D, x runs from
x = y/2 x=1
x = y/2 to x = 1. For the outer
integral the range y = 0 to y = 2
1 covers the triangle so these are
the limits for the y integration.
0 x
0 y=0 1
I =∫
0
2
(∫y/2
1
)
x2 y dx dy
x =1
2 x3 2 y y3
= ∫
0
y =
3 x= y / 2
dy ∫ 0
1 −
3 8
dy
2
1 y2 y5 1 4 32
= − = −
3 2 8 × 5 0 3 2 8 × 5
1 4 2
= 2 − = .
3 5 5
∫ (∫ )
1 p /2
Example 9.7 Evaluate cosec ( x ) dx dy by
0 sin − 1 ( y )
0
1
∫ ∫
p /2
sin −1
( y)
cosec( x ) dx dy =
∫
0
p/2
( ∫ cosec( x) dy) dx
0
sin( x )
= ∫
0
p /2
( cosec( x) ∫ dy) dx
sin( x )
0
p /2 sin( x )
=∫ cosec( x ) y 0 dx
0
p /2
=∫ cosec( x ) sin( x ) dx
0
p /2 p
= ∫=
0
dx
2
.
316 • Mathematical Physics
Exercise 9.7 Sketch the regions of integration and insert the cor-
rect limits on the right hand sides of the following equalities:
∫ ( ∫ f ( x, y) dx ) dy = ∫ ( ∫ f ( x, y) dy) dx ,
1 y
(i) 0 0
∫ ( ∫ f ( x, y) dy) dx .
∞
∞
f ( x, y) dx dy =
(ii) ∫ ∫
0 y
P(x, y)
y y
dr
r
dA
r rdq
q dq
0
0 x x
(a) (b)
FIGURE 9.3: (a) Polar coordinates (r, q) and Cartesian coordinates (x, y). (b) The area ele-
ment in terms of polar coordinates: dA = rdrdq .
− 1/ 2
y2
Exercise 9.8 What is the function f ( x,=
y) x 1 + 2 in
x
polar coordinates (r, q)?
(i) r = constant
(ii) q = constant?
∫ f dA = ∫∫ f ( x, y) dxdy
we can write
∫ f dA = ∫∫ f ( r , q ) rdrdq .
This is useful if the limits of integration are more readily
expressed in (r, q) coordinates. The temptation to write drdq (instead
of rdrdq) must be resisted; dxdy is an area, but drdq is not – they
cannot be equal.
318 • Mathematical Physics
I= ∫∫ R
xy dxdy
f ( x, y) = xy
= ( r cos( q ) ) ( r sin( q ) )
= r 2 cos( q ) sin( q )
Next transform the element of area:
dxdy = rdqdr
p/ 2 1
I = ∫ ∫ r cos( q ) sin( q ) rdq dr
2
0 0
p/ 2 1
= ∫ sin( q ) d ( sin q ) ∫ r dr 3
0 0
p/ 2 1
1 1
= sin 2 ( q ) r 4
2 0 4 0
1 1 1
= × = .
2 4 8
Exercise 9.10
(i) Find the integral of the function f(x, y) = x2 + y2 over the
disc x2 + y2 ≤ 4.
(ii) A disc of radius 2m centered on the origin has a variable
surface density given by s(r) = 0.1r2 kg m–1. What is the
mass of the disc?
= (∫ −∞
∞
e− x dx
2
)( ∫ ∞
−∞
2
e− y dy )
= (∫ dx ) .
∞ 2
2
e− x
−∞
∞
But there are no elementary methods to evaluate ∫ e− x dx .
2
0
We can, however, transform the problem to polar coordinates.
The problem becomes: evaluate the integral of the function
2
f(r, q) = e− r over the whole plane. This is
2p ∞
∫ ∫
2
I = e− r rdrdq
0 0
= (∫ 0
2p
dq )( ∫ 0
∞ 2
e− r rdr )
∞
1 2
= 2p × − e− r = p
2 0
320 • Mathematical Physics
z ρ dz dV
P
O y y
x dφ
dρ
x ρdφ
φ
(a) (b)
FIGURE 9.4: (a) Cylindrical polar coordinates (r, φ, z) and Cartesian coordinates (x, y, z).
(b) The volume element in terms of cylindrical polar coordinates dV = pdpdφdz.
1
This notation is common in physics, but not universal. When working
on a particular problem, always check the definitions.
Multiple Integrals • 321
z = z.
The inverse relation is
(x + y2 )
1/ 2
r
= 2
,
f = tan −1 ( y / x ) , (9.9)
z = z.
dV = dr × rdφ × dz = pdpdφdz.
∫ f dV = ∫∫∫ f ( x, y, z) dx dy dz
we can write
dV = rdrdφdz,
while the limits of integration are 0 ≤ r ≤ 1, 0 ≤ z ≤ 1, 0 ≤ q ≤ 2p.
Put these together and evaluate the integral and we have that
1
r4
( x2 + y2 ) dV =
1 2p 1
∫ ∫0 ∫0 ∫0 4 [f ]0 [ z]0
2p 1
r 2
× r d r df dz=
V
0
1 p
= × 2p × 1 = .
4 2
Multiple Integrals • 323
dz
dA
r
y
a dφ
x adφ
FIGURE 9.5: The cylindrical element of area for a cylinder of radius r = a: dA = adφdz.
Exercise 9.14
(i) Find the integral of the function f(x, y, z) = xyz over the
wedge defined by 0 ≤ r ≤ 1, 0 ≤ φ ≤ p/2, 0 ≤ z ≤ 2.
(ii) The number per unit volume of bacterial fossils in a
cylindrical wedge of rock, 0 ≤ φ ≤ p/2, of radius 1m and
height 2m is 105 xyz m–3. How many fossils are in the rock?
∫ f dA = ∫∫ f (f , z ) adf dz
the surface integral of f over the cylindrical surface.
324 • Mathematical Physics
dA = 4dφdz.
cos (f )
Exercise 9.15 Find the integral of the function over the
(1 + z )
2
z
z r sin(θ)dφ
P dV
θ
θ r dr
O y dθ
φ y
x
r
dφ
x
(a) (b)
FIGURE 9.6: (a) Spherical polar coordinates (r, φ, q) and Cartesian coordinates (x , y, z).
(b) The volume element in terms of spherical polar coordinates dV = r2 sin(q)drdqdφ.
Multiple Integrals • 325
z = r cos (q ) . (9.10)
(x + y2 + z 2 )
1/ 2
r= 2
,
= (
q cos−1 z / ( x2 + y2 + z2 )
1/ 2
),
f = tan −1 ( y / x ) .
(9.11)
Note carefully (and remember) how the angles q and φ are defined:
this is the standard convention and you need to know it − q is the
co-latitude of P (not the latitude; i.e. it is measured down from the z-axis,
not up from the equator). The angle φ is measured in the (x, y) plane
from the x-axis.
Exercise 9.18
(i) Show that f(x, y, z) = ln(1 + x2 + y2 + z2) is spherically
symmetric.
(ii) Show that a spherically symmetric function is constant on
spheres centered on the origin.
∫ f dV = ∫∫∫ f ( x, y, z) dxdydz.
we can write
f(x, y, z) = z = r cos(q).
Multiple Integrals • 327
Next, write down the limits for the integration: these are given as
0
4 2 0
1 1 p
= × × 2p = .
4 2 4
∫ f dA = ∫∫ f (q ,f ) a sin (q ) dq df ,
2
328 • Mathematical Physics
z a sin(θ)dφ
adθ
dA
dθ
y
a
dφ
x
FIGURE 9.7: The spherical element of area for a sphere of radius r = a.
the surface integral of f over the spherical surface. Note the factor of
sin(q) in the integrand. The area is not a2dqdφ. You can see from the
figure that the same dqdφ near the poles has a smaller area than near
the equator. The factor of sin(q) takes this into account.
z = a cos(q).
dA = a2 sin(q) dqdφ.
0 ≤ q ≤ p/2, 0 ≤ φ ≤ 2p.
Multiple Integrals • 329
2 0 2
Exercise 9.21
(i) Find the integral of f(x, y, z) = 1 − z2 over the surface of the
sphere r = 1.
(ii) The moment of inertia of a uniform sphere of radius a
and unit surface density about an axis through its cen-
ter at the origin is given by the integral of a2 − z2 over
the surface of the sphere. Find the moment of inertia of
such a sphere.
d l cos( i)
dq =
r
where i is the angle between the normal to the element and the line
from O to the element (Figure 9.8b).
330 • Mathematical Physics
i
q P
dl
1 r
q
dq
O O
(a) (b)
FIGURE 9.8: (a) The length of an arc of the unit circle about point O is equal to the angle
(in radians) at O subtended by the arc. (b) A tiny length element dl viewed at a distance r
from point O is inclined by an angle i from the circle of radius r centered on O. The angle is
dq = (dl cos i)/r.
z z
dA dA
P
i
r
O O
y y
x x
(a) (b)
FIGURE 9.9: (a) The area element dA at point P, a distance r from point O, with angle i be-
®
tween the normal to the area element and the line OP. (b) The solid angle subtended by dA
from O can be thought of as the area of the image of dA projected onto a unit sphere from
point O.
dA = r2 sin(q) dqdφ.
Now work out the angle i at P, the angle between the radius
vector and the normal to the surface: here i = 0, because the
normal is along the radius vector, and so cos(i) = 1. Write down
the limits of integration, chosen here to cover the whole sphere:
0 ≤ q ≤ p, 0 ≤ φ ≤ 2p.
2p p r 2 sin (q )
=
W ∫=
dW ∫ ∫
0 0 r2
dq df
q =p 2p q =p
∫ sin (q ) dq ∫ df =
=
q =0
2p − cos (q ) q = 0 =
0
4p.
q P
O
y
x
FIGURE 9.10: The solid angle on a sphere.
z
q=0
A
q = p/4
φ=0 φ = π/4
(a) (b)
FIGURE 9.11: (a) Exercise 9.22. (b) Exercise 9.24.
Multiple Integrals • 333
∫ IA cos (q ) dW
hemisphere
z z
y y
x x
(a) (b)
FIGURE 9.12: (a) Example 9.17: z = x + y . (b) Example 9.18: z = (1 − x − y2)1/2 + 2. In both cases
2 2 2
the functions have axial symmetry (because they depend on x and y only through r2 = x2 + y2),
so we can draw the graph of the function z = f(0, y) on the (y, z) plane, and then rotate about
the z-axis.
Multiple Integrals • 335
Z2 = 1 − x2 − y2, Z > 0.
Revision Notes
After completing this chapter you should be able to
• Compute a given multiple integral
• Insert limits for an integral over a given plane region
• Insert the limits corresponding to a change in the order of
integration over a plane region
• Define cylindrical and spherical polar coordinates
• W
rite down the transformations between Cartesian
coordinates and cylindrical coordinates and spherical
polar coordinates and their inverses
• Write down the elements of area and volume in these
coordinate systems
• Use the appropriate elements of area and volume to
integrate over parts of spheres and cylinders and their
surfaces
• Define and compute a solid angle
• Recognize axial, cylindrical, and spherical symmetry
336 • Mathematical Physics
9.16. EXERCISES
∫ ∫
x2 ydx dy .
1 y
1. Evaluate 0
0
∫ ∫ f ( x, y) dx dy
2 y
1 1
5. S
how that the integral of the function f(x, y) = xy2 over the
region of the (x, y) plane bounded by the lines y = 0, x = 1
and the curve y2 = x is 2/21.
6. Let R be the tetrahedron given by x ≥ 0, y ≥ 0, z ≥ 0 and
x + y + z ≤ 1. Show that
1
∫∫∫ R
y dx dy dz =
24
where the integration is over the volume of the tetrahe-
dron.
7. D
raw a diagram to show the relation between Cartesian
coordinates (x, y, z) and standard spherical polar coor-
dinates (r, q, φ). What does the function f(x, y, z) = (x2 +
y2)1/2/z become in spherical polar coordinates?
Multiple Integrals • 337
8. D
raw a diagram showing the relation of standard spherical
polar coordinates (r, q, φ) to Cartesian coordinates (x, y, z).
Indicate on your diagram the standard cylindrical polar co-
ordinates (r, φ, z) and obtain the relation between (r, φ, z)
and (r, q, φ).
9. F
ind the integral of the function cos2(q) cos(φ/4) over the
surface of the unit sphere centered at the origin.
10. F
ind the integral of cos(2q) over the surface of the unit
sphere. (q is the angle between the radius vector and a
fixed direction.)
11. I f f = (x2+y2)z calculate the integral of f over the solid
cylindrical wedge of unit radius, unit height in x ≥ 0, y ≥ 0,
1 ≥ z ≥ 0.
spherical cap of angle a is the surface of a sphere, center
12. A
O, cut off by a right circular cone of semi-angle a with
vertex at O. Find the solid angle subtended at O by such a
spherical cap. Hence show that a circular disc of angular
diameter 2a at a very large distance (compared with its
radius) subtends a solid angle at a point on its axis of
approximately pa2.
13. F
ind the solid angle subtended by a circular disc of radius
3 cm at a point a distance 4 cm along the axis perpendicu-
lar to the disc through its center.
14. W
rite down an expression for the solid angle subtended at
the origin by an element of surface dS with normal n at a
point with position vector r. Use this expression to estimate
the solid angle subtended by a coin of radius 1.295 cm (or
approximately 5/4 cm) at a distance of 1/2 m.
338 • Mathematical Physics
9.17. PROBLEMS
1. W
rite down the element of volume dV in spherical polar
coordinates (r, q , φ). Find
exp ( ir cos (q ) − r )
I=∫ dV ,
V r
a aa dy dx
W∆ = ∫∫ .
( 1+ x + y2 )
0 0 2 3 2
Multiple Integrals • 339
(c) From (b) deduce an expression for the solid angle, Ωn,
subtended by an n-sided regular polygon of unit area
at a point on its axis unit distance from its center, and
show that this can be written as
1 p px n dy
Wn = 2 n∫ ∫ 32
dx.
0 0
y2
(1 + x )
2 32
1 +
1 + x2
By approximating the inner integral in the expression for
Ωn show that Ωn ≤ ΩD, and hence that a regular polygon
subtends a smaller solid angle than a disc of the same
area.
CHAPTER
10
PARTIAL DERIVATIVES
Let f(x, y) = x2 + xy2. This means that for each x and y (chosen inde-
pendently) there corresponds a value for f given by this expression.
Example 10.1 At (x, y) = (1, 0), f(x, y) = x2 + xy2 has the value
12 + 1 × 02 = 1; at (x, y) = (1, 2), f has the value 12 + 1 × 22 = 5.
342 • Mathematical Physics
Exercise 10.1
(i) What value does the function f(x, y) = x3y + y2 have at
(x, y) = (−1, 3)?
(ii) What value does g(u, v) = u3v + v2 have at (u, v) = (−1, 3)?
(iii) And at (u, v) = (2, 4)?
y
z
(a) (b)
FIGURE 10.1: The function z = (2 + x2 − y2 ) e1− x − y 4. (a) A surface plot. (b) A contour(level
2 2
curves) plot. The axes have been offset from the origin in (a) to avoid cluttering the plot.
y
z
(a) (b)
FIGURE 10.2: The function z = sin(x) sin(y). (a) A surface plot. (b) A contour (level curves)
plot. The axes have been offset from the origin in (a) to avoid cluttering the plot.
df
Example 10.2 If f(x, y) = x3/y what is (i) if y is temporarily
dx
df
kept constant and (ii) if x is temporarily kept constant?
dy
df 3 x2
(i) = (keeping y constant).
dx y
df x3
(ii) = − 2 (keeping x constant).
dy y
344 • Mathematical Physics
z z
y y
x x
plane: y = y0 plane: x = x0
(a) (b)
FIGURE 10.3: The partial derivative as the slope of a cross-section. (a) A function z = f(x, y)
plotted as a surface. A plane of constant y (= y0) is shown. The curve z = f(x, y0) is the
intersection of the surface with this plane. The gradient of the tangent of this curve at some
point P = (x0, y0) is the partial derivative of z with respect to x at P. (b) The same surface, plotted
with a plane of constant x (= x0). The curve z = f(x0, y) is the intersection of the surface with this
plane. The gradient of the tangent of this curve at P is the partial derivative of z with respect
to y at P.
346 • Mathematical Physics
2 2
directional derivative = ( −2 ) + ( −4 ) =−3 2.
2 2
dx 2
∂g
Exercise 10.7 If g(t, u, v, w) = t + u2v3w4, what is ?
∂v
Partial Derivatives • 349
Exercise 10.9 Let f = ex log(x + y). Find fx and fy and verify that
fxy = fyx.
∂f df ∂u
= . (10.3)
∂x du ∂x
where the curly ∂’s remind us that y is being held constant. Other-
wise there is nothing new here. (Notice we do not need the curly ∂’s
for df/du since f can be written as a function of u only.)
∂ 2 d ( r 2 ) ∂r ∂r
=
∂x
( r ) = 2r .
dr ∂x ∂x
∂ 2
∂x
( r ) = 2 x + 0 + 0.
Comparing these two we see that
∂r x
= .
∂x r
∂w 2x
= .
∂x sin ( 2w )
Partial Derivatives • 353
d
dx (∫
a
b
)
f ( x, t ) dt = ∫
a
b ∂f ( x, t )
∂x
dt (10.5)
d 1 sin ( tx )
dx ∫0
Example 10.7 Find dt .
t
Using equation (10.5) we have that
d 1 sin ( tx ) 1 ∂ sin ( tx )
∫
dx 0 t
dt = ∫
0 ∂x
t
dt
1
= ∫ cos ( tx ) dt
0
1 t =1
= sin ( tx ) t = 0
x
sin ( x )
= .
x
354 • Mathematical Physics
d 1 x d 1 1
dx ∫0
t dt = = − .
dx x + 1 ( x + 1 )2
Next we differentiate the left side (with respect to x) using
equation (10.5). (Recall that you learned how to differentiate
ax in Section 2.7.)
d 1 x 1 ∂
∫
dx 0
t dt = ∫
0 ∂x
( t x ) dt
= ∫ t x ( log ( t ) ) dt.
1
Therefore
1
∫ t ( log ( x ) ) dt = − ( x + 1)
1
x
2
.
0
( −1)m m!
∫ t ( log ( t ) ) dt
1 m
= x
= , for m 1,2,.
0
( x + 1 ) m +1
Partial Derivatives • 355
p2 d p2
∫ t sin ( tx ) dt = − cos ( tx ) dt .
dx ∫0
Exercise 10.13 Show that
0
p2 sin ( p x 2 ) p2
Given that ∫ cos ( tx ) dt = t sin ( t ) dt . , find ∫
0 x 0
∞ 1
Exercise 10.14 Given that ∫ 0
e−λt dt =
λ
, use the method of
∞
Example 10.8 to find ∫0
t n e−λt dt , for n = 1,2,. . . .
∞
Hence show that ∫0
x n e− x dx = n! and show that 0! = 1 using this
formula. (Thus, this formula agrees with the usual definition of n!
for n a positive integer, and gives a meaning to cases when n is not
a positive integer.)
The result agrees with the first term of the series expansion for
sin(u) with u = xy, as, of course, it must. You will find it extremely
helpful if you set out your work systematically like this.
Exercise 10.15 Let f(x,y) = 1 + exey and P = (0, 0). Find fx(P),
fy(P), fxx(P), fxy(P), fyy(P), and hence obtain the Taylor expansion of
f about the origin to second order.
Suppose we are given the function f(x, y). We can evaluate f and
its partial derivatives at the point P = (x0, y0). Let
∂f
= ( x0 , y0 ) , f x ( P )
f ( P ) f= ( x0 , y0 ) ,
∂x
and so on. Taylor’s theorem says we can approximate the function
f(x,y) near the point (x0,y0) by a finite number of terms from the fol-
lowing series:
f ( x, y=
) f ( P ) + ( x − x0 ) f x ( P ) + ( y − y0 ) fy ( P )
1
+ ( x − x0 ) f xx ( P ) + 2 ( x − x0 ) ( y − y0 ) f xy ( P ) + ( y − y0 ) f yy ( P )
2 2
2!
+… (10.8)
2!
+ ( y − 1 ) ( − p 2 4 ) + ,
2
and so
1 p2 2
sin ( xy ) =1 − ( x − p 2 ) + p ( x − p 2 )( y − 1 ) + ( y − 1 ) + .
2
2 4
The result is a power series (or a quadratic if we stop after the
second order terms). Near the point (p/2, 1) we can put X = x − p/2
and Y = y − 1 and write the series more succinctly as
1 p p2
sin ( xy ) =
1 − X 2 − XY − Y 2 +.
2 2 8
Exercise 10.16 Let f(x,y) = x3y4 + x4y3. Find the partial deriva-
tives of f up to second order at (1,1) and hence obtain the Taylor
expansion of f about the point (1, 1) to second order.
Partial Derivatives • 359
2!
+2 ( x − x0 )( z − z0 ) f xz ( P ) + ( y − y0 ) f yy ( P )
2
+2 ( y − y0 ) ( z − z0 ) f yz ( P ) + ( z − z0 ) f zz ( P ) + .
2
(10.11)
Extensions to more variables follow a similar pattern, but you
are unlikely to meet them.
To sum up then, if we need to find the behavior of a function
near a given point we can approximate the function by a polyno-
mial, usually a quadratic, which is much simpler than the original
function.
x
FIGURE 10.5: Near a stationary point the function f(x, y) is constant to first order (in all
directions).
minimum
FIGURE 10.6: Three different types of stationary point for a function of two variables.
Maximum (left), minimum (center) and saddle point (right). Near a stationary point the
function f(x, y) is constant to first order. As you can see from the figures, stationary points
are important for the analysis of stability.
362 • Mathematical Physics
So P = (0, 0) is a minimum.
Exercise 10.18
(i) Find and determine the nature of the stationary points of
the function
f(x,y) = x2 − 2x − y3 + y2 + 8y.
If neither condition holds, i.e. if for some (x, y), f(x, y) > f(x0, y0)
and for some (x, y), f(x, y) < f(x0, y0), then
• (x0, y0) is a saddle point (see Figure 10.6).
1 1
f ( x, y ) − f ( P )= ah2 + 2 bhk + ck2 = Q (10.15)
2 2
with Q = ah2 + 2bhk + ck2.
Now, if P is a maximum this means f(x, y) < f(P) near P, and so
Q < 0; or if P is a minimum then f(x, y) > f(P) near P and so Q > 0,
for all values of the displacements h and k. In both these cases Q ≠ 0
which means the quadratic equation
1. A minimum if
fxx(P) > 0, and fyy(P) > 0, and f xx ( P ) f yy ( P ) − f xy2 ( P ) > 0
2. A maximum if
fxx(P) < 0, and fyy(P) < 0, and f xx ( P ) f yy ( P ) − f xy2 ( P ) > 0
3. A saddle point if
f xx ( P ) f yy ( P ) − f xy2 ( P ) < 0 (if fxx(P), fyy(P) have opposite signs)
If the second derivatives, fxx, fyy and fxy, are all zero one would
need to investigate the behavior of the function in more detail to
correctly identify the type of stationary point.
Now consider the converse: given a(x, y) and b(x, y) can we write
adx + bdy = df for some f ? If such an f exists we say that adx + bdy is
an exact differential (it is also called a perfect differential). It is often
useful to know if adx + bdy is an exact differential, because, if it is,
then there exists a curve f(x, y) = constant, such that (a, b) is normal
to the curve at each point. As an example, let a = 3x2y + 1/y and
b = x3 − x/y2; does there exist an f such that
If you are good at this sort of thing you can guess (with the help
of Exercise 10.19) that f = x3y + x/y will do. Can we find a more
mechanical method?
Suppose we can find an f such that
df = adx + bdy.
df = fxdx + fydy,
ay = bx.
Although we shall not prove it, this is in fact a necessary and suf-
ficient condition for the differential to be exact.
Partial Derivatives • 367
But
∂x ∂y
= cos (q ) , and
= sin (q ) ,
∂r ∂r
and so we have that
∂f ∂f ∂f
= cos (q ) + sin (q ) .
∂r ∂x ∂y
You will find that the chain rule is important wherever it is neces-
sary to use different coordinate systems (e.g. Cartesians and polars).
Exercise 10.22 Complete Example 10.17 by finding fq in terms
of fx and fy. Hence obtain expressions for fx and fy in terms of fr
and fq . Deduce that if f = rq then
x x y
( x 2 + y2 ) .
12
q cos (q ) − sin (q ) =cos−1 − , where r =
fx =
r r r
df = fxdx + fydy.
370 • Mathematical Physics
which are exactly the chain rule equations (10.18) and (10.19).
df = (fx)vdx + (fv)xdv.
dv = (vx)ydx + (vy)xdy.
Substituting for dv
df = (fx)ydx + (fy)xdy.
i.e.
∂f ∂f ∂f ∂v
= + .
∂x y ∂x v ∂v x ∂x y
Partial Derivatives • 371
dx dy
=df f x + fy dt,
dt dt
and hence that
df dx dy
= fx + fy .
dt dt dt
Verify that this result can be obtained directly from the chain rule.
because different variables are being held constant on the two sides.)
dx = 1 · dx + 0 · dy,
show that
x u uy
xv = − ,
uy
and that
xu = vy/(uxvy − uyvx).
Revision Notes
10.17. EXERCISES
∂r ∂ 1
4. Let r2 = x2 + y2 + z2; find and ( ) , and show that
∂x ∂x r
∂2 1 1 3 x2
=
− + .
∂x2 r r3 r5
5. Let r2 = x2 + y2 + z2; if f(r) is a function of r only show that
∂f df x ∂ 2 f x2 d 2 f 1 df x3 df
= and = + −
∂x dr r ∂x2 r 2 dr 2 r dr r 3 dr
∂2 f xy d 1 df
and = .
∂x∂y r dr r dr
∂f ∂f
6. Let f(x, y) = xy2 sin(y/x); show that x +y = 3f .
∂x ∂y
7. Let r = (x2 + y2 + z2)1/2; show that
∂ ∂ ∂
( x + y + z )r n = nr n for any n ≠ 0.
dx dy dz
∂f
8. Let r = (x2 + y2 + z2)1/2 and f = 1/r. Find , write down
∂x
∂f ∂f
expressions for and and hence show that f satisfies
∂y ∂z
∂f ∂f ∂f
x +y +z = − f.
∂x ∂y ∂z
∞
∫
2
hence (or otherwise) evaluate x2 e− x dx given that
0
∞ p
∫
2
e− x dx = .
0 2
2
d ∞ e− x t ∞ p
∫ ∫
2
11. Evaluate 2
dx (t > 0)given that e− x dx = .
dt 0 x 0 2
12. Find the Taylor series of f(x, y) = x/y about the point
x = 1, y = 1 up to and including terms of the third order (i.e.
as a cubic polynomial).
13. Given that f(x, y) is stationary at the origin and its second
derivatives are fxx = 2, fyy = −2, fxy = 3, determine the nature
of the stationary point.
14. Write down the conditions for a(x, y)dx+b(x, y)dy to be a
perfect differential, and deduce that
1 x
dx − 2 dy
y y
is a perfect differential.
15. For what value of n is the expression xn[(6x5 + 3y2)dx − 2xydy] a
perfect differential?
2 2 x2 x3 2
16. Is e x y (1 + )dx − e x y dy a perfect differential? (Show
y y
your reasoning.)
17. Write down the condition for p(x, y)dx + q(x, y)dy to be a
perfect differential. Is
2x x2
dx − dy
y + x2 y2 + x 2 y
a perfect differential?
Partial Derivatives • 375
10.18. PROBLEMS
∂f ∂2 f
ting u = use the above results to express in terms
∂x ∂x2
∂ 2 f ∂ 2 f ∂ 2 f ∂f ∂f
of , , 2, and .
∂r ∂r ∂q ∂q ∂r
2
∂q
∂2 f
Obtain the corresponding expression for and show
that ∂y 2
∂ 2 f ∂ 2 f ∂ 2 f 1 ∂f ∂ 2 f
+ = + + .
∂x2 ∂y2 ∂r 2 r ∂r ∂q 2
376 • Mathematical Physics
f(x, y) = x4 − 2x2 + y3 − 3y
2
9. Find the stationary point of the function ln(x) − x/y2 − 2y
and determine its nature.
1
10. Show that f ( x, y ) = 3 x2 + 2 xy + y2 − 2 x + has one station-
2
ary point. Find the position (x0, y0) of the stationary point
and show that it is a minimum. Deduce that f(x, y) > 0 for
all x, y. Show that g(x, y) = exp(a(x + y))f(x, y) has a mini-
mum at (x0, y0) for all values of a. Show further that when a
≠ 0, g(x, y) has a second stationary point and determine its
nature.
CHAPTER
11
PARTIAL DIFFERENTIAL
EQUATIONS
∂y ∂u
yx = × = cos ( u ) × 1 = cos ( x + ct ) .
∂u ∂x
− sin ( x + ct ) , ytt =
yxx = − c2 sin ( x + ct ) .
Therefore
1
ytt = yxx
c2
or
1 ∂2 y ∂2 y
= . (11.1)
c2 ∂t 2 ∂x2
We say that the function y(x, t) = sin(x + ct) satisfies the partial
differential equation (11.1). But this may not be − in fact, is not − the
only solution. To find other solutions of equation (11.1) we must per-
form a reverse process: start from equation (11.1) and find the pos-
sible forms for y(x, t). In practice, solutions are usually required to
satisfy further subsidiary conditions (boundary conditions or initial
conditions, or a combination), as will be explained later.
Equation (11.1) is called the wave equation (in one space dimen-
sion). It describes the propagation, for example, of a wave on a uni-
form string under tension (y is the transverse displacement of the
string from its equilibrium position, x is the distance along the string
and t the time; compare Figure 11.1) or of a sound wave in a uniform
tube (y is the pressure in the tube) etc. The constant c is the speed
of the wave. In the case of the string, the physical interpretation of
this equation is that the transverse acceleration (perpendicular to the
string) of an element of string, ∂2y/∂t2, is a result of the net tension
acting on the element arising from the distortion of the string ∂2y/∂x2.
The following are four more examples of partial differential
equations in physics.
Partial Differential Equations • 381
∂2 f ∂2 f ∂2 f 1 ∂2 f
+ + = . (11.2)
∂ x 2 ∂ y2 ∂ z 2 c 2 ∂ t 2
Laplace’s equation:
∂2 f ∂2 f ∂2 f
+ + =0, (11.3)
∂x2 ∂y2 ∂z2
where f = f(x, y, z) might be, for example, the electrostatic poten-
tial, or the gravitational potential in empty space, or the velocity
potential for an inviscid fluid.
The “diffusion equation”:
∂ 2 y 1 ∂y
= , (11.4)
∂x2 D ∂t
where y = y(x,t), D = constant, describing, for example, the diffusion
of heat in a bar (y is then the temperature).
Schrödinger’s equation:
∂2y ∂y
= −i (11.5)
∂x 2
∂t
for the wave function y of a free particle.
These equations are linear (i.e. the unknown function and its
derivatives occur only linearly) and homogeneous (there is no term
involving only the independent variables and not the unknown func-
tion). They are also of second order (i.e. second derivatives occur but
not higher ones). A method of solution will be presented in section
11.11. However, in Sections 11.2–11.9 we explore an exceptional
feature of the wave equation (11.1): the existence of an explicit form
for the general solution.
x
FIGURE 11.1: Wave on a string at some instant of time t.
Partial Differential Equations • 383
∂2 y 1 ∂2 y
= . (11.6)
∂x2 c2 ∂t 2
Proof: We calculate fxx and ftt and show they satisfy equation (11.6).
Let u = x − ct, so f = f(u(x, t)), i.e. f(x − ct) is a function of a function
u = (x − ct). Differentiate f (u(x, t)) by the chain rule:
∂f df ∂u df
= =
∂x du ∂x du
∂2 f d df ∂u d 2 f
= = .
∂x2 du du ∂x du2
∂f df ∂u df
= = ( −c ) ,
∂t du ∂t du
384 • Mathematical Physics
and so
∂2 f d df ∂u 2
2 d f
= − c = c .
∂t 2 du du ∂t du2
Exercise 11.3 Complete the proof by showing also that g(x + ct)
satisfies equation (11.6).
y(x, t)
x
y(x, t)
FIGURE 11.2: A wave moving to the right. The circles in the top figure indicate the displace-
ment of a point at x = 1 with time t: y(1, t). The circles in the bottom figure indicate the
movement with time of a point of constant phase.
The phase speed, c, of the wave (to the left or right) should not
be confused with the transverse (“up and down”) velocity, ∂y/∂t, of a
point of the string. If the displacement of the string is given by equa-
tion (11.7), the transverse velocity of a point of the string is
∂y ∂
= f ( x − ct ) + g ( x + ct ) =
− cf ′ ( x − ct ) + cg′ ( x + ct ) ,
∂t ∂t
∂2 y ∂ ∂ ∂y ∂ ∂ ∂ ∂ ∂2 y ∂2 y ∂2 y
= + = + + y= 2 +2 + .
∂x2 ∂u ∂v ∂x ∂u ∂v ∂u ∂v ∂u ∂u∂v ∂v2
Partial Differential Equations • 387
∂2 y 2∂ y ∂2 y ∂2 y
2
= c 2 − 2 + .
∂t 2 ∂u ∂u∂v ∂v2
∂2 y ∂2 y
=
4 0,=
or 0.
∂u∂v ∂u∂v
This is now in a form which can be solved. We can write this as
∂ ∂y
= 0.
∂u ∂v
This implies that ∂y/∂v is independent of u, i.e. an (arbitrary)
function of v only:
∂y
= h ( v) ,
∂v
where h is an arbitrary function of v. Integrating this equation gives
y = ∫ h(v) dv + f (u),
where g and f are both arbitrary functions. This is the general solu-
tion – it involves two arbitrary functions. (Compare second order
ordinary differential equations, where the general solution involves
two arbitrary constants.) In terms of x and t we have
Example 11.3 Find the solution of the wave equation for the
displacement of a string y(x, t), given that
∂y
=y ( x,0 ) sin
= ( x ) and ( x,0 ) 0
∂t
i.e. the string is initially at rest.
Starting from the general solution for the displacement, the
general solution is
y(x, t) = f(x − ct) + g(x + ct).(11.8)
To impose the second initial condition we need ∂y/∂t:
∂y
− cf ′ ( x − ct ) + cg′ ( x + ct ) .
= (11.9)
∂t
Now impose the initial conditions on y and ∂y/∂t. We are given
y = sin(x) and ∂y/∂t = 0 initially (the string is at rest), and so
y(x, 0) = f(x) + g(x) = sin(x)(11.10)
and
∂y
( x,0 ) =
− cf ′ ( x ) + cg′ ( x ) =
0. (11.11)
∂t
Thus we have two simultaneous equations for f and g, equations
(11.10) and (11.11). If we now integrate equation (11.11) we
obtain
−f(x) + g(x) = k, k a constant. (11.12)
Partial Differential Equations • 389
For a string released from rest (and only in this case) the solu-
tion can be interpreted graphically as follows: the initial configura-
1
tion y = h(x) (in this case sin(x)) is made up of two waves: 2 h ( x )
moving to the right and 1 h ( x ) moving to the left. These combine to
2
give the solution
1 1
y= h ( x − ct ) + h ( x + ct )
2 2
for a string initially at rest in the configuration y = h(x).
Exercise 11.9
(i) Repeat the working of Example 11.3 to find the solution of
the wave equation for the displacement of a string, y(x, t),
released from rest under the initial conditions
∂y
=y ( x,0 ) e=
− x2
, ( x,0 ) 0.
∂t
(ii) Interpret the solution as was done for Example 11.3 above
(see Figure 11.3).
390 • Mathematical Physics
y(x, t)
FIGURE 11.3: Motion of a string released from rest, with an initially Gaussian displacement
y(x, 0) = exp(−x2). As time increases (from back to front in the figure), the displacement splits
into two oppositely moving parts (Exercise 11.9).
FIGURE 11.4: Displacement of string released from rest with an intially triangular displacement.
The sequence (a)–(d) shows the evolution with increasing time. (a) At t = 0 the two waves overlap
completely. (b) At 0 < t < L/(2c) the two waves are mostly overlapping. (c) At L/(2c) < t < L/c the
two waves are only partially overlapping. (d) At t > L/c the two waves have separated completely.
Partial Differential Equations • 391
a, − L < x < L
y ( x,0 ) = ,
0 otherwise
where L and a are given constants. By using the graphical inter-
pretation above draw a series of figures to illustrate the displace-
ment of the string at times t < L/2c, L/2c < t < L/c, t > L/c, where
c is the wavespeed. (Note that this is an idealized example: the
string is displaced to a height a between two end-points where it
is fixed (so has zero height). But a real string cannot be discon-
tinuous at the points x = ±L. The picture therefore approximates
a string that has been given a constant displacement along almost
the whole of its length 2L.)
2cg(x) = −cos(x).
So
1
f ( x) =
− g ( x ) =cos ( x ) ,
2c
and hence
1 1
y ( x, t ) = f ( x − ct ) + g ( x + ct ) = cos ( x − ct ) − cos ( x + ct ) .
2c 2c
If we use the rule that
A+ B A−B
cos ( A ) − cos ( B) =
−2sin sin
2 2
with A = x − ct and B = x + ct then the cosine terms can be
combined in this solution to give
1
y ( x, t ) = sin ( x ) sin ( ct ) .
c
The initial conditions therefore generate a standing wave
pattern.
Partial Differential Equations • 393
1 x 1
=g ( x) ∫ p ( x′ ) dx′ + q ( x ) ,
2c x0 2
and hence
1 1 x
f ( x)
= q ( x ) − ∫ p ( x′ ) dx′.
2 2 c x0
Finally, substitute for f and g in the general solution and we
have that
1 1 x − ct 1 1 x + ct
y ( x, t =
) q ( x − ct ) − ∫ p ( x′ ) dx′ + q ( x + ct ) + ∫ p ( x′ ) dx′.
2 2 c x0 2 2 c x0
and so
1 1 x + ct
y ( x, =
t) q ( x − ct ) + q ( x + ct ) + ∫ p ( x′ ) dx′. (11.8)
2 2 c x − ct
1 x + ct 1
( x, t )
y= ∫ sin ( x=
′ ) dx′ cos ( x + ct ) − cos ( x − ct ) .
2c x − ct 2c
Now look at these pictures from the point of view of the semi-
infinite string x > 0: the left-moving wave is reflected at the origin,
with a change of sign, into a right-moving wave. Effectively, a wave
coming in from infinity (or at least a long way away) reflects off the
fixed point at the origin (e.g. a wall) and changes its phase by 180°.
Partial Differential Equations • 397
y = A cos[k(x − ct) + f]
(or y = A sin[k(x − ct) + f]) the wave is called simple harmonic. Let
w = ck.(11.19)
Then y = A cos[k(x − ct) + f] = A cos[k(ct − x) − f] can be
written as
y = A cos(wt − kx − f).(11.20)
11.9. BEATS
phase
amplitude
y(x, 0)
x
wavelength
FIGURE 11.7: Beats in the sum of two simple harmonic waves of the same amplitude with
slightly different frequencies. The top curve shows A cos(wt − kx). The second curve shows
A cos(w′t − k′x). The two curves are overlaid to highlight where they are in phase and out of
phase, and the fourth curve shows the sum of the two waves.
∆w ∆k
y( x, t ) = 2 A cos
2
t − x cos w t −kx .
2
( ) (11.22)
If we let
∆w ∆k
A( x, t ) 2 A cos
= t − x ,
2 2
then equation (11.22) is
=y( x, t ) A( x, t ) cos w t −kx .
( (11.23) )
This is therefore a wave (the carrier wave) with frequency w
and speed w k = c , with varying amplitude A(x, t). The amplitude
A(x, t) is itself a wave of large wavelength, 4p/∆k, and low frequency,
∆w/2, the beat frequency. The beat pattern moves with velocity
∆w/∆k = c. (See Figures 11.7 and 11.8.)
FIGURE 11.8: The beat pattern (at t = 0) from the sum of harmonic waves with slightly
different wavenumbers: y(x, t) = Acos(wt − kx) + B cos(w′t − k′x). (a) Waves with the same
amplitudes, A = B = 1, and (k′ − k)/k = 0.1, (b) with same amplitudes, A = B = 1 and (k′ − k)/k = 0.05,
(c) with (k′ − k)/k = 0.1 and A = 0.4 and B = 1.
|A+B|
|A–B|
–|A–B|
–|A+B|
FIGURE 11.9: Beats in the sum of two simple harmonic waves of different amplitudes.
400 • Mathematical Physics
Exercise 11.16
(i) Show that if the frequency w and wavenumber k of waves
1
in a certain medium are related by w = k2 , then the group
velocity is double the wave velocity. 2
(ii) What is the relation between w and k if, instead, waves have
group velocity equal to half the wave velocity?
Partial Differential Equations • 401
dc 1 c
= 1 − ,
dw k vg
and hence that the group velocity is respectively greater than the
wave velocity, equal to it, or smaller, according as the wave veloc-
ity increases with the frequency, is constant (as in the case of a
stretched string), or decreases as the frequency increases.
Waves for which the group and phase velocities are equal are
called non-dispersive. In particular, if a wave is non-dispersive then
its phase speed is constant with frequency (and conversely). The
group speed provides a useful estimate of the modes (of different
frequencies) that transfer the most energy.
0 5 10 15 20 0 5 10 15 20
x x
(a) (b)
0 5 10 15 20 0 5 10 15 20
x x
(c) (d)
FIGURE 11.10: The beat pattern from the sum of harmonic waves with different wavenumbers
and different phase velocities: y1(x, t) = cos(wt − kx) (top curve) and y2(x, t) = cos(w′t − k′x)
(second curve). Here v1 = w/k = 5/5 = 1, while v2 = w′/k′ = 6.6/5.5 = 1.2. (a) At time t = 0 the
two simple waves are in phase, and they are in phase with their sum. (b)–(d) Times t = 2.5, 5
and 7.5 showing the effect of the different phase speeds of the simple waves on the sum. The
solid dots represent points of constant phase on each wave (y1, y2 and the carrier wave of their
sum); these all move with their respective phase velocities. The open circle shows a point of
constant phase on the envelope; this moves with the group velocity.
402 • Mathematical Physics
This solution has the property that y(0, t) = 0 and y(2p, t) = 0 for
all t. If we restrict our attention to the region 0 ≤ x ≤ 2p, this solution
Partial Differential Equations • 403
∂2 y 1 ∂2 y
= .
∂x2 c2 ∂t 2 (11.24)
Take the left side first. We have
∂2 y ∂2 X
= T,
∂x2 ∂x2
since T(t) is constant for partial differentiation with respect to x, and
similarly for the right hand side
∂2 y d 2T
= X .
∂t 2 ∂t 2
404 • Mathematical Physics
1 d2 X 1 d 2T
= .
X dx2 c2 T dt 2 (11.26)
In equation (11.26) we have been able to write our equation
in a very special form: the left side is a function of x only; the right
side is a function of t only, not of x. Thus, a function of x on the left
equals something that is independent of x on the right. The only way
this can happen is if each side of the equation equals a constant. We
have the freedom (at the moment) to choose this constant. For the
applications we shall be making it will turn out that the best choice
is for this constant to be negative. With hindsight we therefore write
this separation constant as −a2. (Writing + a2 for the constant would
lead a to be imaginary; this is not wrong but it is not so convenient.)
Thus equation (11.4) gives
1 d2 X 1 d 2T
=
−a 2
, −a 2c2 .
= (11.27)
X dx2 T dt 2
Thus we have separated out the x and t parts of the equation. The
dependence on x and t is sometimes said to be decoupled. Equations
(11.27) can be solved for X and T; possible solutions are
X = cos(ax) or sin(ax),
and
T = cos(act) or sin(act).
Since each of these four solutions are possible, we make linear
combinations to form a solution:
For the second line we have made use of the fact that
Partial Differential Equations • 405
Exercise 11.20 What are all the separable solutions of the wave
equation obtained by taking +a2 in equation (11.27)? Hence
explain why the choice of −a2 is made in applications involving
the wave equation.
∂2 f ∂2 f
+ =
0,
∂x2 ∂y2
use the method of separation of variables to find all solutions of
the form f(x, y) = X(x)Y(y) which vanish on x = 0 (for all y).
11.12. EIGENVALUES
y(l, t) = Asin(al)sin(act + f) = 0,
a = an = np/l, for n = 1, 2, 3, . . . .
Partial Differential Equations • 407
FIGURE 11.11: The first five eigenfunctions from Example 11.10. Bottom to top: n = 1, 2,
3, 4, 5.
The numbers an2 are the eigenvalues of the problem; the cor-
responding solutions for y are called the eigenfunctions of the prob-
lem. So for the example above the eigenvalues are (np/l)2 and the
eigenfunctions are
Asin(npx/l) sin(npct/l + f),
or, expressed differently,
sin(npx/l)(a cos(npct/l) + b sin(npct/l)).
In the case of the wave equation we also refer to the eigenfunc-
tions as the normal modes of the system and the eigenfrequencies as
the normal frequencies or harmonics.
408 • Mathematical Physics
Exercise 11.25 Show that for a string with the end at x = 0 fixed
and the end at x = l free, the wave equation has separable solu-
tions of the form
y(x, t) = sin(wx/c)(Acos(wt) + Bsin(wt))
Find the transverse velocity of the free end of the string at x = l
corresponding to this solution. If the free end at x = l is driven
with velocity v0 sin(pt) show that the displacement of the string
is given by
v sin( px / c)
y( x, t ) = − 0 cos( pt ).
p sin( pl / c)
from which it can be shown that r′(c′)2 = rc2 = F, say. Let an incident
wave of unit amplitude
yi(x, t) = eiwte−ikx
be incident from the left. (Note that the sign in the exponent is
related to the direction of propagation in the usual way.) A wave
yref(x, t) = Aeiwteikx
x
FIGURE 11.12: Wave incident on a join. In the text we show that w = w′ so this diagram is
often drawn omitting the factors eiwt.
410 • Mathematical Physics
1 + A = B.(11.29)
1
rw 2 a2 c.
2
1 2
A w kF
2
and the transmitted flux is
1 2
B w k ′ F.
2
Partial Differential Equations • 411
reflected flux
=R = A2
incident flux
transmitted flux k′ 2
=T = B.
incident flux k
x
FIGURE 11.13: A matter wave incident from the left on a potential barrier V0. The common
exponential factors e−iwt are often omitted.
412 • Mathematical Physics
2 mE
k= .
(Note the opposite signs in the exponentials in yi relative to
Section 11.13. This is dictated by the interpretation as matter waves.
Solutions of the Schrödinger equation are proportional to e− iEt and
positive momentum means e+ ipx . For waves on strings only the
relative sign between t and kx matters.)
Similarly the reflected wave is yr = Ae−iwte−ikx.
To the right of the origin the Schrödinger equation requires
p2
=
E + V0 ,
2m
or, with p = k′,
2 m(E − V0 )
k´ = .
So the transmitted wave is yt = Be−iwteik′x. Thus in x < 0 the wave-
function is
y1 = e−iwt(eikx + Ae−ikx),
and in x > 0
y2 = Be−iwteik′x.
There are two cases to consider, depending on whether E > V0
(k′ real) or E < V0 (k′ imaginary).
Partial Differential Equations • 413
or
1 + A = B.(11.31)
Note that this holds for all time only if we can cancel the
factors e−iwt, which requires w (or E) to be the same on both
sides of the barrier at x = 0. This corresponds to conservation
of energy. Similarly, the conservation of probability means
that the probability current must be continuous across x = 0,
so ∂y/∂x must be continuous. Thus
∂y 1 (0, t ) ∂y 2 (0, t )
= ,
∂x ∂x
or
ik − ikA = −ik′B.(11.32)
k − k′ 2k
=A = , B .
k + k′ k + k′
The reflection and transmission coefficients are the same as
for the string example (above). Again, we see an extra factor
of k′/k in the transmission region to take account of the dif-
ferent speed of the wave there.
(ii) The large potential step. In this case E < V0 so k′ is imagi-
nary. Thus A and B are complex and the definition of reflec-
tion and transmission coefficients requires a little more care.
The flux of energy is proportional to
E × particle probability density × speed ∝ E|ψ|2k.
414 • Mathematical Physics
2 2 Re(k′)
= =
R A , T B
Re(k )
Revision Notes
11.15. EXERCISES
∂x∂t
∂2 y 1 ∂2 y
5. If y(x,t) = exp[i(kx − wt)] satisfies 2 = 2 2 what is the
∂x c ∂t
relation between w and k?
6. Write down the normal modes of a stretched string with
wavespeed c which is of unit length fixed at its end-points.
Such a string is given a displacement y(x, 0) = 3 sin(2px)
and released from rest. Show that the subsequent displace-
ment of the string is y(x,t) = 3 sin(2px) cos(2pct).
7. A semi-infinite string 0 ≤ x < ∞ is fixed at x = 0. A wave
y = f(x + ct) impinges on x = 0 from the right. What is the
reflected wave?
416 • Mathematical Physics
11.16. PROBLEMS
1. Verify that
1 1 x + ct
y( x,=
t) F( x − ct) + F( x + ct) + ∫ G(s)ds
2 c x − ct
∂2 y 1 ∂2 y
is the solution of the wave equation = satisfying
∂x2 c2 ∂t 2
∂y
the initial conditions y(x, 0) = F(x), ( x,0) = G( x).
∂t
2. An infinite string is at rest along the x-axis. At time t = 0 it is
struck in such a way that its velocity is non-zero only in the
region |x| ≤ L. Show that, after a time L/c, at the point x = 0
the string is again at rest. What parts of the string are at rest
for times t ≥ L/c?
3. An infinite string is released from rest with the initial shape
a(l + x) if − l < x < 0
=y a(l - x) if 0 < x < l
0 otherwise
where a, l are constants. Show that after a time t = l/c the
string at the point x = 0 is again at rest. What parts of the
string are at rest for t > l/c?
4. An infinite string lies along the x-axis except for the region
−l < x < l, where it is deformed from the axis. At t = 0 the
string is released from rest. Show that, whatever the initial
deformation, the string is again at rest at the origin (i.e.
y(0, t) = 0) for t > l/c.
∂2 y 1 ∂2 y
5. Separate variables in the wave equation 2 = 2 2 and
∂x c ∂t
hence find the solution obeying the conditions
px ∂ px
=y( x,0) B=
sin and ( ,0) sin
l ∂t l
Partial Differential Equations • 417
12
FOURIER SERIES
12.1. INTRODUCTION
To see how this works, let us consider a sound wave. The vibra-
tion of a tuning fork produces a sound wave of a given frequency.
If we plot the pressure as a function of distance, x, or time, t, it
looks like a single sine wave, or a “pure tone” (Figure 12.1: left).
When a note is played on a flute, we get a more complex sound
(Figure 12.1: center). The note that we get is made up from the sum
of many pure tones: the fundamental and different harmonics with
420 • Mathematical Physics
FIGURE 12.1: Left: a pure sine wave, sin(wt). Center: example waveform, f(t), from a flute.
Right: the note from the flute is made up of the sum of the fundamental sine wave and a series
of harmonics. In this example, f(t) = sin(wt) + sin(2wt) + 0.2sin(3wt) + 0.4sin(4wt). This is
the Fourier series.
Fourier methods are used very heavily in signal and data analy-
sis. By Fourier analyzing a signal – essentially by expanding it in the
form of equation (12.1) – we can immediately tell which harmon-
ics are the important ones. For example, in the note from the flute
(Figure 12.1), the harmonic at frequency 2w has relatively large
amplitude, while the harmonic at 3w is small. If, for example, a
poorly designed speaker filtered out the harmonic at 2w it would
greatly change the character of the sound, while filtering out the
harmonic at 3w would have a much less discernible effect.
Fourier methods are also commonly used in mathematical phys-
ics. In this chapter, we will focus on using them to solve differen-
tial equations, and the wave equation in particular. We will examine
Fourier half range series and Fourier full range series, study some
applications of Fourier series, then finish by introducing Fourier
transforms and the convolution theorem.
Given any physically reasonable function, f(x), can we find the coef-
ficients, bn, such that equation (12.3) is satisfied? Remarkably, yes!
This is known as Fourier’s theorem.
Equation (12.3) is the Fourier half range sine series of a func-
tion, f(x). This is a very powerful result. It tells us that, within the
range 0 to L, we can write any (physically reasonable) function as a
sum of sine waves.
The Fourier half range sine series coefficients, bn, are given by
2 L np x
bn = ∫ f ( x)sin dx. (12.4)
L 0 L
a non-zero result for the integral only when the wavelengths are the
same and hence when n = m.
Substituting in the result from equation (12.35), we have
∞
mp x L L
∑=
L
∫0
=
f ( x)sin
L
dx b d
2
n =1
n nm bm
2
.
We can now use the results from equation (12.3) and equation
(12.4) to find the Fourier half range sine series for any function, f(x).
2 L np x 2 L np x
=bn =∫ f ( x ) sin dx ∫ sin dx
L 0 L L 0 L
L
2 L np x
= − cos
L np L 0
2 L
= −
L np
( cos ( np ) − 1) .
Fourier Series • 423
=
bn
2
np
(
1 − ( −1 )
n
)
0 if n iseven
(12.5)
= 4
if n isodd.
np
4 np x
1= ∑ np sin
n odd L
, (12.6)
∞
4 ( 2m + 1) p x
1= ∑ sin .
m =0 ( 2 m + 1) p L
4 px 1 3p x 1 5p x
1= sin + sin + sin + .
p L 3 L 5 L
y y
1 1
0 0
0 L x 0 L x
FIGURE 12.2: Left: the function f(x) = 1 in 0 ≤ x < L. Right: the first three partial sums of its
Fourier sine series, dashed line f1 = 4 sin(px/L)/p, dotted line f2 = f1 + 4 sin(3px/L)/(3p), solid
line f3 = f2 + 4sin(5px/L)/(5p). As we add each successive term from the infinite series we get
closer and closer to the exact representation of the function.
Figure 12.3 shows f(x) = x and the first three partial sums of the
Fourier sine series of f(x).
y y
L 1
0 0
0 L x 0 L x
FIGURE 12.3: Left: the function f(x) = x in 0 ≤ x < L. Right: the first three partial sums
of its Fourier sine series, dashed line f1 = 2L sin(px/L)/π, dotted line f2 = f1 – 2L sin(2px/L)/(2p),
solid line f3 = f2 + 2L sin(3px/L)/(3π).
Fourier Series • 425
x L if 0 ≤ x < L 2
g ( x) =
1 − x L if L 2 ≤ x < L.
Hint: the integral for bn can be split into the sum of two parts
b c b
of the form ∫=
f ( x)dx ∫ f ( x)dx + ∫ f ( x)dx where a < c < b. See
a a c
Figure 12.4 for a depiction of f(x) and the first three partial sums
of its Fourier sine series.
y y
0.5 0.5
0 0
0 L x 0 L x
FIGURE 12.4: Left: the function g(x) from exercise 12.2. Right: the first three partial sums of
its Fourier sine series, dashed line f1 = 4sin(px/L)/p2, dotted line f2 = f1 – 4sin(3px/L)/(9p2),
solid line f3 = f2 + 4sin(5px/L)/(25p2).
Armed with results 12.3 and 12.4, we can now find the coeffi-
cients An and Bn given a set of initial conditions.
Let us examine the general case when the string is given an
initial displacement, y(x, 0) = p(x), and an initial velocity, yt(x, 0)
= q(x). By substituting t = 0 into equation (12.8) we immediately
find that
∞
np x
y( =
x,0 ) p=
( x) ∑B
n= 1
n sin
L
.
This looks like the Fourier sine series of p(x) with Fourier series
coefficients Bn. So, to find the coefficients Bn we simply need to
apply the formula in equation (12.4):
2 L np x
p ( x ) sin
L ∫0
Bn = dx . (12.9)
L
We can follow a similar process to find the An. First, find the
transverse velocity of the string:
∂y ∞
np x np c np ct np c np ct
yt ( x, =
t) =
∂t
∑ sin
n =1 L
An
L
cos
L
− Bn
L
sin
L
.
So at t = 0
∞
An np c np x
yt ( x=
,0 ) q=
( x) ∑
n =1 L
sin
L
.
Again, this looks like the Fourier sine series of q(x) with Fourier
series coefficients Annpc/L. So to find Annpc/L we simply need to
apply the formula in equation (12.4):
An np c 2 L np x
= ∫0 q ( x ) sin dx,
L L L
giving
2 L np x
An =
n pc ∫ q ( x ) sin L dx . (12.10)
0
Fourier Series • 427
y y y y
1 1 1 1
0 0 0 0
L x L x L x L x
—1 —1 —1 —1
FIGURE 12.5: Wave from exercise 12.5. From left to right the panels show the displacement
(solid line) and transverse velocity (dashed line) of the string at t = 0, t = L/4c, t = 3L/4c and
t = L/c.
We can use the techniques from Section 12.5 to find the solu-
tions to other differential equations.
As a brief example, let us consider the solution to the Laplace
equation for the electrostatic potential f(x, y) on a metal plate,
Fourier Series • 429
2 L2 np x − np y L (12.12)
4 L2
( ( −1) )
∞
f (=
x, y ) ∑ ( ) n +1 n
−1 + − 1 sin e .
n =1 np n3 p 3 L
L 1 L
0.5
0 0 0
0 L 0 L
FIGURE 12.6: φ(x, y) from equation (12.12). Left: map of the electric potential. Right:
equipotential lines.
430 • Mathematical Physics
From the standard integral (Equation 12.36) all the terms in the
sum on the right of equation (12.16) are zero except for the one with
m = n. Hence, if m = n,
L mp x L
2 mp x L
∫ f (=
0
x ) cos
L
dx m ∫ cos
a=
0 L 2
am ,
The function f(x) = x and the first three partial sums in its Fourier
cosine series are shown in Figure 12.7.
y y
L L
0 0
0 L x 0 L x
FIGURE 12.7: Left: the function f(x) = x in 0 ≤ x < p. Right: the first three partial sums of its
Fourier cosine series, dashed line f1 = p/2 – 4cos(x)/p, dotted line f2 = f1 – 4 cos(3x)/(9p), solid
line f3 = f2 – 4 cos(5x)/(25p). As we add each successive term from the infinite series we get
closer and closer to the exact representation of the function f(x) = x.
432 • Mathematical Physics
p 4 1 1
x=− cos x + cos 3 x + cos 5 x + . . . .
2 p 9 25
This equation is true for any x where 0 ≤ x < p. We can substi-
tute a particular value for x into both the left and right sides and the
equality will still hold. Let us choose, for example, x = 0. Then all the
values cos nx = 1 and so
p 4 1 1
0= − 1 + + + . . .
2 π 9 25
giving
1 1
p 2= 8 1 + + + . . .,
9 25
which is a series expansion for p2.
1 0 ≤ x < L 2
f ( x) =
0 L 2 < x < L.
Expand f(x) as a Fourier cosine series and show that an = 0 if n is
even and that, if n is odd,
2
=
an ( −1)( n−1) / 2 . (12.18)
np
Write down the cosine series for f(x) in 0 ≤ x < L and deduce that
p 1 1 1
=1 − + − .
4 3 5 7
Fourier Series • 433
So far we have examined the sine and cosine Fourier series rep-
resentations of functions within a limited range, 0 ≤ x < L. However,
both sine and cosines repeat periodically. So, if we plot the Fourier
series representations outside of this range we will get functions that
repeat periodically with wavelength 2L.
Outside the given finite range, the Fourier series of f(x) repre-
sents a periodic extension of the function with f(x + 2L) = f(x).
To understand the periodic extension of Fourier series it is impor-
tant to first understand the symmetry of even and odd functions. There
is a short section on even and odd functions in Chapter 3.
y y y
–L 0 L 2L x —L 0 L 2L x —L 0 L 2L x
FIGURE 12.8: Left: Fourier sine or cosine representation of f(x) = x within 0 ≤ x < L. Center:
periodic extension of sine series representation of f(x). Right: periodic extension of cosine
series representation of f(x).
f(x)
f(x)
Cosines and hence the Fourier cosine
series must repeat every 2L. So the
shape of the waveform between –L and
L simply repeats, and we can sketch the
-L 0 L 2L 3L cosine series of f(x) = x3 between –L
and 3L.
x L if 0 ≤ x < L 2
g ( x) =
1 − x L if L 2 ≤ x < L.
Sketch the Fourier sine and Fourier cosine representations of
g(x) in –L ≤ x < 3L.
FIGURE 12.9: Some common waveforms: square wave (left), triangle wave
(center), rectified half wave (right).
The formulae for the Fourier series full range coefficients can
be derived in a similar way to the formulae for the sine and cosine
half range coefficients. We have
1 L
f ( x ) dx
L ∫− L
a0 =
1 L np x
an = ∫ f ( x ) cos dx (12.20)
L − L L
1 L np x
bn = ∫ f ( x ) sin dx.
L − L L
f ( x) = ∑ce
n =−∞
n
inp x L
, (12.21)
where
1 L
cn = ∫ f ( x ) e− inp x L dx. (12.22)
2L − L
Fourier Series • 437
Sometimes this form is more convenient than the sine and cosine
forms.
( a2 2 − b2 ( 2 i ) ) e− i 2 p x L + ( a1 2 − b1 ( 2 i ) ) e− ip x L + a0
f ( x ) =+
+ ( a1 2 + b 1 ( 2 i ) ) e i px L + ( a 2 2 + b 2 ( 2 i ) ) e i 2 px L +
= + c−2 e− i 2 p x L + c−1 e− ip x L + c0 + c1 e ip x L + c2 e i 2 p x L + ,
∑c ∫
L L
∫ f ( x ) e− imp x L dx = n e i( n− m )p x L dx.
−L −L
n = −1
∑ c 2 Ld
L
∫ f ( x ) e− imp x L dx = n nm . (12.23)
−L
n =−∞
1 n=0
cn =
( −1 ) L ( inp ) n ≠ 0.
n +1
−1
( −1)n+1 L ∞
( −1)n+1 L
1+ ∑
f ( x) = e inp x L
+∑ e inp x L .
n =−∞ inp n =1 inp
− L ∞ ( −1 ) i( − n)p x L ( −1 ) inp x L
−n n
f ( x) =
1+ ∑
p n=1 i ( − n )
e +
in
e
− L ∞ ( −1 )
n
=
1+ ∑
p n=1 in
e inp x L − e− inp x L
2 ( −1 ) L
∞ n +1
np x (12.24)
= 1+∑ sin ,
n =1 np L
where, in the last step, we have used the fact that
2 i sin ( np x L ) .
e inp x L − e− inp x L =
Fourier Series • 439
General Properties
a) Except for some pathological functions which do not occur in
physical problems, we can always expand a function, f(x), defined
in a finite interval as a Fourier series which will converge with
sum f(x) at all points at which f(x) is continuous.
b) If f(x) has a discontinuity at x = x0 its Fourier series will con-
verge to the average of the limit from the left and the limit
y from the right. The Fourier series of
a discontinuous function sums to a
value midway along the discontinuity.
f (x0)
1
x0 x f ( x0 ) lim
=
Œ →0 2
( f ( x0 + Œ ) + f ( x0 − Œ ) ) .
c) A Fourier series can be integrated term by term: the resulting
series always converges to ∫ f ( x ) dx .
d) Term by term differentiation of a Fourier series may produce a
divergent series. If the series produced by differentiation does
converge, then it is the Fourier series for f ¢(x).
Convergence of Series
The properties above apply to the full Fourier series containing
an infinite number of terms. In practice we often calculate the sum
of only a finite number of the terms in the series which we use as an
approximation. It is important to know when this is likely to give a
good approximation.
a) If f(x) or its periodic extension has discontinuities we expect an,
bn, cn to be of order 1/n and the convergence is slow.
b) If f(x) or its periodic extension is continuous we expect an, bn, cn
to be of order 1/n2 and the convergence is rapid.
c) The Gibbs phenomenon. At a discontinuity (at x0, say) conver-
gence of a Fourier series is slow and a finite sum of N terms will
persistently under- or over-estimate f(x) near x0. The size of the
440 • Mathematical Physics
=f ( x) ∑
= ce
n= −∞
∑c e
n
inp x L
n= −∞
n
2 inp x l
. (12.25)
Fourier Series • 441
F(k) is called the Fourier transform of f(x). The two functions, F(k)
and f(x), are called a Fourier transform pair; f(x) is the inverse Fou-
rier transform of F(k).
Unfortunately there is no standard definition in the literature
of what constitutes the transform and what constitutes the inverse
transform. The only requirement is that one of equations (12.26)
and (12.27) contains e-ikx and one contains eikx. Similarly there is no
set convention for the constant factors in front of these integrals.
We have used a factor 1/2p on the inverse transform and 1 on the
442 • Mathematical Physics
transform, but you will often see the opposite of this, or sometimes
1 2p is used in front of both. This means that, when reading the
literature, you should be careful to identify the conventions used.
The argument of the Fourier transform, k, has units that are the
reciprocal of the dimensions of the variable x. We will often use x to
denote position, in which case k is the wavenumber. Similarly, if the
variable is a time, t, the transform variable will be a frequency, often
denoted by w. Physicists often talk about using Fourier transforms
to transform from real space to reciprocal or k-space, or from the
time-domain to the frequency-domain.
( x s + iks 2 )
2
x2 s 2 + ikx = + k2s 2 4.
Then
∞ − ( x s + iks 2 )
2
F ( k ) = e− k s ∫
2 2
4
e dx,
−∞ (12.28)
∞
∫
− k2s 2 4 − x′2
=s e e dx′,
−∞
exp ( − x2 ) dx =
∞
the fact that ∫−∞
p, we find that the Fourier
transform of the Gaussian is
F ( k ) = s p e− k s
2 2
4
. (12.29)
12.17. CONVOLUTIONS
∞
( x)
C= exp − ( x − x′ ) g ( x′ ) dx′ . You will often see a similar
∫ −∞
2
FIGURE 12.10: Image processing with convolutions. A sharp image of a circle (left) is con-
volved with a Gaussian response function to give the blurred image (right).
1 1 1
0 0 0
—10 —5 0 5 10 —10 —5 0 5 10 —10 —5 0 5 10
FIGURE 12.11: Example convolution. Left: sharp “input” signal, g(x) = 1, |x| < 3. Center:
Gaussian response function, f(x) = exp(−x2). Right: blurred output signal,
∞
C( x)
= ∫ f ( x − x′ ) g ( x′ ) dx′ .
−∞
C ( k ) = F ( k ) G ( k ) . (12.31)
= F ( k) G ( k).
1 ∞ C ( k ) ikx
g ( x) =
2 π ∫−∞ F ( k )
e dk. (12.32)
Exercise 12.20 Write down the analogous results for the con-
volution integral and convolution theorem, in the time and fre-
quency domain.
∞
Exercise 12.21 If= c ( t ) ∫ f ( t − t ′ ) g ( t ′ ) dt ′ show that C(w) =
−∞
F(w)G(w) where C(w), F(w) and G(w) are the Fourier transforms
of c(t), f(t) and g(t), respectively.
Note how this differs from the Fourier transform: we replace the
complex exponential by a real exponential and integrate only from
0 to ∞. The Laplace transform is useful for finding the solutions of
linear differential equations with constant coefficients, particularly
equations with time as the independent variable, because it readily
allows us to incorporate the initial conditions. It has the disadvan-
tage that the inverse transform is not straightfoward; in particular
∞
f ( t ) ≠ ∫ F (s )e − st ds in general. In practice, one can look up tables
0
of transform pairs.
Fourier Series • 447
x + x =f ( t ) . (12.33)
− x ( 0 ) − sx ( 0 ) + s2 X ( s ) .
=
Revision notes
0 if m= n= 0
L np x mp x (12.35)
∫ sin sin dx = L
L L 2 d mn otherwise
0
L if m= n= 0
L np x mp x
∫0 cos cos dx = L (12.36)
L L 2 d mn otherwise
if m= n= 0
L np x mp x 0
∫ −L
sin
L
sin
L
dx =
Ld mn otherwise
(12.37)
L np x mp x 2 L if m= n= 0
∫−L
cos
L
cos
L
dx =
Ld mn otherwise (12.38)
L np x mp x
∫−L
sin
L
cos
L
dx = 0 (12.39)
L
∫ e i( n− m ) πx L dx = 2 Ld nm . (12.40)
−L
L mp x np x 1 L ( m − n) p x ( n + m)px
∫=
0
sin
L
sin
L
dx ∫
2
0
cos
L
− cos
L
dx
( m − n ) πx
L
1 L
= sin
2 ( m − n) p L 0
450 • Mathematical Physics
( m + n) p x
L
1 L
− sin
2 ( m + n ) p L
0
= 0 if m ≠ n, ( all the sine terms are zero ) .
L mp x np x L np x
∫0
sin
L
sin
L
dx = ∫ sin 2
0 L
1 L 2 np x
=
2 ∫0 1 − cos
L
dx
L
= .
2
i (n − m )p x i (n − m )p x
L
L L
∫− L exp dx =
L i ( n − m ) p
exp
L
− L
L
= e i ( n − m )p − e − i ( n − m )p
i (n − m )
= 0,
L i( n − m)px L
∫−L
exp
L
dx = ∫ dx
−L
= 2 L.
Fourier Series • 451
12.21. EXERCISES
12.22. PROBLEMS
1. Find the Fourier series for f(q) = q2, −p ≤ q < p, and by set-
ting q = 0, show that
p2 ∞
( −1) n +1
=∑ .
12 n=1 n2
2. Expand the function f(x) = x as a half range Fourier cosine
series in the range 0 ≤ x < L.
By considering the series for a suitable value of x, show that
∞
1 p2
∑ (2 n + 1 )
n =0
2
=
8
.
1 1
( x)
4. If f = ( ( x) ( h ( x) − h ( − x)) ,
h ( x ) + h ( − x ) ) and g=
2 2
show that f(x) is even and g(x) is odd.
Derive the following results:
a
0 if h ( x ) isodd
∫− a h ( x ) dx = a
2 ∫0 h ( x ) dx if h ( x ) iseven.
Hint: first split the integral into two regions, a part from
−a to 0 and a part from 0 to a, then change variable from x
to x¢ = −x in the region where x < 0.
Fourier Series • 453
13
INTRODUCTION TO
VECTOR CALCULUS
z
y
1
0
x
–1
–1 0 1 2 3 4
x
(a) (b)
FIGURE 13.1: Two different ways to visualize a scalar field f(x, y). (a) A heat-map or intensity
image, where the shade represents the value of f. (b) A surface plot where z = f(x, y) is the
height of the surface.
4
3
2
y
1
0
−1
−1 0 1 2 3 4
x
FIGURE 13.2: A vector field V(x, y): a vector is defined at each point (x, y). Here, vectors are
drawn as arrows at grid points.
the subscripts indicating that the components are along the x-, y- and
z-axes (not to be confused with shorthand notation for the partial
derivatives of a scalar V(x, y, z). If there is any danger of confusion, the
derivatives would be denoted by V,x etc. with the additional comma.)
A vector field is simply a vector-valued function, i.e. an assign-
ment of a vector (three real numbers) to each point of a region of
space. Examples of vector fields are the electric field in space and
the velocity of fluid in a pipe.
Examples:
•• A = (x, y, z) is a vector field with Ax = x, Ay = y, Az = z at the
point (x, y, z).
•• B = (x3, x2y, y2(z + 1)) is a vector field.
•• V = (0, 0, 2) is a vector field with V equal to this constant
value throughout space.
Example 13.1 Let f(x, y, z) = xz; find ∇f and its value at the
point P with coordinates (3, 2, 17).
According to the definition we have to work out the three
partial derivatives. Recall that z is a constant for differentiation
with respect to x etc.
Introduction to Vector Calculus • 459
Thus:
∂f ∂f ∂f
= z=, 0,= x, (13.5)
∂x ∂y ∂z
and so we have
∇f = (z, 0, x). (13.6)
Substituting the values of x, y and z at P = (3, 2, 17) gives
x y z
∇f = , , .
( x2 + y2 + z2 ) ( x2 + y2 + z2 ) ( x2 + y2 + z2 )1 2
1 2 1 2
(13.10)
We can extract a common factor from each component:
∂f ∂f ∂f
f+ δ x + δy+ δz
∂x ∂y ∂z
f ( x , y, z)
δr
δz
δx
r
δy
∂f
f + δx
∂x ∂f ∂f
f+ δx+ δy
∂x ∂y
FIGURE 13.3: Changes in f(x, y, z). We start at point r and end up at point r + dr by moving
first along the x-axis, then the y-axis, and then the z-axis.
Consider first a function of two variables f(x, y). The level curves
are defined to be the curves f(x, y) = k for the various possible values
of the constant k. They are the curves along which f(x, y) has a level,
i.e. constant, value.
This idea is familiar from meteorology. If f = pressure, level
curves are isobars; or if f = temperature, level curves are iso-
therms. And in geography, if f = height of land, level curves are
contour lines.
The surface f(x, y, z) = k, with k a constant, is said to be a level
surface of the function f(x, y, z). A set of level surfaces is generated
by taking a range of values for k. The notion of level surface captured
by this definition is not the same as the non-technical use of “level” to
mean a “flat” surface – it is the function that is “flat,” not the surface.
Introduction to Vector Calculus • 463
4
3
2
y
1
0
–1
–1 0 1 2 3 4
x
FIGURE 13.4: Level curves of of the function from Figure 13.1.
The vector (0, 0, 1) points along the z-axis, and hence is normal
to the (x, y) plane.
464 • Mathematical Physics
∇f
Note that − is also a unit normal to the surface (but it points
∇f
in the opposite direction).
Introduction to Vector Calculus • 465
4
3
2
1
0
–1
–1 0 1 2 3 4
FIGURE 13.5: Level curves and gradient vectors of of the function from Figure 13.1. Notice
how the gradient vectors are perpendicular to the level curves (contours). Also notice how the
gradients are larger (longer arrows) when the contours are closer together.
We have shown that ∇f/|∇f | and −∇f/|∇f | are unit normals, and
so
1
n= ± ( −2 x, −2 y,1 ) . (13.27)
( 4 x2 + 4 y2 + 1 )1 2
Next, we substitute the values of x, y and z at the given point
(1, 1, 1):
n = (−2/3, −2/3, 1/3) or (2/3, 2/3, −1/3). (13.28)
Since the surface expands outwards from the z-axis
above z = −1, the outward normal points “down,” so is
(2/3, 2/3, −1/3).
Exercise 13.6 Find the outward pointing unit normal to the sur-
face x2 + y2 = 8 at the point (2, 2, 1).
Exercise 13.7 Find the outward pointing unit normal to the sur-
face f(x, y, z) = x2 − y2 at the point (−20, −20, 0).
Introduction to Vector Calculus • 467
∂ ∂ ∂
∇= (
⋅ A i + j + k ⋅ iAx + jAy + kAz )
∂x ∂y ∂z
∂A ∂Ay ∂Az
= x+ + . (13.31)
∂x ∂y ∂z
We have used the results for the dot products of the basis vec-
tors: i · i = 1, i · j = 0, and so on.
468 • Mathematical Physics
rx = x, ry = y, rz = z. (13.32)
∇· r = 1 + 1 + 1 = 3. (13.34)
–4 –2 0 2 4
x
FIGURE 13.7: The vector field r = (x, y, z) shown in the plane z = 0. See Example 13.5.
Introduction to Vector Calculus • 469
∇ · A = 4 × 12 + 02 = 4. (13.38)
or equivalently
∂ ∂ ∂
∇= (
× A i + j + k × iAx + jAy + kAz )
∂x ∂y ∂z
∂A ∂Ay ∂Ax ∂Az ∂Ay ∂Ax
= i z − + j − + k − .
∂y ∂z ∂z ∂x ∂x ∂y
(13.41)
using the usual rules for vector (cross) products (see Section 4.4).
This is the same as equation (13.40).
Introduction to Vector Calculus • 471
∂Ax ∂Az
( ∇ × A )=y − = 3 x − 0= 3 x; (13.44)
∂z ∂x
and finally for the z component:
∂Ay ∂Ax
(∇ × A )z = − = 0 − 0= 0. (13.45)
∂x ∂y
Thus we have that ∇ × A = (−2y, 3x, 0). It is a vector field
so has three components each of which is a function of x, y
and z.
4.0
4
2
3.0
y
y
0
2.0
–2
1.0
–4
1.0 2.0 3.0 4.0 –4 –2 0 2 4
x x
(a) (b)
FIGURE 13.8: Two examples of vector fields plotted with the z-axis suppressed. (a) The vec-
tors always lie along the x-axis with y-and z-components zero. The curl is therefore zero every-
where. If we placed a small paddle wheel in the field it would not be made to rotate. (b) The
vectors change direction as we move around the space; if we placed a paddle wheel anywhere
in the field it would begin to rotate clockwise. For example, at the point (2, 0) there is a stron-
ger “current” on the right than on the left side. This means there is a non-zero curl; at any
point the curl vector here points into the page in agreement with the “right hand screw rule”
(see last paragraph in Section 13.7).
y
x
More generally, if
y 1
x
13 3
= ∫= dz . (13.56)
0 2 2
The field has 3/2 units of flux through Y.
Exercise 13.14 Calculate the flux of the vector field (x3 + 1, yz, 2)
through
(i) the surface X1, in the y-z plane bounded by the lines y = 0,
y = 1, z = 0, z = 1,
(ii) the surface X2, in the plane x = 1 bounded by the lines y = 0,
y = 1, z = 0, z = 1.
(iii) Hence obtain the difference in the fluxes through X2 and X1.
478 • Mathematical Physics
v V
y
u(y2) u
y2
Q R
P S
y1
u(y1) x
x1 x2
FIGURE 13.11: The rectangle PQRS. Above is shown the vector V at a point, and its compo-
nents in the x-and y-directions.
where the minus sign for u arises because the tangential component
of V is positive in the direction QR, opposite to RQ. Similarly, for the
contribution from PS,
(length of PS) × (tangential component of V along PS )
= (x2 − x1) × u(y1).
Vx ( x, y2 , 0 ) dx = − ∫ Vx ( x, y2 , 0 ) dx
x1 x2
∫x2 x1
(13.62)
Vx ( x, y2 ,0 ) dx.
x2
∫ x1
(13.63)
480 • Mathematical Physics
∫ ( − V ( x, y , 0 ) + V ( x, y , 0 ) ) dx
x2
x 2 x 1 (13.64)
x1
∫ ( − Vx ( x, y2 ,0 ) + Vx ( x, y1 ,0 ) ) dx
x2
circulation =
x1
∫ (1 + x ) dx =
− ∫ (1 + x2 ) dx.
0 1
circulation from RQ = 2
1 0
1
Q R
P S x
1
FIGURE 13.12: The square PQRS (see Example 13.12).
Introduction to Vector Calculus • 481
0
=
circulation from QP ∫=
0.y dy
1
0.
∫ (1 + x ) dx.
1
circulation from=
PS 2
(13.66)
0
1
circulation from SR = ∫ y dy.
0
− ∫ (1 + x2 ) dx + ∫ y dy + ∫ (1 + x2 ) dx.
1 1 1
circulation PQRS =
0 0 0
1
1 y2 1
=
circulation PQRS ∫0
= y dy =
2 0 2
.
around the unit square in the (x, y) plane, centered on the origin
with sides parallel to the axes.
482 • Mathematical Physics
13.10. DIVERGENCE THEOREM
(GAUSS’S THEOREM)
b ( x ) dydz b=
∫∫= 2 ( x ) ∫∫ dydz
2 Ab ( x2 ) . (13.68)
X2
X2
The net flux out of the volume is the sum of the net flux out of
each face. As the vector field is along the x-direction, crossing
only the surfaces X1 and X2, these are the only faces with
Introduction to Vector Calculus • 483
=∆flux (∫ z2
z1 ∫
y1
y2
dydz )(∫
x1
x2
∇ ⋅ B dx , )
(13.75)
or
z2 y2 x2
∆flux= ∫ ∫ ∫
z1 y1 x1
∇ ⋅ B dxdydz= ∫
V
∇ ⋅ B dV.
(13.76)
divergence theorem tells us that the net flux through the surround-
ing surface is also zero.
More generally, if B = (b(x, y, z), 0, 0) the definition of the flux
of B through the rectangle X2 gives
∫ b ( x , y, z ) dydz
z2 y2
flux of B through the plane surface X 2 = ∫ 2
z1 y1
(13.77)
and similarly for X1. Now the difference in flux between X2 and X1 is
b ( x2 , y, z ) − b ( x1 , y, z ) dydz.
z2 y2
=∆flux ∫ ∫
z1 y1
(13.78)
Bx ( x2 , y, z ) − Bx ( x1 , y, z ) dydz
z2 y2
( ∆flux ) x
= ∫ ∫
z1 y1
(13.79)
z2 y2 x2 ∂Bx
=∫ ∫ ∫x1 dx dydz.
z1 y1
∂x
Similarly
z2 x2 y2 ∂By
( ∆flux ) y =
∫z ∫x ∫y dydxdz. (13.80)
1 1 1 ∂y
and
y2 x2 z2 ∂Bz
( ∆flux ) z =
∫y ∫x ∫z dzdxdy. (13.81)
1 1 1 ∂z
The flux out of the volume V is the sum of the net flux through
each pair of faces:
= ∫ V
∇⋅ B dV ,
(13.83)
∫ B ⋅ dS= ∫
S V
∇ ⋅ B dV , (18.84)
through the closed surface of the unit cube bounded by the six
planes x = −1, x = 0, y = −1, y = 0, z = −1 and z = 0.
∫ (V ( x , y , 0 ) − V x ( x , y 1 , 0 ) ) dx
x2
= x 2
x1
(V ( x )
, y , 0 ) − V y ( x 1 , y , 0 ) dy .
y2
+∫ y 2
y1
Now
y1 ∂V
Vx ( x, y2 ,0 ) − Vx ( x, y1 ,0 ) =
∫y2 ∂y ( x, y,0 ) dy
x
and
x2 ∂Vy
Vy ( x2 , y,0 ) − Vy ( x1 , y,0 ) =
∫x1 ∂x ( x, y,0 ) dx
and so
x2 y2 ∂V y2 x2 ∂Vy
−∫ ∫
circulation = x
dydx + ∫ ∫ dxdy
x1 y1 ∂y y1 x1 ∂x
x2 y2 ∂Vy ∂Vx
= ∫ ∫
x1 y1
∂x
−
∂y
dxdy.
Making use of the fact that the integrand is the same as the
z−component of the curl ∇ × V (see Equation 13.39) we have
that
Circulation of V round C= ∫ (∇ × V )
S z
dA. (13.85)
∫
C
V ⋅ d=
l ∫ ( ∇ × V ) ⋅ dS.
S
(13.86)
circulation= ∫ (∇ × V )
S z
dS.
(13.87)
3
Provided they are orientable–roughly speaking, that they have two sides
and so one can define a consistent choice of normal vectors to the surface
at every point, unlike a Möbius strip.
Introduction to Vector Calculus • 489
circulation= ∫ (∇ × V )
S z
dS.
∇ × V = (z + 1, 0, y).
(∇ × V)z = y.
So we have
1 1 1 1
circulation=∫
0 ∫ (∇ × V )
0 z
dxdy = ∫
0 ∫
0
y dxdy,
and so
1 1 1 1
circulation= ∫ dx ∫ y dy =1 × = .
0 0 2 2
around the unit square in the (x, y) plane, centered on the origin,
is zero.
Exercise 13.19
(i) The steady drift of a straight section of river can be described
by the velocity field
v = νi. (13.88)
A lobster pot consisting of a cube of side 1m (designed by a
physicist!) is immersed in the water with all its sides parallel
or perpendicular to the flow. Write down the flux of v into
490 • Mathematical Physics
each of the sixsides and hence the net flux into the cube.
Verify this result using Gauss’s divergence theorem.
(ii) The pot is now moved to a section of the river approaching
a weir where the water is speeding up with a constant accel-
eration a, so the velocity field becomes
∂f
( ∇f ) i =
∂xi
∂Ai
∇⋅A =
∂xi
∂Ak
( ∇ × A ) i =Œijk
∂x j
using the notation of Section 4.15. Once one gets the hang of this nota-
tion the derivation of vector identities becomes a mechanical process.
1 ∂y 1 ∂y 1 ∂y
∇y = , , ,
h1 ∂x1 h 2 ∂x2 h 3 ∂x3
1 ∂ 1 ∂ 1 ∂
=∇⋅A ( h2 h 3 A1 ) + ( h1 h 3 A 2 ) + ( h1 h 2 A 3 ) ,
h1 ∂x1 h2 ∂x2 h 3 ∂x3
1 ∂ ( h 3 A 3 ) ∂ ( h 2 A2 ) 1 ∂ ( h1 A1 ) ∂ ( h 3 A 3 )
=∇×A − , − ,
h 2 h 3 ∂x2 ∂x3 h1 h 3 ∂x3 ∂x1
1 ∂ ( h 2 A2 ) ∂ ( h1 A1 )
− .
h1 h 2 ∂x1 ∂x2
492 • Mathematical Physics
Revision Notes
∫ A ⋅ dS= ∫
S V
∇ ⋅ A dV
Introduction to Vector Calculus • 493
∫C
A ⋅ d1 = sum of integrals along each edge of C.
∫
C
A ⋅ d=
l ∫ ( ∇ × A ) ⋅ dS
S
13.14. EXERCISES
1. F
ind the gradient of the function f (x, y, z) = x2 + 2xyz + z2
in the direction parallel to the vector (1, −1, 2) at the point
(0, 2, 3).
2. F
ind expressions for the divergence and the curl of the vec-
tor function
3. C
alculate the flux of the vector field E = (y3, x2y, y2z)
through a rectangular surface with corners at (0, 0, 0),
(0, 0, −2), (0, 1, 0) and (0, 1, −2).
494 • Mathematical Physics
13.15. PROBLEMS
∫ ( A × r ) ⋅ dr =
C
2p A,
∫ div ( r 2 r ) dV ,
V
1
3∫
r ⋅ dS,
A
PREREQUISITES
This appendix contains some basic facts and methods that will be
review for most students, as well as some useful reference material.
Consult it as necessary.
Real Numbers
Real numbers are values that represent a quantity along a line.
Imagine each real number as a point on a line with the negatives to
the left of zero and the positives to the right. There are two types of
points: rational numbers that can be represented as fractions or as
either finite or recurring decimals and irrational numbers that are
not fractions and are represented by an unending and non-repeating
decimal expansion.
There is one point about the decimal expansion of real numbers
that you may need reminding of: the recurring expansion 0.999999
(the bar shows that the 9 repeats forever) is the same as 1.00000,
and similarly for other numbers with recurring 9s. (You can see why,
once you try adding something to 0.999999, such as 0.00001 or
0.0000001. There is nothing you can add that does not give a num-
ber that exceeds 1.)
498 • Mathematical Physics
Functions
A function of a real variable is a rule that assigns a real num-
ber to each permitted value of the variable. For example the square
function x2 assigns the square of x for any real number x. We often
write y = f (x) for a function of x. Here x is the independent variable
and y the dependent variable.
Proofs
You need to be quite clear about the notion of proof in mathemat-
ics. A finite number of verifiable instances of a general result does not
constitute a proof. Take the following example, the expansion of
(1 + x)n = 1 + nx + ... ,
where we want to prove that the coefficient of x in the expansion is
n. It is obviously true for n = 1. For n = 2 we can get the result by
explicitly multiplying out (1 + x)(1 + x) = 1 + 2x + x2. You might see
how to go on to verify the result for n = 3. But we cannot prove that
the result is true in general by enumerating instances, because we
might eventually come across a value of n for which the result does
not hold. For example, take the (incorrect) statement that every
even number, 2n, is followed by a prime number. This is true for
n = 1, 2 and 3 where, 2n = 2,4 and 6, which are followed by 3, 5, and
7, but it is clearly not true for n = 4 since 2n = 8 is followed by 9,
which is not prime.
There are various valid methods of proof in mathematics. One
example is proof by mathematical induction.
Suppose that x > 1; then it follows from the first of the above
statements that x2 > 1.
Suppose x < 1; then it does not follow that x2 < 1 (because according
to the second statement x < 1 is necessary but not sufficient). Another
way of thinking about this is in terms of the direction of implication:
• x > 1 implies x2 > 1 so x > 1 is a sufficient condition (‘‘suffi-
cient’’ to imply the result).
• x < 1 is implied by x2 < 1 so the result x < 1 is a necessary
consequence.
500 • Mathematical Physics
A.3. LOGARITHMS
We define the function y = loga x (‘‘ the logarithm to the base a of x’’)
for x > 0 and a > 0, as the number, y, satisfying the relation x = ay..i.e. the
log of a positive number is the power to which the base must be raised
to get that number.
x1 = a y1 , x2 = a y2 and x1 x2 = a y .
Then
x1 x2 = a y1 a y2 = a y1 + y2 = a y .
The elementary definition of the functions sin, cos, and tan gives
them in terms of the ratios of sides of a right-angled triangle,
o a o
sin( q )
= = , cos( q ) =
and tan( q ) .
h h a
sin( q )
tan( q )
= and sin 2 ( =
q ) + cos2 ( q ) 1.
cos( q )
+ + - +
q q
- - - +
sin cos
- +
q
+ -
tan
FIGURE A.3: The signs of the trigonometric functions in the various quadrants.
tan q
1.0
6
sin q cos q
4
0.5
2
q q
0.0
0
π/2 π 3π/2 2π π/2 π 3π/2 2π
−6 −4 −2
−0.5
−1.0
(a) (b)
FIGURE A.4: (a) The sin and cos functions. (b) The tan function.
Combining Angles
We can combine angles:
sin(a + b) = sin(a) cos(b) + cos(a) sin(b),
cos(a + b) = cos(a) cos(b) − sin(a) sin(b),
and
a + b a −b
sin (a ) + sin ( b ) =
2sin cos ,
2 2
a + b a −b
cos (a ) + cos ( b ) =2 cos cos .
2 2
And also split them up:
sin(2a) = 2 sin(a) cos(a),
cos(2a) = 2cos2(a) −1 = 1 − 2sin2(a).
Related Functions
Here are some related trigonometrical functions and the rela-
tions between them that are important to know:
1 1 1
=sec (a ) = , cosec (a ) = , cot (a ) ,
cos (a ) sin (a ) tan (a )
where
2x + 5
One basic way to divide a polynomial is to guess the form of the
answer and compare coefficients. In this question the answer
must start with an x2 and hence will be of the form ax2 + bx + c +
R(x) where the remainder, R(x), will be of the form d/(2x + 5).
We can therefore write
2 x3 + 9 x2 + 4 x − 20 d
2x + 5
= ( ax 2
+ bx + c ) +
2x + 5
so
2x3 + 9x2 + 4x − 20 = (2x + 5)(ax2 + bx + c) + d
= 2ax3 + (5a + 2b)x2 + (5b + 2c)x + 5c + d.
Now we can equate the coefficients, starting with the highest
power of x. This allows the unknowns to be read off directly:
equating the coefficients of x3 gives a = 1; equating the
coefficients of x2 gives b = (9 − 5 × 1)/2 = 2; then, similarly,
c = (4 − 5 × 2)/2 = −3 and d = (−20 − 5 × (−3)) = −5. Therefore
2 x3 + 9 x2 + 4 x − 20 2 5
= x + 2x − 3 − .
2x + 5 2x + 5
4 3 2
Exercise A.8 Find x + 5 x + 11 x + 13 x + 7 .
2
x + 2x + 3
3
Exercise A.9 Express in partial fractions.
( x + 4 )( x − 1)
=
(x 2
+ 5x + 4) + x
,
x2 + 5 x + 4
x
= 1+ ,
( x + 4 )( x + 1)
4 1
=
1+ − .
3 ( x + 4 ) 3 ( x + 1)
510 • Mathematical Physics
A.7. SERIES
Arithmetic Series
An arithmetic sequence (or arithmetic progression) is a succes-
sion of terms, each of which differs from the previous one by adding
a constant.
For example
1, 3, 5, 7, 9,.. . (A.1)
The sequence may be finite or infinite. If the sequence starts at
a and each term differs from the previous one by a constant d then
the nth term is
an = a + (n − 1)d,
giving a1 = a, a2 = a + d, a3 = a + 2d, and so on. The unevaluated sum
of the terms of a sequence is called a series. For example, the series
obtained from the sequence (A.1) is
1 + 3 + 5 + 7 + 9 + ... . (A.2)
The sum of the first N terms of an arithmetic sequence is
N N
SN=
=
n
n 1=
∑ a= ∑ ( a + ( n − 1) d ).
n 1
Geometric Series
A geometric sequence (or geometric progression) is a succes-
sion of terms in which each term differs from the previous one by a
multiplicative constant. For example
1, (1/2), (1/4), (1/8),(1/16),... . (A.3)
The sequence may be finite or infinite. If the sequence starts at
a and each term differs from the previous one by a constant r then
the nth term is
an = ar n−1
giving a1 = a, a2 = ar, a3 = ar2, and so on. The sum of the terms
is called a geometric series. The series obtained from the sequence
(A.3) is
1 + (1/2) + (1/4) + (1/8) + (1/16) + ... . (A.4)
The sum of the first N terms of a geometric sequence is
N N
=
SN
=n 1=
∑=
an
n 1
∑ ar n −1
.
1−r
. (A.5)
For example, in the case of the series (A.4) we have r = 1/2 and
a = 1. Therefore S = 2. For | r | ≥ 1 the sum Sn either diverges with
N or it oscillates with increasing amplitude.
APPENDIX
B
THE GREEK ALPHABET
alpha a A nu n N
beta b B xi x Ξ
gamma γ Γ omicron o O
delta d ∆ pi p Π
epsilon e E rho r R
zeta z Z sigma s Â
eta h H tau t T
theta q Θ upsilon t ϒ
iota i I phi φ Φ
kappa k K chi c C
lambda l Λ psi y ψ
mu m M omega w Ω
INDEX
Active transformation, 186, 191 Binomial coefficients, 52
Adjoint matrix, 180 Binomial expansion, 55
Algebraic methods, 84 binomial series, 53–54
Algebraic operations, 192 definition, 50–51
Algebraic rules, 111 factorials, 51–53
Analytic fucntion, 261 Boundary conditions, 267–277,
Angular momentum, 139, 141 289–290
Anti-derivatives, 18
Antisymmetric matrices, 160–161 Carrier waves, 399
Arbitrary function, 49, 387 Cartesian coordinate systems,
Arbitrary point, 357–358 320, 324, 455, 491
Arcsin, 11, 70 Cauchy-Riemann relations, 261
Argand diagram, 247–252, Cauchy’s integral formula, 261
255, 262 Chain rule, 65, 228, 350,
Associative law, 158 367–369,
Asymptotes, 79 372, 380, 383, 386, 459.
Asymptotically equivalent, See also “Function of a
94–96 function”
Asymptotic approximations, 103 consistency with, 369–371
Auxiliary equations, 213, 215 Closed form expression, 99
with complex roots, 277–281 Coefficients, 207
with repeated roots, Column vector, 149, 150
215–217 Commute, 157–158
Axially symmetric, 321 Complementary functions, 223,
224, 233, 271, 284
Back-substitution, 154 Complex algebra, basic rule
“Bac minus cab” rule, 134 of, 239
Basis vectors, 115–118 Complex conjugate, 240–241,
Beats, 397–400 283
Bessel functions, 231–232 Complex exponential, 252–255
516 • Mathematical Physics