Advanced Molecular Quantum Mechanics
Advanced Molecular Quantum Mechanics
Advanced Molecular
Quantum Mechanics
An Introduction to Relativistic
Quantum Mechanics and the
Quantum Theory of Radiation -
R. E. Moss
This book contains a lucid account of the
theoretical physics that is necessary for
the appreciation of many of the recent
advances in chemical physics.
Following a summary of vector methods,
classical mechanics and special relativity
are introduced and used to develop
electromagnetic theory. With this back-
ground the Dirac equation for the electron
is considered in detail, this theory
leading naturally to the concept of elec-
tron spin. The extension to many electrons
is then discussed, the end product being a
Hamiltonian containing all the terms of
chemical interest. Finally, the classical
theory of electromagnetic radiation is
quantized and it is shown how quantum
field theory may be used to describe the
interaction of radiation with matter.
The level at which this work is written
should make it particularly suitable for
graduate chemists and chemical physi-
cists. Only the quantum mechanics and
mathematics normally covered in under-
graduate chemistry courses are assumed
and, even so, the pertinent parts are sum-
marized early in the book.
The Author
Richard Moss carried out research in the |
University of Cambridge and at Columbia »
peas }
yon?» > 4
¥
anc?ed
19 ;
oul ort
“ole eae
PARMKCIE INET
£5 00
Digitized by the Internet Archive
in 2023 with funding from
Kahle/Austin Foundation
https://archive.org/details/advancedmolecula0000moss
Ae -
Advanced Molecular Quantum Mechanics
\
ex, BH
ae we €
pApR\,
‘2
NI Veen
; LO.
‘a ap
STUDIES IN CHEMICAL PHYSICS
General Editor
A. D. Buckingham, Professor of Chemistry, University of Cambridge
Series Foreword
The field of science known as ‘Chemical Physics’ has
greatly expanded in recent years. It is an essential part of
both physics and chemistry and now impinges on biology,
crystallography, the science of materials and even on
astronomy. The aim of this series is to present short,
authoritative and readable books on different topics in
chemical physics at a level that is appreciated by the non-
specialist and yet is of prime interest to the expert—in fact,
the type of book that we all welcome and enjoy.
I was grateful to be given the opportunity to help plan
this series, and warmly thank the authors and publishers
whose efforts have brought it into being.
A. D. Buckingham,
University Chemical Laboratory,
Cambridge, U.K.
R. E. MOSS
Lecturer in Chemistry
University of Southampton
Gh
CHAPMAN AND HALL: LONDON
First published 1973
by Chapman and Hall Ltd
11 New Fetter Lane, London EC4P 4EE
© 1973 R. E. Moss
This book is primarily intended for graduate chemists and chemical physicists.
Indeed, it is based on a graduate course that I give in the Chemistry Depart-
ment of Southampton University.
Nowadays undergraduate chemistry courses usually include an introduction
to quantum mechanics with particular reference to molecular properties and
there are a number of excellent textbooks aimed specifically at undergraduate
chemists. In valence theory and molecular spectroscopy physical concepts are
often encountered that are normally taken on trust. For example, electron
spin and the anomalous magnetic moment of the electron are usually accepted
as postulates, although they are well understood by physicists. In addition, the
advent of new techniques has led to experimental situations that can only be
accounted for adequately by relatively sophisticated physical theory. Relativis-
tic corrections to molecular orbital energies are needed to explain X-ray photo-
electron spectra, while the use of lasers can give rise to multiphoton transitions,
which are not easy to understand using the classical theory of radiation. Of
course, the relevant equations may be extracted from the literature, but, if the
underlying physicsisnot understood, this is a practice that is at best dissatisfy-
ing and at worst dangerous. One instance where great care must be taken is in
the use of spectroscopically determined parameters to test the accuracy of elec-
tronic wave functions. The practice of employing classical analogies to add
small terms to the molecular Hamiltonian, as and when they are needed to ex-
plain new phenomena, is no longer satisfactory. It is necessary to start with as
complete a molecular Hamiltonian as possible to make sure that important
contributions to spectroscopic parameters are not omitted.
However, the physics literature that deals with such topics as relativistic
1X
x Preface
Southampton
July, 1972
Contents
Preface page ix
Classical Mechanics 37
Inertial frames and Galileo’s relativity principle eH)
The principle of least action 38
Lagrange’s equations of motion 32
The Lagrangian for a system of particles 41
Constants of motion 43
The Hamiltonian 46
Bibliography 47
Special Relativity 48
Einstein’s principle of relativity 48
The interval 50
The Lorentz transformation 51
Contraction, dilation and paradoxes 54
The transformation of velocities 56
The relativistic mechanics of a free particle 57.
Four-vectors 60
Bibliography 62
Author Index Za
AS
eel
This introductory chapter has two purposes. The first is to provide a brief
résumé of the quantum theory that is assumed to be familiar to the reader
and, in particular, those results that are used later. A deep discussion of the
foundations of quantum mechanics will not be given and in this chapter the
results will be quoted without proof; the justification for this policy is that
there are a number of excellent texts available. Nevertheless, some of the
mathematics employed in this presentation is discussed in more detail later.
For example, an acquaintance with vector and matrix algebra is assumed here,
although Chapter 2 is devoted to this subject.
The other objective of this chapter is to point out the unsatisfactory fea-
tures of non-relativistic quantum mechanics. This theory is based on a number
of postulates, the justification of which is that the results derived from them
are in agreement with experiment. However, in a relativistic theory some of
these postulates are unnecessary and here we are thinking in particular of elec-
tron spin and the phenomena associated with it.
For convenience just one particle is considered here, but the extension to
many-particle systems is readily made. Moreover, it should be noted that the
present section is not intended to be a logical development of quantum theory.
The state of a one-particle system is described by a wave function W(r, f),
which is a function of particle coordinates and time. The probability of the
particle being found in a small volume dz at position r and time ¢ is given by
W(r, t)*(r, f)dr, where the asterisk indicates complex conjugation. Since the
2 Advanced Molecular Quantum Mechanics
Qa, = fUFAW,
dr. (1.3)
This is the expectation value of A in the state m and in the Dirac notation is
written as (n|A|n). In general, expressions of the type f¥mAW,d7 or
(m|A |n) are referred to as matrix elements of the operatorA between states
mand n.
In quantum mechanics the operators are all linear so that the result of
operating on a linear combination of wave functions is:
® = Dc,V,. (1.6)
However, ® is not necessarily itself an eigenfunction of A, but a determination
of the physical property associated with A will always give one of the a,, as
the result; the probability that a particular result a, is obtained being c#,c,.
The expectation value of A in the state represented by ® may still be deter-
mined, but it will be a weighted ayerage of all the eigenvalues a, of A.
A rule is still needed to construct the operator for a particular observable.
It is usual to take the classical expression for the observable and to replace
the position r and linear momentum p by the operators r and p = — ih(0/dr),
respectively, where the symbol / represents ./— 1 and his related to Planck’s
constant A by h = h/2m; the vector operator (0/dr) has components (0/0),
(0/dy) and (0/0z). If both r and p appear in the classical expression then care
must be taken that the resulting operator is Hermitian. In addition, the total
energy E of a system may also be replaced by the operator ih(0/02).
Classically the energy of a system is the sum of its kinetic and potential ener-
gies. For a particle of mass m the former is p?/2m and the latter is a function
of position and time:
B= 2m Vest). (1.7)
On quantization the right-hand side of this equation becomes the Hamiltonian
operator:
HW = in(av/ar); (1.9)
4 Advanced Molecular Quantum Mechanics
Hy = Ey; a
ve = Cnn: (1.13)
and the best approximation of this form to the true lowest eigenvalue of H is
obtained by the variation of the coefficients c,. This leads to a set of coupled
equations:
~ GCA = ES en Gn = 0, (1.14)
where d,,,,, is the matrix element of H between ¢,, and ¢, and S,,, is the
overlap integral f ¢7,¢,d7, which is unity for m =n but is only zero for m#n
if the ¢, are orthogonal to one another.
The other method involves perturbation theory, in which the Hamiltonian
is split into two parts:
I= I dee (1.15)
and that the eigenfunctions W,, of the total Hamiltonian may be written as a
linear combination of them. We then have:
and
Wn = Wet D Woinl(Eo—Eo)+...
mtn
(1.17)
Since A(0) is time independent, the derivative of this equation with respect
to time gives:
dA/dt = (i/h)(HA —AH) = (/h) [H, A], (1.24)
These results are analogous to those for P and lL, except that in this case the
values that the quantum numbers may adopt are j= 0, (WieReey OS pear el
8 Advanced Molecular Quantum Mechanics
The existence of orbital angular momentum and its associated magnetic mo-
ment does not explain all the features of the fine structure of atomic spectra
and, in particular, Zeeman splittings. In non-relativistic quantum mechanics it
is necessary to postulate that in addition the electron possesses an intrinsic
angular momentum or ‘spin’ of $ in units of hf together with a spin magnetic
moment of magnitude eh/2m.
The spin angular momentum corresponds to the case j = $ mentioned at
the end of the previous section and by combining spin and orbital angular mo-
mentum the other half-integer values for 7may be obtained. The spin angular
momentum operator is usually deignasted As and the quantum numbers corre-
sponding to j and m; are s(= ) and m,=+ s The eigenfunctions of the
operators s” and s, may then be written as |s, m,) or just |m,), since s is always
5; in addition, the notation |a) and |6) is often used in place of |+ 4) and
|— 5), respectively.
In what follows it is often more convenient to use the operator o = 2s
instead of s itself. The commutation relations for the components of o are
then:
[Gp ssOylen 21025, .[CjmO,| = 210ea, (Osx Onl eee, lod)
and the way in which these operators act on the spin eigenfunctions |a) and
|B) is given by:
0,|a) = |B); 0,18) = |a);
Oyla) = i|B); oy |B) = —ila); (1°32)
From these relations the effect of operators constructed from the components
of o may readily be determined. In particular, it may be shown that |) and
|8) are both eigenfunctions of the operators 0%, 03 and o2 with the same eigen-
value 1. Thus we may write:
0, = 0 = o= 1, (1.33)
As we know the commutator of two operators A and B is [A,B] =
(AB — BA). The anticommutator of these operators may also be defined; this
Non-relativistic Quantum Mechanics 9
is just (4B + BA) and is written in the abbreviated form [A, B],. The anti-
commutation properties of the components of o may now be found and as an
illustration we evaluate [0,, oy], by using Equations 1.31:
pastes G,1et.o¢locsa,]),
= = sile, o,]. (1.34)
Now, since 0; can be replaced by unity (Equation 1.33), it must commute
with o, and [0,, oy], vanishes. Similarly, the other pairs of components of
may be shown to anticommute so that:
a0 1 0 1 1 0
0». = - Oy = 3 Oz — = (1.36)
j if
These matrices also have the property that the square of each is equal to the
two-by-two unit matrix in agreement with Equation 1.33.
With this formalism the spin functions |a) and |) are written as two-
component column vectors:
1 0
|a) -() Tees |) (1537)
0 1
the complex conjugates of these being the corresponding row vectors. Using
these with Equations 1.36, the relationships of Equations 1.32 may be re-
covered and indeed the elements of the matrices in Equations 1.36 are just the
matrix elements of the corresponding operators between the spin functions.
Electron spin can now be incorporated into the quantum mechanical de-
scription of the electron by using classical analogies to add appropriate spin-
10 Advanced Molecular Quantum Mechanics
dependent terms to the Hamiltonian and using wave functions which are pro-
ducts of spatial and spin functions. The resulting Schrodinger equation can
then be expressed as one equation using the formalism employed in Equations
1.32 or as a pair of coupled equations using the matrix representation of the
spin operators (Equations 1.36) and spin functions (Equations 1.37)
It must be emphasized that although this theory is largely successful it is
based on the postulate of a ‘spinning’ electron with angular momentum fis and
associated magnetic moment — efs/m. The electron spin g factor, which is the
magnitude of the ratio of these two quantities measured in units of e/2m, is
thus 2. This result is contrasted by the orbital g factor of 1 and consequently
the sping factor and the spin magnetic moment are often said to be anomalous.
The reason for this is that, unlike an orbiting electron, it is not possible to con-
struct a satisfactory classical model for a spinning electron.
Any model in which the electron has its mass and charge distributed over a
finite volume explains the existence of both the angular momentum and the
magnetic moment when the electron spins about an axis. However, the g
factor cannot be explained so easily. The simplest model consists of a sphere
with uniform mass and charge densities, but any model in which the mass and
charge are distributed in the same way leads to a spin g factor of 1 just as it
does for the orbital motion of an electron. Of course, if the electron consists
of conducting material the charge might be expected to reside on the surface
of the sphere and this modification of the model leads to a spin g factor of
5/3. Although this is not the required result, it does suggest that models may
be devised with g = 2 and indeed this is so; for illustration we quote just one
such model. In this the electron is a solid cylinder of uniform density and all
the charge resides on the curved surface. The angular momentum is mr?w/2,
where m is the mass, r the radius and w the angular velocity of the cylinder,
and the magnitude of the associated magnetic moment is er?w/2, so that
g = 2 as required.
Nevertheless, this model, and others like it that give a classical g factor of 2,
are rather far-fetched and cannot be taken as a justification for the introduc-
tion of the concept of electron spin. In non-relativistic quantum mechanics
the existence of electron spin angular momentum and its associated magnetic
moment must be regarded as postulates.
All these remarks apply specifically to the electron, but nuclei may be de-
scribed in a similar fashion. They too possess spin angular momenta and mag-
netic moments, although the spin is not restricted to 5 as it is for the electron,
and these properties must also be incorporated in the theory in a phenomeno-
logical manner.
Finally, we might mention a further postulate that is necessary when deal-
Non-relativistic Quantum Mechanics 11
ing with systems of many identical particles. This is the Pauli exclusion
principle, which states that wave functions must be antisymmetric with respect
to the interchange of like particles with half-integer spin and symmetric with
respect to interchange of like particles with zero or integer spin. The former
category of particles, which includes the electron, are called fermions and obey
Fermi-Dirac statistics, while the latter are bosons and are subject to Bose-
Einstein statistics.
The velocity of the electron in the first Bohr orbit of a one-electron atom is
Zac, where Z is the atomic number, a(~ 1/137) is the fine structure constant
and c is the velocity of light. In the hydrogen atom the relativistic corrections
expected are small although they are observable. However, in heavier atoms
these corrections can become quite large for inner electrons despite the shield-
ing of the nuclear charge by other electrons. For this reason alone a relativistic
theory is needed particularly as techniques such as X-ray photoelectron spec-
troscopy are now being used to study the inner electrons in atoms and mole-
cules.
At this point it might be mentioned that we will only be interested in special
relativity. The effects of general relativity, whichis concerned with gravitation,
are negligible in atoms and molecules. This may be seen by considering the
hydrogen atom. In the first Bohr orbit the total energy of the electron is
approximately 2 X 10°'*J, whereas the potential energy due to the gravi-
tational attraction of the electron by the proton is of the order of 2 X 107°7J.
As it happens the incorporation of special relativity in quantum mechanics
gives rise to far greater benefits than might be expected. In particular, it leads
quite naturally to the concept of electron spin, so that this need no longer be
regarded as a postulate. Indeed, both the spin angular momentum and the spin
magnetic moment are predicted by the theory and the spin g factor is found
to be exactly 2. (In fact the spin g factor of the electron differs slightly from
2 and this discrepancy can only be explained by quantum electrodynamics,
which is beyond the scope of this book, although an indication of its origin is
given in Chapter 9).
Morevoer, the spin-dependent terms in the Hamiltonian arise directly from
a relativistic theory and it is not necessary to use dubious classical analogies
to construct them as in the non-relativistic approach. Here we might cite the
spin-orbit interaction, for which classical analogy actually leads to a term
in the Hamiltonian that is twice as large as is needed to explain experimental
observations. It is possible to explain the necessary correction factor classically,
but only if special relativity is included. A section in Chapter 5 is devoted to
V2 Advanced Molecular Quantum Mechanics
this topic, but no problems arise if relativistic quantum mechanics is used from
the outset.
Not only are terms in the Hamiltonian with classical analogues explained
readily in the relativistic theory, but also there are some terms that have no
classical analogues and only appear if special relativity is allowed for.
The theory to be developed in this book applies specifically to particles of
spin 5, and in particular to the electron. However, it turns out that nuclei may
be included adequately in atomic and molecular Hamiltonians by treating them
as electrons but with anomalous masses, charges, spins and magnetic moments.
Finally we note that the Pauli exclusion principle is derivable from relativ-
istic theory, but only by using quantum field theory and as far as electrons are
concerned this is too advanced a topic to be covered in this text, although a
qualitative discussion is given in Chapter 12.
Having made all these assertions about the results that may be obtained by
using relativistic quantum theory, we have now to demonstrate them. However,
before this may be done a certain amount of background theory must be de-
veloped and the next few chapters are devoted to this aim.
Bibliography
Quantum mechanics
*References consisting of the author’s name only are given in full in the Gen-
eral Bibliography in Appendix C.
CHAPTER TWO
It has already been remarked that the reader is assumed to be acquainted with
vectors and matrices, indeed it is unlikely that he will have survived the first
chapter without a rudimentary knowledge of these subjects. However, the
techniques of vector and matrix algebra are used widely in this book and a
summary of these subjects is desirable. This summary could have been rele-
gated to an appendix, but there are a number of sophistications that may well
be unfamiliar, since they do not appear to be covered in elementary mathe-
matical textbooks. The advantages of these sophisticated techniques are that
they provide us with an abbreviated notation and can considerably reduce the
manipulations involved particularly in vector theory.
We start by recalling the elementary theory of three-dimensional vectors.
With this background, the repeated subscript summation convention can be
introduced together with the Kronecker delta, 6;;, and the €;;, notation; it is
the introduction of the latter that leads to the greatest economies. These tech-
niques are used to review the multiplication and differentiation of vectors. The
properties of matrices and determinants are then summarized using the new
notation where appropriate. Finally the possibility of having vectors with four
or more components is mentioned.
13
14 Advanced Molecular Quantum Mechanics
expressed as
A = Ae GA ve, tt 4.67, Cay)
If A denotes the length of vector A, then the scalar product of A with itself
gives A”:
A.A = A2+A}+A?2 = A?. (2.3)
An equivalent way of expressing a scalar product is:
where 6@ is the angle between the two vectors. This angle is clearly unchanged
by a rotation of the axis system, so that the scalar product of two vectors has
the important property of being invariant to rotation. From Equation 2.4 we
note that the scalar product of two perpendicular vectors is zero; two such
vectors are said to be orthogonal. The scalar products of the unit orthogonal
vectors among themselves are thus:
e,. ey e,
C= A, B= A. ay | asl (2.7)
By ¥ z
Any one of the four preceding equations shows immediately that the vector
product of a vector with itself vanishes, while in general the magnitude of the
vector A , B is given by AB sin 0, where again 6 is the angle between A and B.
The vector products of the unit orthogonal vectors among themselves are thus:
ee, = ey.
Equation 2.2, which defines the scalar product of two vectors, may be written
in the alternative form:
A.B = 2 A;B;, (On| 1)
where the sign indicating summation over 7 has been omitted. The question
arises as to when summation over a subscript is intended and when it is not.
This problem may be solved by introducing the convention that, if a subscript
is repeated, that is, if it occurs twice in an expression, then summation over
all the possible values of the subscript is implied; if a subscript appears only
once in the expression, then we do not sum over its possible values. This is the
repeated subscript convention for summation. It will be seen in the examples
in the remainder ofthis chapter that it is not possible for a subscript to appear
more than twice; if it did the meaning of the expression would be ambiguous
16 Advanced Molecular Quantum Mechanics
and in fact a mistake would have been made. Other checks can be applied to
equations, since a subscript that appears once in one expression must appear
once and only once in all the other expressions in the equation. In addition, a
subscript that appears twice in one expression must appear twice or not at all
in the other terms of the equation. All this should become clearer as examples
are met in the rest of this chapter.
This section will be concluded by using the convention to abbreviate some
of the equations that have been encountered already; in doing this a number
of dangers associated with the convention will be met. For example Equation
2.1 may be written as:
A = Ae; (2.13)
and Equation 2.3 appears as:
A.A = A;A;. (2.14)
The first example presents no problems, but in the second the right-hand side
could have been written as A?, which obscures the fact that the subscript / is
repeated; it is wise to leave it in the expanded form used in Equation 2.14.
Finally, it is tempting to rewrite Equation 2.5a as e;.e; = 1, but this is not cor-
rect, since the subscript i is repeated and summation over the possible values
of imust be performed:
We cannot write Equation 2.5a using the general subscript i for x, y and z,
because repeating the subscript i automatically implies summation. However,
Equation 2.5a could be written as:
6; = 1 fori =f;
2.18
Vector and Matrix Algebra 17
However, the great utility of the Kronecker delta in vector algebra will only
be appreciated when one more symbol has been introduced and this is done in
the next section.
The definition of €;;,, which is known as the permutation symbol or the unit
antisymmetric tensor in three dimensions, is introduced immediately; it will
then be seen how it can be used to simplify equations that have been met al-
ready. If ijk is an even permutation of xyz, then €;;, takes on the value one,
while an odd permutation gives €;;, the value minus one:
There is still the possibility that a pair of the subscripts may be identical, in
which case €;;, has the value zero:
It can now be seen that Equation 2.6 for the vector product of two vectors
may be written as:
C; = (Ay B)i = ijn AjBr- (2.24)
18 Advanced Molecular Quantum Mechanics
This is because the subscripts j and k are both repeated on the right-hand side
of the equation so that, for example:
= A,B,—A,By (225)
since interchange of any pair of subscripts in €;;, must change its sign. In ad-
dition, Equations 2.10, which state the result of taking the vector product of
a pair of unit orthogonal vectors, can now be written as one simple equation:
So far we have only met products involving two vectors. The triple scalar
product, which is written as A.(B , C), can be expanded using the new notation:
From its definition cyclic permutation of the subscripts i,j,k does not alter
the value of €;;,, so that cyclic permutation of the vectors A, B and C does
not alter their triple scalar product:
A A (B A C) = €;€;;,A;(B A C),
The combination of two permutation symbols in one expression has not been
encountered before. In addition, the subscript k is repeated in the product
Vector and Matrix Algebra 19
Eijk €kim (= €ij€kim), SO that the sum rules involving the permutation symbols
must be considered before Equation 2.31 may be simplified.
The permutation symbol €;;, has been used to express the vector product of
two vectors in terms of their components (Equation 2.26). This was also ac-
complished in Equation 2.7 using a determinant. It would not be surprising if
€;jk Could be expressed as a determinant and indeed it is possible, one way
being:
This may easily be checked. From the rules relating to the interchange of rows
in determinants, it can be seen that a cyclic permutation of the subscripts i,j,k.
does not alter the value of the determinant, but a non-cyclic one changes its
sign; thus explicit consideration of, say, €,, shows that this definition is con-
sistent with Equation 2.23a. Agreement with Equation 2.23b may be demon-
strated by noting that if any pair of subscripts are identical, then the determi-
nant will be zero, since two rows are identical. (Any reader unfamiliar with
the manipulations of determinants is reminded that these are summarized in
the section on matrices later in this chapter).
Using the rule for multiplying two determinants together (similar to that
for matrices), the general product of two permutation symbols may be express-
ed as one determinant:
aio:
€ijrElmn — | J*
(2.33)
Sp Onn dkn
this result is best obtained by transposing the second determinant before multi-
plication. Although each subscript appears three times in theabove determinant,
20 Advanced Molecular Quantum Mechanics
expansion leads to a sum of six terms, each of which contains a particular sub-
script once and only once, so there is no inconsistency with the repeated sub-
script convention.
However, if the product €;;,€jmn is under consideration, summation over
i is implied and we have:
EijrEimn — 5 ji 5; 5jn
= 8ji(5jm5rn a 6jnOkm)
Before using this result on the vector triple product, the problem that initi-
ated this discussion of sum rules, two other sum rules will be established. The
first arises when there are two pairs of identical subscripts in the product of
two permutation symbols, for example €;;,€;;,. Equation 2.35 gives immedi-
ately:
€ije€iin = 54j5kn — SjnOrj
38kn — Skn
= 25pn. (2.36)
Finally, for three pairs of identical subscripts:
Returning to the problem of the triple vector product in Equation 2.31, use
of Equations 2.35, 2.19 and 2.12 gives:
A (By) ©) = ei€pij€e1mAjBiCn
= €(5;15jm — 5im5j1)AjBiCn
eA ;(B;C; — B;C;)
(A.C)B — (A.B)C. (2.38)
Vector and Matrix Algebra 21
2.6 Examples I
(a) Show that the quadruple vector product (A , B).(C , D) may be expanded
as:
(A , B).(C , D) = (A.C)(B.D) — (A.D)(B.C). (2.39)
Solution:
(A ,B)(C,D) = €;jxAj;Br€imnCmDn
= (6j)m5kn — SjndemJAjBrCmDn
= A;B,CD, — AjB,C,D;
(b) Express as one equation the commutation relations in Equations 1.31 for
the components of the spin vectore:
Solution:
[0;, 0] = 2i€;;,,0, (2.41)
or multiplying by S€iin€n> expanding the commutator and using Equations
2.26 and 2.36:
ojo = 2ie. (2.42)
(c) Prove the following identity for any two vectors, A and B, that commute
with the spin vector o, but not necessarily with each other:
Solution:
and
[9;, oj]. = 26;;. (2.22)
To prove Equation 2.43 we note that, since o commutes with A and B, the
term (o.A)(o.B) is identical to the first term on the right-hand side of Equa-
tion 2.39, provided C and D are both set equal too. Equation 2.39 cannot be
used directly, since the order of vector components was not preserved in its
proof, but we can start with the third line of Equation 2.40:
So far two types of quantity have been encountered, scalars and vectors. Both
scalars and vectors can be functions of time and space coordinates or other
variables. (These two types of functions are usually called scalar fields and
vector fields. An example often given for a scalar field is the temperature,
since this can vary in space and time, but is represented by a scalar quantity
at each point, while an example of a vector field is heat flow, which needs a
vector at each pointin space and time to specify it.) Situations frequently arise
where it is necessary to differentiate a scalar or a vector. Differentiation with
respect to time presents no problems and will not be considered here, but
differentiation with respect to spatial coordinates leads to the vector operator
V.
Let us consider a scalar function $(x, y, z). For infinitesimal increases in
x, y and z the corresponding change in @ is given by:
while the first vector consists of the scalar being operated on by a vector
operator with components ((0/0x), (0/dy), (/dz)). This operator is called
‘del’ and has the symbol V:
ll e,.0/dx
V = + e,d/dy + e,0/dz
The gradient of a scalar function $(x, y, z) is a vector and from Equation 2.45
may be written:
Vo = e,(0¢/dr;) = e;Vi¢. (2.46)
For a small change dr in the position vector r of the point (x,y, z) the corre-
sponding change in ¢ is, from Equation 2.44:
dé = (V¢).dr. (2.47)
To see what V¢ means physically we note that, if k is a constant, the equation
6(x,y,z) = k represents a surface. If dr is taken to be tangential to the surface,
that is the change in position dr takes us to a new point in the same surface,
then dd = 0 and
(V¢).dr = 0. (2.48)
Since dr is tangential to the surface and can be in any arbitrary direction in the
surface, Vd must be perpendicular to the surface. Then again, if dr were not
tangential to the surface, but were to take us to some new surface, d= k’,
Equation 2.47 indicates that the increase in ¢ will be greatest when dr is parallel
to V@, that is perpendicular to the surface. Thus V¢ represents in magnitude
and direction, the maximum rate of increase of the scalar function ¢.
Suppose Equation 2.47 were to be integrated between two points 1 and 2
along some line joining the points. That is for each infinitesimal element dr on
24 Advanced Molecular Quantum Mechanics
the line, we calculate (V¢).dr and then add the contributions from all the ele-
ments of the line. The result would be:
The divergence of the vector A has already been defined as the scalar V.A; in
terms of components:
Let us consider the way in which electric charge (or heat etc.) flows out of a
volume element dt= dxdydz centred on the point x,y, z (Fig. 2.1). If the
Face 2 —— Face |
Fig. 2.1. The volume element dxdydz used to demonstrate the significance of
the divergence of a vector.
The combined charge loss rate through the two faces is thus (0/,./0x)dxdydz.
The charge flow across the other faces of the volume element may be found
in the same way giving for the total charge loss rate from the volume element:
V.j+(dp/ar) = 0 (2.55)
expresses mathematically the law of conservation of charge, that is charge can
be neither created nor destroyed and any charge lost by a volume element
must go through the surface of thé element.
The rate of loss of charge can also be thought of as an integral over the
surface of the volume element. An infinitesimal surface element may be re-
presented by ds, which is a vector normal to the surface element and with
magnitude equal to the area of the element. Thus the total loss rate from the
volume element is f j.ds, so that from Equation 2.54:
VA = €;€;jnVjAr- (2.58)
Its physical significance is not as easy to see as in the case of the gradient or
the divergence, but consideration of an area element is instructive. For con-
venience it is assumed to be parallel to the xy plane (Fig. 2.2), but generaliz-
ation to other orientations is possible. The circulation of the vector A round
Fig. 2.2. The surface element dxdy used to demonstrate the significance of the
curl of a vector.
the boundary of the surface element is the line integral round this closed loop
of the component A along the line element, that is $ A.dr. This integral may
be broken down into contributions from the sides of the area element, in the
same way that the contributions from each face of a volume element were
considered when the divergence of a vector was discussed. If the integration
round the boundary of the area element is carried out in the direction of the
arrows in the figure, the contribution from side | is:
Ax,
y+3dy, z)dx = — [A, + 300A, /dy)dy]dx. (2.60)
The combined contribution from these two sides is — (0A,./dy)dxdy. The
sides 2 and 4 may be treated in the same manner to give for the total line inte-
gral around the boundary of the area element:
the left-hand side is a line integral round a closed loop while the right-hand
side is a surface integral over any surface bounded by the loop. Equation 2.63
is known as Stokes’ theorem.
2.11 Examples II
In this section are given a number of useful relationships involving the vector
differentiation of products; these are also listed in Appendix B for convenience.
If A and B are vectors and @¢ is a scalar, then:
There are six possible ways in which V can operate twice on a scalar @ or a
vector A: V.(V¢); (V.V)A; V(V.A); V , (Vo); V.(V , A); V A(V , A). There
are no meaningful combinations of the symbols, that are not included in this
list.
The first case, V.(V¢), may also be written without ambiguity as (V.V)¢ or
even V*¢ and the scalar operator V’ expanded in terms of components:
V = V,V; = (02/dx?) + (07/dy”) + (02/827); (2.69)
it occurs frequently in physics and is known as the Laplacian. Since it is a
scalar, the Laplacian can also operate on a vector and an example of this possi-
bility appears second in the original list; however, this possibility is of little
interest as is the third possibility, V(V.A).
The next two cases both vanish as can be seen by rewriting them in alterna-
tive but equivalent ways:
and
V (Vd) = (VAV)o=0 (2.70)
Ve(Vi un Ge = (2.71)
in each case the operator (V , V) vanishes. These two identities have an inter-
esting use. Suppose that the curl of some vector B vanishes, that is (V , B) = 0.
Then Equation 2.70 tells us that, if B is expressed as the gradient of an appro-
priate scalar, the condition that its curl is zero is automatically satisfied. Simi-
larly, from Equation 2.71 it can be seen that, if a divergenceless vector is
written as the curl of another vector, its divergence is guaranteed to be zero.
Finally, we note that the remaining possibility, V , (V , A), may be expand-
ed using Equation 2.68 provided it is remembered that V and A do not com-
mute and their order must be preserved:
ae Pe ia ae ee a Pe IE (2-73)
Va = Viri = 5; = 3. Cas)
On the other hand the curl of r vanishes:
= — f/r’. (2.78)
We could proceed in the same way to evaluate V°(1/r), which will also be met
later. From Equation 2.78 this is:
(Vir) — rr)
76, TA ote
Tr)
= 3p ope 0! (2.79)
This suggests that V*(1/r) vanishes everywhere, but is this true when r = 0 and
(1/r) is infinite? Equation 2.78 shows that V(1/r), which is expected to vary
roughly as 7”, also becomes infinite at r = 0 and thus poses no problems. But
30 Advanced Molecular Quantum Mechanics
so that 5(x) is zero everywhere except at the origin where it is infinite in such
a way that the area under it is unity.
Thus 5(x) has very strange properties compared with the mathematical
functions usually encountered. However, it can be related to the usual type of
functions in the following way. The function:
gets spikier at x = 0, and in the limit when n becomes infinite the function
has the same properties as the Dirac delta function 6(x) in that it is zero every-
where except at x = 0, where it is infinite in such a way that the area under it
is unity and Equations 2.82 and 2.83 are satisfied. Thus an alternative defi-
nition of 5(x) is:
Other limits may be used to represent 5(x) and one of these will be encoun-
tered in Chapter 13.
n=\6
ZI
Fig. 2.3. A plot of y= (n/7)2 exp (— nx?) forn = 1, 4 and 16.
\ x
It should be noted that because of its definition the Dirac delta function
can only be used in integrals as in Equation 2.82; however, in this book the
delta function usually arises in quantum mechanical operators, the matrix ele-
ments of which involve integration. Consequently, an operator expression such
as x6(x) may be replaced by zero:
xd(x) = 0, (2.87)
since Equation 2.82 tells us that an integration involving it will vanish. In ad-
dition, Equation 2.82 may be used to show that:
V6 = —pleo. (2.90)
In the special case of a point charge q at the origin, the scalar potential at a
point ris given by:
@ = q/4neor. (2.91)
p = qd(r) (2.93)
is an appropriate description. Substitution into Equation 2.92 gives Equation
2.80, which is the desired result.
In a later chapter the expression V,(7;/r°) will be met. If i#/, reference to
Equation 2.79 shows that it may be expressed as:
since 6;; = 3 (Equation 2.21) and by symmetry the contributions from each
of the three termsin the summation on the left-hand side of Equation 2.95 are
equal.
Vector and Matrix Algebra 33
At =A. (2.102)
34 Advanced Molecular Quantum Mechanics
A square matrix A has an inverse if there exists a matrix A”! such that:
Aves As (2.104)
on the other hand A is unitary if its inverse is the same as its adjoint:
ASSAY (2.105)
As for the transpose the inverse of the product of two square matrices is the
product of the inverses taken in reverse order:
It is easy to show that the trace of the product of two matrices 77(AB) is un-
changed by reversing the order of multiplication of the matrices:
X2
x= : 5) (2:143)
i |
\Kai
x= Ay, (2.114)
and we recall that the inverse of a matrix is found by finding for each element
A;; its cofactor, which is (— 1)'* times the determinant obtained by removing
row 7 and column /, transposing the resulting matrix and dividing each element
by the determinant of A. That is
(A AvAl (2.115)
where Ajj is the cofactor of element A;,;. If the determinant of the matrix A
vanishes, A is said to be singular and can possess no inverse.
If y = Ax were substituted into Equation 2.112, it would read:
(A—2XI)x = 0, (2.116)
which is the general form of the eigenvalue equation found in quantum mech-
anics; the non-trivial solutions of this set of n equations consists of n values of
36 Advanced Molecular Quantum Mechanics
the eigenvalue ) each with its corresponding eigenvector x. The possible eigen-
values are found by solving the polynomial equation:
iA—AI| = 0 (2.117)
and the eigenvectors x, corresponding to each eigenvalue A, are determined
by back-substitution into Equation 2.116 followed by solution of the result-
ing simultaneous equations. The eigenvectors x, are column vectors, which can
be written side by side as a square matrix X. This matrix X is such that:
Most of this chapter has been devoted to three-dimensional vectors, each com-
ponent of which corresponds to one of the three spatial dimensions. In the
last section it was seen that these vectors can be thought of as 3 X 1 matrices
and be manipulated accordingly. However, a 3 X 1 matrix is just a special case
of an n X 1 matrix, and such properties as scalar and vector products can be
generalized from three to many dimensions, although it is hard to see the sig-
nificance of say a vector product in n dimensions. But is there any point in
doing this? It is merely noted here that when special relativity is considered in
Chapter 4, it will be convenient to think of time as a fourth dimension, and
that the theory may be developed in terms of four-dimensional vectors or four-
vectors. Further discussion of this topic is postponed till then.
Bibliography
Classical Mechanics
In this book it is assumed that the motion of electrons and nuclei is indepen-
dent of their dimensions, that is, they may be regarded as point masses or
particles. The justification for this assumption is that electrons and nuclei are
very small compared with the atoms and molecules in which they are found;
the radii of electrons, nuclei and atoms are of the order of 10°7'S, 10°7!4 and
10°'° m, respectively. The position of an electron or a nucleus at a particular
time can thus be specified by its vector position r, that is by three cartesian
coordinates. In general a system of N particles requires 3N coordinates to
specify its instantaneous configuration and the system is said to have 3N de-
grees of freedom.
Although a system can be characterized instantaneously by 3N cartesian
coordinates, other coordinates may be used. Any set of 3N coordinates q,,
(n=1,...,3N), that completely specify the system, is acceptable, and these
are called generalized coordinates; for example, one of these generalized co-
ordinates could be the separation between two particles. However, to be able
to measure coordinates at all, we must have a frame of reference, that is a sys-
tem of coordinate axes and, since the evolution of a system with time is of
interest, a clock fixed in this frame of reference.
57
38 Advanced Molecular Quantum Mechanics
There are an infinite number of possible frames of reference, but the only
ones of interest in classical mechanics are those in which space is homogeneous
and isotropic and time is homogeneous, in agreement with experience. These
frames are called inertial frames and in particular a freely moving particle (one
that experiences no external forces) has a constant velocity in such a frame.
There are still an arbitrarily large number of inertial frames, but these move
uniformly with respect to one another.
Now experiment shows that the laws of physics are the same no matter
what inertial frame is used; this is Galileo’s principle of relativity. Thus, the
same phenomena will be described by similar equations in two different inertial
frames. Suppose that the position of a particle is r in a frame K and r’ in
another frame K’, that is moving uniformly with a velocity v relative to the
first frame K. If the two frames are coincident at time ¢ = 0, then rr andr’ are
related by the Galilean transformation:
r=r+vt. (3.1)
It will be noted that time is here assumed to be the same in the two frames,
t=t'. It is this assumption that time is absolute that distinguishes a classical
theory from a relativistic one; special relativity will be considered in the next
chapter, and for the remainder of this chapter it will be assumed that time is
indeed independent of the inertial frame used. With this assumption an equa-
tion expressing a physical law must be invariant to the transformation (Equa-
tion 3.1); that is, the equation must have the same form in different inertial
frames.
takes the least possible value. Thus, the action is a line integral along the path
taken by the system between f, and f, and the path actually followed is such
that the action is least. (Strictly speaking this is only true for an infinitesimally
short path and the condition for a finite path is that the action should have a
stationary value; however, it will be seen that in deriving the equations of mo-
tion from the principle this point is of no consequence.)
It is not easy to visualize the physical meaning of the least action principle.
However, there are other minimum principles in physics, the most transparent
being Fermat’s principle of least time, which states that light travels between
two points in such a way that the time taken is a minimum. Use of this princi-
ple leads immediately to the rectilinear propagation of light in a uniform
medium, the laws of reflection at a surface and the laws of refraction for light
crossing the boundary between two different media. For detailed discussions
of this principle the reader is referred to the bibliography at the end of the
chapter.
q,Odt
5S = fio 6Lq,
ae fe RM d (aL
SS (®) 7: a (e)-2(] 6q dt-= 0. (3.5)
Now the first term in this equation vanishes, since g is required to have the
fixed values gq and qy at times t, and f,, so that 5g(t,) and 5q(tp) are both
{Sl Fae ee
ee aS ES ee
Biswsn |e The path taken by a one-coordinate system between (qq, t,) and
(4p, ty); q(t) is the path along which the action is assumed to be least and q(t)
+ 6q(t) is a slightly displaced path.
zero. Thus, the integral in Equation 3.5 must vanish and this must be true for
any arbitrary infinitesimal change in path 5q. The only way that this can hap-
pen is for the integrand to vanish also:
d(ac\ af _ 4
dt\aqg) aq or
Classical Mechanics 41
H
dt \ag,
a OE
E gts
FYE bes
ps
(ead 2s, SV): (3.7)
These equations are known as Lagrange’s equations of motion (or Euler’s equa-
tions in the general context of the calculus of variations, which is the mathe-
matical technique that has been used here). These partial differential equations
may be solved for the particular system under consideration to give the g,, ex-
plicitly as functions of time; in general, there will be 6N arbitrary constants,
which may be fixed by initial conditions.
Of course, the form of the Lagrangian must be known before the equations
of motion may be solved. Before considering this in the next section, it is
noted that the equations of motion are unaltered by multiplication of the
Lagrangian by an arbitrary constant; however, the same constant must be used
for all Lagrangians, if isolated systems are to be brought together and treated
as one with a combined Lagrangian. In addition, a Lagrangian is undetermined
to the extent that addition of a total time derivative of a function f(@n,t)of
coordinates and time will not affect the equations of motion. This may be
seen by noting that addition of (df/dt) to the Lagrangian adds to the action
integral (Equation 3.2) a term:
since the g,, are assumed fixed at t, and f,. Consequently such a term can have
no influence in the variation procedure used in developing the equations of
motion.
It has already been stated that the principle of least action is consistent with
Newton’s laws and indeed they may be derived from the principle via Lagrange’s
equations of motion. The derivation given here is not completely rigorous, but
the consistency of the two approaches should be apparent. It is first necessary
to obtain an appropriate Lagrangian and this is done for a free particle before
_considering a collection of particles.
The Lagrangian of a free particle in an inertial frame cannot depend on the
vector position r of the particle, since this would imply that space was not
42 Advanced Molecular Quantum Mechanics
L OL
d Ds =) Ole asa Ss CONStANt. (3.9)
dt \ar or
(Se (3.10)
The Lagrangian in the K’ inertial frame of Section 3.1, which is related to the
present frame by the transformation of Equation 3.1, is then given by:
ticles. If the particles are distinguished by the subscript i, the total Lagrangian
may be written:
t= Z ymir} — Vir;) = T(t)— Vir). (3.12)
The first term is the kinetic energy T while V is the potential energy; the reason
for these names and the presence of the minus sign will become apparent in
the next section. The repeated subscript summation convention is not appro-
priate for the first term, since the subscript i differentiates particles and not
vector components. Substitution of Equation 3.12 into Lagrange’s equations
of motion (Equation 3.7) gives three equations for each particle:
Since the kinetic energy can also be a function of the generalized coordinates,
the generalized force F,, must be defined as:
F, = (£/@qn). (3.15)
Earlier it was stated that the Lagrangian can also be a function of time, and
this occurs if the particles are subject to some sort of time-dependent external
potential; this possibility will be encountered later.
For a closed system, that is one that is not subject to an external potential,
there are a number of functions of the coordinates and velocities that remain
constant throughout the motion. Of these constants of motion the most im-
portant are those that are a direct consequence of the homogeneity and isotropy
of time and space. Here they will be derived using cartesian coordinates, al-
though the extension to generalized coordinates will be discussed.
44 Advanced Molecular Quantum Mechanics
The homogeneity of time implies that for a closed system the Lagrangian
does not depend explicitly on time. Thus:
we 7 ede) * Ga)
S Ciba VOL\ 4a da . |of
se at alae)
i en
Rearrangement of this equation shows that:
does not vary with time and is a constant of motion. Substitution of the ex-
plicit Lagrangian (Equation 3.12) for a system of particles now gives:
EB = ¥ qn(0Lll0gn)
—£ = Tan, dn) +Vdn)- (3.20)
Since space is homogeneous an arbitrary infinitesimal displacement & of a
system of particles must leave the Lagrangian unaltered:
2 (o£/dr;) = 0, (3.22)
d(ae\ dw(ae
S (2) ' Plea =O) (3.23)
U
Classical Mechanics 45
pi = (dL£/dr;); (3.24)
substitution of the Lagrangian (Equation 3.12) shows that this is equal to m;t;.
In addition, substitution of Equation 3.24 into Lagrange’s equations of motion
and use of Equation 3.13 shows that:
where F; is the force on particle 7. From this equation it is apparent that Equa-
tion 3.22 shows that & F; = 0 and in particular for a system ofjust two par-
l
ticles we may recover Newton’s third law, that to every action there is an equal
and opposite reaction.
Equations 3.24 and 3.25 may be generalized:
UE Ft (3.27)
Finally, the isotropy of space implies that the Lagrangian is invariant to in-
finitesimal rotations of the axis system. Use of this condition leads to a third
constant of motion, the angular momentum. The rotation of a system of par-
ticles through an angle 66 may be represented by a vector 66 directed along
the axis of rotation, the sense of the rotation being given by the right-hand
screw rule. If the axis passes through the origin of the coordinates, the ro-
tation changes a position vector r by an amount 6r= 66, r and the change
in the time derivative of r is given by 5r = 60 , r. The corresponding change
in the Lagrangian is thus:
bL£ II >» [(0L/dr;).6r; + (0.£/0r;).54;]
is a constant of motion. This is the angular momentum and as for the linear
momentum it is an additive property of a system of particles.
d£ = ¥(0L/8qn)ddn + ¥ (BL/24n)
Gn. (3.30)
where again a closed system is assumed so that the Lagrangian does not depend
explicitly on time. Substitution of Equations 3.26 into Equation 3.30 gives:
df = = (Pnddn + Pndgn)
or on rearrangement:
and reference to Equation 3.27 shows that it is just the energy of the system.
Hamilton’s equations of motion may now be obtained directly from Equation
3:32
Bibliography
Classical mechanics
Special Relativity
The theory of classical mechanics, which was discussed briefly in the previous
chapter, is classical in two senses. First, it is not a quantum mechanical theory
and is thus not appropriate on a molecular scale. Secondly, it takes no account
of special relativity. The use of the word ‘classical’ is thus ambiguous, since in
different contexts it may mean non-quantal or non-relativistic or both. It is the
purpose of this chapter to remove one of the restrictions and develop a rela-
tivistic but non-quantum-mechanical theory for a free particle.
The relativity principle, that the laws of physics are the same in all inertial
frames, has been met already in Section 3.1. However, the Galilean transform-
ation (Equation 3.1) used there to relate inertial frames, is based on the as-
sumption that time is absolute, that is the time interval between two events
will be the same no matter what inertial frame is being used. We will see that
this assumption is not in fact correct.
Experiment shows that, if two particles interact in some way, a change in
one does not have an instantaneous effect on the other, but that there is a
finite time lapse between the cause and the effect. This in itself points out a
difficulty with classical (non-relativistic) mechanics, since the interaction is de-
scribed by a potential energy that is taken to be a function of the particle
positions alone, an assumption that implies the instantaneous propagation of
interactions. If there is a finite delay between cause and effect, there must be
a maximum velocity of propagation of interaction that determines the mini-
mum delay before a change in one part of a system begins to affect another
48
Special Relativity 49
Although time is not absolute and the time lapse between two events depends
on the inertial frame used, there is another quantity, the interval, that is inde-
pendent of the inertial frame.
To demonstrate this the invariance of the velocity of light is first expressed
mathematically. Suppose that in an inertial frame K a signal leaves the point
X1, ¥1,2Z1 at time ¢, with velocity c and arrives at another point x2, V2, Z2 at
time ty. The distance between the two points may be related to the time taken
by the signal to travel between them:
is of importance; S12 is called the interval between the two events. Since the
velocity of light is the same in all inertial frames, Equations 4.1 and 4.2 show
that, if the interval s;. between two events in K is zero, the interval sin K’ for
the same two events also vanishes.
If the two events occur at two world points that are infinitesimally close to
one another and whose coordinates are separated by dx, dy, dz, dt in K, then
the interval ds is given by:
imply that different points in space and time are not equivalent, in contradic-
tion to the homogeneity of space and time. Thus it can only depend on the
relative velocity v of the two inertial frames K and K’. However, the isotropy
of space demands that k cannot depend on the direction of the relative motion
of the two frames and & can only be a function of the magnitude of v. If ds”
had been expressed in terms of ds’* the corresponding equation to Equation
4.5 would have been:
ds? = k'ds’?, (4.6)
where k’ is the same function of the magnitude of — vas kis of the magnitude
of + v. Consequently k = k' and comparison of Equations 4.5 and 4.6 shows
that k must be + 1. The negative solution can be dismissed immediately, since
ds’? must tend to ds? as the relative velocities of the two frames tends to zero,
so that:
dss ds (4.7)
Similarly ds’ must tend to ds as v tends to zero and ds = ds’. Integration to
give finite intervals then gives the final result that the interval between two
events is invariant to the inertial frame used:
Si = Sp, (4.8)
and this invariance may be attributed to the fact that the velocity of light is
the same in all inertial frames.
Now that the interval between two events has been shown to be invariant to
the inertial frame used, the explicit transformation between space and time
coordinates in different frames can be found. Two inertial frames K and K’ are
again considered and for simplicity it is assumed that the x and x’ axes coin-
cide, and the y and z axes are parallel to the y’ and z’ axes, respectively; the
velocity of the frame K’ relative to K is v along the x axis (see Fig. 4.1).
It has already been noted that time is not absolute and that it is convenient
to think of a four-dimensional space in which one of the coordinates is related
to time. The time and space coordinates may be made equivalent by introduc-
ing the new time coordinate:
T = ict, (4.9)
since the interval (Equation 4.3) between two events is then given by:
so that, if 7 is the coordinate associated with the fourth dimension, the inter-
val is related to the distance between two points in four-dimensional space.
The transformation from one inertial frame K with coordinates x, y,z,7 to
another K’ with coordinates x’, y’,z’, 7’ can thus be thought of as a transform-
ation in four-dimensional space. Since the interval is independent of the inertial
frame, the length of a line must be preserved in such a transformation, that is
only orthogonal transformations are of interest.
Fig. 4.1. The relationship between the inertial frames K and K’.
x = x’ cos@—7' sin;
} (4.11)
T =x’ sinO +7 cos@.
To relate @ to the relative velocity v ofK and K’, the motion of the origin of
the K’ frame is considered, that is x’ is set equal to zero. Substitution in Equa-
tions 4.11 gives:
x = —T sind; T= T cosO, (4.12)
or combining these:
tan @ <=. ==)7: (4.13)
Special Relativity 53
Now, if the origins of the two frames are assumed to coincide at T= t = 0, the
origin of the K’ frame has the coordinate x = vt = — ivt/c in the K frame,
where Equation 4.9 has been used to relate ¢ and r. Substitution in Equation
4.13 now gives:
tan0 = iv/c. (4.14)
Using familiar trigonometric relations, sin @ and cos 8 may now be recovered:
% = ¥(x' —iv7'/c);
(4.17)
tT II= y(ivx'/o+7').
Finally the time coordinate ¢ may be reintroduced using Equation 4.9 to give:
which are appropriate Galilean transformation equations for the situation under
consideration (cf. Equation 3.1).
54 Advanced Molecular Quantum Mechanics
from the last of Equations 4.19. In the method used here the times at which
xX, and x2 are determined is not important, since the rod is at rest in K. From
Equation 4.23 it can now be seen that the length of the rod in K’, the frame
in which it is moving, is less than the length in its rest frame K. This shortening
is known as the Lorertz (or Fitzgerald) contraction.
A similar phenomenon occurs when time lapses are considered. For a clock
that is at rest in the K frame and is consequently moving at velocity —v in the
K' frame, the time between two events that occur at eee ater wander ets
At = t, — t. The corresponding time interval, Ar’ = t3 — f',, in K' may be ob-
tained from Equations 4.19:
Both the Lorentz contraction and time dilation are real phenomena, though
after more than half a century the paradoxes associated with them are still
discussed frequently and there are those who remain unconvinced. We will
confine our discussion to one experimental manifestation of time dilation and
one of the paradoxes associated with it.
Mu-mesons, which are created by cosmic rays in the upper atmosphere at a
height of approximately 10* m, are observed on the ground. Their observed
lifetime in the laboratory is only 2 X 10°°s suggesting that those reaching the
ground must travel at velocities of the order of 10c! However, the lifetime
measured in the laboratory is appropriate for the rest frame of a mu-meson
and, as a result of time dilation, the lifetime will be greater by a factor of
(1 — v?/c?)? in a frame in which the mu-meson is moving with velocity v. Mu-
mesons with velocities close to that of light can thus reach the ground before
they decay.
One of the more famous paradoxes is the so-called clock paradox. If two
clocks are synchronized in the rest frame and one then undergoes a round-trip
at high velocity, on its return the travelled clock will be found to be running
slow relative to the other as a result of time dilation. The paradox arises when
it is argued that the two clocks should run at the same rate, since there is a
symmetry between them in that their roles could be reversed by choosing a
different rest frame. However, although this symmetry between the clocks
exists when they are moving uniformly with respect to one another, it is not
present during the acceleration and deceleration that the travelling clock must
experience during its journey.
This equation expresses the result of adding two velocities, and can be seen to
reduce to the classical result, w = u’ + v, for low velocities. In addition, it is a
simple exercise to show from Equation 4.30 that the relativistic sum of two
velocities can never exceed the velocity of light.
Sa Ld
b ds, (4.31)
where k is a constant and a and bare two world points. The sign of k is nega-
tive, since the action is required to be a minimum for actual motion and it was
seen in Section 4.4 that fe ds is maximum for a straight world line and could
be made arbitrarily small by a suitable choice of a curved world line.
From Equation 4.4 ds is given by:
£ = ke—jke(ulcy+...; (4.36)
the first term may be ignored, since it is a constant and makes a contribution
to the action integral (Equation 4.34) that is independent of the path of the
integral. If k is taken to be — mc the second term reduces to }mu? and the
remaining terms all vanish in the classical limit. The relativistic Lagrangian for
a free particle is thus:
£= —me1 — Jc). (4.37)
Special Relativity oY
From this Lagrangian the momentum and Hamiltonian of the particle may
be determined from the definitions of the previous chapter. Using Equation
3.24 the momentum of the particle is:
4.7 Four-vectors
It has already been noted that time may be thought of as a fourth dimension
and that the location and time of an event may be represented by a world
point in four-dimensional space. The position of this world point may be speci-
fied by a vector with four components ry (a= 1, 2,3, 4), where ry, r2 and
r3 are the spatial coordinates x, y and z respectively and rg is related to the
time coordinate. To distinguish this type of vector from those in three-dimen-
sional space, they are called four-vectors and here Greek subscripts are used to
label their components.
By analogy with three-dimensional vectors the scalar product of the position
four-vector with itself is defined as:
where the repeated subscript convention is again used, but since a four-vector
is involved, the summation is over the four possible values of the subscript a.
If r4 is to chosen to be 7 = ict (Equation 4.9), then Equation 4.44 becomes:
Fat Oe ae ay (4.45)
Fy C1
— Weal). rr rr:
(4.46)
rg = Y(ivrilc + 74),
where y retains its original meaning (Equation 4.16).
A general four-vector may now be defined as one having components that
transform in precisely the same way as the rg under a Lorentz transformation.
With this definition the scalar product of any two four-vectors, for example
A, By, is invariant to such transformations and is independent of the inertial
frame used, as can be seen by expansion of the product.
Although it was not at all obvious, there were four quantities in the pre-
vious section that constitute the components of a four-vector. The way in
which the components of the momentum p (Equation 4.38) of a particle be-
have under a Lorentz transformation may be established using Equations 4.28
and 4.29, which describe the transformation of velocities. The algebra is simple
but tedious, and will not be given here, the result being that the components
of p are the spatial components, p;, p2 and p3, of a four-vector the correspond-
ing time component being p4= i#(/c, where H is the particle’s Hamiltonian or
Special Relativity 61
energy (Equation 4.39). It follows from the invariance of the scalar product
of two four-vectors that pyPq should be invariant:
rp = ope (4.50)
Now under the transformation the operator (0/0r,,) transforms as:
(0/8rq) = (8rg/Arq)(8/9r8)
= dog (0/0rg), (4.51)
which transform in the same way as the sixteen quadratic products ryrg, that
is:
Fag = Lay4Bé Frys a (4.52)
Bibliography
Special Relativity
Feynman, Leighton and Sands: Volume I, Chapters 15—17.
Goldstein, H. (1950), Classical Mechanics, McGraw-Hill, New York: Chapter 6.
Landau and Lifschitz: Chapters 1 and 2.
Rindler, W. (1966), Special Relativity, Oliver and Boyd, Edinburgh.
CHAPTER FIVE
So far we have been concerned with the mechanics of a free particle, and the
possibility of a particle interacting with an electromagnetic field must now be
considered. This willlead naturally to a discussion of the electromagnetic field
itself. It has already been noted that Maxwell’s equations, which describe elec-
tromagnetic fields, were found to be invariant to Lorentz transformations
rather than Galilean transformations and that this discovery preceded Einstein’s
theory of special relativity. Consequently it is appropriate that electromagnetic
theory should be developed from a relativistic stand point, although this was
not the case historically. The approach used here is a logical one, but of course
the final justification of the theory is that it agrees with experiment.
5.1 Units
It is at this point that a decision must be made about units. In mechanics the
transition between different systems of units is trivial and usually involves
only factors of 10. However, in electromagnetic theory the appearance or non-
appearance of the factors 47, c, €9 and Uo depends on the system of units
adopted.
In the past the system of units used by chemists and chemical physicists
has usually been the mixed or Gaussian system, which is based on centimetre-
gramme-second (CGS) units, but in which magnetic quantities are measured in
electromagnetic units (e.m.u.) and electrostatic quantities in electrostatic
units (e.s.u.). Because of the way these units are defined, equations connecting
the two sorts of quantity contain factors of c. This appearance of c is not a
consequence of special relativity, but merely reflects the separate initial
63
64 Advanced Molecular Quantum Mechanics
where s is the interval and the integration is performed between the world
points @ and b. If the particle interacts with an electromagnetic field, further
The Interaction of Charged Particles with Fields 65
terms have to be added to this action integral, and these terms will involve
quantities characteristic of the particle and the field. The extra terms must, of
course, be Lorentz invariant, but beyond that little can be said about them
and an appeal to experiment is necessary. That is, the justification of the terms
that will be added is that the equations that they lead to are in agreement with
experimental observations.
Experiments indicate that only one new parameter, the charge q, is needed
to characterize a particle as far as its interaction with an electromagnetic field
is concerned, but that four quantitiesA, (a= 1, 2, 3, 4), which are functions
of time and the position of the particle, are needed for the field. The Ay are
the components ofa four-vector and thus under a Lorentz transformation they
behave ina similar manner to the rg, the components of the position four-vec-
tor. The spatial components, A;, A2 and A3, are the components of a three-
dimensional vector A, which is called the (magnetic) vector potential, while
the time component A q is related to the electric or scalar potential ¢ by:
Aa = idle. (5.2)
A theory that agrees with experiment may be developed by taking the inter-
action terms in the action integral to be:
i oqgAcin. (5.3)
the integrand is proportional to the invariant formed by the scalar product of
the potential four-vector and the infinitesimal position four-vector, which has
components dry.
The total action integral thus appears as:
u = dr/dt. (5.6)
and comparison with the definition (Equation 4.34) of the action integral in
terms of the Lagrangian £ shows that:
=nm+qA, (5.9)
where tris called the mechanical momentum, that is the momentum in the ab-
sence of the field; p is the total momentum conjugate to the velocity u and is
distinguished from w by being referred to as the canonical momentum. The
momentum of a particle is thus modified by the addition of aterm involving
the vector potential A.
Equation 3.33 gives the Hamiltonian of the particle:
H = pu—L = mcX(1—uJc?2)?
+ go. (5.10)
This may be expressed more appropriately in terms of the canonical momen-
tum p in the same way that Equation 4.41 was obtained for the case of no
field:
H c[mc? + (p—qA)’]? + a
= ¢(m2c? + 1)? + go. (5.11)
Comparison of Equations 5.9 and 5.11 with Equations 4.38 and 4.41 shows
that to generalize from a free particle to a particle in a field, all that is necessary
is to replace p by wm= (p—qA) and KH by (H —q@). For example, the non-
relativistic limit of the Hamiltonian (Equation 5.11) is, by comparison with
the non-relativistic field-free Hamiltonian (Equation 4.43):
HK = mce?+qotn7/2m. (5.12)
In the SI system of units the charge g has dimensions TA, while the vector
and scalar potentials, A and ¢, have dimensions MLT 747! and ML?T 34“1, res-
pectively. For the mixed system of units Equations 5.9 to 5.12 are only modi-
fied in that the mechanical momentum must be rewritten as:
m= p—(qicA, (5.13)
where the factor c_! has no relativistic significance; however, there are other
hidden differences, since the dimensions of g are modified to M!2L3?7T~! and
A and ¢ have the same dimensions M21 }27~!.
The Interaction of Charged Particles with Fields 67
a (al) _ = by:
dr\du ory Pet Gi
where use has been made of the vector relation, Equation 2.66, and the fact
that u does not depend explicitly on r. Substitution of Equations 5.16 and
5.17 into Equation 5.15 followed by rearrangement gives finally:
in this form it is known as Lorentz’s equation. The vector B is called the mag-
netic flux density and is defined by:
B= V,A, (5.20)
The components of the magnetic flux density B and the electric field strength
E are related to the components of the four-potential A, and the position
four-vector rg by Equations 5.20 and 5.21. We might enquire how these com-
ponents behave under a Lorentz transformation. It will be seen that they trans-
form in the same way as quadratic products of the components of two four-
vectors, that is as the components of a four-tensor of second rank.
The sixteen quantities Fg (a, B = 1, 2, 3, 4), which are defined as:
Ag = agAg. (5.23)
In addition the operators (0/07) were shown to be the components of a four-
vector (Equation 4.51):
(0/dra) = ag(0/0rg), (5.24)
so that:
Fog = AquyAg6 Fs; (5:25)
comparison with Equation 4.52 shows that the Fg are indeed the components
of a four-tensor. However, explicit consideration of these components (Equa-
The Interaction of Charged Particles with Fields 69
tion 5.22), and use of Equations 5.20 and 5.21 shows that they are related to
the components of B and E; this relationship may be indicated by a matrix-
like array:
0 £. =B 5 HEC
a 0 Be ei) ea
(fan) = : : (5.26)
B, —B, 0 —iE,|c
EESopens! RS Cola ak
PiesFa 0
The electromagnetic field vectors B and E may be determined from the poten-
tials A and ¢ by Equations 5.20 and 5.21. However, since it is Band E that de-
termine physically observable phenomena such as the force on a charged
particle (Equations 5.19), it is pertinent to ask to what extent A and ¢ are
determined by B and E. It will be seen that there is some freedom in the choice
of these potentials and this may be attributed to the fact that integration of
Equation 5.20 or Equation 5.21 introduces an arbitrary integration constant.
Let A and ¢ be replaced by some new potentials given by:
where the fact (Equation 2.70) that the curl of a gradient vanishes has been
used. The electric field corresponding to ¢’ and A’ may be obtained from
Equation 5.21:
The Interaction of Charged Particles with Fields 71
E’
—V¢' — (0A'/dt)
—V$—V(df]at) — (@A/dt) + (AVs/dt) = E. (5.33)
Thus the potentials are not uniquely defined, since the electric and magnetic
fields are unaltered by the transformation of the potentials described by Equa-
tions 5.31; such a transformation is called a gauge transformation.
It is clear that a change of gauge must not affect the equations describing
physical phenomena and that these must be gauge invariant. For instance, we
might enquire what happens to the Lagrangian (Equation 5.8) under the gauge
transformation (Equations 5.31). Substitution shows immediately that the
Lagrangian is modified by the addition of:
where B and E are both constant vectors, substitution in Equations 5.20 and
5.21 and the use of vector relations from Chapter 2 shows that the correspond-
ing fields are just B and E:
where | is the angular momentum of the particle and p is the associated mag-
netic moment. This term represents the energy of the interaction between
the magnetic field and a magnetic moment u = (g/2m)l.
Equations 5.20 and 5.21 relate the electromagnetic fields to the potentials:
since for example they do not indicate how E depends on time. In fact, these
are just two of the four Maxwell’s equations, which are required for complete
specification of a field. The other pair of equations cannot be derived from the
theory that has been developed so far and it is necessary to go back to the ac-
tion integral and add terms for the field itself; up till now the action integral
has only included terms for a charged particle and for its interaction with an
electromagnetic field. Toavoida major diversion, the derivation of these equa-
tions is postponed until the next chapter, but they are quoted here for the
sake of completeness, and because one of the equations is needed in the next
section.
The second pair of Maxwell’s equations is:
V.B = 0
|OFop
,Woy , Fre _ og,
VAE = —(0B/at)) %y %a rg
(5.55)
praia |OF iol Ten
V,B = c?[j/eot+ (0E/30)]) 8 — €0e”
In the mixed system of units these equations are modified by the replacement
of B by B/cand €g by 1/47, but of course the dimensions of all the quantities
involved are also altered.
Finally, we should note that the permeability of free space uo often appears
in electromagnetic equations, but it is related to the permittivity of free space
€9 by Uo€o = c ”, so that equations may be written in terms of Mo, €o or both.
The quantity Uo has the fixed value of 4m X 10°? kg ms 7A ’ in the SI system,
so that € 9is uncertain to the extent that there is an experimental uncertainty
associated with the value of the velocity of light c. Of course use of Wo instead
of €9 or vice versa can introduce factors of c, for which special relativity is not
responsible.
The space surrounding an isolated point charge q has spherical symmetry. This
means that ata pointr relative to the charge as origin the vectors B and E must
be along the radius vector r. In addition, the magnitude of the fields can only
depend on the distance from the charge.
Although Gauss’ law (Equation 5.50) has not yet been derived, it should be
familiar to the reader and may be used in its integral form (Equation 5.53) to
obtain the magnitude F of the electric field at a distance r from the charge.
By considering a sphere of radius r centred at the charge, the surface integral
on the left-hand side of Equation 5.53 becomes:
E = q/4neor’, (5.58)
and, since the direction of the field is along r:
E = qr/4ne€or’. (5.59)
The electric field is given in terms of the potentials by Equation 5.21:
since the charge is stationary and the magnetic vector potential A must be
independent of time. When it is recalled that V(1/r) is equal to — r/r* (Equa-
tion 2.78), it is seen that an appropriate scalar potential is:
@ = q/4neor. (5.61)
To obtain the magnetic field B the same arguments may be employed ex-
cept that Equation 5.47 is used instead of Gauss’ law. It is immediately appar-
ent that the magnetic field is zero, and that the magnetic vector potential A
may be taken to be zero also.
From the previous section we know that, in the rest frame K “ofa charge q,
the potentials at a distance r’ from the charge are given by:
Aa 0; @ = q/4neor’. (5.62)
The potentials in a frame K, in which the charge is moving, may be obtained
by performing a Lorentz transformation, since A‘, A, A, and i¢’/c are the
components of the potential four-vector. If the velocity of the particle in K is
u in the x direction, Equation 4.46 gives:
A, V4,
+ubiG)., | Ay ae A.
(5.63)
a7= A; ¢ = y(uA,+¢’),
where y = (1 — u/c?) 7, or substituting Equations 5.62 and generalizing to a
particle moving with velocity u:
Although these potentials have been established for a specific situation, they
apply generally if r is interpreted as the position of the charge relative to the
observer (in the previous section r was the position of the observer relative to
the charge). The important thing to note is that the expressions 5.68, which
are known as the Liénard-Wiechert potentials, must be evaluated at the time
the signal leaves the charge. The potentials are thus expressed in terms of the
so-called retarded values of r and u. Clearly the observed potentials only de-
pend on the instantaneous velocity and position of the particle at the retarded
time, the time at which the signal is emitted by the charge, so that Equations
5.68 apply to a charge moving in an arbitrary manner. The retarded potentials
may be expressed in terms of non-retarded quantities provided that there is
sufficient information about the motion of the particle. This is fairly simple
for a uniformly moving charge, but rather than do this here another method
is presented, in which the retardation effect is automatically accounted for.
Retarded potentials arose because the event to which the Lorentz trans-
formation 5.66 applies is the emission of the signal from the charge and the
relative velocity of the two frames must be taken as the charge’s velocity at
this event. However, if the charge is moving with constant velocity, the same
rest frame K’ for the charge is appropriate at both the emission of the signal
and the observation of the potentials. The position r’ of the charge in K’ may
be evaluated in terms of r at any time, and here the time of observation t =
t' = Oischosen. The inverse of the transformation (Equations 4.21) then gives:
78 Advanced Molecular Quantum Mechanics
r= rt+(y—1)(unu/v’, (5.69)
from which it is easy to show that:
The Lagrangian for an isolated particle with charge q,, mass m, and velocity
u, in the presence of electromagnetic potentials A and ¢ is given by Equation
5.8 as:
£L = —mc(1 —u3/c?)? + qyA.u,
— 419. (5.74)
For the remainder of this section it will be assumed that u, <c. If the poten-
tials at r,, the position of this particle, are entirely due to a second slow-moving
particle distinguished from the first by the subscript 2, then they are given by
Equations 5.73:
A (q2/8m€9cr) [ug/¢ + (u2.r)r/r°c =F O((u2/c)*)];
(5.75)
= (q2/4meor)[1 + O((u2/c)*)],
where r= (1.— r,) is the position of electron 2 relative to electron 1. Substi-
tution in Equation 5.74 gives:
these results are to be applied to molecular problems and in the ground state
of the hydrogen atom, for example, (q¢1q2/87€or) is of the order of me?(u/c)?
(see Section 8.7). Thus:
= ris U;-Pj — Af
Piri = Dae aA
(5.82)
>I PP Gidi,
1=1,2
where A; and ¢; are the external potentials at particle 7. This result will be used
in Chapter 9 when the quantum mechanical theory for two electrons is con-
sidered.
The Interaction of Charged Particles with Fields 8]
The concept ofelectron spin was first introduced to account for fine structure
in atomic spectra. It was postulated that the electron has an intrinsic angular
momentum fs and an associated magnetic moment uw = — (eh/m)s, where — e
and m are the charge and mass of the electron; in a magnetic field the energy
associated with this moment is —y.B. However, a discrepancy soon became
apparent, since an electron moving with velocity u (<c) under the influence
of an electric field E, should according to Equations 5.29 experience a mag-
netic field:
B = (E,u)/c’ (5.83)
in its own rest frame. The energy of interaction between the spin magnetic
moment of the electron and this magnetic field should thus be — w(E , u)/c’,
but in atoms where the electric field is provided by the nucleus experiment
showed that this expression was a factor of two too large. This discrepancy
can be explained in terms of a relativistic effect, if the concept of spin is ac-
cepted, and this may be done without considering quantum mechanics.
To do this the concept of precession must be introduced. In a magnetic
field B an electron experiences a torque u , B by virtue of its intrinsic magnetic
moment pw. Now this torque will change the angular momentum of the electron
and the rate of change of the angular momentum is just equal to the torque:
Thus the change in the angular momentum is perpendicular to itself and the
magnetic field, so that the magnitude of the angular momentum vector and
the angle @ that it makes with the magnetic field vector do not change, but
instead the angular momentum precesses about the direction of B (Fig. 5.1).
The angular velocity w, of this so-called Larmor precession is in the same di-
rection as B and its magnitude may be found by considering a short time df,
in which the precession angle d@ is w,dt. This angle is given also by:
As
Che
Fig. 5.1. The precession of the angular momentum of the electron about the
direction of the magnetic field.
Let us now consider an electron moving with velocity u along the x axis of
an inertial frame K in which there is an electric field E directed along the y
axis. In the rest frame of the electron the precession will have an angular vel-
ocity about the z axis of:
pe eUlinic., (5.87)
where Equations 5.83 and 5.86 have been used. Now by virtue of its charge
the electron will experience a force due to the electric field and will suffer a
resulting acceleration:
@ = —eH/m. (5.88)
in the y direction. It will be seen that as a result of this acceleration the axes
of the rest frame of the electron appear to be themselves precessing when
viewed from the frame K.
After a short time dt the situation will be as in Fig. 5.2. The rest frame K”
of the electron is moving with velocity u' = adt in the y’ direction relative to
a frame K’, which is itself moving with velocity wu in the x direction relative to
the K frame; the frames are assumed to be coincident at t=O andu' <u <c.
Use of Equations 4.28 and 4.29 shows that the velocity v of the K” frame rela-
tive to the K frame has components:
Vy = U; vy = u/ylu), (5.89)
where y(u) = (1 —u?/c?)
u“/c*) 2.
?. On On the
the other
other h hand the velocity
ity v v" of theK frame
relative to the rest frame of the electron has components:
The Interaction of Charged Particles with Fields 83
Fig. 5.2. The relationship between the inertial frames K, K’ and K”.
just half its magnitude in the rest frame of the electron (Equation 5.87).
84 Advanced Molecular Quantum Mechanics
Since the energy of interaction between a magnetic field and the spin mag-
netic moment of the electron is directly proportional to the angular velocity
of precession, we conclude that, when the magnetic field is due to the motion
of an electron relative to a frame in which there is an electric field, special
relativity introduces a correction factor of ;; the energy of interaction is thus
—u.(E , u)/2c? and this is just the spin-orbit interaction term when the elec-
tric field arises from a nucleus. Later it will be seen that this correction factor
arises naturally from the Dirac equation, which is a relativistic quantum mech-
anical equation for the electron.
Bibliography
Electromagnetism
Coulson, C. A. (1948), Electricity, Oliver and Boyd, Edinburgh.
Feynman, Leighton and Sands: Volume II.
Landau and Lifschitz: Chapters 3, 4 and 5.
Jackson, J. D. (1962), Classical Electrodynamics, Wiley, New York.
Panofsky, W. K. H., and Phillips, M. (1962), Classical Electricity and Magnet-
ism, Addison-Wesley, Reading, Mass.
Rindler, W. (1966), Special Relativity, Oliver and Boyd, Edinburgh: Chapter 6.
The classical field theory that is developed here is not used until Chapter 12,
where its quantization is considered. Consequently, a reading of this chapter
may be delayed until then, provided that the validity of Gauss’ law (Equation
5.50) is accepted; it is recalled that the proof of this equation, which is used
in Section 5.7, was postponed until this chapter, together with the proof of
Equation 5.51.
The reason for this postponement was that, although the Lagrangian for a
charged particle subject to an electromagnetic field had been obtained, the
Lagrangian for the field itself had not. There is clearly a fundamental differ-
ence between a particle, which is concentrated at a point, and a field, which
is spread out through space. The Lagrangian for a particle is a function of its
position and velocity, while for the Lagrangian of a field some new variables,
which are themselves functions of position r and time ¢, are necessary. This
means that our ideas about Lagrangians, Hamiltonians and equations of motion
will need some modification, when a continuous, rather than a discrete, system
is under consideration. To see how the theory needs to be modified we first
discuss the mechanics of a continuous system.
85
86 Advanced Molecular Quantum Mechanics
Here it is assumed that the Lagrangian does not depend explicitly on time,
and that 6n, the change in n, is zero att, and t,and also at the limits of the
space integration in Equation 6.3. Substitution of Equation 6.3 in Equation
6.5, expansion and integration by parts gives:
lo
65 = 8| |L(n, i, an/ax)dxae
~
OL OL OL F)
ll —— 6n +—6n+——— 6 (—
pee” E ay SPRCOnan)e aney ee
The Classical Theory of Electromagnetic Fields 87
_ ee)
ple p[ae_a fat)_a/
Syuatlenl
a Vi.
Parl aOnen Co
where the integrated parts have been omitted, since they vanish as a result of
the assumed boundary conditions. Now 5S must vanish for an arbitrary choice
of 5n, so that:
ab afar) _a/_a_)_, Be
dn at\dn} ax\a(an/ax)) ee)
This is the equation of motion, which is to be compared with Equation 3.6 for
a discrete system.
For the example used here, substitution of the Lagrangian density (Equa-
tion 6.4) in Equation 6.7 gives:
p = (AL/dn), (6.9)
while the Hamiltonian density is:
H = np—L. (6.10)
In our example the momentum density is just un and the Hamiltonian density,
which is given by:
H = 3[un? + Y@n/dx)’], (6.11)
is clearly the sum of kinetic energy and potential energy densities.
So far attention has been confined to a one-dimensional system, but the
extension to three dimensions is trivial. Remembering that the subscript 7 can
be x, y or z and that the repeated subscript summation convention is being
used, Equation 6.7 generalizes to:
where 7 is now a function of the position vector r and the time f; by using
four-vector notation this equation may be written in a more abbreviated form:
88 Advanced Molecular Quantum Mechanics
ab_
Ore (_2L_) (a@=nlarg) )
(an) ,i
(2) l (6.14)
Use of the equation of motion (Equation 6.13) and the fact that 0(0n/0rg)/drq
is the same as 0(07/0r,,)/0rg now shows that:
on OL
Leg = Soph a= lee
eh! fa | sod)
It should be noted that the Tyg are not uniquely determined by Equation 6.16,
since its validityisunaffectedif dU ggy/dry is added to Tyg, where Uggy is any
tensor antisymmetric in the subscripts B and ¥.
Equation 6.16 may be put in a more useful form by interpreting the left-
hand side as the four-divergence of a four-vector TJ with components Tyg. It
can then be rewritten as:
so that the time derivative of the integral of Tq4 over all space is:
0
apt Lead? = f (07x, /dt)dt
ll =I) Veer
where Gauss’ theorem (Equation 2.57) has been used. If it is assumed that
and its derivatives vanish at infinity, then so do the components of T, and the
surface integral on the right-hand side of Equation 6.19 vanishes. Thus f Ty 447,
the integral of 7, over all space, is a constant in time and we may define the
four-vector:
Gy = k fTy,dt = constant, (6.20)
Gg = kf Tagdt = k f [L — (0n/0t)(0L/0(0n/dt))|
dr. (6.21)
Equations 6.9 and 6.10 now show that, if k has the value — i/c:
G4 = (i/ce)
f Hdt = (i/c)K, (6.22)
where # is the Hamiltonian. Thus G, is just the time component of the 4-
momentum, as may be seen by reference to Section 4.7. Consequently the
four-vector:
Ge Cie OT (6.23)
characterized by more than one variable and, for example, these could be the
components ofa vector. For such a vector field there is an equation of motion
similar to Equation 6.13 for each component of the vector. In addition, there
is a momentum density conjugate to each field variable and the Hamiltonian
density generalizes to:
H = Xm (0L/0n,)
—L, (6.24)
where the subscript 1 distinguishes the field variables. Similar remarks apply
to the tensor Tg and the four-momentum Gq.
In the last chapter it was seen that an electromagnetic field may be character-
ized by the components Ag of the potential four-vector. These four com-
ponents are functions of position r and time f, and they might be expected to
be appropriate field variables, so that there are four equations of motion (Equa-
tion 6.13) for the field. However, before we can proceed any further an appro-
priate Lagrangian must be found and to do this the action integral is again
considered.
For a system consisting of charged particles and an electromagnetic field
the action integral consists of three parts, one for the particles, one for the
field and one for the interaction between the two. When the equations of mo-
tion of a particle were obtained, only those terms in the Lagrangian which
depend on the position and velocity of the particle were needed; that is the
electromagnetic potentials were regarded as known and an explicit term for
the field was unnecessary. However, in finding the equations of motion for a
field, the potentials are the variables and the motion of the charged particles
is assumed to be known. In this case the action for the particles alone is un-
important, since it is not a function of the field variables. The action due
to the field itself has yet to be derived, but the interaction term is already
known (Equation 5.3), although it will be necessary to devote the next sec-
tion to rewriting it in terms of the current four-vector.
A number of restrictions may be made on the action integral for the field.
In the last section it was seen that the Lagrangian density is a function of the
field variables and their partial derivatives with respect to space and time co-
ordinates (if it were to depend explicitly on time, this would imply that time
is not homogeneous). However, explicit dependence on the potentials them-
selves is not possible, since they are not uniquely determined and the Lagrangian
density must be invariant to gauge transformations. Thus, only quantities of
the type (0A,/drg) are involved, and these are related to the Fug, the com-
ponents of the electromagnetic field tensor (Equation 5.22).
The Classical Theory of Electromagnetic Fields 91
It has also been noted that the Lagrangian density must be an invariant, if
the Lorentzinvariance of the equations of motion is to be ensured. An invariant
(Equation 5.30) has already been constructed from the F yg and this suggests
that for an electromagnetic field alone an appropriate action integral is:
“) 0A
E.B = (ic/8)€agysFaghys = (ic/2)— |€apysAp — |. (6.26)
Oy, ory
where €ys5 is the unit antisymmetric tensor of rank four analogous to €;;, for
three dimensions; the details are left to the reader.
It remains to choose a value for the constant a in Equation 6.25 and clearly
this will depend on the system of units employed. For the SI system agree-
ment with experiment is obtained by taking a to be — c7€o/4 (= — 1/4 uo),
where €o and Mo are respectively the permittivity and permeability of free
92 Advanced Molecular Quantum Mechanics
space; it may easily be checked that this choice makes the action integral
dimensionally correct. That a@ must be negative is due to the fact that the
quantity (0°A/d¢*) appears with a negative sign in Faghag. However, it must
appear with a positive sign in the action, since it may be made arbitrarily large
by varying the potentials sufficiently rapidly, and if it had a negative sign the
action could not be minimized as required by the principle of least action. The
Lagrangian density for a field is thus:
DP = CeohapFaght = —yeolC
Ba Ee). (6.27)
By substitution of Equation 6.27 in the equations of motion (Equation
6.13) the field equations for free space could now be obtained, but the more
general situation in which electric charges are present is considered after the
Lagrangian density for the interaction of charges with a field has been deter-
mined.
The action integral for the interaction of a point charge q with an electromag-
netic field is (Equations 5.3):
f2qA da. (6.28)
Now that the field itself is under consideration, it is desirable to be able to
treat any distribution of charge and not just a single point charge. To do this
the charge density p is introduced. The total charge in a region of space is then
the appropriate volume integral of p, that is f pd7. A point charge q may be
included in this description, since its contribution to the charge density may
be written in terms of a Dirac delta function, g5(r — rq), where r, is the vector
position of the charge.
For an infinitesimal region of space dz the included charge is dg = pdr.
Thus we can write:
Ja = p(drg/dt), (6.30)
is a component of the so-called current four-vector. The spatial components
of this four-vector can be written as j = pu, where uis the charge velocity at a
The Classical Theory of Electromagnetic Fields 93
particular point, so that j is the flow of charge per unit time per unit area
across a surface; the time component icp is proportional to the charge density.
If the charge q is replaced by the volume integral of the charge density, the
action (Equation 6.28) becomes:
The results of the two preceding sections show that the Lagrangian density for
an electromagnetic field in the presence of a charge distribution is:
L = jyAg — €€oF
ygF ag/4, (6.32)
where the components of the electromagnetic field tensor are given by:
a) a) 0As OA 0As 0A
jan + (C*e0/4)=—Org \0(0Aq/drg) es Org}4) ea
[\Or, \Ory Ore
This is just the second pair of Maxwell’s equations written in four-tensor nota-
tion; it was quoted (Equation 5.52) and interpreted in the last chapter.
For convenience Maxwell’s equations are repeated here in full and abbrevi-
ated form:
V.B =
= 0 | OFap , Fey , WF ya _ . 6.37
(6.37a)
V,jE=—(aB/at)) %y Fa (6.37b)
V.j+dp/at = 0; (6.40)
this is just the charge conservation equation 2.55. This result could have been
obtained using four-tensor notation by operating on the combined equation
for Equations 6.37c and 6.37d with the operator 0/dr,, and summing over the
subscript a (that is taking the four-divergence of Equation 6.36):
Ojn/Ory = 0, (6.42)
which is just Equation 6.40 written in terms of four-vectors.
We started this section with the Lagrangian density (Equation 6.32) for an
electromagnetic field in the presence of a charge distribution. The results of
Section 6.1 can be used to obtain the momentum and Hamiltonian of the
The Classical Theory of Electromagnetic Fields 0)
field, and for this it is assumed that no charges are present, that is jg = 0.
The Lagrangian density is then — €7€oF ag agl4 and its derivative with re-
spect to 04,/drg is:
Now the left-hand side is just equal to — V7B from Equation 2.72 and the fact
that V.B vanishes (Equation 6.37a). Substitution of Equation 6.37b for V , E
followed by rearrangement then gives:
It is noted that these potentials also satisfy the Lorentz gauge condition (Equa-
tion 5.36).
Since the vector potential has been chosen to be divergenceless, Equation
6.55 reduces to:
V*A — (07A/0t7)/c? = 0. (6.59)
Again we have a wave equation, this time with the vector potential as variable.
The wave equations 6.52 for B and E in free space may be recovered from
Equation 6.59 by operating on it with V , and 0/df, respectively, and using
Equations 6.53. However, it is the wave equation for A that is used in the next
section.
The wave equation 6.59 represents three equations, one for each component
of the vector potential A. The general solution of one of these equations is:
A, = fk —wt), (6.60)
where f is any function of (k.r — wt) and w is an arbitrary constant, while the
arbitrary wave vector k is restricted in that its magnitude has the value w/c;
these assertions may be checked by direct substitution of Equation 6.60 in
Equation 6.59. Of course, boundary and other conditions may be used to re-
strict the solution 6.60, but first it is convenient to write it in an alternative
form and this is done by solving Equation 6.59 by a different method.
Equation 6.59 is solved by the separation of variables method by substitut-
ing:
A(r, t) = q(AQ), (6.61)
98 Advanced Molecular Quantum Mechanics
to give two differential equations, one for q(t) and one for the components of
A(r):
d’q/dt? + w*q = 0; V7A + (w/c7)A = 0, (6.62)
where q(0), A, w and k are arbitrary except that the magnitude of k is restrict-
ed to w/c; in addition, if A(r, 4) is to be finite for all values of r and ¢, both k
and w must be real.
In Equations 6.63 e is a unit vector, which is called the polarization vector
and which can only be perpendicular to the wave vector k. This restriction
arises because A(r, f) is to satisfy the Coulomb gauge condition (Equation
6.58), so that A(r) must be divergenceless. Application of this condition then
gives:
e.k = 0, (6.64)
which shows that e and k must be orthogonal. For a given ¢ and k there is
another unit vector that is orthogonal to both, so that there are two linearly
independent choices of e and these are distinguished by the subscript A = 1, 2;
the vectors €;,€ 2 and k are mutually orthogonal.
Clearly any pair of functions of the form 6.63 is also a solution of the
wave equation 6.59, so that the general solution is a sum over all possible
values for k and all polarizations A. However, in writing down this general
solution we insist that A(r, ¢) be real, since the fields B and E derived from it
must be real. This may be accomplished by taking:
where y,, 2yy and ”,, are integers; these solutions are similar to the wave
functions for a particle in a box.
It may now be demonstrated that the A;,, are orthogonal to one another:
A= bal? (6.70)
is appropriate and the orthonormalization condition (Equation 6.69) becomes:
Although k can only take on discrete values (Equation 6.68), the side of
the cube may be made arbitrarily large and a sum over k may be replaced by
an integral:
% fik) > S fk)p(k)dk. (6.72)
The function p(k) is a number density and is defined so that p(k)dk is the
number of discrete values that k may adopt between k and k + dk. From Equa-
tion 6.68 it is apparent that:
B= Vi A= RON i[dxa(Ok
, Ann — Tka(DK
y Akal (6.77)
and
E = —(0A/0t) = 2 1Ok [dxxaQAna —dka(Akal; (6.78)
it may easily be seen that a given wave vector and its contributions to B and E
are mutually orthogonal. From now on the dependence of the g,, on time
will be omitted for brevity.
The contribution of the electric field (Equation 6.78) to the energy (Equa-
tion 6.76) is:
Addition of Equations 6.79 and 6.82 now gives the Hamiltonian for the
electromagnetic field:
It is important to note that the total energy for the field is the sum of the
energies of the fields associated with each wave vector k and polarization A. In
addition, the Hamiltonian is expressed entirely in terms of the q,(¢), which
are the solutions of the first of the differential equations 6.62.
Using the same techniques the total momentum of the field G (Equation
6.48) may also be evaluated:
Since the summation is over all k, both positive and negative, the first and last
terms in the parentheses make no contribution to the total sum. (This depends
on the order of gy, and g_, as well as that of qi, and q*,, being unimpor-
tant; this is of course true classically and it will be seen to be true also in the
quantized theory of Chapter 12.) Thus:
and as for the energy the total momentum may be thought of as the sum of
contributions from the individual wave vectors and polarizations.
It is now convenient to define the new real functions of time:
Onn = Ara
t+ ka; Pax = —1Wx
(Gun — Vir): (6.86)
Reference to Equations 6.62 shows that the Q,, are just the real solutions of
the differential equation:
d’?Q/dt? + w’Q = 0, (6.87)
while from Equations 6.63 it is seen that the Py, are the time derivatives of the
Onn: b
Pyx = (dQxa/dt) = Qxx- (6.88)
102 Advanced Molecular Quantum Mechanics
G= 2% Gao, (6.95)
where
O 5 |= 7(k/w, (Pa + RO?)
(k/, Hy. (6.96)
The Classical Theory of Electromagnetic Fields 103
Bibliography
The classical expression for the energy (or Hamiltonian) of a particle with mass
m and charge g moving under the influence of electromagnetic potentials A
and @ is (Equation 5.11):
w = p—a@A. (7.2)
The usual prescription for the transition from a classical equation to a quan-
tum mechanical equation is to interpret the energy and the canonical momen-
tum as operators:
E = ih(a/dt); =p = —ifV, (7.3)
104
Relativistic Wave Equations 105
Except for the mc? term this is just the usual time-dependent Schrédinger
equation for a charged particle in an electromagnetic field.
The presence of mc? is merely due to the use of a different origin for the
energy scale. It may be removed formally by introducing a phase factor into
the wave function:
y—w' = Wexp(—imc’t/h). Ca)
The left-hand side of Equation 7.6 then becomes:
Thus the energy origin has been shifted by changing the phase factor of the
wave function; clearly this can have no effect on the physical properties calcu-
lated from the wave function.
106 Advanced Molecular Quantum Mechanics
In the last section it was assumed that the gauge of the electromagnetic
potentials can be chosen in quantum mechanical equations just as it can be in
classical equations. This assumption is now justified.
It is recalled that in a gauge transformation new potentials:
Thustry is merely multiplied on the left by the phase factor in Equation 7.17,
and the same is true of 77W, m*y and so on. If (ihd/dt — q@)W is now exam-
ined, it will be seen that the effect of the transformation is the same:
The wave equation 7.4 has the disadvantages that it is not obviously Lorentz
invariant, and that it is difficult to interpret and use. The first attempt to avoid
these difficulties was tp return to the classical Equation 7.1 and rewrite it as:
HY = ih(dw/dt), (7.25)
the time and space variables may be separated by the substitution:
where & is an arbitrary constant. The probability density y*y is thus propor-
tional to x*x and is time independent. Integration of y*w over all space shows
that a particle described by the wave Equation 7.25 has a constant probability
of being somewhere in space.
This is no longer true for a wave equation, such as the Klein-Gordon equa-
tion, in which the second derivative with respect to time appears. To demon-
strate this a simpler example is considered:
The variables may again be separated by the substitution of Equation 7.26, but
the time-dependent part 9(f) now satisfies the equation:
so that y*w is in general time dependent and so is its integral over all space.
Consequently, the probability of a particle described by the Klein-Gordon
equation being somewhere in space varies with time.
In actuality many of the problems associated with the Klein-Gordon equa-
tion can be overcome, and it has been used to describe certain fundamental
particles, such as mesons, which can decay. In this connection it can also be
noted that the wave equation 6.59 for the electromagnetic field is just the
Klein-Gordon equation for a particle with zero mass and charge. Of course, if
a wave equation is to describe a decaying particle, some of the basic ideas of
quantum mechanics must be reformulated. For example, although the Klein-
Gordon equation is not consistent with the conservation of the number of
110 Advanced Molecular Quantum Mechanics
The last chapter was devoted to showing that direct quantization of classical
equations does not lead to a wave equation for the electron. We will now dis-
cuss the Dirac equation for the electron. Although much of this chapter will
be concerned with its justification, it will be assumed from the outset that it
is indeed appropriate to the electron to the extent that we will refer to an elec-
tron with charge — e, where e is positive, rather than a particle with an arbit-
rary charge qg; in addition, m will now be taken to be the mass of the electron.
Let us briefly summarize the problem that faced Dirac. For a particle with
mass m and charge — e in the presence of electromagnetic potentials (A, ¢) the
classical relativistic expression for the energy is:
111
4 Advanced Molecular Quantum Mechanics
the substitutions 8.3 and the introduction of a wave function then gives the
Klein-Gordon equation. This too has a disadvantage in that the second deriva-
tive with respect to time appears with the consequence that the probability
that the particle is somewhere in space is time dependent. In addition, the
Klein-Gordon equation cannot account for the property of spin, so that, al-
though it might be appropriate for other particles, it does not apply to the
electron. It was this unsatisfactory situation that led Dirac to consider an
alternative equation.
In the Klein-Gordon equation the time and space coordinates appear symmetri-
cally, but as second derivatives. Dirac argued that it is necessary for the time
coordinate to appear as a first derivative and, in order to preserve the relativ-
istic equivalence of space and time, it is necessary for the space coordinates to
appear as first derivatives as well. This would also be in contrast to the Schro-
dinger equation in which the time derivative is first order, but the space deriva-
tives are second order. To simplify the ensuing arguments we consider a free
electron, that is one in the absence of electromagnetic potentials. In addition,
the notation can be simplified by introducing the operator Po such that:
(The subscript 0 is used rather than 4, because po is not the time component
of the four-vector py = — ih(0/0r,,), although they are related by p4= ipo).
The equation that Dirac proposed has the form:
or in vector notation:
[Po—a.p —Bmc]y = 0, (8.6)
where the nature of @ and B needs to be considered more closely. In Equation
8.6 time and space coordinates do appear symmetrically, and one might expect
this equation to be invariant to Lorentz transformations, a necessary require-
ment of a relativistic equation; that the equation is indeed Lorentz invariant
will be confirmed later.
Since Equation 8.6 is to be an equation linear in space and time derivatives,
aand 6B must be independent of p and po. Also, we are considering the situation
where there are no electromagnetic fields so that all points in space-time are
equivalent and the operator [9 —a.p — Bmc] cannot involve x, y, z or f, since
this would introduce space-time inhomogeneity and anisotropy, although this
The Dirac Equation 113
operator can (and does) contain first derivatives with respect to these coordi-
nates; consequently @ and £ are also independent of x, y, z and t. To obtain
further information about « and 6 we return to the Klein-Gordon equation,
which for a free particle can be written in our current notation as:
a =a, = 02 =f = 1;
(0,0 + GyQ,) = (Aya, +a,0,) = (4,0, +4,0,) = 0; (8.10)
( } F =< (/ 0
0, = Se Oyu = ~~ 6, = (8.13)
tL 6 eS i8) 0 -1
XY + YX = Q'AQQ 'BQ+Q'BQQ'AQ
= Q'(AB+ BA)Q = 0. (8.15)
Similarly, if A? equals the unit matrix, then so does X*. This fact, combined
with the fact that X has been chosen to be diagonal, means that the diagonal
elements of X must be either + 1 or— 1.
The next step in our proof is to show that the trace (the sum of the diag-
onal elements) of the matrix X vanishes. To do this we use the fact that for
two matrices R and S the trace of the product matrix RS is the same as the
trace of the product matrix SR; this can easily be proved by expanding
these products in terms of the elements of R and S (see Equation 2.108). Thus,
the product matrix XY” has the same trace as YXY, and also the same trace as
X, since Y? is required to be the unit matrix:
[0,014.05 (8.20)
[e,.0] = [Bo] = 0.
The Dirac Equation : 117
For example:
or again:
A.Qo + Aa, = PO,B + Boo, = 0,(of8+Bp) = 0. (8.22)
When a matrix representation of the a; has been found, an equivalent repre-
sentation can be produced merely by performing the same similarity transform-
ation on each of the matrices in our original representation. However, the
representation can be fixed by stipulating that our representation is such that
6 is diagonal. In the consideration of general matrices it was seen that, for
the square of a diagonal matrix to equal unity, its diagonal elements can only
be + 1 or — 1 and that, for it to anticommute with other matrices, its trace
must be zero. Thus, 8 must have the same number of elements equal to + 1 as
equal to — 1. The fact that 6 commutes with o,, 0, and o, means that it can
only be:
RANOe FORO
OL ome
= (8.23)
Daly 40s ao
0 0 0-1
or of course minus this; we choose it to be as shown in Equation 8.23.
It remains to find the matrix p, which is to anticommute with 6 and to
commute with o,, 0, and o,, all of which are now known explicitly. From its
anticommutation property with B it can only have the form:
0 0 a b
0 0 Cc d
p= (8.24)
e if 0 0
g h 0 0
Commutation with o, demands that b, c, f and g are all zero, while commu-
tation with 0, (or o,) necessitates that d = a and h = e. Finally, for p’ to be
the unit matrix ge = 1 and making the choicea =e = 1:
ie AO Pew)
O wrOucSONM I
ne (8.25)
jie Sees 0)
Oe die 0 >-0
118 Advanced Molecular Quantum Mechanics
Substituting Equations 8.25, 8.23 and 8.18 into Equations 8.19 now gives us a
set of four four-by-four matrices representing the a;:
Om. OF Osakt 0 0 O -i
Oat Oe. sia traptigen 1a te
We leita 00g ee ee ee OC ne
NRO Ome 0 ae Oe 20
(8.26)
(Veep Reger Herow mig hes
0 0 O-1 gms hee Re:
oo . al aaOREO ge a eon Pow aie ae
Ot 0 0 Ne oy Mos 4
and as would be expected they are all Hermitian.
Although we do not prove it here, this is, to within a similarity transforma-
tion, the only four-by-four matrix representation of the a;. The particular
representation chosen here is called the standard representation. However, it
should be noted that as far as determining eigenvalues is concerned it does not
matter which representation is used, since, although the operator in the Dirac
equation:
[Po —a.p— Bmcly = 0 (8.27)
i a; = Q'a;Q, (8.28)
multiplied by a four-by-four matrix Q”!. Also the total Dirac operator can be
expressed as a four-by-four matrix, and this operates on W. This is only mean-
ingful if y is represented by a four-component column vector or spinor:
V1
pele (8.31)
se
Wa
and if Po is thought to be multiplied by a four-by-four unit matrix. Thus,
when expanded fully, the Dirac equation consists of four coupled equations
in the four components of yw:
From the considerations in Section 7.2 it is at once apparent that this equation
is gauge invariant, since it contains only the operators (p + eA) and (iA(0/dt)
+ e¢). A change in gauge merely multiplies the wave function by a phase fac-
tor, exp (— ief/h), where f is the scalar function specifying the gauge trans-
formation. In the special case of the time-independent problem Equation 8.34
reduces to:
[6mc? + ca.nr—edly = Ey (8.35)
and the operator on the left-hand side of this equation may be referred to as
the Dirac Hamiltonian.
The Dirac equation has now been developed, but it has yet to be shown that
it is appropriate to the electron. It has not been derived rigorously, nor can it
be; it came about because of an insistence on a linear equation in Po, D,,, Py and
pz, and a consideration of the consequences. Its justification is the same as
that for the Schrodinger equation — it works. Later in this chapter the Lorentz
invariance of the Dirac equation will be demonstrated and the nature of the
negative energy solutions will be considered. However, we will first show that
the equation is appropriate to the electron, in that it leads to the concept of
spin, since any further discussion would be pointless if this were not so.
Later in this chapter the relationship between the Dirac equation and the non-
relativistic formulation of quantum mechanics will be considered in detail.
Here, we will content ourselves with showing that, when the Dirac equation is
reduced to a non-relativistic form, two new properties arise naturally. These
properties are the intrinsic magnetic moment and the intrinsic angular momen-
tum, that are associated with spin.
The time-independent Dirac equation 8.35 can be written as:
Hy = By, (8.36)
The Dirac Equation Pel
It is tempting to say that the second term in this equation is zero, since it in-
volves the vector product of rm with itself, but this is not true. In considering
(mtr) the explicit form ofm (Equations 8.33) must be used:
Although the first and last terms in this equation do vanish, the middle term
does not, since p and A do not commute; this term may be simplified further,
but in doing so it must be remembered that it is an operator and must always
be thought of as operating on a wave function, even though the wave function
is not included explicitly. Thus:
12 Advanced Molecular Quantum Mechanics
a > a II e[p,pAtA
yp]
i — mc +o. (8.46)
where J’ is small compared with mc’. In expanding the left-hand side of Equa-
tion 8.45 it can then be assumed that H’*/c? is negligible as are other terms
involving c:
(H + e¢)?/c? ~ mc? + 2mK' + 2me?. (8.47)
Combining Equations 8.45 and 8.47 gives finally:
ihcp(o,Pp)i, (8.54)
where again Equations 8.19 and the properties of p (Equations 8.20) have
been used. Now the commutation properties of the components of o are
(Equation 2.41):
[o;,0;] = 2i€; jn0p; (8.56)
= —2icp(opPii (8.57)
showing that o does not commute with the Dirac Hamiltonian either. However,
comparison of Equations 8.54 and 8.57 shows that (Al + sh o) does commute
with #:
[Al, + $ho;, H] = 0, (8.58)
and is thus anew constant of motion. If we regard this as the total angular mo-
mentum, then in addition to the orbital angular momentum Al there is an in-
trinsic angular momentum sho and by identifying o with 2s this intrinsic
angular momentum of As is just what is assumed in non-relativistic theory (see
Chapter 1).
The Dirac equation thus leads to an intrinsic magnetic moment of — 2ugs
and an intrinsic angular momentum of hs. The ratio of these gives a g factor of
2. These results agree with the assumed electron properties that are introduced
phenomonologically into the non-relativistic theory of the electron. It would
appear that the Dirac equation is indeed an appropriate equation for the elec-
tron. However, we still have to verify the Lorentz invariance of the equation
and examine the negative energy solutions more closely before its suitability
is fully confirmed.
It is often said that electron spin is a consequence of special relativity, since
it was first explained by the relativistic theory of Dirac. However, it is only
fair to say that there is another point of view. The non-relativistic Hamiltonian
(Equation 7.10) for a particle contains the operator 17/2m. Replacement of
this by (o.1)’/2m, where the components of o are the Pauli spin matrices, also
leads to an intrinsic magnetic moment of the right magnitude, as may be seen
from Equations 8.39 to 8.45. The same method may be used to develop a rela-
tivistic theory, the result being the Dirac equation; this approach is described
in, for example, the book by Sakurai (see bibliography).
The Dirac Equation M25
Finally, it should be pointed out that the present section has itself raised a
problem in that | and s, which commute with the non-relativistic Hamiltonian,
do not commute with the Dirac Hamiltonian; the reasons for this will be dis-
cussed in Section 8.10.
Although the Dirac equation was constructed so that time and space coordi-
nates appeared symmetrically, it is not at all obvious from Equation 8.34 that
itisin fact Lorentz invariant. It need hardly be said that for the Dirac equation
to be an acceptable relativistic equation it must be Lorentz invariant.
In most advanced texts the invariance with respect to a general Lorentz
transformation is demonstrated. Here we will content ourselves with the
specific transformation considered in Section 4.3. That is, we consider two
inertial frames K and K’, for which the axes x and x’ coincide and the axes
y and z are parallel to y’ and z’ respectively; the velocity of the K’ frame is v
relative to the K frame. A general Lorentz transformation can be reduced to
this specific one merely be redefining the origins and the orientation of the
axes within the two frames.
It will be recalled that the coordinates, x, y, z and t, in the K frame are re-
lated to x’, y’, z’ and t', the coordinates in the K’ frame, by Equations 4.18:
eos COSC
ier si 0. eas
(8.60)
7A a A t = t' cos @ —i(x'/c) sin @,
A. =
II A, cos) —1(6 Jc) sin @; A, = Aj;
(8.62)
Am ¢ = ¢' cos
0 —icA, sin 0.
The operators 0/074 have also been shown to transform as the components of
a four-vector (Equation 4.51), so that, recalling the definition of po (Equation
126 Advanced Molecular Quantum Mechanics
and W*w does indeed transform like the coordinate t, while comparison with
Equations 8.60 shows that it is W*ca, that transforms like the space coordi-
nate x; it willbeseen in Section 8.10 that cais the electron’s velocity operator,
so that w*cay is indeed the probability current density.
Finally, it must be demonstrated that the electron’s charge is conserved,
that is the change in the charge density in an infinitesimal volume element
must be compensated by a flow of charge across the faces of the volume
128 Advanced Molecular Quantum Mechanics
In the present connection the charge density p and the current density jboth
differ by a factor —e from the probability density y*y and the probability
current density *caw, respectively, so that Equation 8.72 may be rewritten as:
aW*W)/at
+ V.Wy*cay) = 0. (8.73)
That the relationship is obeyed may be demonstrated by multiplying the Dirac
equation 8.34 on the left by y*:
W*[Gh/c)(OW
/dt) + (e/c)dy] — Ya. [—ihVY + eAY] —Bmep"y = 0.(8.74)
The Hermitian conjugate of this equation is:
where use has been made of the fact that the components of « are Hermitian
so that (Wa) =aw. By subtracting Equation 8.75 from Equation 8.74 and
dividing by (ih/c):
which is merely an expanded form of Equation 8.73. Thus, the Dirac equation
is consistent with the conservation of charge.
We have already seen that the Dirac equation for a free electron has four solu-
tions, which is twice as many as is needed to explain spin. This multiplicity of
solutions arose because half the solutions have negative energies. In fact, this
happens in any relativistic theory, even classical ones, since the energy always
involves a square root (see for example Equation 8.1), which can have both
positive and negative values. Classically the positive square root is taken, the
presence of negative solutions causing no embarrassment. This is because classi-
cally the energy must vary continuously, and energies between + mc? and
— mc’ are not permitted, so that, if the energy is positive at one time it cannot
become negative at a later time. However, quantum mechanically discontinuous
transitions can occur so that the negative energy solutions cannot be ignored.
The solutions of the Dirac equation are represented by a wave function
which is a four-component vector or spinor:
The Dirac Equation 129
y= (8.77)
ieee.
(Os AO
Ota, wee Oas0
oS (8.78)
O08 10
0 0 OSI
in the matrix representation that we are using, Y,; and W2 may be thought of
as positive energy components and 3 and wW, as negative energy components.
For solutions with positive energy the main components will be Y, and W2, but
small amounts of Y3 and 4 will be mixed in by the terms a.m which couple
positive and negative energy components. In this section the relationship be-
tween the positive energy solutions and the negative energy solutions will be
examined, and it will be shown that if the positive energy solutions correspond
to a particle with electric charge — e, then the negative energy solutions may
be thought of as being for a particle with the same mass, but with charge + e.
The Dirac equation for a particle with charge — e:
Yo 1b a. (8.81)
Using the anticommutation relations (Equation 8.11) of B and the compo-
nents of a, together with the fact that af = —a,, the following properties of
y may be demonstrated:
[Po
— (e/c)d —«.(p —eA) —Brnc]yp* = 0. (8.83)
Comparison of this equation with Equation 8.79 shows that, if W is a solution
of the Dirac equation, then yw* is a solution of the same equation, except
that the sign of the charge has been reversed. That is, if y is a solution of the
Dirac equation for a particle with charge — e subject to electromagnetic poten-
tials, then yw* is a solution of the Dirac equation for a particle of charge + e
subject to the same electromagnetic potentials. For this reason ¥ is referred to
as the charge conjugation operator.
It can easily be shown that if w is a positive energy solution then yw™ is
a negative energy solution, since from Equations 8.26, 8.77 and 8.81:
special cases (see Section 8.10). In both methods the Hamiltonian is obtained
as a power series, which converges for electrons moving with velocities much
less than the speed of light. Thus there are a number of problems shared by
the two methods, the choice of expansion parameter for the power series, the
estimation of the size of the various terms in the series and the decision on
how far to take the expansion. The remainder of this section will be devoted
to these questions.
A number of quantities have been used as the power series expansion par-
ameter, c', (u/c) and a, where wu is the speed of the electron and a is the
fine structure constant (not to be confused with the Dirac operator @ or its
components). The quantity c”' is frequently used, but it is not really suitable,
since it is not dimensionless and consequently its value depends on the system
of units employed. Both (u/c) and @ are dimensionless and we will shortly see
that they are in a sense equivalent. However, we will make the more professional
choice of the fine structure constant a, which is defined by:
y = a (8.91)
p_
1 W3
Y= [ : y= | ; (8.92)
WY Wa
[Po + (e/c)¢]W.—(o.my_ = 0
(8.93)
[Do + (e/c)¢]W_-—(o-m W, = 0).
+ 2mew_
These equations are coupled and each involves two-by-two matrices and two-
component wave functions.
The second of these coupled equations can be used to express the small
components W_ in terms of the large components W,:
This may be expanded by using Equation 8.100 and Equation 8.102 itself,
together with the identity:
which is obtained with the help of Equation 5.21. To the order of accuracy
that we require the result is:
Ky, = [1 —(em)*/4m’c?
+ O(a*)]y,. (8.104)
Equations 8.100, 8.103 and 8.104 may now be used to expand the Hamil-
136 Advanced Molecular Quantum Mechanics
N = [1+ @mK?(o.1)/4m?c?]?
= 14+ (0.m?/8m?c?
+ O(a’). (8.109)
The wave function W), satisfies the time-dependent Schrodinger equation
8.97, but as we have seen the Hamiltonian H is not Hermitian. The new norma-
lized wave function x, satisfies a similar equation:
HK’ = N#N~'—iAN[d/dt,
N“]. (8.112)
To obtain the final Hamiltonian for the positive energy states it only re-
mains to substitute for V (Equation 8.109) and H (Equation 8.105) in Equa-
tion 8.112 and to perform some manipulations. Since N commutes with (o:7)
and the result is only required to be accurate to order mc’a*, we may write:
In this section the positive and negative energy states are decoupled using a
method due to Foldy and Wouthuysen. The non-relativistic Hamiltonian is ob-
tained by a series of unitary transformations, by which the coupling terms are
progressively eliminated. However, before the details of the transformation
are considered it is necessary to look more closely at the properties of the
coupling terms.
138 Advanced Molecular Quantum Mechanics
KH = bmc?—ed+ca.r
= Bmc?+&+0. (8.117)
The symbol & (= — e) represents an even operator, that is one that has no
matrix elements between positive and negative energy components, while ©
(= cam) is an odd operator having only matrix elements between positive and
negative energy components. The term Bmc? is an even operator, but it is ex-
pedient to keep it separate from the other even operator (— e¢@), since they are
of different orders of magnitude. This is also true of the two parts of ca.n, but
keeping them together ensures the gauge invariance of any resulting Hamil-
tonian.
Whether an operator is even or odd is determined by the matrix involved in
its representation. Operators involving 6 or the unit matrix, which are both
diagonal, are even (&), while those involving a,, a, or a, (Equations 8.26) are
odd ((). Other matrices encountered in this chapter can also be classified as
even or odd, o,,, 0, and o, being even and p being odd. The conditions for an
operator to be even or odd can be expressed more formally; an even operator
must commute with 6:
[&,6] = 0, (8.118)
while an odd operator must anticommute with £:
S = —iBO/2mc?, (8.125)
where the facts that © anticommutes with 6 (Equation 8.119) and 6? = 1 have
been used. Thus, with this choice of S part of the first commutator cancels the
odd operator appearing in H, the first term of Equation 8.124. Complete sub-
stitution for S and H in Equation 8.124 now gives the transformed Hamiltonian;
in doing this the expressions can be simplified so that 6 only appears linearly
and only terms up to order mca“ are retained, the order being determined on
the basis & ~ mc?a* and © ~ mc’a. The result may be written as:
HK, = Bme?+8&,+O,, (8.127)
where
&, = & + BO’/2mc*— [O, [iha/at—&,O] /8m?c*
— BO*/8mFc§ + O(mc?a°) (8.128)
and
©, = Bliha/at — &,0] /2mc? —O7/3m?c* + O(me’a5). (8.129)
It can be seen that the explicit operators in ©,, the odd part of the transformed
Hamiltonian, are both of order mc’a°. Thus, the odd operator © in the Dirac
equation, which is of order mc’a, has been removed by the transformation at
the expense of introducing odd operators of order mc?a? and higher.
This process may be repeated to remove the new odd operators. That is by
using:
S, = —i8O,/2mce? (8.130)
another transformation may be performed on Equation 8.127 to remove the
odd operator© ;. The resulting Hamiltonian is:
KH, = Bmc?
+ &.+0>, (8.131)
where
& = & + BO?/2mc? — [O, [ihd/dt— &,O]/8m?c4
— BO*/8m3c® + O(mc?a®) (8.132)
and
O2 = O(mc?a*). (8.133)
Only two applications of a unitary transformation have been necessary to re-
duce the Hamiltonian to one in which the largest odd terms are of order me’a°,
and these may be removed by a further unitary transformation to give the
Hamiltonian #3, which is correct to order mc*a‘. This reduced Hamiltonian
H3 operates on the transformed wave function w”. In the subsequent dis-
cussion, the primes and subscripts will be dropped, since it will always be ob-
vious when the Hamiltonian is a non-relativistic approximation to the Dirac
Hamiltonian. However, it should be noted that the wave function appropriate
The Dirac Equation 141
Hy = ih(dy/dr), (8.134)
where
H = Bmc? + & + BO?/2mc? — [O, [ihd/at — &,O] ]/8m2c*
— BO*/8mFc® + O(me7a°). (8.135)
It is now necessary to substitute € =— eg and © = c(am) into Equation
8.135. The first term for which the substitution is not trivial contains
©?, but
we saw earlier (Equations 8.39 to 8.45) that:
= —iheca, (8.139)
where the electric field E is given by Equation 5.21; this result is similar to
that quoted in the last section (Equation 8.103). Finally:
[O, [in0/at
— &,O]] = —ihec?[(a-m), (a.E)] = — ihec?[(6-m), (0 E)]
= — ihec?[w.E—E.n+io(m,E—E ,t)|
KH = mc?—eot+n?/2m (a)
+ (eh/m)(s.B) (b)
— (eh/4m?c?)s.(r , E—E yt) (c)
(8.142)
— (1/8m3c?)n* — (eh/2m3c?)(s.B)n* — (d)
+ (eh?/8m?c*)(V.E) (e)
+ O(mc?a°).
The appropriate Hamiltonian for the negative energy states may be obtained
from Equation 8.141 by replacing B by — 1.
The Hamiltonian 8.142 for the positive energy states could have been de-
rived using the method of small components. Indeed, the Hamiltonian obtained
by substitution of Equations 8.136 and 8.140 into Equation 8.116 of the
previous section only differs from Equation 8.142 by the rest energy
mc’, but it will be recalled that in the method of small components only the
positive energy states were considered and the energy zero was adjusted. In
fact, the method of small components can also be used to obtain the non-
relativistic Hamiltonian for the negative energy states and there is no reason
to suppose that this method does not give the same results as the Foldy-
Wouthuysen transformation to any order of accuracy; this has been demon-
strated explicitly for the special case of the free electron, for which the posi-
tive and negative energy states can be completely decoupled.
The Dirac Equation 143
where /ik and £ are the eigenvalues of p and KH (= ihd/dt), respectively, and
u(k) is a four-component vector, which is independent of the position r. Sub-
stitution in Equation 8.143 gives:
which is a set of four coupled equations for the components of u(k). Since F
and the components of k are numbers, Equation 8.146 represents an ordinary
eigenvalue problem; by expanding the determinant of the four-by-four matrix
the eigenvalues are found to be:
this fact was referred to in Section 8.5. Thus the operator appropriate to a
component of the velocity r is proportional to the corresponding component
of a. Now the square of a component of a is equal to unity, so that the eigen-
values of a component of « can only be + 1. This suggests that any determi-
nation of the component of velocity of an electron must give + c, in contrast
to the non-relativistic result that the velocity operator is p/m. This result fur-
ther suggests that the instantaneous speed of the electron can only be 3c, but
the components of a do not commute with one another, so that two compo-
nents of the velocity cannot be precisely measured simultaneously. In addition,
it might be thought that the result that an instantaneous component of velocity
can only be +c is in contradiction to reality, but it should be remembered
that any experimental determination must involve an average over an appreci-
able time interval. Thus, if the electron’s velocity is oscillating very rapidly
about a mean corresponding to the observed value, an instantaneous component
of velocity can still be + c.
This suggestion can be substantiated by considering how the velocity oper-
ator varies with time:
where the anticommutation properties (Equation 8.11) have been used. Now
for a free particle H and p are independent of time, so that Equation 8.149
may be treated as a differential equation in a and solved provided the order of
H and «are not changed. Solution of this equation gives:
O = c(a.p);
from Equation 8.125. The presence of the operator p”' should not cause con-
cern, since it will only appear in expressions where it is multiplied by positive
odd powers of p; it isintroduced so that abbreviated notation may be used for
power series in the following manipulations. It is now apparent that the unitary
transformations may be “summed” and applied all at once, the total unitary
operator being:
an [exp (iS,,)] = exp [2 iss): (8.159)
This equality is only true because the operators S,, commute with one another;
if they did not, ambiguities would arise about the order of the operators when
the exponential term was expanded as a power series, and this is one of the
reasons why this technique cannot be used for the case of an electron in a
general electromagnetic field. Thus, the complete removal of odd operators
may be accomplished with one unitary transformation defined by the operator:
KH =
II (cos f+ RETR sin f)(6mc? + c(a.p))
B(mc? cos
f + ep sinf)+ mc*(«.p)p"'((p/mc) cos f — sin f).
(8.166)
It only remains to express sinfand cos fin terms of (p/mc). From Equation
8.161, tan f = (p/mc) so that, using trigonometric relations:
All this manipulation to obtain a result we already had may seem rather
unnecessary. However, the important thing to note is that when the positive
energy states are decoupled by a unitary transformation (Equation 8.153), all
operators must be subjected to the same transformation, not just the Hamil-
tonian. By determining how operators transform, we will be able to see how
operators before and after the transformation are related to one another; the
description of the electron before the transformation may be referred to as
the Dirac representation, and that after the transformation as the Foldy-
Wouthuysen representation. For example, the momentum operator p is the
same in both representations, since it commutes with S’, but the position
operator rin the Diracrepresentation has for its operator in the Foldy-Wouthuy-
sen representation r’, which is given by:
[S’,r] = —3hmc*(E,p)
*B(a.p)p + zhp *fB[(@.p)p — pa], (8.170)
where
E, = + [mc*+ c*p?]?. (8.171)
Substitution of Equation 8.170 into Equation 8.169 and evaluation of the
commutators leads to an expression, which may be split up into a number of
summable power series, giving:
r = r—hihck;
Bat sihc*E, (E, + mc’) '[email protected])p
— tne EE, + me’)(o pp). (8.173)
Once again the appearance of reciprocal operators causes no problems, since
they are merely a shorthand notation for the corresponding power series.
Equation 8.173 shows that the position operator r in the Dirac representation
has, as its operator in the Foldy-Wouthuysen representation, r plus three other
terms, which depend on the momentum operator, thus indicating that the
transformation is a non-local one; that is a point in the Dirac representation
150 Advanced Molecular Quantum Mechanics
do not commute separately with the Dirac Hamiltonian; that is, they are not
physical observables, although they are in the non-relativistic Schrodinger
treatment. However, the operators (R , p) and = which are given by:
highly oscillatory motion that has a frequency of order 2mc?/h and an ampli-
tude of order A/mc. If we consider an electron moving in a circle of radius
h/2mc with a frequency of 2mc?/h, its velocity must be just c in agreement
with the earlier assertions that the instantaneous value of a component of the
electron’s velocity can only be +c. The angular momentum associated with
this motion is just the product of the linear momentum me and the radius of
the circle A/2mc, so that the intrinsic or spin angular momentum of the
electron is /2 in agreement with experiment. Unfortunately this model cannot
be correct, since, although the electric current associated with this motion
gives rise to an intrinsic magnetic moment, the predicted g factor is unity as
in most classical models.
Except for the Darwin term (Equation 8.142e) all the terms in the non-
relativistic approximation to the Dirac Hamiltonian (Equation 8.142) can now
be understood, since they are related to electron spin or have classical relativ-
istic counterparts. However, the Darwin term may be understood in terms of
the Zitterbewegung, since it is just a correction to the electrostatic interaction
between the electron and the electric potential to allow for the smearing out
of the electron’s charge by the Zitterbewegung.
The position operator r in the Dirac representation has as its operator in the
Foldy-Wouthuysen representation the mean position r plus the oscillatory
term, which will be given the symbol 8. Corresponding to the electrostatic
interaction term — e¢(r) in the Dirac representation is the term — eg(r +8) in
the Foldy-Wouthuysen representation, and this must be averaged over the
Zitterbewegung. Now $(r +8) may be expanded as a Taylor series in § about r:
5° = —(h/2mc)*(Ba).(Ba)
= (h/2mc)?B*a* = 3(h/2mc)’, (8.188)
since B* = 1 and a? = 3 from Equations 8.10.
Bibliography
Although we now have a suitable wave equation for one electron in the
presence of external electromagnetic fields, we are not yet in a position to
write down the Hamiltonian for a general molecule. In this chapter we move
towards this goal by first considering the problem of just two electrons before
generalizing the theory to the case of many electrons.
In fact, a correct relativistic wave equation for two electrons cannot be
written down in closed form, but it is possible to obtain an approximate Hamil-
tonian provided the kinetic energy of the electrons is very much less than their
rest energy, mc’; this is just the situation that pertains in molecules.
There are a number of possible approaches to this problem of two electrons.
One way is to start with the non-relativistic approximation of the Dirac Hamil-
tonian for each electron. The interaction between the two electrons then arises,
since each contributes to the electromagnetic potentials experienced by the
other. Although this method has a number of associated difficulties, it does
give a physical picture of the origin of the individual terms in the final Hamil-
tonian. A more sophisticated approach is to seek an approximate wave equation
for two electrons, which is the counterpart of the Dirac equation for one elec-
tron. Such an equation is the Breit equation, but to be useful it has to be re-
duced to a non-relativistic form.
Both these approaches give the same non-relativistic approximation to the
Hamiltonian for two electrons, although the policies adopted to circumvent
problems in the first method are chosen to ensure this agreement. Of course,
the most satisfactory approach is to use quantum electrodynamics, but we
content ourselves with noting that the Hamiltonian derived here is the same
as that obtained by Itoh. Even so, it is necessary to include certain radiative
corrections in a phenomenological manner.
155
156 Advanced Molecular Quantum Mechanics
expression for the vector potential is important. The choice may be guided by
insisting that the resulting Hamiltonian for two electrons be Hermitian and
symmetric in the two electrons. By using the order given above these conditions
will be satisfied, but the real justification is that the final Hamiltonian is the
same as that obtained by more sophisticated methods.
The charge of the electron is responsible for the potentials (Equations 9.2),
but the spin magnetic moment of the electron also provides a contribution to
the vector potential and this is found by considering a point magnetic moment,
although it is not clear that this is permissible. As our model we take a slowly
moving particle of charge g and mass m moving in a circle about the origin;
the associated magnetic moment is (see Section 7.1):
and this may be expanded as a power series in a about r by noting that for any
function f(r — a):
Sita fit) a. Vit) a (9.5)
1/|r—al reaVELir)
he
Il Urania (9.6)
where use has been made of the facts that u = da/dt and r is a constant vector.
Equation 9.7 gives the instantaneous potential at position r. When the aver-
age is taken the first term vanishes, since it is the time derivative of a periodic
158 Advanced Molecular Quantum Mechanics
function; this may easily be checked explicitly for the model employed here.
The remainder of Equation 9.7 may be written in terms of w by noting that
from Equation 2.38:
(a,u),r = u(a.r)— a(un), (9.8)
A = (1/4necc*)(a,nirt.... (9.9)
When a is allowed to tend to zero, the vector potential at a position r relative
to a constant magnetic moment yp is obtained:
In the case of the electron the spin magnetic moment is (Equation 8.49)
u = — (eh/m)s, so that the associated vector potential is:
By Fleas (9.13)
The Wave Equation for Many Electrons bog
is introduced; it is noted that rin Equation 9.11 is just r;;, but that rin Equa-
tions 9.2 is —r,;. In addition, the subscripts are added to the charges and
masses ofthe electrons, so that the origin of the terms in the final Hamiltonian
may be seen more easily. If A and ¢ are now the external potentials:
and
$; = ¢—e;/4n€or;; + O(mc’a‘/e); (9.15)
these potentials satisfy the Coulomb gauge. Although the highest neglected
terms in the expression for the vector potential are formally of order mca‘/e,
we have only taken Equation 9.14 to be correct to the required accuracy of
mca?/e, because of the uncertainties involved in this derivation of the Hamil-
tonian. Similarly the scalar potential (Equation 9.15) has only been assumed
to be correct to order mc’a4/e.
— (1/8mjec?)\n}—(eh/2mic*)\(s;.B;)n; (4)
+ (e;h?/8mjc?)(V.E;) (e)
+ O(mc’a®)
and the wave function y has two components.
For two electrons the Hamiltonian can be written as:
H=H,+H, = (2 ts (9.17)
resulting terms, so that in the final Hamiltonian any potentials that appear are
just the external potentials. However, at this stage a further difficulty appears,
since for any effect that one electron has on the other there will be a recipro-
cal effect, and a decision has to be made as to which interactions should be
counted twice and which should not. That some interactions should only be
counted once, may be seen by referring to Section 5.9. There the Lagrangian
(Equation 5.76) for one charged particle contains terms arising from the inter-
action of its charge with the potentials due to the charge of another particle.
However, in the Lagrangian (Equation 5.77) for both particles together these
same terms appear once, not twice, since the interaction is symmetric between
the two particles. If the Lagrangians for the individual particles had been added
together a correction factor of ;would have been needed in these interaction
terms. Similar remarks apply to the corresponding Hamiltonians. In deciding
which interaction terms should only be counted once (or which should have
correction factors of +), we may be guided by their symmetry with respect to
interchange of the two particles, but the ultimate decision will be made so that
agreement with other methods is achieved.
For simplicity it is supposed that the external potentials, A and @, vanish,
since they can be reintroduced later by the usual replacement:
The influence of the scalar potentials due to the electrons is discussed first.
The term — e;¢; in 9.16a makes a contribution:
since f2; = —fy2 (Equation 9.13). This expression represents the spin-orbit
interaction, which is the interaction between the spin and orbital motion of
the same electron. It arises from the electric field produced by the other elec-
tron, since by virtue of its motion the electron experiences a magnetic field
which interacts with its spin magnetic moment.
The term 9.16e may be dealt with more easily, since from Equations
OD eand:2:95:
Again it is clear that each of the two possible contributions to the Hamiltonian
is not symmetric between the two electrons and that both must be counted,
so that the term 9.16e is responsible for:
the last term is negligible and only the middle term need be considered further.
The orbital parts of the vector potentials due to electrons 2 and 1 give rise to
similar terms when substituted in H , and H2, respectively. Hence, as is indicated
162 Advanced Molecular Quantum Mechanics
Since it is the spin of 7 but the momentum of / that appears in this term, it is
not symmetric between the two electrons, so that the total contribution to the
Hamiltonian is:
since s; and V commute. The first term is just 475(1;;)s; from Equation 2.95,
while the second term may be evaluated using Equation 2.96:
— (e;h/4n€omjc’)[(87/3)5
(r;;)8; — s,[r}, + 3(s;-xitij/ris]. (9.34)
When the orbital part (9.30) of B; is substituted into 9.16b, two different
terms arise, one for each electron, so that the contribution to the Hamiltonian
is:
This term is the same as 9.29 and provides another contribution to the spin-
other-orbit interaction. In this case the spin of each electron interacts with the
magnetic field produced by the orbital motion of the other, whereas the
term 9.29 is interpreted as being due to the motion of each electron in the
presence of the magnetic field due to the spin of the other. It may be argued
that these two interactions are really one and the same and that only one of
the terms, 9.29 and 9.35, should be counted. This is in fact necessary if agree-
ment is to be obtained with the Hamiltonian obtained from the Breit equation.
Finally, the spin part (9.34) of B; must be substituted into 9.16b and for
each electron an identical term arises, which is consequently only included
once in the final Hamiltonian:
fields are reintroduced by the replacements 9.18, the total Hamiltonian for
two electrons is given by:
+ (€,€h/4me9m
myc’) [(81.82)/ri2 — 3(81-T12)(F 12-82) /Ti2
— (81/3)5(112 (81-82 )]
H —— Hy tH + Hi, (9.38)
cannot be correct, since the wave equation with this Hamiltonian is not even
approximately Lorentz invariant. The correction terms can only be obtained
rigorously by using quantum electrodynamics. In principle they may be ob-
tained to any order of accuracy, but in practice they can only be obtained ap-
proximately. It is beyond the scope of this book to go into the details, but
essentially second-order perturbation theory is used to describe the emission
of a virtual photon by one electron and its absorption by the other and vice
versa; to do this the eigenvalues and eigenfunctions of the approximate Hamil-
tonian (Equation 9.40) are used. The result is an additional interaction term:
— (€ €2/87€
oF12)[(a1 2) + (@1 1 12)(42-0 2)/ria] > (9.41)
which is known as the Breit operator. Because perturbation theory is used and
because of the approximations involved in its derivation, this operator must
only be used for slowly moving electrons and then only in first order. The mat-
rix representation of the operator (Equation 9.41) also involves matrices of
order 16; in this case they are related to direct products of the matrix repre-
sentations of the components of the Dirac operators a, anda.
Although we are not ina position to derive the Breit operator, it can be seen
that it is approximately correct by referring to the classical (non-quantum-
166 Advanced Molecular Quantum Mechanics
mechanical) Hamiltonian for two electrons, that was derived in Section 5.9.
Equation 5.81 shows that the classical interaction Hamiltonian consists of the
Coulomb term together with:
needed for the reduction of the Dirac equation and only an outline of the
method is given. For simplicity the time-independent problem is discussed,
but there is no reason to suppose that the time-dependent Hamiltonian does
not have the same form.
In the Foldy-Wouthuysen reduction (Section 8.9) of the Dirac equation
the operators in the Dirac Hamiltonian are classified as even (&) or odd (©)
depending on whether they have no matrix elements or only matrix elements
between the positive and negative energy components of the wave function.
More formally even operators commute and odd operators anticommute with
the Dirac operator 8. The odd operators are removed progressively by the ap-
plication of a sequence of unitary transformations.
A more complicated classification of operators must be used for two elec-
trons, since the wave function has sixteen components, which cannot be
labelled simply as positive or negative energy components. However, the wave
function can be written as:
Wa4
y= We (9.45)
where, for example, Y,, has four components for which both electrons have
positive energy, while for y,- electron 1 has positive energy but electron 2
has negative energy. We are interested in decoupling Y,, from ,-, Y_, and
W__ to obtain a four-component Hamiltonian for two electrons, which both
have positive energy; this Hamiltonian is required to be correct to order mc7a*.
It will be seen that the Breit Hamiltonian (Equation 9.44) can now be
written:
H = Bym,c? + Bymyc? + (&8) + (80) + O©&)+ (OO). (9.46)
by analogy with Equation 8.117. This is just an extension of the Foldy-
Wouthuysen notation, so that, for example, an even-even operator (&&) such
as B, or B> is even with respect to both electrons 1 and 2. On the other hand
the components of «2 are even-odd operators (&O) and are even and odd with
respect to electrons 1 and 2, respectively. As a further illustration, y,, is only
connected with y_, by an odd-even operator (O&).
These operators may be defined more formally. For an operator to be even
with respect to electron 1 it must commute with B,, but if it anticommutes
with B, it is odd with respect to electron 1. Thus:
168 Advanced Molecular Quantum Mechanics
II II
[((G&), Bi] lI ((&0), 61] lI [(G&), Ba] [(O&),Bo] 0;
(9.47)
[O&), 61]. = (OO), Bi], = [(&O), B21. = [OO), Bel.
It is now easy to show that the product of say an odd-even operator with an
odd-odd operator gives an even-odd operator:
(9.49)
(& ©): O25 > Ary > A, 5
e; 0 (eo Te,) 0
0 e 0 (e =re,)
ae . (9.51)
(e, Flee) 0) =e; (9)
0 (Guat Les) 0 ee
ieee ah Oo 04 venOrhiO
Oat? BONN Oo «Ole Ot a
By = iene , (9.52)
OO Mieeare bleed Clowmed.(Piaiea ts
The Wave Equation for Many Electrons 169
where
a. te. Te.) 0 0
( 2 re.) —e, 0) 0
c— (9.53)
0 0 ez (e, — iey)
0 0 (e,- 1 ie,) cy
These matrix representations satisfy all the relationships they are expected to.
The operators 8; and the components ofa; all anticommute with one another
and the square of each is the unit matrix of order sixteen. In addition, 6, and
the components of @, all commute with 6, and the components of a, as they
must, since they operate on different electrons.
We are now in a position to identify the operators in Equation 9.46:
(O8) = cay.m
~ me’a; Cy
Although Equation 9.62 is the simplest acceptable choice for S, its substi-
tution in Equation 9.57 and the subsequent expansion is quite laborious even
when the commutation relations 9.55 are used; consequently only the result
is given here. Two applications of the transformation are necessary to obtain
a Hamiltonian, which is of the desired type and correct to order mc’a*. The
order of magnitude of the terms is estimated on the basis (Equations 9.54)
that the (&&) and (OO) operators in Equation 9.46 are of order mc7a’, while
the (©) and (O&) operators are both of order mc7a. The result is:
and
+ O(mce’a’),
—(O&)*/8mic> = — (1/8m3c?)4 — (ef/4m3c?)(o,.B, )7?
(9.68)
with similar expressions for the terms involving (0)? and (&0)*. However,
the remaining terms are more complicated, but liberal use of the techniques
of Chapter 2 and relationssuch as Equation 8.40 shows quite straightforwardly
that:
— (€ :€2h7/8€qc7)5(t12)/m}, (9.69)
with a similar expression for the term involving [(&O), [(&O), (&&)] ] except
that the subscripts 1 and 2 are interchanged. With even more labour it may also
be shown that:
De iene
i=1 lnie? — e6; + 1?/2m, + (eh/2m;)(6;.B;)
— (e;h/8m}c?)o;. [m; , E; — E; ,mi] — (1/8mpec?)n?
— (e;h/4mjc?)(o;.B;) 1? + (e;h?/8mjc? (V.E;)]
+ (€1€2/47€o712)
— (87/3)6 (r12)(1.02)]
The Dirac equation for the electron predicts a spin magnetic moment of
—(eh/2m)e and a spin angular momentum of she (Section 8.4). The g factor
of the electron is the ratio of the magnitude of these two quantities measured
in units of (e/2m). Consequently the Dirac equation leads to a g factor for the
electron of exactly 2.
The transition to the notation of non-relativistic quantum theory can be
made by replacing o by 2s. This has been done in Equations 8.142 and 9.37,
which are the non-relativistic approximations of the Dirac and Breit Hamil-
tonians, respectively. Now in these approximate Hamiltonians each of the terms
involving the spin of an electron arises by virtue of its spin magnetic moment;
the spin angular momentum does not appear as such in the Hamiltonian. Thus,
rather than replace o by 2s, we could have replaced the spin magnetic moment
of the electron, —(eh/2m)o, by — g(eh/2m)s, where g is exactly 2. Since
(eh/2m) is just the Bohr magneton pg, this replacement may be written:
When Dirac first put forward his theory, the experimental value of the g
factor was also 2. However, since then experiment has shown that it differs
slightly but significantly from 2. It is now one of the most accurately known
constants and one of the more recent experimental determinations gives it as:
electron’s magnetic moment, but since one is preferred there is a small negative
contribution that accounts for the deviation of the g factor from 2.
When these quantum electrodynamic arguments are expressed quantita-
tively, the correct term is found to be a/7, where a is the familiar fine structure
constant (Equation 8.85). Higher-order corrections have also been calculated,
although a large number of interaction terms have to be considered. One of
the more recent theoretical estimates for the g factor of the electron is:
The coefficient of (a/7)’ is well established, but attention has only recently
been directed to the (a/m)* term. Equation 9.74 gives a value for the g factor
that is in good agreement with the experimental value of Equation 9.73.
A Hamiltonian for two electrons falls short of our goal of a molecular Hamil-
tonian. Here the extension to a many-electron Hamiltonian is considered; the
inclusion of nuclei is discussed in the next chapter.
The two-electron Hamiltonian (Equation 9.71) may readily be extended
to the case of many electrons provided it may be assumed that there are no
interactions that involve three or more particles. This assumption cannot be
correct, since one electron can influence a second by perturbing the motion of
a third. However, the correction terms necessary may be shown to be of the
order of mc’a®. In addition, the work of Itoh, who used quantum electrody-
namics, confirms that to order mc?a* there are no three-body terms.
In writing the many-electron Hamiltonian the subscripts i andj are used to
distinguish electrons. If the opportunity is taken to introduce the radiative
corrections discussed in the previous section by making the replacement 9.72,
the resultant Hamiltonian is:
+ gup(s;-B;) (b)
+ (eh?/8m?c*)(V.E;) (d)
— (1/8m°c?)
1} — (gug/2m?c?)(s;.B;)7? (e)
+ os {e°/8m €or; ; (f)
The Wave Equation for Many Electrons ‘ATI
Since this Hamiltonian involves a summation over j #i, it has been necessary
to introduce a factor of i into (f), (g) and (j), so that these interactions are not
counted twice.
In Equation 9.73 the subscripts have been dropped from the constants m,
e and g, since the origin of each term may be ascertained by referring back to
Equations 9.37 or 9.71. However, for completeness the identification of each
term is repeated here:
(a) an electron’s rest energy, its energy of interaction with the external
electric potential and its kinetic energy, respectively;
(b) the Zeeman term representing the interaction of an electron’s spin mag-
netic moment with the external magnetic field;
(c) the interaction of an electron’s spin magnetic moment with the magnetic
field it experiences by virtue of its motion relative to the external elec-
tric field;
(d) the Darwin term representing the correction to the second term in (a)
due to an electron’s Zitterbewegung;
(e) relativistic corrections to the kinetic energy in (a) and the Zeeman term
(b);
(f) the Coulomb interaction between pairs of electrons;
(g) the retarded orbit-orbit interaction of one electron with the electromag-
netic field due to the relative motion of another;
(h) the spin-orbit term representing the interaction of the spin magnetic
moment of an electron with its own orbital motion due to the electric
field of another electron;
(i) the spin-other-orbit term representing the interaction of the spin mag-
netic moment of one electron with the orbital motion of another;
(j) the spin-spin term comprising a dipole-dipole interaction between the
spin magnetic moments of two electrons together with a Fermi-contact-
type interaction;
178 Advanced Molecular Quantum Mechanics
(k) a correction to the Coulomb term (f) analogous to the Darwin term (qd).
In the Hamiltonian (Equation 9.75) the mechanical momentum 1; =
(p; + eA;) appears in many terms. It is expedient not to expand these terms,
since this ensures that the Hamiltonian is gauge invariant. However, it should
be noted that when, for example, m;/2m is expanded, an orbital Zeeman term
appears together with a contribution to the diamagnetism (Section 7.1).
Bibliography
The derivation of the two-electron Hamiltonian from the Dirac equation
Carrington, A., Levy, D. H., and Miller, T. A. (1970), ‘Electron resonance of
gaseous diatomic molecules’, Adv. Chem. Phys., 18, 149.
McWeeny, R., and Sutcliffe, B. T. (1969), Methods of Molecular Quantum
Mechanics, Academic Press, New York: Appendix 4.
179.
180 Advanced Molecular Quantum Mechanics
complication associated with nuclei which have spins greater than or equal to
1 is that they have quadrupole moments. These are discussed in the next sec-
tion, which is concerned with effects associated with the finite size of the
nucleus; for the present nuclear quadrupole contributions to the Hamiltonian
are merely noted and labelled as Hg.
Associated with a nuclear spin I, is a spin magnetic moment:
Even in the case of the proton, where my = m, and Zy = 1, the nuclearg fac-
tor gq is not 2 and as for the electron it is anomalous. However, the anomaly
is much larger than for the electron and the protong factor is in fact 5-5855
even though its spin is .. In this connection we might note that the magnetic
moment of the neutron, which is of the same magnitude as that of the proton,
is completely anomalous, since the neutron is chargeless and would thus be
expected to have no magnetic moment at all. These anomalous magnetic mo-
ments have been explained in terms of the interaction of the proton and neu-
tron with their virtual meson charge clouds, although meson theory is not in
as satisfactory a state as quantum electrodynamics. Larger nuclei also have
anomalous magnetic moments and some are even negative. Finally, we should
note that in practice the ratio (Z./m,) in 10.2 is absorbed into gy, so that
nuclear magnetic moments are given by gyUyIg (Equation 10.1).
We are now in a position to generalize the many-electron Hamiltonian
(Equation 9.75) to a molecule by assuming that nuclei are Dirac particles with
anomalous magnetic moments and spins. In doing this it is helpful to keep in
mind the interpretation of the terms in Equation 9.75 and to consult Equa-
tions 9.37 or 9.71 to see which particle is responsible for quantities such as
mass and charge in an individual term. The molecular Hamiltonian may then
be written as:
Han Soe (10.3)
where H, is the many-electron Hamiltonian (Equation 9.75) and H,, and Hen
are, respectively, the nuclear Hamiltonian and the electron-nuclear interaction
Hamiltonian.
Written explicitly the nuclear Hamiltonian is:
182 Advanced Molecular Quantum Mechanics
where
TW, = Pa — Zeal Ae: (10.5)
The terms in Equation 10.4 represent, respectively, nuclear rest energy, the
energy of interaction between nuclear charge and the external electric potential,
nuclear kinetic energy, nuclear Zeeman energy and nuclear-nuclear Coulomb
repulsion. The interaction Hamiltonian appears as:
the interaction between molecular rotation and nuclear spin magnetic mo-
ments, that is seen in molecular beam studies. Terms like this have analogues
in the two-electron Hamiltonian and are quite plausible, but their inclusion
must be based on phenomenological arguments.
In this section two additional nuclear effects are mentioned, which may be
attributed to the fact that nuclei are not point charges but have finite extent;
of course, both the angular momentum and the magnetic moment associated
with nuclear spin might also be attributable to this.
Apparently electrons can penetrate nuclei and move about inside them. In
atoms only s electrons have a finite probability of being at the nucleus, and
when such electrons are inside the nucleus the electric potential they experi-
ence is not given by Ze/47€or, but is presumably less than this. Consequently the
energy of an s electron is shifted upwards because of the finite size of the
nucleus, but the magnitude of this effect is dependent on the structure of the
nucleus involved. In particular, different isotopes of the same element have
different shifts, since the nuclei are different; isotope shifts have been detected
experimentally, but the effect is small and has only been observed in the spec-
tra of heavy atoms (Z > 50). A more important consequence of this effect is
in Mossbauer spectroscopy, where the observed transitions are between differ-
ent nuclear energy levels. Since the size of a nucleus alters during these tran-
sitions, the exact energy change involved depends on the environment of the
nuclei and in particular the distribution of the electrons in the molecule. These
effects must be treated phenomenologically in view of the uncertainty about
nuclear structure.
The other consequence of finite nuclear size is that many nuclei have quad-
rupole moments. One model that is frequently given takes the nucleus to be a
collection of (charged) protons and (uncharged) neutrons. If the position of
proton p in nucleus a relative to the centre of charge of nucleus @ is rpq,
the electrostatic interaction energy between the nucleus and an electron at
position feq is:
eo. = 2 € [Amegitan'= thal. (10.7)
Normally the electron is well outside the nucleus so that Equation 10.7 may
be expanded as a power Series in fpq about req. This may be accomplished by
noting that:
f(r—a) = [1—(a.V) + xa.V) +... 1f@). (10.8)
184 Advanced Molecular Quantum Mechanics
Ulta —trols, Uleack Geoitey) taee helen am BUT gels) clea eee
(10.9)
a term involving 5(teq) has been omitted, since it is assumed that rea> rp
When Equation 10.9 is substituted into Equation 10.7 the first term that
arises is — D e7/41€oreg = —Zge’/4m€oreg, Which is the familiar Coulomb
Dp
term. The next contribution involves 2 ery,, which is the electric dipole
D
moment of nucleus a, and vanishes, since rpq is the position of a proton
relative to the centre of charge of the nucleus. Although the model used here
is a simple one, it is a general result, both experimentally and theoretically,
that nuclei have no electric dipole moments. The theoretical demonstration of
this fact is based on the assumption that the states of nuclei have definite
parity, that is they are either symmetric or antisymmetric with respect to the
inversion of all coordinates. The expectation value of the electric dipole
moment operator, which is of odd parity, must then vanish.
The third and fourth terms in Equation 10.9 are both quadratic in the com-
ponents of rpq and may also be factorized into the product of an electronic
term and a term dependent on nuclear structure:
rat Teel ot se ul 3 (rey tay) /2S Grveat par Sieh a OrcetaairoRlea Orca
(10.10)
provided the electron is always taken to be outside the nucleus so that the
delta function contained in the right-hand side of Equation 10.14 may be ig-
nored. Equation 10.13 may be extended by summing over all the electrons in
the molecule and, with a little modification, the contribution of other nuclei
may be included. The total nuclear quadrupole interaction energy Hg is then
given by a summation of the expression 10.11 over all’ the nuclei in the
molecule.
The largest of the terms that have been neglected in the expansion (Equa-
tion 10.9) are cubic in the components of rpq and are related to the electric
octupole moment of the nucleus. However, they vanish, since parity arguments
show that nuclei do not have electric octupole moments. Thus the next non-
zero contribution to Equation 10.7 involves the electric hexadecapole moment
of the nucleus, but at present this is of no experimental significance. In this
connection we might note that, of the nuclear magnetic moments, the mono-
pole and quadrupole moments are zero, but that the dipole and octupole mo-
ments do not vanish; the dipole moment has already been considered, but, al-
though magnetic octupole moment effects have been detected in iodine, indium
and gallium atoms, they are at present at the limit of experimental significance.
The molecular Hamiltonian that has been derived here is expressed in terms of
the position vectors of the individual particles. This form has the disadvantage
that it does not reflect the basic physical features of molecules. A molecule is
essentially a semi-rigid nuclear framework supporting a distribution of elec-
trons which is responsible for the chemical bonds. (In this and the following
section molecules are considered, but atoms can always be regarded as special
cases.) Because nuclei are much more massive than electrons, nuclear and elec-
tronic motion are largely separated from each other and have very different
energies; in addition, molecular translation, rotation and vibration also have
characteristically different energies from each other. Consequently, it is ex-
pedient to write the Hamiltonian of a molecule in such a way that translation,
rotation, vibration and electronic motion are separated as much as possible.
To do this it is appropriate to change the molecular variables from the position
vectors of individual particles to translational, rotational, vibrational and elec-
tronic coordinates.
Since the Hamiltonian includes relativistic corrections, it might be thought
that a Lorentz transformation should be used in transforming to molecular
coordinates. However, the frame of reference in which physical measurements
are made is not changed so that the transformation is merely a change of
186 Advanced Molecular Quantum Mechanics
where rQ is the equilibrium position of nucleus a and the 1,, are constant vec-
tors chosen so that the normal coordinates are orthogonal to each other and
to the molecular rotations and translations; the P, are then the vibrational
momenta conjugate to the vibrational coordinates Q,. The vector N is the
rotational angular momentum of the nuclei and differs from the total angu-
lar momentum J of the molecule by the electronic and vibrational angular
momenta, L and G respectively:
where W trans is a translational wave function and depends only on the position
R of the centre of mass of the molecule, W,,; is a rotational wave function
depending on the angles specifying the orientation of the molecule, Wy, is a
function of the (vibrational) normal coordinates Q, and the electronic wave
function W., depends only on the electronic coordinates r; (and possibly on
Q, as well); Weisp and Wnuesp are electron and nuclear spin functions, respec-
tively. The translational part of the problem may easily be separated off,
since at present there appear to be no significant interactions between trans-
lation and other types of molecular motion. However, there are important
interactions between electronic, vibrational and rotational motion and, in ad-
dition, external electromagnetic fields may be present.
In principle the molecular Schrodinger equation could now be solved by
setting up the Hamiltonian matrix and using appropriate techniques to
The Molecular Hamiltonian 189
This expression for (ZL;) still contains the operator J;, but the introduction of
orbital angular momentum into the ground state is due to molecular rotation
The Molecular Hamiltonian 19]
and Equation 10.27 still has to be averaged over the rotational states.
The total Hamiltonian is from Equations 10.22 and 10.25:
Bibliography
In non-relativistic theory the hydrogen atom is one of the few cases for which
the Schrodinger equation may be solved exactly. For this to be possible the
proton and electron are assumed to be spinless, and the only role that the pro-
ton plays is to provide an attractive Coulombic potential; the finite mass of
the proton may, of course, be taken into account by using a reduced mass
for the electron.
If the proton is regarded as merely an infinitely massive source of electro-
static potential, the Dirac equation for the electron may also be solved exactly.
This chapterisdevoted to its solution using this model for a one-electron atom,
although the more general case where the nucleus has charge Ze is considered,
since the results demonstrate that relativistic effects are increasingly im-
portant for heavy atoms. It is noted that the effect of the finite mass of the
nucleus could be largely accounted for by using a reduced mass for the elec-
tron, although unlike the non-relativistic case this procedure is not exact. Even
with the simplification of an infinite mass for the nucleus, the mathematics
involved is tedious though straightforward and for this reason only an outline
will be given. However, the reader should by now have acquired some facility
inthe manipulations involved in evaluating commutators, for example, and no
difficulties should be encountered.
194
The Hydrogen Atom 195
In addition, this brief review facilitates comparison of the relativistic and non-
relativistic solutions.
The non-relativistic Hamiltonian is:
Ey = —me2
8h egn = — mZ0 nn’, (it 2)
where a (= e*/2heoc) is the fine structure constant (Equation 8.85). Using
spherical polar coordinates (7, 6, ¢) for the position of the electron relative to
the nucleus the corresponding eigenfunctions may be written as the product
of radial and angular parts:
p = Zrlao (11.6)
and do (= 4m€oh"/me?) is the Bohr radius. In general these radial functions ex-
hibit nodes, that is they vanish for certain values of r; in particular, the func-
tion R,,,(r) has (n — /— 1) nodes.
196 Advanced Molecular Quantum Mechanics
The angular functions Y;,,(0, 6) are of more interest, since they will be
used to construct the solutions of the Dirac equation. Specifically they are
spherical harmonics and may be written as:
Yim (0,0) = [(21+ 1) — m)!/4m( + m)!]? Pim (cos 0) exp (im), (11.7)
where
ie l+m l+m
In addition, their behaviour under the operators /,. and l, may be expressed in
the following manner:
respectively, and the behaviour of the functions under the operators s, and
sy is given by equations analogous to Equations 11.10 for the corresponding
orbital angular momentum operators. However, perturbation theory using
these new basis functions cannot be used directly, since the operator s.l in H,
(Equation 11.17) has non-zero off-diagonal matrix elements between levels
which are degenerate in zeroth order; the other operators in d(; only connect
levels with different 1. This problem may be overcome by choosing new basis
functions, which are linear combinations of the old, but which are diagonal in
the operator s.l. One way of doing this is to diagonalize the matrix of s.l for a
given value of n, but a different approach is used here.
The root of the problem is that the basis functions are characterized by the
quantum numbers m, and m,, but that the perturbation H, does not commute
with the components of the operators l and s. On the other hand /” and s* do
commute with #,, so that / and s are still good quantum numbers. However,
the components of the total angular momentum operator:
Resor 1 = 25.
4~1. (11.20)
whose eigenvalues are given the symbol k. This new quantum number conveys
no new information, but may be used instead of / or j. The reason for its
introduction here is that its use facilitates the solution of the Dirac equation
later in this chapter.
The restrictions on the values of k may easily be determined. From the defi-
nition of the operator K (Equation 11.20):
K?
1 + 2(0.1) + (@.1)?
ll 1+2@1)+2 +ic.d,b
L+ol)+?, (11.21)
where use has been made of the relation 8.40 and the fact that 1,1 is just il.
Substitution for (¢.I) using Equation 11.20 now gives the operator equation:
= KA—Ky (1:22)
so that:
Aha 1) = K(k 1); (11.23)
—(p4)/8m3c? = (mc?Z4a4/2n3)
[3/4n — 1/(1+ 5)], (11.27)
while the spin-orbit coupling term gives:
where Equations 11.11 and 11.19 have been used. For /= 0 the expression
11.28 is strictly indeterminate, since 7=s= 5 and both the denominator and
numerator vanish. However, agreement with experiment is obtained if it is
taken to be zero, a choice which is confirmed by the exact solution of the
Dirac equation. Finally, the Darwin-type term makes a contribution:
(11.31)
In the non-relativistic theory all levels with the same value of n are degenerate,
but this degeneracy is lifted when the relativistic corrections are included.
However, for a given n the energy of a level only depends on the quantum
number j and not on 1. If only the spin-orbit coupling had been included, the
energy would have been a function of / as well, but the relativistic kinetic
The Hydrogen Atom 201
energy Correction ensures that levels characterized by the same values of n and
j, but different values of /, are degenerate. This degeneracy persists even if the
Dirac equation for a one-electron atom is solved exactly, as will be seen in
Section 11.7.
If the notation used for a level is ni; the first few levels are:
Isi(k= +1)
[2s (kK = + 1), 2ps(k = — 1), 2Paalk = + 2)
[3 (k= + 1), 3ps(k = — 1], [3panlk = + 2), 3dga(k = —2)],
3ds2(k = + 3)
= (11.32)
where degenerate levels are enclosed in square parentheses. Of course, each of
these levels has a further (27 + 1)-fold degeneracy corresponding to the poss-
ible values of m;, but this degeneracy is only removed by an external magnetic
field. Finally, it should be noted that, although the Dirac theory predicts that,
for example, the 2s1 and 2p levels are degenerate, experiment shows that in
the hydrogen atom the 281 level has a slightly higher energy than the 2p1 level.
This effect, which is called the Lamb shift, may be explained by quantum
electrodynamics and is considered briefly in Section 11.9.
In the preceding section use was made of basis functions which are eigenfunc-
tions of the operators j”, 1”, s”, K andj,, but they were not related explicitly
to the original basis functions which are eigenfunctions of J”, s”, J, and s,. Their
identity must now be considered and a number of additional properties estab-
lished, since they are to be used in the exact solution of the Dirac equation.
Since they are linear combinations of the original basis functions, they must
be two-component functions, which can be written as:
pe (ob (11.33)
W2
TAP sea is Wi
Ky = =k ; (11.36)
hy auhl 1 aoa W2 V2
Expansion of this relation gives two equations connecting W, and W2, but only
one is needed:
C1 + L)Wit Ue — ily2 = ky. (11.37)
On substitution for ~, and W2 (Equations 11.34) and use of Equations 11.9
and 11.10 this gives:
y= fy, (11.39)
where
C1Y im,
X= : (11.40)
C2Y 1,m)+1
The constants c, and c2 may easily be obtained, since their ratio is given by
Equation 11.38 and, if f(r) is normalized, normalization of y requires that
|e,|? + |egI?=1. In quoting the results two possibilities may be recognized
depending on whether & is greater than or less than zero; these two cases are
distinguished by a superscript + or —, respectively. For k positive (Equations
11.26a):
The Hydrogen Atom 203
V = fOXim;= 10 ("
a0cae ‘) (11.41)
where ari
= [Gt+mp/yP?; ck = [G—m) /2/}3, (11.42)
while for k negative (Equations 11.26b):
1Yj+4 ,m; =2
Y TUX: = ro (° ia | (11.43)
C2 j+h, my+3
where
= [(J—mjt+D2UG+DP; of = —[G+m,t+D2G4+
Di. (11.44)
It is also easy to confirm that these two-component angular functions
es behave in the expected manner under the various operators:
Thus for 6 = 0:
+ + 1 a ! 0 Smj,+4
lone
x ae eina — ss =
ml j
alan)
= 2
ie cea
had
= [Gi[7+ + 3)/47]
D/Any? Eads = Xing
x? (11.51 )
(OLXin = Ximp-
J
(11.52)
since, although it has been determined for the special case 9 = 0, aj is a con-
stant and independent of 0, as may be seen by noting that the square of (o.r)/r
is just 1 by Equation 8.40.
does commute with H, j? and j,, and may also be used to classify the eigen-
functions of the Dirac Hamiltonian. It is similar to the operator (Equation
11.20) used in the non-relativistic theory and is denoted by the same symbol,
but since K now possesses a four-by-four matrix representation no confusion
should arise. It may be noted that the operator B(¢.1) does not commute with
the Dirac Hamiltonian and the non-relativistic operator was chosen to be
(o.1+ 1) rather than oI so that the same eigenfunction k can be used in both
the exact and the approximate theory.
Since H, j*, 7, and K all commute with each other, the solutions of the
Dirac equation are simultaneous eigenfunctions of these operators. However,
an eigenfunction Y must now have four components and may be written in
terms of two two-component functions, W, and W_:
We
v= | ) (11.55)
The last of these three relations is obtained by using the explicit form of the
Dirac matrix 8; here the operator (0.1 + 1) has a two-by-two matrix representa-
tion and is just the operator (Equation 11.20) used earlier. Comparison of
Equations 11.56 and 11.45 now shows that the two-component functions
W, and W_ may be taken to be the functions 11.41 and 11.43 of the previous
section; again two possibilities arise depending on whether k is positive or
negative. Explicitly:
This result may easily be derived by using Equation 11.52 and recalling that
the four-by-four matrix representation of o is just the direct product of the
two-by-two representation with the two-by-two unit matrix.
The components of the solutions of the Dirac equation have now been expressed
as the products of radial and angular functions. To proceed further it is neces-
sary to express the Dirac Hamiltonian itself in polar coordinates. Substitution
of Equation 11.57 into the Dirac equation will then give equations for the
radial functions f(r) and g(r).
To perform the transformation to polar coordinates, we use the fact that,
from Equation 2.38:
r,t, V) =20.V)—rVv. (11.59)
Rearrangement of this equation shows that the operator V may be split into a
radial and an angular part:
V = (/)[rV)—r,
0, V)I. (11.60)
The radial part involves (1/r)(r.V), which may readily be shown to be just
(0/dr), while the angular part contains (r , V), which is related to the orbital
angular momentum operator Al. Thus, the operator (a.p) is given by:
where the fact that r.] vanishes has been used. Finally, multiplication of both
sides of Equation 11.54 by 6 followed by rearrangement gives:
If, in addition, the fine structure constant a and the energy parameter €
(= E/mc’) are introduced, then after division throughout by fc the equations
become:
[(mce/h)(1 — €) — Za/r|
F — (0G/dr) — kG/r 0
(11.68)
[—(mce/h)0 + €) — Za/r|G + (OF /ar) — kKF/r Il 0}.
To solve the radial equations (11.68) series solutions are sought, but in doing
this two conditions may be imposed. First, we may restrict ourselves to bound
states, so that € (= E/mc’) is less than unity. Secondly, for bound states it
must be possible to normalize the eigenfunctions.
A starting point for the solution is provided by solving the radial equations
for the special case when r tends to infinity and they reduce to:
208 Advanced Molecular Quantum Mechanics
(dG/or) ee
= (mc/h)
hi — €)
—e)F Mie
the original equations (11.71) show that the constants cp and Cg are related by:
lag ik PA a a A ee (11.78)
where one (or both) of ag and bo is non-zero. Substituting in Equations 11.77
and equating the coefficients of p**”"! on the two sides of the equations gives:
h = +(k?—Za2)?. (11.81)
A decision may be made between the two possibilities for A by invoking
the condition that the eigenfunctions of the Dirac Hamiltonian must be nor-
malizable for bound states. Since the angular functions are already normalized,
this normalization condition reduces to one for the radial functions:
From the second of Equations 11.79 the ratio a,/b, is seen to be:
dolby = —n'[[k
+ Zo(1 — ey 7], (11.88)
so that this possibility is acceptable provided the denominator does not vanish
as well as the numerator n’. Now for n’ = 0:
N= (k?—Z0?)? = Zae(1 —€?)?, (11.89)
where Equations 11.81 and 11.83 have been used. Solution of this equation
for k gives:
k =\4 Zoli ey ?; (11.90)
The Hydrogen Atom ail
for the negative solution the denominator in Equation 11.88 vanishes and
ao/bo is indeterminate, but the positive solution is permissible. Thus there are
solutions for n'=0, but they are only allowed if the quantum number k is
positive.
To complete the solution of the radial equations for bound states it is only
necessary to normalize the radial functions, that is to determine the magni-
tudes of do and bo. This is not a trivial problem, since F and G involve an ex-
ponential and, because A is non-integral, fractional powers of p. Nevertheless,
normalization is possible using gamma functions, which have to be evaluated
numerically and may be obtained from tables; the inclusion of the details here
would serve little purpose.
Although the eigenfunctions of the Dirac Hamiltonian have now been deter-
mined, the eigenvalues E (= emc”) have yet to be found. Now from Equation
11.83:
e(1—e)? = (n' +)/Za, (11.91)
where A is (k?— Za”)? and n’ is a positive integer or zero. Solution of this
equation gives:
E = mc’e = mce?[1+ (Zaf(n’ +d))’]?. (11.92)
At this stage it is convenient to introduce:
n=nt+kl. (11.93)
Since n’ = 0, 1,2,...and the minimum value of |k| is + 1, the new quantum
number 7 can be any positive integer but not zero. In addition, for a given
value of n the maximum value that |k| can take is n, although k = — n is not
allowed, since this corresponds to n' = 0 for which k cannot be negative. Thus,
the possible values of k for a given n are 1,2,...,n if k is positive and — 1,
—2,...,—(n—1) if k is negative. When it is recalled that k is +(j + +), it
may be seen that j, k, and n can have just the same values as in the approximate
two-component theory (Equations 11.26) and in particular the new quantum
number 7 may be identified with the non-relativistic principal quantum num-
ber 7. In the four-component theory / is no longer a good quantum number,
although the large and small components, y, and W_, are separately eigenfunc-
tions of /* as may be seen from Equations 11.41, 11.43 and 11.57. It was for
this reason that the quantum number k was introduced earlier. Nevertheless,
it is convenient to use the notation 11.32 for the different levels, the / value
for the large components being employed.
212 Advanced Molecular Quantum Mechanics
Substituting for n’ and A in Equation 11.92 gives finally for the eigenvalues:
ratio of the contributions to the probability density, it is seen that the small
components are responsible for about ten per cent.
To compare and contrast relativistic and non-relativistic atomic orbitals,
approximate Dirac eigenfunctions for n= 1 and n=2 are used. These are
given in the table and are correct to order Za. They are normalized to this
order provided each is multiplied by the factor 12(Z/ao)?”, where do is the
Bohr radius; in addition, the dimensionless parameter p (Equation 11.76) is to
this approximation just Zr/ao, the same quantity as that employed in the non-
relativistic theory (Equation 11.6). Although these functions become less ac-
curate as Z increases, they exhibit the same qualitative features as the exact
functions, and are easier to handle; the results obtained may be generalized to
higher values of 7 without difficulty.
The probability density distribution |W|? is the real interest, but radial
parts of the individual components of W are considered first; the angular parts
involve spherical harmonics and do not need special consideration. For each
four-component function there are two radial functions, one for the large
components f(r) and one for the small components g(r). For f there are n’ or
(n' —1) nodes in the radial function depending on whether k is positive or
negative, respectively. If the value of / appropriate to the large components is
used, we may say that f has (n —/— 1) nodes. This result is the same as that
for non-relativistic orbitals and indeed, to the order of accuracy used in the
table, the radial functions of the large components are the same as those in the
non-relativistic theory. For g the number of radial nodes is n’ regardless of the
sign of k; this means that when k is positive the large and small components
have the same number of radial nodes as each other, and in the special case
when n’ = 0 and k can only be positive f and g are both nodeless. The impor-
tant thing to note is that, although the large and small components may have
the same number of radial nodes, a node in g never occurs at the same value of
p as a node in the corresponding radial function f.
When | |? is considered it is immediately apparent from the table that it
may be factorized into a radial and an angular part. This is because, although
|W,|? and |W_|* have different radial dependences, their angular behaviour is
similar and the ratio of |W,|? to|W_|?is independent of @ and ¢.
The radial part of |W |? is thus a linear combination of ||? and |g|?. Since
the large components predominate, the radial dependence is very like that of
the corresponding non-relativistic orbitals, but unlike these the Dirac orbitals
have no radial nodes. Although f has nodes at the same places as the corres-
ponding non-relativistic radial functions, g never vanishes at these points. As
an example the 2s: orbital may be considered. At p = 2 both fand the non-
relativistic 2s radial function possess a node, but the node in g occurs at
Table 11.1. Eigenfunctions of the Dirac Hamiltonian, for a one-electron atom
correct to order Za. Each function should be multiplied by a factorm 2(Z/ao)*””,
where dg is the Bohr radius; p is the dimensionless parameter Zr/do.
Mate
Ge \
e?
1. By
3iZae” cos 0 siZae
1. —9
sin= 0e7'*
=
1. - ° i 1. =
5iZae ? sin be’? — ziZae™? cos 0
eP(1 — p/2)
1 0 1 e?( 1 — p/2)
2/2 siZae "(1 — p/4) cos 6 2V2 | tiZaé?(1 — p/4) sin de!
5iZae°’?(1 — p/4) sin be!% — siZaé’?(1 — p/4) cos 0 /
m= + 5 aye tell
Wy ne
e °/20 cos 0 e-/29 sin Oe*
1 e-P/2sin A el? 1 — e256 cos 0
4V6 \ —(3/2)iZae"(1 — p/6) 4/6 0
0 — (3/2)iZaeP"(1 — p/6)
The Hydrogen Atom 2V9
2Paa(n = 2,j= 3/2, k= + 2,n'=0)
m; = +3/2 iy = 3/2
e °’? sin Oe!” 0
1 0 1 | eo sin 6e7%?
8 | (1/4)iZae~’*p sin 0 cos be!* 8 (1/4)iZae~?’?p sin? Oe?
(1/4)iZae~?!?p sin? Oe? —(1/4)iZae?”p sin 6 cos 0e*%
m, = +3 m; = —%
e °? » cos 6 56°/29 sin Oe 1%
1 |—3e?? p sin Oe? 1 |e?"p cos @
4/3! — (1 /8)iZae”’?p(1 —3 cos?4) 4/3 (3/8)iZae~?p sin @ cos de %?
(3/8)iZae~?’*p sin 6 cos be!” (1/8)iZae™?’?
p(1 —3 cos?
6)
j=3/2,\mz\/2
(1+3 cos®@)
j=3/2, \mp=3/2
4sin2@
/=5/2, l= 3/2
sin@@ (16-15 sin@@)
Fig. 11.1. The angular distribution for selected Dirac orbitals. Each is cylin-
drically symmetric about the z axis.
cated, but two limiting cases may be distinguished. For |m;|= 5 the angular
dependence resembles a dumb-bell with its axis along the z axis; for j = 3/2
the angular part of |W |? is proportional to (1 + 3 cos? 0) and is plotted in Fig.
11.1. At the opposite extreme, when |; | takes its maximum value of/, the
distribution is toroidal and an example of this case is also given; the specific
case shown is also for j = 3/2, the angular dependence being proportional to
4 sin’ @. For intermediate values of m, the orbitals are also toroidal, but with
a number of lobes, the example in the figure being for 7= 5/2 and m, = 3/2,
for which |W |? is proportional to sin? (16 — 15 sin? 6).
Although the first angular function shown in the figure is not unlike that of
a non-relativistic p, orbital, it has no node perpendicular to the z axis. The
other angular functions are also nodeless, since they are cylindrically symmet-
ric about the z axis. The lack of angular nodes is a general property of Dirac
orbitals. |
We have already seen that relativistic orbitals have no radial nodes, so that
it may be concluded that except at the nucleus itself they have no nodes at all;
there are no radial nodes because the eigenfunctions of the Dirac Hamiltonian
have four components and the small components prevent them occuring, while
the lack of angular nodes may be attributed to the fact that the electron has
spin .
This is in sharp contrast to non-relativistic orbitals, which in general have
both angular and radial nodes, a feature which raises the question of how an
electron gets from one part of an orbital to another, if there is a node in be-
tween. However, this problem no longer arises if relativistic atomic orbitals are
used, since they are nodeless.
Both the exact (Equation 11.94) and approximate (Equation 11.95) eigen-
values of the Dirac Hamiltonian for a one-electron atom show that levels
characterized by the same values of the quantum numbers n and j are de-
generate. However, this degeneracy is in fact removed by the Lamb shift,
which is of quantum electrodynamic origin. The largest manifestation of this
effect occurs for the m = 2 levels in the hydrogen atom and is illustrated in
Fig. 11.2, which shows the successive effects on the non-relativistic energy of
including relativistic effects and the Lamb shift; the energy separations are
given in terms of frequency (1 GHz = 10?Hz = 0.03336 cm‘) and are to be
compared with the non-relativistic energy lowering of 8.22 X 10°GHz relative
to the rest energy of the electron.
218 Advanced Molecular Quantum Mechanics
n=2
\
VeNS 2:737 GHz
ene
ty ars sitive, Sse
\ 2 Pyjp(k=2)
\
\
\
\
\
\
\
\ 10-949 GHz
\
\
\
\
\
\
25,9 (A=N)
-
\ Be 1-058 GHz
Se ee eee ee eee
22172 (k=-1)
Fig. 11.2 The effect of relativistic corrections and the Lamb shift on the
energies of the n = 2 levels in the hydrogen atom.
— e(5*)
V7 o(r)/6, (11.98)
where ¢(r) is the scalar potential and & is the instantaneous deviation of the
electron from its mean position. For the Zitterbewegung the time average
(8*) is of the order of (A/mc)?.
The Lamb shift may be thought of as arising from an additional smearing
out of the electron’s charge that is superimposed on that due to the Zitter-
bewegung. This arises from the quantization of the electromagnetic field,
which is the subject of the next chapter. Nevertheless, we may anticipate this
discussion to the extent of recognizing that the quantized field has zero-point
energy in the same way that a quantized harmonic oscillator does. This result
is not unexpected, since the classical field may be described in terms of a
collection of classical harmonic oscillators (Section 6.7). Consequently, even
a field-free vacuum contains electromagnetic energy in the form of so-called
‘vacuum fluctuations’, which interact with the electron and cause it to under-
go a rapid random motion. Detailed calculations, which cannot be presented
here, show that for an electron in an atom the mean square deviation of this
random motion is:
(5?) = (2a/7)(h/mc)*|n(2n?/a*), (11.99)
which is smaller than that due to the Zitterbewegung by one order in a.
If the result (Equation 11.99) is accepted, the crude model of Section
8.10 may be used to estimate the magnitude of the Lamb shift. In addition
to (57) the expression 11.98 involves V7¢(r) and in a one-electron atom this
is just — Zed(r)/E9 as may be seen from Equation 11.15. Thus the Lamb
shift is given by the operator:
(e?Za/37€9)(h/mc)?5(4)In(2n?/a’). (11.100)
The expectation value for an electron in a non-relativistic s orbital with
principal quantum number n is from Equations 11.11 just:
mce?Z*a5(4/31n?)In(2n?/a’), (11.101)
while in other atomic orbitals it vanishes. Although the model used is crude,
more sophisticated approaches give the same result for non-relativistic
orbitals.
For the hydrogen atom 11.101 gives 1.040 GHz for the Lamb shift of the
2s level; the 2p levels are unaffected. This is to be compared with 1.058 GHz,
220 Advanced Molecular Quantum Mechanics
the experimental separation of 251 and 2p:. The discrepancy has been ex-
plained to within 1 MHz by relativistic, higher-order quantum electrodynamic
and other effects. These corrections also produce small shifts in levels other
than s levels. Lamb shifts have also been observed in the hydrogen atom for
n>2 and in other one-electron atoms and they have all been explained
quantitatively. They also occur in many-electron systems, but are more com-
plicated; electron-electron interactions are of course present and full calcu-
lations have only been performed for the simplest species.
So far this chapter has been devoted to the relativistic treatment of one-
electron atoms in which the nucleus is assumed to be stationary and spinless.
It has already been remarked that the finite mass of the nucleus may largely
be accounted for by using a reduced mass for the electron and, except for
a few remarks about the influence of nuclear spin, this section is concerned
with many-electron atoms.
The presence of a nuclear spin magnetic moment in a one-electron atom
may be effectively allowed for by using perturbation theory, since the cor-
rections introduced are relatively small. If non-relativistic atomic orbitals are
employed as spatial basis functions, it is appropriate to use a perturbation
based on the approximate Hamiltonian of chapter 10. In particular the
nuclear hyperfine interaction arises from the Fermi contact term (10.6f); the
dipole-dipole term averages to zero for an atom because of its spherical sym-
metry. However, it is only permissible to use first-order perturbation theory
for the Fermi contact interaction, since it has its origin in the Breit operator.
Indeed, for the hydrogen atom the estimated hyperfine splitting is infinite if
second-order perturbation theory is used, although the cause of this diver-
gence has also been attributed to treating the proton as a point particle.
An alternative approach is to employ Dirac atomic orbitals but, although
it is tempting to use, for example, the Fermi contact operator as a pertur-
bation, this is not permissible, since it is a non-relativistic operator. If it is
used, even first-order perturbation theory gives infinite answers, since Dirac
orbitals with 7= 4 have weak singularities at the nucleus as the exact radial
functions of Section 11.6 show. The correct procedure is to substitute the
vector potential due to the nuclear spin magnetic moment into the Dirac
Hamiltonian. This gives:
this operator using all Dirac orbitals with the same n as basis functions gives
the same results as the non-relativistic theory except for a correction of rela-
tive order (Za)*. For example, the correction factor for the 1s: level is
(1 + 3(Za)?/2 +... .); correction factors for other levels have been tabulated
in the literature, but except for large Z they are relatively unimportant.
For many-electron atoms perturbation theory must again be used to esti-
mate the effect of small terms in the Hamiltonian, such as those involving
nuclear spin. However, the choice arises as to what electronic basis functions
to use. The usual approach is to obtain eigenfunctions of the non-relativistic
Hamiltonian, which consists of the kinetic energy of the electrons and all the
Coulomb interactions. Even these can only be obtained approximately and in
general consist of a linear combination of determinants involving one-electron
functions that have been optimized using a self-consistent field criterion, the
starting point for the iterative procedure being eigenfunctions for one-electron
atoms. For inner electrons and heavy nuclei the relativistic corrections are
relatively large and their calculation by perturbation theory is not very satis-
factory. This is becoming a serious drawback, since attention is now being
focused on inner electrons by the use of experimental techniques such as
X-ray photoelectron spectroscopy. This situation has led to the use of the
Dirac Hamiltonian for the individual electrons, since allowance for the relati-
vistic corrections involving individual electrons is then automatic; in particu-
lar, account is taken of such things as spin-orbit coupling and the relati-
vistic corrections to the kinetic energies of the electrons. An appropriate
procedure is described briefly here; for the details reference must be made
to the research literature, since the mathematical techniques used are outside
the scope of this book.
The approximate Hamiltonian for a many-electron atom may be taken as:
eee l t
E e7/4n€or ij,
J
(11.103)
where H; is the Dirac Hamiltonian for electron 7 and includes the Coulomb
interaction between the electron and the nucleus. The term representing the
Coulomb interaction between the electrons is not even approximately Lorentz
invariant, but the terms that have been omitted from the full Hamiltonian
may be treated as perturbations on the eigenfunctions of 11.103. Indeed,
to describe the interaction between two electrons better the Breit operator
9.41 may be added, but as we have emphasized repeatedly it is only ap-
propriate to use it in a first-order perturbation treatment. Other perturbations
that could be included are those involving the nuclear spin magnetic moment
and quadrupole moment; electron spin is of course included automatically
in the Hamiltonian 11.103. The problem then is to find eigenfunctions of
222 Advanced Molecular Quantum Mechanics
11.103 that can be used as basis functions for the perturbation treatment.
Now the eigenfunctions of the operator 2 H; in 11.103 are just products
t
these products is taken. The Pauli exclusion principle restricts the possible
linear combinations to those which are antisymmetric with respect to inter-
change of any pair of electrons and this linear combination may conveniently
be written as a determinant, the elements being (four-component) Dirac
functions. For closed shell configurations only one determinant is needed, but
for open shell configurations a linear combination of determinants is neces-
sary, although the number involved may be reduced by choosing the com-
binations to be eigenfunctions of the total angular momentum operator iz
This means that an iterative procedure must be adopted, the Dirac orbitals
for a one-electron atom being used to calculate initial potential energies
V(r;).
The improved one-electron functions so obtained are then used to obtain
better potentials and the process repeated until a self-consistent field is
achieved. This procedure is quite laborious and in practice the whole calcu-
lation may be accomplished in one step using the variation method; this has
the added advantage that the exchange interaction is included, if determin-
ental wave functions are used.
In Section 11.7 it was seen that for one-electron atoms eigenfunctions
with the same values of m and j are degenerate. These degeneracies of one-
electron functions do not occur for many-electron atoms, since the potential
experienced by an electron is no longer Coulombic. The energy of an electron
then depends on how effectively it is screened from the nucleus by the other
electrons. The lower the value of/ associated with the large components of a
one-electron function the lower is its energy; the same situation occurs in the
non-relativistic theory.
The wave functions obtained using this self-consistent field approach are
only approximate, but they may be improved by using techniques, such as
configuration interaction, which are well established in the non-relativistic
theory of many-electron atoms.
The relativistic theory for atoms is thus similar to the non-relativistic
theory, although the details must be modified to allow for the fact that
relativistic eigenfunctions have four components; the amount of labour in-
volved is greater and the mathematical techniques employed are more sophis-
ticated. In addition, it is only comparatively recently that relativistic
calculations of atomic structure along these lines have been attempted in any
great number and at present they are by no means routine. For these reasons
the details are not given here and reference must be made to the specialist
literature.
Finally, a few comments may be made about calculations on molecules
using relativistic atomic orbitals. Since molecules do not possess spherical
symmetry, acentral-field approximation is not poss.vle and the wavefunctions
cannot be factorized into radial and angular parts. A popular approach in
non-relativistic theory is to construct one-electron molecular orbitals, which
are linear combinations of a limited basis set of atomic functions, the variation
principle being used to calculate the coefficients or other adjustable para-
meters. In principle there is no reason why this should not be done using
relativistic atomic orbitals as a basis. However, although this approach has
been used for atoms themselves, at the present time little attention appears
224 Advanced Molecular Quantum Mechanics
to have been directed towards the use of relativistic atomic orbitals to de-
scribe molecular electronic structure. Recently attempts have been made to
solve the non-relativistic self-consistent field equations for simple molecules
by using numerical techniques and in principle this approach could also be
used in the relativistic theory.
Bibliography
Many-electron atoms
Grant, I. P. (1970), ‘Relativistic calculation of atomic structures’, Adv. Phys.,
19, 747: a review of relativistic calculations on atoms.
CHAPTER TWELVE
Before considering the electromagnetic field, the more familiar particle quan-
tization is reviewed, since it provides a guide for the quantization of field
variables. Classically the Hamiltonian of a particle may be expressed in terms
of its position r and its momentum p conjugate to r. Quantization may then
be accomplished by making r and p time-independent operators and in par-
ticular by identifying p with the operator — ih(0/dr). This procedure is equiv-
alent to imposing the condition that the operators p and r satisfy the com-
mutation relations:
[pi rj] Someone [Pi Dj] = [7:7] =,0. Gof =a Pez). (12.1)
AC 1) = & [taMAralr)
+ diaMAall, (12.2)
where
Qur(t) = Aa(0) exp (— iw,1) (12.3)
and
Aua(t) = (€oV)? ey exp (ik.r). (12.4)
Here the wave vector k and the angular frequency w, are related by wy, =
kc, and V, the volume under consideration, may be made arbitrarily large. In
addition, for each value of k there are two independent polarization vectors
€, and these are distinguished by the subscript A(= 1, 2). The two possi-
bilities for €,, are orthogonal to each other and because of the Coulomb
gauge condition they are both orthogonal to k:
The energy of the field is then given by the classical Hamiltonian (Equation
6.83):
H = & wx (Gindiar + VrVin: (12.6)
Although this Hamiltonian is expressed in terms of pairs of variables,
Qua and qyy, they do not form conjugate pairs and are not appropriate for
quantization. This situation was remedied in Section 6.7, where the new real
variables:
Ow = Wat dm; Pay st OW OE) (12.7)
were introduced, the inverse relation being:
but a problem arises for the electric field operator, since it is given by
E=— dA/dt and q,y, and qj, are both time-independent operators. The
origin of this difficulty is that we are using the Schrodinger picture in which
operators are time independent and the wave functions evolve with time.
The alternative formalism involves time-dependent operators and time-inde-
pendent wave functions. This is the Heisenberg picture, but it may be made
to coincide with the Schrodinger picture at t= 0. Heisenberg’s equation
of motion (Equation 1.24) shows that:
a result that may easily be obtained from Equations 12.8 to 12.10; this is
analogous to the classical result and may be used in the Schrédinger picture
at time ¢ = 0. The electric field operator is then:
H = 4€0Sf(c?B?
+ E*)dr. (12.18)
Quantum Field Theory 229
The manipulations involved are the same as those in the classical derivation of
Section 6.7 except that it is now important to preserve the order of gq, and
dx» since they do not commute. However, this was anticipated in the classi-
cal treatment so that Equation 6.83 is also valid quantum-mechanically:
G = % Ga, (12.21)
where
but these are merely a consequence of the definition of the electric and
magnetic fields in terms of the potentials. The divergence of the electric field
is easily shown to vanish, since in the gauge chosen ¢ vanishes and A satisfies
the Coulomb condition (V.A = 0), so that:
V.E = —a(V.A)/dt = 0. (12.24)
The remaining equation is more difficult to obtain. It is first noted that,
from Equations 12.16 and 12.17:
where
V i(k, Ana) = k(V.A,))— (K.V)A,, = — ik? Ayn (12.27)
has been used. Since w, = ck, comparison of Equations 12.25 and 12.26
shows that:
VRB c ?(dE/df), (12.28)
G = $hk(a*a + aa*)
= nk(N + 4), (12.44)
so that:
G|n) = Ak(n + 4)|n). (12.45)
The eigenvectors |m) may all be expressed in terms of |0), since repeated
use of Equations 12.41 shows that:
In the previous section the energy of the eigenfunction |n) for an individual
radiation oscillator was seen to be fiw(n + 4). The quantum number n may
be regarded as the number of quanta of energy Aw in the oscillator in ad-
dition to the zero-point energy of 4hw. For this reason n is known as the
occupation number and, since it is the eigenvalue of N, this operator is
called the number operator.
Quantum Field Theory 233
An operator specific to one wave vector and polarization only acts on the
corresponding part of the eigenfunction, so that, for example, use of Ny;
tells us how many quanta of type k;, A; are present:
as would be expected.
In specifying a state of the field it is only necessary to mention those
quantum numbers 1, that differ from zero, although all the possibilities for
k and A must be remembered in performing a summation such as in Equation
12.52 because of the zero-point energy. Thus, if there are no quanta present,
Lp Degas ee", On ap oe may be abbreviated to |0) and this is known as
the vacuum state.
234 Advanced Molecular Quantum Mechanics
Any state of the radiation field may be constructed from the vacuum
state by using the appropriate creation operators. For a state of the field with
just one quantum of type k, X:
Finally, we note that, since all creation operators commute with one another
(Equations 12.49), it does not matter in which order the quanta are created.
12.4 Photons
In the last section it was seen that the energy of an electromagnetic field is
the sum of the quanta of energy associated with the individual radiation
oscillators. These quanta may be identified with particles or photons and it
is the purpose of this section to show that this identification is consistent
with photons being massless particles which obey Bose-Einstein statistics.
If we consider just one photon with wave vector k and polarization A, we
know that it has energy how, and, from Equation 12.45, momentum hk.
Now Equation 4.41 relates the energy & and momentum p of a particle:
E = c(mc?
+ p?)2, (12.56)
so that the mass of the photon is given by:
m = (E*—c*p*)i/c? = h(w?
— c?k?)i/c?. (12.57)
Since w, = ck, Equation 12.57 shows that the photon has no mass.
In this formalism there can be any number of photons in the same state,
that is having a particular wave vector, polarization and energy. In addition, in
creating photons from the vacuum state the order of creation is immaterial
so that the state of the radiation field is symmetric with respect to inter-
change of any two photons; this fact depends on the commutation of all
creation operators. Particles with these properties satisfy Bose-Einstein stat-
istics, so that photons are bosons. These remarks apply to any type of particle
or quantum of energy, for which the creation operators all commute; for
Quantum Field Theory 235
field and this energy is infinite, since the sum is over all possible wave vectors.
From Equation 12.45 the field momentum of the vacuum state may be
seen to be PA $hk, but no problem arises here, since for every wave vector k
.
in the summation there is also a wave vector — k, so that the field momentum
vanishes.
Nevertheless, it is disturbing that the vacuum state has an infinite energy.
This problem may conveniently be ignored by taking the energy of the
vacuum state to be the origin of the energy scale. This may be achieved
formally by starting with the classical expression:
H = % hora: (12.59)
In addition, we must impose the rule that in products, such as those in-
volved in Equation 12.18, creation operators must always be placed on the
left of annihilation operators, since, if they are not, it is possible to have a
creation operator acting directly on the vacuum state and this leads to in-
finite quantities. This restriction may seem a little contrived, but a self-
consistent theory can be developed; in effect the zero-point energy of the
236 Advanced Molecular Quantum Mechanics
k,A
where Equation 12.4 has been used. The sum (Equation 12.63) is infinite
even if the volume V is allowed to become infinite, since the number density
(Equation 6.74) for k is proportional to V. Similarly we may obtain:
and by substituting this and Equation 12.63 into Equation 12.18 and inte-
grating over the volume V the zero-point energy, D 4/w,, may be recovered.
Thus, in the vacuum state the electric and magnetic fields fluctuate wildly,
although their expectation values vanish. These are the so-called vacuum
fluctuations. However, it is not at present clear whether the vacuum fluc-
tuations and the zero-point energy are one and the same thing. As we
have already remarked the latter may be formally removed from the theory,
but the former may not be avoided so easily. Indeed, the vacuum fluctuations
do lead to observable effects. For example, they have already been invoked
(Section 11.9) in a crude explanation of the major part of the Lamb shift.
However, further discussion of this topic is not possible here, since treating
the electron as a point charge leads to infinities and the circumvention of
this difficulty involves the more sophisticated techniques of mass and charge
renormalization.
The theory that has been developed in this chapter is appropriate to photons,
but it may readily be adapted to describe other particles that satisfy Bose-
Einstein statistics. It is based essentially on the commutation relations (Equa-
tion 12.30) for the creation and annihilation operators:
where
[A, B], = AB + BA. (12.67)
As for bosons we may define a vacuum state |0) in which there are no
particles present. A single-particle state may then be constructed by oper-
ating on the vacuum state with a creation operator:
where r * s. However, it is not possible to have two particles in the same state,
since from Equation 12.66:
N= O)-=) Dab.
| Os. 0:
N,|1,) II N,bs|0) = b*b,b*|0)
Bibliography
Photon spin
Grandy: pages 138-140.
240 Advanced Molecular Quantum Mechanics
Vacuum fluctuations
Grandy: pages 141-143.
Sakurai: pages 32-36.
Second quantization
Sakurai: pages 28-29 and 143-156.
Schiff, L. I. (1968), Quantum Mechanics, McGraw-Hill, New York: pages 498-
508.
CHAPTER THIRTEEN
The quantized field Hamiltonian Hy was obtained in the last chapter and, as
has already been remarked, the non-relativistic approximation is used for the
electron’s Hamiltonian H,. The interaction Hamiltonian H;,,; must now be
discussed.
The non-relativistic approximation to the Hamiltonian of an electron in
an electromagnetic field was derived in Chapter 8. This is a semiclassical
Hamiltonian in the sense that the radiation field was treated classically. From
this Hamiltonian (Equation 8.142) we may select those terms involving the
electromagnetic potentials to obtain the semiclassical interaction Hamiltonian:
This differs from Equation 13.3 in that there is no term in A” so that it is not
obvious that the two interaction Hamiltonians can ever lead to the same
results for some phenomena. This is a reflection of the fact that the Dirac
theory of the electron is really a many-particle theory, since it is necessary
to assume that all the negative energy states are occupied by electrons
(Section 8.6). For a correct treatment this has to be taken into account,
since the Dirac operator « connects positive and negative energy states. How-
ever, we will not go into the details, since for the generalization to molecules
it is more convenient to use Equation 13.3 as the interaction Hamiltonian.
where ¢;, is the derivative of c, with respect to time and Vz, is the time-
independent quantity:
This may be solved iteratively for R;; and so long as the perturbation is small
compared with (£; — £,,) the resulting series converges rapidly:
final state f, The probability of the system being in state f at time ¢ is just
|c,(t)|? and from Equation 13.16 this is given by:
The transition rate w,; from i to fis obtained by taking the time derivative of
this probability:
Lt, (nlm?
+1?)]= 8(2). (13.21)
If x *O the limit tends to zero, while for x =O the limit is infinite. In
addition, it may easily be shown that:
Wei = (2n/h)|Rei|?5(E7
— Ej); (13.23)
here, use has been made of the fact (Equation 2.88) that 5(cx) =c 18(x),
where c is a constant. In this connection we might note that 5(E — Ej) has
the dimensions (energy)', so that wy; has the correct dimensions of (time)?.
Equation 13.23 is to be used in conjunction with Equation 13.18, which in
the limit as n tends to zero becomes:
In this section the matrix elements of the individual terms in the perturbation
(Equation 13.3) are considered, since they are needed in the application of
Equations 13.23 and 13.24. For convenience, it is assumed that only one
type of photon is present, the generalization to all types being made later.
The state of the unperturbed system is then given by an eigenfunction of
Ho (= He + Hf):
|A, nxn) = |A)|ng,), (13.25)
where |A) and |.) are eigenfunctions of He and Hf, respectively.
The perturbation (e/m)(A.p) is linear in creation and annihilation oper-
ators, so that its only non-zero matrix elements are of the type:
= (im(Eg — E,)/he)‘Bler|A>
must be taken and, since €,,.k vanishes (Equations 12.5), the results are un-
altered if the scalar —e7r?/3 is added to each of the diagonal elements of T.
This gives a new tensor QO with elements:
(B, ny — 1|(eh/m\s.B)|A)
= i(Feyn/204, Veo)? (Ky € nx) (BI (€h/m)s| A), (13.39)
where the long wavelength approximation has been used. The similarity to
the other magnetic dipole matrix element (Equation 13.38) is easily seen.
The electric quadrupole and magnetic dipole contributions to the matrix
elements of the interaction Hamiltonian are in general smaller than the
electric dipole contribution by a factor of order kr, where k is the reciprocal
of the wavelength of the radiation and r is of molecular dimensions. They are
important when the electric dipole matrix element Pg, vanishes or when
special techniques are employed; for example, it is the magnetic dipole con-
tribution that is usually involved in magnetic resonance. In the rest of this
chapter only the electric dipole contributions are considered, since we are
interested in principles rather than details, but in practice the other contri-
butions may be dealt with in much the same way.
250 Advanced Molecular Quantum Mechanics
There is one term in the interaction Hamiltonian that has yet to be con-
sidered, this being (e?/2m)A?. Since it is quadratic in creation and annihil-
ation operators, it can connect states differing by zero or two photons and in
general two types of photon must be considered. Now (e7/2m)A? is a much
smaller perturbation than (e/m)(A.p), so that the long wavelength approxi-
mation may again be made and only the first term in the expansions of the
exponentials need be retained. The resulting matrix element connecting states
with the same total number of photons is:
A+ 7>B, (13.41)
where y is a photon. Initially just one type of photon is considered, but
eventually the summation over all types must be made. Thus, the initial
state 7 is |A, n,) and the final state fis |B, ny, — 1).
The transition rate is given by Equation 13.23:
where fw,, E,4 and Ep are the energy of the photon and the molecular
states A and B respectively.
So far just one type of photon has been considered and, in addition,
Equation 13.43 is an operator equation. It is now necessary to sum over
the photon polarization \ and all the possible wave vectors k. To do this it is
expedient to replace the summation over k by an integration over w and the
solid angle Q. At the same time a number density p(w) must be introduced
and from Equation 6.75 this is just Vw?/(27c)°. Thus we make the replace-
ment:
i f (Voo?/(21¢)?)dwd&. (13.44)
My = Mw)n?c3/hw®. (13.46)
Using 13.44 and Equation 13.46 the transition rate (Equation 13.43)
now becomes:
The sum over \ and the angular integration is performed first, the relevant
part of Equation 13.47 being:
The situation is illustrated in Fig. 13.1; x, is the angle between Pg, and
€,, and the angular integration is over 0 and ¢. The expression 13.48
2352 Advanced Molecular Quantum Mechanics
reduces to:
x f [Pzal7 cos? x, dQ = |Pgal?fsin?0dQ, (13.49)
have been used. The angular integral is now readily evaluated and 13.48
becomes (877/3)|P3,|*, so that the transition rate is:
where Bg, is the Einstein coefficient for absorption and, because energy
is conserved in the transition:
CA, (13.54)
The Interaction of Radiation and Matter 255
may be treated in very much the same way. If just one type of photon is con-
sidered the initial and final states are |B, m,,) and |A, ny, + 1) respectively
and from Equation 13.27 the transition rate for emission is:
it is noticed that for emission the transition rate may be split into two parts.
The first is proportional to m,, and hence to the intensity of the radiation.
This gives rise to stimulated emission, since it depends on radiation being
present. The second part is independent of n,, and has no counterpart in ab-
sorption. It is responsible for spontaneous emission, since it does not vanish
when there is no radiation present initially. These two parts are considered
separately.
The stimulated emission term is similar to the absorption transition rate
(Equation 13.43) and may be processed in the same way. Reference to
Equation 13.52 shows that after the inclusion of all types of photons the
stimulated emission rate is given by:
Wri TI (w2)|P.4p17/3€oh”
= By pl(w), (13.56)
where w again satisfies the energy conservation condition (Equation 13.53).
The Einstein coefficient for stimulated emission B, <p is precisely the same
as that for absorption Bg.,. It is emphasized that this emission only occurs
if there is radiation of the appropriate frequency already present and this is
why it is said to be stimulated (or induced).
When 7, vanishes, Equation 13.55 reduces to the transition rate for
spontaneous emission. This equation is for one type of photon only and must
be summed over k and i. The sum over k may be replaced by an integral
using the prescription 13.44 and the sum over \ and the angular integration
may be performed as for the absorption case, since all directions of emission
and polarization are equally likely. This gives:
\I w?|Ppal?/3nc%
€oh
eA ey (13:57)
y
ce A e
B
A+Y—B8 B-—Aty
absorption emission
At+tyty-B. (13.63)
Thus, the initial state is |A, m,,,/x’,’) and the final state is |B, m,, —1,
ny/x’ — 1). The transition rate wy; is given by Equation 13.23, so that a non-
zero reaction matrix element R;; is needed. The first term in the expansion
13.24 of Ry; is V;;, but this vanishes. In the first place the perturbation
(e/m)(A.p) only connects states differing by one photon. Secondly, although
(e?/2m)A? connects states differing by two photons its matrix elements are
proportional to (B|A), which is zero in this case; this depends on the use of
the long wavelength approximation, but higher terms in the expansion of
exp (ik.r) make negligible contributions.
For a non-zero R;; it is necessary to go to the second term in its expansion
and this is just:
z Vin Vil (Ei — En). (13.64)
A few words are appropriate about the molecular states J, which are called
intermediate or virtual states. They are definite molecular states and as such
they are eigenfunctions of the molecular Hamiltonian. However, these states
cannot be observed in the two-photon process. Over the entire process energy
is conserved, but it is not necessarily conserved while the molecule is in an
intermediate state. The reason for this is that the lifetime of an intermediate
state is so short that Heisenberg’s uncertainty principle allows there to be some
uncertainty in its energy. Strictly speaking, the intermediate states include
continuum states and an appropriate integral over these states should be
added to the sum over the discrete states in Equation 13.65; however, they
are usually unimportant because the corresponding energy denominators are
very large.
B B
(a) T ig
sf Y
A 1 A
Y Y
A ye A
(b) a
B 1
(c)
/
if
16 A B
BoAt+yty (13.67)
are given. Finally, the negligible absorption and emission processes due to the
perturbation (e?/m)A? are described by the diagrams in Fig. 13.3(c); in this
case there are no intermediate states, since the operator A? connects states
differing by two photons and in principle can contribute to the first term
V;; in the expansion 13.24 of Rj.
In the two-photon processes discussed in the last section both photons are
either absorbed or emitted. However, it is possible for one photon to be
absorbed and one to be emitted:
A+ y>Bty; (13.68)
this process is referred to as scattering. If y and y’ both have the same energy
and A = B we have Rayleigh scattering, while the more general phenomenon
where y and y’ are of different energy and 4 ¥ B is known as Raman scat-
tering.
B uf B
y ‘
je
i rf
” A
(a) (b) (c)
where the completeness of the intermediate states J has been used. Use of
Equation 13.30 now shows that:
Now the first term vanishes, since it is proportional to the matrix element
(Al [(€x,-1), (€x/n'-4)] |A), so that the reaction matrix element R,; is finally
given by:
Although Equation 13.74 has been derived for a single bound electron
R,; has the same form for a molecule and can be rewritten as:
Rei = — [eaten + 1)]? (eo/2Veo) €4,-4A lelA)-€,'n’, (13.75)
where a@ is the molecular polarizability tensor. This concludes our demon-
stration that for low energy photons the Rayleigh scattering may be described
in terms of the molecular polarizability.
For example, in high resolution gas phase studies the Doppler effect and
pressure broadening are important. However, even if these contributions are
eliminated there is a residual line width, known as the natural line width,
which cannot be removed.
Another difficulty is that the reaction matrix element (Equation 13.69)
for photon scattering involves the denominator (E, — E,;+ hw), where hw
is the energy of an incident photon. If Aw is equal to (EL;— £4) for an inter-
mediate state /, then Ry;, and hence the transition rate wy;, both become
infinite. This does not, of course, occur in practice, although in the region
where hww=E,;—E, the scattering probability is observed to go through
a sharp maximum. This phenomenon is called resonance fluorescence.
Both the natural line width and resonance fluorescence may be attributed
to the fact that excited states have finite lifetimes and that this is not ac-
counted for in the theory of Section 13.2. Heisenberg’s uncertainty prin-
ciple, which is stated in Equation 12.60, shows that the lifetime 7 of a state
and the uncertainty in its energy are related by:
These are qualitative considerations, but the results arise naturally in more
advanced theories. Here, we confine ourselves to an unsophisticated demon-
stration that finite line widths are predicted by quantitative treatments. To
do this we consider a system consisting of just two states, a and b. Corre-
sponding to Equation 13.10 there is then a pair of coupled differential
equations:
Ca = — (i/h)Vaycy exp [i(Ea — Ep)t/h]
(13.77)
Cy = — A)Voala exp [iE, — Eq)t/h] }.
To simplify the subsequent development it is assumed that at time t=0
only state a is populated, so that
The first of Equations 13.77 may now be used with Equations 13.79 and
13.80 to determine 7;':
Bibliography
Units
This book employs the “Systéme International d’Unités’ (SI). (It is also known
as the (rationalized) MKSA system, since four of the base units employed are
the metre, kilogram, second and ampere.) A discussion of the advantages of
using this system is given in Section 5.1. Here a brief summary is given, with
particular reference to those units used in this text. For the details of these and
other units, reference should be made to the bibliography.
A.1 SI units
length metre
mass kilogram
time second
electric current ampere
thermodynamic temperature kelvin
luminous intensity candela
amount of substance mole
265
266 Advanced Molecular Quantum Mechanics
Physical quantity
Name Symbol
(c) Derived units are employed for all other physical quantities. They are said
to be coherent, since they are obtained by multiplying and dividing the base
units without introducing any numerical factors whatsoever. Some of these
have special names and symbols:
Physical quantity
energy joule
force newton
pressure pascal ia}
power watt
electric charge coulomb
electric potential
volt
difference
electric resistance | ohm
electric
farad
capacitance
magnetic flux weber (ey
inductance henry
magnetic flux
tesla Ae!
@hesbinc)
<a)
Lap
tes
eect
=a)
density
frequency hertz xN
There are other derived units which have no special names or symbols. Many
of these are obvious, such as that for density (kg m~*), and only the pertinent
electromagnetic units are given here. For completeness electric charge, electric
potential and magnetic flux density are included, although their units have
special names.
Units 267
Symbol for
Physical quantity SI unit Dimensions
electric charge (e
| w I =
electric field strength E Vm!
electric displacement D Cm?
permittivity €(D = €E) Fm! wn Ey N
ile
tles
is | w | _
electric potential ¢ Vv Nn
\
electric charge density p Che w
(d) Decimal multiples (in steps of 10%) of SI units are indicated by prefixes:
A.2 Conversion from the mixed (Gaussian) CGS system to the SI system
(a) The conversion of equations expressed in the mixed CGS system to the
corresponding SI equations only poses problems if electromagnetic quantities
are involved. For the situations considered in this book the conversion may
be accomplished using the following table, the quantity in the CGS column
being replaced by the expression in the SI column. Before the conversion all
quantities must be expressed in mixed CGS units and after the conversion all
must be in SI units.
268 Advanced Molecular Quantum Mechanics
This table must not be used to provide numerical conversion factors for units,
since powers of 10 must be introduced to allow for the change from centi-
metres and grams to metres and kilograms; this topic is considered below.
(b) Conversion factors for units are given in the following table:
energy erg rg |
calorie* : 4.2)
wave number (cm7') 1.986001 0. 29
electron volt 1.6022 X10 @ 1.6022 X
(ev)t Oy SA
force dyne 10° 10°°N
electric charge+ e€.$.u. (4765) 10272 33334C10 Ee
electric field e.s.u. cm7! cro? 2.998 X 10?
strength+ Vm"!
electric dipole debye (108 (Ameo)? 107 72” 3:334x%10-8
moment €.s.u.cm)
magnetic flux gauss (G) (el 4a) O10"
density
Fundamental Constant
Bibliography
The Symbols Committee of the Royal Society (1971), Quantities, Units and
Symbols, The Royal Society, London.
McGlaschan, M. L. (1971), Physico-Chemical Quantities and Units (second
edition), The Royal Institute of Chemistry (London), Monographs for
Teachers, number 15.
Taylor, B. N., Parker, W. H., and Langenberg, D. N. (1969), ‘Determination
of e/h, using macroscopic quantum phase coherence in superconductors.
Implications for quantum electrodynamics and the fundamental physical
constants’, Rev. Mod. Phys., 41, 375: with the exception of the electron
g factor (see Section 9.5) the values of the fundamental constants quoted
here are taken from this reference.
Coulson, C. A. (1958), Electricity, Oliver and Boyd, Edinburgh: Chapter 14.
APPENDIX B
General relations. The scalar ¢ and the components of the vectors A, B,....
are all assumed to commute with one another; V is the vector operator with
components (0/dr;).
Vir = 3,
270
Vector Relations in Three Dimensions 271
Vi/r) = —+/r’,
Vi/r) = —V.@/r?) = — 475(9),
The derivation of other relations is often facilitated by the use of €;;, (see
Chapter 2). The following sum rules may be useful:
€kEimn = On Rn On Obs
€yrEyn = 25en>
€ik Eijk = (os
APPENDIX C
General Bibliography
The following references are cited a number of times in this book. They are
referred to by the authors’ names in the bibliographies at the ends of the
chapters.
Die
Indexes
=
_<
#7 4S
,¢ e264 x
Ly =
*s
=. _
- =
~ < SS
S ~
SeeSn a
as =
— =
=~ _
—=
ec oe tarot: 4 a at
—— . ne Ca S pits - ne ii Soa wh ee , r .
i ~
~ My
~
ny = Pe OF P| e= Sede
sai
is
sis Sf adh, TS
oe
ie oad — sero
S
a -
: j
iP. ‘We @
siGme,
on (e0
Ae”
a ¢ Com pe, : >
™ Pati’
Ex, "Ol Ses
Author Index
2S
276 Advanced Molecular Quantum Mechanics
Landau, L. D., 12, 47, 62, 84, Sakurai, J. J., 103, 124, 153,
NOS), 272 224, 239, 240, 264, 272
Langenberg, D. N., 269 Salpeter, BE. Ex) 178s 1805192;
Leighton, R. B., 36, 47, 62, 84, 2245272
D2 Sands, M., 36, 47, 62, 84, 272
Levine, M. J., 178 Schiff, L. I., 240
Levy, Dal Us. 93 Series, G. W., 224
Lifschitz, E. M., 12,47, 62, 84, Shorey BD =aWalo2
LOS e202) Sikolov, A. A., 178, 224
Lowdin, P-O., 153, 154 Soliverez, C. E., 193
Strandberg, M. W. P., 193
Margenau, H., 36 Strocchi, F., 224
McGlaschan, M. L., 269 Sutcliffe, B. T., 178
McLachlan, A. D., 193 Szabo, A., 224
McWeeny, R., 12, 153, 178
Menzel, D. H., 192
Taylor, B. N., 269
Messiah, A., 192
edder i Me ain
Miller, T. Aq, 178, 193
Ternov, I. M., 178, 224
Moss, R. E., 154, 192
Thomas, L. H., 84
Murphy, G., 36
Murrell, J. N., 12, 224
Uhlenbeck, G. E., 12
Panofsky, W. K. H., 84, 103
Parker, W. H., 269
Van Vleck, J. H., 153, 193
Pauling, L., 12
Velenik, A., 224
Phillips, M., 84, 103
Powell, R. E., 224
Presenta ReaD tis Wallace, R., 264
Ine, Wil, lel, Ik. IOVS} Walter, J., 12
Wesley, J. C., 178
Ramsey, N. F., 192 Wilson, E. B., Jr:,.12, 192
Rich, A., 178 Wouthuysen, S. A., 153, 154
Rindler, W., 62, 84 Wright, J., 178
Rose, MoE. 12, 153, 154, 178;
224, 272
Royal Society (London) Symbols Ziman, J. M., 239
Committee 269 Zivkovic, T., 224
Subject Index
2a,
278 Advanced Molecular Quantum Mechanics
\ 4
\
\
\ \
}
\\
iH‘\
\
\
im
:
.
=F
Studies in Chemical Physics
A series edited by A. D. Buckingham, Professor of.
Theoretical Chemistry, University of Cambridge