Barrett-1991-Holonomy and Path Structures
Barrett-1991-Holonomy and Path Structures
9, 1991
PROLOGUE
The following text was written in 1985, forming the author's Ph.D.
thesis. In preparing it for publication I have edited and amended it in a
number of places, particularly updating the citations of work which has
appeared since 1985, or has been brought to my attention since then. But I
have tried to keep to my original intentions regarding the particular point
of view about fundamental physics which I tried to express in 1985.
The work in Section 2, on Yang-Mills fields (connections), has been
developed in innumerable ways over the years from Dirac onward, with
many of the developers working separately, it seems, particularly those in
the physics community. The introduction provides a guide to the literature
which I know about on this subject--but is surely not complete, and I expect
that more of this fragmented literature will come to light as the years pass.
The material in Section 2 provides the necessary synthesis for the two sec-
tions which follow.
~Department of Physics, The University, Newcastle upon Tyne, NE1 7RU England.
1171
0020-7748/91/0900-1171$06.50/0 ct) 1991 PlenumPublishingCorporation
1172 Barrett
9 ~p,(2i) i_<1/2
l ~ p 2 ( 2 i - 1) i>1/2
Holonomy and Path Structures in GR and YM Theory 1173
1. I N T R O D U C T I O N
Quantum gravity has very much taken its cue from the conventional
field point of view. One considers the space of all metrics on a given manifold
and tries to form a statistical, probabilistic, theory on this space, which
satisfies the conventional ideals of a quantum theory. In other words, the
attempted theory purports to deal only with the pure field, and does not
consider as part of its scope the measuring bodies which Einstein considered
to be basic to his conception of "space." We know now that this conception
of quantum gravity has all sorts of objectional features (which, of course,
may be due to other aspects of the theory) (see, e.g., Isham, 1981). One of
the unpleasant aspects is the necessity of considering the diffeomorphism
symmetry of the configuration space. This arises because quantum field
theory is based on the local vector space structure of the configuration
space (Isham, 1984), and so is based on the space of all metrics. However,
diffeomorphism-related metrics are considered to describe the same physical
object, and so one is forced to "factor through" the diffeomorphism sym-
metry. Now this symmetry is an essentially unphysical one, and arises
because the space of all metrics is really the wrong concept, it does not have
a one-to-one correspondence with real physical quantities. What one should
really work with are the set of distinct geometries. The diffeomorphism
symmetry is a symmetry of a representation, via a particular differentiable
manifold.
This article is about a different representation of the geometry of the
gravitational field, one in which the test bodies play a crucial role. It is an
analysis of what the relationships between the measured quantities of the
motion of the test bodies are. It describes the effect of the geometry of space-
time on the geometries of the test-particle motions. The key idea is suggested
by the quote of Einstein above: a point on a manifold is defined as the set
of all particles which arrive "there." In other words, we have to say what a
particle path is, and what it means for particle paths to be "in coincidence."
Particle paths shall be defined so that they all start at the same place.
A particle path is defined by its geometry--the specification of the angles
through which the particle bends, in which directions, and at what proper
distances along the path. In other words, the intrinsic geometry (proper
distances along the path) and extrinsic geometry (parallel transport of vec-
tors along the path) are specified.
The information in the gravitational field is involved in grouping to-
gether, in a particular way which will be described more fully below, all the
possible particle paths into sets which represent the particles whose paths
are "in coincidence." These sets form the points of the space-time manifold.
This is actually all that is required! What one finds is that the geometry of
the particle paths is sufficient to specify the geometry of the resulting space-
time. The result is a theory of gravity in which the test particles are firmly
Holonomy and Path Structures in GR and YM Theory 1175
mixed up with the phenomenon of the field itself. In fact, the field is "made
out of" the motions of the test bodies. There is no conception of an inde-
pendent gravitational field divorced from the theory of the propagation of
matter.
So the presence of gravity is felt by the fact that it alters the sets of
coincident particle motions. The coincidence is specified by an equivalence
relation on the set of all possible particle geometries. I have termed the
information that goes into this equivalence relation the "holonomy" of the
gravitational field, the word holonomy being using in rather a loose sense.
The point is that two particle paths end at the same point if they form a
closed loop on the space-time manifold, which starts and ends at their mutual
starting points. So to specify the equivalence relation one needs to know the
geometry of the particles which move in closed loops only, and in addition
the Lorentz group holonomy element (in the strict sense of the word holono
omy) of the closed loop. This is the way in which the gravitational "field"
is specified. The mathematical details of this are best left to the following
sections.
So, with the motivation the analysis of gravity, why is a large section
of the article about classical Yang-Mills fields? The reason is that Yang-
Mills theory is very similar to general relativity, but, as is usually the case,
it is a simpler theory. The Yang-Mills result (the representation theorem of
Section 2) both provides a comparison with general relativity, and also a
technical result which is of use in building the linear connection of the
gravitational field. Moreover, the technical tools used on the way in Section
2 bear a strong resemblance to, and in fact motivate, the techniques of
Section 3.
Table I contains a comparison of the main features of Yang-Mills
theory in Section 2 and gravity in Section 3; the notation has been deliber-
ately chosen to enhance the similarity.
The strong resemblances displayed in the comparison throw fresh light
on the debate about the status of gravity as a gauge theory. From the point
of view of holonomy, gravity has no diffeomorphism symmetry: one just
specifies the set P and the map h. The "gauge freedom" of gravity is still
present, however, in a rather subtle form. The set P (see Section 3) contains
the geometries of the paths which form closed loops. Suppose, for the sake
of argument, that the paths are composed of n linear sections (straight lines).
Then, to specify an element of P (and hence give information about the
gravitational field) one can arbitrarily specify the first n - 1 pieces. Then the
geometry of the last section of the path is entirely fixed by the requirement
that it be in P. We can say that the gauge freedom of gravity is the freedom
to specify the first n - 1 sections at will, and the real information (the "holon-
omy") is the fixed value of the geometry of the last section. This notion of
1176 Barrett
Table I
Construct YM Gr
G Structure group Lorentz group
M Base space Tangent space
Geometry of loops f~M P
Holonomy mapping H: ~ M ~ G h: P - ~ G
Equivalence p(1) =p'(1), There exists 7r~P:
relation on P M x G, g' = H ( p - ~ o p')g zr= ( H ( ~r)p- ~) o p',
(p, g) ~(p', g') g' = h(zc)g
Constructed set P M x G / ~ Bundle Manifold and frame bundle
Lifting Lifting function l. Inverse development
map A, lifting in
frame bundle
whose smoothness gives Chartson the bundle Charts on the manifold
and which d e f i n e s Connection Metric and connection
2. T H E H O L O N O M Y R E P R E S E N T A T I O N OF GAUGE FIELDS
2.1. Introduction
the configuration space is the space of orbits of the action of the gauge group
f9 in the space d of all connections on some given bundle. This orbit space
has been much studied" because one can view Yang-Mills quantum field
theory as a theory of integration on the orbit space, albeit in an imprecise
fashion (Atiyah, 1980; Singer, 1981; Babelon and Viallet, 1981). There is
also a canonical version, where geodesics on orbit space correspond to the
Hamiltonian flow (see also Narasimhan and Ramadas, 1979).
For good physical reasons, then, one would like to have more infor-
mation about the configuration space d/fg. To achieve a different charac-
terization of the configuration space, it is possible to consider the holonomy
mapping of the connection as the fundamental object. This mapping takes
the loop space f~M of the base manifold M into the structure group G of
the bundle by mapping a loop into its holonomy element. The loop space is
the set of all piecewise smooth paths in M which start and end at an arbi-
trarily singled-out point, d e n o t e d . , in M, with a topology which will be
discussed later. The holonomy element of a loop is defined in terms of the
horizontal lifting of the loop into the total space of the bundle. The two
endpoints of the lifting define a G-element which translates one of the end-
points to the other, using the G-action on the bundle. This is the holonomy
element. Section 2.2 describes how a connection gives rise to the concepts
of horizontal lifting and holonomy, and why the machinery of bundles is
the most appropriate.
Discussion of the holonomy in electromagnetism goes back to Dirac
(1931) in the context of magnetic monopoles. Aharonov and Bohm (1959)
discussed parallel transport as a phase shift of a Schr6dinger wavefunction,
and recognized the physical significance of the holonomy operator for a
closed loop, being more directly involved in the behavior of the Schr6dinger
wavefunction than magnetic forces. Some of these ideas were used in an
attempt to find gauge-invariant quantizations of electromagnetic and other
gauge fields (Mandelstam 1962a,b, 1968a,b; Bialynicki-Birula, 1963).
There are two important facts about the holonomy mapping. First,
different physical configurations give rise to different holonomy mappings,
and second, the set of restricted gauge transformations fg. c fg, that is, ones
which act trivially at the basepoint, leave the holonomy mapping invariant.
This means that, if we consider gauge equivalence to be restricted in this
way, the physical configurations are faithfully and uniquely represented by
their holonomy mappings. The residual gauge freedom of field rotations at
the basepoint is parametrized by G ~ fg/fg., a finite-dimensional Lie group,
in contrast to the original infinite-dimensional gauge group f#. Thus, the
restricted, or pointed, configuration space d/fg. is equivalent to a set of
certain types of mappings OM~G, namely those which arise as holonomy
mappings.
1178 Barrett
if there is a bundle isomorphism B~B' taking F to F' and b to b', and such
that it is the identity on M. This definition is more useful here, since the
holonomy mapping involves only the base manifold M in its definition, the
reconstruction process manufactures "new" bundles as well as connections.
Note that since a preferred point in the fiber over 9 is preserved, these
triples are equivalent to orbits of the restricted gauge group in the space of
potentials. To be precise, if C is a particular bundle with basepoint c, and
~ - c ~ ~- is defined as the subset of triples (B, F, b) where B is isomorphic
to C, and d and f#, are the potentials and restricted gauge group of (C, c),
then ~ .c is in bijective correspondence with d / f # . , in the obvious way.
The two main results are as follows.
similar to the one used here, differentiable bundles and infinitesimal connec-
tions, but Kobayashi's axioms do not seem to include any mention of differ-
entiability (see H3 below). The reconstruction theorem appears in a more
general guise in papers on bundle topology which appeared shortly after-
ward. Milnor (1956) considered locally trivial bundles ("fiber" bundles) over
a simplicial complex, with a bundle slicing function playing the role of a
connection. Milnor's analogue of H2 is particularly elegant. The construc-
tions are all topological, rather than differentiable.
The most general setting is that of Lashof (1956), who considered topo-
logical principal bundles (not even necessarily locally trivial) with a general
lifting function playing the role of the connection. The lifting function is
much more general than an infinitesimal connection because the lifting may
not commute with the structure group action. This is also connected with
the fact that in this formulation, paths which differ only by reparametriza-
tions and other similar operations (cf. H2) are not considered to be equiva-
lent. Lashof has a notion of equivalence of maps ~ M ~ G which renders the
equivalence classes in a bijective correspondence with the set of inequivalent
G-bundles over M. This is thus a coarse version of the representation
theorem, which does not distinguish between different connections on the
same bundle. Interestingly, Lashof's constructions are rather close to the
gravitational constructions presented here.
The differential aspect of the subject was explored by Teleman
(1960, 1963) shortly afterward. He later wrote two papers which are
probably the most comprehensive overview of all of these sorts of results.
In the first (Teleman, 1969a), the reconstruction theorem appears (Theo-
rem 3) in a general guise, a connection being a lifting function which
satisfies analogues of H1 and H2. Various special cases (simplicial, differ-
ential, complex analytic) are discussed in the second paper (Teleman,
1969b).
The constructions were later rediscovered by physicists (Giles, 1981;
Anandan, 1983; Barrett, 1985, 1989; Fischer, 1986). For the most part, they
confined themselves to the algebraic part of the constructions rather than
the topological aspect. Chan and Tsou (1986) and Chan et al. (1986) found
an interesting extension of the reconstruction to presenting the data in the
form of a connection on loop space f~M, with applications to monopoles._
The last paper also contains references to many other associated papers in
the physics literature. It is interesting to note that Dirac's (1931) paper on
monopoles discussed a homomorphism f~M~ U1, with M three-dimensional
Euclidean space minus a point, which by the reconstruction theorem immedi-
ately translates into a connection on a bundle, the modern understanding of
monopoles. The inequivalent bundles correspond to the different monopole
charges which Dirac found.
1180 Barrett
n(co2 o o)1) = n ( c o l ) H ( ~ )
H(co) = exp f v iF dx I A dx 2
~f(s)t(1 - t) exp(it/s), sr 0
Vt(s, t)
(0, s=0
where R 2 has been identified with the complex numbers. The parameter t is
the time along the path, and s is the family parameter. If f is a continuous
real function with f ( 0 ) = 0, then ~t is continuous, and so fit is continuous in
the compact-open topology. The family describes a continuous set of loops
which, as s ~ 0 , simultaneously wind around faster and faster, and shrink to
zero radius. The area integral is easily computed, putting r(t)=f(s)t(1- t)
and O(t)= t/s, then the area enclosed by loop s is
f If2(s)/6Os, s~ 0
It2 dO = ~0, s=0
and this is exactly where the topology and differentiable structure of space-
time come into use. 2 Thus, we want to postulate that the holonomy is a
smooth map (~4)n~G. So, to consider the space o f piecewise linear loops,
the smoothness requirement is:
For any integer n > 0, if V: (~4) n--*~2M is the family of piecewise linear
loops with n + 1 linear pieces, then the composite map HVt: ~ 4 ~ M ~ G is
a smooth map.
It would be quite possible, and physically well motivated, to work with
the space of piecewise linear loops. However, the linear structure of the base
space (here R 4) is not really relevant, and we want the results to hold equally
well on any differentiable manifold. It is also technically rather inconvenient.
So the remedy is to allow the above situation to relax under arbitrary
diffeomorphisms of the base space.
The final form of the axiom H3 encapsulates the notion that a smooth
finite-dimensional family of loops has a smoothly-varying holonomy image
in G. A smooth finite-dimensional family of loops is a map ~: U~f~M with
U an open subset of ~ for any n, which is smooth in the sense that the
associated map
~t: Ux I ~ M
(u, i) ~ ~(u)[i]
is continuous, and smooth (C ~) on the subintervals
U• for io=O<il<...ik<l=i~+l, n=0,1 ..... k
H3. For any smooth finite-dimensional family of loops ~t: U~f~M, the
composite map H ~ : U ~ f ~ M ~ G is smooth.
The collection of all such maps ~t from any such U into f~M defines the
induced topology on f~M. This is the finest topology which makes all these
maps continuous. Any map which obeys the axiom H3 is continuous
(Spanier, 1966; Dugundji, 1966).
21 amindebted to Dr. R. W. Tucker for bringing to my attention the fact that if parts of the
loop consideredare spacelike,then it does not have the direct interpretation of being composed
of particle paths. One just has to consider it as a figure in space-time, perhaps as marked out
by, and measured with, the aid of auxiliary particles.
1184 Barrett
fibers of bundle
(o, 1) ~o
I !
I
Fig. 1
without altering the endpoints of the lift in Bh. The map h carries p to the
path 0 in M and q to the trivial path in M (Figure 1). Since the canonical
homomorphism B h ~ B carries horizontal lifts into horizontal lifts, it follows
that there is a horizontal lift of 0 in E which has the same endpoints as the
trivial lift of the trivial loop in M, and so the endpoints of the 0 lift'are the
same point. Hence the holonomy of 0 is the identity element of G. It should
be noted that the homotopy h may be smooth only on subintervals
I x [in, in+ 1], but that the conclusion is unaffected.
If r then H(c01 o c0~-l)=id, and since H(c0zl)=H(co2) -1, it
follows, using H1, that H(o,~)= H(c0~). Property H3 follows from a slight
modification of the proof in Kobayashi and Nomizu (1963, p. 74).
Since these conditions are satisfied, the representation map ~ : ~,~,~/f,
which takes (B, F, b) to its holonomy map, is defined.
(P, g)-~q
with PE the space of paths in E starting over ,, and q is the path
q: I~E
i--+{K(p, i), g}
the contraction K(p, i) representing the section of path p from 0 to i:
K: P M x I-+PM
K(p, i)[j] =p(ij)
Now having a preferred point b in the fiber z - l ( , ) provides an iso-
morphism between this fiber and the group G. So the lifting function l, gives
1186 Barrett
for each path p in the base starting at * and a point g in the fiber over 9 a
path in E starting at g which is carried back onto p by the projection zr. For a
differentiable bundle with a connection a lifting function is given by assigning
l , ( p , g) the unique horizontal curve starting at g which covers p. The lifting
described here may, a priori, not be of this type. But with the aid of axioms
H I - H 3 we will show that it is.
As yet E has no differentiable structure. Suppose U is a contractible
open set o f M. Then there exists a smooth family of paths gt: U--*PM such
that gt(u) ends at u. Using the endpoint map e: P M ~ M , e ( p ) = p ( 1 ) , this
condition can be restated as egt= id.
The chart Cv for Jr- 1(U) c E is
For the present, we shall assume the proof of Lemma 3, which states
that l, is the horizontal lifting of a connection on B. It is clear that the
connection is uniquely specified by the lifting function. Thus, the construc-
tion of a triple (B, F, b) is complete. This defines the construction map
~: ~ - o ~ , .
Proof o f the Reconstruction Theorem. To complete this proof, it remains
to show that ~cg= id, or, in other words, that if (B, F, b) is constructed
from H ~ and has holonomy mapping H', then H = H ' . This is
straightforward. []
Proof o f the Representation Theorem. The reconstruction theorem
shows that ~cg=id, so it remains to show that <g~=id. Suppose that
~(B, F, b) = H and C~(H) = (B, F, b), b = (e, Jr, m, g), a n d / ~ = (E, ~, M, G),
and that l, is the lifting function of (B, F, b). Consider the map
[* e'
PM x G ~ PE ' E
This factors through the relation Rn on P M • G to give a map ~b:/7~E. The
map ~b is a bijection and commutes with the G-actions on E and E.
Lemma 2. c~ is a diffeomorphism.
Proof To show that ~bis a smooth bijection, it is enough to show that
the charts Cv defined for/~ map smoothly to E. The map ~bCv is
(cP,id) l, e'
UxG ~ PMxG" ' PE ' E
which is smooth due to the fact that a smooth family of paths lifts to a
smooth family. Since ~b is the identity on M and an isomorphism of the
fibers, it is clear that ~bCV,is a chart for E, and so q~is a diffeomorphism. []
It is also easy to check that the induced mapping of paths takes the
lifting function of (B, F, b) into the lifting function of (B, F, b). Hence 4) is
a bundle isomorphism preserving M, F, and b.
Since the lifting property of l, guarantees that ~,Fc = id, it follows from
Lemma 3 that F defines a connection.
Proof of Lemma 3. The first point is that the lifting defined by l, is
local in the sense that the lift of a section J c I of a path p c P M depends
only on the section pllJ o f p . More precisely, i f p ' c P M agrees with p on J,
then on J the lifts o f p and p' are related by right translation with a fixed G-
element. So the tangent vector of a lift at a point c c E can depend only on
the classes of paths in M which agree in some open neighborhood of ~(c).
For example, it might depend on any of the derivatives of the path p at the
point 7c(c). It shall be shown, however, through Lemmas 4-8, that the three
conditions H1-H3 in combination are sufficiently restrictive for the tangent
to the lift to be determined only by a linear mapping of the first derivative
o f p at r
Lemma 4 eliminates a possible pathology of the holonomy mapping
(see also Section 2.5).
Lemma 4. Suppose ~t: I ~ f ~ M is a smooth one-parameter family of
loops with r t, the trivial loop; then (d/di)(H~t)(O)= O.
Proof Since only the behavior of ~t near the p o i n t , c M is important,
we can assume without loss of generality that M is the m-dimensional vector
space Nm with * as the origin. The strategy is to embed t~ in an m-dimensional
family of loops &: Im"--~'~ m. This is defined by its associated map co
co: I m x I--* ~m
(Hv,) (H A)
n GSn
~'"I ~ . (1,1)
M
M IxI ) ~(O,s)
Y
Fig.2
t ~ ( 0 , 0 , . . . , N(s,, t) . . . . ,0) is a thin loop, and the holonomy element is
the identity for all values ors,. Hence ff/Ss,)(H(o)(O)= 0. Then the conclu-
sion of Lemma 4 follows. 9
Returning to the proof of Lemma 3, the local property of the lifting
means that it is possible to define the lifting of all paths in M, not only the
ones which start at .. This lifting has the product property: If q lifts p, q'
lifts p', and q(1) =q'(0), then q'o q lifts p' op. It also enjoys the obvious G-
invariance and reparametrization invariances. There is a smoothness prop-
erty which is discussed more below, and a thin equivalence property: If q
lifts p, q' lifts p', p is thinly equivalent to p' [p- 1 op, is a thin loop based at
p(0)], and q(0)=q'(0), then q(1)=q'(1).
Instead of considering directly the lifting of tangent vectors, it is useful
to consider, as a generalization, the lifting of one-parameter families of paths
~: I ~ M I [MIc Map(/, M ) denoting the unpointed piecewise smooth path
space] which shrink at parameter zero to the constant path at some point
in M (Figure 3). Such a family is specified by the smooth map u/: I x I ~ M ,
with N0, t)= x. The first factor will be denoted s, the shrinking parameter,
the second, t, is the time along the path. The quantity
v'-v ~ 1) 8~,(0, O)
Os Os
generalizes the tangent vector of a path in M. For ifp: I--*M is a path, then
putting ~s, t) =p(st), it follows that v ~= (dp/di)(O) and v~= 0. The vectors
v~ v ~ will be called the boundary vectors of gt.
Now suppose that ~: I x I ~ E lifts ~t, i.e., for fixed s, the path ~. (s, t) is
a lift of ~t(s, t). The function s~;~ (s, 0) determining the starting point of the
1190 Barrett
t
a/~ 1)
lxI (0 0) M -
generalized
tangent
vector
Fig. 3
Since t ~ Vt(0, t) is the trivial path, putting c = A (0, t), both w ~ and w~ lie
in T~E.
The first part of Lemma 3 is a consequence of the following result.
W 1 -- W0 ~--.rC(~)l -- /.)0)
k=o
ag~(~
ds
Using the Leibnitz rule on the defining equation of the g's gives
w~ = w~ + b dgk(O___))
ds
g.
bo
@~r ~176
I , t ,
V(s, 1 ) + ~ [V(s, 1 ) - ~ (s, 1)1, s#O
/
~2 \ ds 2 ds 2 ]' s=O
w~_wo_aa(o,1) aa(O,O)=o
8s 8s
Similarly, w ~ - w~ = 0. 9
The next lemma shows that only the difference v 1 - v~ is important.
L e m m a 8. Suppose we have two smooth one-parameter.families of
paths ~ and ~t' in M shrinking to the point *, with lifts 2 and Z' shrinking
to the point b ~ n - l ( * ) just as in Lemma 7, but with v ' 1 - v'~ v I - v~ then
it follows that w 'l - w '~ = w ~ - w ~
P r o o f Again we can suppose that M = Rm, with * = 0. Suppose four
vectors v 1, v~ v'1, v'~ ~m are given, with v'~ - v '~ v I - v~ The two functions
v a n d ~': I x I ~ M
g: (s, t)--*s[(1 - t ) v ~ tv 1]
and ~'. The 9 is defined by mapping I x I x I into R'~ by the linear map which
extends
~t and ~,' factor through q~: gt=q~cr and ~'=qbcr' with cr and
or': I x I ~ I x I x I defined by
If 2 . lifts q~, then ~ , a and 2oct' lift ~t and gt', and hence ~*(0, 0, 0)(4) =
w1 - w~ w' 1 - w'~ The lemma is proved for the particular maps N, N' con-
structed here by linear interpolation in t. But Lemma 7 shows that any other
families of paths will give the same result. I
lxl lxlxl M
Fig. 5
1194 Barrett
If v ~, v ~ Vtl, and v'~ are the boundary vectors of ~ and g', then v~ v'1, and
v ~ and v '~ are the boundary vectors of Z. The product property of the lifting
implies that
and so F is linear.
The conditions of the lemma are thus satisfied at the point c = b. The
same proof can be applied at other points because the holonomy mapping
based at other points in E obeys the same axioms H1-H3. 9
2.1.5. Conclusion
I/ p M
Usually one can freely specify the starting point 5t (0) of 2 in z-~(p(O)).
The lifting is usually required to vary continuously with respect to both the
path in M and the starting point in E, but the details of this will be omitted
here.
For a gauge theory (defined by potentials on the base) one can define
the parallel transport map r: MI~G. The horizontal lift of a path p into the
trivial bundle M x G starting at (p(0), g) is the path i~(p(i), gr(K(p, i))),
where K is the contraction in path space: K(p, i) is the path which traverses
the section [0, i] of path p, i.e., K(p, i)[j] =p(ij). The holonomy of a loop,
which was defined in Section 2.1, can now be seen to be just the parallel
transport operator for the loop.
Any lifting function can be specified by giving, for each path p in the
base space, a vector field w in the section of the bundle above the path, so
that the lifts A, are the integral curves of the vector field. The vector field
must project down onto the tangent vector field of p in the base, and so it
is determined up to an arbitrary vertical component. The horizontal lifting
is a very special type of lifting function. For each point ceE, there is a linear
mapping Fc: T~(c)M~T~E such that for any path p, w=d~/di=F(dp/di).
The image of F is called the horizontal distribution of TE, and at a point e
it defines the horizontal subspace of TeE. To put this another way, the
important point is that for any two paths passing through the same point,
p(i) = z(c), and having the same tangent vectors at that point, the tangent
vectors of the lifts agree. The vector w depends only on the tangent vector
of the path at that point, and on no other feature, local or nonlocal, of the
path. That is what distinguishes the horizontal lifting of a connection from
any other lifting.
After this slight digression on lifting functions, we come back to the
original question: why are bundles essential? The short answer is that in the
holonomy representation nontrivial bundles automatically appear alongside
the trivial ones (we are supposing that the base space actually has some
nontrivial bundles, so that it is not a contractible space, as, for example, ~n
is). The reason for this is that the concept of parallel transport "unifies" the
infinitesimal aspect of the connection one-form (on the base space) with the
topologically nontrivial global aspect of the finite transformations of the
transition functions on overlapping charts. In bundle language it "unifies"
l 196 Barrett
the connection and the topology of the bundle. To put it crudely, the only
difference between an electromagnetic field on a circle and a Moebius band,
as far as parallel transport and holonomy are concerned, is that in one case
the structure group (U1 or ~) is connected and in the other case (Z2) it is
not. Conventionally, one would ascribe the holonomy in the first case to the
presence of a field, and in the second case to the topology of the bundle.
When one comes to consider the set of all holonomy mappings, it is rather
artificial to separate out those that belong to different bundles. For different
connections on the same bundle, the holonomy mappings are in the same
homotopy class, so perhaps one could restrict attention to only one homo-
topy class of maps. This seems a cumbersome idea, and since there is nothing
to be lost by considering different bundles, it is not pursued.
There is a second reason for using bundles in Yang-Mills theory. The
bundle constructions have a very deep analogy with the reconstruction of
the manifold of general relativity, which is described in later sections. The
analogy is that the base space of Yang-Mills theory corresponds to the
tangent space in gravity, and the total space of the bundle corresponds to
the differentiable manifold of general relativity. Thus, to explore the relation
between the two theories, it is essential to use the theory of bundles for
Yang-Mills fields.
0),-~ a ) o t
(.0 o ( 0 - 1 ~ t
where f~ is the loop on the left-hand side of the three homotopy equivalences.
The map h is not continuous, but the overall map f~h is. The homotopies
are thus actually all thin equivalences, and so f ~ M / 0 (0 is the thin equivalence
Holonomyand Path Structures in GR and YM Theory 1197
where the integral is over the region V which the loop bounds. One can
make this expansion precise: If gt: I ~ M is a smooth family of loops with
gt(0) = t, then the second derivative of the holonomy is related to the curva-
ture tensor F on the base manifold by
d2(H~)
as 2 (0)=F(A)
= -~2
- ~o xtUdx vl = -~2
- f v dx EuA dx vl
A'v Os2 v Os2
d(H(g)
- - (0) = 0
ds
This is the content of Lemma 4 of Section 2.1.
Note: The quantity A vv is a tensor at the point ,. One can take the
~2/0s2 inside the integral to define it as
fo dxt~ d2xVl
dt ds ds dt
3. H O L O N O M Y AND GRAVITY
3.1. Introduction
The last section was concerned with Yang-Mills theory in its own right.
This section presents the analogous constructions for general relativity. The
Yang-Mills work provides both the general framework for a similar treat-
ment of gravity and some specific results which are of use in this section.
The work on gravity is not complete in the same way as the Yang-Mills
results, and so a less formalized presentation is used. The difficulties are
of a fairly technical nature; mainly questions about differentiability and
differentiable structure, and are still open problems. However, the goals are
the same: we are looking for the holonomy representation of a gravitational
field configuration, an axiomatization of holonomy, a reconstruction theo-
rem for the field configuration, up to a diffeomorphism, and finally a repre-
sentation theorem for the gravitational configuration space. Perhaps the
reader can keep in mind the more formal development of the Yang-Mills
theory.
Holonomy and Path Structures in GR and YM Theory 1199
3.2. Y a n g - M i i l s
E = PM • G/R
3.3. Gravity
The gravitational field is described by a differentiable manifold X with
a metric, and a connection on the bundle O(X) of orthonormal frames. As
before, we pick an arbitrary basepoint * in X and a frame f a t ,, so that the
fiber over X is identified with the Lorentz group G. To achieve the holonomy
description of gravity, we need the idea of development. The development of
a curve in the base space p s P X is defined by horizontally lifting the curve
into O(X). Then, using the canonical ~n-valued l-form on O(X) (the
t
"vierbein"), e, the integral C(t)= ~o e, integrating along the lift of p, gives a
curve in ~ , which is identified v i a f w i t h the tangent space at ,, viewed as
an affine space with a metric, i.e., Minkowski space M. For the rest of the
section, M is now definitely Minkowski space. Intuitively, we can think of
the development as the curve in PM with the same geometry as the path p,
that is, it bends through the same angles at the same proper distances as p.
It has the same intrinsic and extrinsic geometry as p. One can also define
the curve C as the unique path in M obeying C(0)--0 and
dC_.dtrK(P'i~(~t)
where r/~w.,) is the parallel transport map: Tp(,)X~ T,X. Note that a closed
loop in PX will not in general develop a closed loop in M.
Now we can see how to describe the manifold X:
point of X = subset of PM
where p is in x e X if p is the development of a path which ends at x. The
gravitational field is described:
point of O(X) = subset of PM x G
where the G element is defined in exactly the same manner as for Yang-
Mills theories: the frame is parallel transported back to the fiber above ,,
using the connection. This second set of subsets is a very powerful object.
It describes the set X, its differentiable structure, its metric, the frame bundle
O(X), and the connection on it, as will be shown in detail below. In the
same way as for Yang-Mills theory, there is an underlying equivalence
relation yielding these subsets of PM x G as equivalence classes. This relation
Hoionomy and Path Structures in GR and YM Theory 1201
it is clear that P M / R ' is the set X, and that the projection Jr: E---,X defined
by {p, g} R~ {p} R' is intrinsic to the construction. It is also extremely plausible
that the differentiable structure, metric, and connection are implicit in the
constructions. As far as the metric goes, we can see that X is "made of"
sections of paths in M, and so distances can be measured by measuring the
corresponding distances in M. This can be formalized by defining the inverse
development map A,
A: PM--* PX
A(p)[i] = {K(p, i)}
on the open interval (0, ~ ) . The two points 0 in each copy are the two
points which cannot be separated by open sets. The tangent space is R. If
the metric on the manifold is taken to be just the standard metric on ~, the
development of a path on the manifold is the path in E one gets by identify-
ing the two "bottom legs" ( - ~ , 0] of the manifold. Now the holonomy set
will be identical to that of R, as the base manifold, and so the reconstruction
process will give back as the manifold only ~. In other words, the two
geometrically indistinguishable "legs" ( - ~ , 0] get identified. If, on the other
hand, the two legs are given different geometries [this is not possible in one
dimension!--so imagine a two-dimensional example E x R identified
on R • (0, ~)], then the holonomy description may be inconsistent. For
example, two loops with identical development but which are partly in
different "legs" may have different Lorentz group holonomy elements. Hence
the mapping h: P ~ G would not be definable.
Completeness of a manifold can be defined in terms of the development
mapping 6: P X ~ P M which takes paths to their developments. A manifold
is said to be complete if I m ( 8 ) = PM. In physical terms, every conceivable
particle motion, as defined by its geometry, can take place, and ends at some
point in the manifold. This notion of completeness coincides with the notion
of b-completeness (Hawking and Ellis, 1973). Completeness is defined in
this way because for a pseudo-Riemannian metric the manifold does not
have the metric space structure that a positive metric would give. In the
case of a positive metric, the metric space completeness coincides with
development completeness (Kobayashi and Nomizu, 1963).
For the reasons given above, then, a gravitational field configuration
will be defined as a connected, Hausdorff manifold which is complete in the
sense that the development map is complete, with a metric of Lorentz signa-
ture, and a connection on the bundle of orthonormal frames, so that the
connection is metric compatible, but may have nonzero torsion.
Now we turn to the details of the axiomatization. We start with P and
h. From it H: P ~ A is defined in the same way as before: H(p)=p(1)h(p).
Then a product operation 9 on P is defined:
p~ * p2 = (H(p2)pl) op2
and an inverse operation p~/~:
p = H ( p ) - I p -1
The first and second axioms are as follows.
G1 (Homomorphism). If p~,pz6P, then pl *pz~P, and h(pl *p2) =
h(p2)h(pO.
G2 (Inverse). I f p ~ P , then p~P.
Hoionomy and Path Structures in GR and YM Theory 1205
Now at some stage it has to be shown that if P and h are derived from
a gravitational field, i.e., manifold X, metric, etc., then the axioms hold. In
order to motivate the axioms, this will be done as the axioms are stated.
Proof of G1 and G2 for a gravitational field configuration. P is the
development of the loop space of the manifold, and the product and inverse
operations on P are just the product and inverse operations of the loop space.
The homomorphism condition on h is just the corresponding condition for
the holonomy mapping of the Lorentz connection. 9
Since a path which is thinly equivalent to a loop is also a loop, and the
holonomy mapping of a connection agrees on thinly equivalent loops, it is
natural to propose the following.
G3 (Thin equivalence). P contains complete thin equivalence classes of
paths. The map h agrees on these equivalence classes.
Attempted Proof of G3for a Gravitational Field. The proof of this is not
complete, and is one of the technical problems referred to at the beginning of
the section. The proof rests on the following conjecture: For p, p'~PX, p is
thinly equivalent to p' iff S(p) is thinly equivalent to 6(p'). The problem in
proving this is that in general the homotopy which establishes one of the thin
equivalences does not develop (or inverse-develop) to a homotopy which
establishes the other one. This is easily seen to be the case if the image of
one of the thin loops is not a simply connected set. However, one can
establish the proof for a special class of thin loops, ones which can be
transformed to the trivial loop by a finite number of operations of either (1)
reparametrization of the loop, or (2) replacing subsections of the path of
the form p-1 op with the constant path at p(0). It is not actually clear
whether this is a more restricted class of thin loops than the original defini-
tion. There may be a pathological example which shows that it is a restricted
class.
There are three possible ways out of the problem: establish the conjec-
ture, modify the definition of thin loops to a more restricted set (all the
proofs would work with the restricted notion of thin loop given above), or
prove that this restricted set is actually all the thin loops. In any event, the
lack of a proof does not seem too serious. The proof will, however, be
assumed in the following. 9
Returning to the axiomatic development, a few facts can be established,
from the axioms.
Proposition. H agrees on thin equivalence classes.
Proposition. If P is not empty, t~P and h(t)=id, t being the trivial
loop.
1206 Barrett
Proof Define O=p ;-1 op2. Then OsP and H(O)=id, and so
(H(O)p;-') op2=O
that is, p~ "~pz by R'. 9
Before going any further, this is the right point to introduce the smooth-
ness axioms. Clearly, so far there is no reason to suppose that X is a mani-
fold; in fact, with the axioms presented so far it may be very far removed
from a smooth finite-dimensional manifold. For example, take P to be a
proper subgroup of f~R n, for example, say by removing the loops passing
through some of the points of Rn: P = f~U, with U a subset o f R n, and h to
be trivial: h(m)= id for all og~P. Then P and h will satisfy the axioms G 1 -
G3, but the reconstruction will yield something rather bizarre, certainly not
a manifold. The paths which end at points outside U will not be related by
R' to any other paths. So if p is such a path, it will form an independent
point of X. The set X will contain "too many points" in the sense that if
one considers deforming p locally, then X is locally the same "dimension"
as the path space PM. Similarly, one can also imagine that too many points
may be identified, leading too small a dimension for X. It may also happen
that the paths are identified by R' in a chaotic, discontinuous manner, not
admitting any smooth structure for the set X.
With these points in mind, the axiom G4 should be tentatively (and
imprecisely) stated as:
U , PM ~ PX , X
is smooth. At this point, to check that we are on the right track, we can
1208 Barrett
compare this expression with the construction used in the Yang-Mills theory
to construct charts on the total space E of the bundle
(~,id) 1. e'
Cv: U x G , PMx G ' PE' ' E
which is very similar. It has already been remarked that the inverse-develop-
ment function A plays the same role for the manifold construction as l, did
for the bundle construction of the Yang-Mills theory.
Without going too deeply into the technical problems involved [which
start with the fact that ~ U) is not open in PM], we shall assume, as axiom
G4, what should be the principal conclusions of the above tentative proposal.
G4. There is a unique four-dimensional smooth structure on X such
that for any smooth family fi: U ~ P M the map eAfi: U ~ X is smooth. Paths
are mapped nondegenerately into X: if p ~ P M and ( d p / d i ) ( i ) ~ 0 , then
[dA(p)/di](i) # 0 .
The smoothness for the map h is straightforward:
G5. For any smooth family fit: U ~ P , the map hfi: U---,G is smooth.
The proof strategy only for the remainder of the reconstruction will be
sketched, as the work is not yet complete, and a more thorough exposition
would be premature. The main points to show are:
1. A can be used to construct "Riemann normal coordinates" around
any point x ~ X . For example, around the point ,, the family p: M ~ P M ,
p(m)[i] = im (i.e,, radial straight lines) should, in some neighborhood of the
origin, map invertibly to X. The derivative at the origin is nonzero on
account of the nondegeneracy condition in G4.
2. A is a 1-1 mapping of P M to PX.
3. For paths p, q ~ P M , p is thinly equivalent to q if and only if A(p) is
thinly equivalent to A(q).
4. The mapping h can, by virtue of the 1-1 correspondence of P and
D_.X, be regarded as a holonomy mapping I'bY~G. The axioms H1-H3 are
established by the points above.
5. The reconstruction theorem is applied to h. The construction is the
same as that given by the relation R directly on P M x G. The resulting G-
principal bundle needs to be interpreted as the frame bundle of X. At a point
e~ E, the frame is defined as the mapping
Oc: M ~ T.(c)X
dp dA(p) (1)
g~;i (1)-~ di
for any point (p, g ) E P M x G which is in the equivalence class of c. One has
Holonomy and Path Structures in GR and YM Theory 1209
to show that this mapping agrees on all the different possible paths p, and
is linear. This is the same as the problem of defining a connection in the
Yang-Mills theory, and one should be able to use the Yang-Mills result by
defining the affine holonomy mapping D.X~A, and constructing its bundle
(which should be the isometric affine frame bundle) and connection. Its
connection splits into two parts, the Lorentz connection, and the inverse of
the map 0.
The nondegenerateness of 0, which is essential to the notion of a frame,
follows directly from the nondegenerate property in G4.
6. Finally, the metric on X follows once the frames are established,
essentially from the fact that the reconstructed bundle contains only a subset
of frames, the orthonormal ones. The map 0c is used to map the metric on
the Minkowski space M to the tangent space at x on X. These metrics will
agree for all the points c in the fiber above x, because the frames are all
related by Lorentz transformations in M.
4.1. Introduction
where R is the curvature two-form and r the torsion two-form. The two
quantities on the left-hand sides are the Einstein tensor and the modified
torsion tensor. When the right-hand sides are set to zero the equations
become the vacuum Einstein equation and the equation r = 0, expressing the
connection form in terms of the metric.
Let us examine the Einstein equation first. What is needed is a precise
expression of the idea that over a sufficiently "small" three-surface V, so
that the curvature over its extent can be neglected, the integral of the Einstein
tensor over V is equal to the matter energy-momentum passing through the
surface V,
along the unique path which links it with v. An alternative way of looking
at this is that a special type of gauge has been picked, that in which the
connection form o~ is zero in the directions along the curves, and the inte-
gration is performed in this gauge. At the point v, curves radiate in all
directions, and so a~= 0.
The quantity I(V) = 89Sv R~bA ecgabcd' defined by the aid of a particular
spray of curves from v, is the same up to third order in a small parameter s,
for any spray of curves. What this means precisely is that if one has a smooth
one-parameter family of three-disks V(s) shrinking to the point v at s = 0
(i.e., a smooth map to M of a regular cone whose base is a standard three-
disk), and one attaches two different smooth families of sprays to these, then
the two different integrals I(V(s)) agree up to third order in s at s = 0. Clearly,
since the integration is over a three-dimensional region, this just follows
from the equality of the integrands at the point v. Note that for this result
it is important that V(s) is parametrized smoothly, as defined above.
This formula is not particularly exciting as it stands, but can be rewritten
in an interesting way. The same spray of curves can be developed into
Minkowski space M. Suppose that the manifold has a b a s e p o i n t . , and that
c is an arbitrary curve which connects 9 to v. The paths that were chosen in
V are connected to c and then developed into M, the tangent space of the
point .. Then the region Vcan be mapped into M by mapping a point u s V
to the endpoint of the development of the path leading to u.
Let us formalize this briefly. We started with a family of paths
~: V ~ P o X (PoX is the path space of the manifold X, based at v). These
ended at the point in question: ~(u, 1)=u. The map x: V ~ M was defined
by x(u)= 6(~t(u) o c)(1), where 6 is the development map P , X ~ P M .
Again, one can regard this as a special type of coordinate gauge fixing,
x providing a particular type of coordinate chart, such that dx = e for vectors
along the family of curves. The point is that co and x obey
co(v) = O, dx(v) = e
so that the "coordinates" are very particular ones: they are adapted to the
geometry of the manifold at v.
The local conservation of energy-momentum dp"(v) = 0 (which actually
only holds if the torsion or curvature vanishes at v) suggests that the integral
over V can be rewritten as a boundary term. In fact, the relevant expression
is
which again holds up to third order. This formula is true because, using
Stokes' formula, the integrand at v is equal to 89 times
translational part of the affine holonomy for a "small" loop at the end of
the curve e.
The corresponding integral expression is
M a b ~ ~abcd ; V
gl~ X "1"-~l.~ X
with the equality again holding to third order. The two terms in the integrand
might be called the first moment of translational curvature and a second
moment of (rotational) curvature. The equation states that the integral of
these two moments of curvature over a small two-sphere t?V is equal to the
matter angular momentum passing through the two-sphere.
This equation can be regarded as the second field equation because if the
first field equation holds, one can subtract the "orbital" angular momentum
X(V[a)Pb] from the total angular momentum Mab, and the result is the equa-
tion relating the modified torsion tensor to the spin density.
The origin dependence of the total angular momentum Mab is what one
would expect. If the angular momentum is measured from a different origin,
or indeed the same origin but connected to it by a different path c', then the
coordinate of v changes by x ' - x , and the angular momentum changes by
M ' - M = (x' - X[a)Pb]. In addition, if there is a change of frame, all the vector
indices are rotated by the Lorentz transformation. The linear momentum
was independent of change of origin, but behaved as a vector under a change
of frame.
Due to the origin dependence of M~b, it is possible to sort out the
intrinsic spin of the matter from the orbital angular momentum, thus resolv-
ing the ambiguity noted by Kibble (1961). Roughly speaking, the spin part
of the angular momentum is the part that cannot be transformed away by
the change of origin mentioned above. The details of how this works in
ordinary flat space is contained in Penrose and MacCullum (1973).
The two small-sphere expressions for linear and angular momenta pre-
sented here have appeared previously in different contexts, due to the fact
that the small-sphere limit is the same as the weak-field limit (Penrose, 1982).
The expressions, for zero torsion, therefore agree with the twistor expressions
for quasilocal momenta (Penrose, 1982; Kelly et al., 1986).
A suitable representation of the equations of motion has been achieved
with the aid of the notion of gravitational holonomy, particularly the use of
development. The linear and angular momenta of matter passing through
small two-spheres are equated with integrals of moments of the two curva-
tures, rotational and translational. All these quantities have fairly immediate
significance in the holonomy description. The curvatures are essentially the
holonomy elements for small loops, as explained in Section 2.5. The other
1214 Barrett
ACKNOWLEDGMENTS
Thanks are due to the following for discussing various areas: Jeeva
Anandan, Rob Baston, Bernard Kay, Michael Singer (Yang-Mills axioms),
Chris Clarke (Hausdorff property), David Elworthy (path space topology),
and Luke Hodgkin and Graeme Segal (literature). I am indebted to Chris
Isham, Tom Kibble, Robin Tucker, and Nick Webber for general discussion
and criticism.
REFERENCES
Aharonov, Y., and Bohm, D. (1959). Physical Review, 115, 485-491.
Anandan, J. (1983). Holonomy groups in gravity and gauge fields, in Proceedings Conference
Differential Geometric Methods in Physics, Trieste 1981, G. Denardo and H. D. Doebner,
eds., World Scientific, Singapore.
Atiyah, M. F. (1980). Geometrical aspects of gauge theories, in Proceedings International
Congress Mathematics, 0. Lehto, ed., Helsinki, pp. 881-885.
Babelon, O., and Viallet, C. M. (1981). Communications in Mathematical Physics, 81, 515-525.
Barrett, J. W. (1985). The holonomy description of classical Yang-Mills theory and general
relativity, Ph.D. thesis, University of London.
Barrett; J. W. (1986). Classical and Quantum Gravity, 3, 203-206.
Barrett, J. W. (1987). Classical and Quantum Gravity, 4, 1565-1576.
Barrett, J. W. (1988). Classical and Quantum Gravity, 5, 1187-1192.
Barrett, J. W. (1989). General Relativity and Gravitation, 21, 457466.
Bialynicki-Birula, I. (1963). Bulletin de FA cadbmie Polonaise des Sciences, 11, 135.
Cartan, E. (1922). Comptes Rendus, 174, 437-439.
Chan, H.-M., and Tsou, S. T. (1986). Acta Physica Polonica B, 17, 259-276.
Chan, H.-M., Scharbach, P., and Tsou, S. T. (1986). Annals of Physics, 166, 396~21.
Dirac, P. A. M. (1931). Proceedings of the Royal Society of London A, 133, x-xxi.
Dugundji, J. (1966). Topology, Allyn and Bacon, Boston.
Durhuus, B. (1980). Letters in Mathematical Physics, 4, 515-522.
Einstein, A. (1922). The Meaning of Relativity, 6th ed., Chapman and Hall, London.
Fischer, A. E. (1986). General Relativity and Gravitation, 18, 597-608.
Giles, R. (1981 ). Physical Review D, 24, 2160-2168.
Hawking, S. W., and Ellis, G. F. R. (1973). The Large Scale Structure of Space-Time,
Cambridge University Press, Cambridge.
Isham, C. J. (1981). Quantum gravity--An overview, in Quantum Gravity 2, A Second Oxford
Symposium, C. J. Isham, R. Penrose, and D. W. Sciama, eds., Clarendon Press, Oxford.
Isham, C. J. (1984). Topological and global aspects of quantum theory, in 1983 Les Houches
Summer School Lectures "'Relativity Groups and Topology", North-Holland, Amsterdam.
Kelly, R. M., Tod, K. P., and Woodhouse, N. M. J. (1986). Classicaland Quantum Gravitation,
3, 1151-1167.
Kibble, T. W. B. ( 1961). Journal of Mathematical Physics, 2, 212-221.
Holonomy and Path Structures in GR and YM Theory 1215