Releat
Releat
Contents
1. Setting the Stage
Reference Frames; The Principle of Relativity; Electromagnetism; Clock
Synchronization; Three Kinds of Time; Spacetime Diagrams
Reference Frames
So where do we start? With the concept of a reference frame.
I hope you recall something about reference frames from your study of
Newtonian physics. Basically a reference frame is a coordinate
system: xx, yy, and zz axes, laid out with respect with some origin, along
with a consistent method of measuring time. You can picture a reference
frame as a three-dimensional lattice of meter sticks, with a clock located at
each lattice point, as in this vivid illustration from the book Spacetime
Physics:
Using this structure we can measure both the location and the time of any
localized event, such as the collision of two billiard balls or the flashing of a
strobe light. In practice we never actually set up such a cumbersome
structure, but we need to remember that whatever method we do use to
measure locations and times of events must be equivalent to this one, in the
sense that it yields all the same numbers for position and time measurements.
(Also, in practice, we can often get by with a system for measuring positions
and times in just a two-dimensional plane, or even just along a one-
dimensional line, where our events of interest take place.)
When we set up a reference frame, we must make several arbitrary
choices: the location of the origin (labeled the “reference clock” in the
illustration above), the orientations of the three spatial axes, the origin of time
(when the clocks read zero), and, crucially, the state of motion of our
reference frame. Different choices will give us different measurements for
the xx, yy, zz, and tt coordinates of any given event, so we say that these
positions and times are relative.
If you’re not yet completely comfortable with the concept of a reference
frame and the idea that measured quantities can be relative, I highly
recommend this classic (albeit corny) film on the subject:
My reference frame has its origin at the post office, xx axis pointing
east, yy axis pointing north, and t=0t=0 at high noon. Your reference frame
has its origin at the library (5 blocks due east from the post office), xx axis
pointing north, yy axis pointing west, and t=0t=0 at 9:00 a.m. (when the
library opens). Suddenly, a door slams (Event S). Some time later, a dog
barks (Event B). Using my reference frame, I determine that the coordinates
of Event S are x=3x=3 blocks, y=2y=2 blocks, t=1t=1 hour, while the
coordinates of Event B
are x=−1x=−1 block, y=−3y=−3 blocks, t=3t=3 hours. (a) What
coordinates do you measure for these events? (b) Use
my xx and yy coordinates and the Pythagorean theorem to calculate the
distance between them (just in space, ignoring time). Then calculate the
distance again using your coordinates, and comment on the result.
Electromagnetism
But by the late 1800s, physicists had convinced themselves that the principle
of relativity does not apply to the laws of electromagnetism. That’s because
Maxwell’s equations for the electric and magnetic fields predict that
electromagnetic waves (including visible light) travel at a speed of
1ϵ0μ0−−−−√=c=3.00×108 m/s.(1)(1)1ϵ0μ0=c=3.00×108 m/s.
Question: 300 million meters per second with respect to what? Between 1865
and 1905, everyone assumed the answer was: with respect to some preferred
frame of reference, in which the (hypothetical) medium that transmits
electromagnetic waves (then called the ether) is at rest.
But Einstein wasn’t so sure. He was familiar with a common
demonstration that you may have seen, in which we move a coil of wire
relative to a magnet, inducing a current measured by a galvanometer:
Clock Synchronization
Recall the picture of a reference frame above, with a clock at every grid
location. To measure the time when an event occurs, we look at the reading
of whatever clock lies nearest to the event. But to compare the times of
events that happen in different places, we need to make sure our clocks are all
synchronized with each other. How do we do that?
You might think we could pick up our reference clock and carry it
sequentially to the locations of all the other clocks, setting each of them to
match it. As we’ll later see, that method won’t work: accelerating the
reference clock into motion, then stopping it when it arrives near some other
clock, will affect its measurements. (You don’t need to actually believe this
yet—just accept that the accelerations could affect the measurements, and
agree not to use this method of clock synchronization.) Fortunately, there’s a
much better way to synchronize the clocks: Just look at them! But when you
do, be sure to take into account that the light from different clocks might have
taken different amounts of time to reach you. If you’re the same distance
from two different clocks (as measured by all those meter sticks!), then you
should always see them reading the same time. If you’re looking at a clock
that’s one light-second away (300 million meters, or about 3/4 the distance to
the moon), then you should see it running exactly one second behind the
clock at your location, because the light from the distant clock took that long
to reach you. Because the speed of light is the same with respect to all inertial
reference frames (and doesn’t depend on the direction the light travels), we
can be confident that this light-travel-time adjustment will always work as
expected.
Suppose you’re standing still (with respect to the ground) and looking at a
clock tower 200 meters away. If your wristwatch reads exactly 8:18 a.m. and
the clock on the tower is perfectly synchronized with it, what time should
you see on the tower’s clock face?
You run a single lap around a track, while your coach, standing at your
starting (and ending) location, times your lap with a stopwatch. Event A is the
start of your lap while Event B is the finish. What kind(s) of time between
these events is/are measured by the stopwatch? What kind(s) of time between
these events is/are measured by your wristwatch? Explain carefully.
You are standing near a railroad track when a train rushes past you, moving at
constant velocity. Let Event A be the locomotive (at the front of the train)
passing you, and let Event B be the caboose (at the rear of the train) passing
you. You measure the time between these events with your wristwatch, while
the train’s engineer (in the locomotive) and conductor (in the caboose)
measure the time between these events using their pocket watches, which they
have carefully synchronized. What kind of time do you measure between the
two events? What kind of time does the train’s crew measure?
Spacetime Diagrams
To visualize the times and locations of various events, I now want to
introduce a tool called a spacetime diagram. It’s really just a plot of one-
dimensional position (x) and coordinate time (t), like we use for one-
dimensional motion when studying basic kinematics. But there are two new
twists: First, it’s conventional to plot x horizontally and t vertically. This
reversal may seem strange at first, but I think you’ll soon get used to it.
Second, we space the tick marks equally along both axes, such that our unit
of distance is whatever distance light travels in one unit of time. For instance,
you’ve probably heard of measuring distances between stars in light-years,
where a light-year is the distance that light travels in one year
(nearly 10161016 meters); this will be our unit of distance if we measure
time in years. If instead we measure time in seconds, then our distance unit is
one light-second, or 300 million meters (about 3/4 of the distance to the
moon). Or, for events occurring in a smaller laboratory, we can measure time
in nanoseconds and distance in light-nanoseconds (one light-nanosecond is
0.3 meters, or about a foot). The slow-moving objects we’re used to don’t
travel very far in a nanosecond, and would take a long time to travel a light-
second, let alone a light-year. But for the fast-moving objects that make
relativistic effects apparent, these unit choices will be very convenient.
Here is a spacetime diagram, calibrated in seconds and light-seconds, on
which I’ve plotted several events:
Notice that an event, localized in both space and time, is represented on the
diagram by a point. We plot each point on the diagram according to its
coordinates (x,t)(x,t) in some particular inertial reference frame; if we used a
different inertial frame then the appearance of the diagram would change (as
we’ll see in detail in Lesson 4). But at least from the perspective
of this inertial reference frame, the diagram shows us at a glance that Event A
(starship’s warning sirens sound) occurs at t=1t=1 second
and x=2x=2 light-seconds; that Event B (deflector shields are raised) occurs
at the same place, two seconds later; that Event C (enemy ship fires photon
torpedoes) occurs at the same time as B, four light-seconds away to our right;
and that Event D (science officer raises eyebrow) occurs three seconds later
still and at x=3x=3 light-seconds.
Now think about a sequence of events that all happen to a particular
object: perhaps the flashes of a strobe light, or the beats of a person’s heart. If
we plot these events on a spacetime diagram and connect the dots together,
we have a record of that object’s (or person’s) motion:
We refer to the line or curve connecting all events that happen to a particular
object as that object’s worldline—its line through the “world” of space and
time. Often we draw an upward-pointing arrow on a worldline, to remind us
that the object’s history flows from bottom to top.
Here are some more worldlines:
Object 1 is at rest, always at the same xx value. Object 2 is moving to the
right at a constant velocity of 1/3 the speed of light (one light-second of
distance in each three seconds of time), while object 3 is moving to the left
(in the −x−x direction) at 2/3 the speed of light. Notice that the faster an
object’s motion, the shallower the slope of its worldline. Object 4 is initially
moving to the right but then slows down, stops, and gradually begins moving
to the left. Object 5 is a light pulse, moving rightward at the speed of light:
one light-second per second. A light pulse worldline always lies at a 45-
degree angle on a conventionally calibrated spacetime diagram.
Again, each of these spacetime diagrams is plotted from the viewpoint of
one particular inertial reference frame; let’s call it the Home Frame. If instead
you measure events with respect to some Other Frame that’s moving to the
right at 1/3 the speed of light (with respect to the Home Frame), then your
spacetime diagram will show Object 2 at rest, with a vertical worldline, and
Object 1 moving to the left at 1/3 the speed of light. Motion is relative!
Lesson 4 explains in detail how to translate a spacetime diagram from one
inertial reference frame to another.
For a delightful animated explanation of reference frames and spacetime
diagrams, I recommend the Minute Physics video Spacetime Diagrams. The
video even shows how to add a yy axis to a spacetime diagram, to depict
motion in two spatial dimensions. But it doesn’t use the convention of
calibrating the space and time axes so that light signal worldlines are always
at 45 degrees.
At t=0t=0 an uncrewed rocket is launched from earth, traveling in
the +x+x direction at 4/5 the speed of light (with respect to earth). After 10
seconds, as measured in earth’s frame of reference, the rocket explodes. A
burst of light from the explosion travels back toward earth, where authorities
detect the light some time later. Draw a calibrated spacetime diagram that
accurately shows these objects and events, as observed in earth’s reference
frame. Label the launch event, the explosion event, and the detection-of-light
event, as well as the worldlines of earth, the rocket, and the light burst.
Look again at the spacetime diagram above with events labeled A through D.
This diagram is drawn from the perspective of some Home reference frame.
What velocity would some Other reference frame need to have, with respect
to the Home frame, in order for observers in the Other frame to observe
Event B and Event D to occur at the same place? Explain carefully.
At t=0t=0 I use a match to light the fuses on all the firecrackers, and
simultaneously hurl them in both directions, at an assortment of speeds. Some
go fast, while others go slow. Some go in the +x+x direction, while others
go in the −x−x direction. I give one of the firecrackers a velocity of zero,
holding it in my hand for reference.
Eventually all the firecrackers explode, and I carefully record the places
and times of these explosions. How do I do that? I could station an assistant
at each clock, with instructions to record the location and clock reading when
an arriving firecracker explodes. Or I could just watch for the explosions,
using binoculars to view the tape label and clock reading at each explosion
event. Of course the light from these explosions will take time to reach me,
and that delay will be longer for the more distant explosion events, so I don’t
expect to actually see all the explosions at the same time. But if I didn’t know
anything about special relativity, I would still expect all the explosion events
to occur at the same time. Plotted on a spacetime diagram, the explosion
events should (I expect) all lie on a horizontal line:
On this diagram my own worldline coincides with the tt axis, because I’m
at x=0x=0 and not moving (with respect to my own reference frame). The
vertical red line is the worldline of the firecracker that I’m holding in my
hand, and its explosion event is plotted at x=0x=0 and t=10t=10 seconds. I
expect all the other explosion events to also occur at t=10t=10 seconds, as
shown.
But that’s not what actually happens.
The firecracker that I’m holding in my hand really does explode
at t=10t=10 seconds, as expected. But the other firecrackers
explode later than t=10t=10 seconds, by an amount that’s tiny if they’re
moving slowly but that grows quite large if they’re moving at nearly the
speed of light (with respect to my reference frame). Plotted on a spacetime
diagram, the explosion events actually lie on a hyperbola that’s defined by
the formula t=(10 s)2+x2−−−−−−−−−−√t=(10 s)2+x2 (with the
understanding that xx is measured in light-seconds):
As you can see, the hyperbola is quite flat near the middle of the diagram, so
for slow-moving firecrackers we could easily mistake it for a horizontal line.
Meanwhile, the explosion events for fast-moving firecrackers can occur at
arbitrarily large distances (in either direction), and at arbitrarily late times,
because the hyperbola extends infinitely far in both directions, asymptotically
approaching the 45-degree worldlines of the light flashes traveling outward
from the match that I used to light the fuses.
Does this shocking result mean that something was wrong with those
firecracker fuses after all? No! Each fuse truly measures exactly 10 seconds
between the fuse-lighting event (call it Event A) and the explosion event (call
it Event B). But there’s no logical necessity to our expectation that the time
between these two events as measured by the fuse must be the same as the
time between them as recorded in my inertial reference frame (and plotted on
my tt axis). For the fuse, which is present at both events and doesn’t
accelerate along the way, measures the spacetime
interval ΔsABΔsAB between the fuse-lighting event A and the explosion
event B, whereas the clocks in my inertial reference frame, no one of which
is present at both events, instead measure coordinate time ΔtABΔtAB.
The mathematical relationship between coordinate time and the spacetime
interval is summarized in the equation for the hyperbola given above. For an
arbitrary pair of events A and B, the equation reads
ΔtAB=(ΔsAB)2+(ΔxAB)2−−−−−−−−−−−−−−−−√,(2)
(2)ΔtAB=(ΔsAB)2+(ΔxAB)2,
where ΔsAB=ΔsAB= 10 seconds in the firecracker example, and again with
the understanding that ΔxABΔxAB is measured in units of the distance that
light travels in one time unit (e.g., light-seconds if the times are in seconds).
This relationship is called the metric equation of special relativity, and is
summarized in this simple spacetime diagram:
We can write the metric equation in many other ways. For instance, if we
square both sides and move the ΔxΔx term to the left, we obtain
(ΔtAB)2−(ΔxAB)2=(ΔsAB)2.(3)(3)(ΔtAB)2−(ΔxAB)2=(ΔsAB)2.
I like this version because it puts the frame-dependent coordinate differences
(which of course must be measured in the same reference frame) on one side
and the unique spacetime interval on the other. There is a deep analogy
between the metric equation and the Pythagorean formula for calculating
distances in a two-dimensional plane—but the metric equation has a minus
sign where the Pythagorean formula has a plus.
If you’d rather measure ΔxΔx in more conventional units, then
the ΔxΔx term in the metric equation requires a conversion factor. For
instance, if ΔxΔx is in meters and the times are in seconds, then to
convert ΔxΔx to light-seconds we divide by the number of meters in a light-
second, 3×1083×108. More generally, to convert ΔxΔx to appropriate
light-travel units we must divide by the speed of light:
(ΔtAB)2−(ΔxABc)2=(ΔsAB)2.(4)(4)(ΔtAB)2−(ΔxABc)2=(ΔsAB)2.
Yet another variation is to notice that Δx/ΔtΔx/Δt is the velocity of the
clock (e.g., a firecracker’s fuse) that measures ΔsΔs, with respect to our
inertial reference frame. Denoting this velocity vv, we can then
insert Δx=vΔtΔx=vΔt to write the metric equation in terms of vv instead
of ΔxΔx:
(ΔtAB)2(1−v2c2)=(ΔsAB)2,orΔtAB=ΔsAB1−(v/c)2−−−−−−−−√.
(5)(5)(ΔtAB)2(1−v2c2)=(ΔsAB)2,orΔtAB=ΔsAB1−(v/c)2.
This is the form of the metric equation that’s most often written in
introductory textbooks, although these books usually call it the “time dilation
equation”, and instead of ΔsΔs they often use the notation ΔτΔτ or Δt′Δt′.
I’ve been assuming that we orient our reference frame’s xx axis so
Events A and B are separated only in the xx direction, not yy or zz. To drop
this assumption, just replace (Δx)2(Δx)2 in the metric equation with the
square of the spatial distance between the events, (Δx)2+(Δy)2+
(Δz)2(Δx)2+(Δy)2+(Δz)2. When we write the metric equation in terms
of vv instead of ΔxΔx, that means v2=v2x+v2y+v2zv2=vx2+vy2+vz2.
When applying the metric equation, the most common difficulty is
figuring out whose clocks measure ΔtΔt and whose clock measures ΔsΔs. I
find it helpful to keep referring back to the last figure above, which shows
that ΔtΔt is always longer than ΔsΔs, and that ΔsΔs is the time interval as
measured by the unique, nonaccelerated clock that is present at both events.
Suppose that one particular firecracker, among those described above, has a
velocity (with respect to your reference frame) of 2/3 the speed of light. Draw
a spacetime diagram showing its worldline, its explosion event, and the light
from the explosion traveling back to the origin. What are
the xx and tt coordinates of the explosion event? At what time do you
(standing at the origin) see the light from this explosion? Label your
spacetime diagram to show your answers.
Suppose that one particular firecracker, among those described above,
explodes at x=−12x=−12 light-seconds. Draw a spacetime diagram
showing its worldline, its explosion event, and the light from the explosion
traveling back to the origin. What is this firecracker’&s velocity? At what
time do you (standing at the origin) see the light from this explosion? Label
your spacetime diagram to show your answers.
Suppose that one particular firecracker, among those described above, has a
velocity (with respect to your reference frame) of 200 kilometers per second
(slightly faster than NASA’s fastest-ever space probe). What are
the xx and tt coordinates of its explosion event?
For the events A, B, C, and D shown in the first spacetime diagram in the
previous lesson, use the metric equation to calculate the spacetime
intervals ΔsABΔsAB, ΔsBDΔsBD, ΔsADΔsAD, and ΔsCDΔsCD. How
does ΔsADΔsAD compare to the sum ΔsAB+ΔsBDΔsAB+ΔsBD? What
happens if you try to calculate ΔsACΔsAC or ΔsBCΔsBC? Can you
generalize your answers to these questions?
You wish to travel to the Vega star system, 25 light-years from earth. Being
impatient, you would rather not spend more than 15 years of your own time
on the journey. How fast must your spaceship travel? How long does your trip
take, according to observers on earth (or on Vega, which is more or less at
rest with respect to earth)? Draw an accurate spacetime diagram showing the
worldlines of earth, Vega, and your spaceship.
Repeat the previous exercise for a trip to Polaris (the North Star), which is
430 light-years distant. Assume again that the trip should take no more than
15 years of your own time. Sketch a spacetime diagram to convey the idea of
your calculation, but don’t worry about making it quantitatively accurate
(which would be difficult). Is there any limit to how far out into the universe
you can travel within a human lifetime?
People often describe the metric equation with the ambiguous phrase moving
clocks run slow. Explain why this phrase can be misleading, and give an
example in which it is the “stationary” clock that “runs slow”.
Let Event A be a particular bounce of the light pulse off the bottom
mirror, and let Event B be the next bounce off the bottom mirror, after a
single round trip. Because the light travels a total distance 2d2d in between
these events, and it moves at speed cc, we can immediately write
2d=cΔsAB,(7)(7)2d=cΔsAB,
where ΔsABΔsAB is the time between the two events as measured by our
clock. Our clock measures the spacetime interval because it is present at both
events (and we won’t allow the apparatus to accelerate).
Now let’s view these same events from an inertial reference frame in
which the whole apparatus is moving to the right at some constant speed. The
illustration below shows three successive images of the mirrors from this
perspective (at the times of the three light pulse bounces), along with the path
of the light pulse. Notice that in this frame of reference the light pulse is
traveling diagonally, so it travels farther. But the principle of relativity
(together with the laws of electromagnetism) requires that the measured
speed of light still has the same value, cc, in this new frame of reference.
Because the light travels a greater distance at the same speed, it must take
more time.
To measure the time between Event A and Event B in our new frame of
reference, we require a pair of previously synchronized clocks, at rest in this
frame, one present at each event. (These two clocks are not shown in the
illustration, which shows only the clock attached to the bottom mirror.) The
distance between these two clocks is ΔxABΔxAB, so each of the two
diagonal legs of the light-pulse path is the hypotenuse of a right triangle with
height dd and base ΔxAB/2ΔxAB/2. We can therefore write the total
distance traveled by the light pulse, using the Pythagorean theorem, as
Total distance=2d2+(ΔxAB/2)2−−−−−−−−−−−−
−√=(2d)2+(ΔxAB)2−−−−−−−−−−−−−√.(8)(8)Total
distance=2d2+(ΔxAB/2)2=(2d)2+(ΔxAB)2.
But since the pulse travels at speed cc, this distance must
equal cΔtABcΔtAB. Meanwhile we have already seen
that 2d=cΔsAB2d=cΔsAB, so this equation becomes
cΔtAB=(cΔsAB)2+(ΔxAB)2−−−−−−−−−−−−−−−−−√,(9)
(9)cΔtAB=(cΔsAB)2+(ΔxAB)2,
and this is just an algebraic rearrangement of the metric equation as written
above (with the factor of cc explicit so we can express ΔxABΔxAB in
traditional distance units if we like).
Length Contraction
Now let’s return to the cosmic-ray muon experiment described in the
previous lesson. The principle of relativity tells us that it is equally valid to
analyze this experiment from the reference frame of one of the muons, in
which it is at rest and the earth’s surface is rushing upward toward it at, say,
99.5% of the speed of light. But as I calculated above, the time interval
between Event A (summit of Mt. Washington rushes past muon) and Event B
(ground at sea level smashes into muon) is only 0.6 microseconds in this
frame of reference. How is that possible, if Mt. Washington’s height is 6
light-microseconds?
The answer is that in this frame of reference, the mountain is not 6 light-
microseconds high. Instead it is only 0.6 light-microseconds high (about 600
feet), because whenever we observe an object from a reference frame in
which it is moving at speed vv, it appears shorter, along the direction of
motion, by a factor of 1−(v/c)2−−−−−−−−√1−(v/c)2. In the muon’s
reference frame, the situation looks something like this:
With the mountain’s height contracted by a factor of 10, it passes the
stationary muon in 0.6 microseconds.
More generally, if we denote an object’s true length (in the frame in
which it is at rest) as L0L0 and its measured length (in the frame in which it’s
moving, along the direction of this length, at speed vv) as LL, then the
general formula for this relativistic length contraction effect is
L=L01−(v/c)2−−−−−−−−√.(11)(11)L=L01−(v/c)2.
So LL for a moving object is always less than L0L0, and the difference
between LL and L0L0 is negligible when v≪cv≪c.
By now you’ve surely noticed that the expression 1−(v/c)2−−−−−−
−−√1−(v/c)2 comes up a lot in relativity. For convenience we therefore
often use a standard abbreviation for it, or actually for its reciprocal:
γ=11−(v/c)2−−−−−−−−√.(12)(12)γ=11−(v/c)2.
The symbol is the Greek letter gamma, and this quantity is often called
the Lorentz factor, after the Dutch physicist H. A. Lorentz, who derived
many of the formulas of relativity several years before Einstein (but arguably
didn’t fully understand their meaning, as Einstein did). The Lorentz factor
equals 1 when v=0v=0, and increases to infinity as vv approaches the speed
of light. In terms of the Lorentz factor, the metric equation
reads ΔtAB=ΔsAB⋅γΔtAB=ΔsAB⋅γ and the length contraction formula
reads L=L0/γL=L0/γ. The main downside of using this abbreviation is that
in some situations there can be more than one relevant velocity, and then you
need to be clear about which velocity your γγ abbreviation depends on.
A 50-foot (50 light-nanosecond) log is lying on the ground. A bird flies past
the log, just above it and parallel to its length, at 3/5 the speed of light. Let
Event A be the bird passing the first end of the log, and let Event B be the
bird passing the other end of the log. (a) Draw an accurate spacetime diagram,
from the viewpoint of earth’s reference frame, showing the worldlines of both
ends of the log, the worldline of the bird, and Events A and B. (b) What is the
time between Events A and B, as measured by the squirrels sitting on the log?
(c) What is the time between Events A and B, as measured by the bird? (d)
From the bird’s point of view, the log is rushing past at 3/5 the speed of light.
How far does the log move, during the time between Events A and B,
according to the bird’s calculations? Explain carefully.
While peacefully watching cloud formations in the desert, you suddenly see a
roadrunner zip by (beep, beep!) at half the speed of light, pursued by a coyote
running at the same speed. According to your measurements, the coyote is ten
meters behind the roadrunner. How far behind does the roadrunner think the
coyote is? (Hint: If the two creatures were holding a pole between them, in
whose reference frame would the pole be moving?)
By how much is the length of a 100-meter-long commuter train contracted in
a reference frame in which it is moving at 30 m/s? (Hint: You may find it
helpful to use the binomial approximation, (1+ϵ)n≈1+nϵ(1+ϵ)n≈1+nϵ,
which is accurate when |nϵ||nϵ| is much less than 1.)
For simplicity I’ve drawn this diagram so that the origin events of our two
reference frames coincide. I’ve drawn Events F and S at appropriate points on
my worldline, so that the origin event O is again half-way between them.
Question: Where on this diagram should I locate Event B?
To answer this question we use the startling fact that you also measure
the light pulse to move at exactly one light-second per second, despite the
fact that the strobe lamp that emitted the pulse is moving (with respect to
you) at half that speed. That seems impossible, right? But please suspend
your disbelief for a while, so we can work out the logical consequences. If
you really measure the light to be traveling at one light-second per second,
then we must draw the light-pulse worldlines at 45-degree angles even on
your spacetime diagram. I started them on the diagram above. Extrapolating
each of them to the right, we can locate Event B at the unique point where
these 45-degree lines intersect:
And what we find is that even though Events O and B are simultaneous
in my reference frame, they are not simultaneous in yours: You observe Event
B to occur after Event O, at some positive tt value. More generally, when
two events occurring in different places are observed to be simultaneous from
one frame of reference, they will be observed to occur at different times from
a frame of reference that is moving, with respect to the first, along the
direction that separates the events. This phenomenon is called relativity of
simultaneity.
To emphasize the relativity of simultaneity, I’ve added another element to
the diagram: the x′x′ axis. What do I mean by the x′x′ axis? It’s the line
connecting all events that happen at time zero in my reference frame, that is,
at t′=0t′=0. (Compare the xx axis, which connects all events that happen
at t=0t=0, and the tt and t′t′ axes, which are lines connecting all events that
happen at x=0x=0 and x′=0x′=0, respectively.) In our case, the x′x′ axis
must connect events O and B, because they both occur at t′=0t′=0.
Notice that the x′x′ axis is sloped upward from the xx axis by the same
amount that the t′t′ axis is sloped rightward from the tt axis. Or, equivalently,
the angle between the xx and x′x′ axes is the same as the angle between
the tt and t′t′ axes. In the present case, where my frame is moving with
respect to yours in the +x+x direction at v/c=1/2v/c=1/2, the t′t′ axis has
slope 2 while the x′x′ axis has slope 1/2. (I won’t present a rigorous proof
that the slopes are always related in this way, but the proof isn’t hard. If
you’d like to try it, draw another light-signal worldline passing through Event
O and then look for similar triangles.)
A spacetime diagram showing two sets of axes, for two different
reference frames, is called a two-observer spacetime diagram. To use the
diagram quantitatively, we can add gridlines for both coordinate systems:
For the unprimed frame (“yours”, shown in blue), each vertical gridline
connects all the events happening at a particular place (as measured in your
frame), while each horizontal gridline connects all the events happening at a
particular time (as measured in your frame). We can use whatever unit we
like for the interval between gridlines, with the understanding that the space
interval is however far light travels in one time interval.
For the primed frame (“mine”, shown in green), each mostly-vertical
gridline connects all the events happening at a particular place (as measured
in my frame), while each mostly-horizontal gridline connects all the events
happening at a particular time (as measured in my frame). Importantly, I’ve
spaced these gridlines so they correspond to the same time and space
intervals as the gridlines in your frame. How did I do this? Using the metric
equation! For instance, if Event P lies on the t′t′ axis where the t′=1t
′=1 gridline crosses this axis, then my wristwatch measures a spacetime
interval ΔsOP=1ΔsOP=1. But according to the metric equation, your clocks
should measure Event P to occur at ΔtOP=γ⋅ΔsOP=1/1−(1/2)2−−−−
−−−−√=1.155ΔtOP=γ⋅ΔsOP=1/1−(1/2)2=1.155, as shown in this
enlarged portion of the diagram:
Here I’ve plotted Event A at t=5t=5 and x=4x=4, as indicated by the blue
dashed lines. Then, using the green dashed lines, we can read off the
approximate primed-frame coordinates t′≈3.5t′≈3.5 and x′≈1.7x′≈1.7.
Similarly, I plotted Event B at t′=−3t′=−3 and x′=4x′=4, as indicated by
the second pair of dashed green lines. But as the second pair of dashed blue
lines shows, the approximate coordinates of this same event in the unprimed
frame are t≈−1.15t≈−1.15 and x≈2.9x≈2.9.
These coordinate transformations have the crucial property that if you
square the time and space coordinates and then subtract, you get the same
thing in either reference frame:
(t′)2−(x′)2=t2−x2.(13)(13)(t′)2−(x′)2=t2−x2.
For instance, for Event A, the right-hand side is 52−42=952−42=9, while
the left-hand side is approximately (3.5)2−(1.7)2≈9(3.5)2−(1.7)2≈9. But
computed in either frame, this quantity is just the square of the spacetime
interval between the origin event (call it O) and Event A, that
is, (ΔsOA)2(ΔsOA)2. Because the spacetime interval between two events is
the unique time between those events as measured by a single non-
accelerating clock that’s present at both events, we must obtain the same
spacetime interval when we compute it using the coordinates in either
reference frame. This “invariance” of the spacetime interval is analogous to
how you can calculate the squared distance between two points in ordinary
two-dimensional space as (Δx)2+(Δy)2(Δx)2+(Δy)2, and you’ll get the
same result no matter how you orient your xx and yy axes. (Note, however,
that the “Pythagorean theorem” for spacetime intervals has a minus sign
where the ordinary Pythagorean theorem for two-dimensional space has a
plus.)
As you might guess, there are also equations that you can use to carry out
these kinds of transformations between primed and unprimed spacetime
coordinates. They’re called the Lorentz transformation equations, and you
can find them in just about any textbook on relativity. They’re analogous to
the trigonometric equations that transform coordinates in ordinary two-
dimensional space when we rotate the axes (x,y)(x,y) by some angle. (The
Lorentz transformation equations can be written in terms of
the hyperbolic sine and cosine of the “angle” whose hyperbolic tangent
is v/cv/c.)
Event C occurs at t=1t=1 and x=−2x=−2 in the unprimed reference frame.
What are its coordinates in the primed reference frame represented in the
diagrams above, moving rightward at half the speed of light with respect to
the unprimed frame? Show your construction on a copy of the two-observer
spacetime diagram. Also check that (t′)2−(x′)2=t2−x2(t′)2−(x′)2=t2−x2.
Event D occurs at t′=1t′=1 and x′=4x′=4 in the primed reference frame
represented in the diagrams above. What are its coordinates in the unprimed
frame, which is moving leftward at half the speed of light with respect to the
primed frame? Show your construction on a copy of the two-observer
spacetime diagram. Also check that (t′)2−(x′)2=t2−x2(t′)2−(x′)2=t2−x2.
At high noon, a solar flare erupts on the surface of the sun. Half an hour later,
at 12:30, a comet crashes into Jupiter, 780 million km away from the sun.
(These data are as measured in earth’s reference frame, which is moving at
negligible speed with respect to the sun and Jupiter.) Meanwhile an alien
spaceship zips by at 0.8c0.8c, headed in the direction from the sun toward
Jupiter. (a) Draw a spacetime diagram, calibrated in minutes of time and
light-minutes of space, showing the sun, Jupiter, the flare eruption, and the
comet crash. (b) Add t′t′ and x′x′ axes for the alien spaceship’s reference
frame, and determine which event (solar flare or comet crash) occurs first in
the aliens’ frame. (c) Which of the two events do the aliens see first? Draw
worldlines to represent the light from each event traveling toward the alien
spaceship, and show that the answer depends on where the spaceship is
located within its reference frame (which I haven’t specified).
Redraw the two-observer spacetime diagram in the text above from the
viewpoint of the primed reference frame, so the t′t′ axis points straight up and
the x′x′ axis points straight to the right. Since the unprimed frame is now
moving to the left at half the speed of light, this means that the tt axis will
point up and to the left, with a slope of −2−2. Plot Event A according to its
unprimed coordinates, and check that its primed coordinates are the same as
what I found above. Plot Event B according to its primed coordinates, and
check that its unprimed coordinates are the same as what I found above.
Draw a calibrated two-observer spacetime diagram for the case where the
primed frame moves at v=0.7cv=0.7c with respect to the unprimed frame.
What are the primed coordinates of an event that occurs
at t=5t=5 and x=6x=6? What are the unprimed coordinates of an event that
occurs at t′=−2t′=−2 and x′=4x′=4?
Draw a two-observer spacetime diagram for an imaginary universe in which
time is absolute, so t′=tt′=t for every event. Label the axes and include
gridlines.
Combining Velocities
Now let’s move on to a completely new example. Suppose I’m at rest in the
primed frame, moving with respect to you in the positive direction at half the
speed of light, and I toss a baseball forward at half the speed of light with
respect to me. How fast is the baseball moving with respect to you?
If you didn’t know anything about relativity, you would probably answer
this question by adding one-half to one-half to obtain one, that is, one times
the speed of light. But by now you may be more wary of such simple
answers.
To answer this question I’ve carefully drawn the baseball’s worldline on a
two-observer spacetime diagram below. I started the worldline at the origin
and then, looking only at the diagonal green gridlines, measured one unit of
space (along the x′x′ axis) and two units of time (parallel to the t′t′ axis), to
find another event along the worldline, under the assumption that I measure
the baseball to be moving at half the speed of light. I then repeated this
process to extend the worldline further, and also extended it backward from
the origin. The events that I used to draw this line are highlighted with red
dots, and you’ll notice that they’re all at intersections of the green gridlines.
Amazingly, the baseball’s worldline is steeper than 45 degrees, indicating
that you measure the ball to be moving somewhat slower than the speed of
light. And to find its actual speed, you can just look at the blue gridlines! I’ve
conveniently chosen the numbers in this example so the baseball’s worldline
passes exactly through the blue grid point at t=5t=5 and x=4x=4 (check
this!), meaning that you measure the ball’s speed to be only 4/5 the speed of
light.
This example is a special case of the famous Einstein velocity
transformation formula. Before I write the formula in general, I need to
carefully define symbols for the three different velocities that we’re talking
about:
uxux = velocity of the baseball with respect to you (in the unprimed
frame);
u′xux′ = velocity of the baseball with respect to me (in the primed
frame);
vxvx = velocity of me (the primed frame) with respect to you (the
unprimed frame).
(Of course the object doesn’t have to be a baseball, but “baseball” seems
easier to remember than “object”.) Let’s also agree that all three of these
velocities are to be measured as fractions of the speed of light. The general
formula is then:
ux=u′x+vx1+u′xvx.(14)(14)ux=ux′+vx1+ux′vx.
Notice that the numerator of this formula is what we would expect if we
didn’t know anything about relativity: just add the two velocities! But the
denominator contains a “correction” term that’s the product of the two
velocities, measured as fractions of the speed of light. At ordinary speeds
these fractions would be tiny and their product would be tinier still, so we
could simply neglect this correction term. But when u′x=vx=1/2ux
′=vx=1/2, we obtain
ux=12+121+12⋅12=11+14=45,(15)(15)ux=12+121+12⋅12=11+14=45,
just as we already saw from the diagram.
Applying the velocity transformation formula to other examples can be
tricky, because it’s not always obvious which reference frame should be the
primed frame, which should be the unprimed frame, and which object should
correspond to the baseball. There are always multiple correct ways to set up
these correspondences, but you need to be consistent. Any of the three
velocities is allowed to be negative, and you often need to pay special
attention to minus signs. The best advice I can give you is to draw a picture
showing which way things are going and which direction you’re calling
positive; then write out, in English, exactly what you mean by each of the
three symbols. As a check, remember that if you neglect the correction term
in the denominator, you should get the answer you would expect if you didn’t
know about relativity.
What if, instead of a baseball, I “toss” a light pulse? Assuming that the pulse
moves forward at the speed of light with respect to me, while I move at half
the speed of light with respect to you, how fast does the light pulse move with
respect to you? Answer this question using a two-observer spacetime
diagram, then answer it again using the Einstein velocity transformation
formula. Finally, do it again (using both methods) for a light pulse that I
“toss” in the backward direction.
You are fleeing from Planet Vogsphere at speed 0.99c0.99c (with respect to
the planet) when your spaceship’s antimatter drive malfunctions, making
further acceleration impossible. Knowing the Vogons are in hot pursuit, you
climb into your escape pod and set it to be launched forward at the maximum
speed of 0.95c0.95c (with respect to your spaceship). Once the pod is
launched, how fast is it going with respect to Vogsphere?
A supersonic jet, moving with respect to the ground at 1000 m/s, fires a
supersonic missile in the forward direction at a speed of 1000 m/s with
respect to the jet. What is the missile’s speed with respect to the ground? By
what percentage does the answer differ from the naive prediction, 2000 m/s?
A distant quasar is moving away from earth at speed 0.35c0.35c. The quasar
emits a jet of plasma in the direction toward earth. Astronomers on earth
measure the jet to be approaching at speed 0.27c0.27c. What is the jet’s
velocity with respect to the quasar?
instead of writing out three separate equations for the xx, yy,
and zz components. But there’s a more important reason besides brevity.
When we write that one vector equals another, we’re making a statement
about the vectors themselves, independent of how we orient our coordinate
axes to define their xx, yy, and zz components. This means that if a vector
equation is true in one coordinate system, it must also be true in any rotated
coordinate system. Writing the laws of physics in terms of vectors doesn’t
ensure that these laws are correct, but at least it ensures that they’re
consistent with the principle that space doesn’t have any “preferred
directions”; our choice of coordinate axes is arbitrary.
In a completely analogous way, writing the laws of physics using four-
vectors will ensure that if these laws are true in one inertial frame of
reference, then they will be true in all inertial frames of reference. In other
words, using four-vectors ensures that the equations we write will be
consistent with the principle of relativity.
With this principle in mind, let’s now think about some of the laws of
physics.
In Newtonian mechanics, after you learned the kinematic concepts of
position, time, velocity, and acceleration, you went on to study dynamics:
force, mass, Newton’s second law, and the laws of conservation of
momentum and energy.
We could now revisit each of these concepts in the context of relativity,
but it turns out that the relativistic version of Newton’s second law isn’t
nearly as useful as we might have guessed. Instead it’s more efficient to skip
over the concept of force and go straight to the relativistic version of
momentum.
Momentum Conservation
Let’s consider a simple one-dimensional momentum conservation problem. A
1-kg block is gliding frictionlessly (or drifting through space) at exactly 20
m/s, toward an identical 1-kg block that’s initially at rest. The blocks then
collide and stick together, conserving momentum because they form an
isolated system:
This view of the collision is from what I’ll call the “Home” reference frame.
Because the initial momentum of the system is 20 kg m/s and momentum is
conserved, the final velocity of the combined 2-kg block must be exactly 10
m/s.
Now let’s view this same collision from what I’ll call the “Other”
reference frame, which is moving to the right at exactly 10 m/s with respect
to the Home frame. In the Other frame the final velocity of the combined
blocks is zero, while the initial velocity of Block 2 is exactly −10 m/s:
But what’s the initial velocity of Block 1? If we didn’t know about relativity
we would simply subtract 10 m/s (the Other frame’s velocity) from 20 m/s
(Block 1’s velocity in the Home frame) to obtain 10 m/s. But the Einstein
velocity transformation tells us that this isn’t exactly right. For if we work
backwards, combining the 10 m/s velocity of the block in the Other frame
with the 10 m/s velocity of the Other frame with respect to the Home frame,
we would get a value very slightly less than 20 m/s for the block’s velocity
back in the Home frame. In order for this velocity to come out to exactly 20
m/s, the velocity of Block 1 with respect to the Other frame must instead be
very slightly greater than 10 m/s.
And now we have a problem: As viewed from the Other frame, the final
momentum of this system is exactly zero but the initial momentum is not; in
fact it is slightly positive. By assuming that momentum was conserved in the
Home frame, I’ve proved that momentum is not conserved in the Other
frame. Momentum conservation is therefore incompatible with the principle
of relativity, which requires that the laws of physics are valid in all inertial
reference frames.
So what do we do? One option would be to simply give up, and conclude
that momentum conservation isn’t a law of physics after all. That’s
conceivable, but it would be a sad outcome and it wouldn’t explain why
momentum conservation works so well at low speeds.
To make the discrepancy more dramatic, consider the collision example
above but multiply all the speeds by 107107, so Block 1 is initially moving at
200,000,000 m/s (2/3 the speed of light), and the blocks’ final speed is
100,000,000 m/s (1/3 the speed of light). (Try to ignore the absurdity of two
“blocks” simply sticking together after such a violent collision!) If the Other
frame is again moving along with the blocks after the collision, what is the
initial speed of Block 1 in the Other frame, according to the Einstein velocity
transformation rule? What is the system’s initial (Newtonian) momentum in
the Other frame?
Relativistic Momentum
Fortunately, there’s another option: Modify the definition of momentum!
Perhaps the formula we’re using for momentum is only approximately
correct—accurate enough at low speeds, but inaccurate at higher speeds.
And what is our definition, exactly? Well, it’s mass times velocity, for
instance,
px=mvx=mdxdt(old definition).(17)(17)px=mvx=mdxdt(old
definition).
In Newtonian mechanics, this formula defines a perfectly good vector
component because xx itself is a valid vector component,
while mm and dtdt are scalars, that is, numbers that are the same in all
coordinate systems. (The quantity dxdx is basically the difference between
two xx values, final minus initial, but subtraction of vectors works
component-wise, so this subtraction doesn’t affect the status of the expression
as a valid vector component.)
And now, perhaps, you can see the issue: In four-dimensional relativistic
spacetime, the denominator dtdt is no longer a scalar because it is
a coordinate time interval, different in different reference frames. But there’s
a straightforward fix! Instead of putting the coordinate time difference in the
denominator, we can use the proper time difference dτdτ (which for
infinitesimal time intervals is the same as the spacetime interval dsds). This
is the time interval as measured by the object’s own clock, so it is a true
scalar quantity, independent of any reference frame. Our new definition of
momentum is therefore
px=mdxdτ=γmdxdt(new definition),(18)
(18)px=mdxdτ=γmdxdt(new definition),
where in the final expression I’ve used the metric equation to
relate dτdτ to dtdt:
dτ=dt1−(v/c)2−−−−−−−−√=dtγ.(19)(19)dτ=dt1−(v/c)2=dtγ.
The Lorentz factor γγ is very close to 1 at low speeds, which is why we never
notice that we need it in everyday situations. But at speeds close to the speed
of light, the extra factor of γγ in the definition of momentum makes a big
difference.
In three spatial dimensions, by the way, the Lorentz factor depends on all
three components of the velocity:
1γ=1−(v2x+v2y+v2z)/c2−−−−−−−−−−−−−−−−−√.(20)
(20)1γ=1−(vx2+vy2+vz2)/c2.
Surprisingly, this implies that the xx component of an object’s momentum
depends on all three components of its velocity! Meanwhile, the momentum
vector also has yy and zz components,
py=mdydτ=γmdydt,pz=mdzdτ=γmdzdt,(21)
(21)py=mdydτ=γmdydt,pz=mdzdτ=γmdzdt,
each of which also depends, through γγ, on all three velocity components.
How fast would an object need to be moving for its relativistic momentum
(mdx/dτmdx/dτ) to exceed its Newtonian momentum (mdx/dtmdx/dt) by
one percent? Express your answer as a fraction of the speed of light and also
in meters per second.
pt=γm≈m+mv22c2(when v≪c).(25)(25)pt=γm≈m+mv22c2(when
v≪c).
So at low speeds, the time component of the momentum four-vector is approximately equal to
the mass, plus a small correction term that’s starting to look a lot like kinetic energy (another
conserved quantity!). There’s an extra factor of c2c2 in the denominator, but an overall
constant factor won’t affect whether this quantity is conserved.
What we therefore do is multiply ptpt by c2c2, and refer to this quantity as simply EE,
the relativistic energy of the object:
E=ptc2=γmc2 (at any v)≈mc2+12mv2(when v≪c).(26)(27)
(26)E=ptc2=γmc2 (at any v)(27)≈mc2+12mv2(when v≪c).
For the special case of an object at rest, we have simply E=mc2E=mc2 (an equation that
you may have seen before); this quantity is called the rest energy of the object. At low speeds, an
object’s total energy is its rest energy plus the familiar Newtonian formula for kinetic energy (to
a good approximation). And at high speeds, we can compute its total energy from the exact
formula γmc2γmc2, and we define the kinetic energy to be the amount by which this exceeds
the rest energy:
Kinetic energy=E−mc2(at any v).(28)(28)Kinetic energy=E−mc2(at
any v).
In any case, the relativistic energy must be a conserved quantity—and it’s the total energy that’s
conserved, not the rest energy or kinetic energy separately.
Calculate γγ for v/cv/c equal to 0.1, 0.01, and 0.001. Compare each value to
what you get using the binomial approximation, and comment on the results.
Relativistic Energy
Let me now summarize this remarkable story. First I showed that the old Newtonian definition of
momentum is incompatible with the principle of relativity. To fix this problem, I suggested a
modification to the definition of momentum that seems to have the desired vector properties. But
if momentum conservation is to be a true law of physics, valid in all inertial reference frames,
then the momentum vector can’t have just three components; it must also have a time
component, which must also be conserved. Finally, I showed that this time component is closely
related to the familiar concepts of mass and energy. I inserted a factor of c2c2 to give it the
right units, and arrived at the relativistic version of energy conservation.
The implications of relativistic energy conservation are plentiful and astounding.
In the case v=0v=0, the equation E=mc2E=mc2 tells us that the mass of any object
is effectively a measure of its total energy content when it is at rest; the factor of c2c2 merely
converts between mass units and energy units. But look at the numbers! A 1-kg object at rest has
a total energy of
E=(1 kg)(3×108 m/s)2=9×1016 J,(29)(29)E=(1 kg)(3×108 m/
s)2=9×1016 J,
or a little over 20 megatons (of TNT explosive equivalent). It’s probably a good thing that
nobody knows a quick and practical way to convert all this energy to other forms!
Several of the exercises below involve decays and other reactions of subatomic
particles, so I’ve gathered the relevant masses and rest energies into the following
table. Note that in atomic and nuclear physics it is common to measure masses in
atomic mass units, also called daltons (abbreviated u or Da), where 1 u is the
approximate mass of a proton or neutron, and is defined as exactly 1/12 the mass of a
carbon-12 atom. It is also customary to measure energies in electron-volts (eV), where
1 eV is the energy that a one-volt battery gives to each electron it pushes around a
circuit, 1.60×10−191.60×10−19 joules. The table gives rest energies in MeV,
where M (mega) is the metric prefix for 106106. For comparison I’ve also included
masses in kilograms.
Particle
Symb
Mass (kg) Mass (u)
mc2mc2 (M
ol eV)
Photon γγ 00 00 00
Neutrino νν ∼0∼0 ∼0∼0 ∼0∼0
Electron/ 9.11×10−319.11×1 0.000550.00
positron
e∓e∓ or β∓β∓ 0.510.51
0−31 055
1.88×10−281.88×1 0.113430.11
Muon μ±μ± 105.66105.66
0−28 343
2.41×10−282.41×1 0.144900.14
Pion (neutral) π0π0 134.98134.98
0−28 490
2.49×10−282.49×1 0.149830.14
Pion (charged) π±π± 139.57139.57
0−28 983
1.67×10−271.67×1 1.007281.00
Proton pp 938.27938.27
0−27 728
Neutron nn 1.67×10−271.67×1 1.008661.00 939.57939.57
0−27 866
Estimate the mass increase of a cup of tea as its temperature increases from
room temperature to the boiling point. (It takes 4.2 J of energy to raise each
gram of water by one degree Celsius.)
Although the metric equation is good news for space travelers who wish to
reach distant stars within a human lifetime, the energy requirements pose a
challenge. Suppose, for instance, that you wish to take a trip on a starship that
travels at 4/5 the speed of light. If you and your luggage have a combined
mass of 100 kg, how much kinetic energy must you (including your luggage)
acquire to reach cruising speed? Convert your answer to kilowatt-hours
(kWh), then look up the current per-kWh cost of electrical energy in your
area. Use these numbers to estimate the cost of a ticket on this starship.
Discuss the assumptions behind your estimate, and the practicality of
interstellar travel.
The sun has a mass of 2×10302×1030 kg and radiates energy at a rate of
about 4×10264×1026 watts. How much mass does the sun lose in each
second? At the rate it’s going, how long would it take the sun to lose one
percent of its mass?
Starting from Avogadro’s number 6.022×10236.022×1023, determine the
number of kilograms in one atomic mass unit (to four significant figures). Use
this conversion factor to check a couple of the entries in the Particle Masses
and Rest Energies table above. Then calculate the rest energy in MeV of a
hypothetical particle whose mass is exactly 1 u, and use this conversion factor
to check a couple of the entries in the table.
Technetium-99m (99mTc99mTc) is an excited, metastable state of the isotope
technetium-99 that is commonly used in diagnostic medical procedures. It
spontaneously decays to ordinary 99Tc99Tc with a half-life of 6 hours,
emitting a photon (gamma ray) with energy 0.140 MeV. By about what
fraction does the mass of the technetium nucleus (or atom) decrease when it
loses this energy? (Hint: The atomic mass number 99 tells you that a mole
of 99Tc99Tc has a mass of 99 grams, where a mole
equals 6×10236×1023 atoms. The kinetic energy of the recoiling Tc nucleus
is negligible, but it’s a nice exercise to verify this using momentum
conservation.)
An alpha particle, or helium-4 nucleus, consists of two protons plus two
neutrons, held together by the strong nuclear force. Notice from the table
above that its mass is somewhat less than the total mass of its constituent
protons and neutrons. How much energy would you need to provide, in MeV,
in order to separate these constituents from each other? (This quantity is
called the binding energy of the nucleus.) What is the binding energy as a
fraction of the alpha particle’s rest energy?
A uranium-238 nucleus decays (into thorium-234, with a half-life of 4.5
billion years) by emitting an alpha particle. If the uranium nucleus is initially
at rest, then the alpha particle comes out at a speed
of 1.42×1071.42×107 m/s. (a) Calculate the kinetic energy of the emitted
alpha particle in MeV. How accurate is the nonrelativistic
formula K=12mv2K=12mv2? (b) To conserve momentum, the thorium
nucleus must recoil as the alpha particle is emitted. What is its recoil speed?
(c) What fraction of the rest energy of the uranium nucleus is converted into
kinetic energy by this decay?
A free neutron (not bound inside a nucleus) is unstable, decaying with a half-
life of about 10 minutes into a proton, an electron, and a neutrino (technically
an antineutrino, but the distinction doesn’t matter here):
n⟶p+e−+ν.n⟶p+e−+ν.
Referring to the table for the needed mass data, determine the total kinetic energy of the products of
this decay. What fraction of the neutron’s rest energy is this?
The nuclear fusion reaction that powers our sun combines four protons and
two electrons to form a helium nucleus, also called an alpha particle, along
with several photons and neutrinos:
4p+2e−⟶α+photons+neutrinos.4p+2e−⟶α+photons+neutrinos.
What is the net gain in kinetic energy during this reaction? What fraction of the reactants’ rest energy is
converted to kinetic energy? (Refer to the table for the needed mass data.)
The neutral pion has a very short half-life of less than 10−1610−16 seconds.
It normally decays into a pair of photons (gamma rays). What are the energies
of these photons, assuming that the pion is initially at rest? Why is it
impossible for a neutral pion to decay into a single photon?
The positron is the electron’s so-called antiparticle, with the same mass as an
electron but opposite electric charge. When a positron interacts with an
electron, they can annihilate into two or more photons:
e++e−⟶photons.e++e−⟶photons.
(a) Prove that there must be at least two photons produced in this reaction: annihilation into a single
photon is not possible. (b) If the positron and electron are initially at rest and just two photons are
produced, what are the photons’ energies?
The LEP collider at the CERN laboratory in Geneva accelerated electrons and
positrons to a maximum energy of 104,000 MeV (104 GeV) each. How does
the energy of such a particle compare to its rest energy? What is the velocity
of such a particle, expressed as a fraction of cc? (Hint: Solve for v/cv/c in
terms of γγ and then use the binomial expansion to determine the amount by
which v/cv/c differs from 1.)
Often an object’s energy and momentum are of more interest to us than its
velocity, so we want an equation that relates energy to momentum and mass,
with velocity eliminated. Show that for any object, this relation is
E2−|p⃗ c|2=(mc2)2,E2−|p→c|2=(mc2)2,
where p⃗ p→ represents the three spatial components of the momentum. Like the metric equation,
this equation tells us to subtract the squares of the three spatial components of a four-vector from the
square of its time component, resulting in a quantity that is frame-independent (a so-called Lorentz
scalar). What does this equation tell us about a massless particle such as the photon?
A charged pion usually decays (with a half-life of 18 ns) into a muon plus a
neutrino. Use both energy and momentum conservation to determine the
neutrino’s energy and the muon’s kinetic energy. (Hint: Use the energy-
momentum-mass relation from the previous problem, rather than working
with the muon’s velocity as a variable.)
A gamma-ray photon collides with a free electron that is initially at rest. This
problem and the next explore some possible outcomes. (a) First use
momentum and energy conservation to prove that the electron cannot simply
absorb the photon (a reaction we would write as e−+γ⟶e−e−+γ⟶e−).
This means that the simplest possible reaction is e−+γ⟶e−+γe−+γ⟶e−
+γ, with a single photon in the final state; this reaction is called Compton
scattering. (b) Suppose that the Compton-scattered photon comes straight
back, opposite to the initial photon’s direction of motion, while the electron
recoils in the initial photon’s direction. Use relativistic momentum and energy
conservation to derive a formula for the final photon energy in terms of the
initial photon energy and the electron’s rest energy mc2mc2. (c) Discuss
what happens in the limiting cases where the initial photon’s energy is much
less than and much greater than the electron’s rest energy.
Repeat part (b) of the previous problem for the general case in which the final
photon’s direction makes an angle θθ with the initial photon’s direction. Now
there are two nontrivial momentum components, which are separately
conserved. You should obtain the Compton formula,
1Ef=1Ei+1mc2(1−cosθ),1Ef=1Ei+1mc2(1−cosθ),
where EiEi and EfEf are the initial and final photon energies, respectively. Discuss the predictions of
this formula for some interesting values of θθ and Ei/mc2Ei/mc2.
If you would like to see special relativity applied and extended in the context of
electromagnetism or gravity, here are some treatments of those subjects that I recommend:
Magnetism, Radiation, and Relativity is a brief set of online materials I created long ago
showing how magnetism is a consequence of length contraction, and electromagnetic radiation
is a consequence of the cosmic speed limit. These materials are at the level of a calculus-based
introductory physics course.
Exploring Black Holes, by Edwin F. Taylor, John A. Wheeler, and Edmund Bertschinger. This
unpublished book (an earlier edition was published but is now out of print) picks up
where Spacetime Physics leaves off, introducing many of the applications of general relativity at
a level that uses calculus but no more advanced mathematics. You can freely download the
book for personal use.
A General Relativity Workbook, by Thomas A. Moore. This is a textbook for a complete
undergraduate course in general relativity, carefully introducing the mathematics needed to
deal with arbitrarily curved spacetime. The workbook format is great for self-study, and the
price is reasonable.
Colophon
I typed this HTML document directly into a text editor (BBEdit). The source file is
human-readable, so you can use your browser’s View Source or Page Source
command to see the (minimal) styling code. MathJax is typesetting the equations. I
created the illustrations in Adobe Illustrator, then exported them in png format at
double the (nominal) displayed resolution for sharpness on high-resolution displays.
To label the illustrations I used LaTeXiT (included in the MacTeX distribution),
exporting the output as pdf with outlined fonts. For the pdf version I manually ported
the document to LaTeX.