Math For Physics
Math For Physics
YORK UNIVERSITY
1.2. Unit vectors. Vectors can be specified by two different methods. One, more
general, method is to specify them by their length, and by their orientation in
space. When one adopts this method it is clear that they can be moved around
in space and remain the same. The second method depends on a choice of coordi-
nate system. If we defines orientations that specify what we mean by horizontal,
vertical, and depth displacements (such as the edges of a rectangular room we find
ourselves in), then we can represent vectors by a triple of numbers:
~r = (x, y, z) . (1.1)
While this representation depends on a choice of what the (x, y, z) axes are (and
therefore appears to be less general), we nevertheless use it, since it is easy to
work with. In physics one prefers the following notation: one defines the axes of
the coordinate system by unit-length vectors î, ĵ, k̂ and uses vector addition to
assemble the position vector according to:
~r = xî + y ĵ + z k̂ . (1.2)
We can work in the plane by assuming z ≡ 0, and illustrate in Fig (1) what we
mean by the three objects in this case, namely the relevant unit vectors î, ĵ, and
the components of the position vector ~r along the axes (x along î, and y along ĵ).
1.4. Vector magnitude. The Cartesian coordinate system defined by î, ĵ, k̂ has
perpendicular axes. The Pythagorean theorem allows us to calculate the length
of a vector given by its components such as in Eq. (1.2). In the plane this is very
obvious from the triangle:
p
r = x2 + y 2 . (1.4)
Note that r is a (positive) scalar. We need to be careful in our notation to distin-
guish between the vector and its magnitude. A physics example: a car is moving
with a speed of 96 km/h. This statement does not say which direction the car
moves in! The statement: a car moves with velocity
~v = 3.0î + 2.0ĵ m/s (1.5)
states specifically the direction of motion. This is a vectorial statement, and
provides a complete understanding of the motion. If this velocity vector does not
change with time, we can understand the motion of the car by tracking the change
in the position and applying this velocity vector for each instant in time.
The speed of the car which moves with the velocity (vector) given by Eq. (1.5)
is obtained by applying the magnitude calculation. We denote the components of
the velocity vector as vx and vy , i.e., ~v = vx î + vy ĵ, and compute in analogy with
Eq. (1.4):
q
v = vx2 + vy2 . (1.6)
4 YORK UNIVERSITY
For our example (1.5) the speed is v = 3.6 m/s. For intermediate calculations you
should keep an additional digit, i.e., it would be prudent to record v = 3.61 m/s.
In fact, the on-line problem-solving environment used in our courses (WebAssign)
assumes that final results are answered with one extra digit (it is not coded properly
for significant digits, and works with a small-percentage window around the ‘exact’
answer to mark your result - sometimes the correctly rounded answer will fall
outside this acceptance window).
1.5. Dot product. Note that you can compare two vectors by their magnitudes.
This is possible, since the magnitudes are real (non-negative) numbers, and these
numbers are ordered. The vectors themselves cannot be ordered. You can have
for two vectors ~r1 and ~r2 the relation x1 > x2 and y1 < y2 , which means that
comparing the two vectors directly is meaningless. You can compare their lengths,
however. A number of physics laws involving vectorial quantities require us to form
products of vectors. The first product, called the scalar product, or dot product
is available in many vector spaces, e.g., in the plane. It associates a scalar number
with any pair of vectors. There are two ways to define it, an algebraic one (in the
Cartesian, or component representation), and a geometric one. Let us start with
the algebraic definition:
~a · ~b ≡ ax bx + ay by . (1.7)
We will avoid dots to denote multiplication of scalars to emphasize that the dot
product is formed from two vectors. The right-hand side of (1.7) is a scalar, which
is a real number without any restrictions (it can be negative). The definition of a
vector magnitude (length) can be traced back to the dot product of a vector with
itself (in which case one obtains the square of the vector length, which is positive).
The dot product (1.7) is easily generalized to more than two dimensions. In this
MATHEMATICS INTRO FOR PHYSICS 5
course we will use at most N = 3 dimensions, but in special relativity one needs
N = 4, and later (in the context of function spaces, Fourier series, etc.) physicists
use higher N , even N → ∞:
N
X
~a · ~b ≡ ai b i . (1.8)
i=1
From the geometric version (1.9) we see that the dot product of two vectors
becomes negative when the angle between them is obtuse (larger than 90 degrees).
To find the angle α we place the vectors tail to tail. A graph of the cosine function
shows that for π2 < α < π the result is negative. A useful geometrical interpretation
of the dot product is that it provides the product of the component of ~b along
~a, which is bcos(α), times the magnitude a. The construction works also the
other way around, since the dot product is symmetric in its two factors. It does
not matter whether the angle α is measured from ~a towards ~b, or the other way
around. This works out, since the cosine function has the symmetry property
cos α = cos (−α). In many other instances we will be careful when measuring
angles, by agreeing that they are positive when increasing in the counter-clockwise
(CCW) direction.
An important physics application of the scalar product can be found when cal-
culating the work ∆W done by a force F~ along some displacement ∆~r. The work
is a scalar quantity and takes into account only that component of the force that
acts along the displacement.
1.6. Vector components and component vectors. We now return to the no-
tion of assembling a vector from its components along the axes to prevent some
confusion. The motion on an inclined plane shown here is a typical problem where
one can simplify matters by a clever choice of coordinate system. In the standard
set-up where î is chosen along the horizontal, and ĵ along the vertical the motion
is two-dimensional, i.e., would require functions x(t) and y(t) for a mathematical
description.
Yet, the motion is one-dimensional along the surface, since the normal force from
the surface cancels the gravitational force component pressing the object into the
surface. Thus, it is clever to choose the direction of motion as one basis vector,
and the perpendicular direction as the other. In order to take advantage of this
trick we need to know how to resolve vectors into components along any possible
direction.
In terms of given î and ĵ any vector ~a can be written as the sum of component
vectors:
~a = ~ax + ~ay = ax î + ay ĵ . (1.10)
Here ~ax and ~ay are component vectors that make up ~a by vector addition, while
ax and ay are scalars, which are called the x− and y−components of ~a. A rather
confusing point is that we might think about ax and ay as being the magnitudes
of ~ax and ~ay . However, this is only true if vector ~a points in the first quadrant
when its tail is at the origin! In general, the components ax and ay can be (and
will be) negative. This leads sometimes to notational confusion: often, when using
trigonometry or geometry to find out relations, we need the magnitudes, rather
than the components, in which case we should, e.g., specify |ax |, or at least provide
a sketch of the situation to make things clear.
MATHEMATICS INTRO FOR PHYSICS 7
The vector components can be found by using the representation (1.10) and
dotting into the equation (on both sides) with î or ĵ (a trick used often in physics,
so get used to it). This is called taking the projection of the equation along î or
ĵ. For example:
î · ~a = î · (ax î) + î · (ay ĵ) = ax . (1.11)
Here we used the properties that î · î = 1, and î · ĵ = 0. Figure out these properties
using both the algebraic and the geometric versions of the dot product!
1.7. Magnitude and direction from the components. The representation of
vectors in terms of components is easy to work with, but has the drawback that
we need to make a choice of axes (basis vectors î, ĵ, k̂), and specify what they are
when describing ~a in terms of (ax , ay , az ). We stated already in Eq. (1.6) that
the Pythagorean in 2d (and the dot product in 3d) let us calculate the magnitude
a = |~a|. How do we get the direction of the vector?
In the plane, and if the vector points into the first quadrant, things are easy.
From geometry/trigonometry we know that
ay
tan θ = . (1.12)
ax
8 YORK UNIVERSITY
Applying the idea of inverse functions, we solve (1.12) for θ. The arctan, or inverse
tan function is implemented on calculators, and formally we write:
ay ay
θ = arctan = tan−1 . (1.13)
ax ax
You should explore this inverse function on your calculator to make sure to un-
derstand that it has nothing to do with 1/ tan x. An inverse functions undoes the
effect of a given function; for another important pair of such functions you can
study ex and ln x.
Things, are, however, a little trickier than just finding the key for arctan on
your calculator. Depending on whether the argument is positive or negative, the
calculator arctan function can only answer in the range 0... π2 , or − π2 ...0, i.e., the
angle will always be in the first or fourth quadrant. Using negative angles to
denote locations in the fourth quadrant is not such a bad thing, in principle (for
significant digits it is much more reasonable to write −0.1◦ , than 359.9◦ ). However,
sometimes vectors do point into the second or third quadrants, and the calculator
is misleading us! Many programming languages provide an arctan function with
two independent arguments to solve this problem. However, in this course you’ll
have to do it yourselves. You have to look at the sign of ax and ay to determine
the quadrant. A negative argument into the arctan function can happen when
EITHER ax OR ay is negative. This happens in quadrants II and IV. The calcu-
lator always goes for quadrant IV, as stated above. You need to add π (or 180◦ )
if ax < 0 and ay > 0. Likewise, the case when BOTH ax AND ay are negative,
your calculator can’t recognize this, since the signs cancel in the division before the
arctan is evaluated. The angle given by the calculator will be in the first quadrant,
and again, you’ll need to correct manually by adding π (or 180◦ ).
It is conventional to measure angles from the positve x−axis as positive in the
CCW sense. Draw a sketch to state your case clearly.
1.8. Vector product or cross product. Midway through the course you will
find this mathematical tool to help you understand three-dimensional space, and
you will learn how to think properly about angular momentum and torque. Later
in magnetism the tool will become indispensable. There is a product of two vectors
which span (define) a plane, that results in a vector, which, however, points in a
perpendicular direction to this plane. The two vectors ~a and ~b will be assumed
to not point in the same (or exactly opposite) directions - otherwise they would
not define a plane. To define this vector (or cross) product ~c ≡ ~a × ~b we need to
specify magnitude and direction. The magnitude is given by
c = ab sin α , (1.14)
where α is chosen such that it falls into the range 0 < α < 180◦ . The latter
guarantees that the above is positive, i.e., describes a magnitude. Concerning the
direction, we have two choices left: perpendicular to the plane defined by ~a and ~b
MATHEMATICS INTRO FOR PHYSICS 9
Figure 5. The right-hand rule uses the first three fingers spread
out, with the middle finger pointing perpendicularly to the plane
spanned by thumb and index finger to define a right-handed three-
dimensional system of unit vectors. Use of the left hand would
make the z− axis point into the screen - avoid this! The middle
finger is curled naturally inward. This right-hand rule is used to
find the direction for the cross product result ~a × ~b, where ~a and
~b are represented by thumb and index finger, respectively. Order
matters here: ~b × ~a = −~a × ~b !!!
2. Calculus
2.1. Preliminaries. Our textbook works at the pre-calculus level. Yet, the course
is supposed to be calculus-based. Why do we do this, and how can we put the
calculus in? There are multiple reasons for our choice. The main reason is that
the physics laws are valid and can be formulated without taking limits. The
formalism of calculus was developed by Newton, Leibnitz and others to derive nice
symbolic results in mechanics. The physics principles, such as Newton’s laws, the
gravitational law, or the laws of electricity and magnetism by themselves have to
be valid when using finite intervals. Finite-interval arithmetic is used commonly
in computer programs to solve problems for which a symbolic answer cannot be
found using Calculus tools. Thus, the pre-calculus version of physics has acquired
some modern validation.
The main problem with so-called calculus based texts is that very quickly they
move from simple ideas such as ‘the slope of a curve is given by the first deriva-
tive’ to ‘calculate the area under the curve’. This happens very early in classical
mechanics, yet in mathematics it is covered later under integral calculus (and not
MATHEMATICS INTRO FOR PHYSICS 11
covered in your typical high-school calculus course). To make things worse, one
needs quickly a new type of integral (line or curve integral to calculate the work
done by a force acting on a moving object in two or three dimensions), then the
moment of inertia (a volume integral with potentially complicated boundaries).
In electricity and magnetism we need another type of integral to measure how a
force field permeates some surface. It doesn’t stop there: we also want to solve
differential equations, and we even write down some differential equations with
partial derivatives (the wave equation) before the course is over. Past experiences
with a deeply conceptual book that is based on calculus (by Randy Knight) have
generated student complaints ‘why are we doing new mathematics all the time?’.
Modern books addressing College Physics (college in the USA is not a commu-
nity college, but really another name for university, usually without a graduate pro-
gram) have found a way to emphasize the important laws without over-burdening
the student with advanced mathematics (which is really covered only in the second
year of university mathematics). The book by N. Giordano matches Knight’s text
in emphasis on the concepts. Very elegantly, it deals with topics such as Gauss’ law
in electricity without the mathematical burden. The present summary of calculus
ideas is provided to put the calculus into the course where it is most appropriate
and straightforward, namely the mechanics part. The first-year student in On-
tario (and most likely elsewhere in Canada) is ready for this, since it really relies
only on high-school calculus as a prerequisite. In a very straightforward way does
this chapter introduce the idea of how area is measured under a curve. It also
reviews some of the trigonometric and the exponential function to highlight their
derivative properties - something that has been eliminated from the high-school
curriculum when grade 13 (or OAC) was finally eliminated.
Let us make just one important comment here: the reason why physicists work
liberally with objects such as ∆t, and even treat the symbol dt as if it were a finite
∆t, is that physics measurements (or physical reality) only exists for finite time
intervals ∆t = t2 − t1 . Below a certain scale, the notion of classical mechanics
begins to fail, the boundaries of a material object become fuzzy, since the laws
of quantum physics take over. It is a beautiful result that in reality one can
reduce ∆t by orders of magnitude when defining the instantaneous velocity of an
object while finding that v( t1 +t
2
2
) = xt22 −x
−t1
1
becomes independent of t2 − t1 , but a
measurement cannot take the limit ∆t → 0. Thus, physical reality cannot depend
on the question whether a limit exists or not. The mathematical convenience of
expressing results, however, does depend very much on this ability.
upon two initial conditions that make the motion unique. For a given time in-
using the displacement vector ∆~r. We can reduce these expressions to known
quantities by using the Cartesian representation:
dx dy dz
~v (t) = î + ĵ + k̂ , (2.4)
dt dt dt
Whether it will be easy to carry out the program or not will depend on the forces
acting on the particle, since they may depend on all three coordinates x, y, z in
some non-trivial way.
2.4. Area under a curve. We showed that the relationship between velocity
and position follows from the definition of velocity; when looking at the velocity
graph vs time, it should tell us what the slope is at any time in the corresponding
position vs time graph. How can we figure out an x(t) graph from a given v(t)
graph?
Let us begin from Eq. (2.1) and solve it for the displacement (while dropping
the avg label by assuming that ∆t is sufficiently small):
∆x = v(t)∆t . (2.6)
Let us assume that the particle is located at time t at position x(t), at which it
has velocity v(t) which will not change appreciably over the short interval ∆t. At
the end of this interval the particle is located at x(t + ∆t) = x(t) + ∆x. From
the graph shown for a linearly increasing v(t) (which corresponds to constant-
acceleration motion for some larger time interval) we can infer that the statement
x(t + ∆t) = x(t) + v(t)∆t (2.7)
MATHEMATICS INTRO FOR PHYSICS 15
establishes that the displacement is given by the area under the curve v(t) for the
segment t → t+∆t. More precisely, the area bounded by the vertical lines t, t+∆t,
by the t-axis and by the trace of v(t) yields the displacement. Eq. (2.7) makes it
clear that we start from a given x(t), and the area law tells us to which position
we advance after the time ∆t. Wait, you might think, there is an inaccuracy here:
the area under the curve is a rectangle plus a triangle, whereas you only accounted
for the rectangle part! The small rectangle is of higher order, it is proportional to
∆t∆v, and will drop out in a limiting process.
The same area-under-a-curve trick will work to get the velocity function v(t)
from a known a(t). Solving for the motion from a given acceleration function
a(t) involves the repeated application of ‘integration’. Let us illustrate this by the
constant-acceleration example known as a ‘physics law’ from high school, which
merely becomes an almost trivial example in university:
We start with known acceleration a = const. The area under the horizontal line
from fixed initial time t0 to arbitrary time t is found to be
Note that the area-law does not tell us what v(t0 ) is! It only tells us how the
velocity changes from the value at time t0 . Since we never use absolute time,
we could set our stopwatch to t0 = 0. Then Eq. (2.8) for a given a value would
represent a family of curves that start at t = 0 at any height v(0) = v0 , and are
straight lines with slope a. We would state the answer as v(t) = v0 + at. The
meaning of v0 is that of an initial condition. We can start the free-fall motion
of a rock by releasing it from our hand (v0 = 0), or by tossing it (up or down),
therefore giving it some initial velocity.
Let us pick one such choice in the graph (with a > 0 and v0 > 0). The area
under this curve should give us the displacement of the particle. The area between
t0 = 0 and arbitrary time t, the t-axis and the graph consists of two parts: a
rectangular piece of area v0 t, and a triangular piece given by ‘one half times base
times height’, i.e., 21 t(at). We arrive at
1
∆x = v0 t + at2 . (2.9)
2
MATHEMATICS INTRO FOR PHYSICS 17
Again, the area law has nothing to say about an initial condition, namely the
initial position x0 = x(t0 ) = x(0), and the full answer becomes
1
x(t) = x0 + v0 t + at2 . (2.10)
2
You can verify that differentiation of (2.10) results in the velocity (2.8), and further
differentiation in the statement that the acceleration is a constant function of time
a(t) = a.
Area-under-a-curve laws appear in other branches of physics in various contexts.
They also appear in mathematical descriptions of biology, economics, engineering,
or other fields where physicists have pioneered a deeper understanding of concep-
tual relationships. The calculus challenge of re-tracing the steps that lead to x(t)
when we have some function a(t) requires the notion of integrals or anti-derivatives.
It is needed when areas can no longer be obtained from simple geometric shapes.
Note that the area formula for a circle is derived in calculus. In high school you
were simply using a given formula A = πR2 (a rabbit out of a hat?).
2.5. Anti-derivative. Given our simple example, where the position was given
by a general quadratic function (2.10), and the velocity by a linear function (2.8)
where we noted that the velocity follows by differentiation of the position function,
we should be able to reverse the process. Thus, in general, when we try to find a
position function that corresponds to a given velocity function, we are looking for
the function whose derivative is v(t). In integral calculus you will find that this is a
hard process, and that simple functions can have complicated (or unknown) func-
tions as their anti-derivatives. Sometimes they are sufficiently important that we
call them ‘special’, study their properties and equip calculators and programming
languages to provide easy access to them.
In integral calculus you prove the important theorem that if f (t) and F (t) satisfy
f (t) = dFdt
, then the area under the curve f (t) from t1 to t2 (between the vertical
lines) is given by F (t2 )−F (t1 ). Note that area is computed with sign here, positive
and negative areas (by being above and below the t-axis) do cancel. In this course
you may have to apply this rule occasionally. In upper-year courses you will use
it on many occasions.
In mathematics you prove what the derivative functions are for some elemen-
tary functions. We need these properties in first-year physics for sine, cosine, the
exponential function (apart from the simple power law). While you should com-
mit some of this to memory, you may be given a table. sin0 = cos, cos0 = − sin,
exp0 = exp, where the prime denotes differentiation with respect to the argument.
If the velocity is given by the sine function, then the position will look like an
inverted sine function (plus a constant).
2.6. Chain rule. The differentiation rule for composite functions is of utmost
importance in physics, yet it was perhaps practiced too little in your mathematics
eduction. A composite function can be denoted as f (g(t)), i.e., your ‘outermost’
18 YORK UNIVERSITY
Figure 11. The trace of the velocity over time for a particle re-
leased by a compressed spring: the particle starts from rest, in-
creases its velocity, reaches a maximum velocity when the spring
is in its natural state (equilibrium), then it starts expanding the
spring and slows down. The area under the curve which yields the
displacement is approximated by a sum obtained from sampling the
particle velocity at ∆t = 0.5 s intervals. The sum of the rectangles
will approach the area under the curve when the sampling rate is
increased, i.e., the interval ∆t will be decreased towards zero. In
integral calculus this area is computed from the anti-derivative.
would be expressed as ∆N = Nfin − Nin , and since Nfin < Nin we see that the
derivative on the left-hand side of (2.17) is negative. In dimensionless units the
function describing this behaviour is
exp(−t) ≡ e−t , (2.18)
and the meaning of the time constant τ = 1 unit in this case is that it takes time
τ for the population to drop by almost a factor of three.
We will use the exponential function to describe how mechanical energy is con-
verted in a real-life oscillator into heat (the energy loss is proportional to the
motional energy), or how electrical oscillations die down due to damping (elec-
trical energy is converted into heat in a resistor), The oscillations themselves are
described by a class of functions that satisfy the law (again in dimensionless time):
d2 f
= −f (t) . (2.19)
dt2
Both the sine and cosine functions solve this equation, based upon their relation-
ships under differentiation: sin0 = cos and cos0 = − sin. The details of this relation
MATHEMATICS INTRO FOR PHYSICS 23
are discussed in the lectures on oscillations, but the basic properties are needed to
understand the mathematical description of circular motion.
powerful relationship, namely Euler’s theorem which shows the relation between
24 YORK UNIVERSITY
the exponential and basic trigonometric functions (which you might have wondered
about in the previous subsection):
eiα = cos α + i sin α (2.21)
2.9. Taylor series. It is possible to verify Euler’s theorem (2.21) by observing
the Taylor series expansions for the three functions involved:
t3 t5 t7
sin t ≡ t − + − ± ... , (2.22)
3! 5! 7!
where the factorial n! ≡ 1 · 2 · 3 · ... · n, and
t2 t4
cos t ≡ 1 − + ± ... , (2.23)
2! 4!
while
t2 t3 t4
exp t ≡ 1 + t + + + + ... (2.24)
2! 3! 4!
The proof of these expansions comes late in your Calculus education, since it is an
application of integral calculus. The meaning, however, is important in physics:
a function can be approximated locally by starting with the function value, and
having some derivative information for the local point t = t0 :
f 0 (t0 )h f 00 (t0 )h2 f 000 (t0 )h3
f (t0 + h) = f (t0 ) + + + . (2.25)
1! 2! 3!
The main idea of the Taylor series (2.25) is to replace a complicated function f by a
polynomial. Eqs. (2.22-24) state that relation for the three ‘complicated’ functions
used in this course using t0 = 0 as the expansion point. Computer arithmetic only
knows elementary operations (add, multiply and their inverses). Calculators and
floating-point units in CPUs use expansions based upon (2.25) to compute numbers
for a given argument t. When modelling nature we use expansions such as (2.22-
25) to simplify laws. An example in this course is when the pendulum equation is
reduced to a simple oscillator for small-amplitude oscillations.
This summary of what will be used in the first-year courses should serve as
motivation to study mathematics ideas in some depth. It is meant as a shortcut
so that one can apply them without being help up by the (sometimes tedious)
process of proving them.