Math Overboard! - (Basic Math For Adults) Part 1 (PDFDrive)
Math Overboard! - (Basic Math For Adults) Part 1 (PDFDrive)
No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or
by any means, electronic, mechanical, photocopying, recording, or otherwise, without the written
permission of the author.
ISBN: 978-1-4575-1481-4
This paper is acid free and meets all ANSI standards for archival quality paper. Printed in the United States
of America
Contents 1 Addition and Multiplication
1.1 The decimal system for whole numbers
1.2 Addition of whole numbers
1.3 Multiplication of whole numbers
1.4 Binary arithmetic
1.5 Decimal-point numbers
1.6 Scientific notation
1.7 The rules of arithmetic
1.8 The number line
1.9 How to learn mathematics
1.10 Review problems
6 Analytic Geometry
6.1 The rectangular coordinate system
6.2 Straight lines
6.3 Circles
6.4 Transformations
6.5 Conic sections
6.6 Review problems
Diagnostic Test
Common Errors
Greek Alphabet
Mathematical Symbols
Index
Acknowledgements
Special thanks are owing to my wife Janet, who set the book in type, using the
mathematical typesetting program TeX. Getting all the equations, graphs and
problem solutions in the right position was a mammoth undertaking.
My daughter Jennifer drew the cover picture. The rear-cover pboto was taken
by Deborah Kieselbach.
Thanks also to colleagues who encouraged me to write and publish Math
Overboard! including Marc Mangel, Michael Mesterton-Gibbons, Peter Rastall,
Jim Zidek and Leah Keshet.
Colin Clark
July, 2012
How to use this book Math Overboard
will help you to raise your level of
understanding of basic mathematics,
from Arithmetic to Algebra, Geometry,
Trigonometry and other topics. You are
assumed to have studied these topics (or
most of them) in school, but like many
former math students you now realize
that your understanding of elementary
mathematics is not satisfactory. You need
help, and this is it!
Math Overboard can be used in various ways. For example, you can look up
a specific topic, or review an entire subject, or even review all of elementary
math. If you’re not sure where to begin, try the following:
1. Do the Diagnostic Test on page 405. This will help to identify the areas you
need to work on.
2. Read about Common Errors in Mathematics, page 411. Understanding
mathematics well enough to avoid these errors is essential for success in
later math and science courses.
3. To decide whether you need to study a specific chapter, start by trying to
solve the Review Problems at the end of the chapter. If you have difficulty
with these problems, you clearly need to work through the chapter.
Math Overboard is based on the principle that to learn mathematics you
must take the trouble to understand it. Many students get into the habit of trying
to learn math by memorization alone. This is a sure recipe for disaster. Working
with Math Overboard will help you to supplement memorization with
understanding. The whole subject will begin to “make sense.” You will develop a
new mental outlook, which will help you immensely in your future studies.
Let me mention one example, adding fractions:
My question is, how can you check whether this is correct? Try to answer this
before reading on.
Because the decimal system is based on ten, we need ten digits (0, 1, 2, 3, 4, 5,
6, 7, 8, 9). The word “digit” also means finger (or toe), of which we have ten.
Well, you know the rest of the story. If humans actually had six fingers per hand,
no doubt our number system would use twelve as its base, and it would need 12
“digits.”
There is one important modern device that uses a base different from ten: the
computer (including the hand calculator). Because they are electronic devices,
computers and calculators use base two. This is called the binary system. Luckily
for us, we don’t have to know anything about binary numbers to use a calculator.
We just punch in a decimal number and the calculator itself automatically
translates it into binary notation, which the user never sees at all. Still, binary
arithmetic is kind of interesting (it’s actually a bit easier than decimal
arithmetic). If you want, you can read about it later.
By the way, you may be wondering about the term “decimal system” – what
became of the decimal point? Well, decimal points will be discussed later on in
this chapter.
Numbers versus numerals
Is 7 a number? Is it the same thing as “seven”? Or If we were to be very
precise, we would say that there are numbers, names of numbers, and numerals.
“Seven” is the English name of the number seven, and 7 is the symbol, or
numeral, for this number. Other names for seven include “sept” (French),
“sieben” (German), and so on. Other numerals include VII (Roman).
But what exactly is the number seven itself? Well, seven is the number of little
circles on the next line.
Or, seven is the number of Snow White’s dwarfs – Happy, Doc, Sleepy, Dopey,
Grumpy, Sneezy, and Bashful. “Oh, daddy, everyone knows what seven is.”
Right, but there is a difference between the number seven, its name “seven,” and
its numeral 7. Just as there is a difference between a chair, the name “chair,” and
the symbol .
If you ask somebody “What’s this?”
they will probably answer “a chair.” They know perfectly well it’s not a chair –
it’s only a crude drawing of one. But we all happily accept the slight inaccuracy,
which is preferable to circumlocution: “That’s a crude sketch that looks a bit like
a chair.” So, in this book we will not hesitate to say “the number 7,” and not “the
number whose numeral is 7.” This should never cause any confusion.
Powers of ten
The phrase “the powers of 10” refers to the numbers
and so on. An easy, organized way to write these numbers is to use exponents.
and so on. (The expression 107 is read as “ten to the seventh.” But for reasons you
can perhaps guess, 102 is usually read as “ten squared,” rather than “ten to the
second.” Also 103 is read as “ten cubed.” Do you see why? Think of a large square
with 10 units on a side. How many little one-unit squares are there? One
hundred, or 102.)
In other words, for any n, the expression 10n (“ten to the n’th”) equals 10 × 10
× ··· × 10 (n times). So how would you write 1 million using exponents? One
billion (i.e., a thousand million)? Answer: 1 million (1,000,000) is 106. One
billion (1,000,000,000) is 109.
When written out in full, 10n becomes 1 followed by n zeros. For example,
102 = 100 (two zeros), 107 = 10,000,000 (seven zeros).
A comment on how to write large numbers, such as 1,000. This book uses
commas between every third digit, starting from the right. Alternative methods
used by other authors are:
Problem 1.1 Actually, we can take the n’th power of any number. For example 32
= 3 × 3 = 9, or 26 = 2 × 2 × 2 × 2 × 2 × 2 = 64. Your problem is, write out the
powers of 2 (i.e. 2n) for n = 1, 2, 3, up to n = 10.
Note: to see the solution to any problem, look at the bottom of the next even-
numbered page.
You can probably see what the powers of 10 have to do with decimal
notation. For example,
(In case anyone is confused, × means “times” and + means “plus.”) In other
words, each digit, according to its position, tells us how many multiples of each
power of 10 are contained in the number. This is the positional system, based on
powers of 10 – in other words, the decimal number system.
Problem 1.2 Think about this number: Write it in a different way. What
would it be in ordinary decimal notation? [This problem could really blow your
mind!]
The number zero
Our decimal system depends critically on the use of the zero symbol, 0. For
example, 204 is two hundreds plus no tens plus four units. All this seems
elementary to us today, but historically speaking, people had great difficulty
accepting the idea of zero. For example, although the Babylonians in 1600 BC
used positional notation (with base 60, not 10), they simply left a blank where we
would put a zero. Greek astronomers by 100 AD did use zeros, but in Western
Europe the decimal system with zeros was not adopted until the 17th century.
But is zero really a number? We have a name, zero, and a symbol, 0, but does
that mean that zero has to be a number? This rather philosophical difficulty
apparently helped to delay the acceptance of decimal notation for many
centuries. The modern view is this: if we want to say that zero is a number, fine.
There’s no law against it. What must be done is to develop a logically consistent
and useful numbering system. When we consider subtraction in Chapter 2, we
will find that zero is indispensable as a number, not just a symbol.
The number 0 can be considered as a counting number – it counts no, or 0
objects. Thus we would have 9 + 0 = 9, and so on. More on this in the next
section.
The numbers 0, 1, 2, 3, … comprise the system of whole numbers. This
system excluding 0, in other words, the system 1, 2, 3, … is called the system of
natural numbers. Other number systems will be discussed later in this chapter,
and in Chapter 2.
Problem 1.3 The main feature of the natural number system is that any given
natural number n has an immediate successor n + 1. Your problem is to specify
exactly the rule for adding 1 to a natural number given in decimal notation, and
to illustrate with examples.
Problem 1.4 Add 627 and 415 (writing one above the other). Note the repetitive
steps. Specify the addition algorithm in detail.
There are several things worth noting about the addition algorithm. (1) The
algorithm includes starting and stopping instructions. (2) It is repetitive. (3) You
can probably perform the algorithm without even understanding what you’re
doing. (Of course, to be able to use the addition algorithm, you have to know
your “plus table:” 7 plus 5 equals 12, and so on. But this was presumably
programmed into your brain in first grade.)
The advantage of any algorithm is that it is repetitive, and “mindless.”
Computers can be (and are) programmed to perform algorithms, and a
computer certainly doesn’t understand what it’s doing. The mindless nature of
algorithms can also be a major educational disadvantage, if mastering algorithms
is stressed over understanding mathematics. Learning an algorithm is easy.
Gaining understanding is more difficult. But both are important – you need to
master the algorithms, and also understand the mathematics.
I once had a remarkable experience of this in an engineering math course I
was teaching. One problem I put on the Christmas exam had two parts; part (a)
used an algorithm from the course, and part (b) asked the students to explain in
plain words what the answer to part (a) meant. Of fifty-two students, 51 got (a)
completely right, but only one had even the vaguest idea what it meant. This
certainly convinced me that students can easily master an algorithm without
having the faintest idea about what they’re doing. (I thought that I had stressed
the meaning of the topic when I taught it; the underlying concept was pretty
basic to the whole course. It was shocking and embarrassing to find that most of
the students had missed the whole point.)
But why does it matter? If a student memorizes the algorithms (techniques)
in a course, he or she will probably pass the course. Isn’t that good enough?
Understanding everything is a lot more work, so who needs it? The answer to
these questions is subtle, in part because the very meaning of “understanding” is
a complicated matter. I will return to this question at various times in this book,
but let’s agree for now that understanding a topic in mathematics includes:
1. Knowing why the particular topic (algorithm, formula, etc.) is true.
2. Knowing how the topic is related to other topics.
3. Being familiar with the uses of the topic, both elsewhere in mathematics,
and in practical applications.
The disadvantages of trying to rely on memorization, rather than
understanding, are:
1. Anything that is memorized without understanding is quickly forgotten, or
incorrectly recalled at a later time. Once fully understood, however, a
mathematical technique or concept is usually remembered for life. The
mental effort exerted in understanding some topic appears to establish
permanent synapses in the brain, much more so than rote memorization.
2. Failure to fully understand the mathematics taught at a particular stage will
often return to haunt the student in later courses. Advanced mathematical
topics often depend strongly on understanding elementary material. For
example, learning calculus requires a secure understanding of basic algebra
and geometry, as well as a mastery of techniques in these subjects.
Problem 1.5 Explain in some detail why the addition algorithm works.
Addition as counting
Both addition and multiplication are closely related to counting. We adults
don’t actually count anything when doing arithmetic, because we memorized our
plus and times tables as children, and learned the algorithms. Nowadays we have
electronic calculators, so perhaps less emphasis needs to be given to teaching
arithmetic skills to children. Does this mean that teachers can stop teaching
arithmetic altogether? I don’t think so.
Every citizen needs to be “number-conscious” (a better word is numerate), in
order to deal with numbers in everyday life, and to have some “feel” for the
meaning of numbers. Arithmetic is as important as ever. But endless drill
exercises aren’t. Better to teach children the practical uses of arithmetic. Most
schools do seem to be going in the right direction these days.
Politicians and salesmen are always out to take advantage of people’s
innumeracy. Why does every price end with 9? A bottle of milk is $1.99. Why
not call it $2? Most ads for new cars these days only list the monthly lease charge,
not the price of the car. I can’t believe anyone would ever lease a new car. It’s
much more expensive than buying the car outright, even if you have to borrow
from the bank. But many people don’t, or can’t do the arithmetic to compare
these options. They couldn’t even do it using a calculator, perhaps.
Not paying off your monthly credit card promptly and completely is another
way to throw money away. Wouldn’t a decent school system use such examples
to illustrate the uses of math?
But let’s get back to pure math for a while. We need to discuss the “rules of
arithmetic.” One of these rules is
Rule 1 is true because of the basic relation between addition and counting.
What is this relation? If you ask a seven-year-old to add 3 and 5, she (or he) may
count on her fingers 1,2,3,1,2,3,4,5 and then recount up to finger 8. Now ask her
to add 5 and 3. Why does she think she gets the same answer?
What I’m leading up to is this. We need to formulate a basic definition of a +
b, the sum of a and b. (I’ll talk later about the role of definitions in mathematics.
Few students graduating from high school seem to have any idea how utterly
important definitions are. This seems to be part of the “memorize, don’t try to
understand” syndrome.)
But the total set of objects is the same for both cases. Therefore you get the same
result. This is an instance of a fundamental principle.
That’s all there is to Rule 1. Thus a + b means first count a objects, then b
more. And b + a means count the b objects first, then a more. You always will get
the same total (unless you make a mistake). No doubt your grade one teacher
explained this very well, and you’ve never thought about it since. Okay, what
about the next rule?
What does this mean, and why is it true? First, what about the brackets, ()? In
mathematics, brackets specify the order of performing multiple operations, with
quantities inside brackets being calculated first. Thus, 4 + (5 + 6) means 4 + 11,
which becomes 15. (The brackets can be removed when the calculation inside
them is finished.) This is a different calculation than (4 + 5) + 6, which means 9
+ 6, which turns out to be 15 again.
Now we can see why Rule 2 is valid. We have three sets of objects, one with a
objects, one with b, and one with c. Expression a + (b + c) says first count the b
and c objects, then continue on with the a objects. Expression (a + b) + c does it a
different way. But the counting principle tells us we have to get the same answer
both ways.
Rules 1 and 2 are second nature to most people. But think how convenient
they are. More generally, given any list of whole numbers, we could group them
in any way and add the groups in any order, and always get the same sum.
(Before calculators – or even with them – one way to check a long sum was to do
it twice, in different order. If you get the same answer both times, it’s probably
correct. If not, it’s certainly incorrect!)
Rules 1 and 2 may seem pretty obvious. Students rarely make mistakes in
using these two rules. But they do make mistakes with some of the other rules of
Arithmetic (as will be discussed later). Mistakes are easy to make when one relies
entirely on memorization of rules and formulas – because we all are subject to
faulty memories.
In today’s world, mathematical mistakes can have drastic consequences.
When a nurse makes a mistake in dispensing a dose of medicine, a patient may
die. When an engineer makes a mistake in calculating a safety allowance, a
building may collapse. Mathematical education must emphasize methods for
eliminating mistakes, and this is a leading principle of Math Overboard!
Here, then, are some fundamental ways of avoiding mistakes.
Problem 1.6 Rule 2 says that brackets aren’t needed in addition. We can simply
write 2 + 15 + 8, without any brackets. To see that brackets are sometimes
needed in other situations, calculate 2 × (15 + 8) and compare it with (2 × 15) +
8.
Here a × b is read as “a times b.” For example, 3 × 5 = 15, as you can check by
actually counting 3 groups of 5 objects:
There are several ways of expressing the product of two numbers a and b:
Problem 1.7 See if you can think up a good explanation for Rule 3. If not, please
read the explanation given in Solution 1.7.
Next, how would we deduce the Associative Law of Multiplication? Here we
have three numbers a, b, and c, so we need to think of a stack of objects (for
example, children’s square blocks), as shown. Then a × (b × c) equals the total
number of blocks in a layers, each containing b × c blocks (left diagram). On the
other hand, (a × b) × c equals the number of blocks in c “slices,” each containing
a × b blocks. No matter how you arrange to count these blocks you get the same
answer. This proves Rule 4.
Rules 3 and 4 tell us that to multiply three numbers, for example on a hand
calculator, we can group them and multiply them in any order. This is similar to
Rules 1 and 2, which tell us we can add numbers in any order. Because of these
facts, we don’t need to use parentheses when writing sums or products. Thus 21
+ 105 + 273 is unambiguous, and so is 21 × 105 × 273.
Our next rule, which combines addition and multiplication, is also explained
by the counting principle.
Note that, according to the convention that a × b can also be written as ab,
Rule 2 could be stated as
Let’s start with an example: 3 × (4 + 9) = 3 × 13 = 39, whereas (3 × 4) + (3 ×
9) = 12 + 27 = 39. Try another example for yourself.
Before explaining the reasoning behind Rule 5, let us consider the notation
involved. The left side, a × (b + c), is straightforward; to calculate this, first you
add b and c, then you multiply the result by a; see the example. (Recall the rule of
brackets: operations inside brackets are performed first.) The expression on the
right side, (a × b) + (a × c), also uses brackets. First you do the multiplications,
then the addition.
Now for the explanation of Rule 5, which once again is based on the counting
principle. First, consider a × (b + c). By the definition of multiplication, this is
the total number of objects in a groups of (b + c) objects each. The diagram
shows the example 3 × (2 + 4).
One way to count up these dots is in groups: first 3 × 2, then 3 × 4, and add.
This is what the symbols (3 × 2) + (3 × 4) mean. A second way to count the dots
is to first count one line (2 + 4), then combine the 3 lines, getting 3 × (2 + 4). By
the counting principle, these are equal:
We don’t even have to calculate the numerical values to know that the two
results will be the same.
I hope you see that the above argument could be used to prove that Eq. 1.6 is
true for every possible case. The special example used here, 3 × (2 + 4), is strictly
representative of the general case, a × (b + c). We can therefore conclude that in
general
Multiplying by 10
How do you multiply a given number by 10? How much is 763 × 10? Answer:
7,630 – you just put a zero on the end of the number. Can you explain why this
works? Take a moment to think about this before reading on.
It has to do with positional notation and powers of 10. Thus
and therefore
(Did you notice the use of the Distributive Law in this argument? If not, look
at it again.) Similarly, to multiply a number by 100 (i.e. 102), you put 2 zeros on
the end of the number, and so on.
Nested brackets
The above calculation used nested brackets, that is, brackets inside other
brackets. How are these handled? Remember the rule of brackets: operations
inside brackets are carried out before operations outside these brackets. This rule
can be applied repeatedly. If one pair of brackets contains an expression that
itself involves further brackets, then the operation inside those brackets must be
done first. For example:
Here, the inner brackets are dealt with, and removed, first. Try this example
Answer: 11 × 13 = 143. This example used two kinds of brackets, round brackets
(), and square brackets [ ]. This makes the whole expression easier to read, but is
not required; the above example could be written as (8 + 3) × ((2 × 6) + 1).
Again, the innermost brackets are dealt with first.
Using brackets correctly is important. Weak students often make errors in
using brackets, and sometimes get the wrong answer because of such an error.
For example, what would this mean:
This means that, in any expression involving both multiplication (×) and
addition (+), the multiplications are carried out before the additions, unless
indicated otherwise by brackets. Thus
However
Here brackets are needed to specify that the addition is done before the
multiplication.
The precedence rule allows us to use fewer brackets than would otherwise be
needed. This in turn makes mathematical expressions easier to read – provided
you are familiar with the rule. However, whenever you are in doubt, the brackets
can be left in. For example, 3 × 6 + 2 looks confusing to me, and I would usually
write (3 × 6) + 2. But I don’t find xy + 2 confusing, and I would never
misinterpret this as x(y + 2).
Another example: where are the “hidden” brackets in the expression 3xy +
2z. Evaluate this expression for the case x = 2, y = 5 and z = 4. (Answer: 38.)
Would this be the same as 3x(y + 2z)? (Answer: no, the latter evaluates to 78 in
this case.)
Problem 1.9 Remove all brackets: (a) (3x)(y + 4); (b) (7p)(4 + 2q).
The multiplication algorithm
The multiplication algorithm that most people use is illustrated by the following
example, for the calculation of 67 × 3:
You probably do the carryover in your head, and normally do such a simple
multiplication on one line: 67 × 3 = 201. This calculation is again based on the
powers-of-ten, positional (decimal) notation:
As in doing addition, you start with the units column (7 × 3 = 21) and keep
track of the carryover. Do another example or two for yourself.
A more complicated example is done in the same way:
If you’re like me, you hardly ever do long multiplications any more, what
with hand calculators being readily available. But I think it’s worthwhile knowing
how to, and also understanding why the old school algorithm is valid. It’s all part
of acquiring a firm foundation for all your mathematical knowledge. I’m not
embarrassed to be rusty at multiplication, but I would be embarrassed if I had no
idea how to do it by hand, or why the hand method is valid.
1.4 Binary arithmetic
You don’t have to read this section if you don’t want to; it’s for curiosity’s sake.
However, understanding the binary number system can strengthen your
understanding of the decimal system. Besides, it’s fun. The good news is that
binary arithmetic is much easier than decimal arithmetic. The bad news is that
the binary representation of a given number is longer to write down than the
decimal representation. (Because this section is for really keen readers, I’ve
condensed the writing style quite a bit. You’ll have to figure out some of the
details on your own.)
The binary number system is much like the decimal system, except that it
uses base 2 instead of base 10. Consequently it uses only two symbols, 0 and 1.
Every whole number can be written as a string of 0s and 1s. For example, the
binary number 10110 is the number twenty-two, i.e. 22 [in base 10], written in
binary form. The symbols 0 and 1 in a binary number are called “bits.”
How do we know what the number 10110 is? Well, reading from the right,
the bits are the units bit, the two’s bit, the four’s bit, the eight’s bit, etc. This
means that
We have special English names for some of the powers of ten, so we can read off
any base-ten number. Thus 9,536 becomes “nine thousand, five hundred and
thirty-six.” We don’t have names for the powers of 2, however, because nobody
uses base-two numbers in everyday life. Thus 10110 is just “one oh one one oh
(base 2).”
Problem 1.10 Write the numbers from one to ten in base two.
Multiplying by two. How do you multiply by two in binary notation? Figure this
out for yourself, and compare with multiplying by ten in decimal notation.
This ruler has unit length, and we label the ends 0 and 1. Now we mark off the
ruler into ten equal lengths. Each little segment has length (one-tenth) of a
unit. We can label the points and so on.
Thus, the point labeled is three-tenths of the distance from 0 to 1. To be
and and so on. Now we have our ruler neatly labeled, using decimals.
Although this picture doesn’t show all the decimal labels, you can easily locate
any number such as 2.7 on this ruler. You also realize that 2.7 means two and
seven-tenths:
Notice that a whole number can also be considered to be a decimal point
number. For example, 3 = 3.0.
Addition. For simplicity, let’s stay with single-digit decimals a bit longer; the
principle is the same for all decimal-point numbers. How do we add two such
numbers, for example 1.2 + 2.5? Nothing could be simpler. First add the decimal
parts, then the whole-number parts, so 1.2 + 2.5 = 3.7. In terms of tenths
The new algorithm is this: line up the decimal points, and use the usual
addition algorithm.
The same addition algorithm still implies: line up the decimal points and add
as always. For example
This just keeps track of thousandths, hundredths, tenths, units, tens, and so
on.
Problem Find 261.09 + 77.62. Check using your calculator.
Powers of Ten
Perhaps you recall our discussion of the powers of 10, and how this relates to the
positional decimal notation for whole numbers. For example,
The same powers of ten scheme applies also to decimal numbers, once we use
minus exponents:
which clearly shows the positional significance of all the digits, both before and
after the decimal point. This is our modern decimal system as used throughout
the world today.
Multiplying or dividing by 10
Earlier we explained that, to multiply a whole number (written in decimal
notation) by 10, we simply tack a zero on the end of the number. For example,
672 × 10 = 6,720. Each digit of the number gets “upgraded” when we multiply by
10. The 1s digit becomes the 10s digit, while the 10s digit becomes the 100s digit,
and so on. The new units digit is 0. Read the discussion of Multiplying by 10
again if you don’t remember this.
A similar thing happens when we multiply a decimal point number by 10.
Each digit gets “upgraded” by 1. Example:
(Do you see why 10−2 × 10 = 10−1 on the second line? This says that 10
hundredths equal one tenth. That’s exactly what hundredths are: ten hundredths
make one tenth.)
The rule for multiplying a decimal point number by 10 is: shift the decimal
point one digit to the right. Thus 4.18 × 10 = 41.8. This amounts to upgrading
each digit by one. Actually, the same rule works for whole numbers, if we
remember that a whole number is also a decimal number. For example
Problem 1.12 What is the rule, in terms of shifting the decimal point, for
multiplying by 100? by 1000? Give examples.
If multiplying a number by 10 amounts to upgrading its digits by 1, which
means shifting the decimal point one position to the right, what does dividing by
10 do? You’re right – it “downgrades” each digit by 1, which means that the
decimal point is shifted one place to the left. For example, 37.4 ÷ 10 = 3.74. We
discuss division in Chapter 2, but this particular case is easy to describe:
First, check that this is correct: to multiply 3.5 by 108 we shift the decimal point 8
places to the right, which puts in 7 zeros after the 5, as above. “Scientific
notation” means that a number is written in the form
This can be done by shifting decimals, but the following is probably easier:
Similarly
(You may have never before seen commas after the decimal. This is because
scientific notation is usually used for such numbers.) In cases like this, scientific
notation is much easier to read than normal decimal notation.
Problem 1.14 Express the U.S. debt for June 7, 2010 in scientific notation. Also
the distance to Andromeda.
If you have a scientific calculator, it can use scientific notation, using the E-
symbol. For example 2.6E5 means the same as 2.6 × 105. But you can also do the
calculation on your non-scientific calculator. For example, let’s calculate the per
capita federal debt in the U.S. The population is 308,000,000, or 3.08 × 108.
Therefore
You can use your calculator, for the number parts, and keep track of the powers
of ten yourself:
In June 2010 every U.S. citizen was in debt about $42,400, on behalf of the federal
government. If you’re married with 2 children, your family’s share of the debt is
$169,600. No wonder taxes are so high!
The interesting thing about this calculation is that it is virtually impossible to
do at all if you don’t understand scientific notation, but quite easy if you do. A
rough calculation might be just as informative: the debt was about 1.2 × 1013 and
the population about 3 × 108. Therefore the per-capita debt was about $.4 × 105,
or $40,000. Close enough. Again, you can’t even do this rough calculation if you
don’t know scientific notation. (Division is discussed fully in Chapter 2.)
Problem 1.15 Try to “guesstimate” the total value of all residential property in
the U.S. (or in your own country).
I hope you are convinced that scientific notation is useful, and not just for
scientists. I think it should be taught in school. Then newspapers could start
using scientific notation, both in the financial columns and in articles about
science (fat chance?). Large numbers in the paper are usually stated in words -
million, billions, trillions, and so on. For your information, here is what these
words mean, in base-ten and scientific notation.
There are more such names, for example, a quadrillion is 1015, but these are
seldom used. Billions, or trillions are large enough to describe such things as the
world’s population, and national debts. (By the way, to make matters even more
confusing, the British use the word billion to mean 1012, not 109. Universal use of
scientific notation would remove this confusion.)
Numerical precision
Mathematically speaking, the decimal number system is capable of arbitrary
precision. In real life we can never obtain (and never require) unlimited
precision. For example, no one even knows what the exact population of the U.S.
is at any given moment. The census can’t track every last American. Some people
are traveling, others are dying, others are being born. Anyway, the census is only
taken once every 10 years, so there’s additional uncertainty between censuses.
When a number is written in scientific notation, the number of digits is
called the “number of significant digits.” For example, the U.S. population figure
3.08 times 108 has 3 significant digits. If you read somewhere that the latest figure
for the U.S. population is 308,409,618, be suspicious. This figure is bogus – the
population can’t be known to the last person.
Listing the U.S. population as 308,409,618 gives a false sense of precision.
Better to simply say that the population in December, 2009 was approximately
308 million.
Similar comments apply to almost every instance of numerical data,
including financial data, scientific values, and so on. The precision with which
such values are listed should give some indication of the accuracy with which the
values are known.
Next, what about addition and multiplication, in terms of significant digits?
How much is
The answer may surprise you: 4.21 × 106. Purely mathematically, you would
get 4.21033 × 106, but this is over precise. To leave the answer in this form would
imply a precision of 6 significant digits, whereas the original data are only precise
to 3 digits.
Multiplication is treated in the same way:
Your calculator will give 4.21 × 3.3 = 13.893, but this is once again over precise,
and should be rounded off to 13.9. (“Round off to 3 figures” means to replace
13.893 by the closest 3-digit number, 13.9. Scientific calculators can be set to
display a given accuracy, and will then automatically round off all results to this
accuracy.)
Remember, avoid over precision when working with actual data, which are
always of limited precision. And be suspicious of data or statistics presented with
high precision. In many cases they will have been calculated without due
attention to the limits of precision involved. Calculators and computers can give
answers with many digits, but some of these digits will not be meaningful in
terms of the known precision of the input numbers.
Orders of Magnitude
Scientists and other people sometimes use the term “orders of magnitude.” To
say that A is about 3 orders of magnitude larger than B just means that A is
approximately 103 × B. For example, let’s compare the population of China with
that of New York City. In rough figures, China has about a billion (109) people,
and New York about ten million (107). Therefore the population of China is
about two orders of magnitude greater than that of New York, or about 100
times as large.
Another example: compare the speed of light to the speed of sound. At sea
level, sound travels about 300 meters/sec; the speed of light is 300,000 km/sec.
Thus we are comparing (in meters/sec)3 × 108 with 3 × 102, so that the speed of
light is about six orders of magnitude greater than the speed of sound at sea level.
Problem 1.16 (a) Using a calculator, find 1.76 × 35.3, rounding off appropriately.
(b) A chickadee weighs about 10 grams, and a swan about 10 kilograms. By how
many orders of magnitude does the swan’s weight exceed the chickadee’s? (One
kilogram equals 1,000 grams.)
Fortunately, the five Rules of Section 1.1 remain valid for all decimal
numbers. As a reminder, here they are again:
Here the five rules are written in the simplest possible way. For example, ab
means a × b. Also the expression in Rule 5, ab + ac means (a × b) + (a × c). This
is in accord with the Precedence Rule: × precedes +.
Since these rules are important, you should again understand why they’re
true. You remember that, for whole numbers (which are the counting numbers),
each rule is a consequence of the counting principle. But decimal numbers are
also a kind of counting number. They don’t count units, but tenths, hundredths,
and so on.
For example, consider the number 6.4. This equals which is the same as
that is, 64 tenths. (Why? Well, ten tenths equals 1, so 60 tenths equals 6.
What other kinds of numbers are there? This will be discussed fully in
Chapter 2; the other types include
negative numbers
infinite decimals, either repeating or non-repeating.
All these numbers correspond to points on an extended number line, as will
be explained in the next chapter.
Larger and smaller
The symbol < is called the inequality sign, but we read it as “is less than” or
“is smaller than.” For whole numbers a and b, the statement a < b means that a
occurs before b, in terms of counting. Thus 17 < 19. Also, because decimal
numbers can be thought of as counting numbers, we can readily compare any
two decimal numbers. For example, 2.05 < 2.1 because while
If the whole number parts of a and b are equal, we look at the tenths digit:
If the tenths digits are the same, we compare the 100ths digits, and so on. (Yet
another algorithm!) Example: 2.1409 < 2.141.
In terms of the number ruler (pointing to the right) a < b means that a lies to
the left of b.
Sometimes it is convenient to use the symbol >. Thus a > b is read as a is
greater than b, or also a is larger than b. To be specific,
All numbers considered so far are positive. However, negative numbers will
be studied in Chapter 2, and Rule 2 is not valid if c is negative.
These rules are almost self evident for the case of whole numbers a, b, and c.
For example, Rule 3 says that if a comes before b, and b before c, then a comes
before c. Rules 1 and 2 are equally obvious. To show that the rules are also valid
for decimal point numbers, we remember that a decimal point number can be
thought of as a type of counting number. Therefore the rules also apply to such
decimal point numbers.
Problem 1.17 List all whole numbers x for which 9 < x < 15. (This double
inequality means that 9 < x and x < 15.) How many decimal point numbers
satisfy this double inequality?
Problem 1.18 Using Rules 1-3, show that: if a < b then a2 < b2. (Hint: Use Rule 2
twice.) This might stump you for a while! Assume that a and b are positive.
1.9 How to learn mathematics
Most people find mathematics confusing, at times. This book tries to eliminate
confusion as much as possible. Nevertheless, you the reader may still get
confused occasionally. Everyone who studies mathematics experiences mental
blocks once in a while, something that just doesn’t seem to make sense. This is
not a sign of stupidity! The question is, what should you do to overcome a mental
block? Here are three possible approaches:
1. Forget it, it’s probably not important anyway.
2. Forget about understanding the point, just memorize the result.
3. Take the time to identify the difficulty, and then try to resolve it.
One of the reasons, I think, that many people “drop out” of math is that they
start adopting strategies 1 or 2. Either of these strategies is a sure-fire recipe for
eventual failure. In learning mathematics, any point of confusion must be
eliminated as soon as it occurs. If not, everything that follows on from the point
of confusion will also probably be confusing. Pretty soon the whole subject
becomes incomprehensible – and then hateful.
Suppose that you have encountered a difficulty, and that you wish to adopt
strategy 3. How should you proceed? First of all, you may not even know that
you didn’t fully understand some particular point. The Problems that occur
throughout this book are designed to help you to quickly recognize any points of
confusion. Be sure to pause and solve these Problems. Don’t be too hasty in
looking up the solution, especially if the Problem seems confusing. Try hard to
work out the solution on your own. Then check, and make sure you understand
the given solution. Can you make up another similar problem, and solve it?
Still, you may sometimes encounter a confusing point not related to any
given Problem. While reading this book, you should always be asking yourself
“Do I understand this point?” Hopefully, the answer will usually be a firm “Yes.”
If not, here are my suggestions.
1. Pause immediately. (Well, you might just glance at the next sentence to see
if it explains the problem.)
2. Try to pinpoint exactly what the difficulty is. If it’s an unfamiliar word, try
to find out where in the book the word was first used. Look in the index.
3. If it’s a logical, or “mathematical” question, try to express your difficulty in
simple words. Or, if it’s a certain sentence that doesn’t make sense, find out
why not. Re-read the whole paragraph. Rewrite the sentence another way.
4. Perhaps the problem is one of ambiguity - something that could be
interpreted in two or more different ways. Write out both, or all, reasonable
interpretations. Are they really reasonable? Which, if any, is likely the
intended meaning? (We authors are sometimes guilty of ambiguity, no
matter how hard we try to avoid it. No one’s perfect!)
5. If the difficulty is that the mathematics has become too complicated for you
at this point, see if you can invent a simpler problem based on the same
idea. Try to solve the simpler problem – maybe this will give you the clue to
the more complicated one.
6. If nothing works, put the book aside until tomorrow. Then try rereading the
whole section. (Of course, you can also think about the matter in the
meantime, for example just before going to sleep. I often used this approach
successfully when I was in graduate school.)
Everyone has difficulties with mathematics at some time, even professional
mathematicians. Overcoming these difficulties is a challenge. At first your self-
confidence plunges. You may get angry at yourself. But persevere. Eventually you
will crack it, often with a sudden flash of insight. Then the elation will be
sublime. Or maybe you will kick yourself for being so stupid. In any case,
overcoming a mathematical difficulty is an encouraging experience, one that will
improve your confidence in the future. Don’t ever say to yourself that you can’t
“do” mathematics. Worst of all, don’t ever abandon understanding in favor of
memorization.
The next time you seem to be confused, try the 6-step procedure described
above. Remember, no one is immune to the occasional “mental block” while
studying mathematics. I bet not one student in a hundred ever makes a
conscious effort to overcome mental blocks. No math teacher of mine ever
warned me about blocks, or explained how to fix them. There are countlessly
many ways to misunderstand a given mathematical topic, far too many for a
teacher (or book author) to anticipate them all. Everyone’s blocks are different.
You have to learn to recognize them, and then overcome them. Any time you
seem confused, stop and try to discover what’s confusing you. The six-step
approach works for me, and I recommend you try it.
4. Show how the Distributive Law a(b + c) = ab + ac follows from the counting
principle. (Use a diagram.)
(b) If I told you that 1 mile is 1.60934 km, what would your answer be?
7. Review the basic definitions of addition and multiplication for whole numbers,
based on the counting of objects.
10. [Optional] Computer programmers sometimes use the octal (base 8)) system.
Write out the first 20 whole numbers in octal notation.
11. List in increasing order: 30.7, 84.1, 0.9, 16.5, 0.99.
12. Use the Distributive Law a(b + c)= ab + ac to prove that a(b + c + d)= ab + ac
+ ad.
Note: Solutions to the Review Problems are given at the end of the book.
Solution 1.1 The first ten powers of 2 are: 2, 4, 8, 16, 32, 64, 128, 256, 512, and
1024.
This shows how the 10 in 13 (obtained from 7 + 6) gets “carried over” to the 50,
making it 60. This is how carryovers work in general. You start by adding the
units position because any carryover from there may affect the sum in the tens
position, including the carryover, and so forth.
This argument applies to any addition, and explains the usual way of doing it, as
in Problem 1.4. Try a more difficult example, first using the usual carryover
method, and then writing out the units 10s, 100s, and so on. Use a calculator to
check the answer.
If we count these objects row by row, we are counting 3 groups of 5 objects, i.e. 3
× 5. And if we count them column by column we are counting 5 groups of 3
objects, i.e. 5 × 3. By the counting principle, these must be the same so 3 × 5= 5 ×
3.
(At this stage, some readers may be asking “What’s the point? I know that 3 ×
5 = 15 and 5 × 3 = 15, so they’re the same. Why do I have to draw all those dots?”
What would you say to help such a reader?)
Solution 1.11 The way to do this is to find the biggest power of 2 in the given
number, subtract it, and repeat the process:
Solution 1.12 To multiply a number by 100, shift the decimal point two places to
the right, appending zeros if necessary. Examples: 4.18 × 100 = 418; 6.5 × 100(=
6.50 × 100) = 650.
Solution 1.13 (a) 9.03 × 1, 000 = 9, 030. (b) 9.03 ÷ 100 = .0903. (If you didn’t get
these answers, try writing the calculation out in terms of powers of 10, as in the
text. Then do the calculation by shifting the decimal point.)
Solution 1.14 The debt was $1.3055 × 1013. The distance to Andromeda is 1.2 ×
1018 miles.
Solution 1.15 First, I guess that, on average about 3 people live in each
residential unit. Therefore the number of residential units (houses and
apartments) in the U.S. is approximately 3 × 108 ÷ 3= 108. Next, I suppose the
average unit to be worth about $200, 000 = $2 × 105. Therefore, the total value of
U.S. residential property is something like $108 × 2 × 105 = $2 × 1013 (in words,
20 trillion dollars). You can see that this is about times the federal debt of $13
× 1012, which puts the debt in some perspective. It also suggests that the debt
can’t be allowed to grow much larger, or else Americans will be paying about as
much to service government debt charges as they pay on their mortgages.
Solution 1.16 (a) 62.1. (b) 3 orders of magnitude.
Solution 1.17 The whole numbers are x = 10, 11, 12, 13, and 14. There are
infinitely many decimal point numbers between 9 and 15.
Solution 1.18 If a < b then a × a < b × a (Rule 2), or a2 < ab. Also, since a < b we
have a × b < b × b,or ab < b2. Therefore, by Rule 3, a2 < b2.
Chapter 2
Subtraction and Division
2.1 Subtraction
In popular language, to subtract means to take something away. This is also the
mathematical meaning of the term. For example,
says that “12 take away 5 leaves 7,” although we more commonly say “12 minus 5
equals 7.” If we start with 12 items and then remove 5, there will be 7 items left.
This is the basic definition of subtraction, in terms of counting:
Now, if 5 objects are removed from 12 (leaving 7) then replacing the 5 objects
restores the original number, 12. In other words
We could just as well use this idea as our basic definition of subtraction:
Please check that the statement in Box 2.2 exactly fits the preceding example.
Let me reassure the reader about the use of symbols m, n and p here. Making
sense out of a mathematical statement containing symbols always requires a
special effort. You first need to understand that the symbols stand for
unspecified whole numbers. To comprehend what the statement means, it helps
to pause and make up a few numerical examples, such as
So now you realize that subtraction is intimately related to addition, and Box
(2.2) specifies how.
Many people find it easier to remember Eq. 2.2 if it is written as
This at least keeps the letters in the same order. Also, in Chapter 4 (Algebra) we
will learn about “transposition” in equations, and Eq. 2.2a involves the
transposition of the symbol n across the equals sign.
Problem 2.1 Use Eq. 2.2a to check that (a) 9 − 8 = 1; (b) 92 − 45 = 47; (c)28 − 13
≠ 14.
Problem 2.2 What goes awry with the definition of subtraction if m < n?
Consider some examples.
Remember, first you must do the calculation inside the brackets. Therefore 12 −
(7 − 4) = 12 − 3 = 9. But this is the same as writing
(both are equal to 9). The two minus signs in front of 4 have turned into + 4! In
other words, in 12 − (7 − 4) you first take 4 away from 7 before taking anything
away from 12. From 12 you take away 4 less than 7. It’s exactly the same as
adding 4 to 12 − 7.
At this stage I usually get a bemused “Oh, thanks,” as the new acquaintance
ambles off. No, seriously, this is the explanation. I’m sure it makes sense to you!
Please pause to make sure that you do understand the way the number line
works here. Adding corresponds to moving up the number line, while
subtraction moves down. We will use this idea often in this chapter.
Showing that two minuses make a plus: 12 − (7 − 4) = 12 − 7 + 4
I hope you remember (at least vaguely) how to do this. Also, remember that
you can immediately check the answer by addition: 233 + 82 = 315.
Problem 2.3 Write out the subtraction algorithm (for whole numbers) in detail.
Negative numbers
The “take-away” concept of subtraction doesn’t help if the number to be
subtracted is larger than the number it’s subtracted from. For example, how
much is 12 − 23? The counting idea in Box 2.1 doesn’t apply here. But there are
lots of practical situations where we want to be able to subtract a larger number
from a smaller one. One example is temperature. Financial gains and losses is
another. (See Chapter 3 for other examples)
Negative numbers provide the answer. They are best understood in terms of
the extended number line:
Problem 2.4 Draw (a freehand sketch is OK, or you could use a ruler) a vertical
line, pointing upwards. Locate points 0 and 1 arbitrarily on the line. Now locate
(approximately) the numbers 0.3, −0.8, −2.25 on this number line.
We will call the system of all positive and negative decimal point numbers
(including 0) the full decimal-point number system. This number system
includes the following kinds of numbers:
whole numbers: 0,1,2,3, …
integers: …, −3, −2, −1, 0, 1, 2, 3,…
decimal-point numbers (positive, negative, and zero): ±n.d1d2 …dk
A positive number, for example 1.6, can be written with or without a + sign
(1.6= + 1.6), but we normally omit the + sign. Of course, a negative number
always has the − sign. (Many school textbooks insist that positive and negative
numbers are indicated by small + or − signs written above and before the digits:
+
5.1, −2, etc. These would be called “positive 5.1” and “negative 2.” This notation
is unnecessary, and will not be used in this book. It is almost never encountered
in later math courses, or in science or economics. Some people do say “negative
2” instead of “minus 2.” Use which ever you prefer.)
Using negative numbers, we can now subtract without constraint. For
example, 7 − 10 = −3. You just figure out the deficit. How much is 12.3 − 18.6?
How do you calculate it? What’s the general method? First, calculate 18.6 − 12.3=
6.3, then attach a minus sign. This can be written as
The general method is the same. To calculate a − b (where a and b are positive
numbers):
Problem 2.5 Arrange the following numbers in increasing order: 5.6, −2.1, −8.3,
0, −8.8.
Magnitude
The magnitude of a number is its absolute size, regardless of sign. The symbol |a|
is read as “the magnitude of a” (or else “the absolute value of a”), and |a| equals
a itself if a > 0 (or a = 0), and |a| equals the positive counterpart of a if a < 0. For
example,
In terms of the number line, |a| measures the distance from the point a to 0,
regardless of direction.
Addition
Addition is easy to understand in terms of the number line.
In other words, adding a positive number moves up the number line, whereas
adding a negative number moves down the number line. For example
How about (−8) + 3 or (−8) + (−3)? I hope you get the answers −5 and −11,
respectively.
Problem 2.6 Add: (a) (−5) + (−8); (b) (−5) + 8; (c) 5 + (−8); (d) 8 + (−8). Also
add these in commuted order: (−8) + (−5), etc.
As you can see, it is always possible to add two numbers by referring to the
number line. But we need an algorithm for addition, which will give the correct
sum without needing to bring in the number line. Here is the complete addition
algorithm for adding a + b, for decimal point numbers:
Cases 1 and 2 are not new. Cases 3 to 5, which involve adding a negative number,
are new. Here are some generic examples of these cases:
Case 3: (−9) + (−6) = −(9 + 6) = −15.
Case 4: 4 + (−13) = 4 − 13 = −9.
Case 5: (−20) + 15 = 15 − 20 = −5.
Please pause to check (i) that the answers agree with the up-or-down the number
line prescription given in Box 2.4, and (ii) that the Addition algorithm of Box 2.5
has been correctly used.
You probably figured this out as (8 − 5) − 6= 3 − 6= −3. You could have also
done the calculation as 8 + ((−5) + (−6)) = 8 + (−11) = −3. This illustrates the
associative law.
Problem 2.8 Calculate (with care): (a) (−17) + 6 + (−9); (b) 82 + (−101) + (−5);
(c) (−33) + (−34) + (−35).
Subtraction
Subtraction of general numbers is defined as the inverse of addition.
This says, to subtract b you simply add −b. We already have an Addition
algorithm, Box 2.5, so Eq. 2.9 is all we need for our subtraction algorithm. Again,
check the above generic examples in this light, especially examples (2) and (4).
For example (2):
Problem 2.9 Calculate (a) mentally: 12 + (−15); −25 − (−8); 61 − (+ 20); −33 −
(−33); (b) by calculation: −14.73 + (−8.07); 521.9 − 7, 650.0; 24.75 − (−24.65);
−1.87 + 7.63 − 5.09.
In Chapter 4 you will learn about “transposing” terms across the = sign.
When a term is transposed, it changes sign. Thus, from a + (−a) = 0 we get a = −
(−a), by transposing the term (−a). This is probably how most students think of
“two minuses equal a plus.” There are many explanations for this fact. Yet
another explanation is based on the number line.
Study these two diagrams carefully, as they fully explain the relation of
addition and subtraction in the full decimal-point number system.
Problem 2.10 Calculate (carefully!) (a) −1.8 − (−7.4 − 7.6). (b) −(4.3 − 6.1) + (2.7
− 7.0). (c) 3.5 − (2.6 − (4.2 − 5.8)).
Note how “two minuses make a plus” comes in here. Alternatively, you can think
of the “take away” example given at the very beginning of this Section.
2.2 Multiplication
Our next task is to discuss multiplication for the full number system. What
would we want 6 × (−9) to be? It should be 6 times as negative as −9, do you
agree? Therefore
You just multiply 6 × 9 and tack on a minus sign.
This takes care of multiplying a positive times a negative number. Next, how
about (−5) × (−7)? Well, first (5) × (−7) = −35. We would therefore expect that
(−5) × (−7) = 35. Each minus sign reverses direction on the number line. Two
minus signs, when multiplied, reverse direction twice, which amounts to no
change in direction.
You might be wondering, does this prove that two minuses (when multiplied)
make a plus, or is it just an assumption? The answer is that it is a definition,
motivated by the need for consistency, as in the example.
Our complete definition of multiplication is:
Let’s get some practice using these Laws. First, with numbers:
Problem 2.12 Calculate: (a) −2 × (13 − 6 × 6); (b) (−8 + 3) × (3 − 3) + 7; (c) (−5 −
2) × (−8 + 2).
Be sure you follow these examples. Chapter 4 goes into Algebra in much more
detail.
Problem 2.13 Remove brackets: (a) 2c − 3(c − d); (b) x(y − 2z); (c) (w + 2)(w −
3).
How do we know that the above laws of arithmetic are valid in the full
number system? A professional mathematician (like me) would be able to give a
completely logical proof, but no self-respecting student (like you) would sit still
for it. It’s sufficient that you remember how these rules were justified for whole
numbers, and accept as reasonable the statement that the Laws of Arithmetic are
universally valid. Of course, you might like to check out a few examples chosen
at random. For example, let’s check that 2 × (3 − 5) = (2 × 3) + (2 × (−5)):
(Remember, you could do a thousand examples such as this, but that still
wouldn’t prove that the five Laws of Arithmetic are valid.)
I expect that you consider these equations to be obvious, but from strictly logical
standpoint we should prove them by using the basic definition of multiplication,
Eq. 2.11. Let’s start with the first, 1 × a = a. This is certainly true if a is positive,
or zero (remember that 1 × a is what you get by counting up one group of a
objects). If a is negative, Box 2.11 says to first multiply 1 × |a|, which equals |a|,
and then attach a minus sign:
But when a is negative we have − |a| = a, right? (Try a generic example if you
don’t see this.) This completes the proof of Eq. 2.12.
Equation 2.13 is just a restatement of the first Convention listed above.
To end this section, let’s make sure that you fully understand the important
special result
(This is another of those stumbling points for some math students.) What you
need to remember from now on is that the minus sign (−) can always be
interpreted as (−1) × , and also that two minuses when multiplied always give a
plus. The fact that −(b − c)= −b + c is then apparent, by applying the Distributive
Law: −(b − c)= (−1)(b + (−1)c)= −b + c.
Problem 2.14 Calculate (a) 7 − (12 − 15); (b) (7 − 12) − 15; (c) 7 − 12 − 15; (d)
−7 − 12 − 15.
or sometimes, 15/5 = 3. Both are read as “15 divided by 5 equals 3.” Also, 15/5 =
3 can be read as “15 over 5 equals 3.”
Now, if the 5 groups of 3 candies are counted up together, there will of course
be 5 × 3 = 15 candies altogether. Thus
Problem 2.15 (a) How could you check that 27, 024 ÷ 563 = 48 without using
long division or a calculator: (b) List all numbers n that divide (i.e., evenly) into
30, and find q = 30 ÷ n for each such number. Same for 31.
This is the first sense in which children learn about division. They say that “5
divides into 15 three times.” However “7 doesn’t divide into 15,” or “you can’t
divide 15 by 7.”
Later on, the children are told that you can divide 7 into 15 after all, if you
use “fractions.” Or, as an alternative, you can use “remainders.” Then (“15
over 7”) is an example of a fraction, whereas 15 ÷ 7 = 2 with remainder 1 is an
example of division with remainder.
Are these two types of division related, or perhaps the same? In practical
terms, if there are 15 candies and 7 children, then each child can get 2 candies
and one will be left over. This is quite different from each child getting
candies, which would mean that each child actually got candies. The one
leftover candy is sliced into 7 equal pieces and distributed. However, the two
types of division are closely related mathematically, as we shall see.
First, we consider division with remainder.
Here the condition 0 ≤ r < n means that the remainder r is greater than or
equal to zero and also less than n. Both q and r are assumed to be whole
numbers, or zero. Also, in Box 2.15, n is called the divisor (it does the dividing)
and m is the dividend.
For the candies example we have
In other words, 7 goes into 15 two times with remainder 1. Check that this agrees
with Box 2.15, with quotient q = 2 and remainder r = 1. Another example: divide
53 by 8, with remainder. Answer: 53 = 6 × 8 + 5. To do this division mentally,
you think of the candies example: 8 goes into 53 six times, leaving remainder 5.
Problem 2.17 In terms of Eq. 2.15 what is the condition that n divides evenly
into m? Give an example.
2. Select the first few (left-most) digits of the dividend, until they themselves
make a number ≥ the divisor. Call this number the “first segment” of the
dividend. Here the first segment is 195, because 23 > 19, while 23 ≤ 195.
3. Determine, by trial and error, the largest multiple of the divisor that is ≤ the
first segment. Write this number (always a digit between 1 and 9) on the top
line, above the last digit of the first segment. This is the first digit of the
quotient.
4. Multiply this quotient digit by the divisor, and write the result below the
first segment. Draw a line, and subtract. The number you get must be
greater than or equal to zero, and less than the divisor. If not, you chose the
wrong digit on the top line. (Here 0 ≤ 11 < 23, so it’s OK.)
5. “Bring down” the next digit of the dividend, if there is one. This produces
the next “segment.”
6. Repeat steps 3–5 for this segment, until you run out of digits to bring down.
Each cycle gives another digit (between 0 and 9) of the quotient.
7. The division is now finished. The quotient is on the top line, and the
remainder is on the last line. Thus 1956 = 23 × 85 + 1. Let’s check that the
result is correct.
Problem 2.19 A pirate ship has 54 pirates, including the captain. They capture a
ship carrying a treasure chest with 896 gold doubloons. The rule is that the
treasure is divided evenly among everyone, but the captain also gets any
remainder. Who gets what?
These are all the primes less than 30. You should check mentally that all these
numbers have no proper factors, and that all the omitted numbers do have
proper factors. For example, 8 = 2 × 4, and 21 = 3 × 7, etc. (Non-prime numbers
are sometimes called composite numbers.) Do you see any pattern in the above
list of primes, which would allow you to continue the list without actually
checking each new number? If so, you are probably mistaken – no one has ever
found such a pattern.
Here is one reason why prime numbers are considered interesting:
The product so obtained is called the prime factorization of n. For example:
Fractions
As we pointed out above, children first learn that, for example, “you can’t divide
7 by 3.” But later they learn about division with remainders: 7 divided by 3 equals
2 with remainder 1. Finally, they learn that 7 divided by 3 is a perfectly good
number, denoted by (“seven over three,” or “seven thirds”). This number is an
example of a fraction, that is, a number of the form ,where m and n are natural
numbers. We need to explain how such fractions are interpreted and then to
develop the arithmetic of fractions. This takes some care, so please follow the
discussion carefully. Errors in working with fractions are common among
students who are experiencing difficulty with their college math courses.
Consider a line segment of length 1 unit, that is, the part of the number line
between 0 and 1. If n is a natural number, we imagine dividing this unit interval
into n equal parts. (The figure shows the case n = 3.)
The number (pronounced “one over n”) represents the length of one of
these pieces. We can mark off the numbers etc., as shown. Thus is
the number obtained by moving to the right from 0 by m times the length . In
this way, the number can be thought of as a kind of counting number, which
counts up the total length of m equal “pieces,” each of length . The expression
is called a fraction. Note that m and n are whole numbers (0, 1, 2,…), and n ≠
0. The meaning of is given in:
The number m is called the numerator and n the denominator, of this
fraction. Thus the denominator specifies the size of the “pieces” , and the
numerator specifies how many pieces are included in the fraction . For
example, the fraction is the total length of 17 [numerator 17] pieces, each of
size [denominator 5].
The fact that fractions are, in this sense, counting numbers, means that the
arithmetic of fractions is quite similar to the arithmetic of the counting numbers
(i.e., natural numbers). We will now develop this arithmetic methodically.
First, we have
where q is a whole number, and r < n. (To follow this general argument, I suggest
that you keep track of a numerical example of your own choice.) This last
equation means that
In the case that m < n, the fraction is called a proper fraction. Therefore, to
locate an improper fraction, we use division with remainder to rewrite it as a
whole number plus a proper fraction. (Note that a proper fraction is a number
between 0 and 1.) Next, can you see how to reverse this procedure? How would
you write as an improper fraction? See if you can figure this out before
reading on. The answer is that
Problem 2.23 Express as improper fractions:
Cancellation
Question: locate the fraction on the number line. Can you see a simple way to
express this number? Answer: from a sketch you will see that is exactly half-
way between 0 and 1:
This means that in fact
Similar simplifications hold for many other fractions, because of the following:
Here the letters m, a and b all represent positive whole numbers. The above
example fits this framework as follows:
Here the common factor 3 can be “cancelled out,” and disappears from the
fraction.
Try to do a similar calculation with First, you factor the numerator and
denominator:
by cancellation of the common factor 4. The fractions and are located at the
same spot on the number line.
to explain the Cancellation Law. Consider the number line between 0 and 1,
marked into 15 equal segments.
Next, we group these segments into groups of 5, as shown. Three such groups
make up the whole length, which means that each group has total length
This tells us that
(and also that but we already know this). In other words, the fraction
has 2 groups of 5 little segments in the numerator, and 3 groups of 5
little segments in the denominator, so this fraction equal
The same argument applies in general. Namely, represents a group of m
little segments out of b groups of m little segments, which is the same as
Errors in cancellation
Cancellation is frequently used in Algebra and other topics. Unfortunately,
errors are often made in cancellation. Students are sometimes taught to “strike
out” common factors, as in
I strongly advise you not to use strike-outs, for the following reasons:
1. Strike-outs are easily used incorrectly.
2. Strike-outs make your calculations look messy and hard to check.
3. Strike-outs are completely unnecessary.
For example,
is perfectly understandable. Although strike-out might be OK for this example,
what about
What’s the final result? (I have often encountered this kind of situation while
grading math exams. Often the student doesn’t seem to know what the final
result is!) The answer is that
(because 4 = 4 × 1, right?).
Another common mistake is to cancel numbers that aren’t factors, as in
Of course, you might make this mistake without striking out the 2s, but for some
reason the habit of using strike-outs seems to encourage this kind of error. A
student who has understood the logic behind the Cancellation law is not likely to
make this error.
Let me be explicit:
Problem 2.25 Identify the factors in: (a) 36 × 8; (b) 36 + 8; (c) 36x + 8y; (d) 36(x
+ y); (e) 36xy.
Slash notation
Henceforth we will often write the fraction as m/n. This saves space in
printing. Both are read as “m over n.” (Another possibility is m ÷ n, which we
sometimes use.)
Both are located between 0 and 1 on the number axis, but which is larger? The
answer is not obvious just from inspection. However we can answer the question
by using the Cancellation Law in an unexpected way:
The fractions on the right have the same denominators, so it is now evident that
the second fraction is the larger. The same method works for any pair of
fractions. Try it on 13/6 and 20/9 (answer: 20/9 is larger).
Any two fractions a/b and c/d can be rewritten to have the same
denominators:
Be sure that you understand this, because we will use it again later, when adding
and subtracting fractions.
Thus you see that 20/9 is the larger number. But notice also that both these
fractions appear to involve repeating decimals. We next explain all this.
Note that the pattern now repeats indefinitely, giving the result
To check this, note that
I won’t spend a lot of time on this. It is not an approximation, but an exact result.
One way to explain it is to write
The repetition need not be a single digit, but can be a group of digits, as in
If you will work out this example for yourself, you may understand why the
decimal expansion of any fraction must always repeat, if it doesn’t terminate.
So how can it be proved that the decimal expansion of a fraction m/n must
terminate or repeat? Upon doing the long division, once the same non-zero
remainder occurs a second time, the pattern repeats from then on. Since there
are only n − 1 possible non-zero remainders in m/n,you in fact must either
eventually get remainder 0, or repeat an earlier remainder. For example, to see
how this works, find the decimal expansion of 1/7. How long is the repeating
group of digits?
Problem 2.28 Find the decimal expansion of (a) 5/6; (b) 2/9 by long division.
where the 1s occur in longer and longer groups, separated by single zeros. Since
this is a nonrepeating decimal, it is not a rational number. Such numbers are
called irrational. Another example is (the square root of 2) – we will show in
Section 2.4 that is irrational. So are and so on. To repeat,
rational numbers have terminating or repeating decimals. Any other decimal
number is irrational.
In passing, note that our notion of decimal-point numbers has suddenly
expanded, to include the possibility of infinitely many digits to the right of the
decimal point. These digits may occur in a repeating pattern (in the case of a
rational number), or a nonrepeating pattern (irrational number). The resulting
“real number system” is discussed further in Section 2.4.
Addition of fractions
We next study the arithmetic of fractions – addition, subtraction, multiplication,
and division. Of course, since any fraction is equal to a decimal-point number,
we might dispense with fractions altogether. But because fractional expansions
are widely used in math and science, you need to master the arithmetic of
fractions.
The basic definition for adding fractions is
(I switch to capital letters because this seems to make the formulas easier to
look at.) Here the letters A, B and C represent whole numbers. An example:
Recall that is the total length of 3 little segments of length Hence is the
total length of 3 + 7 or 10 such little segments, which says that
The same argument applies to any example, as in Box 2.21. Another example:
Eq. 2.21 tells us how to add two fractions if they have the same denominators.
But how would you add
If you said go to the bottom of the class! That’s incorrect. Can you think of
some way to use Eq. 2.21 for this case?
Well, we could first rewrite the two fractions so that they have the same
denominator, as explained in the section on Order of fractions:
Therefore
(which could be reduced to This is called the method of common
denominators. It applies to any addition of fractions.
The proof of this formula uses the same method as in the numerical examples.
Now, should you memorize Eq. 2.22? Of course not! Just remember how to
do the calculation.
Also, note that you can easily – and should – check any addition of fractions:
(You can check this in the same way as before.) This might result in a negative
fraction, however. No problem! The combined system of positive and negative
fractions (and zero) is called the system of rational numbers.
Try this method on the rest of Problem 2.32. For example, in (d) you can use the
denominator 24; this is called the least common denominator. In simple cases you
can find the least common denominator by trial and error. But you can always
use BD as your denominator.
Multiplication of fractions
Now, where have we got in developing the arithmetic of fractions? We have
positive and negative fractions including 0, and we can add or subtract such
fractions, always obtaining another fraction.
Next comes multiplication. Here’s the rule:
This formula is easy to remember (“to multiply fractions, multiply their
numerators and multiply their denominators”). It is also very easy to use.
For example
So the foregoing formula for multiplying is easy to use and easy to remember.
But why is it true? How come
for example? The formula, Eq. 2.24, tells us this, but what’s the reasoning behind
this formula?
I now have to admit a little secret. Nowhere in this book have we yet defined
multiplication for anything but whole numbers. If you look back to Section 1.5,
you will find the multiplication algorithm for decimal-point numbers. Did you
understand why this rule was correct? Why is 0.7 ×0.9= 0.63, for example? How
can you base your mastery of math on a complete understanding if your author
doesn’t explain these things? The explanation is now forthcoming. It’s a bit
finicky, and I suggest that you don’t spend too much time on it unless you’re
really interested. But do try Problem 2.24, and continue reading from there on.
The first step is to define what is meant by multiplication of fractions. We
need a definition that makes good sense. Our definition will apply to a special
case:
The graph shows the case n = 5.
The line segment from 0 to a is divided into 5 equal segments, each having
length
For example, suppose that a = m, a whole number. Consider the example m =
8. It happens that
This might seem obvious to you, but keep in mind that these two expressions
actually have different definitions (what is the definition of ?). To prove that is
the same as according to our definition, we need to show that is one-fifth
the distance from 0 to 8, or in other words that 5 segments of length will have
length 8. But the total length is just which equals 8 by cancellation.
Clearly, this example is entirely generic, so we can conclude that
whenever n and p are whole numbers (n ≠ 0). This is a special case of Eq. 2.24
(write to see this).
Did you understand that argument? I admit it’s pretty confusing! It just
seems to be proving the obvious in a non-obvious way. Perhaps you have known
that for many years, so you can’t see the point of “proving” a
particular case from scratch. But mathematics always has to proceed from
carefully stated definitions of basic concepts to usable results, or algorithms. The
multiplication formula, Eq. 2.24, is just that – the multiplication algorithm for
fractions. Like most other algorithms, it is not obviously true. Its truth has to be
deduced from the basic definition.
The last argument applies to any number a, so let us take We then
conclude that
for all whole numbers m, n, p, and q. This is exactly the same as Eq. 2.24 (but
with different symbols, which were used to emphasize that they are whole
numbers).
Dear reader, I offer my apologies for this extremely abstruse discussion. No
elementary math text that I have seen makes any attempt to explain the logic
behind the multiplication formula, Eq. 2.24. So is it essential that you take the
effort to understand this completely? Maybe not, for the simple reason that the
multiplication formula is easy to remember and use correctly. Try the next
problem.
Perhaps you would like to explain why this is incorrect. What is 1/x − 1/2,
actually? Answer, (2 − x)/2x, which certainly doesn’t look anything like 1/(x − 2).
Indeed, if you try some value of x, such as x = 1, you immediately see how silly
the above equation is. The value x = 1 is said to be a counterexample to the
above equation.
Problem 2.35 Use counterexamples to show that the following “calculations” are
incorrect. What is the correct version?
These equations are easy to remember and use, but they take a bit of explaining.
Equation 2.28 is in fact a definition. The expression has not been defined
previously in our discussion. This number would be located at −m times the
length 1/n, on the negative number line, and this is what −(m/n) has been
defined as. As a check, try (−7) ÷ 3 on your calculator (punch in “7 ÷3 =
“), and observe that this equals −(7 ÷ 3). Note: means “Change Sign.”
Your calculator may have a different symbol. The display for a negative number
will have a − sign somewhere.
Next, Eq. 2.30 can be deduced from the Cancellation Law (Eq. 2.20), if we
assume that this law must also hold for negative numbers (a consistency
assumption).
and this is Eq. 2.30. Once again, you can use your calculator to check this
equation.
Finally, Eq. 2.29 can be deduced by a similar argument:
Pause to review
Dear reader, you have almost completed your review journey through
Arithmetic. Let’s pause here to briefly review the voyage so far, from Chapters 1
and 2. The experience has been one of progressive enrichment.
1. Counting, or natural numbers
– addition and multiplication as counting
– the decimal representation of natural numbers (positional significance)
– algorithms for addition and multiplication
– the Counting Principle implies the Laws of Arithmetic
2. Positive decimal-point numbers
– extension of the idea of positional significance of digits
– algorithms for addition and multiplication
– the number line
– Laws of Arithmetic remain valid
3. Integers
– subtraction as the inverse of addition
– integers: 0, ±1, ±2, …
– addition and subtraction; algorithms
– Laws of Arithmetic remain valid
4. Fractions
– division with remainder
– fractions m/n
– location on number line
– addition and multiplication
– Laws of Arithmetic remain valid
5. Rational numbers
– subtraction of fractions
– rational numbers ±m/n
– positive and negative number line
– addition, subtraction, and multiplication
– Laws of Arithmetic remain valid
The one step that is yet to be discussed is division of rational numbers. When
this is completed, we will have a number system (the rational numbers) in which
all four operations, + , −, ×, ÷ can be carried out. (However, there are yet further
numbers on the number line, called irrational numbers. We discuss these
numbers briefly later.)
Division
We will use this definition of division of rational numbers:
Here, is read as “a over b,” or “a divided by b.” This can also be written as
a/b, or a ÷ b. Thus Eq. 2.31 could thus be expressed as
This definition of division is of course familiar in the case that a and b are
whole numbers and b divides evenly into a. For example 8/4= 2 means that 8 = 4
× 2. Now we can extend the definition to allow for a and b to be rational
numbers.
Equation 2.31 or 2.32 just says that “division is the inverse of multiplication.”
For example, consider We have
Can you see what c must be? What can you multiply by, to get Think about
it for a moment.
Answer: because
Any example could be done in this way, which is like guessing the answer.
However, the following formula avoids guessing:
Here, A, B, C, D are assumed to be whole numbers (with B, C, D ≠ 0).
This is the famous invert and multiply rule: To divide by a fraction p/q invert
this fraction and then multiply. (Invert just means to switch the numerator and
denominator.) For example,
Write this as
What has to go into the space (?) to make this work? It has to cancel out the
factor and replace it by Do you get it? In fact
Problem 2.37 Calculate (a) (16/9) ÷ (64/81); (b) (12/13) ÷ (3/13); (c) (11/50) ÷
(−7/150); (d) (12/7) ÷ 6.
The longer line serves to group things above and below it separately. However,
using brackets is usually preferable.
Division by zero
We saw earlier that m/0 is an undefined, or “illegal” expression, if m is a whole
number. More generally, division by 0 is always undefined: a/0 is meaningless, or
illegal, if a is any number. I’m often asked “Professor, why is division by zero
impossible?” My whimsical answer is “Try it on your calculator.” But this is no
explanation. The proper answer is “Well, what is the basic definition of
division?” Remember,
I have included the long-division algorithm for the sake of completeness. I’m
not convinced that every one needs to remember it.
Problem 2.40 Use the Laws of Arithmetic to prove that if ab = ac for some a ≠ 0,
then b = c.
Problem 2.40 indicates another type of cancellation – you can “cancel” the
common factor a in the equation ab = ac (if a ≠ 0). As previously, I recommend
against using “strike-out” in this situation, as it can cause confusion. For
example, suppose you obtain the equation 18x = 6 when solving some math
problem. You write this as So what is x? Better to leave off the
strike-outs, and note that 6 × 3x = 6 implies 3x = 1, so x = 1/3.
Problem 2.40 makes use of a mathematical principle that we use frequently,
without being aware of it. The principle is
For example,
Now, I don’t want to bore you with further technical details. But a very
perceptive reader might ask, how do we know, for example, that the formula for
multiplying fractions
Absolute value
The absolute value (or magnitude) of a real number x is the positive distance of
x from 0, on the number line. This implies that
Here, for any a > 0 the expression designates the square root of a; this is
defined as the positive number b whose square b2 equals a. Thus
For example because 52 = 25. (Many students think that
but this is not how the symbol is used in math.)
Note that Eq. 2.38 says that finding the square root of a positive number is
the reverse operation to finding the square of a positive number. Reverse
operations are common in mathematics.
and this even makes sense if you don’t know what the symbols mean. The real
number system is the basis for much of higher mathematics. For example, the
calculus is entirely based on the real number system.
Repeating decimals
Every repeating decimal is equal to a rational number, that is, a fraction m/n. But
how do you calculate the fraction? Consider the example
We have found the fractional form of x. (A quick check on your calculator will
confirm this.)
Do you see the general method? Can you find the fraction corresponding to
0.4545… in your head, now? Answer: 45/99 = 5/11. Work this out more
carefully, if you don’t see it. Note that you multiply by 100 in this case. In
general, if the repeating group has n digits, you would multiply x by 10n.
In general, a repeating decimal may begin repeating only after some number
of nonrepeating digits, as in
Problem 2.42 Which if any of 151, 153, 157, 159 are primes?
that is, Q is the product of all primes up to N ,plus 1. Then Q is not evenly
divisible by any of the primes up to N , because there is always a remainder of 1.
Therefore, either Q is itself a prime, or it has a prime factor greater than N. In
any case, no matter how many primes we start with, there’s always another,
larger prime.
Factoring
Given a natural number n we can, in principle, determine which primes divide n.
For simplicity we start by considering the smallest primes 2, 3, 5, etc., reducing
the size of the factoring problem whenever we discover a prime factor. For
example
Since 19 is a prime, this is as far as we can go. We say that 684 has now been
factored as a product of primes. We might write it more compactly as
where 22 = 2 × 2, etc.
A natural number that is not a prime is called a composite number. The
composite numbers are 4, 6, 8, 9, 10, etc.
This trial-and-error algorithm can be used in principle to factor any given
whole number. Therefore
Irrationality
We next consider a fact about whole numbers that was known to the early Greek
mathematicians. The fact in question is that numbers like √2 are not expressible
as a fraction m/n of whole numbers. This fact was known to the school of
Pythagoras, and was at first considered to be an embarrassment, more or less on
superstitious grounds. The Greeks were familiar with fractions, which seemed a
good enough number system. Thus was a “bad” number, although it could
easily be realized geometrically, as the hypotenuse of 45° right triangle (see
Chapter 5).
Let m be any whole number. What can we say about the prime factorization
of m2? Consider two examples:
As these examples indicate, any prime that occurs as a factor of m2 must
occuran even numberof times as a factor. For example, 202 has four factors of 2,
while 212 has zero (zero is an even number).
We can now give Euclid’s proof that is irrational. Suppose on the
contrary that is rational, in other words that = m/n for certain whole
numbers m and n. Then 2 = (m/n)2 = m2/n2, and this means that
If we now consider the prime factorizations of m2 and n2, we see that m2 has an
even number of factors of 2, but 2n2 has an odd number of factors of 2. Therefore
we cannot have m2 = 2n2 after all. The assumption that is rational
has led us to a contradictory conclusion. (This type of argument is called
“reductio ad absurdum” in Latin – reduction to an absurdity.) Therefore we
must conclude that is irrational.
To check whether you followed this argument, try modifying it to prove that
is irrational. What happens if you use the argument to try to prove that is
irrational? Also, prove that (the cube root of 2) is irrational. Here’s how to
start: suppose = m/n. Then 2 = (m/n)3 = m3/n3, so that m3 = 2n3. Why is this
an absurdity?
The Greek mathematicians were also interested in proving, or disproving,
that π is rational, but this problem defeated them. It was finally proved in the
19th century that π is irrational. One sometimes reads in the paper the “amazing
fact” that the decimal expansion of π will never repeat. This is true (because π is
irrational), but don’t the journalists realize that the decimal expansion of also
never repeats? The latter fact is fairly easy to prove– indeed, we have just proved
it.
Errors in arithmetic
We have finally completed our review of Arithmetic. The subject is quite
complicated. Getting through Arithmetic takes many years of schooling. Having
completed Arithmetic, students next study Algebra. Unfortunately, errors are
often made in Algebra. These errors, I believe, are usually the result of an
incomplete mastery of Arithmetic. Algebra is direct follow-on from Arithmetic,
so any gap in one’s training in Arithmetic is likely to cause errors in Algebra.
At this stage, you may wish to look at the two final sections of this book. Try
the Diagnostic Test, and read about Common Errors. These sections can help
you decide whether you still need to study parts of Chapters 1 and 2 carefully,
before going on. Likewise, if any of Chapter 4 (Algebra) is not entirely clear to
you, come back and study Arithmetic meticulously.
Why is this true? The answer is, first that we can write both fractions using a
common denominator:
Second, fractions having the same denominator are added simply by adding the
numerators. Hence the above formula is valid.
We also remember that this argument can be run backwards, as a way of
checking. For example, if asked to add you would get and this can be
checked immediately:
Knowing all this should prevent anyone from making certain types of mistakes.
Here is another question related to understanding a math formula:
– Are there interesting special cases?
In the multiplication formula, suppose C = D. Then, because for sure,
the formula says that
6. Remove brackets and simplify: (a) 3x − 2(x + y); (b) (a − b)(a − b).
7. Divide, and obtain the quotient q and the remainder r: (a) 207 by 2; (b) 71 by
11; (c) 1897 by 222. (Use a calculator.)
8. Show that if n divides m, and m divides p, then n divides p (divides” means
“divides evenly into”). Start by considering an example.
9. Find the prime factorization of (a) 186; (b) 1611 (this is divisible by 9). Use a
calculator if this helps.
10. Specify where the following numbers are located on the number line: (a)
83/6; (b) 105/15.
11. Find the decimal expansion of 4/9 by long division. Check by calculator.
12. Use your calculator to determine which reciprocals 1/n (n = 2, 3,…, 20) have
terminating decimals, and which have repeating decimals. Can you guess the
general rule?
13. Express as a single fraction in simplest terms: (a) 35/8 + 7/12; (b) 20/7 − 13/4.
Solution 2.3 The problem is to calculate m − n, where m > n. The algorithm is:
1. Write m above n, aligning digits on the right, as usual. Draw a line.
2. Start with the units digit.
3. Subtract the lower digit from the upper digit, if this is possible, and write the
result below the line, in the same column. If, however, the lower digit is
bigger than the upper digit, “borrow” 1 from the next upper digit, and add
10 to the current upper digit. Now subtract as before. The “borrowed” 1
decreases the next upper digit by 1 in the next step.
4. Move to the left one column, i.e., to the next digit.
5. Repeat steps 3 and 4 until finished.
Observe the typical algorithmic nature of these steps: starting instruction, a
repeated procedure, and a stopping instruction. See Section 1.2. An example:
Solution 2.6 (a) −13; (b) 3; (c) −3; (d) 0. You get the same answers after changing
the order.
Solution 2.7 (a) −11.6; (b) 3.7; (c) −6.2.
Solution 2.9 (a) −3; −17; 41; 0; (b) −22.80; −7, 128.1; 49.4; 0.67.
Solution 2.10 (a) −1.8−(−7.4−7.6) = −1.8−(−15.0) = −1.8 + 15.0= 13.2. (b) −(4.3
− 6.1) + (2.7 − 7.0) = −(−1.8) + (−4.3) = 1.8 − 4.3= −2.5. (c) 3.5 − (2.6 − (4.2 −
5.8)) = 3.5 − (2.6 − (−1.6)) = 3.5 − (4.2) = −0.7.
Solution 2.15 (a) You could check, by a pen-and-paper calculation, that 563 × 48
= 27, 024. (b) The numbers that divide into 30 are n = 1, 2, 3, 5, 6, 10, 15, and 30.
The corresponding values of q = 30 ÷ n are q = 30, 15, 10, 6, 5, 3, 2, and 1. For 31,
the only values of n are n = 1 and 31, with corresponding q = 31 and 1.
Solution 2.16
Solution 2.17 n divides evenly into m if and only if the remainder r equals zero.
Example: 3 divides evenly into 24, because 24 = 3×8 (zero remainder).
Solution 2.19 We have 896 = 54 × 16 + 32. The crew members each get 16
doubloons, and the captain gets 48.
Solution 2.22
Solution 2.23
Solution 2.25 Factors are: (a) 36 and 8; (b) no factors; (c) no factors; (d) 36 and
(x + y), or you could say 36 and x + y; (e) 36, x and y. (Note: some of these
expressions could be factored further. For example, 36×8= 25 ×32. But the
“explicit” factors are as listed here.)
Solution 2.26 8/53 is larger.
Solution 2.27
Solution 2.28 (a) .833 … (repeating); (b) .22 … (repeating).
Solution 2.29
Solution 2.30
Congratulations if you got this right.
Solution 2.31
Solution 2.32
Solution 2.33
Problem 3.1 Find (a) 31.2 × (16.2 − 2.7). (b) 16 ÷ (21 − 7). (This may be
confusing.)
Avoiding errors
By now you realize how easy it is to make mistakes while using a calculator. And
this can be important. Sometimes I’ve been presented with outrageously wrong
bills by store clerks who made calculator errors. (Usually I don’t consider these
errors to be deliberate. One exception was a transaction involving a foreign
currency – always a confusing situation.) The two most common errors are:
entering a number incorrectly, and doing the wrong calculation (as in some of
the above examples). To avoid the first error, be sure to check each number on
the display, as soon as you’ve entered it and before pressing any operation key.
To avoid doing the wrong calculation altogether, first make sure that your
method is right. If in doubt, try it on a simple similar example, as a check.
Another extremely useful way to check your calculator’s results is to do a
quick, approximate mental calculation (Sec. 1.5). This is how I usually know
when someone gives me a wrong bill. Example: 21.55 ÷ (11.2 + 18) is
approximately 20 ÷ 30, which equals 2/3 or about .7. You’d know for sure that a
(mis) calculated result like 19.92 is dead wrong.
Using memory
Almost every hand calculator has a single-register memory, where you can store
a single number, which can later be recalled. As an example, consider a
calculation like (x + y) × (z + w), where x, y, etc. are certain numbers. There’s no
obvious way to key in this calculation on most cheap calculators. However,
memory comes to the rescue. The algorithm is:
calculate x + y and store in memory
clear the display (this does not clear memory)
find z + w
multiply (×)
recall memory
equals
Try it on (66 + 18) × (39 − 17) = 1, 848. (Here is the calculation as done on
my small TI calculator: First, clear anything in memory by pressing
twice. If the memory is not clear, an ‘M’ is displayed on the LED. Now key in
This stores 66 + 18, or 84, in memory. Next key in
giving the answer, 1,848. Here, the key
stands for ‘Memory Recall,’ which brings the memory value 84 onto the screen
for multiplication.)
You will need to read your owner’s instruction book (if you have one) to
familiarize yourself with using memory. Again, using memory is normally easier
with a more advanced calculator. In fact, if you’re serious about upgrading your
math skills, I recommend not spending too much effort mastering a dime-store
calculator. Instead, get a scientific calculator – but first read Sec. 5.5.
Problem 3.2 Using memory, calculate (a) (70.3 − 47.6) ÷ (18.7+14.6). (b) (2.5 −
16.3) × (16.1 − 5.5). Do quick mental approximations to check.
Calculator accuracy
Try this calculation: ‘1 ÷ 7 = ×7 = ’. You may not get the exact answer, 1.
Calculators can only keep track of a finite number of decimal places, and a
number like 1/7 is a repeating, infinite decimal. My own dime-store calculator
gives (1÷7)×7= 0.9999997, but my HP Scientific calculator gives (1 ÷ 7) × 7=
1.000000000. Nevertheless, the HP calculator also sometimes does given slightly
inaccurate results – it’s inevitable. Anyway, be glad that you’re smarter than your
calculator, in some ways. You know that (1/7) × 7 = 1, but a calculator doesn’t
necessarily know this.
Problem 3.4 Your restaurant check is $62.50. Exactly how much is a 15% tip?
What do you get by quick approximation?
Problem 3.5 You purchased 100 shares of Axiom Corporation. In the first
month the price of Axiom shares went up by 50% of your purchase price. In the
second month the price fell by 50% from it’s one-month level. Did you gain or
lose, overall? How much?
3.3 Areas
The basic formula related to area is the rectangle formula:
where A denotes the area of a rectangle which has length l and width w.
(A rectangle is a four-sided figure, with each corner being a right angle, i.e., a 90
deg angle. Look up ‘right angle’ in Chapter 5 if you are not familiar with this
concept.) Of course, the length and the width are measured using the same units
(the same ruler, in other words). For example, if you actually measure the above
rectangle with a metric ruler, you will see that it has length l = 3 cm andwidth w
= 2 cm. The area is A = 6 square centimeters, which we usually write as A = 6 cm2
[cm2 should be read as ‘centimeters squared,’ or else as ‘square centimeters’].
The next figure shows why we know that A = 6 cm2 in the above example.
The 3 × 2 rectangle can be ‘cut up’ into exactly 3 × 2, or 6 one-by-one squares,
each having an area of 1 square centimeter.
In other words, the problem of finding the area of a rectangle is just an example
of the relation between multiplication and counting, as discussed in Chapter 1.
This argument works perfectly well for rectangles of any size l × w, at least in
the case that l and w are whole numbers. However, the basic formula A = lw is
valid for all lengths l and widths w, not just whole numbers.
Problem 3.6 Make a sketch to illustrate the fact that the area of a square with
sides cm is A = 1/4cm2. Suggestion: start with a ‘unit’ square, having sides 1 cm.
The example in Problem 3.6 can be used to argue that the area formula for
rectangles, Eq. 3.1, is valid for any values of l and w. For example, if a wall is 8
feet high and 14 feet 3 inches long, its area is square feet.
Area of a triangle
We’ll start by having you do an experiment.
Problem 3.7 Using a ruler and pencil, draw a rectangle on paper (draw the
corner angles ‘by eye,’ or by using the end of your ruler). Also draw a diagonal, as
shown.
Now use scissors to cut out the rectangle. Also, cut it along the diagonal.
What can you say about the two triangles you get? If you were careful, the two
triangles are identical.
Now consider a right-angled triangle, as in the experiment.
What is its area A? Just ½ the area of the original rectangle, right? Let us label
the sides of the triangle which form the right angle, as b (for base) and h (for
height). Then the area of the rectangle is bh, so that the formula for the area of a
right-angled triangle is
Equation 3.2 in fact holds for any triangle, not just right-angled triangles. The
‘height’ h means the perpendicular height above the base:
Problem 3.8 Using your ruler, measure the base b and height h for the triangle
shown above. Find the area A.
How can we be sure that Eq. 3.2 holds for an arbitrary triangle? Let us
imagine slicing the given triangle into very thin slices parallel to the base. Next,
slide all the slices to the right so that they butt up against a perpendicular line, as
shown. The new triangle is a right-angled triangle, which has the same base, the
same height, and the same area, as the original triangle. (Pause to make sure you
agree with this.) Since for the right-angled triangle, it must also be true
that for the original triangle.
This idea works also when the original triangle looks like this:
Complete the sketch for this case. How does one specify h, the height of the
original triangle? (Answer: one first extends the base line b to the right.)
Problem 3.9 Another way to show that for any given triangle is to cut
the triangle into pieces, somehow. Can you elaborate?
Problem 3.10 What is the formula for the area of a parallelogram (a four-sided
figure having its opposite sides parallel to each other)? How do you know?
The answer is that the second object weighs 23 = 8 times as much as the first
object. (Recall that 23, which reads as ‘two cubed,’ means 2 × 2 × 2, which is 8.) If
the scale factor is k, rather than 2, then the second object weighs k3 times as
much as the first one. By ‘scale factor’ we mean the ratio of the dimensions of the
two objects.
If you own two dogs, you can check the scale effect:
What would you predict Fido’s weight to be? Answer: 33 lb ×(30/18)3 = 153 lb.
The actual weight could differ slightly from this, of course.
The general rule for the geometric effects of scaling is shown in the next
diagram.
In words, under scale change (with scale factor k), length scales as k, area as
k , and volume as k3. (In these pictures, the new object is a scaled-up version of
2
the old object. This means that k> 1. However, the formulas are correct also for
scaling-down, in which case k< 1.)
Problem 3.11 An elephant is about 4 m tall, and a mouse about 4 cm. If the
mouse weighs 3 g, about how much should the elephant weigh? (See Sec. 3.7 for a
discussion of weights and measures.)
Now for the explanation that volumes increase as k3. First consider a cubic
volume, with side s units. Its volume is s3 cubic units (and this is why we read s3
as ‘s cubed’). For example, a cubic box with side 5 inches has volume 53 = 125
cubic inches. Now consider a second cube, with side ks, where k is the scale
factor. It volume is (ks)3 = k3s3 (see Chapter 4). Thus the ratio of the two volumes
is k3s3/s3 = k3.
When we scale up the smaller object by the factor k, all the tiny cubes [or
squares, in 2 dimensions] get scaled up also by k, so their volumes go up by k3 [k2
in two dimensions]. Therefore the whole volume increases by k3 [the area
increases by k2, for a two-dimensional object].
Example. If a small child falls over and hits it’s head on the ground, he may cry,
but the injury is unlikely to be serious. It can be a different matter if you fall over
and hit your head. The reason is that the energies involved in the impact scale up
by k5,where k is the scale factor (relative heights) of the two persons. For
example, a 6-foot man’s head will hit the ground with 25 = 32 times the energy of
a 3-foot boy, or 35 = 243 times as much as a 2-foot toddler.
Similarly, A = k2A0. Also, if R denotes the area covered by roads, then in case (b)
we have:
The proportion of the city’s total area A covered by roads equals
For example, suppose that originally, roads covered 20% of city’s area, i.e., R0/A0
= 20% = .2. Here is what happens as the city grows:
3.5 Proportionality
Proportionality is a basic concept frequently encountered in mathematics and
science. Some everyday examples are:
1. The amount of gas used on a driving trip is proportional to the distance
traveled.
2. The cost of a bunch of bananas is proportional to their weight.
3. The time needed to mow a lawn is proportional to its area.
for all values of U and V . The constant c remains the same no matter what
values U and V have. In a practical situation, c usually has a meaningful
interpretation. The units for the constant c are given by
This is required to make the units on both sides of the equation U = cV match.
The symbol ∝ is used to denote proportionality:
Thus U ∝ V means that there is some constant of proportionality c such that
U = cV.
For example, consider the cost of bananas. Let Y be the cost of a bunch of
bananas, and let W be weight of the bunch. Then Y is proportional to W , so that
In this case the constant of proportionality c is the price of bananas. For example,
if bananas cost 45¢ per lb, then c = 45¢/lb. To calculate the cost of 5 lb of
bananas, we have Y = 45¢/lb × 5 lb = 225¢, or $2.25.
Every proportionality problem is basically the same as this example.
As a second example, consider the question of gas consumption for a certain
automobile. Then D ∝ G, where D is distance driven and G is gas consumption.
Thus D = mG for some constant m. One could write instead that G ∝ D and G =
cD, but I want the constant of proportionality to have its usual interpretation,
mileage. In American units, D is measured in miles, and G in gallons. Therefore
the units of m are miles/gallons, or miles per gallon.
To determine the mileage for your car, first fill the tank and set the trip meter
to zero. The next time you fill the tank, record the amount of gas you bought, G,
and the distance on the trip meter, D. For example, suppose G = 8.3 gal and D =
225 mi. Then from D = mG we have m = D/G = 225 mi/8.3 gal = 27.1 mi/gal.
Repeating the experiment will probably give a slightly different value for m,
depending on driving conditions.
Problem 3.12 Continuing with the above example, suppose that freeway driving
is 50% more efficient than city driving, in terms of mileage. The figure 27.1
mi/gal applies to city driving. If gas costs $2.35 per gallon, what is the gas cost for
a trip of 1500 mi, using the freeway system?
(Notice especially how the act of writing down the units, mi and gal, helps
ensure the right calculation. More about this topic in Section 3.7).
(In metric countries, the gas-consumption figure is usually given in terms of
liters per 100 kilometers. For example, a consumption rate of 8 liters per 100 km
would be the same as 100 km/8 liters, or 12.5 km/liter. If you find this confusing,
you’re not alone.)
Problem 3.13 You are waiting in the doctor’s office, along with 5 other patients.
You notice that it takes 12 minutes for the first two patients to be called. Estimate
how much longer you will have to wait. Also, express this situation in terms of
proportionality.
Multiple proportionality
Here is a typical proportionality problem from school days. If 2 people take 4
hours to paint 30 feet of fence, how many people are needed to paint 150 feet of
fence in an 8-hour day? You can solve such problems as a kind of fill-in puzzle,
like this:
2 people take 4 hours to paint 30 feet
1 person takes 8 hours to paint 30 feet
5 people take 8 hours to paint 150 feet
Problem 3.14 The cost of painting an 8 ft × 12 ft wall with primer at $5.00 per
quart is $17.50. Find the cost of painting four such walls with paint costing $9.50
a quart.
Problem 3.15 Six cats together eat 35 cans of cat food per month, at $1.69 a can.
How much does it cost a menagerie to feed 100 cats for a year?
With a little practice, you can perhaps solve proportionality problems in your
head, without worrying about equations and constants of proportionality. In
science or math, however basic laws are often expressed as proportionality
equations. Familiarity with the mathematics of proportionality is therefore
important.
Other types of proportionality
Other types of proportionality than those discussed so far are common in
science. Two general examples are:
This is pretty slow walking – normal walking is about 3 mi/hr. Perhaps he was
slowed up at street crossings. The table lists several familiar examples, using both
American and metric systems of units. (The question of converting between the
American and metric systems is discussed in the next section.)
Situation Speed (American) Speed (metric)
Person walking 3 mi/hr 4.8 km/hr
Olympic marathon runner 13 mi/hr 21 km/hr
Car on a freeway 65 mi/hr 104 km/hr
Passenger jet 610 mi/hr 980 km/hr
Speed of sound 1089 ft/sec 332 m/sec
Speed of light 186,000 mi/sec 300,000 km/sec
The first thing to notice about the speeds shown in the table is that speed is
always specified in appropriate physical ‘units,’ such as mi/hr, km/hr, mi/sec, and
so on. The expression mi/hr is read as ‘miles per hour,’ and is sometimes
abbreviated as MPH. Speed limit signs in America often use this abbreviation
(other countries use KPH, kilometers per hour).
Passenger jets do not travel as fast as the speed of sound (unless they are
supersonic, like the Concorde). Military aircraft, however, are often capable of
exceeding the speed of sound. If you read that some new fighter jet flies at Mach
3, this means it flies at 3 times the speed of sound.
Notice how great the speed of light is, 186,000 miles per second. No space
craft will ever fly faster than this, because of the basic principle of physics which
says that nothing can move faster than the speed of light.
Let’s return to the basic equation, 3.4:
Problem 3.16 (a) Suppose you drive at 60 mi/hr for hours. How far have you
driven? (b) The sun is 93 million miles from the earth. How long does it take
light to travel from the sun to the earth?
In each part of Problem 3.16 you are given the values of two of the variables,
S, D, and T , and are asked to calculate the value of the third ‘unknown’ variable.
For example, in part (a) you’re told that speed S = 60 mi/hr and time T = 2.5
hours. How do you find the unknown distance D, using S = D/T ?
To find D we first solve the equation S = D/T for the unknown D:
Do you need to remember all three of these equations? No, just remember
the basic equation S = D/T and realize that the other two equations can be
deduced from this. By the way, if you have trouble memorizing S = D/T , think of
an example, such as 20 miles (D) in one hour (T ) equals 20 miles per hour (S =
D/T ). Also, you can remember that ‘speed = distance over time.’
Now, returning to Problem 3.16 (a): given S = 60 mi/hr and T = 2.5 hr, we
have
In this calculation, notice that the numbers are multipied (60 × 2.5 = 150), and
the units are combined The latter operation of combining
units is extremely useful and important. Most practical applications of
mathematics involve the proper combination of units of measurement.
Returning to Problem 3.16 (b): given D = 93×106 mi and S = 186×103 mi/sec
[speed of light], find T . First, S = D/T implies that T = D/S. Therefore the
solution to Problem 3.16 is
Problem 3.18 A light-year is the distance that light travels in one year. Use your
calculator (and scientific notation) to find the number of miles in one light year.
Average speed
Suppose you commute from home to work. For the first 10 km your speed is 100
km/hr, but then you get into heavy traffic for the next 10 km, where the speed is
only 40 km/hr. What is your average speed for the whole trip? It’s not 70 km/hr,
as one might first think.
To calculate average speed for a trip in which actual speed varies, we still use
the basic formula
where D is the total distance, and T the total time of the trip. For the stated
problem, D = 20 km. But what is T , the total time? To find T we have to consider
the two segments separately. Let’s write T1 and T2 for the two segments, and also
S1,S2 and D1,D2. Then, from Eq. 3.6,
Note that this is considerably slower than the naive (incorrect) guess of 70
km/hr. Slow traffic has a big effect in lowering your average trip speed, as most
commuters are aware.
Problem 3.19 You fly from Dallas to Houston, a trip of 244 miles, in a jet that
flies at 490 mi/hr. You take 30 min getting to the Dallas airport, 45 min waiting
before departure, and 30 min waiting for your luggage at Houston. Counting the
waits at both ends, what is your average speed for the trip? Total time? (This
might be useful knowledge if you want to compare flying with driving.)
For example, 1800 m ≈ 5400 + 540 = 5940 ft (exact value is 5910 ft).
A similar mental algorithm converts masses:
For example, an 8 kg bag of sugar weighs 16 +1.6= 17.6 lb(which is exact).
Temperature
The temperature scale used in most countries (other than the US) is Celsius. The
Celsius temperature C and its Fahrenheit F equivalent are related by the (exact)
formula
For example, the freezing point of water is 0 deg C, or by Eq. 3.8, 32 deg F .
Similarly, the boiling point of water is 100 deg C, or 212 deg F .
Problem 3.20 There is one temperature at which the Celsius and Fahrenheit
numbers are the same. What is it? (Use Eq. 3.8).
For everyday use, an approximate rule is that F equals 2×C plus 30 deg. This
pretty good for normal outside temperatures, though it goes a bit awry for hot
days:
Problem 3.21 (a) Solve Eq. 3.8 for C. (b) Use part (a) to calculate the Celsius
equivalent of normal body temperature, 98.6deg F .
Units and dimensions
You have no doubt heard the statement ‘you can’t add apples and oranges.’ But
what can you add? Well, first we can certainly add (and subtract) quantities
expressed in the same units:
We can also add quantities expressed in different units, provided that the units
can be transformed into one or the other:
because 1 cm = .01 m.
What we cannot add or subtract are quantities that have different
dimensions. For example, 5 yards plus 20 seconds is just nonsense. Some
examples of basic physical dimensions are:
Basic SI Units
Length (L) m (meter)
Mass (M) g (gram)
Time (T) s (second)
There are other basic dimensions, for temperature, electricity, and so on, but
we will not consider these here.
We can add or subtract two physical quantities that both represent length, or
mass, or time, but we cannot add a length to a mass, etc. If the physical quantities
are expressed in different units, we can add them after transforming to a single
unit.
Problem 3.22 Using the conversion table on page 135, find: (a) 5.3 tonnes plus
850 kg; (b) 3 hrs, 25 min, and 18 s (in terms of seconds); (c) 9.5 ft minus 2.7 lb.
Problem 3.23 A bandit is running towards the rear at 10 ft/s, on top of a rail car
travelling west at 40 mph. Find the bandit’s speed relative to the ground.
Mass versus weight
(You can skip this discussion unless this question interests you. But it does
illustrate the idea of physical units, or dimensions.) What is the distinction
between mass and weight, if any? I recently read in a school math text that, to
convert mass to weight you multiply by 9.8. This is utter nonsense – mass and
weight are completely different concepts, at least in Science. (In everyday life, we
often use ‘weight’ when the scientifically correct term would be ‘mass.’ If the
butcher tells you that the meat weighs 450 grams, he should say that its mass is
450 g. Its weight is actually 4.4 Newtons. Tell that to the butcher!)
The mass of an object is the same no matter where it is located. But the
weight depends on circumstances. Astronauts in an orbiting spacecraft are
weightless, but not massless. The weight of a moon rock located on the moon is
not the same as its weight on earth, but its mass is the same.
Weight is actually a force. Your weight as you sit in a chair is equal to the
force that the chair exerts on you to counteract the pull of gravity. (This is the
same force that your body exerts on the chair.)
The great 16th century mathematician Isaac Newton discovered the basic law
of force:
where F is the force, m is mass, and a is acceleration. This equation is one of the
keystones of Science, every bit as important (and famous) as Einstein’s E = mc2.
(The meaning of ‘acceleration’ is discussed below.)
Let’s not discuss the full significance of Eq. 3.9 here. For now let’s just use
this equation to explain weight W , for a mass m at rest on earth. Here we use a =
g, the acceleration of gravity on earth. The value of g is 9.8 m/s2 (see below). By
Eq. 3.9 we have, since weight is a force
where a is the acceleration of the elevator. Likewise, when the elevator slows
down to stop at a floor, your weight goes down, because a becomes negative. Try
the next problem.
Problem 3.24 Here’s how you could measure the acceleration of an elevator (if
you wanted to do that). Take some bathroom scales, and stand on them in the
elevator. When the elevator starts up, your weight increases briefly, according to
Eq. 3.11. For example, suppose the scales first register your ‘weight’ as 80 kg. (a)
This is not correct scientifically – what does it actually mean? (b) Now suppose
that as the elevator starts up, the scale briefly indicates 100 kg. Calculate the
acceleration a of the elevator.
Acceleration
What exactly is acceleration? You probably realize that acceleration has
something to do with a change of speed. Specifically
where ΔS (read as ‘delta S’) is the change in speed, taking place in time ΔT . For
example, it is said that the Mercedes-Benz CLK55 can go from 0 to 100 km/hr in
5 seconds. What is the acceleration?
First, let’s transform 100 km/hr to meters/second. Since 1 km = 1,000 m and
1 hr = 3,600 s, we get
You pay a lot of money, and you don’t even get as much acceleration as gravity,
which is free!
Notice that, with speed measured in meters per second, acceleration (the rate
of change of speed) must have dimensions (meters per second) per second, or
m/s2. The phrase ‘meters per second per second’ is often used, because it reminds
one that acceleration is the rate of change of speed.
This discussion doesn’t quite explain why the number g used in calculating
weights has anything to do with acceleration. The connection is that, if a body is
allowed to fall freely under gravity, it will in fact accelerate at g = 9.8 m/s2 (on
earth). For example, if the body starts from rest, then after one second it will be
falling at 9.8 m/s, and after two seconds at 19.6 m/s, and so on.
2. Write down the formula for the area of a triangle. How is this formula
derived?
3. How would you find the area of a quadrilateral (4-sided figure)?
4. Suppose that a polygon (n-sided figure) is scaled up, using scale factor k. What
happens to its area? Its perimeter? (The perimeter is the length of the
boundary of the polygon.)
5. A golf ball has diameter 1.7 inches, and a baseball 3.0 inches. Find the relative
masses and surface areas of a golf ball and a baseball.
6. The earth rotates around the sun once a year. How fast is the earth moving in
space as a result of this motion? (The length of a circuit around the sun is
about 584 million miles, or 940 million kilometers.)
(b) The gravitational acceleration for a given planet is proportional to its mass
(Newton’s law of gravitation). If Neil Armstrong weighs 98.8% less on the
moon than on earth, what is the ratio of the moon’s mass to the earth’s?
9. The human population of the world was about 7 billion in 2010. The rate of
increase was 1.5% per year. (a) How many extra people will be added to the
world’s population per year? (b) What will be the world population in 2020?
[Don’t consider exponential growth yet – see the next Chapter.]
10. The energy E of a moving object is proportional to its mass m and to the
square of its velocity v. (a) Express the relationship in symbols. (b) The
constant of proportionality is in fact (no units). What units is energy
measured in, using the SI system?
Solution 3.1 (a) 421.2. (b) 1.14 approx. (With a basic calculator, you may have to
do this in steps. First find 21 − 7, which is 14. Remember, or write down 14. Now
calculate 16 ÷ 14. A little later you will learn how to use your calculator’s
memory for problems like this.)
Solution 3.5 If the starting price was p,the one-month price was 1.5p (that is, p +
50% of p), which next became .5 × (1.5p)= .75p. You lost 25%. (A mathematically
naive person might think that if the stock market goes up 5% on Monday, and
down 5% on Tuesday, it balances out. But, in fact this would mean that the
market lost 1/4% over the two days – agree?)
Solution 3.6
Solution 3.7 If you were careful, the two triangles should be exactly the same –
one overlaps the other perfectly. (Such triangles are called ‘congruent’ triangles –
see Chapter 5.)
Solution 3.10 The formula is A = bh, where b is the base and h the height. One
way to prove this is to cut the parallelogram along a diagonal, obtaining two
congruent triangles with base b,height h. Another way is to slice it horizontally,
and slide the slices to form a rectangle.
The cost of gas for the trip is 36.9 gal × $2.35/gal = $86.72.
Solution 3.13 At 6 min per patient, and with 3 patients ahead of you, you can
expect to wait another 18 min. For a proportionality representation, let N be the
number of patients called in T minutes. Then you assume that T = cN . Since T =
12 min when N = 2 patients, we have c = 6 min per patient.
Solution 3.14 Let C = cost of painting area A using paint costing $P per quart.
Then C = kAP (twice the area means twice the cost, as does twice the price). Now
C = $17.50 if A = 96 ft2 and P = $5.00. This gives k = C/AP = $.0036 per square
foot [store this value in memory in your calculator]. With A = 4 × 96 ft2 and P =
$9.50 we get C = kAP = $133.00. Quick check: paint is nearly twice as expensive,
and the area 4 times as large, so the cost should be about 8 × $17.50 = $140.00.
Close.
Solution 3.18 Use D = ST = (186,000 mi/s) ×1 year, and convert 1 year into
seconds:
In words, light travels 5.9 trillion miles per year. Again, notice how the various
units ‘cancel.’
Solution 3.19 The time spent actually flying is Tf = D/S = 244 mi/490 mi/hr =
.498 hr, call it The extra times add up to 105 min, or 1 3/4 hr, so your total
time for the trip is 2 1/4 hr. Therefore the average speed, including the waits, is S
= D/T = 244 mi /2.25 hr = 108.4mi/hr.
Solution 3.20 −40 deg C or F . To get this, set C = F in Eq. 3.8 and solve:
gives F = −40 deg. (Read Section 4.5 if you need help in solving
equations.)
Solution 3.21
(Note again how the units cancel, just is if they were mathematical symbols.)
Therefore the bandit’s net speed is 40 − 6.9 = 33.1 mph, in the westward
direction. (Sorry for the silly ‘real-world’ problem, but the idea was to illustrate
two important concepts, conversion of units, and addition of quantities having
the same units.)
Solution 3.24 (a) The scale is really indicating your mass, 80 kg. Your weight (at
rest) is 80 kg ×9.8 m/s2 = 784 N; (b) Your weight during the acceleration is 100
kg ×9.8 m/s2 = 980 N. According to Eq. 3.11, W = m(g + a). Therefore
Hence a = 12.25 − 9.8= 2.45 m/s2. This is the acceleration of the elevator.
Chapter 4
Algebra
There is admittedly something of a problem in teaching algebra – why should
anyone learn algebra, anyway? Basic algebra doesn’t have many immediate
everyday uses. It all seems quite abstract and meaningless to many students.
Moreover, algebra is often quite finicky. But being adept at algebraic calculations
is essential in later subjects, such as trigonometry and calculus.
Fortunately, algebra involves little more than the use of the basic rules of
arithmetic discussed in Chapters 1 and 2. The main novelty is that we now work
with general expressions, such as ax2 − b, rather than with numbers only. The
letters in an algebraic expression represent numbers, in the sense that the letters
can be replaced by numbers.
Here is an example, which we will discuss in greater detail later:
4.1 Exponents
In Chapter 1 we considered the powers of ten, 10n = 10 × 10 × ···× 10 (n times),
which were so important in the positional notation for expressing numbers in
base 10. Similarly, for any number x we define xn
The expression xn is read as (‘x to the nth’, or ‘x to the power n’), except for n
= 2 or 3; x2 is read as ‘x squared’ and x3 as ‘x cubed.’ In Eq. 4.2 x represents any
number. Thus, for example 34 = 3×3×3×3(which equals 81). Similarly, (−2.7)2 =
(−2.7) × (−2.7) = 7.29. Also, in Eq. 4.2, n represents any whole number, n = 1, 2,
3, etc. (Later we will explain what xn means for other values of n.)
In the expression xn the number n is called the exponent of x. Thus, in x5, the
exponent of x is 5. (The number x is sometimes called the base of the expression
xn, but we won’t use this word here). Similarly, in a8 the exponent of a is 8.
Problem 4.2 If you have a scientific calculator, the yx key can be used for
calculating exponentials. Try it on 3n for n = 2, 3 and 4, and check the results by
hand calculation. Now calculate 3n for n = 10, 20, 30. What do you conclude?
Also calculate (.7)n for n = 10, 20, 30.
Rules of exponents
Errors in applying these rules are often made by math students. As usual, a
little understanding can prevent such errors. You will never forget the rules if
you understand why they are true from the outset. Please read the explanations
carefully.
It’s worth emphasizing again that much of mathematics depends on basic
definitions of terminology and notation. Thus, rules 4.3 – 4.5 are direct
consequences of definition (4.2), as you will see.
To explain rule (4.3), we refer to the basic definition in Eq. 4.2. Thus xm
equals xx ··· x [m factors of x], while xn equals xx ··· x [n factors of x]. Thus xmxn =
(xx ··· x)(xx ··· x) where the first bracket has m factors of x and the second has n
factors of x. Altogether there are m + n factors of x, which means that xmxn =
xm+n.
An example may make this clearer:
The second rule, (4.4), is also easily explained. First xm = xx ··· x [m factors of x],
so that (xm)n = (xx ··· x)(xx ···x) ··· (xx ··· x), with n groups of (xx ···x). Counting
up all the x’s, there are mn of them, which shows that (xm)n = xmn. An example:
Problem 4.3 Simplify (a) x2x6; (b) (x2)6.
Rule 3 also follows directly from the definition, Eq. 4.2. We have
Repeated exponents
What does mean? Try to simplify it. The answer is x8, not x6 as one might
expect. This is a consequence of the following convention:
Negative exponents
Let us look again at the basic definition, Eq. 4.2:
Here x can be any real number, but n has to be a positive integer, n = 1, 2, 3, etc.
Otherwise ‘n factors of x‘ is meaningless. However, we can expand on this
definition so as to make sense of xn for all integers n, whether positive, negative,
or zero. The main point is to organize things so that the laws of exponents, Eqs.
4.3–4.5 always remain valid. Keeping math simple and consistent has always
been a goal of mathematicians.
First, what does x0 mean? We can’t say that x0 = xx ··· x with zero factors of x.
This simply doesn’t make sense. But if Eq. 4.3 has to remain true for n = 0 we
must have
This definition may seem a bit strange at first, but it is necessary for
mathematical consistency. (Don’t worry about the side condition x ≠ 0, i.e., x not
equal to zero. I will explain this later.)
Problem 4.4 Write out the values of 4n for n = 3, 2, 1, and 0. Do the same for 10n.
Next, we can use a similar argument to find the meaning of x−n. First,
remember that we want Eq. 4.3 to be true for both positive and negative integers
(and zero). Let n = −m in Eq. 4.3:
The same applies to the general case, the n factors of x in the denominator
cancel n factors in the numerator, leaving xm−n (assuming that m> n). Equation
4.8 is also true if m < n, as the next example indicates:
Problem 4.5 Express using a single exponent: (a) z8z−4;(b) t2/t6;(c)(w3)2÷ w3w2.
Problem 4.6 (Optional) Prove that (xm)−n = x−mn. Use Eq. 4.7 and Eq. 4.4 for
positive values of m and n.
Similarly, it can be shown that the third rule of exponents, Eq. 4.5, is also true
for a negative exponent:
You should be able to prove this for yourself – try it for an example, such as (xy)
−5
= x−5y−5. Obtain this by using the equations already proved. Thus, first
The first of these equations would suggest that 00 = 0, whereas the second
equation suggests that 00 = 1. Which is it? To avoid any inconsistency, we must
decide that:
(As a check, try 00 on your calculator – it will probably give you an error
message.)
To remind yourself of the basic ideas behind the rules of exponents, I suggest
you do the next problem as a brief review.
Problem 4.7 (a) What is the definition of xn when n is a positive integer? (b)
How does this definition imply the formula xmxn = xm+n? (c) Why is (xm)n = xmn?
(d) Explain why we define x0 as 1. (e) Explain why
Finally, to check your grasp of exponents, try the next problem. Work
carefully.
Problem 4.8 Simplify these expressions: (a) (b−2)−1; (b) (x3)−1x5; (c) (x3y−2)4; (d)
(a2/b)3.
Year, t Value of bond at beginning of the year, Vt Interest for the year, It
0 $1,000.00
$70.00
1 1,070.00
74.90
2 1,144.90
80.14
3 1,225.04
85.75
4 1,310.80
91.76
5 1,402.55
98.18
6 1,500.73
105.05
7 1,605.78
112.40
8 1,718.19
120.27
9 1,838.46
128.69
10 1,967.15
Here, the symbol Vt is read as ‘V sub t,’ or just ‘Vt;’ it is interpreted exactly as
stated in the table. Thus the value of the bond at maturity is $1,967.15, or an
increase of 96.7% over the initial $1,000 investment.
To explain how this table was calculated, note that the interest It in any year t
equals 7% of the bond’s value Vt at the beginning of the year:
This annual interest is added to the current value of the bond, to determine the
next year’s value Vt+1:
This calculation is repeated for each year, resulting in the above table of values.
(Computer spreadsheets are a quick way to do such calculations, but they can
also be done using a hand calculator.)
But do we have to go through the whole calculation, if we only want to know
the bond’s final value V10 after 10 years? The answer is no. To explain this, we
first combine Equations 4.10 and 4.11:
(be sure you see how I got the last line). Next we use this equation, Vt+1 = (1.07)Vt
year by year, by putting t = 0, then t = 1, etc.
and so on.
Try to write down the formula for the general case. It’s Vt = (what?). Can you
see that it is
Problem 4.9 Using Eq. 4.14 and your calculator, find the value of the $1,000.00
bond discussed earlier, at the end of 20 years, and 30 years. Compare these values
with the result of simple interest. (Simple interest is not compounded, and does
not accumulate to earn interest on itself.)
The value V0 is called the present value of the $10,000 ten-year bond. Thus V0
is what you would pay for the bond today (or what you could sell it for today),
given that the interest rate is i. The following table shows V0 for a range of values
of i.
Notice that the present value of the bond, V0, decreases as the interest rate i is
increased. The decrease is not minor. Relatively small changes in the interest rate
cause quite substantial changes in bond prices.
Problem 4.10 Suppose the current Federal Bank interest rate is 6% per annum.
The Bank announces an increase of 50 basis points (100 basis points is 1%). By
what percent will bond prices decrease, for 10-year bonds?
(This simple model overlooks several important aspects of bond prices. For
example, investors may believe that interest rates may change in the future. If so,
the market price will reflect investors’ expectations of future interest rates.)
To conclude this brief introduction to the mathematics of finance let us just
record the general case of Eq. 4.16:
This equation, which follows by solving Eq. 4.14 for V0, expresses the present
value V0 of a future payment Vt which is due in t years time, given that the
annual interest rate is i. This equation is fundamental in Economics and
Business. It is sometimes called the discounting formula, the idea being that the
future value Vt is discounted to produce the present value V0.
Information provided by bond dealers usually specifies the yield for each
bond. The yield is the interest rate i such that Eq. 4.17 holds for the given price
V0 (and the given date t and value Vt at maturity). Any annual dividends are also
included in the calculation of yield. Thus the yield is just the rate of interest that
the bond purchaser will realize.
Problem 4.11 Fir trees are harvested 90 years after planting. An acre of fir trees is
expected to be worth $100,000 in 90 years time. How much would a logging
company evaluate an acre of newly planted fir trees, if the interest rate is 5% per
annum? 10%? (What additional assumptions did you make?)
Population growth
Exponential growth also occurs in biological populations. Suppose, for example,
that a scientist is growing a bacteria culture. She places an initial sample of P0 (P
for ‘Population’) bacteria in a Petri dish with plenty of bacteria food. Every 20
minutes each bacterium divides into two bacteria. How many bacteria will there
be after one hour? One day? One week?
Let Pn represent the number of bacteria after n generations. Then
Problem 4.12 The human population on Earth in 2010 was about 7 billion. The
current rate of increase is about 1.5% per annum. Calculate the population 2000
years in the future, assuming this growth rate is maintained. Is the calculated
population possible?
The general equation of exponential growth is
The larger the growth parameter a, the more rapidly the quantity Xt grows, as
shown in the figure.
In some applications, Eq. 4.19 has a < 1. In this case, Xt decreases over time,
approaching zero – see figure. This process is called exponential decay. Physical
examples include the temperature of a hot object while cooling down, and the
decay of a radioactive sample.
Your problem: how much of the original C14 sample would be left after 10,000
years? 20,000? 100,000?
I hope this section has shown you that quite elementary mathematics can be
extremely useful. The models of exponential growth and decay are used
throughout science and technology. The exponential function (discussed in
Chapter 9), which is closely related to exponential growth, is also widely used.
Remember, any factor that is common to the numerator (top part) and the
denominator (bottom part) of a fraction can be cancelled. In Eq. 4.20, the symbol
a is such a common factor. (Of course, a ≠ 0 is assumed.)
Two examples:
Think carefully about the second example. In fact, you can’t cancel anything here
– why not? You might imagine you could cancel 3, but no – 3 is not a factor of 3
+t. (Look up ‘factor’ in the index if this is confusing.) Improper cancellation is a
hallmark of weak math students, and is a leading cause of failing grades in
college math courses.
One more example. Simplify
This works fine – if you’re careful. But it’s easy to make mistakes this way, and
it’s hard to check, too. I recommend the first method – multiply, then cancel.
You can also combine fractions having unequal denominators, but first you must
rewrite each fraction so that they do have common denominators. For example
Notice that there is a quick way to check the answer in Eq. 4.23. Namely, just
split the result into two parts, and use cancellation:
which equals the original expression. This check is always worth doing (I
invariably do it in my own research), especially since you can do it quickly in
your head. Try it again for yourself in Eq. 4.23. Also, observe that the check
basically just reverses the procedure used to obtain the result in the first place.
Carefully checking algebraic calculations is always worthwhile. Anyone can make
a slip while doing algebra, and immediate checking usually catches any mistake.
Train yourself to do this regularly. (I will give you some helpful hints on good
techniques for checking algebra, as we proceed.)
It is customary to use alphabetic order (when convenient), in expressions
such as 3x2y. However, it would be correct to write 3yx2 instead. What about
writing y3x2? While still correct, this would be quite unorthodox. Most readers
would suspect a misprint.
Problem 4.15 Combine the following fractions, and check your results. (a)
This answer is correct, but it can be reduced to Similarly, the method of Eq.
4.22 can be used in the example
This calculation is simpler and more direct than the previous one.
These formulas were discussed, for numbers, in Chapter 2. (Remember that Eq.
4.25 is often thought of as: ‘to divide by a fraction, invert it and multiply.’) These
formulas are also correct if A, B are any algebraic expressions. Thus, for example
where a is some real number, called the coefficient of xn, and where n is a non-
negative integer (n = 0, 1, 2, etc.), called the degree of the term (or the power of x
in that term). Thus the above polynomial has 4 terms, 2x3, −7x2, x and −12. The
term −12, which doesn’t involve x, is called the constant term in this polynomial.
(Remember that x0 = 1, so that −12 could be written as −12x0. We never really do
this, but it shows us that constant terms are bona fide terms of the form axn, with
n = 0.)
The degree of a polynomial is the highest power of x (or whatever the
variable is) in the polynomial. The above example 2x3−7x2+x−12 therefore has
degree 3. Polynomials of degree 3 are usually called cubic polynomials – because
x3 is called ‘x cubed.’ However a second degree polynomial is never called a
‘square’ polynomial, but instead a quadratic polynomial. For example, 6x2 +4x +
9 is a quadratic polynomial.
A polynomial of degree 1, such as 3x − 2, is called a linear polynomial. The
reason for this term is that the graph of a linear polynomial is a straight line; see
Chapter 6.
To familiarize yourself with these terms, try the next two problems.
Problem 4.19 One of the following three expressions is a polynomial, the others
not. Explain.
Problem 4.20 Consider the polynomial (a) What is its degree? (b)
List the terms, and the coefficient of each term. What is the constant term?
when x = 4? Try this – you should obtain 49. If so, you performed the
calculations like this:
But did you notice that there is a tacit precedence rule involved here? The rule is
Thus 3x2 + 1 actually means (3(x2)) + 1. First we find x2, then multiply by 3,
then add 1.
Without the precedence rule, such an expression would be ambiguous. For
example, 3x2 + 1 might be incorrectly interpreted as (3x)2 + 1. Though few people
actually make this mistake, it is important to realize that there is an accepted
precedence rule behind the correct interpretation.
Addition of polynomials
Again, an example will indicate how two polynomials (in the same variable) can
be added:
This procedure is called ‘collecting like terms’ – here we first collect the x3-
terms, then the x2-terms, and so on.
How can you check your work in such a calculation? The method I use is just
to re-do the problem, but in my head. Consider the above example,
By inspection you can first pick out the highest order terms, x3 +2x3 = 3x3. Then
go to the next highest order terms, −2x2 +3x2 = x2. Next, 4x and finally 5. The
result, 3x3 + x2 +4x + 5 agrees with the original calculation.
With a little practice, you can always add polynomials using the mental
method. In fact, you may be less likely to make mistakes (for example, copying
errors) with the mental approach. But you should always check, by re-doing the
calculation. Try it on:
The answer is 4x4+2. [Remember that the factor 3 for the second polynomial
applies to each term: 3(x4 +2x2 − 1) = 3x4 +6x2 − 3.]
Multiplication of polynomials
Multiplication of polynomials can be done in several ways, including mentally.
There is lots of room for error, so one has to be careful. Unless you consider
yourself really adept at algebra, I suggest you read this section slowly and
carefully!
Here is an example, which we first do longhand:
Be sure you understand each line of the calculation; you might want to re-do
the problem yourself on a new sheet of paper, looking at the above lines to check
your attempt.
On the first line, for example, we use the Distributive Law (Sec. 1.3), which
implies that
for any expression A. Here A is (2x2 − x − 5). Then, on the second line, we use
the Distributive Law again: x2(2x2 − x − 5) = 2x4 − x3 − 5x2 and so on. (We also
use the law of exponents, xmxn = xm+n here.) Finally, we collect like terms.
Problem 4.22 Multiply out longhand. (a) (2x3 − 5x2 + x +1)(x − 2); (b) (x2 − x −
3)(4x2 + 1). If your answers don’t agree with the solution, re-do the calculations
more carefully.
The answer has 4 terms. Each is the product of one term from the first
bracket and one from the second, as indicated. You can just read off the answer
in this way, without any intermediate step. The same works no matter how many
terms there are. For example
Problem 4.23 Read off the answers directly: (a) (U + V )(X + Y + Z); (b) (A + B +
C)(D + E + F ).
The rule is, when multiplying two sums together, choose one term from each
sum and multiply, then add up every possible such product. If there are m terms
in the first sum, and n terms in the second, your final result will consist of mn
terms.
When multiplying two polynomials, there is the additional complication of
collecting like terms. If you do the mental multiplication in a certain order, you
can collect terms ‘on the fly,’ so to speak. Consider this example:
First, decide what powers of x can occur in the answer. Clearly x4 and lower
powers only. To get x4 you have to choose the x2-term from each bracket, getting
2x4:
Next, to get x3-terms, you have to use an x2 and x term, and you can do this in
two ways:
Can you see how many ways there are to get x2-terms? Three! Look for them,
getting
Pause to be sure you follow the last step. (Of course, in actuality you wouldn’t do
this in the above discursive fashion, which is only for explanation. You do it all at
once. But it’s a good idea not to combine the coefficients in the first go through,
however, since this can easily cause an error.)
Completing the calculation, we get
There are two checks you can make. First, there should be 3 × 3 = 9 terms,
before collecting coefficients. Count them: 9, right! At least we haven’t forgotten
a term. Next, check that both sides agree when x = 1. Put x = 1 in the original
problem. You get 2 × 2 = 4. Now put x = 1 in the answer. You get 2 + 7 − 9+7 − 3
= 4. Wow! If you don’t get the same number, your answer is wrong for sure. If
you do get the same number, your answer is very likely right (though not
absolutely certain).
Problem 4.24 Multiply, using both checks (a) (2x +1)(x2 − 3x − 2);(b) (3x3 − 5x −
3)(x4 +2x2 + x − 1).
Problem 4.25 Find the squares. (a) (x − 3)2;(b) (2w + z)2;(c) (x2 +3)2.
Division of polynomials
Division of polynomials closely resembles long division of integers. For example,
long division gives
where N stands for Numerator, D for Denominator, Q for Quotient, and R for
Remainder. In Eq. 4.27, the numerator is sometimes called the dividend, and the
denominator D is the divisor. We must have
Check these symbols, and condition (4.28), against the above example. Question:
what is the remainder R in the case that the denominator D divides exactly into
the numerator N ? (Answer: zero.)
Let me pause to explain the meaning of expressions like N (x), and so on.
First, N (x) isread as “N of x.” It represents some polynomial in x. In any
particular case, N (x) would be specified, for example N (x)= 2x2 +5. This still
contains a variable, x, which itself represents some real number. The notation N
(3) then denotes the value of N (x)for x = 3:
Here we have Q(x) = 2x − 8 and R(x) = 31. (How to obtain this will be explained
shortly.) As before, this can be checked by combining fractions on the right side:
Before I explain my method for dividing polynomials, note that our basic
defining formula, Eq. 4.29, can be rewritten (by multiplying both sides of the
equation by D(x)) as
The problem is, given the dividend N (x) and the divisor D(x), calculate the
quotient Q(x) and the remainder R(x).
The algorithm I recommend for dividing polynomials is called MWTFU –
the Method of Wishful Thinking and Fixing it Up. Here is the example, (2x2 − 1)
÷ (x + 4). First, make sure that both polynomials are written in order of
decreasing powers of x, as they are in this example. Now, start with the first term
in the numerator, 2x2. I ‘wish’ this was 2x(x +4) [so that I could divide it by x +4].
I write
Thus
Thus the quotient is (2x − 8) and the remainder is 31. Note that condition (4.30)
is met – the degree of the remainder (zero) is less than the degree of the
denominator (one). To check that the result is correct, we just multiply out on
the right side, which gives (2x − 8)(x +4)+31 = 2x2 − 1, as required.
Let’s try a slightly more complicated one, (x3 − 3x2 +4) ÷ (x2 − 9):
This is the required answer, using the form of Eq. 4.31. What is Q(x)? R(x)? And
how do we know to stop at the last line? Well, we stop at the last line because it’s
impossible to continue; we can’t hope to write 9x = (?)(x2 − 9) because degree
(9x) < 2. This is exactly the condition (4.30) for the remainder R(x). All this
should become clear when you try the next problem.
Again, we can check the answer by multiplying it out:
Factors of a polynomial
Suppose you divide a polynomial N (x) by another, D(x), as in the above
examples. What must happen for D(x) to ‘divide evenly’ into N (x)? The answer
is that the remainder R(x) must be zero. This is exactly the same as for whole
numbers. For example, 5 divides evenly into 15 because the remainder is zero: 15
= 3 × 5. Similarly, 5 doesn’t divide evenly into 17 because 17 = 3 × 5 + 2, with a
remainder 2.
Here’s an example: does x + 2 divide evenly into x2 + x − 2? Well, we have
for polynomials B(x) and C(x), these polynomials are called factors of A(x). They
are called proper factors of A(x) if their degrees are smaller than the degree of
A(x).
For the above example,
Problem 4.28 (a) Show that 2x + 1 is a (proper) factor of 2x2 +7x +3; (b) For
what value of c is x − 5 a factor of x2 − 2x − c?
Factoring a quadratic
Consider the multiplication (x − 4)(x +1) = ? The answer
will have three terms – an x2-term, an x-term, and a constant term. The x2-term
comes from multiplying the x-term from each factor; here you get x2. The x-term
comes from combining two products, which I picture as the ‘outer’ product x ×
1, and the ‘inner’ product −4 × x. These produce −3x for the x-term. Finally, the
constant term is −4 × 1= −4:
Problem 4.29 Multiply out, using the suggested method. (a) (x+2)(x+4); (b) (x −
3)(x − 4); (c) (x +1)(x − 6); (d) (2x − 3)(4x +1).
Next, can we reverse this procedure? Given the answer, e.g. x2 +6x +8, can we
find the factors? In other words, can we factor a given polynomial?
In general, factoring polynomials is a difficult problem. There is a general
algorithm (only recently discovered) that can be programmed into the computer,
and is now available in software packages for use by scientists and engineers.
Here we only look at quadratic polynomials – it can be useful to be able to factor
these polynomials by inspection. This topic will come up again in later sections
of this chapter.
Consider the example
because the term x2 can only come from x × x. The constants (?) must give 6
when multiplied. What possibilities are there? 6 and 1, or 3 and 2 – any others?
Yes, −6 and −1 or −3 and −2. We use trial-and-error. Do any of these choices
give 5x for the x-term?
Well
which is not right. However,
We’ve done it! (This is why you learned to multiply out in your head.)
The method is this: To factor x2 + Ax + B, when A and B are integers, first
find the possible factors of B. Try each possibility. Example:
which you check by multiplying out. Be sure you follow this example, then do
the next problem.
Problem 4.30 Factor: (a) x2 +7x + 12; (b) y2 − 7y + 12; (c) x2 − 8x + 12; (d) t2 −
11t − 12.
By now you get the idea, I hope. The overall pattern goes like this:
This is true for any real number a and any value of x. This extremely useful
little formula is worth memorizing (but you’ll always remember why it’s true –
the x-terms ax and −ax add to zero). Also, compare this with Eq. 4.1.
means to determine all values of x for which the equation is true. Thus x = 2
happens to be a solution of the above equation, because
Linear equations
A polynomial equation of degree 1, for example
You need to learn to solve linear equations quickly and accurately in your
head. Just go through the above steps, mentally.
Problem 4.35 Solve for x. (Try to solve mentally.) (a) 2x − 6 = 0; (b) −5x +2 = 0;
(c) 3x − 1= x +5.
The step of adding 5 to both sides, in the previous example, is sometimes
called transposing. For example, 4x − 9 = 0 becomes 4x = 9 upon transposing −9
to the right side. You change signs when transposing (because you are really
adding 9 to both sides of the equation).
Look again at Problem 4.35, and think about transposing – this should make
the mental calculation easier. In part (c) you do two transpositions: x from right
to left, and −1 from left to right. Make up some more examples for practice.
To summarize, a linear equation Ax + B = 0 (where A and B are given
numbers) always has the unique solution x = −B/A, provided that But,
better than memorizing this as a formula, just remember the transposition
method:
Linear inequalities
Many students who have no difficulty solving linear equations (Ax + B = 0) seem
to find linear inequalities (for example Ax + B < 0) confusing. Fortunately, the
same solution technique works for both cases, with one additional twist for
inequalities. Here’s a first example:
implies that we can add any number (positive or negative) to both sides of a
given inequality. The new inequality will be true if and only if the original
inequality is true. This is just the same as for equations. Also, we can think of
‘transposing,’ exactly as in the case of equations. Take your pick – add 3 to both
sides, or transpose −3 to the right side, changing sign.
In the same way, the law
implies that we can multiply both sides of an inequality by any positive number.
This is where solving inequalities differs from solving equations. It is extremely
important to understand and use this operation correctly.
Problem 4.36 What does happen if you multiply both sides of the inequality a <
b by a negative number c? (Try an example if that helps.)
Problem 4.37 Solve the inequalities (a) 3x − 8 < x − 2; (b) 7 ≤ 2x − 5. (The laws
for ≤ are the same as for <.) (c) 4 − 3x> −1.
Quadratic equations
Consider the quadratic equation, in general form
Here the coefficients a, b, and c are given numbers, and x is the ‘unknown.’ To
solve this equation means to find all values of x for which the equation is true.
If a = 0 in Eq. 4.33, the equation becomes bx + c = 0, which is a linear
equation. We just learned how to solve linear equations, so we can henceforth
assume that a ≠ 0. It then turns out that the quadratic equation (4.33) has two
solutions x, which can be calculated from the formula
where the symbol ± means that you use + for one solution and − for the other. If
you plan to study college-level math, you will need to memorize this formula,
which is called the quadratic formula. Later I will explain in detail how the
quadratic formula is derived. But first, let us become more familiar with the topic
of quadratic equations.
Here is an example:
Here we are using a very basic and useful rule (see Section 2.4):
Here A and B denote any real numbers (and therefore A and B could be any
algebraic expressions, because such expressions always represent numbers). This
rule, if you think about it, is hardly surprising. If you multiply two non-zero real
numbers A and B together, you either get a positive number AB (if A and B have
the same sign), or a negative number (if they have opposite signs). To get AB = 0
you must have at least one of the numbers A or B equal to zero.
The zero-product rule is used regularly in the solution of equations. For
example, how would you solve 4(x − 1) = 0? There’s a hard way and an easy way.
The easy way is to use the zero-product rule, which tells us that x − 1 = 0
(because, for sure 4 ≠ 0). Thus x = 1 is the solution. (What is the hard way? 4(x −
1) = 4x − 4 = 0; therefore 4x = 4; therefore x = 1.)
In attempting to solve a given polynomial equation P (x) = 0, suppose we are
first able to factor P (x) as
No obvious factoring works here. Using the quadratic formula, we obtain the
roots
If you wish, you can do the algebra to check that this is correct.
This connection between roots and factors is always true: if x1 and x2 are the
roots of the quadratic equation ax2 + bx + c = 0 then (x − x1) and (x − x2) are
factors of this polynomial, so that
Look again at the quadratic formula, Eq. 4.34. This formula contains a
square-root expansion What happens if b2 − 4ac < 0?
For example, consider the example x2 − 2x + 2 = 0. Using the quadratic
formula, we obtain the solution
A simple example that will remind you of this possibility is the equation
This equation obviously has no real solutions, because x2 ≥ 0 for any real number
x. Therefore x2 +1 ≥ 1, so that x2 + 1 = 0 is not possible for any x. What is b2 −
4ac for this example? Answer −4, so the situation described in Eq. 4.35 prevails.
In Volume 2 we will show that, by allowing for ‘complex numbers,’ we do
obtain solutions to the quadratic equation in the case that b2 − 4ac < 0. This
procedure of extending the real number system to the complex number system is
in the same spirit as extending the natural number system to the system of
integers, which allows one to subtract numbers with no restriction.
Problem 4.40 (a) For what values of c does the equation x2 +3x + c = 0 have real
solutions? (b) Show that if a and c are of opposite sign, then ax2 + bx + c does
have real solutions.
Problem 4.41 Calculate the discriminant and determine the number of real
solutions, but don’t solve: (a) x2 − 9x − 1; (b) 1.1x2 +0.8x +2.7; (c) x2 − 10x + 25.
We have discussed examples of each of these cases, but you may wish to
check the further examples
Determine the roots of these equations, and also factor the given
polynomials, where possible. Note that the equations have 2, 1, and 0 roots,
respectively.
In the second case, where the factored form of the polynomial is a(x − x1)2,
we sometimes say that x1 is a double root of the given equation. Whenever a
quadratic equation has only one root, this is a double root. We will discuss this
situation in more detail later.
Note that the first two terms are the same as the first two terms in (x+2)2 = x2
+4x+4. So let’s ‘fix up’ the original quadratic to look more like (x+2)2:
Our given equation now becomes
or
Therefore
or
(Please check that you would get the same result by using the quadratic formula.)
The above method is called ‘completing the square.’ One starts with
(for simplicity we temporarily assume that a = 1). Then one recalls that
Therefore
and this ‘completes the square’ and allows us to solve the quadratic equation x2 +
bx + c.
Try another numerical example.
You should read over this derivation carefully, and make sure that you
understand each step. Then close the book and try to write out the derivation
yourself. Could you do it stranded on a desert island? It is admittedly a little
complicated, but any mathematics student should be able to do it. As a review,
solve the equation 2x2 −x−5 = 0 (a) by completing the square, and (b) by using
the quadratic formula. (By the way, you may wonder why I wrote in
this calculation. If a < 0 we have but the ± sign allows for this
possibility.)
Equations that can be written as quadratic equations
Consider the equation
We have not discussed such equations in general, but perhaps you can see how to
solve this particular example. Think about it for a minute.
Did you realize that the polynomial can be factored?
Next question: how do we solve this equation? Recall the zero product rule. We
must have
We can find the solutions y of this equation, either by factoring, or from the
quadratic formula. Then gives the solutions to the original equation.
Some, or all of these solutions may not be real numbers, however. For example,
what are the solutions of
Problem 4.43 Solve (a) 2/x − 1/(x +5) = 1; (b) (x +3)/(x +2) = x/2; (c) x4 − x2 − 12
= 0.
Extraneous solutions
In solving a given equation, we typically carry out various transformations of the
equation. We may re-arrange and collect terms, multiply both sides by a certain
constant, square both sides, etc. The logic behind the method is that any solution
of the original equation will also be a solution of the transformed equation. By
solving the transformed equation, we therefore solve the original equation. Here
is an example:
Thus x = 2 is the solution, and the only solution, of the given equation. The
final equation x = 2 is equivalent to the original equation 3x − 4= x, in the sense
that x satisfies one equation if and only if it satisfies the other. We know that this
is true because it is true for each step. For example, if Eq. (1) holds for x, then (2)
holds for the same x. Conversely, if Eq. (2) holds for x, then Eq. (1) holds also.
Similarly, Eq. (2) holds for x if and only if Eq. (3) holds for the same x, and so on.
The transformations used in going from one step to the next are reversible.
For some kinds of transformations this reversibility may not hold. For
example, consider the following steps:
If we now try substituting each of these values of x back into the original
equation, we find that x = 1/4 is a solution, but x = 1 is not a solution, of the
given equation. In other words, the original equation and the transformed
equation (for example, (4x − 1)(x − 1) = 0) are not equivalent. What happened?
See if you can discover which of the transformations is not reversible.
The answer is that the transformation from Eq. (2) to (3) is not reversible. It
is true that if Eq. (2) holds for x, then Eq. (3) also holds for the same x, but not
conversely. Indeed, if Eq. (3) holds then we will have (by taking square roots)
This is not the same as Eq. (2). Thus the transformation from Eq. (2) to (3) is not
reversible.
Whenever one performs an irreversible transformation, it is possible to
obtain ‘extraneous’ solutions to the original equation, that is to say, numbers that
are not solutions to the original equation. Therefore, it is necessary to check each
final solution by substitution back into the original equation. In the above
example, the extraneous solution x = 1 was introduced in going from Eq. (2) to
(3), as you should check.
As an example, solve by squaring both sides,
simplifying, and squaring again. What are the solutions? Are any of them
extraneous?
The answer is that there is just one solution x = 5/4 to the final equation, and
it is not extraneous. In this example, no extraneous solutions arise.
Another example of a problematic transformation is multiplying or dividing
both sides of an equation by some expression involving the unknown, say x. For
example, in solving x2 = 2x, students sometimes divide both sides by x, obtaining
x = 2. They then state that the solution is x = 2. Can you see what’s wrong with
this conclusion? The answer is that x = 0 is also a solution of the given equation.
Dividing by x, which might be zero, gets rid of this solution. In this case, the
transformation eliminates one solution, rather than introducing an extraneous
one.
Transformations such as squaring both sides, or multiplying both sides of an
equation by an expression, are useful in solving equations, but care must be
taken regarding extraneous or eliminated solutions.
As an example solve
and check that the solution is correct. (Answer: x = 1 or 4; both check out as
valid solutions. The value x = 2 is not a solution, so multiplying by (x − 2) does
not affect the solution set.)
4.6 Inequalities
The statement a < b means that a lies to the left of b, on the real number axis
(assuming that this axis points to the right):
as in −2.5 < 1.5, or −3.4 < 0. Finally, if a, b are both negative, then the order is
reverse to the magnitude, as in −3.2 < −3.1.
The laws of inequality
The laws of inequality follow from the above real-number-axis characterization
of inequality.
Problem 4.44 Show that if a, b are positive numbers with a < b, then 1/b < 1/a.
Of course, a > b means the same as b < a. We could thus dispense with >, but
it is often a convenience. Next, a ≤ b means just what it says, namely either a < b
or a = b. Inequalities of this kind are useful in various contexts later in this book.
Problem 4.45 Which of the laws of inequality remain valid if < is replaced by ≤
throughout?
Now, a fraction is negative if and only if the numerator and the denominator
have opposite signs. There are two possibilities:
Case 1: −4 − 5x < 0 and x +1 > 0.
Thus x> −4/5 and x> −1. Hence x> −4/5. (Why? See below.)
Case 2: −4 − 5x> 0 and x +1 < 0.
Thus x < −4/5 and x < −1. Hence x < −1.
(In Case 1, note that to say x> −4/5and x> −1 is exactly the same as saying
just that x> −4/5. Be sure you understand this. Also, check case 2 again now.)
The conclusion from these two cases is that the given inequality holds if
This is the solution. (In particular, no number between −1and −4/5satisfies the
inequality. Try an example, x = −9/10. This gives 1/(x + 1) = 10, which is not <
5.)
Most students find such examples a bit confusing, because of the logic. Read
the solution again before trying the next problem.
Problem 4.46 Solve the inequalities (a) −2/(x +1) < 5; (b) 3/(x − 1) < −4.
Absolute values
A particularly useful type of inequality is
Here a and b are given real numbers, and x is a variable. We assume that b > 0.
Recall from Section 2.5 that |x − a| equals the distance between x and a on the
real-number line. For example, |5 − 2| = |3| = 3 is the distance between 2 and 5,
which is the same as |2 − 5| = |− 3| = 3.
Thus inequality |x − a| <b says, in words, that the distance between x and a is
less than b. This means that x lies between a − b and a + b;
Thus the inequality |x − a| <b means that x lies inaninterval of length 2b,
centered at x = a.
For example, let us find the solution of the inequality
This is the desired solution. As a check, note that at the end-points of this
interval, x = −4 or +1, we have |2x +3| = 5 in both cases. This makes sense – the
given inequality |2x +3| < 5 is true for x between these values, but not beyond.
We should expect to get equality right at the ends of the interval. (A sketch of the
graph of y = |2x +3| further confirms this point – see Chapter 7.)
Problem 4.47 (a) Solve the inequality |3x − 6|≤ 8. (b) Find an inequality
involving absolute values, corresponding to the interval −3 <x < 7. What is the
general formula?
Quadratic inequalities
Consider the inequality
Two comments regarding step 2: first, recall that for any real
number Q. Therefore Second, to complete step 2, we
must have B > 0. If this is not the case, then the original inequality has no
solution. Here is an example:
Step 1.
We can see, without further ado, that the latter inequality can never be true. The
given inequality has no solutions x.
Problem 4.48 Solve the inequalities (a) 2x2 ≤ 6x +3; (b) x2 − 6x +10 > 0. In both
cases, check your answer by using the quadratic formula.
What would be the pattern for (a + b)4? See if you can figure it out, before
peeking.
The next pattern will be
Thus
Problem 4.49 Find the coefficient pattern for (a + b)5. Use it to write out the
expansion of (a + b)5.
Finally, end with another ‘1.’ Looking back at how the coefficients 1, 4, 6, 4, 1
were obtained earlier from 1, 3, 3, 1, you can see that Pascal’s triangle does the
same thing, but in a tidy way.
The numbers on line n in Pascal’s triangle are called the binomial
coefficients in the expansion of (a + b)n. The written out expression is often
referred to as the expansion of (a + b)n.
Pascal’s triangle has the disadvantage that, in order to expand (a + b)n one
must figure out all the coefficients for all exponents up to n. Fortunately, there is
a single formula for these coefficients, which I will now explain. Let C(n, k)
denote the kth coefficient in (a + b)n. (You should read C(n, k) as ‘C of n and k,’
or just ‘C n k.’) Thus the entries on line n in Pascal’s triangle are
We have, therefore
In math, 3 dots (··· ) are used to indicate the continuation of an indicated pattern.
For example, 1 + 2 + ··· + n means the sum of the integers from 1 up to n. This
type of notation can only be used when the interpretation is obvious and
unmistakable. In reading examples of this notation, you should pause long
enough to understand exactly what the pattern is. For example, mentally insert
the first missing term.
Here is the formula for C(n, k), as I will explain:
That is, n! is the product of the integers from 1 up to n (n ≥ 1), and 0! is defined
separately by 0! = 1. For example,
Let us check that Eq. 4.41 does give the correct coefficients for n = 5.
Problem 4.51 Check that the C(n, k) values are correct for n = 6 as given by
Pascal’s triangle. You only have to calculate C(n, k) for k = 0, 1, 2, 3 (why?)
Some properties of the binomial coefficients that can be seen either from
Pascal’s triangle, or Eq. 4.41, are:
Problem 4.52 (a) Write out the first 4 terms in the expansion (a + b)10, using Eq.
4.41; (b) Show that Also What is the
general case?
Summation notation
The symbol sum Σ (Greek capital sigma) is used in math to designate
summation, as in
In other words, equals the sum of the values qk, with subscript k going
Problem 4.54 (a) To be sure you understand Eq. 4.44, write it out fully for the
case n = 3; (b) Write down the binomial theorem for (u + v)p. Use x as the
summation index.
Problem 4.55 Expand (x − y)5, using any method you like for the coefficients.
Your calculator will give to three decimals. Now raise this number
5
to the 5th power; you’ll get (1.762) = 17. In other words, is the 5th root of
17.
In general, we define
which means that is the nth root of x. We sometimes write
where the symbol is read as ‘the nth root of’. When n = 2 we get the usual
square root, which is always written without the ‘2.’ The expression
is sometimes called a radical.
Problem 4.57 (a) For what values of x is it true that For what values
4
of x is this not true? (b) Solve for x: x = 16 (find all real solutions).
Problem 4.58 Find by using a calculator. Check.
This is exactly what Eq. 4.47 says. Thus the definition of is designed to keep
the math consistent and simple. (A similar argument was used in defining
see Sec. 4.1.)
Next, for an exponent (where p, q are integers) we define
Once again, this definition is motivated by the laws of exponents. It now turns
out that all three rules of exponents hold for arbitrary exponents. Here are the
rules again:
Here a and b can be any rational numbers (positive, negative, or zero), and x
and y are positive real numbers.
Should you, as an average math student, be able to explain (and remember)
exactly why each of these rules is true in general? I don’t think so. But you should
certainly remember why the rules hold for positive integer exponents (Sec. 4.1),
and then just keep in mind that they also hold for any exponents. This will
ensure that you never make errors in using exponents.
A partial proof of the rules for exponents is discussed at the end of this
section.
Problem 4.59 Use your calculator to check that 30.7 × 31.6 = 32.3.
The following additional laws can be deduced from the above laws of
exponents:
We discussed these laws for integer exponents in Sec. 4.1. The fact that they are
valid for arbitrary exponents is in line with the general consistency of
mathematics.
Simplifying radicals
Consider the number We can simplify this as follows:
A more general example of the same idea is
The perfect-square factor x2 under the sign can be removed from under the
sign, taking its square root x (if x> 0).
A similar calculation applies to nth roots. For example,
Be sure you see how this works. You could also check the result numerically on
your calculator. (Before calculator days, was recognized as being a lot
easier to calculate than ) The calculation is an example of ‘rationalizing the
denominator,’ because is irrational, whereas 2 is rational.
Another useful example is
Again, the final result would be easier to calculate by hand than the original
expression.
For any reader dying of curiosity, here’s the proof of Eq. 4.51. First, take the
case where q is an integer. We have to prove that For q =
2, we would be proving that
See? Math is very tightly organized and logical. Some people love mathematics
for this very reason. Others hate it. You can’t please everyone.
Most math teachers are careful to point out that no number of special cases
are sufficient to prove a general result. However, sometimes a well-chosen special
case can indicate how the general result can be proved. That is true for the
example just discussed, as the next problem shows.
Now, while it is true that special cases do not establish a general theorem, the
study of special cases is nevertheless often worthwhile. Three advantages of
looking at special cases before tackling the general problem are:
1. Being easier to understand, special cases can clarify your ideas and help you
to grasp a more general principle. If you can’t understand the special case,
surely you have little hope of understanding the general result.
2. Mastering special cases can build confidence.
3. In original research, you may not know in advance what the general result
is. By looking at special cases you may be able to eventually discover the
general rule.
For a student working on a difficult problem, or a confusing section of the
text, inventing and solving special cases may be the best way to approach general
understanding. However, it is always important to also master understanding of
the general case.
whereas
The general situation is that (−a)m/n makes sense if n is an odd integer, but not if
n is an even integer. (In my opinion, this fact is an unimportant oddity.
However, it does sometimes occur on math tests.)
in two ways, first using MWTFU, and then using a format that some people
prefer, called ‘long division.’
(1) MWTFU
The quotient is Q(x)= x2 +5x + 15, and the remainder is R(x) = 40.
Synthetic division
Consider the same example as above:
Note that the bottom line contains the coefficients of the quotient Q(x)= x2
+5x + 15, and the remainder R(x) = 40. This is called synthetic division. The
algorithm is as follows.
To divide anxn + an−1xn−1 + ··· + a0 by x − a:
1. Write the coefficients anan−1 ··· a0 (including any zeros) on the top line.
Leaving space for a second line of numbers, complete a half-box, as shown.
Write the value a on the left.
2. Copy an to the third line.
3. Starting with the left-most column, repeat the following steps, up to the
final column on the right:
1. (a) Multiply the number on the third line, current column, by a, and
enter on the second line, next column.
2. (b) Moving to the next column, enter its sum on the third line.
where P (a) is the value of the polynomial P (x) when x = a. The example worked
out above had P (x) = x3 +2x2 − 5 and (x − a) = (x − 3). Thus P (3) = 33 +2 × 32 −
5 = 40, which equals the remainder R. You can check that Eq. 4.56 also holds for
the examples in Problem 4.66.
Equation 4.56 is known as the Remainder Theorem.
To prove the remainder theorem, we use the basic definition of division of
polynomials (see Eqs. 4.27-4.29). Thus division of P (x) by x − a means that
where the remainder R is a constant. (Recall that, in general, deg R(x) < deg D(x).
Here D(x)= x − a, which has degree 1. Therefore deg R(x) < 1, so R(x) =
constant.) Substituting x = a in Eq. 4.57, we obtain P (a)= R, and this proves the
remainder theorem.
Problem 4.67 For the example P (x) = 4x2 − 1, and find R by synthetic
division, and check that P(a) = R. Do the same for a = 1.
Can you ‘see’ a solution in your head? How about x = 1? Yes: P (1) = 0, so x = 1 is
a solution. Now apply the factor theorem, which says that (x−1) must be a factor
of P (x). By synthetic division we find that
Could there be other solutions of the equation P (x) = 0? If so, they must be
solutions of
How does one come up with one root of a given cubic polynomial? We will
show in Chapter 7 that every cubic polynomial has at least one real root, which
can be found by numerical calculation. Hence any cubic equation can be
completely solved. There are at most 3 real roots.
How many roots can a given polynomial P (x) have? Here we consider only
real roots; ‘complex’ roots are considered in Part 2.
First of all, a polynomial may have no real roots. For example, P (x)= x2 + 1
has no real roots, because x2 +1 ≠ 0 for all real numbers x. On the other hand, P
(x)= x2 − 1 has two real roots, x = 1 and x = −1
In general, a polynomial of degree n can have at most n roots (real or
complex). To see this, note by the factor theorem that each root a of P (x)
corresponds to a factor (x − a). Thus, if a is a root, then
and therefore
This process can be continued at most n times, because each step reduces the
degree of the quotient by one. Therefore P (x) can not have more than n roots.
8. Solve for the unknown (a) 3 + y = 4y − 6; (b) (3x +5)/(x − 1) = 2 (first multiply
through by (x − 1).
9. Solve the inequalities: (a) 3x − 2 ≤ 2x +1; (b) 2.9y +4.1 > 7.3y.
14. Solve the inequalities: (a) 2x/(x−1) < 3; (b) |x+2|≤ 5; (c) x2 −4x < 12.
15. Expand by the binomial theorem: (a) (c2 + d2)3; (b) (x − 2)6.
16. Simplify
18. State the Remainder and Factor Theorems, and write out the proofs.
Solution 4.1 First, 12−92 = 1−81 = −80, while (1+9)(1−9) = 10×(−8) = −80.
Second, (−4)2 − 22 = 16 − 4= 12 [remember, (−4)2 = (−4) × (−4) = +16] and
(−4+2)(−4 − 2) = (−2) × (−6) = 12.
Solution 4.2 (Just notice that 3n becomes very large as n increases. On the other
hand (.7)n becomes very small for large n. More about this later.)
Solution 4.3 (a) x2x6 = x8; (b) (x2)6 = x12.
Solution 4.4 First, 43 = 64; 42 = 16; 41 = 4; and 40 = 1. Also, 103 = 1,000; 102 = 100;
101 = 10 and 100 = 1.
Solution 4.5 (a) z4; (b) t−4; (c)w.
Solution 4.6
Solution 4.7 If you’ve forgotten any of these points, go back and re-read the
corresponding discussion.
Solution 4.8 (a) (b−2)−1 = b2 by Eq. 4.4; remember that this equation is valid for
negative exponents; (b) (x3)−1x5 = x−3x5 = x2; (c) (x3y−2)4 = (x3)4(y−2)4 = x12y−8 by
Eqs. 4.5 and 4.4. The answer could also be written as x12/y8. (d) (a2/b)3 = (a2b−1)3 =
a6/b3. In general, (x/y)n = xn/yn;this is proved in the same way as this example.
Solution 4.9 After 20 years the value is V20 = (1.07)20V0 = $3, 869.68. Similarly,
the value after 30 years is V30 = (1.07)30V0 = $7, 612.26. Simple interest is interest
that is not compounded, but equals 7% of $1,000 each year. After 20 years, the
total simple interest would be 20 × .07 × $1, 000 = $1, 400, and the total bond
value $2,400. Similarly, after 30 years the simple-interest bond would be worth
$3,100. (Simple interest would apply to a bond that has annual dividends, in this
case $70.00 per year. The investor would presumably cash in these dividends
every year, perhaps reinvesting them, or perhaps using them in the meantime.)
Solution 4.10 We want to compare V0 for i = .06 with V0 for i = .065 (i.e. 6 ½%).
By Eq. 4.16 we have V0 = $5327.26 for i = .065, compared to V0 = $5583.95 for i =
.06. The decease in value V0 is equal to 4.6%.
Solution 4.11 Use Eq. 4.17 with t = 90, Vt = $100,000, and i = .05, or i = .10. You
get V0 = $1,238.69 and V0 = $18.82 respectively. (This calculation tacitly assumes
that there are no other expenses incurred in looking after the forest for 90 years,
such as taxes or fire insurance.)
This problem indicates that the present value of an asset that will only mature
many years in the future, can be very low. Low present values can be a serious
problem in renewable resource management.
Solution 4.12 The projected population is (1.015)2000×(7×109)= 6×1022. The
radius of the earth is about 6,000 km. Using the formula A = 4πr2 for the surface
area of a sphere of radius r we get A = 4.5 × 108 square km. About 1/4 of this area
is land, or 1.1 × 108 square km. Therefore the population density in 2,000 years
will be about 6 × 1022 ÷ 1.1 × 108 = 5.4 ×1014 people per square km, or (since 1
square km is 106 square meters), 5.4×108 people per square meter. In words,
there would be about 500 million people standing on every square meter of the
earth’s land surface. I don’t think so.
Solution 4.13 For t = 10,000 we get Xt = a10,000X0 = .298X0. Thus 29.8% of the
sample remains after 10,000 years. Similarly, 8.9% remains after 20,000 years,
and .0005% after 100,000 years. [Radio-carbon dating of archaeological items is
based on such calculations. The carbon content of a biological organism is
obtained from atmospheric carbon dioxide (CO2), and is a known mixture of
non-radioactive carbon and radioactive C14. Once the organism dies it no longer
takes up atmospheric carbon. By measuring the proportion of C14 in the
archaeological item, the time since its death can be calculated, as in this problem.
However, items older than about 50,000 years contain too little C14 for accurate
measurement, and other radioactive elements must be used.]
Solution 4.16 The l.c.d. is x2y2, and the answer is (Check this in the
usual way.)
Solution 4.17 (To get this, first add the fractions in the numerator, and
then in the denominator. Then invert, multiply, and cancel.)
Solution 4.18
(This can be simplified to by using Eq. 4.1.)
Solution 4.19 (a) is not a polynomial, because the term does not have
a non-negative exponent; (b) is a polynomial (the coefficients can be any real
numbers); (c) is not a polynomial because the term is not of the form axn for
some non-negative integer n.
Solution 4.20 (a) The degree is 5; (b) The terms are 2q (coefficient 2),
(coefficient ) and 7 (coefficient 7). The constant term is 7.
Solution 4.24 (a) 2x3 − 5x2 − 7x − 2; (b) 3x7 + x5 − 13x3 − 11x2 +2x +3.
Solution 4.25 (a) (x − 3)2 = x2 − 6x +9; (b) (2w + z)2 = 4w2 +4wz + z2; (c) (x2 +3)2
= x4 +6x2 +9.
Solution 4.26 (a) 2x4 −3x2 +5 = (2x2 +1)(x2 −2) +7; (b) x3 −x2 +x−1= (x2 +1)(x −
1).
Solution 4.27
Solution 4.29 (a) x2 +6x +8; (b) x2 − 7x + 12; (c) x2 − 5x − 6; (d) 8x2 − 10x − 3.
Solution 4.30 (a) (x +4)(x + 3); (b) (y − 4)(y − 3); (c) (x − 6)(x − 2); (d) (t − 12)(t
+1).
Solution 4.31
2
(c) x + 16
can’t be factored [Well, it could be factored using ‘imaginary numbers, but that’s
another matter and quite beside the point here.]
Solution 4.32 (a) and (b) Re-read the text to be sure you understand these
words; (c) 5x3 − 9x2 +2x +2; (d) 2x5 +7x4 − x3 − x2 +3x − 2; (e) x3 +3x2 +7 = (x +3)
(x2 − 2) + (2x + 13); (f) x2 − 16 = (x +4)(x − 4). Also x2 +6x − 16 = (x +8)(x − 2).
Solution 4.33
Solution 4.34 Substituting x = 1 gives x3 − 6x2 +2x +12 = 9, which is not 0, i.e., x
= 1 is not a solution.
Solution 4.35 (a) x = 6/2 = 3. (b) x = (−2)/(−5) = 2/5. (c) Use two steps here: first
step 2x = 6; second step x = 3. (Check this answer: 3 × 3 − 1= 8 and 3+5= 8 also.)
Solution 4.36 The inequality reverses direction: ac > bc if c < 0. An example: 2 <
3, but 2 × (−2) > 3 × (−2). Be sure you understand this example.
Solution 4.40 (a) Here b2 − 4ac = 9 − 4c, so the equation has real solutions if and
only if 9 − 4c ≥ 0, i.e. if c ≤ 9/4. (b) If a and c are of opposite sign then ac < 0.
Therefore b2 − 4ac > 0, so the equation does have real solutions, by the quadratic
formula.
Solution 4.41 (a) b2 − 4ac = 85; two real solutions; (b) b2 − 4ac = −11.24; no real
solutions; (c) b2 − 4ac = 0; one solution (namely, x = 5).
Solution 4.45 All the laws remain valid. This can be verified by considering
cases. For example, to check the transitivity law, now written as: If a ≤ b and b ≤
c then a ≤ c, we can consider the cases (i) a < b and b < c, (ii) a = b and b < c, (iii)
a = b and b = c. We get, respectively, a < c, a < c, and a = c, so in all cases we have
a ≤ c. This proves that transivity is valid.
Solution 4.46 (a) x < −7/5 or x > −1; (b) 1/4 < x < 1 (i.e., x is between 1/4 and 1).
(If you think solving inequalities is confusing, you are not alone!)
Solution 4.47 (a) −2/3 ≤ x ≤ 14/3; (b) |x−2| < 5. In general, A < x < B if and only
if |x − (A + B)/2| < (B − A)/2. Note that (A + B)/2 is the midpoint of the interval
from A to B.
Solution 4.49 The coefficient pattern for (a + b)5 is 1, 5, 10, 10, 5, 1. Thus (a + b)5
= a5 +5a4b +10a3b2 +10a2b3 +5ab4 + b5.
Solution 4.50 The 6th and 7th lines are 1, 6, 15, 20, 15, 6, 1 and 1, 7, 21, 35, 35,
21, 7, 1. Therefore
Solution 4.51 The values in Pascal’s triangle are 1, 6, 15, 20, 15, 6, 1, and these
equal C(6,k). (There is always a symmetry in the binomial coefficients, so for
example C(6, 2) = C(6, 4), etc.)
Solution 4.52 (a) (a + b)10 = a10 +10a9b +45a8b2 + 120a7b3 + ··· . (b) The general
case is
Solution 4.53
Solution 4.54 (a) For n = 3 we have, using Eq. 4.44 (remember that b0 = 1)
Solution 4.59 Both equal 12.51 to 2 decimals, for example. Your calculator
should show equal values for 10 or so decimals.
Solution 4.61 (a) 9a2; (b) 81; (c) a+b (if a, b are positive; otherwise |a+b|);
Solution 4.64 (a) Doesn’t exist, because x1/4 does not exist for negative x. (b)
0.702.
Solution 4.65 3x4 − 2x3 + x = (x2 + 2)(3x2 − 2x − 6) + (5x + 12).
From these and several other basic facts we are able to develop much of
geometry and trigonometry. But how do we know that these facts are true?
Neither of the above statements is obvious in any sense. One can imagine two
approaches to establishing the validity of such statements as these, the
experimental method, and the theoretical method.
Experimentally, we could obtain measuring instruments (a protractor to
measure angles, and ruler to measure lengths), and check the truth of the
statements by taking measurements. You may wish to try such measurements of
a few examples, for the statements given above. (You will want a protractor in
this chapter, in any case.) Your measurements will probably verify that the
statements are at least approximately correct.
Mathematics, however, is not generally considered to be an experimental
science. Instead, all statements in math are supposed to be proved – then they
are called theorems. But proved how, and from what? From other basic facts,
perhaps, using accepted principles of logic. Then how are these more basic facts
themselves to be proved – from yet more basic facts? It was the strength of Greek
mathematics to realize that this process had to stop somewhere. One must begin
by accepting some statements as being true without proof. Such statements are
called axioms.
Anyone who has tried to read Euclid’s Elements knows how finicky it is to
deduce, by rigorous logic, all the results of plane geometry. This book takes a
more practical approach, explaining the basic facts of geometry in a convincing,
understandable way that, while based upon Euclidean axiomatics, avoids overly
technical detail.
Line segments
The part of a line lying between two points A and B is called a line segment, and
is denoted by AB. The length of a line segment is defined provided that a unit of
length has been specified. In practice, lengths would be measured in familiar
units such as inches, meters, or light years. In general, however, we can assume
some fixed unit of length without actually specifying what the unit is. The
distance between two points A and B is the same thing as the length of the line
segment AB.
Circles
A circle is defined as the set of all points that are equidistant from a fixed point,
which is called the center of the circle. The distance from the center to a point on
the circle is called the radius of the circle. Notice that, by this definition, a circle
is a curve, not a filled-in area. The filled-in area is called a circular disk.
An arc is a segment (connected piece) of a circle. The properties of circles are
discussed in Section 5.3.
Angles
A ray is a half-line, with an endpoint. An angle is a figure consisting of two rays
joined at their endpoints. The point where the rays meet is called the vertex of
the angle. The rays are called the sides of the angle.
Two given angles are said to be equal angles if one of them can be moved
(without distortion) to coincide exactly with the other.
First, we can name the angle by its vertex, for example ∠V (‘angle V’).
Second, we can use three letters, for example, ∠ABC, (‘angle ABC’), where B is
the vertex, and A and C are points on the sides of the angle. Finally, we can use a
single letter, for example, α (‘alpha,’ the first letter of the Greek alphabet). The
letters used here are examples only; any letters can be used.
A straight angle is an angle whose two sides lie along a straight line:
A straight angle can be partitioned into two equal angles, which are called
right angles. Two lines that meet at right angles are said to be perpendicular to
one another.
(180° is read as ‘one hundred and eighty degrees.’) The sizes of other angles are
determined by their relation to a straight angle. For example, a right angle has
size 90°, because two right angles make up a straight angle. Other useful angles
are 30°, 45°, and 60°. Thus three 60° degree angles make up
a straight angle, and two 45° angles make up a right angle. The 30°, 45°, and 60°
angles have special properties, discussed in Sec. 5.2.
What about other angles? The protractor is a tool used to draw and measure
angles.
Problem 5.2 Draw two or three different right-angled triangles (i.e. having one
90° angle). Measure the other two angles, using a protractor, and write down
their sizes. What do you notice?
Two other terms regarding angles are these: an acute angle is an angle less
than 90°, and an obtuse angle is an angle between 90° and 180°.
If you can’t remember which is which, think of acute as sharp, and obtuse as
dull.
Opposite angles
Consider two lines intersecting at a point P . Angles directly opposite each other
at point P are called opposite angles. The two acute angles indicated in the figure
are opposite angles. So are the obtuse angles that are not marked.
A useful fact is that
You could check this experimentally for a number of examples, by using the
protractor. But keep in mind that experimental evidence is not considered
conclusive in math. We can give two explanations, one intuitive and the other
purely mathematical. The figure on the left indicates the first explanation. We
rotate one of the lines about point P until it coincides with the other line. The
rotation sweeps out the same angle on both sides, so the opposite angles must be
equal to begin with.
Problem 5.3 Make yourself a glossary of terms introduced in this chapter so far.
Keep the list on hand, and add to it as you continue reading.
5.2 Triangles
A triangle is a figure with three line segments as sides. The corners of the
triangle are called its vertices. The vertices are often labeled using capital letters,
like A, B, and C.
In this case the triangle may be referred to as ΔABC (‘triangle ABC‘). The
angles at the vertices could then be called ∠A (‘angle A‘), and so on. Other ways
of naming these angles can also be used.
An acute triangle is one having three acute angles. An obtuse triangle has one
obtuse angle.(No triangle can have more than one obtuse angle, as we will see
shortly.) Finally, a right-angled triangle (a right triangle, for short) has one
right angle.
Parallel lines
We have defined two lines L1 and L2 to be parallel if they never intersect, no
matter how far they are extended in either direction. Another property of parallel
lines is that they have the same direction, in the sense that they make equal
angles with any third line L0 that cuts across the given lines. The line L0 is called a
transversal to L1 and L2 and the indicated angles are called corresponding
angles.
Here the phrase ‘and conversely’ means that if the two corresponding angles
shown in the figure are equal, then the lines L1 and L2 are necessarily parallel (i.e.,
they do not intersect). The boxed statement is an axiom.
Given a line L and point P not on L, we can construct a line L1 through P and
parallel to L, as shown below:
First, draw any line L0 through P , cutting line L. Then draw L1 through P ,
making the same angle with L0 that L does.
Problem 5.4 (Use your space intuition on this problem). Imagine you have two
lines in space. If these lines never intersect, does it follow that they are parallel?
Another pair of equal angles produced by two parallel lines is shown in the
next figure. These are called alternate angles. Can you see why alternate angles
are equal? It’s because the alternate angle is opposite to the corresponding angle
(check this in the figure). Since opposite angles are equal, as are corresponding
angles, the result follows.
From this result we can now prove the following important theorem.
Theorem In any triangle the sum of the interior angles equals 180°.
Proof Let ABC be a triangle, with angles α, β, and γ as shown. Draw a line L
through C and parallel to line AB. Then line AC is a transversal to these two
parallel lines, so that as shown in the figure. Similarly, But
(since these three angles make up a straight angle at C).
Therefore α + β + γ = 180°.
Problem 5.5 Let ABC be a right triangle having ∠C = 90°. Show that ∠A and
∠B are complementary angles.
with angles α, β, γ at the vertices A, B, C respectively. Let CD extend the side AC,
as shown. Show that ∠DCB = α + β. Suggestion: again, use the sum-of-interior-
angles theorem. (Think about this problem for a bit. You may not ‘see’ the
argument at first, but hopefully you will eventually. It might help to label the
angle BCD on the figures as angle X,say.)
Angle DCB is called an exterior angle to the given triangle ABC. The angles α
= A and β = B are called opposing interior angles to this exterior angle DCB.
Thus the statement of Problem 5.6 can be put in words: An exterior angle of any
triangle is equal to the sum of the two opposing interior angles.
By the way, note that Problem 5.5 can be viewed as a special case of Problem
5.6. Explain.
Did you figure out the explanation? If γ = 90°, then ∠DCB = 90° also.
Problem 5.6 says that ∠DCB = α + β. Therefore α + β = 90°, which is what
Problem 5.5 says.
Now for some advice. When reading this book, please read slowly and with
meticulous attention to details. Try to solve the problems yourself, before
looking up the solution. Also, take the time to answer questions. The problems
and questions are there to make you think. Thinking helps you to understand
and remember. Every math teacher knows that students need to think things out
for themselves, slowly and carefully. This is an essential part of learning
mathematics. It’s quite different from learning, say, Spanish, where you just keep
repeating new words until you memorize them.
Even where there are no problems or questions, you can often pose questions
for yourself. Think of some examples. Ask if you understand why a statement is
true. Try to discover interesting special cases.
For example, take the theorem about the sum of the interior angles of a
triangle. Can you think of any special cases, other than a right triangle? Here’s
one: if all three angles are the same, how big is each angle? Or, if the other two
angles in a right triangle are equal, how big are they? Sketch these triangles.
Another question: suppose a person walks around a triangular field, always
keeping the field on her left. When she returns to the starting point, what total
angle has she turned through? Does this have anything to do with Problem 5.6?
If you’re studying this book on your own, you don’t have a teacher to ask
such questions. So be your own teacher – it will enrich and improve your
learning.
Proofs
Why should you bother learning proofs in math? Can’t you just memorize each
theorem and forget all about the proofs? This is a tricky question. I don’t want to
tell you that you MUST understand and remember every proof in math, forever.
Nevertheless, I do believe that taking the time and effort to carefully read and
understand the proofs in this book (there aren’t that many) will be well
worthwhile. Here’s why:
1. Reading and understanding the proof will ensure that you fully understand
all the concepts involved in the statement of the theorem.
2. The mental effort of understanding the proof will indelibly imprint the
theorem in your brain, even if you forget the details of the proof later on.
3. Understanding proofs will establish confidence in your overall mastery of
basic math. The subject will ‘hang together,’ rather than being a collection
of unconnected facts.
4. Understanding a proof can be a mental challenge, and overcoming mental
challenges is an essential aspect of learning mathematics. It’s the difference
between being an active participant and being a passive observer.
5. There can be a big difference between understanding a proof and
remembering it. Even if the proof is later forgotten, you will retain the
assurance when encountering the theorem later that you once understood
the proof.
Regarding some of the Problems in this chapter, you may be asking, ‘Why
should I do proofs? I’m never going to become a mathematician.’ It’s a fair
question; proofs can be hard. Today’s school math seems to downplay proofs,
but this is definitely not the case at the college level, where proofs, or ‘Show that
…’ questions often appear on exams. If you are in a college program or plan to
be in one, you need to learn how to do proofs. In any case, doing your own
proofs can be a powerful aid to learning mathematics.
So how should you tackle a proof, or ‘show that’ type of problem? It often
helps to write out the problem for yourself. What is given, and what is to be
shown? How could this relate to the material just discussed in the book? Even
quite simple proofs often take an inordinate amount of time to think up. If
you’re like me, you may tend to panic if you don’t immediately get the idea.
Well, don’t panic. Stick with it. Get some coffee. Try again, from the beginning. I
once spent a whole week on a Physics problem. The answer finally jumped into
my head while I was waiting for the bus to go to class. It was so obvious. (In fact,
I was the only student in the class to turn in the solution!)
Problem 5.7 What is the sum of the interior angles of any quadrilateral (4-sided
figure)? Suggestion: draw one and try to think up a construction that will reduce
the given problem to a known result.
‘In a right triangle, the square of the hypotenuse equals the sum of the squares
of the other two sides.’
In symbols
Now we need the algebraic formula (a + b)2 = a2 +2ab + b2; see Chapter 4.
Substituting into the preceding equation, we obtain
Problem 5.9 There are many known proofs of Pythagoras’s theorem. Prove the
theorem on the basis of the figure shown here.
Similarity: Congruence:
Translation Translation
Rotation Rotation
Reflection Reflection
Uniform scaling
Note carefully that the concepts of similarity and congruence allow for
reflection. The fact is seldom emphasized in school texts. You might imagine that
a rigid motion in the plane just means ‘sliding’ a figure around, using translation
and rotation. But this is not correct – reflection is also allowed. (Question.
Suppose you reflect a figure in one line, and then reflect it again in a second line.
Is the final figure ‘slideable’ to the original? Try to guess. The mathematical proof
requires advanced technique. The answer is yes.)
Congruent triangles
Congruent figures are identical in all respects (after a rigid motion). In the case
of triangles, if ΔABC and ΔA′B′C′ are congruent, then AB = A′B′, AC = A′C′, and
BC = B′C′. In addition, ∠A = ∠A′, ∠B = ∠B′, and ∠C = ∠C′. Thus, congruent
triangles have three sides equal, and three angles equal, one triangle to the other.
Now let us consider the reverse problem. Given two triangles that have their
sides and angles equal (from one triangle to the other), are these triangles
necessarily identical in the sense of being congruent? The answer is yes, but we
can say much more. We actually only need to know that certain parts of two
triangles are the same, to be able to conclude that the triangles are in fact
congruent.
Let us formulate the question slightly differently. How much information do
we need to have about a triangle in order for this triangle to be uniquely
determined (so that any two such triangles would be congruent)? There are three
cases in which this is so:
We will consider these cases in turn. First let us agree on some conventions. The
triangle will be labeled ABC, going counterclockwise. Side AB is always given,
and will be drawn horizontally.
To get a feeling for Case 1, try the next problem.
Problem 5.11 (a) Using a ruler and compass, construct a triangle having sides 3,
5, and 7 cm. [Use inches if you prefer.] If you don’t have a compass handy, just
show how you would draw the triangle if you did have one. (b) Try doing this
with sides 3,4, and 8 cm. What goes wrong?
The point of this problem is to convince you that only one triangle (or its
reflection) can be drawn, given the three sides a, b, and c. The reason behind this
is that two circles (with centers at points A and B) can only intersect at two
points C and C′. Therefore, the construction produces only two triangles, which
are congruent by reflection. (Possibly the two circles fail to intersect at all, and in
this case there is no triangle having sides a, b, c. See Problem 5.11b.) This takes
care of case 1: Two triangles which have their three sides equal are congruent
because there is only one way to draw such a triangle (except for reflection).
Cases 2 and 3 are also quite easy to understand. (See the previous figure
showing the three cases.)
Problem 5.12 (a) Using a ruler and protractor, construct a triangle having AB =
5 cm, AC = 4 cm, and ∠A = 30°. (b) Construct a triangle having AB = 4.5cm,
∠A = 30°, and ∠B = 100°.
In Problem 5.12, as in the preceding problem, you see that the given
information, whether case 2 or case 3, can only produce one triangle, once side
AB has been drawn. (However, point C could be produced either above or below
AB, depending on how the angles are drawn, up or down. The two triangles that
could be drawn are, as before, congruent by reflection.)
Let’s summarize these facts about congruent triangles as a theorem. The
foregoing discussion should serve as an adequate proof of the theorem.
Theorem Let ABC and A′B′C′ be two triangles, with sides a, b, c and a ,b ,c
respectively. If any one of the following conditions hold, then the triangles are
congruent.
1. (Three sides equal) a = a′, b = b′, and c = c′.
2. (Two sides and the included angle equal) b = b′, c = c′, and ∠A = ∠A′.
3. (One side and two angles equal) c = c′, ∠A = ∠A′and ∠B = ∠B′.
Problem 5.13 Suppose that the three angles ∠A, ∠B, and ∠C of a triangle are
given. Is the triangle completely specified by this information? Why?
Similar triangles
Two triangles are defined to be similar if one is a scaled version of the other
(except for position and orientation). Now scaling does not change angles.
Therefore similar triangles have equal angles, between one triangle and the other.
Thus, in the figure we have ∠A = ∠A′, ∠B = ∠B′, and ∠C = ∠C′. Let us prove
the converse of this.
Theorem If in two triangles ABC and A′B′C′ we have ∠A = ∠A′, ∠B = ∠B′, and
∠C = ∠C′, then these triangles are similar.
Proof Move triangle A′B′C′ so that vertices A and A′ coincide, and side A′B′ lies
along AB. By assumption ∠A = ∠A′, so that side A′C′ lies along AC (possibly
after reflection).
Next, scale triangle A′B′C′ to make side A′B′ the same as side AB. This scaled
triangle has one side and two angles the same as triangle ABC, so these two
triangles are congruent, that is, identical. Thus ΔA′B′C′ can be scaled to agree
with ΔABC, which shows that the triangles are similar, by definition.
To repeat, if two given triangles have the same angles, then they are similar –
one is a scaled version of the other. (Of course, knowing that two of the angles
are the same is sufficient, because the third angles would then also be the same.)
What is the relationship between the sides of similar triangles? If triangles
ABC and A′B′C′ are similar, then the second is a scaled version of the first. If k is
the scale factor, we have a′= ka, b′= kb, and c′= kc. We can write these equations
in the form
Thus, if two triangles are similar, the ratios of their three sides are the same, and
all three ratios equal k, the scale factor. Conversely, if two triangles have their
sides in equal ratio, then they are similar:
Theorem Let ABC and A′B′C′ be two triangles, with sides a, b, c and a′,b′,c′
respectively. If
then these triangles are similar.
Proof The proof is similar to the preceding theorem; see the diagram there. We
first move triangle A′B′C′ so that A′ coincides with A, and A′B′ lies along AB.
Next, we scale A′B′C′ by the (inverse) scale factor k1 = a/a′= b/b′= c/c′. Then the
scaled triangle has sides a, b, c. By the theorem on congruent triangles, this scaled
triangle is congruent to ΔABC. In other words, ΔA′B′C′ can be scaled to coincide
with ΔABC, so that the given triangles are similar.
Problem 5.14 Suppose ABC is a triangle with sides a = 5, b = 12, c = 13 [note that
52 +122 = 132]. The angles are ∠A = 22.6°, ∠B = 67.4°, ∠C = 90°. If ΔA′B′C′is
similar to ΔABC, with a′= 8, find b′, c′and ∠A′, ∠B′, ∠C′.
This is read as ‘a′is to b′is to c′, as a is to b is to c.’ To repeat, this just means that
a′/a = b′/b = c′/c, as in Eq. 5.3. Then the theorem about similar triangles can be
stated as:
This colon notation is seldom used today, but you may encounter it in old
books.
To summarize, if two triangles ΔABC and ΔA′B′C′are similar, meaning that
one is scaled-up (or -down) version of the other, then
1. their angles are the same:
Triangulation
Triangulation is a method of finding a distance without actually measuring it.
The method uses similar triangles. The next problem explains the idea.
Problem 5.15 Given that ∠A = 75°, ∠B = 65°, and c = 8 cm, draw the triangle
ABC carefully, using a protractor. Now suppose ΔA′B′C′is similar to ΔABC with
c′= 53 m. Find a′and b′by first measuring a and b on your diagram, and using
proportionality.
uncross, until adjacent lines overlap. Hold this position for a few seconds. The
pattern should ‘jump’ off the page and appear closer than it really is. Your brain
has used the angle information to re-calculate the distance to the lines. (For
fascinating discussions of mental computations, read How the Mind Works by
Steven Pinker, W.W. Norton, 1997).
Next, how can we determine the three angles of a triangle, if we know its
three sides? This is by no means obvious! The method of drawing the triangle
and measuring its angles can only give limited accuracy. We need an exact
method. It so happens that by using a scientific calculator you can calculate the
angles of a triangle, within the calculator’s accuracy – usually about 10 digits.
Such a feat was humanly almost impossible before the advent of electronic
calculators and computers (but trigonometric tables could be used for 3 or 4-
digit accuracy).
To use a calculator for solving triangles (to ‘solve’ a triangle means to
calculate the unknown sides and angles), you have to know some trigonometry.
A full treatment of trigonometry occurs in Chapter 8; here we will discuss the
important special case of right triangles. Consider a right triangle, as shown. One
angle is labeled θ (Greek letter ‘theta’). The sides of the triangle are called:
These names are an aid to memory only; later we rewrite everything using
standard math symbols.
The trigonometric functions of θ are defined as
Here sin is an abbreviation for ‘sine,’ cos for ‘cosine,’ and tan for ‘tangent.’
(Also sin θ can be read as ‘sine θ,’ or ‘sine of θ.’) These are the functions shown
on the keys of your scientific calculator. You will need a scientific calculator for
the rest of this section. (See Sec. 5.7.) The word-definitions in Eq. 5.4 are an aid
to memory only. Most people seem to remember sin θ, cos θ, and tan θ in this
way. If instead we use symbols as shown here, the definition becomes (check this
carefully).
For example, suppose x = 6, y = 2. Then (to two
decimals), and we have
Also cos θ = x/r = .95 and tan θ = y/x = .33. (Your calculator will give these values
with much greater precision.)
You may be wondering what happens if the triangle shown near Eq. 5.5 is not
of a specific size. All the sides x, y, and r depend on the size of this triangle. Also
sin θ = y/r depends on y and r. Can you explain this possible confusion? What
would happen if two different right triangles (both having angle θ in the position
shown) were used – would two different values for sin θ come out?
The answer is that any two such triangles are similar, so that the ratio y/r is
the same for both triangles. Therefore, in the definition of Eq. 5.5 it is not
necessary to specify any particular right triangle, and any one will do. The
argument applies also to cos θ and tan θ.
Problem 5.19 For a right triangle with y = 1.9 and r = 3.2 find x, sin θ, and cos θ.
In this problem, you calculated sin θ = .59 for the given right triangle. But
how do you calculate the angle θ itself? Your scientific calculator does this.
On most calculators ASIN (or SIN−1) is an ‘alternate’ key – you have to press
some other key first, then ASIN. Each calculator is different – read the manual;
usually there is a color match between the alternator key and ASIN. By the way,
the official name of ASIN is ‘Arc sine,’ and SIN−1 is ‘sine inverse.’ These terms are
explained in Chapter 8.
As an example, enter .59 on your calculator, then activate ASIN (or SIN−1).
You should get 36.16°. Thus, given that sin θ = .59, the angle θ is 36.16°. (If your
calculator does not give this result, check that it has been set in DEG mode.
There will be a key combination that accomplishes this; again, see your manual.)
As a check, now push the SIN key; you should get .59 again.
To repeat,
And similarly for cos θ = z versus Acos z = θ, and so on. The functions sin
and Asin are called mutual ‘inverse functions’ because of this relationship.
Chapters 7 and 8 discuss inverse functions in detail. As a mnemonic device you
can think ‘angle whose sine is’ whenever you encounter Asin. For example, Asin
.5 is the angle whose sine is .5. Find this angle using your calculator. Surprised?
I’ll explain this in a moment.
Problem 5.20 (This problem will familiarize you with using a scientific
calculator to solve triangles.) Consider a right triangle with sides x = 5, y = 3.
Find the angle θ to the accuracy of your calculator, first by using tan θ = y/x and
the Atan key. Next, as a check, use cos θ = x/r and the Acos key. Finally, use sin θ
= y/r and the Asin key. The results should be exactly the same (or almost exactly
the same).
Problem 5.21 Try to calculate Asin 2. You will probably get an error message.
Can you figure out why? Suggestion: review the definition of sin θ, Eq. 5.5, to see
why sin θ = 2 would be impossible.
The result of problem 5.21 shows that, for any acute angle θ (i.e., for 0 ≤ θ ≤
90°) we have
Problem 5.22 Solve the following right triangles, to 3 digits accuracy. Make a
quick sketch of each triangle, to check that your answers are reasonable. (a) y =
2.8, r = 7.5; (b) x = 3.6, θ = 52°.
What about the values of tan θ? Use the ATAN key to find the angle whose
tangent equals 50, then 500. What explains this? You find that
Look at the figure above Box 5.5 and try to explain this. What would tan 90° be?
Now we would have x = 0, so is undefined. And for angles close to
90°,the ratio y/x becomes very large.
This phenomenon is typical for expressions containing x in the denominator.
These expressions are undefined if x = 0, and they usually ‘blow up’ if x is very
small. For example, use your calculator to calculate for x = .1,.01,.001,…, etc.
The reciprocal of a very small number is a very large number (and vice versa).
More on this in Chapter 7.
Please keep in mind that as yet, we have not discussed methods for solving
triangles other than right triangles. This topic is studied in Chapter 8.
Problem 5.23 (This problem involves drawing a graph. Skip it if you forget how
to draw graphs.) Using your calculator, make a table of values of sin θ for θ = 0°,
30°, 60°,…, 180°. Use this data to plot a graph of sin θ for 0 ≤ θ ≤ 180°.
5.5 Isosceles triangles
A triangle having two equal sides is called an isosceles triangle (pronounced eye-
saw-sell-ease). It seems evident from the figure that an isosceles triangle must
also have two equal angles, ∠A = ∠B. We can prove this (and also the converse)
by using the theorem about congruent triangles.
Theorem Let ABC be an isosceles triangle, with a = b (see figure). Then ∠A = ∠B.
Conversely, if ABC is a triangle having ∠A = ∠B, then it is isosceles with a = b.
Proof For the first assertion, we assume that a = b. We construct the line through
vertex C and bisecting the base AB, as shown. (‘Bisect’ AB means to cut AB into
two equal parts). Let D be the point of bisection. The triangles ADC and BDC
have three equal sides: (1) AC = BC by hypothesis (remember AC = b and BC =
a); (2) AD = BD by construction; (3) CD is the same for both triangles. Therefore
ΔADC and ΔBDC are congruent. Since congruent triangles have equal angles, it
follows that A = B, as claimed.
To prove the converse, we now assume that A = B. We now construct line CD to
be perpendicular to the base AB (it happens also to be the bisector of AB, but that
is not proved yet). Triangles ADC and BDC have two equal angles (why?), and
one common side. Applying case 3 of the theorem on congruent triangles, we
conclude that ΔADC and ΔBDC are congruent. Since congruent triangles have
equal sides, it follows that AC = BC, i.e., a = b.
There, that was relatively painless, I hope. You have now mastered four of the
most important theorems of Euclidean geometry. These theorems have
countlessly many uses, some of which we will see soon.
Problem 5.24 (a) Recall from memory the statements of: (i) the theorem on
interior angles of a triangle; (ii) Pythagoras’s theorem; (iii) the theorem on
congruent triangles (3 cases); (iv) the theorem on isosceles triangles. If you have
to look them up first, then try closing the book and writing them down from
memory. (b) Next, recall the proofs. The key to most of these proofs is the
construction that is used, so try to remember this. Ten years from now, you may
have consciously forgotten these proofs, but they will still be in your brain
somewhere - provided you take the trouble to understand them now. In the
future you’ll know that once upon a time you did know these proofs.
Problem 5.25 Prove that an equilateral triangle is equiangular, and vice versa.
(Equilateral means that all three sides are equal; equiangular means that all three
angles are equal.) Suggestion: use the theorem on isosceles triangles. What is the
size of an angle in this case?
Problem 5.26 Let ΔABC be isosceles, with a = b, and let D be a point on the line
AB. Prove that D bisects AB if and only if CD is perpendicular to AB. Suggestion:
think about the preceding proof of the theorem on isosceles triangles.
The line that is perpendicular to and bisects a given line segment AB is called the
perpendicular bisector of AB. Note that if O is the center of a circle that passes
through points A and B, then O must lie on the perpendicular bisector of the line
segment AB. This is because OA = OB (both equal to the radius of the circle), so
triangle OAB is isosceles.
Problem 5.27 Let AB be a chord to a given circle. (A chord is a line joining two
points on a circle.) Show that a radius drawn perpendicular to AB necessarily
bisects AB.
(2) The 30° − 60° right triangle. Consider a right triangle having one of its angles
60°. Thus the remaining angle is 30°. Now this 30° − 60° right triangle is half of
an equilateral triangle. Why? If we let the side of the equilateral triangle be 2
units long, then the sides of our 30° − 60° triangle are x = 1, r = 2, [Be
sure you see this.] From this we obtain
Again, this triangle is worth remembering. It will be used often in the chapter
on Trigonometry.
Problem 5.28 Find sin 30°, cos 30°, and tan 30°. Suggestion: use the 30° − 60°
figure above, and use with θ as the 30° angle in that figure.
(Alternatively, you could redraw the triangle lying on its side.)
To see this, note that in the figure sin(90° − θ) = opposite/hypotenuse = x/r = cos
θ, where ‘opposite’ now means the side opposite to the angle (90° − θ). The
second equation follows in the same way.
Equation 5.8 explains the word ‘cosine.’ The cosine of an angle θ is equal to
the sine of its complement, 90° − θ, and vice versa.
We end this section with a theorem about circles. Remember that a circle is
defined to be the set of all points which are a fixed distance r (the radius) from a
given point O (the center).
Theorem Let AB be a diameter of a given circle. Choose any point C on the circle,
and consider the triangle ABC. Then C = 90°.
Proof First, draw the line OC. (I don’t draw this here, but you should make your
own drawing.) Then ΔAOC is isosceles, because AO = CO (both being equal to
the radius r of the circle). Therefore A = ACO (mark this on your drawing).
Similarly, ΔBOC is isosceles, and therefore B = BCO. Therefore C = ACO + BCO
= A + B.
Now A + B + C = 180° (sum of interior angles). Substituting C = A + B, this
becomes 2( A + B)= 180° or A + B = 90°. Therefore C = 90°.
There are literally hundreds of theorems in Euclidean geometry that could
now be gone into. Most of them are not sufficiently useful to be included in this
book. The theorems that were discussed in this section will be used often later in
the book, however.
Here π (Greek ‘pi’) is a certain real number. Your calculator will display π =
3.141592654 (to 9 decimals).
Equation 5.9 raises some interesting questions:
1. Equation 5.9 says that the circumference C is proportional to the radius r. Is
this obvious from basic principles?
2. How is π calculated? Could the Greeks calculate it?
3. Is π = 22/7?
Let’s start with the ridiculous. I’m always surprised by the number of people who
think they remember that π = 22/7. Especially because it’s wrong! π is not equal
to 22/7. Your calculator will show you that 22/7= 3.1428 … whereas π = 3.1415
…. Thus 22/7 is an approximate value for π, valid to three digits only. This
approximation is good enough for many applications, but is not accurate enough
for every situation in modern science and technology.
Next, consider question (1) – why is C ∝ r? This is a matter of scaling. If we
scale a circle, with scale factor k, we get another circle, with radius and
circumference both scaled by the same scale factor k:
Let the original circle have radius 1 unit and circumference C0 units
(whatever C0 may be). If the new circle has radius r, the scaling factor must be k =
r, so the new circumference is
Area of a circle
If A denotes the area inside a circle of radius r, we know immediately from
scaling that A ∝ r2. Thus there is a constant k such that A = kr2. In fact k = π, the
very same number that comes up in the circumference formula, Eq. 5.9.
The Greeks knew this formula, and were able to prove it from Eq. 5.8. (See
Section 5.8.)
Problem 5.31 What is the area of the annulus shown here? The inner and outer
radii are 8 mm and 25 mm.
Degrees (Radians)
360° 2π
180° π
90° π/2
60° π/3
45° π/4
30° π/6
Arcs of a circle
An arc s is a segment of a circle. An arc is said to subtend an angle θ at the
center of the circle, in the accompanying figure.
The length of an arc of a circle of radius r, subtending an angle θ at the center, is
given by
Here, the angle θ is given in radians. (The simplicity of this formula is the
main reason for using radians in math.)
‘
Equation 5.12 is directly related to the formula C = 2πr for the circumference
of a circle. Namely, if we take θ = 2π, then the arc s is the entire circle. In this case
Eq. 5.12 becomes s = rθ = 2πr, which is correct. Since arc length s is clearly
proportional to the angle θ, Eq. 5.12 must be correct in general.
Problem 5.34 (a) Find the length of a 60° arc in a unit circle. What is the ratio of
this length to the length of the 60° chord? (b) Same problem for a 1° arc and
chord.
Equation 5.12, s = rθ, has three variables, and can be solved for any one
variable in terms of the other two.
Problem 5.35 (a) Solve s = rθ for r, and for θ. (b) An arc of a circle of radius 16
cm has length 8 cm. Find the subtended angle θ, in radians and degrees.
(The diagram assumes that the sun is directly overhead of Alexandria, which
is not the case. However, the directions to the sun from the two cities are parallel,
which still implies that the arc AS subtends an angle of 7° at the center of the
earth, O.)
Area of a sector
The region enclosed by an arc and the sides of the subtended angle θ is called a
sector of the circle. The area of a sector is
(where θ is given in radians). This equation can be deduced from the formula A
= πr2 for the entire circle. Namely, if θ = 2π then Eq. 5.13 becomes A = πr2, which
is correct. Since A is proportional to θ, Equation 5.13 is therefore correct for any
angle θ. (See Problem 5.31.)
Problem 5.36 If the radius of a circle is given in cm, what are the units of arc
length s and area A in Eqs. 5.12 and 5.13?
Problem 5.37 Find the area of the window shown here. The top of the window is
not a semicircle, but an arc of a circle of radius .6 m. Suggestion: first try to figure
out the area of the top part; this requires some thought.
Angular velocity
We now consider circular motion, as in a rotating wheel. The rate of rotation is
called angular velocity, and is often denoted by ω (Greek ‘omega’). Specifically,
where θ = angle turned, and T = time taken. Notice the analogy with speed, S =
D/T , as discussed in Chapter 3.
Angular velocity can be measured in various units, including degrees per
second, radians per second, or RPM (revolutions per minute). For example,
suppose a shaft is rotating at 1200 RPM. How many radians per second is this?
Answer:
i.e., 125.7 radians per second. Note again that radians are dimensionless, and
need not be explicitly mentioned.
Problem 5.38 Two meshed gears have 52 and 20 teeth, respectively. If the larger
gear is rotating at 175 RPM, how fast is the smaller gear rotating? What is the
general formula?
where s is arc length, subtended by angle θ (in radians). Dividing both sides of
this equation by time T ,we obtain
Here, s/T is distance/time, or velocity v (we use the term velocity here, rather
than speed, to avoid confusion between using s to denote speed, or to denote arc
length). Likewise θ/T is angular velocity, ω. Therefore the above equation
becomes
What are the units of measurement in this equation? Suppose, for example,
that r = 36 cm (as in a typical bicycle wheel) and v = 20 km/hr, which equals 20 ×
105/60 cm/min (check this). Then
and this means 926 radians/min. Since 2π radians equals 1 revolution, we can
write
Clearly there are several possibilities for the units in Eq. 5.15, depending on what
units are used for each of the variables v, r and ω.
Problem 5.39 A satellite is located 35,800 km above the equator. At what speed
must it move so as to remain stationary relative to a location on the surface of the
earth? The radius of the earth is approximately 6000 km.
Problem 5.40 A world-class golfer achieves a clubhead speed of about 130 mph.
How long does it take him to complete one swing (ignoring the take-away)?
Assume that the radius of the circle made by the clubhead is 6 feet, and that the
clubhead traces out an arc of 540°.
Now, because the circle is symmetric with respect to the line OA, its tangent
line at A must also be symmetric in this way. Therefore the two angles made by
the radius OA and the line L must be equal to one another. This means that each
angle is 90°, a right angle. We state this fact as a theorem.
If O denotes the center of the circle, then O must lie on the perpendicular
bisector of line AB. This is because, if the circle passes through A and B, its center
must be equidistant from these points. For the same reason, O must lie on the
perpendicular bisector of BC. Theinter-section of these two bisectors determines
O uniquely, and O is indeed the center of the circle through A, B, and C. (Point O
is also automatically on the perpendicular bisector of AC – why?)
Problem 5.43 Explain how to construct the inscribed circle to a given triangle,
this being defined as the circle that is tangent to all three sides. Suggestion: First
prove that points lying on the bisector of a given angle are equidistant from the
sides of the angle, and conversely. (Use the congruent triangles theorem.)
You can quickly learn how to use your calculator by reading through the owner’s
manual and trying each feature. This book provides plenty of practice in using a
scientific calculator. Only a few pointers will be discussed here. Try them out.
Scientific notation. You can switch between regular display of numbers, and
scientific notation. The latter may be shown in Enotation, for example
Most scientific calculators display about 10 digits (plus the exponent). They
actually perform calculations to one or two more digits, resulting in high
accuracy. Nevertheless, minor errors in the last digit displayed inevitably do
occur sometimes, but this is virtually never a problem.
It is possible to control or ‘fix’ the number of digits displayed, and this is
definitely worth doing. The calculator always calculates in its maximum
precision mode, but then rounds off to your fixed display accuracy.
You need to use this carefully, to make sure that it’s what you want to do.
By the way, some scientific calculators use RPN – Reverse Polish Notation.
This is a method of calculation that dispenses completely with all brackets. I find
this vastly easier and more reliable than standard bracket-based calculation. But
it does take some getting used to.
Memory. The scientific calculator provides several memory registers, which are
useful in various ways. First, a complicated calculation can be broken into
simpler parts, with results stored in memory and recalled later. Also, you may
sometimes need to use the same value in several calculations. If so, you can store
this value in memory and keep using it. Most calculators will preserve values in
memory even when the calculator is turned off, which can be useful. (Storing a
new value in memory will erase the old one.)
Trigonometric functions. All the trig functions (sin, cos, tan) and their inverses
are built into the calculator. You have to ensure that the calculator is in the
proper mode, whether degrees or radians. (A third mode, grads, may also be
provided; this is used by engineers.)
Logarithmic and exponential functions. These are also built in; see Chapter 9.
Powers. A very useful feature. It instantly calculates yx for any values of x, y (but y
must be non-negative for most cases). For example, try calculating 73 = 343. The
sequence on my TI calculator is (There are also special keys for x3
and but not for powers higher than 3. To calculate you would
key in
Special keys. Try the keys for x2, 1/x, and π. To check your technique,
calculate π × (1/π) using these keys. The sequence is
Graphics. A graphic calculator draws the graphs of functions that you program in
(see Chapter 7). This could help a lot in math tests.
Other features. Your handbook will describe any other features of the calculator.
Enjoy your calculator – it’s one of the wonders of technology.
Now, this last approximate equation holds no matter how small h is; in fact, the
smaller we take h the closer the approximation becomes. If we take h negligibly
small, we get
3. Express these angles in radians, and sketch them: (a) 3π/4; (b) 2π/3.
4. Three positive numbers a, b, c are given. What is the easiest way to tell whether
these numbers can form the three sides of a triangle? Explain. (Think of
drawing it.)
5. The figure depicts a 70° sector of a circle with radius 20 cm. Find (a) the arc
length s; (b) the area of the sector.
6. A parallelogram has sides of 2 m and 4 m, and one interior angle of 60°. Find
its area.
8. Find the area of an isosceles triangle having two sides of length s, and two
angles of size θ. Show that the answer to Problem 7 is a special case of this
result.
9. Two pulleys, of diameter 6 in and 2 in are joined by a belt. If the larger pulley
is rotating at 100 RPM, how fast is the smaller pulley rotating, and why? What
is the general formula?
10. Triangles ABC and A′B′C′ are similar. If AB = 6 and AC = 10, find the ratio A
′B′/A′C′. What is the general rule?
11. Two triangles are similar, with scale factor k. How are the perimeters related?
The areas? (The perimeter is the total length of the sides of the triangle.)
12. Suppose an arc of a circle is given, but the center is not known. Show how to
locate the center. Suggestion: first show how to draw a line that must pass
through the center.
15. Two radii OA and OB of a circle of radius 6 cm make an angle of 100°. Find
the length of the chord AB, and of the arc AB.
16. What angle above the horizon is the sun at mid-day on Dec. 21, at New York
City? Relevant information: On Dec 21 the earth’s axis is tilted from the sun
the maximum amount, NYC is located at 41° North latitude,
approximately. Suggestion: one way to think about this problem is to first
realize that, if you were located on the Arctic circle, at latitude
the sun would be right on the horizon on Dec. 21. Work
back from there.
17. Show how to calculate the length of the crossover tangent to two circles.
Suggestion: Use a method similar to that of Problem 5.42.
18. A rectangle R has the property that when cut into two equal smaller
rectangles (the cut being parallel to the short sides), the new rectangles are
similar to the original rectangle. What is the ratio of the longer to the shorter
side of R?
19. Rectangle R has length l and width w. If a square of side w is cut off, the
remaining rectangle is similar to the original rectangle R. Find the ratio l/w.
(This value of l/w is called the golden ratio. It is thought to determine the
most aesthetically pleasing shape for rectangles.)
20. Prove that the diagonals of a rhombus are perpendicular bisectors of each
other. (A rhombus is a parallelogram having four equal sides.) Also, state and
prove the converse.
21. The distance from the earth to the moon is about 384,000 km. The moon’s
disk subtends an angle of approximately 0.5° at the earth. Find the diameter of
the moon. Also, compare the volumes of the two bodies (the earth’s diameter
is about 12,000 km).
Solution 5.1 L1 and L3 are parallel to each other (or possibly coincident).
Solution 5.2 You should notice that the sum of the other two angles is always
90°, at least experimentally. Why this is true will be explained in Sec. 5.2.
Solution 5.4 No. Suppose L is a line drawn on the floor, oriented E-W. And
suppose Lis a line on the ceiling, oriented N-S. These lines will never intersect –
they ‘miss each other.’ But they certainly aren’t parallel. You can use your two
index fingers to illustrate the same idea – try it. Also, get two sticks of wood.
Hold one of them at some angle, and ask a friend to hold the other stick parallel
to yours, at a distance. He or she will have no difficulty following instructions;
there is a unique direction determined by your stick. But Euclid’s notion of non-
intersecting lines is not relevant here. Two lines in space are parallel if they ‘have
the same direction.’ The exact definition of ‘direction’ in space will be discussed
in Part 2.
Solution 5.6 We have α+β +γ = 180°. Also γ + ∠DCB = 180° (a straight angle).
Therefore α + β + γ = γ + ∠DCB. Subtract γ from both sides, to conclude that α
+ β = ∠DCB.
Solution 5.7 The construction required is a diagonal of the quadrilateral, which
cuts it into two triangles. The sum of all the interior angles of both triangles is the
same as the sum of the interior angles of the quadrilateral [check this]. Therefore
the quadrilateral angles sum up to 180° + 180° = 360°.
Solution 5.8 (a) From a figure, you see that d2 = (6 cm)2 +(6 cm)2 = 72 cm2, so
that (b) Similarly, here d2 = (3 m)2 + (4 m)2 = 25 m2,
so that d = 5 m. This is the famous ‘3-4-5’ triangle, with 32 +42 = 52.
Solution 5.9 Note that the side of the small square is b −a. Equating areas
therefore gives c2 = (b − a)2 +4 × 1/2ab = b2 − 2ab + a2 +2ab = a2 + b2.
Solution 5.11 (a) First, draw side AB = 7 cm, for example. Next, place the
compass point at A and draw an arc of radius 5 cm. Finally, draw an arc of radius
3 cm, centered at B. These arcs intersect at point C; draw sides AC and AB. (This
triangle would be congruent, by reflection, to the triangle having C below AB.)
(b) You can’t complete this triangle because the two short sides aren’t long
enough.
Solution 5.12 (a) Following the given information you obtain the three vertices
A, B, and C. Joining B to C completes the triangle. (b) In this case, you wind up
with lines starting at A and at B. Since these lines are not parallel, they meet at a
point C, which completes the triangle.
Solution 5.13 No. Consider the two triangles shown. If BC is parallel to B′ C′,
then ∠B = ∠B′and ∠C = ∠C′(corresponding angles). But these triangles are not
identical (congruent).
Solution 5.14 The scale factor is 8/5 = 1.6. Therefore b′= 19.2 and c′= 20.8. The
angles are all the same: ∠A′= 22.6°,etc.
Solution 5.15 You should get a = 10.9 cm, b = 10.2 cm (approximately). The
scale factor between the two triangles is k = c′ /c = 662.5, and therefore a′= ka =
72.2m and b′= kb = 67.6m.
Solution 5.17 The converse is: If ABC is any triangle for which a2+b2 = c2, then
∠C = 90°.
Solution 5.18
Solution 5.21 Equation 5.5 says sin θ = y/r. Now r > y because r =
Therefore sin θ< 1, so that sin θ = 2 is impossible.
Solution 5.22 (a) x = 6.96, θ = 21.9°, φ = 68.1° (where φ, called ‘phi,’ is the
complementary angle to θ); (b) y = 4.61, r = 5.85, φ = 38°. (Did you make the
sketches?)
Solution 5.23 The graph is a smooth curve starting at 0, rising to 1, then falling
back to 0 at θ = 180°. This is the first cycle of the famous ‘sine curve,’ (see
Chapter 8). If your calculator has graphing capability, you can even get it to draw
the graph of the sine curve.
Solution 5.25 First, let ΔABC be equilateral. Then a = b,so A = B by the isosceles
triangle theorem. Similarly, a = c, so A = C. Thus ΔABC is equiangular.
Conversely, if ΔABC is equiangular, then A = B,so that a = b. Also A = C, so a =
c. Thus ABC is equilateral. The size of each angle in an equilateral triangle is 60°
because the three angles must sum to 180°.
Solution 5.26 Proof. First assume that D bisects B. Then ΔADC and ΔBDC are
congruent (three sides equal). Hence the angles at D are equal, so both must be
right angles. This proves that CD is perpendicular to AB. For the converse,
assume that CD is perpendicular to AB. Then ΔADC and ΔBDC have two equal
angles and one side in common, so they are congruent. Therefore AD = BD, i.e.
D bisects AB.
Solution 5.27 First make a drawing, and label the center of the circle, as O, say.
Then ΔABO is isosceles, with a = b (both being equal to the radius of the circle).
Therefore, by Problem 5.26 on perpendicular bisectors of isosceles triangles, the
perpendicular OD bisects AB.
Solution 5.28 Either diagram shows that sin 30° = ½,
Solution 5.31 To get the area of the annulus we subtract the area of the inner
circle from the area of the outer circle. Thus A = π(25)2 − π(8)2 = 1, 762 mm2, or
17.62 cm2 (since 1 cm2 = (10 mm)2 = 100 mm2).
Solution 5.32 Since the area of a circle is πr2, the area of a half-circle must be
(½)πr2. Similarly, a quarter-circle has area (1/4)πr2. The general result is that, if q
is a number between 0 and 1, then the area of a ‘q-circle’ equals qπr2. (See Eq.
5.13.)
Solution 5.33 (a) Using Eq. 5.11 we obtain 1 = 360°/2π = 57.3°. (b) Similarly, 1°
= 2π/360 = .0175.
Solution 5.34
(a) s = θ = π/3 = 1.05. The chord has length 1 (equilateral triangle), so the ratio of
arc to chord is 1.05. (b) s = θ = 2π/360 = .01745329 (see Prob. 5.33). For the
chord, we use some trigonometry: chord length = 2y = 2sin½° = .017453070. The
ratio is 1.000013. (The moral is that, for small angles, arc length and chord length
are nearly equal. The reason is that, for a small angle, the arc hardly curves at all.
It’s nearly straight.) The figure is exaggerated for easier viewing.
Solution 5.35 (a) r = s/θ, and θ = s/r; (b) θ = .5(radians) = .5 × 360°/2π = 28.6°.
Solution 5.36 Arc length s = rθ is in cm (because θ is in radians, which has no
units). Area A = 1/2r2θ is in cm2. Note that θ is always assumed to be given in
radians in both these equations. If not, it must first be transformed to radians.
Solution 5.37 The figure shows the whole semicircle, with center O. The line AC
is the top of the lower part of the window, so that AC = 1 m. Thus BC = .5 m, and
OC = .6 m (radius of the circle). From Pythagoras we get OB = .33 m. If α = BOC
we have sin α = BC/OC. Therefore α = .985. [Set your calculator to radians before
calculating α.] The area of the sector subtended by AOC = 2α is therefore A =
(½)r2(2α)= .355 m2. To get the area of the top part of the window, we have to
subtract the area of triangle AOC, 1/2bh = ½ × 1 × .33 = .165 m2. Hence Atop = .19
m2. The area of the window is 1.3+ .19 = 1.49 m2. (Though this solution may
seem a bit complicated, please note that it uses only basic geometry as discussed
in this chapter.)
Solution 5.38 One revolution of the large gear moves 52 teeth past the point of
contact. Hence 52 teeth of the small gear also move past this point. This results in
52 ÷ 20 = 2.6 revolutions of the smaller gear. Therefore the angular velocity of
the small gear is 2.6 × 175 = 455 RPM. To obtain the general formula, let n1,n2
denote the number of teeth on gear 1 and 2 respectively. Also let ω1,ω2 be the
angular velocities of the two gears. Then the number of teeth passing the contact
point per minute equals n1ω1 for gear 1 (because 1 revolution passes n1 teeth, so
ω1 revolutions pass n1ω1 teeth), and n2ω2 for gear 2. For meshed gears, these are
equal so that
Solution 5.41 AOP is a right triangle with sides 12 cm, 9 cm and 7.94 cm. Thus
∠P = Asin (9/12) = 48.6° and ∠O = 41.4°.
The point of intersection is the zero point on each axis. This point is called
the origin of the coordinate system. The two axes have identical number scales,
meaning that the unit distance on each axis is the same. These coordinate axes
are traditionally called the X-axis and the Y-axis, although any other letters
could also be used. Also traditionally, the X-axis points to the right, so that the x-
values increase to the right. Similarly, the Y-axis points upwards. Again, these
conditions may be changed in some applications.
Once the coordinate axes are specified, any point P in the plane is assigned
coordinates (x, y) determined as follows:
A vertical line (i.e. parallel to the Y-axis) meets the X-axis at position (real
number) x. Likewise, a horizontal line (parallel to the X-axis) meets the Y-axis at
position y. In the above figure we have x = 2.3 and y = 1.4, so that the coordinates
of P are (2.3, 1.4).
In this example, both x and y are positive numbers. Generally x and y can be
any real numbers, positive, negative, or zero. The two coordinate axes divide the
whole plane into four regions, called quadrants, ineachof which the signs of x
and y are given (+ or −). These quadrants are referred to as the first quadrant, the
second quadrant, and so on. Just remember that quadrant I has both coordinates
positive, and the other quadrants proceed counterclockwise.
Quadrant Sign of (x, y)
I (+, +)
II (−, +)
III (−, −)
IV (+, −)
Problem 6.1 (a) Which quadrant is each point in: (−3, −6), (2, −9)? (b) What is
special about the coordinates of a point on the X-axis? On the Y-axis? The
origin?
The terms ‘abscissa’ and ‘ordinate’ are sometimes used to refer to the x
(horizontal) and y (vertical) components of the point (x, y), but these terms are
used infrequently today.
It is important to remember that any point lying on the X-axis has y = 0. For
example, the coordinates of the point located at x = 5 on the X-axis are (5,0).
Similarly, points lying on the Y-axis have coordinates (0,y).
The distance formula
Now consider two points Pi having coordinates (xi,yi). Then x2 − x1 is the
distance between the numbers x1 and x2 on the X-axis. (See Chapter 1 if this is
not familiar to you.)
Similarly, y2 − y1 is the distance between y1 and y2. These distances form the
sides of a right triangle, of which P1P2 is the hypotenuse. If d denotes the distance
between P1 and P2, we can use Pythagoras’s theorem to conclude that
Problem 6.2 Find the distance between the points (−3, 1) and (1, −3). Make a
quick sketch and check.
is the equation of a circle with center at (3, −2) and radius 4. (Where does the −2
come from? Remember that y +2 = y − (−2), so y0 in Eq. 6.2 has the value −2.)
Problem 6.3 Identify and sketch the circle whose equation is x2+(y+3)2 = 9.
Check by substitution that the origin is a point on this circle. What is the lowest
point on the circle? The rightmost point?
Consider again the example (x − 3)2 +(y +2)2 = 16. Let us expand and
simplify:
or
Any circle equation, as in Eq. 6.2, can be expanded out in this way. The
general result will be of the form
(Read ‘Completing the square’ in Sec. 4.5 if this isn’t clear.) Thus the given
equation is the equation of the circle with center (4, −½) and radius .
This calculation can be performed for any equation of the form of Eq. 6.3.
However, something may go wrong. Look at the term 69/4 on the right side of
the above example. What if this had turned out to be a negative number? Then
the radius r would not exist, because r2 must be positive.
What does Eq. 6.3 represent geometrically, in this situation? Nothing! That’s
right, nothing. The final equation (x − x0)2 +(y − y0)2 = −q2 cannot be satisfied for
any numbers x, y, which means that the given equation also has no solutions in
real numbers x, y. This is a perfectly reasonable outcome, for any quadratic
equation (recall Sec. 4.5). A simple, but typical example is x2 + y2 = −1; no real
numbers x, y can possibly satisfy this equation.
Can we tell by inspection whether a given equation of the form 6.3 is the
equation of a circle or not? Problem 6.5 goes into that question.
Problem 6.4 Complete the squares, to identify the circles (if any): (a) x2 + y2 +6x
− 2y +4 = 0; (b) x2 + y2 +6x − 2y +12 = 0.
Problem 6.5 Write down the relationship between the coefficients A, B, and C in
Eq. 6.3 that will ensure that this is actually the equation of a circle. Suggestion:
complete squares, as usual. What happens if you get r2 = 0?
The graph of an equation
We have seen that an equation such as (x − 2)2 +(y +1)2 = 9 corresponds to a
certain circle in the XY-plane. This relationship can be expressed in either of the
following ways:
(a) The equation of the circle with center (2, −1) and radius 3 is (x − 2)2 + (y +1)2
= 9, or
(b) The graph of the equation (x − 2)2 +(y +1)2 = 9 is the circle with center (2, −1)
and radius 3.
Definition The graph of a given equation in x, y is the set of all points (x, y) in
the XY-plane, such that x, y satisfy the equation. (‘Satisfy’ means that when you
plug the particular values of x and y into the equation, it becomes a numerical
equality.)
For example, (2,2) is one point on the graph of (x − 2)2 +(y +1)2 = 9, because
(2 − 2)2 +(2+ 1)2 does equal 9. Likewise, is another point on
the graph, as you could check for yourself.
You may be familiar with the process of ‘plotting’ a graph by first making a
table of x, y values and then plotting and joining up these points. Well, this is one
method of sketching the graph, but it is cumbersome and error-prone. Some
equations have graphs that are common geometric objects – lines, circles, and so
on. By recognizing these equations one can identify the graph directly, without
going through the point-by-point plotting process. However, plotting a few
judiciously chosen points whose coordinates are calculated from the equation, is
often useful. Examples occur throughout this and later chapters.
The important thing to keep in mind from now on is that the graph of an
equation is the set of all points whose coordinates x, y satisfy the equation
numerically.
(Δ is the Greek capital letter ‘delta.’ This letter is often used to denote
changes, or differences. Note that Δx, ‘delta x,’ is a single number, and not the
product of Δ and x).
In figure (a) above Δx and Δy are both positive, so the slope m is positive.
Lines that slope up to the right have positive slope. In figure (b), Δy is negative
(because y2 < y1) while Δx is positive, so m is negative. Lines that slope down to
the right have negative slope.
What about a horizontal line? In this case we have Δy = 0, so m = 0.
Horizontal lines have zero slope. Finally, what if the line L is vertical? Now Δx =
0, so that m in Eq. 6.4 is undefined. In summary:
line, L slope, m
slopes upwards m> 0
slopes downwards m< 0
Horizontal m = 0
Vertical m is undefined
(Sometimes one says that a vertical line has ‘infinite slope,’ but this usage is best
avoided, except in a colorful way. However, it is true that a line that is nearly
vertical has a very large slope, either positive or negative.)
Problem 6.6 Consider various lines L, all going through the point (1, 0). Find the
point P2 = (x2,y2) on L, given that x2 = 3 and (a) m = 1; (b) m = 0; (c) m = −2; (d)
m = 8. Sketch these 4 lines in a single coordinate system. Suggestion: show first
that y2 = 2m in this example.
Next, consider a line L with positive slope m. If θ is the angle between the
positive X-axis and line L, we have
This can be seen from the next figure, in which the Δx, Δy triangle is drawn with
Δx being along the X-axis. Recall that tan θ = opposite/adjacent, so that tan θ =
Δy/Δx = m.
It happens that Eq. 6.5 also applies to negative slopes. In this case, θ is an
angle down from the positive X-axis. Such angles θ are considered to be negative,
and tan θ is also negative. Try finding Atan (−1) on your calculator. You will get
−45°. Negative angles are studied in Chapter 8.
Problem 6.7 (a) A line L has slope m = 1. What is the angle that L makes with the
positive X-axis? (b) Same, with m = 10 (use your calculator for part b). (c) Same
with
Problem 6.8 (a) Use a diagram to show that lines of slope m1 = 1 and m2 = −1 are
perpendicular to each other. (b) Find the slope of L1 if θ1 = 30°, and of L2 if θ2 =
−60° (i.e., L2 slopes down at 60°). Check that m1 = −1/m2.
To put the conditions stated in Eq. 6.6 in words, first, parallel lines have
equal slopes, and second, the slopes of mutually perpendicular lines are negative
reciprocals of each other.
The equation of a line
Let P1(x1,y1) be a given point. We want to find the equation of the line that passes
through the point P1 and has a given slope m. The method is similar to that used
to derive the equation of a circle in the preceding section.
Cross-multiply by x − x1 to obtain
This equation holds for any point (x, y) on the specified line. Conversely, if x
and y are two numbers that satisfy Eq. 6.7, then (x, y)is a point on the line
through (x1,y1) with slope m. In other words, Eq. 6.7 is the equation of the given
line. We call this the point-slope form of the equation of a line, to indicate that
this equation applies when one point (x1,y1), and the slope m are given. Note in
particular that the coordinates (x1,y1) of the given point P1 satisfy Eq. 6.7.
For example, the line through (−2, 5) with slope −3 has the equation y − 5 =
−3(x + 2). This might be rewritten as 3x + y + 1 = 0, although the latter equation
no longer visibly shows the conditions that specified the line.
Problem 6.9 Find the equation of the line through (−4, −2) having slope ½. Then
simplify the equation by algebra. Does the origin lie on the line, and why? Sketch
the line.
Problem 6.10 (a) Review the derivation of the equation for a line, in point-slope
form, in your head. (b) What is the slope of the line passing through (1,1) and
(3,5)? Find its equation.
Problem 6.11 Find the equation of the tangent line to the circle x2 + y2 = 25, at
the point (4,3). Suggestion: first make a sketch.
The point-slope form of the equation of a line, Eq. 6.7, is useful for writing
down the equation of a line given one point and the slope. We next discuss
several other useful forms for writing the equation of a line.
First consider the problem of finding the equation of the line through two
given points (x1,y1) and (x2,y2). This line has slope m given by
This is sometimes called the two-point form for the equation of a straight line.
It’s simpler to remember that this is just a special case of the point-slope form,
Eq. 6.7.
Consider the special case of Eq. 6.7 in which (x1,y1) is the point (0,b). We get
y − b = mx, or
Note that (0,b) is a point on the Y-axis, with y-value b. We say that the line in
Eq. 6.8 has intercept b on the Y-axis, or that b is the Y-intercept of this line.
The slope-intercept form is especially useful for quickly sketching (or
otherwise recognizing) a line whose equation is given to begin with. For example,
consider the equation 2x − y = 6. By algebra, this can be written as
Problem 6.13 Find the equation of the line parallel to x − 4y = 2, and having Y-
intercept 3.
Question: What can be said about the line given by Eq. 6.9 in the event that A
= 0? or B = 0? Answer: If A = 0 the line is horizontal. If B = 0 it is vertical. (If both
A and B are zero, Eq. 6.9 is not the equation of a line. To be absolutely correct,
the proviso that A, B are not both zero should be included in Eq. 6.9.)
Problem 6.14 Find the slope m and Y-intercept b, if possible: (a) 3x − 2y +1 = 0;
(b) 3y +1 = 0; (c) 2x − 7= 0.
Here, a is the X-intercept and b the Y-intercept. To understand why, first let y =
0 in Eq. 6.10; then x/a = 1, or x = a. This says that point (a, 0) lies on this line. In
other words, the line cuts the X-axis (y = 0) at x = a. Similarly, (0,b) lies on the
line 6.10, so the line cuts the Y-axis at y = b. The figure shows an example with a,
b both positive.
Thus a = 4, b = −8/3 are the intercepts. (Notice that you have to get 1 on the
right side, for this to work. The equation has to be exactly as given in Eq. 6.10.)
Problem 6.16 A square has corners at (1, 0), (0, 1), (−1, 0), and (0, −1). Sketch
this square and find the equations of its four sides. What is the area of the
square?
Conditional equation: an equation that is true only for certain specific values
of the symbols.
Problem 6.17 For Eqs. 6.2, 6.4, 6.7, and 6.10 of this chapter, specify (a) the type
of equation, and (b) the variables and parameters.
In this case, adding the equations doesn’t help. Instead, we first multiply the
first equation through by 2:
Now we can eliminate x by subtracting the second equation from the first:
Any system of two linear equations in two variables can be solved in this way.
Operations that are used to carry out the solution are:
Next we multiply the top equation here by −5 and add it to the second equation
(this eliminates x):
Therefore y = −2.
Returning to the equation (*), we now have
Two simultaneous linear equations always have a unique solution, unless the
lines corresponding to these equations are parallel. What happens to the algebra
of the solution, in this case? Here is an example
This false equation means that the original system of simultaneous equations
cannot be solved for x and y. Look back at these equations and observe that the
lines corresponding to these equations both have slope 1/4. In other words, these
lines are parallel, and never intersect. In this case, the given equations are said to
be inconsistent – they do not have any solution.
Analytic versus Euclidean geometry
Analytic plane geometry is much more concrete than Euclidean geometry. Here
are some comparisons:
Concept
Euclidean
Analytic
point undefined an ordered pair (x, y)of real numbers
straight line undefined the solution set of a linear equation Ax + By
+ C = 0
point on a line undefined (x,y) satisfies the given linear equation
distance between two undefined distance formula, Eq. 6.1
points
two points determine a axiom Two-point formula, Eq. 6.7
line
parallel lines exist axiom lines having the same slope m are parallel
two non-parallel lines axiom the meeting point can be calculated by
meet in a single point Gaussian elimination, which fails if the lines
are parallel
We have not discussed things like triangles here, but of course the theorems
of Euclid remain valid in analytic geometry. Next we study circles in analytic
geometry.
6.3 Circles
Recall that the equation
Two circles in the plane may intersect at two points, or at a single point (if the
circles are tangent), or at no points. Given the equations of the two circles, we
can determine whether they intersect by comparing their radii with the distance
between the centers. Consider the example
The distance between the centers is from Eq. 6.1, so the center of the second
circle lies inside the first circle, which has radius 5. The second circle has radius
3. Since is bigger than 5, the circles do intersect. (Make a sketch to check
this.)
How can we find the coordinates of the points of intersection? This takes a
bit of ingenuity. Here are the given equations:
Second, the example illustrates how one might go about solving any two
simultaneous nonlinear equations. The key is to eliminate one of the variables
(by algebra), and then solve the remaining equation for the other variable.
Finally, use substitution to calculate the value(s) of the first variable. (In practice,
this may be difficult, or even impossible.)
Problem 6.20 Find the points of intersection of the line y = 2x with the circle x2
+ y2 = 16. Make a sketch and comment on the symmetry of the problem.
Problem 6.21 Find the equation of the line passing through (5,0) and tangent to
the circle x2 + y2 = 9. Suggestion: see Sec. 5.6 on tangents to circles.
How to solve math problems
How should you set about solving a math problem? There are two kinds of
problems, as far as you are personally concerned. Let’s call them (1) routine
problems, and (2) confusing problems. Routine problems are those that you
quickly see how to solve – you only have to perform routine steps to complete
the solution. An example would be solving simultaneous linear equations, or
finding the unknown sides and angles of a right triangle. Learning math is partly
learning to solve routine problems, but only partly. The real crux comes when
you face confusing problems. Then what do you do?
My advice is ‘play with it.’ Try various ideas, make sketches, write down
vague thoughts. Try to come up with a strategy that could work. In other words,
enter into a mental search mode. Be creative. False starts are better than no starts.
Let’s look at a stream-of-consciousness approach to solving Problem 6.21,
finding the equation of the tangent line to a circle. (1) Make a sketch. (2)
Remember, the tangent is perpendicular to the radius line through the point of
contact. (3) So, it’s a right triangle. What are the sides? Oh, 3 and 5. So the other
side is (oh, yes, the 3-4-5 triangle). (4) Now what? What about
angles? Can I find θ in the sketch? Then what? m = tan θ is the slope of the radius
line. Great – the tangent line is perpendicular to this line. It’ll work! (From here
on, the problem is routine.)
This description is much too neat – no false start, no hesitation, no feelings of
despair. But you get the idea. Your brain is searching through memory for
relevant information, trying to understand the problem. Assuming that you have
thoroughly understood the math you’ve learned up till now, the search should
eventually be successful.
Solving confusing problems in nature is what the human brain has evolved to
do. Math and science, and creativity in general, take advantage of this natural
ability. This is how we differ from computers.
Most of the problems in this book are fairly routine. But every now and then
there’s a confusing one. Don’t expect to obtain the solution to a confusing
problem instantly. You have to work on it. You’ll learn a lot from taking the
effort. Why don’t you try the next problem?
Problem 6.22 What is the radius of the circle inscribed in an equilateral triangle
of side s?
6.4 Transformations
As pointed out at the beginning of this chapter, analytic geometry melds
geometry and algebra. Here we see how this works for transformations such as
translation, rotation, reflection, and scaling. We only look at a few examples to
illustrate the ideas but not to give an exhaustive treatment.
Translation
A translation in the XY-plane shifts every point (x, y) a certain distance to the
right (or left) and a certain distance up (or down).
Let h = horizontal shift and k = vertical shift. Let (x, y) be an arbitrary point
and let (x′, y′) be the translated (shifted) point. Then
For example, if h = 3 and k = −1, each point (x, y) is transformed to the point (x
+3,y − 1), i.e. 3 units to the right and one unit down.
A given geometric object, such as a line, triangle, or circle, is also shifted by h,
k by the translation in Eq. 6.11. The translation leaves the shape of the object
unchanged – the translated version is congruent to the original.
This is probably quite obvious to you, but let’s just check that translation
preserves the distance between two points, for example. Let Pi = (xi,yi) be two
given points with translated positions If d′ denotes the distance
between we have
(because x2 + h − (x1 + h)= x2 − x1,etc. Thus d′ = d: distance is preserved by a
translation. The algebra proves that our intuition is correct.
(As a quick check, the point (0,4) is on L, and gets translated to (1,7), which
satisfies the equation for L′.)
Problem 6.23 (a) Find the equation of the new line L′, obtained by translating
the line L: x − y = 5 by (h, k)= (0, −1); (b) Same for L : y = 5 and (h, k)= (2, 2) [do
part (b) in your head].
The technique used in the above example (and Problem) can be described as
follows. We wish to determine the equation, in x, y-coordinates, of a certain
curve (or line) C. To start with, we let (x, y) denote some point on C. Next we use
the given characterization of C to deduce an equation involving x and y. This is
the equation of C.
To mention a familiar example, suppose C is the circle of radius 5, centered
at the point (3,1). If (x, y) is a point on C, then the distance from (x, y) to (3,1)
equals 5:
This is the equation of the given circle C. We usually write it in the form
Problem 6.23 shows that lines are transformed into new lines, having the
same slope as before, by a translation. This implies in turn that angles are
preserved by a translation, because the slopes of the sides of a given angle remain
the same.
For example, a translation of a triangle would give a new triangle having the
same sides and angles as the original triangle. Translation is an example of a rigid
motion.
Rotation
Just for the record, here are the equations for a rotation about the origin:
For example, the line y = mx + b gets transformed into the line y = −mx + b.
(Use the same argument as before to check this.)
It would be straightforward to show that distances and angles are preserved
by a reflection, which is therefore another instance of rigid motion.
Uniform scaling
The equations of a uniform scale change, with scale factor k (with k > 0), are
Problem 6.24 Show that uniform scaling with scale factor k multiplies distances
by k, but does not change the slope of a line.
Uniform scaling is not a rigid motion, but it does preserve angles, and hence
shapes. Recall from Ch. 5 that uniform scaling is the basis for the concept of
similarity.
What about nonuniform scaling? you might ask. An example would be
where h and k are different. This transformation preserves neither distances nor
angles. But we humans do recognize a certain similarity between an image and a
‘squashed’ version of it. An ellipse is a squashed circle. Aunt Harriet’s face seen
from an angle is a squashed version of the head-on view, but you recognize her
immediately. The brain instantly applies all kinds of transformations to visual
images, because recognition (or its lack) has important consequences But if
you’re like me
To make a quick sketch of this curve, first find the intercepts by inspection.
Namely, for the X-intercept, put y = 0 in the equation, giving x2/a2 = 1 or x = ±a.
Similarly the Y-intercepts are at y = ±b.
Suppose that a> b. Then the line from −a to +a on the X-axis is called the
major axis of the ellipse, and the line from −b to +b on the Y-axis is the minor
axis. Of course if a = b, the ellipse becomes a circle.
Next, consider an ellipse centered at the point (x0,y0), with major and minor
axes as before. Its equation is
You can think of this as the translation by x0,y0 of an ellipse centered at the
origin.
Problem 6.25 Identify the curve whose equation is 4x2 +9y2 − 24x +18y + 9 = 0.
Suggestion: complete the squares.
is the equation of an ellipse, provided that A and C are both positive, and
provided that after rewriting in the usual form by completing the square and
dividing through by the appropriate constant, the number +1 appears on the
right side of the equation. (The other possibility is that −1 winds up on the right
side. In this case the equation isn’t the equation of anything. Why not?)
Hyperbola
Let us make one small change to the ellipse equation (6.13).
This is the equation of a hyperbola. Drawing the graph requires a little ingenuity.
First, for the X-intercepts, set y = 0. This gives x = ±a. For the Y-intercepts set
x = 0, getting y2 = −b2. But this is not possible – no value of y satisfies this
equation. Thus there are no Y-intercepts. The curve never crosses the Y-axis.
Next, we observe that the graph of Eq. 6.14 is symmetric by horizontal
reflection in the Y-axis. If (x, y) is a point on the graph, then (−x, y)is also on the
graph, because the only appearance of x in Eq. 6.14 is in the term x2.
Thus reflection in the Y-axis transforms the graph into itself, so the graph is
symmetric. The same argument shows that the graph is also symmetric by
reflection in the X-axis. (By the way, these symmetric properties also apply to the
ellipse of Eq. 6.13 – see that graph.)
To proceed, we next solve Eq. 6.14 for y in terms of x:
This gives
From this we see that we need x2 > a2 to get a value for y. There are no points on
the graph for x lying between −a and +a. Check the figure for this feature.
Finally, what happens as x becomes large? The figure shows that the
hyperbola gets closer and closer to one of two straight lines. The lines have
equation y = ±bx/a. These lines are called the asymptotes of the hyperbola. How
do we explain this fact? Intuitively, if x is large, then x2 − a2 ≈ x2 (the term a2 is
relatively small). The symbol ≈ is read as ‘is approximately equal to.’ Therefore
[More details on this point are given below.]
A quick way to obtain the equations of the asymptotes to the hyperbola x2/a2
− y2/b2 = 1 is to use the equation x2/a2 − y2/b2 = 0, which has the solutions y =
±bx/a, and these are the asymptotes. Why does this work? Roughly speaking, if x
and y are large, then the constant term 1 is relatively negligible, so x2/a2 − y2/b2 ≈
0.
To summarize, the curve given by Eq. 6.14 is a hyperbola. It has the following
properties:
1. The hyperbola consists of two separate branches, which open up to the right
and left, along the X-axis.
2. The X-intercepts are at x = ±a. The branches lie to the left and right of the
intercepts.
3. The hyperbola is symmetric by reflection in both the X and Y-axes.
4. As x becomes large (positive or negative), the hyperbola approaches one of
the two asymptotes
Problem 6.26 Make careful sketches of the following hyperbolas: (a) (x2/4) − y2 =
1; (b) y2 − x2 = 1. Suggestion for (b): review the above discussion as it would
apply to this slightly different case.
Here is the algebra needed to show conclusively that the hyperbola given by
Eq. 6.15 (with the + sign) does approach the line as x becomes large. For a given
value of x, the difference in the y-values on the line and the curve is
The last line results from multiplying numerator and denominator by
and simplifying. (You should check this.) Now the final
expression approaches zero as x becomes larger and larger. This is what we
wanted to prove.
This is a parabola. To get an idea of what a parabola looks like, do the next
problem.
Problem 6.28 Sketch the graph of y = x2, for −2 ≤ x ≤ 2. First make a table of
values of x and y, for x = 0, 0.5, 1.0, 1.5, and 2.0. Plot the points (x, y) and join
with a curve. Finally, use symmetry to complete the graph.
Problem 6.29 If you own a graphics calculator, try it on the preceding parabola,
and also on (part of the) hyperbola
The foci of an ellipse
We have described an ellipse as a circle that has been stretched in one direction.
Most textbooks define an ellipse as the set of all points in the plane, the sum of
whose distances from two given points is a constant. The given points are called
the foci (singular, focus) of the ellipse. Some rather finicky algebra is required to
establish this fact.
Let the foci be located at x = ±c, on the X-axis, as shown. Then the sum of the
distances from (x, y)to these points (−c, 0) and (c, 0) equals
Suppose the X-intercept of the ellipse is at x = a. Then the sum of the distances of
(a, 0) from the foci equals (a − c)+(a + c)= 2a. Therefore the constant equals 2a,
and our equation is
Taking the square of both sides, and at the same time expanding (x + c)2 etc.
gives
The ratio c/a is called the eccentricity of the ellipse, sometimes denoted by e.
A circle has zero eccentricity; a greatly elongated ellipse has eccentricity nearly
equal to 1. (See the next problem.) We always have e< 1, for any ellipse.
Problem 6.30 (a) A circle is an ellipse in which a = b. Where are the foci of a
circle? What is the eccentricity? (b) Roughly sketch the ellipse (x2/100) + y2 = 1.
Where are the foci? What is the eccentricity? (c) If e denotes eccentricity, express
the ratio of minor to major axis (b/a)in terms of e.
Problem 6.31 The eccentricity of the earth’s orbit is e = .017. Find the ratio of the
largest and smallest distances from the sun to the earth, over a year’s cycle. (The
seasonal change of solar distance affects the earth’s weather, especially the
severity of winters in the Northern hemisphere.)
The foci of a hyperbola
A hyperbola can be defined as the set of points P in the plane such that the
difference in the distances from P to two given points (the foci) is constant. If the
foci are at x = ±c, and the X-intercepts are x = ±a where a < c,this definition
implies that
This equation can be simplified exactly as in the case of the ellipse. In fact, the
calculations are virtually the same; the final equation is exactly the same (because
the minus sign disappears in the second squaring):
However, we now have a < c, so this equation is better written in the form
Problem 6.32 Find the foci, and the eccentricity, for the hyperbolas (a) x2/4 − y2
= 1; (b) x2 − y2/4 = 1. Sketch the hyperbolas.
Where does the parabola fit into this scheme of foci and eccentricity? It
happens that e = 1 for any parabola, so that parabolas are somehow intermediate
between ellipses and hyperbolas.
The parabola can be defined as the set of points P in the plane that are
equidistant from a given point (the focus), and a given line, called the directrix.
To obtain the equation corresponding to this definition, we place the focus at
(0,c) on the Y-axis, and the directrix at y = −c. Then we have
(use a sketch to check this) so that
This reduces to
and this agrees with Equation 6.16 for a parabola, with a = 1/(4c).
Parabolic reflectors are used for both receiving and transmitting rays of many
kinds, including light, radar, TV and radio signals, and also sound. When such
rays are reflected by a surface, the angles of incidence and of reflection are equal.
In the parabolic reflector rays parallel to the axis are all reflected to the focus.
Example Most comets return periodically (like Halley’s comet), and have elliptic
orbits around the sun. The higher the eccentricity of the ellipse, the longer the
period between sightings of a comet. The eccentricity of some comet orbits,
however, is close to 1.0, and some are even greater than 1.0. These comets have
parabolic or hyperbolic orbits, and do not recur. Once they leave the solar
system, they never reappear. (The orbits of comets are actually not perfect
ellipses etc., because of the gravitational influences of the planets, which perturb
the elliptic orbit about the sun.)
Problem 6.33 Find the equation of a curve defined as the set of points in the
plane such that the ratio of the distance from a given line to the distance from a
given point is a constant, say q. Identify the curve, depending on q. Suggestion:
use the same setup as for the parabola.
7. Find the equation of the perpendicular bisector of the line P1P2,where P1 = (1,
4) and P2 = (3, 2).
12. Find all points of intersection of the hyperbola x2 − y2 = 2 with the circle x2 +
y2 = 4. Use a sketch to check the answer.
Solution 6.1 (a) III, IV. (b) A point on the X-axis has y = 0. Thus the point has
coordinates (x, 0). Similarly, a point on the Y-axis has coordinates (0,y). The
coordinates of the origin are (0, 0).
Solution 6.2 This is the hypotenuse of a 45o right triangle with legs of
length 4, determined by the given points.
Solution 6.3 The circle has center (0, −3) and radius 3. The origin (0, 0) lies on
the circle, as we see either from the graph or by substitution: 02 + (0 + 3)2 = 32 = 9
as in the equation. From the graph we see that the lowest point on the circle is (0,
−6), and these coordinates also satisfy the equation: 02 +(−6+3)2 = (−3)2 = 9.
Similarly, the rightmost point is (3, −3), and this also satisfies the equation, as
you can check.
Solution 6.4 (a) Circle with center (−3, 1) and radius (b) Nothing.
Solution 6.5 Completing the squares in Eq. 6.3 leads to (x − A/2)2 +(y − B/2)2 =
(A2 + B2)/4 − C The condition needed to get a bona fide circle is therefore A2 + B2
> 4C. In the case of equality, A2 + B2 = 4C, we get r = 0, i.e. a ‘circle’ of radius
zero. Such a circle is in fact a single point, at the center (A/2,B/2). To summarize,
Eq. 6.3 represents:
Solution 6.8 (a) As shown in Problem 6.7, a line with slope m1 = 1 makes an
angle of 45° with theX-axis. In thesameway, a linewith slope m2 = −1 makes an
angle of −45° with the X-axis. These lines therefore make an angle of 90° with
each other. In other words, they are perpendicular.
(b) By looking at the Δx, Δy triangles for lines L1 and L2, we see that
Therefore
Solution 6.13 The given line has slope 1/4, so the equation of the new line is y =
(1/4)x +3, or x − 4y +12 = 0.
Solution 6.14 (a) m = 3/2, b = ½; (b) m = 0, b = −1/3; (c) m is undefined; the line
is vertical at x = 7/2.
Solution 6.15
intercepts
Solution 6.17
Solution 6.18 x = −2, y = 1. One way to get this is to multiply the first equation
by 2 and add it to the second equation, thereby eliminating y.
Solution 6.19 u = 2, v = −3.
Solution 6.20 Substituting y = 2x into x2 + y2 = 16 gives x2 +4x2 = 16, so
The corresponding y-values are y = 2x,or The line
and circle are both symmetric by reflection in the given line itself, and also in the
perpendicular line y = −x/2.
Solution 6.21 The triangle shown is a 3-4-5 right triangle. The slope of the radius
line is therefore tan θ = 4/3 (opposite/adjacent, remember). Therefore the slope
of the tangent line is −3/4. From the point-slope form, the equation of the
tangent line is y = (−3/4)(x − 5). This can be written more neatly as 3x + 4y = 15.
(Congratulations if you figured this out – there are various ways of doing it. If
you didn’t succeed, how about reviewing all the math principles used in the
solution given here? Each sentence involves one or more principles.) By the way,
the other tangent line has equation 3x − 4y = 15.
Solution 6.22 First, make a sketch. Next, recall from Chapter 5 that the center of
the inscribed circle lies at the intersection of the angle bisectors. Therefore AOD
is a 30 − 60° triangle. Thus Since
AD = s/2, we see that and this is the radius of the inscribed
circle.
This is an ellipse centered at (3, −1) and with semi-axes of length 3 and 2.
Solution 6.26 (a) X-intercepts at x = ±2; asymptotes y = ±(½)x have slope ±½;
(b) Y-intercepts at y = ±1; this hyperbola opens vertically; asymptotes y = ±x.
Plotting these points and joining them up gives the graph for x > 0.(Note that
one does not join the points with little line segments. The parabola is a smooth
curve, not a jagged one.)
The parabola does not have asymptotes – it continues to curve more and more
steeply upwards.
Solution 6.30 (a) By Eq. 6.17, c = 0. Thus the foci of a circle are both at the
center. (The sum of the distances from the foci in this case equals 2r where r is
the radius of the circle.) The eccentricity c/a is zero. (b) Here
Thus the eccentricity is c/a = 9.95/10 = .995 – a highly eccentric ellipse. (c) From
Eq. 6.17, c2 = a2−b2. Therefore c2/a2 = 1−b2/a2, from which we have
Thus b/a = 1 if e = 0 (circle), and b/a is nearly
0 if e is close to 1 (elongated ellipse).
Solution 6.31 From a graph of the ellipse with the sun at a focus, we see that the
ratio of longest to shortest distances equals
7.1 Sets
Modern mathematics makes extensive use of sets. In basic math, for example,
various geometric objects, such as lines and circles, are defined as the set of all
points satisfying a certain condition. What exactly is a set? A set is any collection
of objects. Here we will talk about mathematical objects, such as numbers,
points, and so on, and not about physical objects, such as chairs, or dogs.
Notation
Let A,B,… designate sets of some kind. Then
1. x ∈ A means that x is a member of A, or an element of A. We often read x
∈ A as just “x is in A,” or “x belongs to A.” For example, if A is the set of all
even integers, then 2 ∈ A, but 3 ∉ A (“3 does not belong to A”).
2. A ⊂ B means that set A is contained in set B, or A is a subset of B.
3. A ∪ B (“A union B”), the union of A and B, is the set of objects that belong
to A or B, or both.
4. A ∩ B (“A intersect B”), the intersection of A and B, is the set of objects that
belong to both A and B.
Problem 7.1 Decide which of the following statements are true. A diagram may
be helpful. (a) A ⊂ B and B ⊂ C implies A ⊂ C; (b) A ⊂ (A ∪ B); (c) A ⊂ (A ∩
B); (d) (A ∩ B) ⊂ A; (e) A ⊂ B and x ∈ A implies x ∈ B; (f) A ∪ (B ∪ C)= (A ∪
B) ∪ C.
We also have
5. {x : Px} (“The set of all x such that Px.”) Here Px represents some statement
about x, and {x : Px} is the set of all x for which Px is true.
Problem 7.2 Identify the sets A = {(x, y): x2 + y2 = 16} and B = {x, y): y = x +2}.
(Interpret A and B as sets of points in the XY-plane.)
Note that, to make sense of the expression {x : Px}, the allowable choices for x
must be specified, at least tacitly. For example, in Problem 7.2, the notation
suggests that (x, y) is a point in the XY-plane. If there could be any doubt, the
possible choices for x should be stated explicitly. Thus {x : x is an integer and x ≥
5} is unambiguous, but {x : x ≥ 5} might be interpreted as the set of all real
numbers greater than or equal to 5. What is intended might be clear from the
context, but if not, the meaning should be stated explicitly.
Problem 7.3 The following are true statements. Read them until you understand
them. Then write them in plain English. (⇔ means if and only if.)
(a) For any x, x ∈ A ∪ B ⇔ x∈ A or x ∈ B.
(b) For any
(c) {x : x is a real number and x2 = 1} = {−1, 1}. (The notation {−1, 1} means the
set consisting of two numbers −1 and 1.)
Intervals
A closed interval [a, b] is a subset of the real line given by
Here a and b are two given real numbers, with a < b. The expression a ≤ x ≤ b
means that a ≤ x and x ≤ b; in other words x is between a and b, inclusively.
Problem 7.4 Under what circumstances is [a, b] ∪ [c, d] itself a closed interval?
Suggestion: make a sketch.
Problem 7.5 Same question as in Problem 7.4, but for [a, b] ∩ [c, d].
If you didn’t get either or both of these problems, try them again carefully.
Use sketches.
We use a special symbol ∅ for:
Thus ∅ is the set that has no members. This seems a bit strange at first, but it
is quite useful. For example, case 1 of Problem 7.5 can be expressed as
In this case, the two intervals have no point in common, which means that
their intersection is empty. Exercise: A ∪ ∅ = ? A ∩ ∅ = ? Answer: A, ∅,
respectively. Review the definition of ∪ and ∩ if this is not clear.
Half-infinite intervals are:
and similarly for (a, ∞)and(−∞,a). Note that ∞ and −∞ are not numbers, but
merely indicators that the interval extends indefinitely to right or left. Also
Problem 7.6 Make a list of all new terms introduced in this section, including
their definitions.
7.2 Functions
One of the most important concepts in mathematics is that of a function. The
function concept is so general that at first it may be a little difficult to understand
its significance. We will therefore begin with fairly simple examples. Here is the
basic definition.
This is the way that functions are often encountered, being defined by a single
formula. (It is certainly not the only way, however!) In this example, what would
f(5) mean? Answer: f(5) = 3 × 52 = 75. One just substitutes x = 5 in the defining
formula. To check that you understand this, find f(0), f(−1), f(1/3), and f(u + v).
Answers: 0, 3, 1/3, 3(u + v)2. In all cases you replace x in the formula for f(x) by
the value in between the brackets of f. Note that, as stated in the definition of
function, in this example, f transforms any given object x in A (i.e., any real
number x) into a new object, f(x)= 3x2. The new object is also a real number.
Question: what would f(w) mean, here? Answer 3w2.
Example 2. Let N be the set of positive integers. For any n ∈ N let p(n) be the
number of different prime factors of n (not counting 1 as a prime factor). For
example, p(5) = 1, p(12) = 2, and p(140) = 3 (because 140 = 22 × 5 × 7). This
example shows that a function does not have to be defined by a formula.
Example 3. Let Δ be the set of all triangles. For any t ∈ Δ let A(t) be the area of t.
Is A really a function? Yes: every triangle t has an area. This example illustrates
the generality of the function concept.
Problem 7.7 For the functions given in Examples 1-3, find (a) f(−2); (b) p(64);
(c) A(t1)where t1 is the triangle with vertices (0,0), (0,1), and (2,3).
Problem 7.8 By trying them out, determine the domains of the following
calculator functions: sin, Asin
A constant function
Let f(x) = 2 for all x ∈ (−∞, ∞). Is f a function? Well, does the equation f(x) = 2
tell us how to determine the value of f(x) for every x? Yes it does. Therefore f is
indeed a function. It is called a “constant function.” Of course, a constant
function is not of much use (but they do come up, for example in Calculus).
Nevertheless, a constant function is a bona fide function.
Computer functions
(Skip this subsection if you’re not interested in computers.)
Every computer programming language not only contains built-in functions,
but also allows the programmer to define other functions. Here is an example
using the language C.
Let’s try to figure out what this function is, mathematically. The name of the
function is F. The domain of F is defined as “double,” which specifies double-
precision real numbers (about 17-digit precision). The other “double” on the first
line specifies that the values of F (x) will also be obtained in double precision.
Finally, the body of the function, contained in curly brackets, specifies the value
to “return,” which is 1/(1 +x * x), or 1/(1 +x2). Thus F (x)= 1/(1 + x2) in this case.
This example shows that the term “function” is understood in computers in
the same way as in math. A domain is specified, and a procedure is given to
calculate the function values. (Of course, functions programmed in C can be
much more complicated than this example.)
X y
1 5
2 7
3 3
4 2
5 7
X y
3 4
5 0
1 5
3 2
5 7
Problem 7.9 (a) Consider the set G = {(−1, 2), (1, 3), (4, 2), (7, 1)}. Is G a
function? If so, find G(1). If not, remove pairs that result in ambiguity. (b) Same
for h = {(5, 0), (2, 1), (1, 0), (2, 5)}.
I hope you have not found this discussion overly confusing. It was necessary
to go into these matters, because different traditions of teaching have
emphasized seemingly different definitions of the term “function.” But as we
have seen, these differences are only apparent.
One point remains to be clarified, perhaps. How can a function given by a
certain formula, such as f(x) = 3x2, be considered a set of ordered pairs? Here, x
can be any real number. The set of ordered pairs in this case becomes an infinite
set, consisting of all the pairs (x, y) with y = 3x2, and x any real number. In set-
theoretic notation we have
In practice, we never actually think of the function f(x) = 3x2 in this roundabout
way, of course. (The “look-up table” for this f would have infinitely many entries
—a practical impossibility, but not a conceptual one.)
The look-up procedure works only for finite sets, such as G above. However,
most functions used in math and science involve infinite sets of ordered pairs.
It’s not possible to “look something up” in an infinite set, which is why the
definition given at the beginning of this section insists on there being a
procedure for calculating the values f(x).
We will not use the ordered pair characterization of functions in this book,
but you may encounter it elsewhere.
Other notations
Consider again the function f(x)= 3x2. Another way of writing this function is
sometimes used, namely
What is the geometric interpretation of M? What is its domain? How else could
the definition of M be expressed? , what is the “natural
domain” of f ? (Invent an interpretation of “natural domain.”)
Since a line is a set of points satisfying a certain equation, this could be written
explicitly as
(recall the point-slope form for the equation of a line, Eq. 6.7.) What about the
natural domain of L? The fact that x2 −x1 occurs in the denominator of the
equation of the line, alerts us to the fact that m is undefined if x1 = x2. Actually,
there are two cases here. First, if the given points coincide, so that (x1,y1)= (x2,y2),
then there is no uniquely determined line through these points; L is undefined in
this case. Second, if x1 = x2 but y1 ≠ y2, the two points lie on the vertical line
whose equation is x = x1. Therefore
Thus L is a bona fide function, with domain A = all pairs of distinct points (x1,y1),
(x2,y2), and with values equal to the straight lines whose equations are given
above (two cases).
This example again illustrates the broad scope of the function concept. Any
kind of mathematical objects can constitute the domain of the function, and any
kind of objects can occur as values. The only strict stipulation is that the value of
the function must be unambiguously specified, for each object in the domain.
Problem 7.11 Let C(x0,y0,r) be the circle of radius r, centered at (x0,y0). Express
C(x0,y0,r)in {···} form.
Additional terminology
By definition, to specify a certain function f , we must specify (i) a set D of
mathematical objects (D is called the domain of the function), and (ii) a
procedure for obtaining the values f(x) for any x ∈ D. In practice, the domain D
may not be explicitly stated, in which case some natural domain is taken for
granted.
The set Rf of all possible values f(x) of the function f is called the range of
values (usually just range, for short) of f :
For example, if f(x)= 3x2,with D = (−∞, ∞), the set of all real numbers, then Rf =
[0, ∞), the set of all non-negative real numbers. Why? Because, first, 3x2 ≥ 0 for
any x. Second, any number y ≥ 0 can be obtained as the value f(x)for some x ∈
D. Namely, has f(x)= 3x2 = 3y/3= y. This proves that Rf = [0, ∞).
Problem 7.12 Determine the range of f(x) = 2x +1, (a) with domain (−∞, +∞);
(b) with domain [0,1].
Problem 7.13 Determine the domain and range of the function of Problem 7.9,
G = {(−1, 2), (1, 3), (4, 2), (7, 1)}.
This is a function of two variables, x and y. The natural domain D of f is the set of
all ordered pairs (x, y) of real numbers. The usual method of substitution applies;
for example
(Here the dot “·” is used for multiplication.) What would f(y, x)be, in this
example? Answer: f(y, x) = y2 +4x2 +6yx – you substitute y for x and x for y.
Countlessly many practical examples could be cited. In meteorology, for
example, (x, y) could represent the latitude and longitude of a position on the
earth’s surface. Then T (x, y) could refer to the temperature at this position, and
P (x, y) the barometric pressure there. These functions are published in the daily
media, usually in graphic form. Since temperature and pressure are time
dependent, we should probably write T = T (x, y, t), etc., where t is the time. The
weather report on TV may show a dynamic prediction of T (x, y, t) for your state,
for the next 24 or 48 hours. Functions and graphs are everywhere!
Given a function of several variables, such as T (x, y, t), the input variables x,
y, t are sometimes called the arguments of the function T. This terminology is
used in computer programming for example.
This definition means that the graph of the function f is exactly the same
thing as the graph of the equation y = f(x) (x ∈ A). Recall Section 6.1 on graphs
of equations.
We know that y = 3x2 is the equation of a parabola with vertex at the origin, and
opening upwards. To sketch the graph accurately, we need only plot a few points
(x, y), to help pin down the parabola. Thus (1,3) and (−1, 3) are points on the
graph, because f(1) = 3 and f(−1) = 3. Likewise (0,0) is on the graph.
What should we do if we wished to depict the graph of f(x)= 3x2 for a wider
range of values, say −10 ≤ x ≤ 10? Try it for yourself, first.
Because f(10) = 300, we need to show y-values from 0 to 300. If we use the
same scales on the X-and Y-axes, we get an unrevealing graph, as shown above.
However in depicting the graph of a function f(x), there is no need to use
identical X-and Y-scales. Rather, one should choose scales that are appropriate
for the situation. After all, graphing a function is not the same as doing analytic
geometry, where the scales have to be the same. The second graph above shows
f(x)= 3x2 with appropriate scales.
Graphs of functions are used frequently in science, and then the x-and y-
values typically have physical meaning.
The next graph comes from my own research in modeling the foraging
(feeding) behavior of fish. The graph shows the average feeding rate f, as a
function of visual range r in a lake. The model takes account of various practical
components of foraging, including swim speed v, handling time h, and prey
density d. The function graphed is
where a and b are constants involving the parameters v, h, d. The shape and
position of this curve is useful for understanding the behavior of fish and their
prey species at different times of the day. (Note that this graph uses a
“logarithmic scale” on the horizontal axis. Such graphs are often used in
scientific publications.)
Whether one is considering a given function in math, or in some scientific
situation, displaying a graph of the function is an efficient way of conveying
information about it. What portion of the graph to display depends on the
context. In math, we are usually concerned with displaying the salient features of
the function, and graphic details should be chosen accordingly.
Example 2. g(x)= 2x +1
The graph of this function is the straight line y = 2x +1. Not much more need be
said here.
Let us analyze this carefully. Many students learn only to “make a table of values,
then plot the points and join them up to get the graph.” This unintelligent
approach to graphing is almost worthless in later courses.
What can we say about the graph of y = f(x)= 1/(1+x2)? First, as x gets large, y
becomes small. For example, f(10) = 1/101 ≈ .01. The maximum y-value is f(0) =
1. Also, we see that a negative value of x gives the same y-value as does the
corresponding positive value of x. With these ideas in mind, let’s now make a
table.
x y = f(x)
0 1
±1 .5
±2 .2
±3 .1
±4 .06
Plotting and joining up these points gives the graph shown. But this is a bad
picture! Graphs of algebraic functions are always smooth, without sharp kinks.
What went wrong? I used a computer to draw the above graph – did the
computer screw up? No, I did. I should have told the computer to use many
more x values in its “table.” When I did that, the graph became smooth. You
could have guessed that the earlier sketch was wrong, if you remember that most
simple functions have smooth graphs. To draw a non-smooth graph you have to
know the reason why.
How do we know that the graphs of algebraic functions, such as 1/(1 + x2),
are smooth? This is a fact obtained from calculus. A brief discussion occurs later
in this chapter.
Problem 7.14 Sketch the graph of f(x)= 1/x for x> 0. Be careful about values of x
less than 1. Can you guess what the curve is?
Problem 7.15 If the graph of f(x) is as shown here, find the approximate values
of f(0), f(.5), and f(3). For which x does f(x) have the maximum value, and what is
the maximum value?
Any polynomial function that contains only even powers of x is an even function
(because (−x)n = xn if n = 0, 2, 4,…). Any polynomial function that contains only
odd powers of x is an odd function (because (−x)n = −(x)n if n = 1, 3, 5,…).
Examples are shown above. Most polynomial functions are neither even nor odd.
2. Vertical asymptotes. Consider the function f(x) = 1/x2. Since division by zero
is impossible, f(x) is undefined for x = 0. The clue to graphing such functions is
this: division by zero is impossible, but division by a very small number produces a
very large number. For example 1/10−n = 10n (law of exponents). Here is a table of
values that makes use of this fact, for f(x)= 1/x2.
Since f is even, and f(x) > 0 for all values of x, the graph must look as shown.
We describe the property that f(x) becomes indefinitely large for x near 0, by
saying that this graph has a vertical asymptote at x = 0. The phrase f has an
infinite singularity at x = 0 is also sometimes used.
Vertical asymptotes occur in situations where division by zero would
otherwise be indicated. The explanation is the same as in the example: division
by a small number produces a large number. If the small number is positive, then
the large number is also positive. Or, if the small number is negative, then the
“large number” is actually a large negative number.
Example. f(x) = 1/(x2 − 1). This function is undefined at x = 1 and −1, so there
should be vertical asymptotes. If x > 1 then x2 − 1 > 0, so f(x) > 0. As x
approaches 1 (with x> 1), f(x) approaches +∞. Also, if x < 1, f(x) < 0, and f(x)
approaches −∞ as x approaches 1. Plotting a few points, such as f(0) = −1 and
f(2) = 1/3 pretty well fixes the entire graph as shown.
Problem 7.18 Graph f(x)= x +1/x. Suggestion: Note that, for large x we have f(x)
≈ x (because 1/x becomes small).
Graphing this function shows that |x| is a continuous function (no breaks), but
not a smooth function, because of the corner at x = 0. We say that |x| is smooth
everywhere except at x = 0.
4. Intercepts. The intercepts of a curve in the XY-plane are the points where it
crosses the axes. For the case of a function graph y = f(x), there is only one Y-
intercept, at y = f(0), assuming f(0) is defined. The X-intercepts, if any, are the
solutions of the equation y = 0, i.e. f(x) = 0. Solving this equation may be
somewhat difficult – see Section 7.7 for a computational algorithm. In some
cases, the equation f(x) = 0 can be solved by inspection or by factoring.
Example f(x) = x3 − 4x. This can be factored as f(x)= x(x2 −4), which equals 0 if x
= 0, 2, or −2. These are the X-intercepts. The Y-intercept is at y = f(0) = 0. Thus
the origin is both an X-intercept and a Y-intercept. Next, since f(x) is odd, we can
concentrate on x ≥ 0. If 0 < x < 2, then f(x)= x(x2 − 4) < 0, whereas f(x) > 0 for x>
2. Finally, for large x we have f(x) = x3 − 4x → +∞ as x →∞ (see below). The figure
shows all these features clearly. By the way, students of calculus can quickly find
the coordinates of the minimum point on this graph; they are (1.2, −3.1)
approximately. See Section 7.5 for details.
5. Behavior for large x. Consider a polynomial function f(x)= anxn + an−1xn−1 + ···
+ a0. The behavior of f(x)as x → +∞ or x →−∞ depends entirely on the leading
term anxn (because the other terms are relatively small, compared to this term, for
large x). Now, as x → +∞ we have xn → +∞ also, so anxn → +∞ or −∞, depending
on the sign of an. Next, as x →−∞ we have xn → +∞ if n is even but xn →−∞ if n is
odd. The behavior of anxn as x →−∞ therefore depends on an and n.
Example f(x) = 2x5 − 10x4 + 5. Here f(x) → +∞ as x → +∞, and f(x) →−∞ as x →
−∞. The graph “wiggles” a bit (the wiggles can be figured out using calculus, as
explained in Sec. 7.5), but behaves as described, for x →±∞.
Problem 7.21 Determine the behavior of the following polynomials as x → +∞
and as x →−∞: (a) f1(x)= −x2 +8x + 1056; (b) f2(x)= 9x3 − 15x; (c) f3(x)= (x +2)5.
Problem 7.22 Sketch the graph of f(x)= x4 − 5x2 + 4. Suggestion: first find the
zeros of f(x) by factoring.
Example f(x) = x/(x2 + 1). As x →∞, f(x) ≈ x/x2 = 1/x for large x, because the
denominator term 1 is insignificant relative to x2 for large x. Therefore f(x) → 0 as
x →∞. We also have f(0) = 0, and f(x) > 0 for x > 0. What must happen is that
f(x)atfirst increases with x and then turns around and decreases to zero for large
x. A brief table of values is useful here.
Note also that f(x) is odd.
As you can see, many features of the graphs of algebraic functions can be
figured out from fairly straightforward principles. Fine details require either
numerical computation, or the use of calculus, or both. Section 7.5 discusses
some basic methods of calculus for polynomial functions, but a full course in that
subject is required to master these techniques in general. Of course, it is an
advantage to have a computer with graphical software, such as I used for all the
graphs shown here. But the computer isn’t much use by itself – you have to know
what you’re doing.
Problem 7.23 Sketch the graphs of the following functions. For each graph,
identify any symmetries, asymptotes, and intercepts. (a) f(x) = x3; (b)
Learning to sketch the graphs of functions is a worthwhile skill, even if you have
a graphing calculator, or computer. In many cases, making your own sketch can
be a lot quicker than typing the function into the calculator. In addition, gaining
familiarity with graphs of many kinds is useful in itself. For example, see if you
can think of a function f(x) whose graph looks like this:
One such example is f(x) = ax/(x + b), where b is a positive number. This graph
has asymptote y = a as x →∞, and also has f(0) = 0. By adjusting the value of b,
you can change the shape of the graph.
Transformation of functions
Suppose we have obtained the graph of a certain function f(x). What can we say
about the graphs of related functions, such as f(x) + 2, or f(x +2), and so on? We
will show that such graphs are obtained by transformation of the original graph,
specifically, by translation or scale changes.
Case 1. f1(x) = f(x)+ a. The graph of f1(x) has equation y = f(x) + a. Thus, if (x, y)
is a point on the graph of f(x), then (x, y + a) is a point on the graph of f1(x) = f(x)
+ a. In other words, the graph of f(x)+ a is just the graph of f(x)translated in the
Y-direction by the amount a (up if a > 0, down if a < 0).
For example, the graph of g(x) = x2 + 2 is just the parabola y = x2, translated
upwards 2 units.
Case 2. f2(x) = af(x). If (x, y) is a point on the graph of f(x), then (x, ay) is a point
on the graph of f2(x) = af(x). The figure shows an example with a > 1. (What
would this graph look like for a = ½? Or a = −½?) The graph of f(x) is obtained
from the graph of f(x)by scaling the Y-coordinate by scale factor a. If a < 0, this
also involves a vertical reflection in the X-axis.
Problem 7.24 Let f(x) = 1/(1 + x2) (discussed earlier). (a) Sketch the graph of f(x),
and of f(x) + 1 on the same XY-plane. What asymptotes are there? (b) Same for
f(x) and −2f(x).
Case 3. f3(x)= f(x + a). (Study this case carefully, as many students find it
confusing at first.) Let (x, y) be a point on the graph of f3(x)= f(x+a). Then y = f(x
+ a), which is the value of f at x + a. This means that any point on the original
graph of f(x) is translated left by a units (if a > 0) to obtain a point on the graph
of f(x+a).
For example, the graph of g(x)= (x +2)2 is a parabola with vertex at x = −2.
This graph is congruent to the parabola y = x2. Similarly, the graph of g(x) = (x −
2)2 is the parabola shifted right two units. If you understand and remember these
two examples, it will help you to keep in mind that the graph of f(x + a) is the
graph of f(x) translated a units to the left (if a > 0).
Case 4. f4(x)= f(ax). Let (x, y)be a point on the graph of f4(x) = f(ax). Then y =
f(ax), which is the value of f at ax. This means that the original graph of f(x) is
scaled in the X-direction, by the scale factor 1/a, to obtain the graph of f(ax).
For example, the graph of f(x)= (2x)2 = 4x2 is a parabola, obtained by
“squeezing” the parabola y = x2 by a factor of ½ in the X-direction. Likewise, the
graph of is the parabola obtained by stretching y = x2 by a factor
of 2 in the X-direction.
Problem 7.25 The figure shows the graph of sin x for 0 ≤ x ≤ 2π. (a) Redraw this
graph, and then superimpose the graph of (b) Same for sin (2x). (c)
Same for 2 sin x.
Problem 7.26 Given the graph of f(x), which of the following functions have
graphs that are congruent to that of f(x)? Similar? Explain. (a) f(x)+ a; (b) f(x +
a); (c) f(ax); (d) af(x).
This all seems quite straightforward. However, in trying to solve x = f(y) for y,
two things may go wrong. The first is that there may be more than one solution
y, implying that y = f−1(x) has more than one value. This is not allowed: a
function (here y = f−1(x)) must always have a unique value. If the equation x =
f(y) has more than one solution y for some value of x, then the original function
f(x) does not have an inverse. (The second difficulty, actually solving x = f(y)for
y, will be discussed later.)
Example 2. Let f(x) = x2 for all x ∈ (−∞, +∞). Then y = f−1(x) would mean that x
= f(y), or x = y2. This would imply that an ambiguous (and therefore
incorrect) result.
The problem in this example, as in any such example, is that the given function
f(x) = x2 produces the same value y = f(x) for two different x’s. For example 4 = 22
= (−2)2, so we can’t solve 4 = y2 uniquely for y. Let us look at this example
graphically. The graph of f(x)= x2 is shown in the first figure. The equation we
want to solve for the inverse function is x = y2, shown in the second figure. For
each x> 0 there are two solutions for y, so the inverse function is undefined
(being ambiguous).
The difficulty, in this example, stems from the fact that the original function f(x)
= x2 is not “one-to-one.” A function f(x) is called one-to-one if, for each value of
y in the range of f, there is just one value of x in the domain of f, such that y =
f(x). This is not the case for f(x)= x2, as the upper graph shows. For a given
function f(x) to have an inverse function, f(x) must be one-to-one. Then the
equation x = f(y) has a unique solution y, which equals f−1(x)by definition. To
summarize:
Now, how do we tell if a given function f(x) is one-to-one? The answer is
quite simple: a (continuous) function is one-to-one if and only if it is monotonic.
“Monotonic” means that f(x) is either entirely increasing, or entirely decreasing.
If you stop and think about it, this statement should be clear.
Problem 7.28 Let f(x) = 4x − x2, for 0 ≤ x ≤ 4. Sketch the graph of f(x). Show
how to restrict the domain of f(x) so as to obtain a one-to-one function. Find the
inverse of this one-to-one function, and sketch its graph on the same XY-plane.
Consequently, the graphs of f and f−1 are reflections of each other in the 45° line, y
= x. The figure shows this for the example f(x) = x2 (x ≥ 0). Another example was
shown in the solution of Problem 7.28. Further examples will be encountered
later.
Problem 7.30 Continuing with the preceding problem, show that each of the
following functions is its own inverse, and sketch; (a) f(x)= 1/x (x> 0);
Example 4. Let f(x)= x3 for −∞ <x < ∞. Then also for −∞ <x <
∞. Note that
and also
This is entirely typical: for any one-to-one function f we always have
This is one way to check that you have calculated f−1(x) correctly. Note also
that the domain of f−1(x) equals the range of f(x) and vice versa.
To prove Eq. 7.6 we use the basic definition of an inverse function. Thus, to
prove that f−1(f(x)) = x, let z = f(x). Then x = f−1(z) = f−1(f(x)) as required.
Equation 7.6 emphasizes the fact that the inverse function f−1 of a given function f
reverses the action of f. In the figure, the action of f−1(f(x)) starts with x, then
produces y = f(x), then produces x = f−1(y) again, as the arrows show. (What does
f(f−1(y)) do here?)
Inverse transformations
Consider the equations of a translation in the XY-plane
This translation is in fact a function, but of a somewhat different kind than the
functions f(x) considered above.
In words, the function Ta,b shifts the point (x, y) to a new point (x + a, y+b).
[If you find this example confusing, recall that in general, a function can
transform any kind of mathematical object, into another mathematical object.
Here the objects are points (x, y) in the plane.]
Can you figure out what the inverse function is? Look at the preceding
figure. Keep in mind that the inverse of a given function reverses the action of
the function. What is the reverse operation to translating (x, y) to (x + a, x + b)? I
hope it is clear that
You can also develop your skills at sketching graphs involving other
combinations. For quotients f(x)/g(x), with g(x) > 0, it helps to realize that 1/g(x)
< 1 if g(x) > 1 and vice versa. Also 1/g(x) → 0 if g(x) →∞. The figure shows the
graph of 1/(1 + x4) thought of as 1/g(x), where g(x) = 1+ x4.
Composition of functions
We next discuss another common way of combining functions, called
composition.
To check that you understand this, try to find g(f(x)). (Answer below.)
Definition If f and g are two given functions, the composition f o g [which is
read as “f-oh-g”] is the function defined by
Thus, in Example 1 we have f o g(x)= 1/(1 + x2). Also,
The slope m measures how steeply the line rises (if m > 0) or falls (if m< 0). A
line with zero slope is horizontal.
What would we mean by the slope of a curve, such as y = x2? It should be clear
that the slope changes from one point to another. The slope m is now a function
of x, rather than a constant.
But what is the slope of a curve, exactly? And how can we calculate it? Here’s
the general method. We consider a given point P on the curve y = f(x), and wish
to determine the slope mP of the curve at point P. To calculate mP we consider a
nearby point Q. Let mPQ denote the slope of the line PQ; we can calculate mPQ
from the basic formula, Eq. 7.9.
(This is read as “limit of mPQ as Q approaches P.”) Let us apply this definition
to the parabola y = x2. Let P be the point on the parabola with coordinates (x, x2).
Let Q be a neighboring point (x + h, (x + h)2). Then
You should identify the sides of the right triangle shown in the figure; they
are Δx = h and Δy = f(x + h) − f(x). Therefore equals the expression
above. The expression on the right side of Eq. 7.10 is called the difference
quotient. The limit of mPQ as h → 0 is the slope of the curve y = f(x)at point P =
(x, f(x)). This quantity (the slope) is denoted by f′(x):
Here f′(x) is read as “f prime of x.” The function f′(x) as defined by Eq. 7.11 is
called the derivative of f(x).
To summarize, if a function f(x) is given, then its derivative f′(x) is defined by
Eq. 7.11. The derivative is also a function; it represents the slope of the curve y =
f(x), at each given value of x. For example, if f(x) = x2 we showed above that f′
(x)= 2x.
Problem 7.37 Show that if f(x) = x3 then f′(x) = 3x2. Suggestion: use the basic
definition, Eq. 7.11, and emulate the calculation for y = x2.
where the terms represented by ··· all involve a factor of h, h2,etc. (Please review
the Binomial theorem, Eq. 4.38, if you have forgotten it, or don’t understand the
last statement.) These terms (···) therefore approach zero as h → 0, and this shows
that f′ (x)= nxn−1. We emphasize this result.
For example, the derivative of x2 equals 2x, while the derivative of x3 equals
3x2, and so on. As a review, make a sketch of the graph of f(x)= x3. Is it obvious
that the slope of this graph is ≥ 0 for all x? What is the slope at the origin?
(Answer: 0.)
Problem 7.38 Explain the significance of Eq. 7.12 for the cases n = 0 and n = 1.
and
Formulas 7.13 and 7.14 are among the basic rules of calculus. Here we will
only apply them to polynomials.
Problem 7.39 Find f′(x) given that (a) f(x) = x2 − 4x +2; (b) f(x) = 3x4 +4x3 +8x2 −
16x + 100.
Problem 7.40 (a) Show by counterexample that (fg) ≠ f′g′. Suggestion. Try very
simple functions f and g. (b) The correct formula is (fg) = f′g + f′g. Verify this
formula for f(x)= x2 +2 and g(x)= x.
Example f(x) = x3 − 3x. To sketch the graph of f, we first note that f is odd. Since
f(x) = x(x2 − 3), the graph intersects the X-axis at x = 0 and Also, f(x)
→ +∞ as x → +∞. Since f(1) = −2, the general shape of the graph is as shown. To
locate the minimum point, we calculate f′(x) = 3x2 − 3 = 3(x2 − 1). Thus f′(x)= 0
for x = ±1. The local min is therefore at x = 1,y = f(1) = −2. The point (−1, 2) is a
local max. The figure shows all these features.
Another example, f(x)= x4 − 5x2 + 4, was considered in Problem 7.22. Let us
now find the local max and min points on the graph of f(x). We have
Problem 7.41 Determine the coordinates of the minimum point on the parabola
y = x2 − 4x + 1 (a) by completing the square; (b) by using the derivative.
Problem 7.42 The figure shows the graph of a certain function f(x). The function
is even, and has a horizontal asymptote at y = 2. Make an approximate sketch of f
′(x).
Problem 7.43 (a) Use the definition, Eq. 7.11, to show that the derivative of f(x)
= 1/x is f′(x) = −1/x2. (b) Locate the minimum point on the graph of g(x)= x +
(1/x)for x> 0.
Recall from Eq. 7.12 that (xn)′ = nxn−1 for n = 0, 1, 2,…. This equation is also
true for negative integers, n = −1, −2,… Problem (a) covers the case n = −1: (x−1)′
= (−1)x−2 = −1/x2. You may wish to do the proof for the general case for yourself.
Write n = −m and put xn = 1/xm. Use Eq. 7.11 and some algebra.
Limits
The definition of the derivative uses the idea of a limit – see Eq. 7.11. But what
exactly is meant by the term “limit?” And how are limits calculated?
In general, this can be a bit complicated. (Indeed, nearly 200 years transpired
between the invention of Calculus, by Newton and Leibniz, and the development
of a rigorous theory of limits by 19th century mathematicians.) However, the
basic idea is that the statement
means simply that the values of f(x) approach the limit L, as x approaches x0.
More precisely, f(x) becomes arbitrarily close to L, provided that x is sufficiently
close to x0.
In some cases, the value of L is quite obvious. For example, what do you
imagine is the value of
Answer: 11, just what you get by substituting x = 3 here. This works because
f(x)= x2 + 2 is a continuous function.
Limits don’t always work out this easily. For example, what is
Substituting x = 2 now gives the meaningless result What does this say about
the limit? One possibility is that since 0/0 is meaningless, the limit does not exist.
However, let us consider some values of x near 2:
It looks as if the limit equals −1.0. Can you see how to explain this?
Perhaps you noticed that the denominator can be factored: x2 −5x+6 = (x −
2)(x − 3). Therefore we have
Here, after canceling the factors (x − 2), we get an expression for which the
limit can be found by substitution. This is a common situation.
Note in the above example (and in the Problem) that the function F (x) in
is undefined at x = x0. In these examples, preliminary algebraic
simplification gets rid of the difficulty, after which the limit can be found by
substitution. Exactly the same situation arises necessarily in calculating a limit
involved in finding derivatives. For example, given g(x)= x2, we have
Here we cannot substitute h = 0, yet (why not?). Algebra comes to the rescue,
giving us
Problem 7.45 Let Find w′(x) on the basis of the definition, Eq.
7.11; comment on the “disappearance” of the troublesome h. Suggestion:
rationalize the numerator in the limit expression.
In all the above examples, direct substitution does not work because the
expression whose limit we seek is undefined at the limit value. This may seem
bizarre, but in fact it is the most common and useful case. In the derivative
formula
(Check that these statements are true, by looking at the graph of f(x).) In the
first case, we have determined a zero exactly. In the remaining cases, we have
improved on the accuracy of the approximation by a factor of 1/2. By repeating
the process n times, we can approximate the zero of f to within (½n)(x2 − x1). This
tolerance can be made as small as we wish, by taking n sufficiently large. The
procedure is called successive bisection.
Let us express the process as an algorithm. Let t be the required tolerance, or
accuracy of the approximation. First, we determine (by trial and error) values x1
< x2 such that f(x1) < 0 and f(x2) > 0. [Alternatively it may happen that f(x1) >
0and f(x2) < 0; the algorithm is easily adjusted for this case. Or else you can
replace f(x) by −f(x).]
1. Let and calculate f(x′).
2. If f(x′) = 0, stop: x is a zero of f.
3. Else: if f(x′) < 0 change x1 to x′, leaving x2 unchanged;
Else: if f(x′) > 0 leave x1 unchanged and change x2 to x′.
The new interval (x1,x2) contains a zero of f.
4. If x2 − x1 ≤ t, stop. Else go to step 1.
This algorithm is well suited to computers, or programmable calculators. In
fact, many advanced scientific calculators have built-in equation solving
programs, which may be based on this algorithm, or on some similar method.
Example Solve x3 + x − 1 = 0. Here f(x) = x3 + x − 1 and we see that f(0) = −1 and
f(1) = 1. Therefore there is a zero of f between x = 0 and 1. Also, f′(x) = 3x2 +1 > 0
for all x; this tells us that the graph of f(x) has positive slope at every point. Thus f
can have only one zero.
Here is the result of the algorithm with a tolerance of .01; only two decimal
places are shown.
After 4 steps we know that the solution x of our equation f(x) = 0 is between x =
.63 and x = .69. Two more steps locates the solution between .68 and .69. In other
words, x = .68, with further digits unknown as yet.
By continuing the algorithm, we could calculate the solution x to any desired
accuracy. From an HP Scientific calculator I obtained the approximation x =
0.68232780383, for which f(x) = 0 to 11 decimals. This calculator uses the
successive subdivision algorithm, and takes several seconds to calculate the
result.
Problem 7.48 Explain why the above algorithm works unchanged in the
alternative case f(x1) > 0, f(x2) < 0, provided we replace f(x) by −f(x).
Problem 7.49 Solve cos x = 2x for x> 0 (remember to use radian mode), using a
tolerance of .1.
7.8 Interpolation
Suppose y = f(x) is a given function. We suppose that the values of f(x) are given
only for certain values of x. We wish to find approximate values of f(x)elsewhere.
For example, suppose that a boy’s height H has been measured on his
birthday:
age, A height, H
12 yr 152 cm
13 yr 159 cm
14 yr 168 cm
How tall was he at age 12 yr 3 months? At what age did he reach 165 cm
height? We can answer these questions, approximately, by using interpolation
(more precisely, linear interpolation). This amounts to assuming, for the sake of
calculation, that the boy grew at a steady rate between ages 12 and 13, and at a
(different) steady rate between ages 13 and 14.
Can you do the first calculation in your head? At age 12 1/4 the boy should
have gained about 1/4 of his height increase for the year, which was 7 cm.
Therefore his height at at age 12 yr 3 mo was approximately 152 + (1/4) × 7 =
153.8 cm. To check that you followed this argument, try to calculate the boy’s
height at 13 yr 6 mos, and 13 yr 9 mo. Answer: at 13 yr 6 mo his height was
approximately 159 + (½) × 9 = 163.5 cm, and at 13 yr 9 mos it was approximately
165.8 cm.
A similar calculation applies to the second question. Since 165 cm represents
6/9 of boy’s total height gain between ages 13 and 14, he reached this height at
about 13 + (6/9) = 13 yr 8 months.
To obtain the general formula for linear interpolation, we assume that f(xi)=
yi for two values x1 < x2. Then for x1 < x <x2, the formula is
Notice that if we write y instead of f(x), then Eq. 7.17 is just the equation of
the straight line joining the two points (xi, yi). The above examples of
interpolation used this equation, in a mental calculation. For example, in the
boy’s height problem we have (A1,H1)= (12, 152) and (A2,H2)= (13, 159). With A
= 12 1/4, Eq. 7.17 gives H = 152 + (7 × 1/4), as before.
Before the availability of hand calculators, students and scientists spent many
painful hours interpolating from the values printed in trigonometric and
logarithmic tables. Such tables listed values of these functions to 3 or 4 decimals,
but by interpolating, one could improve on this by one additional decimal.
Fortunately, no one has to do these tedious calculations nowadays. (It was the
practical impossibility of doing by hand the vast calculations needed in designing
the H-bomb, that led to the first “electronic brains” – now called computers.)
Nevertheless, interpolation is still sometimes useful. We end this chapter with
one example.
Recall that the interval bisection algorithm began by calculating f(x), where x
is the midpoint between x1 and x2. This allows one to narrow the interval
containing the solution of f(x) = 0 by one-half in each repetition. A more
efficient algorithm obtains x by interpolation. As the figure suggests, this will
usually yield a closer approximation to the zero of f(x), than the midpoint. The
algorithm then proceeds as before, by altering the interval [x1,x2] according to the
sign of f(x). This calculation is more efficient than interval bisection. Work out
the details for yourself, if interested.
3. Which of the following sets of ordered pairs is not a function? (a) {(2, 1), (3, 1),
(4, 1), (5, 1)};(b) {(1, 2), (1, 3), (1, 4), (1, 5)}.
4. For any finite set A of real numbers, define max (A) to be the largest element
of A. Is max a function? What is its domain?
5. Sketch the graphs, and identify any symmetry, zeros, and asymptotes. (a) f(x)
= 4 − x2; (c) h(x) = x3 − 3x2 + x. (Suggestion: use the
quadratic formula in finding the zeros of h(x).)
8. The function f(x) has its graph as shown (the graph repeats to the right and
left). Sketch the graphs of (a) 2f(x); (b) f(2x).
9. Let f(x) = 1 − 1/x for x ∈ (0,∞). (a) Sketch the graph of f, and identify its
asymptotes. (b) Find the inverse function f−1(x), and sketch its graph. How do
the asymptotes of f−1 relate to those of f ?
10. Let f(x) = x2 +1/x (x > 0). (a) Sketch the graph of f(x) by adding the graphs of
x2 and 1/x. (b) Using calculus, find the local minimum point of f(x).
11. Use the definition in Eq. 7.11 to find f′(x) if f(x) = 1/x2.
12. Same for f(x) = 1/(x2 +2).
Solution 7.1 All are true except (c).
Solution 7.2 A is the circle of radius 4, centered at the origin, in the XY-plane. B
is the line with slope 1 and Y-intercept 2, in the XY-plane.
Solution 7.3 (a) For any x, x belongs to A ∪ B if and only if x belongs to A or x
belongs to B. (b) For any x, x does not belong to A ∪ B if and only if x does not
belong to A and x does not belong to B. (c) This says that the solutions of the
equation x2 = 1 are x = ±1.
Solution 7.4 For simplicity, assume a ≤ c (the opposite case is similar). A sketch
shows that there are 3 possibilities:
Solution 7.7 (a) f(−2) = 12, obtained by substituting x = −2 in the formula f(x) =
3x2. (b) p(64) = 1, because 64 = 26, which has only one prime factor, 2. (c) A(t1)=
1.5, because t1 has base b = 1 and height h = 3.
Solution 7.8 sin and 10x have domain (−∞, ∞), the set of all real numbers
(actually the set of all numbers that the calculator is capable of working with). 1/x
has domain all real numbers except 0. has domain [0, ∞), the set of all non-
negative real numbers. log x has domain (0, ∞), the set of all positive real
numbers. Asin has domain [-1,1], the set of real numbers satisfying −1 ≤ x ≤ 1.
Solution 7.9 (a) Yes: G is a set of ordered pairs, and no two elements of G have
the same first number and different second numbers. Here G(1) = 3, and so on.
(b) No: h is not a function, because h(2) is not uniquely specified.
Solution 7.10 (a) M gives the midpoint of the line segment joining two given
points (x1,y1)and (x2,y2). Its domain is the set of all pairs of points in the XY-
plane. It could be defined by writing
Solution 7.12 (a) Rf = (−∞, ∞), because any real number y equals 2x+1 for some
value of x, specifically x = (y − 1)/2. (b) Rf = [1, 3], because f(0) = 1 and f(1) = 3,
and the values of f(x) increase from 1 to 3 as x increases from 0 to 1.
Solution 7.13 The domain of G is the set {−1, 1, 4, 7} and the range RG is {1, 2, 3}.
Solution 7.14 First, as x becomes large, f(x)= 1/x becomes small. Also f(1) = 1.
Next, as x gets small, f(x)= 1/x grows larger and larger. Here is a representative
table:
The figure shows this graph. Note that the curve is asymptotic to the X-axis as x
approaches +∞. It is also asymptotic to the Y-axis, as x approaches 0. This may
remind you of a hyperbola, which is what this graph is. If you include the graph
for x < 0, you get both branches of the hyperbola. Try it.
Solution 7.15 f(0) = −.4, f(.5) = .2, f(3) = −.1. The maximum of f(x) occurs for x
= 1, with f(1) = .34. (All values are approximate.)
Solution 7.16
Solution 7.17 f(x) is odd; g(x)and h(x) are even; k(x) is neither.
Solution 7.22 We have f(x)= (x2 − 4)(x2−1), so that f(x)= 0for x = ±2 and ±1.
Also, f is even, and f(x) → +∞ as x → +∞ and as x →−∞. (The minimum points on
the graph are at this can be shown using elementary
calculus – see Sec. 7.5.)
Solution 7.23 (a) f(x) = x3 is odd, has X-and Y-intercept at (0,0), and f(x) →∞ as x
→∞. There are no asymptotes. is even. After squaring
2 2
both sides of we get y − x = 1, which is a hyperbola. The graph of
g(x) is the upper half of this hyperbola. It has asymptotes y = ±x. (c) h(x) = x2/(1
+ x2) is even. Also h(x) → 1 as x → ±∞, so the line y = 1 is an asymptote. Both X-
and Y-intercepts are at (0,0). (d) k(x) is undefined if x2 ≤ 1, that is, for x ∈ [−1,
1]. It has vertical asymptotes at x = ±1, and k(x) → 0 as x → −∞. The function is
even.
Solution 7.24 (a) The graph of f(x) + 1 is the graph of f(x) shifted one unit up;
there is now a horizontal asymptote at y = 1; (b) In this case, the asymptote
remains at y = 0. The curve lies below the asymptote, reaching a minimum value
−2.
Solution 7.25 The graph of sin x in fact repeats cyclically for other values of x, so
that these transformed graphs also repeat cyclically. See Chapter 8.)
Solution 7.26 (a) Congruent; because this is the graph of f(x) translated
vertically; (b) congruent; graph translated horizontally; (c) neither congruent nor
similar; graph is scaled in the X-direction only, which changes its shape; (d)
neither congruent nor similar.
Solution 7.30 (a) The solution of x = f(y)= 1/y is y = 1/x, so f−1(x)= 1/x = f(x) (x>
0). The semi-hyperbola y = 1/x (x> 0) is symmetric by reflection in the 45° line.
Case (b) is similar: f−1(x) = f(x). The graphs, shown below, are symmetric, as
before.
Solution 7.31 First, the range of g(x) is [2, ∞] (because x2 + 4 ≥ 4), so that the
domain of g−1(x) is [2, ∞]. Next, solving gives
The graphs are parts of hyperbolas. Finally
and similarly for g(g−1(x)).
Solution 7.32 To prove that f(f−1(x)) = x, let z = f−1(x). Then x = f(z)= f(f−1(x)) as
required.
Solution 7.34
Solution 7.38 For n = 0, Eq. 7.12 says that if f(x) = 1 = x0, then f′(x) = 0. The
graph of f(x) is the horizontal line y = 1, which has slope 0 for all x. For n = 1, Eq.
7.12 says that if f(x)= x, then f′(x)= 1. The graph of f(x) is the line y = x, which has
slope m = 1 for all x. Thus Eq. 7.12 does give the correct, familiar slope in these
special cases.
Solution 7.39 (a) f′(x)= 2x2 − 4; (b) f′(x)= 12x3 +12x2 +16x − 16.
Solution 7.40 (a) For example let f(x) = 1 and g(x)= x. Then (fg)′ = (x)′ = 1
whereas f′g′ = 0. (b) We have (fg)′ = (x3 +2x)′ = 3x2 +2 and f′g′ + f′g′ = 2x · x +(x2
+2) = 3x2 +2.
Solution 7.41 (a) Completing the square gives y = (x − 2)2 − 3, so that the
minimum point is x = 2,y = −3. (b) With f(x) = x2 − 4x +1 we have f′(x)= 2x − 4,
so that f′(x)= 0 for x = 2. The minimum point is at (2,f(2)) = (2, −3).
Solution 7.42 Using the fact that f′(x) equals the slope of the curve y = f(x) at x,
we see that f′ is odd; f′(0) = 0; f′(x) → 0 as x → +∞. Also, f′(x) reaches a maximum
at some point x1, which is where the given curve y = f(x) has maximum slope.
Here, x1 ≈ 0.6.
Here the factor h has been cancelled at Step 3. (Note that the result
fits with the general rule, Eq. 7.12.)
Solution 7.49 Putting f(x)= 2x−cos x, we have f(0) = −1 and f(1) = 1.5, so there is
a solution x to f(x) = 0 between 0 and 1. The algorithm becomes
or 67,000 mph. (If you were surprised to learn that the earth is racing
through space at 67,000 mph, think what Galileo’s contemporaries must have
thought of his claim that the earth rotates around the sun, rather than the
reverse!)
7.
8. (a) By Eq. 3.3, X being proportional to Y means that X = c1Y for some
constant c1. Similarly, Y = c2Z for some constant c2. Therefore X = c1Y = c1c2Z
= c3Z where c3 = c1c2, i.e. X is proportional to Z. (b) We know that
Armstrong’s weight is proportional to g, which is itself proportional to the
mass of the heavenly body in question. If his weight on the moon is 98.8%
less than on the Earth, this implies that the moon’s mass is 98.8% less than
the Earth, so the moon’s mass is 1.2% of that of the earth.
9. (a) 1.5% of 7 × 109 is .105 × 109 = 105 × 106, or 105 million. This is the
approximate increase in population for one year. (b) To a first
approximation, 10 years growth will add 10×105 million, or 1.05 billion
people to the world’s population, which will therefore be a bit over 8 billion
in 2020. Of course, this assumes that the growth rate of 1.5% per year will
continue for the next 10 years, which may not be the case.
4. Three line segments form a triangle if and only if the longest segment is less
than the sum of the other two segments. For example, if a ≥ b and a ≥ c, then
a, b, c form a triangle if and only if a < b + c. To explain this, imagine trying
to construct the triangle by drawing arcs of radius b and c centered at the
endpoints of segment a. These arcs will not intersect if b + c< a, but they will
intersect if b + c> a. (The case b + c = a would give a “collapsed” triangle,
which is not usually considered to be an actual triangle.)
5. (a) s = rθ =20 cm × (70°×2π/360°)= 24.4cm.
6. First, if h is the height of the parallelogram, we have h/2 = sin60°, or h =1.73
m. Therefore A = bh = 6.93 m2.
10. 0.60. The general rule is A′ B′ /A′ C′ = AB/AC, and similarly for other ratios.
The reason for this is that, if k is the scale factor, then A′ B′ = kAB and A′ C′ =
kAC. Therefore A′ B′ /A′ C′ = kAB/kAC = AB/AC.
11. The perimeters scale as k while the areas scale as k2.
12. Draw any chord AB to the arc. Then the perpendicular bisector of this chord
coincides with a diameter of the circle. Using two different chords gives two
different diameters, the intersection of which is the center of the circle.
13. Since 52 +122 = 169 = 132, this is a right triangle. For the smaller angle θ we
have tan θ = 5/12, which implies that θ = 22.6°. The complementary angle is
67.4°.
14. First calculate x from y/x =tan θ, or x = y/ tan θ =8/ tan 38° =10.2. Then
Also B = 90° − 38° = 52°. As a check, A being a
bit less than 45°, we should have BC slightly smaller than AC, which agrees
with the calculation.
15. We have x/r = sin 50°, or x = r sin 50° =4.6 cm. Therefore AB = 9.2 cm. The
length of the arc joining A and B is s =10.5cm.
16. Each degree down from the Arctic circle increases the sun’s angle above the
horizon by one degree. Since NYC is south of the Arctic circle, the sun
will be above the horizon there.
17. Draw a diagram similar to that used in the solution of Problem 5.42, except
that you now draw a circle with radius r1+r2, outside the larger circle. Let AP
be the tangent from this new circle, drawn from the center P of the smaller
circle. The crossover tangent BC has the same length as AP ,namely
[We need to have d > r1 + r2, to have a crossover
tangent.]
18. Let l and w be the length and width of R. Then the dimensions of the smaller
rectangles are w by l/2. Therefore l/w = w/(l/2). Hence l2 =2w2, or
19. We have l/w = w/(l − w). This becomes l2 − lw − w2 = 0. Divide through by w2,
and set x = l/w, obtaining the equation x2 − x − 1= 0. The solution of the
latter equation is (by the quadratic formula)
This is the golden ratio l/w. (The other solution is negative,
and not relevant.)
20. Name the rhombus ABCD as in the diagram, and O as the intersection of the
diagonals. Then ΔDAB ≡ ΔDCB (3 sides equal). Also, both of these triangles
are isosceles. Together, these facts show that the diagonal BD bisects the
rhombus angles B and D. Similarly, AC bisects A and C. Next we see
that ΔDCO ≡ ΔBCO (2 sides and enclosed angle). Hence COB = COD;
since these angles are supplementary, each equals 90°. Also OD = OB. This
means that diagonal AC is the perpendicular bisector of diagonal DB.
Similarly DB is the perpendicular bisector of AC. (There are several
alternative proofs.)
The converse can be stated as follows. Let AC and BD be two line segments,
with each being the perpendicular bisector of the other. Then ABCD is a
rhombus. (Another formulation of the converse would be: if the diagonals of
a parallelogram are perpendicular bisectors of one another, then the
parallelogram is a rhombus.)
To prove this, note that all four triangles shown in the diagram are congruent
(2 sides and included angle). Therefore AB = BC = CD = DA,so that ABCD is
a rhombus.
21. By trigonometry we have d/2 = 384, 000 sin 0.25°, giving d = 3, 350 km for the
diameter of the moon. The ratio of the volumes equals (3, 350/12, 000)3 =
.022, i.e. the volume of the moon is about 2.2% that of the earth.
Chapter 6
1. (a) x − 2y =0; (b) x − 2y =4.
2. x =12/7, y = −11/7.
3. Solve the simultaneous equations (x − 3)2 +(y − 1)2 =4, y =0. The algebra is
. Thus
4. False. Almost any example will disprove this. Take m1 =1, m2 =2. Then θ1
=45° and θ2 =Atan m2 =63.4°.
5. The perpendicular line through (1,1) has equation x +2y = 3. The point of
intersection of this line and the given line 2x − y = 4 is (11/5, 2/5). This is the
required point on the line, closest to (1,1).
6. By subtraction we get 8x +2y +2 = 0, or 4x + y +1 = 0. This is clearly the
equation of a line. To see that this line passes through the points of
intersection of the circles, note that any such intersection point (x, y) satisfies
the two circle equations simultaneously. Therefore (x, y) also satisfies the
equation obtained by subtracting the equations, that is, (x, y) is a point on the
line 8x +2y + 2 = 0. Thus the line passes through both points of intersection
of the circles. (The circles do actually intersect.)
7. The given line segment has slope −1 and midpoint (2,3). Hence the equation
of the perpendicular bisector is y −3= 1(x−2), or y = x+1.
8. After squaring, we have y2 = x. By analogy with y = x2, this is the equation of a
parabola opening to the right. The original equation is the upper
half.
9. Both hyperbolas have the same asymptotes y = ±bx/a. The first hyperbola has
X-intercepts at x = ±a, and opens to the left and right. The second hyperbola
has Y-intercepts at y = ±b, and opens up and down.
10. We imagine a unit circle x2 + y2 =1, with area A = πr2 = π, and we think of it
as being covered by tiny squares. Sketching this figure by factor a in the X-
direction and b in the Y-direction produces the ellipse x2/a2 + y2/b2 = 1, which
is covered by little rectangles having area ab times their original area. Hence
the area of the ellipse is A = πab. Note that this agrees with the formula for
the area of a circle, in the case that a = b.
11. By completing the square and simplifying we obtain the equation (x − 3)2/4 −
y2 = 1. This is the equation of a hyperbola centered at (3,0), with asymptotes
The foci are at distance from the
center, and are therefore located at The eccentricity
12. To solve these equations simultaneously, first add them together, giving
Then from y2 = x2 − 2 we get y = ±1. There are
four points of intersection and this agrees with the figure (note
the symmetry).
Chapter 7
1.
2. (a) all x ≠ 2; (b) all x;(c) all x except 0 <x < 1 (because x(x−1) < 0 if x lies
between 0 and 1).
3. (b) is not a function.
4. Yes, max is a function. Its domain is the family of all finite sets of real
numbers, as stated.
5. (a) The graph of f(x) = 4 − x2 is a parabola, opening downwards. f(x) is an
even function. The X-intercepts are at x = ±2. (b) The graph of
is the upper half of the circle x2 + y2 = 4. (c) First, h(x) =
x(x2 − 3x + 1). Solving the equation x2 − 3x + 1 = 0 by the quadratic formula
gives x =2.6or .4. Therefore the zeros of h(x)are x =0, .4, and 2.6
approximately. h(x)is neither even nor odd. The local max and min points
can be found using calculus. Namely, f′(x) = 3x2 − 6x +1, which equals zero
for x = .2 or 1.8,approximately. The point (.2,h(.2)) = (.2,.5) is a local max,
and (1.8, −2.1) is a local min.
6. (a) We have x2 +2x +3 = (x +1)2 +2. The min is at x = −1,y =2. (b) From
calculus, which equals zero for x = −1. This is
the same result as in (a).
7. We have f′(x)= 3x2 −12x =3x(x−4), which is 0 for x = 0 or 4. Thus f(x) has a
local max or min at (0,9) and (4, −23). Since f(x) →−∞ as x →−∞ and f(x) →
+∞ as x → +∞, the curve y = f(x) must cross the X-axis at least three times.
(Make a rough sketch to explain this.)
8.
TEST
Part I. Arithmetic
Do not use a calculator for this part of the test.
3. Simplify: 1.52 − 3(4.8 − 2.7)
(A) −0.62 (B) −20.98 (C) −15.58 (D) −4.78
4. Simplify: (15 − 3)/(5 − 2)
(A) 2/5 (B) 4 (C) 0 (D) 2
5. Find the decimal expansion of 15/8.
(A) 1.6969 … (B) 1.75 (C) 1.875 (D) 1.88 …
6. Find 13.7 × 1.8 rounded off to one decimal place.
(A) 11.9 (B) 15.5 (C) 24.6 (D) 24.7
7. Find the remainder on dividing 707 by 11.
(A) 0 (B) 3 (C) 7 (D) 64
8. Find 25% of $500.
(A) $125 (B) $12.50 (C) $525 (D) $275
9. Express the binary number 110110 in decimal form.
(A) 26 (B) 40 (C) 54 (D) 202
10. Write in decimal notation: 2.95 × 104.
(A) 2,950,000 (B) 295,000 (C) .000295 (D) 29,500
11. a(b + c)= ab + ac is called the
(A) commutative law (B) multiplication law (C) associative law (D)
distributive law
12. How long does it take a plane flying at 500 mph to travel from Denver to New
York City, a distance of 1575 miles?
(A) 1225 (B) 3 hrs 9 min (C) 3.75 hrs (D) 3 hrs 15 min
5. Expand: (1 − b)3
(A) 1 − b3 (B) 1 − 3b +3b2 − b3 (C) 13 − b3 (D) 1 − b − b2 − b3
SOLUTIONS
Part I
1. C (Sec. 2.3)
2. A (Sec. 2.3)
3. D (Sec. 2.1–2.2)
4. B (Sec. 2.3)
5. C (Sec. 2.3)
6. D (Sec. 1.5)
7. B (Sec. 2.3)
8. A (Sec. 3.2)
9. C (Sec. 1.4)
10. D (Sec. 1.6)
11. D (Sec. 1.7)
12. B (Sec. 3.6)
Part II
1. C (Sec. 4.3)
2. B (Sec. 4.4)
3. A (Sec. 4.4)
4. D (Sec. 4.5)
5. B (Sec. 4.7)
6. D (Sec. 4.3)
7. C (Sec. 4.3)
8. B (Sec. 6.2)
9. A (Sec. 4.7)
10. C (Sec. 4.8)
11. A (Sec. 4.9)
12. D (Sec. 4.5)
Part III
1. C (Sec. 7.2)
2. D (Sec. 7.2)
3. A (Sec. 7.4)
4. C (Sec. 7.2)
5. C (Sec. 7.5)
6. B (Sec. 7.6)
7. C (Sec. 7.3)
8. D (Sec. 7.3, 8.1)
Part IV
1. B (Sec. 5.2)
2. D (Sec. 5.2)
3. B (Sec. 5.6)
4. A (Sec. 5.3)
5. D (Sec. 5.4)
6. C (Sec. 5.6)
7. C (Sec. 5.6)
Common Errors in Elementary
Mathematics
Teachers and professors regularly encounter certain common types of student
error. Students who continue to make these errors cannot hope to succeed in any
science or technology program. Most of these errors result from a lack of
understanding of basic math, together with reliance on faulty memory.
A common symptom of lack of confidence in basic math is the use of bad,
sloppy handwriting. In doing mathematics, it is essential to write down your
work neatly and carefully. This also simplifies the task of checking your
calculations. I always recommend using a pen rather than a pencil. You can
delete a wrong step by drawing a line through it, which is better than erasing – it
sometimes turns out that you discover later that the erased step was correct after
all.
Misuse of the distributive law
The distributive law is
To understand this, you must know that a, b, c represent arbitrary real numbers,
and that juxtaposition ab means multiplication, and also that operations inside
brackets are to be carried out before other operations. Thus, for example, 3 × (5
+ 11) = 3 × 16 = 48. This checks out to be the same as (3 × 5) + (3 × 11), as in Eq.
1.
Common mistakes are: forgetting to do the bracketed calculation first, as in
x(y −2) = xy −2 [False]; or omitting necessary brackets, as in 2(w −3t+5
[Meaningless]. A third mistake is using the apparent form of the distributive law
where it does not apply. Examples of this error:
These errors all look vaguely like the distributive law. However, any student who
remembers why the distributive law is true, at least for whole numbers (see
Section 1.3), will be unlikely to make such mistakes. Question: explain how we
know that the distributive law is true for whole numbers. Hint: think of a
diagram, for a typical example. Read Chapter 1 if you have forgotten this.
Problem 1. For each of the above false equations (a) give a counterexample; (b)
write the correct equation, if there is one.
Misuse of cancellation
The cancellation law is
Here a, b, c represent arbitrary real numbers, with a and c not equal to zero. We
say that the factor a, which is a common factor in the numerator and the
denominator, can be cancelled from both:
This is easy to read and check. Compare this with what happens if you strike
things out. Striking out can and often does result in errors.
Mistakes may also result from misunderstanding the cancellation law as in
the following examples:
Students who make such mistakes seem to be of the impression that any symbol
that occurs in the numerator and denominator of a fraction can be cancelled. Of
course this is incorrect: cancellation applies only to common factors, as in Eq. 2.
The common factor can be a complicated expression, as in
Problem 2. (a) Identify the factors, if any, in the expressions (i) 3(x − 2)(y + 4);
(ii) 3x − 2y + 4. (b) Factor the whole number 28 into its prime factors.
Problem 3. Decide whether the following calculations are correct, and explain:
Operations with fractions
Here are the rules for combining fractions
For example, 3/5+ 8/5 = 11/5. Second, for fractions that have different
denominators, we can first use the cancellation law to rewrite the given fractions
so that their denominators are the same:
This is the best way to remember how to add fractions, much better than
trying to remember Eq. 5 itself. For practice, try adding x/2+2y/5. You should get
(5x +4y)/10.
Furthermore, you can (and should) immediately check any such calculation
by reversing the steps:
Problem 4. Write as a single fraction (a) 3/x − y/4; (b) (8x/3) ÷ (4x2/9). Note: it
is best to write these problems out in the usual vertical manner before
proceeding.
where |x| denotes the absolute value of x. To show that Eq. 6 is correct, we
consider the cases of positive and negative x separately. For x > 0 we have
and |x| = x also, so the equation is correct. Example:
For x< 0 we have x2 > 0 and also |x| = −x in this
case. Example: Thus Eq. 6 is always correct (for x = 0,
as well).
Problem 5. Simplify
Summary
For any reader of this book who tends to make errors in mathematics, I
recommend the careful study of Chapters 1 through 4 before proceeding to the
later chapters. Also, try to adopt the following work habits rigorously:
1. Always write down your calculations carefully and neatly. A few connecting
words can help to make sense of your writing—for example: therefore …,
we have …, by substitution …, etc.
2. Consciously develop and use habits that help to avoid errors, and allow for
easy checking. For example, never use strike-outs to indicate cancellations.
3. Always double check every calculation before going on. Many useful
techniques for checking your work are discussed in this book.
4. Be sure you fully understood each technical term, such as factor,
denominator, radian, and so on. Use the book’s Index to look up such terms
when necessary.
5. Never be satisfied with mere memorization of something that is confusing,
or not fully understood. A firm background in mathematics requires
understanding of every definition, concept, argument, and formula. Exactly
what is meant by “understanding” is a complex psychological question,
which is addressed throughout the book.
6. Always exert the effort to understand the proof, or derivation, or
explanation, of every point in the book, or in class. This can be hard work,
but it will pay off later in terms of confidence in your understanding of
mathematics.
Solution 1. (a) For example (2 + 3)2 = 25, not 22 +32, which is 13.
Counterexamples to the other false equations are also easily found. (b) We have
(a+b)2 = a2+2ab+b2. Also sin(a+b)= sin a cos b+cos a sin b (Chapter 8). The
other false equations do not have correct alternatives – they’re just false.
Solution 2. (a) The factors are 3, x − 2, and y + 4; (ii) There are no factors other
than the entire expression. (b) 28 = 2 × 2 × 7.
Solution 3. (a) Incorrect because p − 2 is not a factor of the numerator; (b) This
is correct.
Solution 4.
Solution 5. (a) |x − 1|, because (x − 1)2 = x2 − 2x + 1; (b) cannot be simplified.
Greek Alphabet
=
equals
equation
+
plus
addition
−
minus
subtraction
×
times
multiplication
÷
divided by
division
/
over
division
a over b division; fraction
—
( ), { }, [ ]
brackets
an a to the nth
exponents
<
less than
inequality
>
greater than
inequality
≤ (≥) less (greater) than or equal to
inequality
|x| absolute value (or magnitude) of x
absolute value
square root
square root
%
per cent
per cent
∝
is proportional to
proportionality
x2 x2, or x sub 2
subscripts
P(x) P of x
function
C(n, k) C of n, k
binomial coefficient
n! n factorial
factorial
sum, k equals 0 to n
summation
angle
angle
°
degrees
angle
end of proof
a′ a prime
Δ
triangle
triangle
perpendicular
perpendicular
sin
sine
sine
cos cos (or cosine)
cosine
tan tan (or tangent) tangent (function)
π
pi
circumference
(x, y)
xy coordinates; ordered pair; interval
Δx delta x
slope
∈
belongs to
sets
⊂
is contained in
sets
∪
union
sets
∩
intersection
sets
{x : …} the set of all x such that …
sets
[a, b] closed interval a, b interval, closed
∞
infinity interval, infinite
→∞
approaches
infinity asymptote
f transforms x to y
function
f -1 f inverse inverse
function
f′(x) f prime of x
derivative
f o g f oh g
composition
limit as x approaches x0
limit
(r, θ) r theta
polar coordinates
base of natural logarithms
loga log to the base a
logarithm
ln
l n
natural logarithm
Index abscissa, 285
absolute value, 51, 99, 196
absolute value function, 352
acceleration, 140
Acos (arc cosine), 252
acute angle, 227
triangle, 229
addition algorithm, 5, 52
decimal point numbers, 24, 51
fractions, 80, 161
polynomials, 167
whole numbers, 10
adjacent side, 249
algorithm, 5
addition, 6, 25, 52
of fractions, 80, 82
division, 65
division, long, 65, 95, 214
multiplication, 19, 58
order, 37
subtraction, 46, 49, 55
alternate angles, 231
angle, 224
acute, 227
alternate, 231
bisection of, 276
complementary, 226
corresponding, 229
exterior, 232
interior, 231
obtuse, 227
opposite, 227
right, 225
size of, 225
straight, 225
supplementary, 226
angles, equal, 224
sum of, 231
angular velocity, 267
approximate calculations, 29
arc cos, 252
arc length, 265
arc of circle, 224, 260, 264
arc sine, 251
arc tan, 253
area, 114
of circle, 263, 278
of sector, 267
of triangle, 116
argument, 341
arithmetic, laws of, 34, 96
Asin (arc sine), 251
associative law, of addition, 10, 54
of multiplication, 14, 59
asymptote, 319, 348
Atan (arc tangent), 252
axes, coordinate, 284
axiom, 222
parallel, 233
binary arithmetic, 20
binocular vision, 247
binomial, 171
coefficients, 201, 202
theorem, 199, 205
bisection of an angle, 276
bisection, method of, 383
bisector, perpendicular, 256, 276
bonds, financial, 150
brackets, nested, 17
rule of, 11
calculator, basic, 109
scientific, 276
calculus, 370
cancellation, 72, 159
errors in, 74
cancellation law, 73, 96, 159
carbon dating, 157
centre of a circle, 224
checking calculations, 162, 168
chord of circle, 257
circle, 224
arc of, 224, 264
area of, 263, 278
circumference, 261
circumscribed, 273
determined by 3 points, 273
diameter of, 259
equation of, 288
inscribed, 273
radius of, 224
tangent to, 270
circles, intersection of, 308
circumference of circle, 261
circumscribed circle, 273
closed interval, 331
coefficient, of polynomial, 165
coefficients, binomial, 201, 202
common denominators, method of, 81, 163
common factors, 175
commutative law, of addition, 9, 54
of multiplication, 14, 59
completing the square, 188
composite number, 68, 103
composition of functions, 369
compound interest, 150, 152
conclusion, 247
congruence, 239
congruent triangles, 240
conic sections, 316
constant term, 165
constructions, ruler and compass, 273
continuity, 351
conventions, for minus sign, 60
converse, 248
coordinate axes, 284
coordinates, rectangular, 283
corresponding angles, 229
cosine, 250
counterexample, 87, 248
counting principle, 10
cusp, 354
decay, exponential, 157
decimal point numbers, 22
full system of, 49
long division, 95
decimal system, 1
decimal, repeating, 79, 100
degree of polynomial, 165
denominator, 70
common, 81
least (lowest) common, 83, 163
rationalizing, 210
derivative of a function, 372
of a polynomial, 374
diameter, of a circle, 259
difference quotient, 372
digit, 2
positional significance, 1
significant, 33
dimensions, 136
directrix, 325
discounting, 155
discriminant, 187
disk, 224
distance, between points, 223
distance formula, 286
distributive law decimal numbers, 59
whole numbers, 15
dividend, 64, 171
division, 63
as reverse of multiplication, 63
by ten, 27
by zero, 88, 94
extended, 77
long, 65, 77, 95, 214
of fractions, 90, 163
of polynomials, 171, 214
of rational numbers, 90
synthetic, 215
with remainder, 64, 171
divisor, 64, 171
domain, 334
natural, 338
earth, size of, 266
eccentricity, ellipse, 323
earth’s orbit, 323
hyperbola, 324
parabola, 325
elimination, 304
Gaussian, 306
ellipse, 316
focus of, 321
empty set, 331
energy, 142
equation, conditional, 302
graph of, 291
linear, 180, 299
meaning of, 301
of circle, 288
of straight line, 292, 296
quadratic, 183
relational, 302
solving, 179, 218
equations inconsistent, 306
numerical solution of, 382
simultaneous, 303
equilateral triangle, 255
Eratosthenes’ method, 266
errors in mathematics, 11, 105, 411
even function, 347
examples, generic, 17
exponential decay, 157
growth, 150, 157
exponents, 144
fractional, 205
laws of, 145, 207, 208
negative, 146
repeated, 146
rules of, 145, 207, 208
extraneous solutions, 192
factor, 13, 67, 101
common, 175
of polynomial, 175
factor theorem, 217
factorial, 201
factoring quadratics, 175
whole numbers, 68, 103
factorization, prime, 68
focus ellipse, 321
hyperbola, 324
parabola, 325
fractions, 69
addition of, 80, 161
decimal representation of, 77
division of, 90, 163
improper, 71
multiplication of, 83, 85, 163
negative, 83
order of, 76
proper, 72
reducing, 73
subtraction, 83, 161
function, 333
argument of, 341
composition of, 369
continuous, 351
derivative of, 372
domain of, 334
even, 347
graph of, 341
inverse, 360
monotonic, 362
natural domain of, 338
odd, 347
one-to-one, 362
range of, 339
smooth, 351
transformation of, 357
Gaussian elimination, 306
gear ratios, 268
generic example, 16
geometric increase, 150
golden ratio, 281
googol, 6
grade, 294
gradient, 294
graph of equation, 291
of function, 341
of inverse function, 363
of sums and products, 368
greater than, 36, 50
growth, exponential, 157
population, 156
hyperbola, 318
focus of, 324
hypotenuse, 236, 249
hypothesis, 247
identity, 302
inconsistent equations, 306
inequalities, 194
laws of, 38, 194
linear, 181
quadratic, 197
solving, 195
infinity, approaching, 350
inscribed circle, 273
integers, 47
intercept, 300, 352
interest compound, 150
interpolation, 385
intersection of circles, 308
of lines, 222
of sets, 329
interval, 331
closed, 331
open, 331
inverse function, 360
graph of, 363
operations, 206
transformation, 366
invert-and-multiply rule, 91
irrational numbers, 80, 100, 104
isosceles triangle, 254
l.c.d., 83, 163
larger & smaller, 36 see also: order, inequalities laws, of arithmetic, 35, 96
of exponents, 145, 207
learning mathematics, see mathematics least (lowest) common denominator, 83,
163
length of a line segment, 223
less than, 36, 50
like terms, collecting, 167
limit, 379
line (straight), 222
equation of, 296
intercept form, 300
point-slope form, 297
slope-intercept form, 298
segment, 223
slope of, 292
linear equation, 180, 299
inequality, 181
lines, parallel, 223, 295
perpendicular, 295
logarithmic growth, 150
logic and proof, 247
long division, 65, 82, 95
of polynomials, 171, 214
look-up table, 335
lowest common denominator, 83, 163
magnitude, 51, 99
orders of, 33
mass vs. weight, 138
mathematical models, 124, 152
mathematics, learning, 38, 233
solving problems in, 310
understanding, 7, 105
maximum, 377
measurement, units of, 134
mental blocks, 40
midpoint, 287
minimum, 377
minus, two minuses make a plus, 45, 56
using the minus sign, 60
mistakes, avoiding, 11, 74 see also errors models, mathematical, 124, 152
monotonic function, 362
multiplication by ten, 17, 27
decimal numbers, 28, 58
fractions, 83, 85, 163
polynomials, 167
whole numbers, 13
MWTFU, 173
natural number, 5
negative number, 47, 83
nested brackets, 17
nines, casting out, 69
notation, scientific, 29
number, composite, 103
decimal point, 22, 49
irrational, 80, 100, 104
natural, 5
negative, 47, 83
prime, 68, 102
rational, 83, 87
real, 100
whole, 5
number line, 36, 47, 57
numerals, 2
numerator, 70, 171
numerical precision, 32
obtuse angle, 227
triangle, 229
odd function, 347
one-to-one function, 362
open interval, 331
opposite side, 249
order, 36, 50
see also inequalities for fractions, 76
of magnitude, 33
ordered pair, 336
ordinate, 285
origin, of number line, 47
of rectangular coordinates, 283
parabola, 320
focus of, 325
parallel axiom, 233
parallel lines, 223, 295
parallelogram, 118
parameter, 153
parentheses, 12
Pascal’s triangle, 200
per cent, 113
perpendicular bisector, 256
lines, 225
pi, 261
planetary orbits, 323
point, 222
polynomial, quadratic, 165
polynomials, 165
addition of, 167
derivative of, 374
division of, 171
linear, 165
long division of, 214
multiplication of, 167
quadratic, 165
roots of, 180
population growth, 156
positional significance of digits, 1
powers of ten, 3, 26
precedence rule, × over +, 18, 94
exponents over multiplication, 166
precision, numerical, 32
present value, 155
prime number, 68, 102
problems, how to solve, 310
product, 13
proof, 234, 247
by example, 16
proportionality, 125
inverse, 129
multiple, 127
protractor, 226
Pythagoras’s theorem, 236
QED, 228
quadrant, 284
quadratic equation, 183
factoring, 176
formula, 183
quotient, 64, 171, 214
radian, 263
radical, 205
simplifying, 209
radius of circle, 224
range, 339
rate of change, 378
rational number, 83, 87
rational number system, 100
rationalizing a denominator, 210
ray, 224
real number system, 100
reciprocal, 92, 296
rectangular coordinates, 283
reducing a fraction, 73
reductio ad absurdum, 104
reflection, 239, 314
reflector parabolic, 325
remainder, 64, 173, 214
remainder theorem, 217
repeating decimals, 79, 100
reversible transformation, 192
right angle, 225
rigid motion, 239
roots double, 188
of negative numbers, 212
of polynomial, 180
nth roots, 205
rotation, 239, 314
rounding off, 33
rule of nine, 69
rule of three, 69
ruler and compass constructions, 273
ruler, number, 23, 36
rules of arithmetic, 34
scale effects, 119
scaling, 239, 315
scientific notation, 29
sector of circle, 267
area of, 267
segment, line, 223
set, 329
empty, 331
SI system of units, 134
significant digits, 33
similar triangles, 243
similarity, 238
simultaneous linear equations, 303
sine, 250
singularity, 349
slash notation, 76
slope of curve, 371
of line, 292
smooth function, 351
solution, extraneous, 192
numerical, 382
of inequalities, 195
of quadratic equation, 183
of polynomial equations, 179, 218
solving triangles, 249
solving math problems, 310
speed, 129
average, 133
square root, 99
straight line, see line (straight) strike-outs, 75
substitution, 144
subtend, 264
subtraction, 43, 55
algorithm, 46
and number line, 57
as reverse to addition, 45
of fractions, 83
summand, 13
summation notation, 204
symmetry, of functions, 347
synthetic division, 215
tangent (function), 250
tangent line, to circle, 270
to curve, 373
temperature, 136
term of polynomial, 165
theorem binomial, 199, 205
factor, 217
if-and-only-if, 218
Pythagoras’s, 236
remainder, 217
traffic congestion, 122
transformation, inverse, 366
of functions, 357
of plane figures, 239, 311
transitivity, 194
translation, 239, 311
transposing terms, 181
transversal, 229
triangle, 229
acute, 229
area of, 116
congruent, 240
equilateral, 255, 275
isosceles, 254
obtuse, 229
Pascal’s, 200
right, 229
similar, 243
solving, 249
triangulation, 246
trigonometric functions, 250
two minuses make a plus, 45, 56
understanding mathematics, 7, 105, 211
union of sets, 329
units of measurement, 134
variable, 165, 288
velocity angular, 267
of rotating object, 269
vertex, angle, 224
triangle, 229
vision, binocular, 247
weight, 138
whole number, 5
zero, 4
division by, 88, 94
of a function, 383
zero product rule, 97, 184