N N 1 N 1 N
N N 1 N 1 N
In (1), when a0 6= 0, we say that the degree of p is n and we call a0 the leading
coefficient. When p is a nonzero constant, we say that the degree of p is zero. When
p is equal to the constant zero, it is customary assign ∞ to be the degree of p. We are
mainly interested in nonconstant polynomials and there is no need to worry about this
convention. By an algebraic equation we mean an equation of the form p(x) = 0. By a
root of the polynomial p, or a solution to the algebraic equation p(x) = 0, we mean
a (complex) number a such that p(a) = 0.
If the degree of p is two, we call p(x) = 0 a quadratic equation, if three, we call a cubic
equation, if four, we call a quatric equation, and if five, we call a quintic equation. Next we
discuss the general quadratic equation p(x) = 0, where p(x) = ax2 + bx + c with a 6= 0.
An important technique for dealing with a polynomial of degree two is called completing
square, which is based on the identity a2 + 2ab + b2 = (a + b)2 .
1
EXAMPLE 3. We are asked to do the “completing square” for the following polynomials:
x2 + 6x + 7, 2x2 + 2x and 3x2 − 2x + 1. Here they are:
x2 + 6x + 7 = x2 + 2 . 3x + 32 − 32 + 7 = (x + 3)2 − 2,
2x2 + 2x = 2(x2 + x) = 2(x2 + 2(1/2)x + (1/2)2 − (1/2)2 ) = 2(x + 1/2)2 − 1/2,
3x2 − 2x + 1 = 3(x2 − 2(1/3)x + (1/3)2 − (1/3)2 ) + 1 = 3(x − 1/3)2 + 2/3.
As you can see, “completing square” is like a tailor’s job of fitting a given expression into
the form (x + a)2 , keeping but without worrying what is left over.
Now we return to the general polynomial of degree two: p(x) = ax2 + bx + c, with
a 6= 0. In the following discussion we assume that a, b, c are real numbers. We perform
“completing square” for this polynomial:
µ ¶
2 2 b
p(x) ≡ ax + bx + c = a x + x + c
a
à ½ ¾2 ½ ¾2 !
b b b
= a x2 + 2 x + − +c
2a 2a 2a
à ½ ¾2 !
2 b b b2
=a x +2 x+ −a 2 +c
2a 2a 4a
Or µ ¶2
b 4ac − b2
p(x) = a x + + . (3)
2a 4a
For convenience, let us write
p(x) = a(x − x0 )2 + m, (4)
QUESTION 5. How do you use (3) or (4) to derive the following formula
√
−b ± b2 − 4ac
x= (5)
2a
2
for finding the roots of ax2 + bx + c = 0?
We return to the general theory of polynomials. You are assumed to know long division
for polynomials. See if you can do the following
EXERCISE 6. Use long division to divide f (x) = 2x4 − 7x3 + 14x + 4 by g(x) = x − 2.
Now we have to be more careful about numbers that we are allowed to use. To avoid
technicality, we restrict ourselves to the following three number systems: rationals, reals
and complex numbers. The standard notation for them is:
In what follows, we use letter F (this is not a standard notation) to stand one of the
above: Q, R or C. We use this letter F here because it is a field according to some
technical definition. When a polynomial all coefficients a0 , a1 , . . . , an of a polynimial
p(x) = a0 xn + a1 xn−1 + · · · + an−1 x + an are in F, then we say that p is a polynomial
over F. Given polynomials f (x) and g(x) over F, we can divide f (x) by g(x), using
long division, to get a quotient q(x) and a remainder r(x), which are also polynomials over
F. The relation between f , g and q, r is given by the identity f (x) = g(x)q(x) + r(x).
Here, the degree of r(x) is strictly less than the degree of g(x), or r(x) is identically
zero. (Note: the degree of a nonzero constant polynomial is 0 but the degree of the zero
polynomial is defined to be infinity.) In case r(x) ≡ 0 so that we have f (x) = g(x)q(x),
we say that g is a factor of f or g divides f . When f has no factors over F other
than the trivial ones, that is, constants or constant multiples of f , we say that f is an
irreducible polynomial over F.
3
which are unique up to multiples of nonzero elements in F. The proof of this is similar
to the unique factorization of integers. A good book on abstract algebra should have a
theory (about something called PID, that is, principal ideal domain) covering the unique
factorization theorem for both polynomials and integers.
One consequence of our discussion here is: If p(x) is a polynomial of degree n, then
p(x) cannot have more than n roots. To see this, suppose that p(x) has more than n
roots, say a1 , a2 , . . . , am with m > n. Then, according to what we have just learned,
f (x) ≡ (x − a1 )(x − a2 ) · · · (x − am ) is a factor of p(x). This cannot happen because the
degree of f (x) is m, which is greater than the degree of p(x), which is n. Indeed, the
degree of a factor of p cannot be greater than the degree of p.
4
q(a), the value of q at a, to see if it is zero. If q(a) 6= 0, the answer is No and in that
case we say that a is a simple root of p. If q(a) = 0, the answer is Yes and in that
case we say that a is a multiple root of p.
EXERCISE 11. Prove that a is a simple root of p if and only if p(a) = 0 but p0 (a) 6= 0;
equivalently, a is a multiple root of a if and only if p(a) = 0 and p0 (a) = 0. (Of course,
p0 here stands for the derivative of p.)
Now we extract all factors of x − a from p. Say, there are m of them so that we can
write p(x) = (x − a)m g(x) for some polynomial g. Now g has no factor of x − a and
hence g(a) 6= 0. The positive integer m is called the multiplicity of the root a. When
m = 1, a is a simple root. When m > 1, a is a multiple root. When we count the number
of roots of a polynomial, we should count their multiplicities as well in order to get correct
answers to many questions.
Any proof of this basic theorem is beyond the level here, although there are over a hundred
of them; (I don’t know the recent record: I guess there are about two hundreds of proofs).
As we know, a real polynomial (that is, a polynomial with real coefficients) does not
necessarily have a real root. For a quadratic polynomial p(x) = ax2 + bx + c with real
coefficients a(6= 0), b, c, the recipe (5) in QUESTION 5 for its roots tells us that p does
not have real roots if its discriminant b2 − 4ac is negative and in that case its roots
are two complex numbers conjugate to each other. In general, we have the following fact
concerning the roots of a real polynomial:
5
of p, then its complex conjugate ω is also a root of p; (3) p can be written as a product
of irreducible factors, either of the form rx + s or ax2 + bx + c, where r, s, a, b, c are
real numbers with r, a 6= 0 and b2 − 4ac < 0.
and hence ω is also a root of p. This proves (2). Now both x − ω and x − ω are
factors of p and they are different (otherwise ω = ω, contradicting the fact that ω is
nonreal). Thus (x − ω)(x − ω) is a factor of p. So we have p(x) = (x − ω)(x − ω)q(x)
for some polynomial q of degree n − 2. Now
(x − ω)(x − ω) = x2 + (ω + ω)x + ωω ≡ x2 + bx + c
Since p and (x−ω)(x−ω) ≡ x2 +bx+c are real polynomials and p(x) = (x2 +bx+c)q(x),
q is also a real polynomial.
We can ask if q has a nonreal root. If the answer is yes, we can get another conjugate
pair of nonreal roots and hence another factor of the form ax2 + bx + c with real a, b, c
such that b2 − 4ac < 0. Pulling out this factor from q (and hence from p), the degree
of f drops again by 2. Continue in this manner, until all nonreal roots are exhausted so
that only real roots of p remains. Now part (3) of the above theorem is clear. Part (1)
is a direct consequence of part (3).
From the above discussion, we see that irreducible polynomials over C are of the form
ax + b, where a 6= 0, and irreducible polynomials over R are of the form rx + s (r 6= 0)
6
or ax2 + bx + c with b2 − 4ac < 0. What about irreducible polynomials over Q, the field
of rational numbers? This is hard, very hard. We can produce many of them by using
some advanced theorems such as Eisenstein’s criterion. Some of them such as cyclotomic
polynomials are very important. A complete description of irreducible polynomials over
rationals seems to be impossible.
EXAMPLE 15. We are asked to factorize the polynomial p(x) = x5 − 1 over the real
field R. Let ω = e2πi/5 . Then 1, ω, ω 2 , ω 3 ≡ ω 2 , z 4 = ω are roots of p. Thus
QUESTION 16. Are you sure the answer in the last example correct? Check the identity
à √ !à √ !
1− 5 1+ 5
x4 + x3 + x2 + x + 1 = x2 + x+1 x2 + x+1 .
2 2
7
In the rest we breifly describe other important aspects of polynomials. First: finding
roots by radicals. Formula (5) in QUESTION 5 is a closed form for roots of a general
quadratic equation in terms of a radical; (roughtly speaking, a radical is an algebraic ex-
pression involving roots, without trigonometric, exponential or other transcendental func-
tions). For solving a general cubic equation x3 + ax2 + bx + c = 0, Cardano published
in 1545 a method described as follows. (This method was originally due to Tartaglia, to
whom Cardano promised earlier to keep it as a secret. Here we cannot give more detail of
this fascinating story.) First step: to eliminate the x2 term by a simple transformation
x = y + k, where k will be determined. Substituting x = y + k to x3 + ax2 + bx + c,
we get (y + k)3 + a(y + k)2 + · · · = y 3 + (3k + a)y 2 + · · · . Letting k = −a/3, the original
equation is converted into
y 3 + py + q = 0. (7)
This step is more or less routine. It works for polynomial of any degree. The next step
is VERY SLICK: let y = u + v. (Here we replace one unknown y by two unknowns
u and v. But so far we only have one equation, namely (7). Therefore we have the
freedom to choose another equation!) Then (6) becomes (u + v)3 + p(u + v) + q = 0, or
u3 + 3u2 v + 3uv 2 + v 3 + p(u + v) + q = 0, that is,
uv = −p/3, (9)
8
where ω = e2πi/3 . Due to the restrain (9), only three combinations of u and v are
possible for y = u + v as the solutions:
√ √ √ √ √ √
A + ω2 B, ω 2
3 3 3 3 3 3
y= A+ B, ω A+ω B.
Quadric equations (algebraic equations of degree four) can be solved by the method of
reduction to cubic equations, which was discovered by Cardano’s student Ferrari. Great
efforts were made to solve quintic equations by radicals, but all of them were doomed to fail:
in early nineteen century Abel proved that this is impossible. In this case, merely using
radicals is not enough. To solve quintic equations one needs elliptic functions (such as the
Weierstrass P function ℘(z)) which are related to elliptic curves, a hot topic in current
mathematical research. The most powerful method of studying algebraic equations is the
Galois theory, which has tremendous impact on many other areas. Galois is a “romantic
revolutionary figure” in mathematics who died in a duel at the age of barely 21. [Another
“romantic figure” in math is Sofya Kowalevskaya, who is one of the two greatest women
in math and whose beauty is legendary. I’m just curious why Hollywood didn’t make a
movie about her.]
In EXERCISE 2 we have considered the relation between the roots and the coefficients
of a quadric. Consider a general polynomial p of degree n with leading coefficient 1:
By multiplying out the right hand side of (11) and then comparing to (10), we have
where X
s1 =x1 + x2 + · · · + xn ≡ xk
k
X
s2 =x1 x2 + x1 x3 + x2 x3 + · · · + xn−1 xn ≡ xj xk
j6=k
(13)
..
.
sn =x1 x2 · · · xn .
9
The expressions s1 , s2 , . . . , sn , considered as functions of x1 , x2 , . . . xn , are called
elementary symmetric functions.
We also mention that, for a real polynomial p, there is an important method due to
Sturm to determine how many real roots of p are lying in a given interval.
10