Linear Algebra Week 5
Linear Algebra Week 5
February 2, 2017 1 / 37
Similarity of matrices
Definition 1 (Similarity)
Let A and B be two n × n matrices. We say A is similar to B if there is an
invertible matrix P such that B = P −1 AP.
Theorem 2
If A and B are similar, then both have the same characteristic polynomials.
Thus they have the same eigenvalues with the same multiplicities.
Proof:
DB (t) = |B − tIn |
= |P −1 AP − tI|
= |P −1 (A − tIn )P| (∵ P −1 IP = I)
|P|−1 (∵ P −1 = |P|−1 )
=
|P|
DA (t) = DA (t).
February 2, 2017 2 / 37
Diagonalizability and diagonalization
Diagonalization problem: Let A be a square matrix. Find an invertible P
such that P −1 AP = D is a diagonal matrix.
Equivalently, we can ask whether or not A is similar to some diagonal
matrix.
Observations: We make some observations to help us attack the
diagonalization problem.
1 First observe is that if D = diag{λ1 , λ2 , ..., λn } then λ1 , λ2 , ..., λn
must be the eigenvalues of A.
(Characterstic polynomials are same.)
2 Next observe that if we write P = [v1 v2 ... vn ] as a row of column
vectors, then {v1 , v2 , ..., vn } must be a basis of Rn .
(Column rank is n.)
3 And finally we make a deeper observation that
v1 , v2 , ..., vn are eigenvectors of A
with eigenvalues λ1 , λ2 , ..., λn respectively.
February 2, 2017 3 / 37
Computations
• Next,
P −1 AP = D =⇒ AP = PD
=⇒ A[v1 v2 ... vn ] = [v1 v2 ... vn ]diag{λ1 , λ2 , ..., λn }
=⇒ [Av1 Av2 ... Avn ] = [λ1 v1 λ2 v2 ... λn vn ]
=⇒ Avj = λj vj (1 ≤ j ≤ n),
as claimed.
The following definition is now natural.
Definition 3 (Eigenbasis)
For a square n × n matrix A, a basis B of Rn is called an A-eigenbasis if
each basis vector is an eigenvector of A.
February 2, 2017 4 / 37
Example
Example 4
1 2 −2
Let A = 2 1 −4. Find the eigenvalues and a basis of eigenvectors
1 −1 −2
which diagonalizes A. Also write down a matrix X such that X −1 AX is
diagonal.
Solution:
DA (λ) = −λ(λ2 − 9) =⇒ λ = 0, ±3.
4 2 −2
λ = −3: Row reduction of A + 3I = 2 4 −4 yields
1 −1 1
1 0 0 0
0 1 −1. Hence v1 = 1 is an eigenvector.
0 0 0 1
February 2, 2017 5 / 37
Example contd.
1 2 −2
λ = 0: Row operations on A produces a matrix 0 1 0 . Hence
0 0 0
2
v2 = 0 is an eigenvector.
1
1 −1 1 1
λ = 3: A − 3I gives rise to 0 0 1 . Hence v 3 = 1 is an
0 0 0 0
eigenvector.
0 2 1
This enables us to write X = [v1 , v2 , v3 ] = 1 0 1 .
1 1 0
−1 1 2 −3 0 0
1
X −1 = 1 −1 1 . Finally, X −1 AX = 0 0 0 .
3
1 2 −2 0 0 3
February 2, 2017 6 / 37
Existence/nonexistence of Eigenbases
Example 5
1 1
Let A = . Find the eigenvalues and eigenvectors.
0 1
Solution: The characteristic polynomial is D(t) = (t − 1)2 . There is only
one eigenvalue t = 1 which is repeated. Solving for the eigenvectors:
0 1
A − I2 = is in REF.
0 0
x
=⇒ y = 0 =⇒ v = , x 6= 0.
0
The span of the eigenvectors is only 1-dimensional, and hence they cannot
produce a basis of R2 .
February 2, 2017 7 / 37
Algebraic/geometric multiplicity, defect
Lesson: When λ is an eigenvalue of A, the nullity of A − λI has to be
positive; but may not be sufficiently positive.
This phenomenon leads to the following definitions:
Definition 6 (Algebraic multiplicity)
Let λ be an eigenvalue of A. The multiplicity of λ as a root of the
characteristic polynomial DA (t) is called the algebraic multiplicity of λ.
We write this number as mλ = mλ (A).
If λ is an eigenvalue then
1 ≤ gλ ≤ mλ .
February 2, 2017 8 / 37
Algebraic/geometric multiplicity, defect
Definition 8 (Defect)
Let λ be an eigenvalue of A. The difference mλ − gλ is known as the
defect of λ and is denoted δλ = δλ (A).
X
The sum δλ may be called the total defect of A. (Not a standard
definition.)
February 2, 2017 9 / 37
Proof of the theorem
Proof:
Denote the distinct eigenvalues: λ1 , λ2 , ..., λk .
Geometric multiplicities: g1 , g2 , ..., gk respectively.
In general,
#B = g1 + g2 + · · · + gk ≤ n
and equality will hold
X
⇐⇒ gj = mj ∀j ⇐⇒ δj = 0 ∀j ⇐⇒ δj = 0,
which is given.
Hence #B = n andtherefore B must be an A-eigenbasis of Rn .
Theorem 10 (Digonalization)
Let A be defect free i.e. each eigenvalue of A is defect-free. Then A is
similar to a diagonal matrix whose diagonal entries are the eigenvalues of
A, each occurring as many times as its multiplicity.
Proof:
Let
renamed
[
B= Bj = {v11 , ..., v1m1 , ..., vk1 , ..., vkmk } = {w1 , w2 , ..., wn }
j
be an A-eigenbasis.
Let
renamed
{λ1 , ..., λ1 , ..., λk , ..., λk } = {λ1 , λ2 , ..., λn }
| {z } | {z }
m1 times mk times
February 2, 2017 11 / 37
Diagonalization
Let
P = [w1 w2 ...wn ]
be the square matrix formed by the columns (eigenvectors) wj .
Then
The defect or the gap in the proofs will be corrected later. [1.0]
February 2, 2017 12 / 37
An application
Before correcting the defect (∗) in the proofs, we give an application
Example 11
Consider the coupled system of equations satisfied by ẏ1 (t), ẏ2 (t), ẏ3 (t):
ẏ1 = 2y1 + y2 + y3
ẏ2 = y1 + 4y2 + y3
ẏ3 = y1 + y2 + 2y3 .
Solution:
2 1 1 y1
In vector form ẏ = Ay, A = 1
4 1 , y = y2 .
1 1 2 y3
The eigenvalues of A are λ = 1, 2, 5.
February 2, 2017 13 / 37
Application contd.
−1 1 1
A choice of the corresponding eigenvectors is 0 −1 2 .
1 1 1
−1 1 1
Let X = 0 −1 2 be the ’diagonalizer’ and make the ’change of
1 1 1
variables’
u(t) = X −1 y(t).
The equivalent de-coupled system is
u̇ = X −1 ẏ = X −1 Ay = X −1 AX (X −1 y) = diag{1, 2, 5}u
which is simply
u̇1 = u1 , u̇2 = 2u2 , u̇3 = 5u3 .
February 2, 2017 14 / 37
Application contd.
C1 e t
u1
The solution of the transformed system is u2 = C2 e 2t .
u3 C3 e 5t
C1 e t
y1 (t) −1 1 1
y2 (t) = 0 −1 2 C2 e 2t =?.
y3 (t) 1 1 1 C3 e 5t
February 2, 2017 15 / 37
Algebraic/geometric multiplicity, defect
Recall:
Definition 12 (Defect)
Let λ be an eigenvalue of A. The difference mλ − gλ is known as the
defect of λ and is denoted δλ = δλ (A).
X
The sum δλ may be called the total defect of A. (Not a standard
definition.)
February 2, 2017 16 / 37
Proof of the theorem
Proof:
Denote the distinct eigenvalues: λ1 , λ2 , ..., λk .
Geometric multiplicities: g1 , g2 , ..., gk respectively.
In general,
#B = g1 + g2 + · · · + gk ≤ n
and equality will hold
X
⇐⇒ gj = mj ∀j ⇐⇒ δj = 0 ∀j ⇐⇒ δj = 0,
which is given.
Hence #B = n andtherefore B must be an A-eigenbasis of Rn .
Proof:
Suppose that
ck1 vk1 + ck2 vk2 + · · · + ckr vkr = 0, (1 ≤ k1 < · · · < kr ≤ k)
February 2, 2017 18 / 37
Proof of l.i. of eigenvectors
February 2, 2017 19 / 37
L.i. of eigenvectors, removing defect from previous proofs
Corollary 15
S
The set B = j Bj in the theorem about A-eigenbasis is L.I.
Explicating, X X X
cv v + cv v + · · · + cv v = 0
v∈B1 v∈B2 v∈Bk
| {z } | {z } | {z }
w1 ∈Eλ1 w2 ∈Eλ2 wk ∈Eλk
February 2, 2017 20 / 37
L.i. of eigenvectors, removing defect from previous proofs
Can rewrite,
w1 + w2 + · · · + wk = 0, wj ∈ Eλj .
By the theorem above, each wj = 0. Therefore,
X X
∀jwj = cv v = 0 =⇒ cv v = 0 ∀j
v∈Bj v∈Bj
=⇒ cv = 0 ∀v ∈ Bj , ∀j
=⇒ cv = 0 ∀v ∈ B.
February 2, 2017 21 / 37
Simple eigenvalues
Corollary 16
If the characteristic polynomial has only simple roots (mλ = 1 for each λ),
then Rn admits a basis of eigenvectors of A viz an A-eigenbasis.
For example if
1 2 3
A = −4 −5 −6 ,
7 8 9
then
trA = 1 − 5 + 9 = 5.
February 2, 2017 23 / 37
Example
Example 19
Let A and B be two n × n matrices, show that tr(AB) = tr(BA).
Similarly,
n n n
" n #
re-order sum
X X X X
tr(BA) = b`j aj` = b`j aj` = tr(AB).
`=1 j=1 j=1 `=1
February 2, 2017 24 / 37
Example
Example 20
Let A be a real n × n. Show that tr(AT A) = tr(AAT ) = kAk2 .
Therefore tr(AAT ) is the sum of the squares of all the entries of A hence
equals kAk2 .
February 2, 2017 25 / 37
Principal sub-minors
The trace of a square matrix is generalized to the notion of the sum of principal
r × r sub-minors.
In particular,
tr1 = tr and trn = det.
February 2, 2017 27 / 37
Examples
Example 24
a b
Let A = . Then
c d
tr1 A = trA = a + d while tr2 A = |A| = ad − bc.
1 2 3
Let B = −4 −5 −6 . Then tr1 B = 5, while
7 8 9
1 2 −5 −6 1 3
tr2 B = + + = −6,
−4 −5 8 9 7 9
1 2 3
tr3 B = −4 −5 −6 = 0.
7 8 9
February 2, 2017 28 / 37
Examples contd.
Example 25
Let B be as in the previous example. Determine the eigenvalues of B and
check for diagonalizability.
Solution:
1 − t 2 3
DB (t) = −4 −5 − t −6 = −t 3 + 5t 2 + 6t = −t(t + 1)(t − 6)
7 8 9 − t
The eigenvalues are simple and hence B is diagonalizable.
February 2, 2017 29 / 37
Connection with characteristic polynomial
Theorem 26
Let A be the a square matrix of order n. Its characterstic polynomial can
be expressed as
Xn
DA (t) = (trn−r A)(−t)r .
r =0
For convenience we have set tr0 ≡ 1 for all square matrices. Proof omitted.
Example 27
In the previous examples,
February 2, 2017 30 / 37
Connection with characteristic polynomial
Corollary 28
For a square matrix A, its r th order trace equals the sum of the produts of
its eigenvalues taken r at a time.
Corollary 29
For a square matrix A, its trace equals the sum of its eigenvalues while the
determinant equals the product of the same.
February 2, 2017 31 / 37
Direct proof of the second corollary
OTO2 H
DA (t) = |A − tI| =⇒ DA (0) = |A|.
n
Y
Next look at p(t) = t n DA (1/t) = (λj t − 1) = |tA − I|.
j=1
February 2, 2017 32 / 37
Direct proof of the corollary
OTO2 H,
−1 0 · · · a1j ··· 0
0 −1 · · · a2j ··· 0
.. .. ..
0 d X . . . 0
p (0) = (|tA − I|) =
dt t=0 0 0 ··· ajj · · · 0
j
.. .. ..
. . . 0
0 0 ··· anj · · · −1
X
= (−1)n−1 ajj (j th term by j th row.)
j
= (−1)n−1 trA.
X
Comparing, trA = λj .
j
February 2, 2017 33 / 37
Computation
A A · · · An 0 = (A1 )0 A2 · · · An + A1 (A2 )0 · · · An
1 2
+ · · · + A1 A2 · · · (An )0 .
0
∴ p 0 (t) = tA1 − e1 , tA2 − e2 , · · · , tAn − en
+ · · · + tA1 − e1 , tA2 − e2 , · · · , An
February 2, 2017 34 / 37
Example
Example 30
1 2 4
Find whether or not A = 2 4 8 is diagonalizable. Use traces to find the
4 8 16
characteristic polynomial and eigenvalues.
Solution: trA = 21, tr2 A = 0 = tr3 A. Hence the char. polynomial is
DA (t) = −t 3 + 21t 2 .
February 2, 2017 35 / 37
Computation of other higher traces trr
See Handout-2
February 2, 2017 36 / 37