Advanced Mathematics
Advanced Mathematics
Chapter 1
Complex Number and
Complex Variables
Objectives
After completing this chapter, you will be able to:
Introduction
Functions of (x, y) that depend only on the combination (x + iy) are called functions of
a complex variable and functions of this kind that can be expanded in power series in
this variable are of particular interest.
This combination (x + iy) is generally called z, and we can define such functions as z n,
exp(z), sin z, and all the standard functions of z as well as of x.
They are defined in exactly the same way the only difference being that they are
actually complex valued functions, that is, they are vectors in this two dimensional
complex number space, each with a real and an imaginary part (or component).
Most of the standard functions we have previously discussed have the property that
their values are real when their arguments are real. The obvious exception is the square
root function, which becomes imaginary for negative arguments.
Since we can multiply z by itself and by any other complex number, we can form any
polynomial in z and any power series as well. We define the exponential and sine
functions of z by their power series expansions which converge everywhere in the
complex plane.
Since all the operations that produce standard functions can be applied to complex
functions we can produce all the standard functions of a complex variable by the same
steps as go to producing standard functions of real variables.
1
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Complex Number
Complex numbers allow solutions to certain equations that have no solutions in real
numbers. For example, the equation
has no real solution, since the square of a real number cannot be negative. Complex
numbers, however, provide a solution to this problem. The idea is to extend the real
numbers with an indeterminate i (sometimes called the imaginary unit) taken to satisfy
the relation i2 = −1, so that solutions to equations like the preceding one can be found.
In this case, the solutions are −1 + 3i and −1 − 3i, as can be verified using the fact
that i2 = −1:
2
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Formally, the complex number system can be defined as the algebraic extension of the
ordinary real numbers by an imaginary number i. This means that complex numbers can
be added, subtracted and multiplied as polynomials in the variable i, under the rule
that i2 = −1. Furthermore, complex numbers can also be divided by nonzero complex
numbers.
This way, a complex number is defined as a polynomial with real coefficients in the
single indeterminate i, for which the relation i2 + 1 = 0 is imposed. Based on this
definition, complex numbers can be added and multiplied, using the addition and
multiplication for polynomials. The relation i2 + 1 = 0 induces the equalities i4k =
1, i4k+1 = i, i4k+2 = −1, and i4k+3 = −i, which hold for all integers k; these allow the
reduction of any polynomial that results from the addition and multiplication of complex
numbers to a linear polynomial in i, again of the form a + bi with real coefficients a, b.
The real number a is called the real part of the complex number a + bi; the real
number b is called its imaginary part. To emphasize, the imaginary part does not
include a factor i; that is, the imaginary part is b, not bi.
The definition of the complex numbers involving two arbitrary real values
immediately suggests the use of Cartesian coordinates in the complex plane. The
horizontal (real) axis is generally used to display the real part, with increasing values to
the right, and the imaginary part marks the vertical (imaginary) axis, with increasing
values upwards.
3
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Notably, the operations of addition and multiplication take on a very natural geometric
character, when complex numbers are viewed as position vectors: addition corresponds
to vector addition, while multiplication corresponds to multiplying their magnitudes and
adding the angles they make with the real axis. Viewed in this way, the multiplication of
a complex number by i corresponds to rotating the position vector counterclockwise by
a quarter turn (90°) about the origin—a fact which can be expressed algebraically as
follows:
4
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
If r is calculated first as above, then this formula for φ may be stated a little more
simply using the standard arccosine function:
If z is a real number (that is, if y = 0), then r = |x|. That is, the absolute value of a
real number equals its absolute value as a complex number.
5
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Together, r and φ give another way of representing complex numbers, the polar
form, as the combination of modulus and argument fully specify the position of a point
on the plane. Recovering the original rectangular co-ordinates from the polar form is
done by the formula called trigonometric form
6
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Equality
Two complex numbers are equal if and only if both their real and imaginary parts are
equal. That is, complex numbers z1 and z2 are equal if and only if Re(z1) =
Re(z2) and Im(z1) = Im(z2). Nonzero complex numbers written in polar form are equal if
and only if they have the same magnitude and their arguments differ by an integer
multiple of 2π.
Ordering
Conjugate
Geometrically, 𝑧 is the "reflection" of z about the real axis. Conjugating twice gives the
original complex number
7
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
The product of a complex number z = x + y and its conjugate is known as the absolute
square. It is always a positive real number and equals the square of the magnitude of
each:
The real and imaginary parts of a complex number z can be extracted using the
conjugation:
Moreover, a complex number is real if and only if it equals its own conjugate.
Two complex numbers a and b are most easily added by separately adding their
real and imaginary parts of the summands. That is to say:
8
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Multiplication
Since the real part, the imaginary part, and the indeterminate i in a complex number are
all considered as numbers in themselves, two complex numbers, given as z = x + yi
and w = u + vi are multiplied under the rules of the distributive property,
the commutative properties and the defining property i2 = -1 in the following way
Formulas for multiplication, division and exponentiation are simpler in polar form
than the corresponding formulas in Cartesian coordinates. Given two complex
numbers z1 = r1(cos φ1 + i sin φ1) and z2 = r2(cos φ2 + i sin φ2), because of the
trigonometric identities
9
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
In other words, the absolute values are multiplied and the arguments are added to yield
the polar form of the product. For example, multiplying by i corresponds to a quarter-
turn counter-clockwise, which gives back i2 = −1. The picture at the right illustrates the
multiplication of
Since the real and imaginary parts of 5 + 5i are equal, the argument of that number is
45 degrees, or π/4 (in radian). On the other hand, it is also the sum of the angles at the
origin of the red and blue triangles are arctan (1/3) and arctan(1/2), respectively. Thus,
the formula
holds. As the arctan function can be approximated highly efficiently, formulas like this –
known as Machin-like formulas – are used for high-precision approximations of π.
Video Links
Introduction to complex numbers
• https://www.youtube.com/watch?v=SP-YJe7Vldo
Complex Variables and Functions
• https://www.youtube.com/watch?v=iUhwCfz18os&lis
t=PLdgVBOaXkb9CNMqbsL9GTWwU542DiRrPB
Complex Variables and Functions
• https://www.youtube.com/watch?v=iUhwCfz18os&lis
t=PLdgVBOaXkb9CNMqbsL9GTWwU542DiRrPB
References
• https://ocw.mit.edu/ans7870/18/18.013a/textbook
/HTML/chapter18/section02.html
• https://en.wikipedia.org/wiki/Complex_number#:
~:text=A%20complex%20number%20is%20a,is%
20called%20an%20imaginary%20number.
10
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Chapter 2
Laplace and Inverse
Laplace Transform
Objectives
Introduction
Laplace's use of generating functions was similar to what is now known as the z-
transform, and he gave little attention to the continuous variable case which was
discussed by Niels Henrik Abel. The theory was further developed in the 19th and early
20th centuries by Mathias Lerch, Oliver Heaviside, and Thomas Bromwich.
The current widespread use of the transform (mainly in engineering) came about during
and soon after World War II, replacing the earlier Heaviside operational calculus. The
advantages of the Laplace transform had been emphasized by Gustav Doetsch to whom
the name Laplace Transform is apparently due.
Laplace Transform
1
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
The Laplace transform of a function f(t), defined for all real numbers t ≥ 0, is the
function F(s), which is a unilateral transform defined by
One can define the Laplace transform of a finite Borel measure μ by the Lebesgue
integral
An important special case is where μ is a probability measure, for example, the Dirac
delta functions. In operational calculus, the Laplace transform of a measure is often
treated as though the measure came from a probability density function f. In that case, to
avoid potential confusion, one often writes
This limit emphasizes that any point mass located at 0 is entirely captured by the Laplace
transform. Although with the Lebesgue integral, it is not necessary to take such a limit, it
does appear more naturally in connection with the Laplace–Stieltjes transform.
2
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
When one says "the Laplace transform" without qualification, the unilateral or one-sided
transform is usually intended. The Laplace transform can be alternatively defined as
the bilateral Laplace transform, or two-sided Laplace transform, by extending the limits of
integration to be the entire real axis. If that is done, the common unilateral transform
simply becomes a special case of the bilateral transform, where the definition of the
function being transformed is multiplied by the Heaviside step function.
Probability theory
By convention, this is referred to as the Laplace transform of the random variable X itself.
Here, replacing s by −t gives the moment generating function of X. The Laplace
transform has applications throughout probability theory, including first passage
times of stochastic processes such as Markov chains, and renewal theory.
3
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
The Laplace transform has a number of properties that make it useful for analyzing
linear dynamical systems. The most significant advantage is that differentiation becomes
multiplication, and integration becomes division, by s (reminiscent of the
way logarithms change multiplication to addition of logarithms).
Because of this property, the Laplace variable s is also known as operator variable in
the L domain: either derivative operator or (for s−1) integration operator. The transform
turns integral equations and differential equations to polynomial equations, which are
much easier to solve. Once solved, use of the inverse Laplace transform reverts to the
original domain.
Given the functions f(t) and g(t), and their respective Laplace transforms F(s) and G(s),
It is often convenient to use the differentiation property of the Laplace transform to find
the transform of a function's derivative. This can be derived from the basic expression
for a Laplace transform as follows:
4
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
The following table provides Laplace transforms for many common functions of a single
variable. For definitions and explanations, see the Explanatory Notes at the end of the
table.
Using this linearity, and various trigonometric, hyperbolic, and complex number (etc.)
properties and/or identities, some Laplace transforms can be obtained from others more
quickly than by using the definition directly.
The unilateral Laplace transform takes as input a function whose time domain is the non-
negative reals, which is why all of the time domain functions in the table below are
multiples of the Heaviside step function, u(t).
The entries of the table that involve a time delay τ are required to be causal (meaning
that τ > 0). A causal system is a system where the impulse response h(t) is zero for all
time t prior to t = 0. In general, the region of convergence for causal systems is not the
same as that of anticausal systems.
5
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
6
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
7
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
It can be proven that, if a function F(s) has the inverse Laplace transform f(t), then f(t) is
uniquely determined (considering functions which differ from each other only on a point
set having Lebesgue measure zero as the same). This result was first proven
by Mathias Lerch in 1903 and is known as Lerch's theorem.
The Laplace transform and the inverse Laplace transform together have a number of
properties that make them useful for analyzing linear dynamical systems.
An integral formula for the inverse Laplace transform, called the Mellin's inverse
formula, the Bromwich integral, or the Fourier–Mellin integral, is given by the line
integral:
where the integration is done along the vertical line Re(s) = γ in the complex plane such
that γ is greater than the real part of all singularities of F(s) and F(s) is bounded on the
line, for example if contour path is in the region of convergence. If all singularities are in
the left half-plane, or F(s) is an entire function, then γ can be set to zero and the above
inverse integral formula becomes identical to the inverse Fourier transform.
In practice, computing the complex integral can be done by using the Cauchy residue
theorem.
Post's inversion formula for Laplace transforms, named after Emil Post, is a simple-
looking but usually impractical formula for evaluating an inverse Laplace transform.
The statement of the formula is as follows: Let f(t) be a continuous function on the
interval [0, ∞) of exponential order, i.e.
8
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
for some real number b. Then for all s > b, the Laplace transform for f(t) exists and is
infinitely differentiable with respect to s. Furthermore, if F(s) is the Laplace transform
of f(t), then the inverse Laplace transform of F(s) is given by
As can be seen from the formula, the need to evaluate derivatives of arbitrarily high
orders renders this formula impractical for most purposes.
With the advent of powerful personal computers, the main efforts to use this formula have
come from dealing with approximations or asymptotic analysis of the Inverse Laplace
transform, using the Grunwald–Letnikov differ integral to evaluate the derivatives.
Post's inversion has attracted interest due to the improvement in computational science
and the fact that it is not necessary to know where the poles of F(s) lie, which make it
possible to calculate the asymptotic behavior for big x using inverse Mellin transforms for
several arithmetical functions related to the Riemann hypothesis.
Video Links
Laplace transform
• https://www.youtube.com/watch?v=OiNh2DswFt4
Inverse Laplace Transform Example
• https://www.youtube.com/watch?v=c6YnYr8KsSo
Inverse Laplace Transform Example
• https://www.youtube.com/watch?v=c6YnYr8KsSo
References
• https://en.wikipedia.org/wiki/Laplace_transform
• https://en.wikipedia.org/wiki/Inverse_Laplace_tra
nsform
• https://web.stanford.edu/~boyd/ee102/laplace.pd
f
9
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Chapter 3
Power Series
Objectives
After completing this chapter, you will be able to:
Introduction
Power Series
In this lesson you will study several power series and discover that on the
intervals where they converge, they are equal to certain well known functions.
1
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Although you cannot enter infinitely many terms of this series in the Y=
Editor, you can graph partial sums of the series because each partial sum is a
polynomial with a finite number of terms.
The power series is a geometric series with first term 1 and common ratio x.
This means that the power series converges when | x | < 1 and converges to
2
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
On the interval (-1,1) the partial sums are close to y=1/(1-x). The interval
(-1,1) is called the interval of convergence for this power series because as the
number of terms in the partial sums increases, the partial sums converge to
y=1/(1-x) on that interval.
Maclaurin Series
In the previous lesson you explored several power series and their
relationships to the functions to which they converge. In this lesson you will start
with a function and find the power series that best converges to that function for
values of x near zero.
Use the fact that f and p are equal at x = 0 to find the value of c.
f(x) = ex f(0) = e0 = 1
p(x) = ax2 + bx + c p(0) = a(0)2 + b(0) + c = c
If c = 1, the function and the polynomial have the same value at x = 0, so p(x)
= ax2 + bx + 1.
Using a similar procedure, set the first derivatives equal when x = 0 and solve for b.
f '(x) = ex f '(0) = e0 = 1
p '(x) = 2ax + b p '(0) = 2a(0) + b = b
3
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
If b = 1, the function and the polynomial will have the same slope at x = 0, so p(x)
= ax2 + x + 1.
f "(x) = ex f "(0) = e0 = 1
p "(x) = 2a p "(0) = 2a
If a = 1/2, the function and the polynomial will have the same concavity at x = 0.
So approximates y = ex near x = 0.
The parabola has the same value, the same slope, and the same concavity
as y = ex when x = 0, and the quadratic polynomial is a good approximation
for y = ex when x is near 0.
Taylor Series
Given a function f that has all its higher order derivatives, the series
, where
is called the Taylor series for f centered at a. The Taylor series is a power series that
approximates the function f near x = a.
4
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
The partial sum is called the nth-order Taylor polynomial for f centered at a.
The second-order Taylor polynomial centered at 1 for the function f(x) = ex can
be found by using a procedure similar to the procedure given.
f(1) = e
f '(1) = e
Graphing the function and the polynomial illustrate that the polynomial is a good
approximation near x = 1.
5
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Video Links
Power Series
• https://www.youtube.com/watch?v=EGni2-m5yxM
• https://www.youtube.com/watch?v=DlBQcj_zQk0
• https://www.youtube.com/watch?v=LDBnS4c7YbA
• https://www.youtube.com/watch?v=3d6DsjIBzJ4
References
• http://education.ti.com/html/t3_free_courses/calcu
lus84_online/mod24/mod24_1.html
• Kreyszig, Erwin (2011). Advanced Engineering
Mathematics, 10th ed. Wiley
• https://blogs.ubc.ca/infiniteseriesmodule/units/unit-
3-power-series/taylor-series/the-taylor-series/
• https://blogs.ubc.ca/infiniteseriesmodule/units/unit-
3-power-series/power-series/the-power-series/
6
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Chapter 4
Fourier Series
Objectives
After completing this chapter, you will be able to:
▪ Define Fourier Series.
▪ Identify Complex-valued Function.
▪ Identify a superposition of an infinite number of sine
and cosine function.
INTRODUCTION
are:
The analysis process determines the weights, indexed by integer n, which is also the
number of cycles of the 𝒏𝒕𝒉 harmonic in the analysis interval. Therefore, the length of a
cycle, in the units of x, is P/n. And the corresponding harmonic frequency is n/P. The
𝑛 𝑛
𝒏𝒕𝒉 harmonic are sin(2𝜋𝑥 𝑃) andcos(2𝜋𝑥 𝑃), and their amplitudes (weights) are found by
integration over the interval of length P.
1
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
In general, integer N is theoretically infinite. Even so, the series might not
converge or exactly equate to s(x) at all values of x in the analysis interval. For the
"well-behaved" functions typical of physical processes, equality is customarily assumed.
2
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
3
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Complex-valued functions
Video Links
Fourier Series
• https://www.youtube.com/watch?v=vA9dfINW4Rg
• https://www.youtube.com/watch?v=KfRE744AFEE
• https://www.khanacademy.org/science/electrical-
engineering/ee-signals/ee-fourier-series/v/ee-
fourier-series-intro
4
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
References
• https://eng.libretexts.org/Bookshelves/Electrical
_Engineering/Book%3A_Electrical_Engineering_
(Johnson)/04%3A_Frequency_Domain/4.02%3A_
Complex_Fourier_Series
• https://en.wikipedia.org/wiki/Fourier_series
• https://eng.libretexts.org/Bookshelves/Electrical
_Engineering/Book%3A_Electrical_Engineering_
(Johnson)/04%3A_Frequency_Domain/4.03%3A_
Classic_Fourier_Series
5
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Chapter 5
Fourier Transform
Objectives
After completing this chapter, you will be able to:
▪ Identify the Fourier
▪ Understand the characteristic of a Fourier Transform
▪ Identify the branches of Fourier
Introduction
We’re about to make the transition from Fourier series to the Fourier transform.
“Transition” is the appropriate word, for in the approach we’ll take the Fourier transform
emerges as we pass from periodic to nonperiodic functions. To make the trip we’ll view
a nonperiodic function (which can be just about anything) as a limiting case of a periodic
function as the period becomes longer and longer. Actually, this process doesn’t
immediately produce the desired result. It takes a little extra tinkering to coax the
Fourier transform out of the Fourier series, but it’s an interesting approach. Fourier was
elected to the Académie des Sciences in 1817. During Fourier's eight remaining years
in Paris, he resumed his mathematical researches, publishing a number of important
articles. Fourier's work triggered later contributions on trigonometric series and the
theory of functions of real variable.
The Fourier transform is crucial to any discussion of time series analysis, and this
chapter discusses the definition of the transform and begins introducing some of the
ways it is useful.
As an aside, I don’t know if this is the best way of motivating the definition of the Fourier
transform, but I don’t know a better way and most sources you’re likely to check will just
present the formula as a done deal. It’s true that, in the end, it’s the formula and what
we can do with it that we want to get to, so if you don’t find the (brief) discussion to
follow to your tastes, I am not offended. called, variously, the top hat function (because
of its graph), the indicator function, or the characteristic function for the interval (−1/2,
1/2). While we have defined Π(±1/2) = 0, other common conventions are either to have
Π(±1/2) = 1 or Π(±1/2) = 1/2. And some people don’t define Π at ±1/2 at all, leaving two
holes in the domain. I don’t want to get dragged into this dispute. It almost never
matters, though for some purposes the choice Π(±1/2) = 1/2 makes the most sense.
We’ll deal with this on an exceptional basis if and when it comes up.
1
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Π(t) is not periodic. It doesn’t have a Fourier series. In problems you experimented a
little with periodizations, and I want to do that with Π but for a specific purpose. As a
periodic version of Π(t) we repeat the nonzero part of the function at regular intervals,
separated by (long) intervals where the function is zero. We can think of such a function
arising when we flip a switch on for a second at a time, and do so repeatedly, and we
keep it off for a long time in between the times it’s on. (One often hears the term duty
cycle associated with this sort of thing.) Here’s a plot of Π(t) periodized to have period
15.
Here are some plots of the Fourier coefficients of periodized rectangle functions with
periods 2, 4, and 16, respectively. Because the function is real and even, in each case
the Fourier coefficients are real, so these are plots of the actual coefficients, not their
square magnitudes.
We see that as the period increases the frequencies are getting closer and closer
together and it looks as though the coefficients are tracking some definite curve. (But
we’ll see that there’s an important issue here of vertical scaling.) We can analyze what’s
going on in this particular example, and combine that with some general statements to
lead us on. Recall that for a general function f(t) of period T the Fourier series has the
form
so that the frequencies are 0, ±1/T, ±2/T, . . .. Points in the spectrum are spaced 1/T
apart and, indeed, in the pictures above the spectrum is getting more tightly packed as
the period T increases. The n-th Fourier coefficient is given by
2
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
We see that as the period increases the frequencies are getting closer and closer
together and it looks as though the coefficients are tracking some definite curve. (But
we’ll see that there’s an important issue here of vertical scaling.) We can analyze what’s
going on in this particular example, and combine that with some general statements to
lead us on.
Recall that for a general function f(t) of period T the Fourier series has the form
so that the frequencies are 0, ±1/T, ±2/T, . . .. Points in the spectrum are spaced 1/T
apart and, indeed, in the pictures above the spectrum is getting more tightly packed as
the period T increases. The n-th Fourier coefficient is given by
Here’s a graph. You can now certainly see the continuous curve that the plots of
the discrete, scaled Fourier coefficients are shadowing.
The function sin πx/πx (written now with a generic variable x) comes up so often
in this subject that it’s given a name, sinc:
3
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
How general is this? We would be led to the same idea — scale the Fourier
coefficients by T — if we had started off periodizing just about any function with the
intention of letting T → ∞. Suppose f(t) is zero outside of |t| ≤ 1/2. (Any interval will do,
we just want to suppose a function is zero outside some interval so we can periodize.)
We periodize f(t) to have period T and compute the Fourier coefficients:
Where
Fourier transform defined There you have it. We now define the Fourier
transform of a function f(t) to be
For now, just take this as a formal definition; we’ll discuss later when such an
integral exists. We assume that f(t) is defined for all real numbers t. For any s ∈ R,
integrating f(t) against e−2πist with respect to t produces a complex valued function of
s, that is, the Fourier transform ˆf(s) is a complex-valued function of s ∈ R. If t has
dimension time then to make st dimensionless in the exponential e−2πist s must have
dimension 1/time.
While the Fourier transform takes flight from the desire to find spectral
information on a nonperiodic function, the extra complications and extra richness of
what results will soon make it seem like we’re in a much different world. The definition
just given is a good one because of the richness and despite the complications. Periodic
functions are great, but there’s more bang than buzz in the world to analyze. The
spectrum of a periodic function is a discrete set of frequencies, possibly an infinite set
(when there’s a corner) but always a discrete set. By contrast, the Fourier transform of a
nonperiodic signal produces a continuous spectrum, or a continuum of frequencies. It
may be that ˆf(s) is identically zero for |s| sufficiently large — an important class of
4
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
signals called bandlimited — or it may be that the nonzero values of ˆf(s) extend to ±∞,
or it may be that ˆf(s) is zero for just a few values of s. The Fourier transform analyzes a
signal into its frequency components. We haven’t yet considered how the corresponding
synthesis goes. How can we recover f(t) in the time domain from ˆf(s) in the frequency
domain?
Recovering f(t) from ˆf(s) We can push the ideas on nonperiodic functions as
limits of periodic functions a little further and discover how we might obtain f(t) from its
transform ˆf(s). Again suppose f(t) is zero outside some interval and periodize it to have
(large) period T. We expand f(t) in a Fourier series,
The inverse Fourier transform defined, and Fourier inversion, too The
integral we’ve just come up with can stand on its own as a “transform”, and so we define
the inverse Fourier transform of a function g(s) to be
Again, we’re treating this formally for the moment, withholding a discussion of
conditions under which the integral makes sense. In the same spirit, we’ve also
produced the Fourier inversion theorem. That is
A quick summary Let’s summarize what we’ve done here, partly as a guide to
what we’d like to do next. There’s so much involved, all of importance, that it’s hard to
avoid saying everything at once. Realize that it will take some time before everything is
in place.
• The domain of the Fourier transform is the set of real numbers s. One says that
ˆf is defined on the frequency domain, and that the original signal f(t) is defined on the
time domain (or the spatial domain, depending on the context). For a (nonperiodic)
signal defined on the whole real line we generally do not have a discrete set of
frequencies, as in the periodic case, but rather a continuum of frequencies.2 (We still do
call them “frequencies”, however.) The set of all frequencies is the spectrum of f(t).
◦ Not all frequencies need occur, i.e., ˆf(s) might be zero for some values of s.
Furthermore, it might be that there aren’t any frequencies outside of a certain range,
i.e.,
5
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
6
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
This is often the most unambiguous notation. Similarly, the operation of taking
the inverse Fourier transform is then denoted by F −1, and so
Finally, a function and its Fourier transform are said to constitute a “Fourier pair”,;
this is concept of ‘duality’ to be explained more precisely later. There have been various
notations devised to indicate this sibling relationship. One is
A warning on definitions Our definition of the Fourier transform is a standard one,
but it’s not the only one. The question is where to put the 2π: in the exponential, as we
have done; or perhaps as a factor out front; or perhaps left out completely. There’s also
a question of which the Fourier transform is and which is the inverse, i.e., which gets the
minus sign in the exponential. All of the various conventions are in day-to-day use in the
professions, and I only mention this now because when you’re talking with a friend over
drinks about the Fourier transform, be sure you both know which conventions are being
followed. I’d hate to see that kind of misunderstanding get in the way of a beautiful
friendship. Following the helpful summary provided by T. W. K¨orner in his book Fourier
analysis, I will summarize the many irritating variations. To be general, let’s write
7
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
For the Fourier transform we compute (using integration by parts, and the factoring trick
for the sine function):
It’s no accident that the Fourier transform of the triangle function turns out to be
the square of the Fourier transform of the rect function. It has to do with convolution, an
operation we have seen for Fourier series and will see anew for Fourier transforms in
the next chapter.
The exponential decay another commonly occurring function is the (one-sided)
exponential decay, defined by where a is a positive constant. This function models a
signal that is zero, switched on, and then decays exponentially. Here are graphs for a =
2, 1.5, 1.0, 0.5, 0.25
Which is which? If you can’t say, see the discussion on scaling the independent
variable at the end of this section. Back to the exponential decay, we can calculate its
Fourier transform directly
8
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Which is which? You’ll soon learn to spot that immediately, relative to the pictures in the
time domain, and it’s an important issue. Also note that |Ff(s)| 2 is an even function of s
even though Ff(s) is not. We’ll see why later. The shape of |Ff(s)| 2 is that of a “bell
curve”, though this is not Gaussian, a function we’ll discuss just below. The curve is
known as a Lorenz profile and comes up in analyzing the transition probabilities and
lifetime of the excited state in atoms.
How does the graph of f(ax) compare with the graph of f(x)? Let me remind you of
some elementary lore on scaling the independent variable in a function and how scaling
affect its graph. The question is how the graph of f(ax) compares with the graph of f(x)
when 0 <a< 1 and when a > 1; I’m talking about any generic function f(x) here. This is
very simple, especially compared to what we’ve done and what we’re going to do, but
you’ll want it at your fingertips and everyone has to think about it for a few seconds.
Here’s how to spend those few seconds.
Consider, for example, the graph of f(2x). The graph of f(2x), compared with the graph
of f(x), is squeezed. Why? Think about what happens when you plot the graph of f(2x)
over, say, −1 ≤ x ≤ 1. When x goes from −1 to 1, 2x goes from −2 to 2, so while you’re
plotting f(2x) over the interval from −1 to 1 you have to compute the values of f(x) from
−2 to 2. That’s more of the function in less space, as it were, so the graph of f(2x) is a
squeezed version of the graph of f(x). Clear? Similar reasoning shows that the graph of
f(x/2) is stretched. If x goes from −1 to 1 then x/2 goes from −1/2 to 1/2, so while you’re
plotting f(x/2) over the interval −1 to 1 you have to compute the values of f(x) from −1/2
to 1/2. That’s less of the function in more space, so the graph of f(x/2) is a stretched
version of the graph of f(x).
For Whom The Bell Curve Tolls
Let’s next consider the Gaussian
function and its Fourier transform. We’ll
need this for many examples and problems.
This function, the famous “bell shaped
curve”, was used by Gauss for various
statistical problems. It has some striking
properties with respect to the Fourier
transform which, on the one hand, give it a
special role within Fourier analysis, and on
the other hand allow Fourier methods to be
applied to other areas where the function
comes up. We’ll see an application to
probability and statistics in this graph.
9
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Now, the function f(x) = e−x2 does not have an elementary antiderivative, so this
integral cannot be found directly by an appeal to the Fundamental Theorem of Calculus.
The fact that it can be evaluated exactly is one of the most famous tricks in
mathematics. It’s due to Euler, and you shouldn’t go through life not having seen it. And
even if you have seen it, it’s worth seeing again; see the discussion following this
section.
Evaluation Of The Gaussian Integral
We want to evaluate Now we make a change of variables, introducing polar
coordinates, (r, θ). First, what about the limits of integration? To let both x and y range
from −∞ to ∞ is to describe the entire plane, and to describe the entire plane in polar
coordinates is to let r go from 0 to ∞ and θ go from 0 to 2π. Next, e−(x2+y2) becomes
e−r2 and the area element dx dy becomes r dr dθ. It’s the extra factor of r in the area
element that makes all the difference. With the change to polar coordinates we have
10
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
This might be a little confusing because you generally want to think of the two variables,
s and t, as somehow associated with separate and different domains, one domain for
the forward transform and one for the inverse transform, one for time and one for
frequency, while in each of these formulas one variable is used in both domains. You
have to get over this kind of confusion, because it’s going to come up again. Think
purely in terms of the math: The transform is an operation on a function that produces a
new function. To write down the formula I have to evaluate the transform at a variable,
but it’s only a variable and it doesn’t matter what I call it as long as I keep its role in the
formula straight. Also be observant what the notation in the formula says and, just as
important, what it doesn’t say. The first formula, for example, says what happens when
you first take the Fourier transform of f and then evaluate it at −s, it’s not a formula for
F(f(−s)) as in “first change s to −s in the formula for f and then take the transform”. I
could have written the first displayed equation as (Ff)(−s) = F −1f(s), with an extra
parentheses around the Ff to emphasize this, but I thought that looked too clumsy. Just
be careful, please.
are sometimes referred to as the “duality” property of the transforms. One also says that
“the Fourier transform pair f and Ff are related by duality”, meaning exactly these
relations. They look like different statements but you can get from one to the other. We’ll
set this up a little differently in the next section.
Duality and reversed signals There’s a slightly different take on duality that I prefer
because it suppresses the variables and so I find it easier to remember. Starting with a
signal f(t) define the reversed signal f – by
Duality and reversed signals There’s a slightly different take on duality that I prefer
because it suppresses the variables and so I find it easier to remember. Starting with a
signal f(t) define the reversed signal f – by
This identity is somewhat interesting in itself, as a variant of Fourier inversion. You can
check it directly from the integral definitions, or from our earlier duality results.6 Of
course then also
11
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
We can refine this if the function f(t) itself has symmetry. For example, combining the
last two results and remembering that a complex number is real if it’s equal to its
conjugate and is purely imaginary if it’s equal to minus its conjugate, we have:
• If f is real valued and even then its Fourier transform is even and real valued.
• If f is real valued and odd function then its Fourier transform is odd and purely
imaginary.
We saw this first point in action for Fourier transform of the rect function Π(t) and for the
triangle function Λ(t). Both functions are even and their Fourier transforms, sinc and
sinc2, respectively, are even and real. Good thing it worked out that way.
Linearity
One of the simplest and most frequently invoked properties of the Fourier transform is
that it is linear (operating on functions). This means:
12
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
If a < 0 the limits of integration are reversed when we make the substitution u = ax, and
so the resulting transform is (−1/a)Ff(s/a). Since −a is positive when a is negative, we
can combine the two cases and present the Stretch Theorem in its full glory:
This is also sometimes called the Similarity Theorem because changing the variable
from x to ax is a change of scale, also known as a similarity
There’s an important observation that goes with the stretch theorem. Let’s take a to be
positive, just to be definite. If a is large (bigger than 1, at least) then the graph of f(at) is
squeezed horizontally compared to f(t). Something different is happening in the
frequency domain, in fact in two ways. The Fourier transform is (1/a)F(s/a). If a is large
then F(s/a) is stretched out compared to F(s), rather than squeezed in. Furthermore,
multiplying by 1/a, since the transform is (1/a)F(a/s), also squashes down the values of
the transform. The opposite happens if a is small (less than 1). In that case the graph of
f(at) is stretched out horizontally compared to f(t), while the Fourier transform is
compressed horizontally and stretched vertically. The phrase that’s often used to
describe this phenomenon is that a signal cannot be localized (meaning concentrated at
a point) in both the time domain and the frequency domain. We will see more precise
formulations of this principle.
To sum up, a function stretched out in the time domain is squeezed in the frequency
domain, and vice versa. This is somewhat analogous to what happens to the spectrum
of a periodic function for long or short periods. Say the period is T, and recall that the
points in the spectrum are spaced 1/T apart, a fact we’ve used several times. If T is
large then it’s fair to think of the function as spread out in the time domain — it goes a
long time before repeating. But then since 1/T is small, the spectrum is squeezed. On
the other hand, if T is small then the function is squeezed in the time domain — it goes
13
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
only a short time before repeating — while the spectrum is spread out, since 1/T is
large.
Video Links
What is the Fourier Transform
• https://www.youtube.com/watch?v=spUNpyF58BY
Fourier Transform
• https://www.youtube.com/watch?v=ykNtIbtCR-8
References
• https://ieeexplore.ieee.org/stamp/stamp.jsp?arn
umber=7389485
• file:///C:/Users/Jhoy/Downloads/Chapter%202_F
ourier%20Transform.pdf
• http://web.ipac.caltech.edu/staff/fmasci/home/as
tro_refs/TheFourierTransform.pdf
• http://www.math.ncku.edu.tw/~rchen/2016%20Te
aching/Chapter%202_Fourier%20Transform.pdf
• http://www0.cs.ucl.ac.uk/teaching/GZ05/03-
fourier.pdf
14
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Chapter 6
Power Series Solution Of
Differential Equations
Objectives
After completing this chapter, you will be able to:
▪ Understand the Power Series
▪ Solve Power Series of Differential Equations
In mathematics, the power series method is used to seek a power series solution to
certain differential equations. In general, such a solution assumes a power series with
unknown coefficients, then substitutes that solution into the differential equation to find
a recurrence relation for the coefficients.
We want to continue the discussion of the previous lecture with the focus now on power
series, i.e., where the terms in the series contain powers of some parameter, typically a
dynamical variable, and we are expanding about the point where that parameter
vanishes. (A special case is the geometric series we discussed in Lecture 2.) The power
series then defines a function of that parameter. The standard examples are the Taylor
and Maclaurin series mentioned at the beginning of the previous lecture corresponding
to expanding in a series about x = 0 or 0 x x = , The coefficients n b are related to the
derivatives of the underlying, at least when the series is convergent, i.e., when the
series actually defines the function. Some specific examples of power series are
These series uniquely define the corresponding functions, whenever (i.e., at values of x
where) the series converges. When we apply the convergence tests of the previous
lecture to these expressions, we will define a range for the variable x within which the
series converges, i.e., a range in which the series expansion makes mathematical (and
physical) sense. This range for x is called the interval of convergence (for the power
1
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
series expansion). For values of x outside of this interval we will need to find different
expressions to define the underlying functions, i.e., the power series serves to define
the function only in the interval of convergence.
So the series absolutely converges for x 2 . At the endpoints, =1, x = 2 , the series
does not absolutely converge since both the p and s parameters of Eq. (2.29) vanish.
For x =−2 all terms in the series are positive and the series diverges ( S S 1 1 (− = − =
2 2 ) ( ) ). At the other endpoint, =1, x = 2 , we must be more careful due to the
alternating signs. We apply test 5) (from Lecture 2) to this series with (2 1 ) ( ) n n n n a
b = = − . Since lim 0 n n → a , the series again diverges. We can see both of these
results explicitly by looking at the series
Hence the function S x 1 ( ) is well defined (by the series, i.e., the series converges)
only on the open interval − 2 2 x (open means excluding the end points). For S x 2 (
) we find from the ratio test that
So the ratio test says that the series absolutely converges for x 1. At the endpoint x
=−1 the signs are all negative and the series diverges ) , just the negative Harmonic
series). At the other endpoint, x =1 , the signs alternate, test 5) is satisfied (and the
series converges conditionally. Hence the function is well defined (by the power series)
only on the semi-open interval. Note again that we must treat the endpoints carefully.
The third series may look like it is missing every other term (the even powers) and
therefore you may be confused about how to apply the ratio test. This is not an issue.
The idea of the ratio test is to always consider 2 contiguous terms in the series.
2
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Alternatively we can (re)write the series as (note that now every value of n contributes
to the sum)
Thus the series absolutely converges for x the open interval x . Since p = 1 2 1, 4 S
diverges at the end points. For x =−1 there are no alternating signs so S4 (−1) also
diverges. At x =−3 we have ( 1 1 ), which satisfies test 5) and the series conditionally
converges. Hence the interval of convergence for S x 4 ( ) is x .
Now that we have verified that we know how to determine the interval of convergence,
we should restate what is important about the interval. For values of x within that
interval of convergence the following statements are all true (and equivalent).
To further develop this discussion we state the following theorems (without proof), which
are true in the interval of convergence where the (infinite) power series can be treated
like a polynomial.
1. We can integrate or differentiate the series term-by-term to find a definition of the
corresponding integral or derivative of the function defined by the original series,
i.e., S x ( ) or S x dx ( ) . The resulting series has the same interval of
convergence as the original series, except perhaps for the behavior at the
endpoints (see below). [This is why series are so useful. Note that the effect of
integrating or taking a derivative is to introduce a factor of 1/(n+1) or n in the term
bn x n±1 , which does not change the interval of convergence (you should
convince yourself of this fact).]
2. If we have two series defining two functions with known intervals of convergence,
we can add, subtract or multiply the two series to define the corresponding
functions, S x S x 1 2 ( ) ( ) or S x S x 1 2 ( ) ( ) . The new series
convergences in the overlap (or common) interval of the original 2 intervals of
convergence. We can also think about the function defined by dividing the two
series, which with some manipulation we can express as a corresponding power
series. This function is well defined in the overlap interval of the original series
except points where the series in the denominator vanishes. At these points the
function may still be defined if the series in the numerator also vanishes
3
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
3. We can substitute one series into another series if the value of the first series is
within the interval of convergence of the second,
So let’s develop power series expansions for the functions we know and love.
We know the general form is given by Eq. (3.1). If we apply the Maclaurin
expansion definition to x e , we find (recall the essential property of the
exponential, x x de dx e = )
Again the interval of convergence is all values of x, x . ASIDE: Note that with
the definition 2 i −1, i −1 , it follows from Eqs. (3.9) and (3.11) that
4
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
This last result is called Euler’s formula (and is extremely useful!). We can obtain
more useful relations by using the four theorems above. Recall our discussion in the last
lecture of the geometric and Harmonic series. We had
Note that the integral changes the behavior at one endpoint from divergent in the
geometric case to conditionally convergent in the harmonic case, an example of the
change at the endpoints noted above in item I).
5
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
The interval of convergence is then given, i.e., the series converges absolutely for 0 2
x and converges conditionally at x = 2. To repeat, the function defined by a power
series expansion is well behaved within the interval of convergence. But clearly it is
important to consider what it means when the series expansion for a function diverges.
In general, the series will diverge where the original function is singular as is the case
for ln x at x = 0 in Eq. (3.20). However, the relationship between the series and the
function does not always work going the other way. A divergent series does not
necessarily mean a singular function. The various possibilities are the following.
1. The series diverges and the function is singular at the same point. This typically
occurs at the boundary of the interval of convergence as in the example of ln x at
x = 0.
2. The series may be divergent, but the function is well behaved. For example ln x
is well behaved for x 2 but the (specific) power series expansion in Eq. (3.20)
diverges. Similarly the function 1 1( + x) is well behaved everywhere except the
single point x =−1 while the power series expansion in Eq. (3.14) diverges for x
1 . The mathematics behind this behavior is most easily understood in the
formalism of complex variables as we will discuss next (where we will develop
the concept of a radius of convergence to replace the interval of convergence). In
any case, it is clear as we have already noted that outside of the interval of
convergence the series is no longer useful to define the function. However, it is
often possible to find a different power series expansion that is useful
(convergent) in a different interval of convergence.
3. The logic of how a power series expansion is used typically runs like the
following. We solve a differential equation by using the equation to solve for the
coefficients in a power series expansion of the solution Within the interval of
convergence of that series we succeed in summing the series and writing the
solution in closed form We use that closed form to define the solution to the
original differential equation over a much larger range in the variable. (The
essential subtext here is the uniqueness of the solution and the power series.)
4. A third and truly devious possibility is that the series looks “OK” but does not
accurately describe the function. This behavior is associated with points where
the function exhibits an essential singularity. Consider the function ( ) 2 1 x f x e−
= . The function and all of its derivatives vanish at x = 0 suggesting a power
series expansion about x = 0 of the (naïve) form f x( ) = 0 . But this is only true at
the isolated point x = 0 and has no finite interval of validity, because the function
has an essential singularity at the origin. A more useful series arises from
expanding about the point at infinity In this form it is clear that special care must
be taken at the point x = 0. If we know that a power series converges, a related
important issue, especially for physicists, is the question of how rapidly the series
is converging, i.e., what is the magnitude of the error if we truncate the series
after N terms? There are several useful relations that address this question,
which we present without derivation.
6
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Consider a Taylor series expansion about the point 0 x x = of the function f x( ) . We can
define a remainder as
where the point x lies somewhere in the interval between 0 x and x, i.e., the final sum in
Eq. (3.21) can be written in terms of the first term in the sum but with the derivative
evaluated at the point x rather than 0 x . While it may be difficult to determine the
precise value of the special point x , this expression provides an easy way to obtain an
approximation to the remainder.
This result is often derived as part of the introductory calculus course, but recent
experience suggests that we should review its derivation here. First note that functions
defined by powers series (as here) are smooth and continuous (and bounded, i.e., not
infinite) within the interval of convergence of the series in the sense that all derivatives
of the function are (well) defined by related power series expansions which converge in
the same interval (except possibly at the endpoints – see point I) on page 4). Next
consider performing some nested integrals of the N +1st derivative of f(x), which is
defined by its Taylor series. We have
7
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
We can evaluate this integral by using another calculus result – the mean value
theorem. Consider a general function g(x) that is continuous and smooth in an interval
that includes x0 to x. Thus in this interval the function g(y) will exhibit (unique) maximum
and minimum values such that g g y g min max ( ) for all y in the interval from x0 to
x. Further g(y) will take on all values in this range ( g g y g min max ( ) ) at least
once as y varies from x0 to x. Then it follows that the mean value for this interval,
defined by
There must at least one value of the variable y within the interval where the
function passes through its mean value. Now we can apply this result to Eq. (3.23) and
establish the existence of a value x such that
8
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
We see clearly that, as we approach the edge of the interval of convergence, the
convergence is slower, i.e., it takes more terms to obtain the same size remainder.
Finally let’s discuss how to use such power series expansions.
1. They are extremely useful in numerical work, e.g., when working on a computer.
Imagine that we wish to evaluate the difference between two very similar
functions, e.g., the difference between The functions are so similar that trying to
take the difference by simply evaluating the functions separately numerically
requires incredible numerical accuracy, while taking the difference first
analytically greatly simplifies the problem. Consider their
2. We can also turn our discussion around and use the power series expansions of
known functions in order to evaluate sums of interest. This application is
straightforward in principle, but requires some creativity. We used this approach
in Lecture 2 to evaluate the Harmonic series with alternating signs to find ln 2 .
Let’s do something more challenging here. Imagine that we want to sum the
series
Clearly we want to consider a function whose power series expansion has similar
coefficients, but at the same time we want a power series that we can sum after,
perhaps, simplifying the series using some simple operation like taking a derivative. In
this case define a new function by the power series
Now we simplify by taking a derivative, then perform the sum, and finally
integrate (by parts) to “undo” the derivative,
9
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Showing that this sum is 1 is not at all straightforward by other means. Similar
manipulations can often be used to sum a power series and obtain a closed expression
for the function defined by the power series. The point is to use manipulations allowed
within the interval of convergence. Consider the function defined by
The ratio test tells us that this series is convergent for x 1 . The explicit form
clearly suggests that integrating will yield a simpler expression, which we can sum, i.e.,
which we can recognize,
10
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Power series are also useful for performing integrals, at least numerically.
Consider the following example,
while the true answer to the same number of significant figures is 1.3780. Of course,
these days one is seldom far from a computer with Mathematica or Maple.
Power series expansions are also useful for evaluating indeterminate
mathematical forms. This application is essentially (a careful application of) L’Hopital’s
rule. Consider the example
Now expand the numerator (and denominator) in power series keeping the first
nonzero terms,
11
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Finally power series are useful in simplifying physics problems. The standard
example is that of a pendulum. With the pendulum’s orientation specified by a polar
angle (measured from the “down” direction) Newton’s equation is
Using just the first term in the power series for sin we obtain the (linearized) Harmonic
Oscillator problem
12
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Video Links
Power Series Solutions
• https://www.youtube.com/watch?v=oY0ItxI9xTk
Introduction to Power Series
• https://www.youtube.com/watch?v=DlBQcj_zQk0
Introduction to Power Series
• https://www.youtube.com/watch?v=DlBQcj_zQk0
References
• https://sites.math.northwestern.edu/~sweng/teachin
g/2018su/math224/notes/lecture12.pdf
• https://www3.nd.edu/~apilking/Math10560/Lectures/
Lecture%2030.pdf
• http://courses.washington.edu/ph227814/227/ellis/P
hys_227_08_Lec_3.pdf
13
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Chapter 7
Simultaneous Linear
and Non Linear
Objectives
After completing this chapter, you will be able to:
▪ Understand Simultaneous Linear and Non Linear
Equation
▪ Differentiate Linear and Non-Linear Equation
▪ Know about some back history of Simultaneous Linear
and Non Linear Equation
▪ Solve Simultaneous Linear and Non Linear Equation
Introduction
Sometimes, one is just not enough, and we need two variables to get the job done.
That is when simultaneous linear equations come into play. Imagine this situation, 2 bars
of Cadbury chocolate and 3 Mars bars cost Rs.270. At the same store, someone else
buys 4 bars of Cadbury chocolate and 2 Mars bars and pays Rs.300. What is the cost of
one bar of Cadbury chocolate and one Mars bar? These are the types of problems that
require you to solve for 2 variables and therefore the need arises for simultaneous
equations.
The training wheels are off from this point on and it’s time to dive into the deep end
of the middle school math pool.
1
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
As you can see, if we treat these equations with two variables as two
independent entities then we can find an infinite number of solutions for both xx and yy.
That is not helpful at all! Therefore, we write them as a system of simultaneous
equations. That means that the variables are the same and can have unique solutions
for both x and y, for example:
Now, we want to find the pair (x,y)(x,y) that satisfies both the equations at the same
time. Basically, what that means is that for all the infinite solutions to equation 1, there will
be a pair (x,y)(x,y) that also satisfies equation 2. That is what we are looking for. We can
visualize a system of simultaneous linear equations by drawing 2 linear graphs and
finding out their intersection point.
The red line represents all the solutions for equation 1, and the blue line,
solutions for equation 2. The intersection is the unique (x,y)(x,y) that we are looking for
which will satisfy both the equations.
• Graphically
• Substitution
• Elimination
2
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Take a look at the example below to understand the Substitution method: you can
observe how the values of the variables are obtained by substituting the variable in one
equation by its solution of the other equation. The key is to represent one variable as an
expression of the other variable.
Take a look at the image below to understand the Elimination method: In this method we
can eliminate one variable by multiplying each of the equations so that when one equation
is subtracted from the other, the resulting equation contains only one variable as the
coefficient of the other variable reduces to zero.
Some useful tips to help you speed up the process of solving simultaneous linear
equations:
• You can use the graphical, substitution or elimination methods to solve a system
of linear equations.
• Simultaneous linear equations may have a unique solution if they intersect at one
point
3
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
• One Unique solution is obtained if the 2 lines intersect at one particular point: this
will give an ordered pair (x,y)(x,y) of solution. If the 2 lines lie on top of each other,
it is simple to see that the 2 equations are just multiples of one another and they
will have an infinite number of ordered pairs (x,y)(x,y) as their solution. In some
cases, you may find that the 2 equations actually represent a set of parallel lines;
this will mean that the system has no solutions as the 2 lines will never intersect
one another.
• It is always recommended that you should substitute the ordered pair of
values (x,y)(x,y) back into the simultaneous linear equations to check whether it is
indeed the correct solution.
Two distinct lines always intersect at exactly one point unless they are parallel (have
the same slope).
The coordinates of the intersection point of the lines is the solution to the simultaneous
linear equations describing the lines. So we would normally expect a pair of
simultaneous equations to have just one solution.
2x + 3y = 7
4x + y = 9
From the graph we see that the point of intersection of the two lines is (2, 1)
4
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
If you solved the two equations using either Gaussian elimination or substitution, you
would obtain the same result.
Thus if a1x + b1y = c1 and a2x + b2y = c2 are parallel lines then .
And
So, if the above equation is true, the lines are parallel, they do not intersect, and the
system of linear equations has no solution.
x - y = -2
x - y = 1.
Using the method of substitution, we subtract the second equation from the first to
obtain: 0 = -3. This is a false statement and the system, therefore, has no solution. If
we look closer at the lines we see that they satisfy the condition
and are therefore parallel (as can be seen below). They do not intersect explaining why
the system of linear equations has no solution.
5
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
x-y=1
2x - 2y = 2
2x - 2y = 2
2x - 2y = 2
Now subtraction gives 0 = 0, which is true no matter what values x and y may have!
This time the two equations represent the same line, since both can be written in the
form y = x - 1.
Any point on this line has coordinates which will satisfy both equations, so there are an
infinite number of solutions!
In general, two equations represent the same line if one equation is a multiple of the
other. That is
There are then three possibilities for a pair of simultaneous linear equations:
(i) Just one solution (the usual situation - both lines are unique and not parallel to each
other)
(iii) Infinitely many solutions (the equations represent the same line, )
6
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
This method for solving a pair of simultaneous linear equations reduces one equation to
one that has only a single variable. Once this has been done, the solution is the same
as that for when one line was vertical or parallel. This method is known as
the Gaussian elimination method.
Example 2.
Equation 1: 2x + 3y = 8
Equation 2: 3x + 2y = 7
Step 1: Multiply each equation by a suitable number so that the two equations have the
same leading coefficient. An easy choice is to multiply Equation 1 by 3, the coefficient
of x in Equation 2, and multiply Equation 2 by 2, the x coefficient in Equation 1:
-(6x + 9y = 24
-(6x + 4y = 14)
5y = 10
y = 10/5 = 2
Step 4: Substitute y = 2 into either Equation 1 or Equation 2 above and solve for x. We'll
use Equation 1.
2x + 3(2) = 8
7
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
x=1
Solution: x = 1, y = 2 or (1,2).
Note that in a nonlinear system, one of your equations can be linear, just not all of them.
In this tutorial, we will be looking at systems that have only two equations and two
unknowns.
Ax + By + C = 0
Any equation that cannot be written in this form in nonlinear. The substitution method we
used for linear systems is the same method we will use for nonlinear systems. We solve
one equation for one variable and then substitute the result into the second equation to
solve for another variable, and so on. There is, however, a variation in the possible
outcomes.
In other words, it is where the two graphs intersect, what they have in
common. So if an ordered pair is a solution to one equation, but not the other,
then it is NOT a solution to the system.
Since we are looking at nonlinear systems, in some cases, there may be more
than one ordered pair that satisfies all equations in the system.
8
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
The equations of a system are dependent if ALL the solutions of one equation
are also solutions of the other equation. In other words, they end up being the
same graph.
There are three possible outcomes that you may encounter when working
with these systems:
Infinite Solutions
If the two graphs end up lying on top of each other, then there is an
infinite number of solutions. In this situation, they would end up being the
same graph, so any solution that would work in one equation is going to work
in the other.
If you get an infinite number of solutions for your final answer, is this system
consistent or inconsistent?
If you said consistent, you are right!
If you get an infinite number of solutions for your final answer, would the
equations be dependent or independent?
If you said dependent, you are correct!
The graph below illustrates a system of two equations and two unknowns
that has an infinite number of solutions:
9
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
It doesn't matter which equation you use or which variable you choose
to solve for.
If you need to solve for a variable, then try to pick one that has a 1 or -
1 as a coefficient. That way when you go to solve for it, you won't
have to divide by a number and run the risk of having to work with a
fraction (yuck!!).
Step 3: Substitute what you get for step 2 into the other equation.
This is why it is called the substitution method. Make sure that you
substitute the expression into the OTHER equation, the one you didn't
use in step 2.
Solve the equation set up in step 3 for the variable that is left.
Most of the equations in this step will end up being either linear or
quadratic. Once in awhile you will run into a different type of equation.
10
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Keep in mind that when you go to solve for this variable that you may
end up with no solution for your answer. For example, you may end
up with your variable equaling the square root of a negative number,
which is not a real number, which means there would be no solution.
If your variable drops out and you have a FALSE statement, that
means your answer is no solution.
If your variable drops out and you have a TRUE statement, that
means your answer is infinite solutions, which would be the
equation of the line.
If it makes at least one of them false, you need to go back and redo
the problem.
There are three possible types of solutions for a system of nonlinear equations involving
a parabola and a line
The graphs below illustrate possible solution sets for a system of equations involving a
parabola and a line.
• No solution. The line will never intersect the parabola.
• One solution. The line is tangent to the parabola and intersects the parabola at
exactly one point.
• Two solutions. The line crosses on the inside of the parabola and intersects the
parabola at two points.
11
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Solve the first equation for x and then substitute the resulting expression into the
second equation.
Solving for y gives y = 2 and y = 1. Next, substitute each value for y into the first
equation to solve for x. Always substitute the value into the linear equation to check for
extraneous solutions.
12
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
The solutions are (1,2) and (0,1), which can be verified by substituting these (x,
y)values into both of the original equations
13
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Video Links
How to - Solve Simultaneous Equation
• https://www.youtube.com/watch?v=YoVY5JwAF9o
Simultaneous equations (linear and non-linear)
• https://www.youtube.com/watch?v=ozP-vf99DK4
Simultaneous Equations
• https://www.youtube.com/watch?v=cWbZqWgsuY8
References
• https://www.cuemath.com/algebra/simultaneous
-linear-equations/
• https://courses.lumenlearning.com/waymakercol
legealgebra/chapter/methods-for-solving-a-
system-of-nonlinear-equations/
• http://mathsfirst.massey.ac.nz/Algebra/Systems
ofLinEq.htm
14
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Chapter 8
Numerical Differentiation
and Integration
Objectives
After completing this chapter, you will be able to:
▪ Understand Differentiation and Integration
▪ Differentiate Differentiation and Integration
▪ Know about some back history of Numerical
Differentiation and Integration
▪ Solve Differentiation and Integration problem
1
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Basic Concepts
This chapter deals with numerical approximations of derivatives. The first questions that
comes up to mind is: why do we need to approximate derivatives at all? After all, we do
know how to analytically differentiate every function. Nevertheless, there are several
reasons as of why we still need to approximate derivatives:
• There are some cases where it may not be obvious that an underlying function
exists and all that we have is a discrete data set. We may still be interested in
studying changes in the data, which are related, of course, to derivatives.
• There are times in which exact formulas are available but they are very
complicated to the point that an exact computation of the derivative requires a lot
of function evaluations. It might be significantly simpler to approximate the
derivative instead of computing its exact value.
where we assume that h > 0. What do we mean when we say that the expression
on the right-hand-side of (5.1) is an approximation of the derivative? For linear functions
(5.1) is actually an exact expression for the derivative. For almost all other functions, (5.1)
is not the exact derivative.
2
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
The second term on the right-hand-side of (5.3) is the error term. Since the
approximation (5.1) can be thought of as being obtained by truncating this term from the
exact formula (5.3), this error is called the truncation error. The small parameter h denotes
the distance between the two points x and x+h. As this distance tends to zero, i.e., h →
0, the two points approach each other and we expect the approximation (5.1) to improve.
This is indeed the case if the truncation error goes to zero, which in turn is the case if f
00(ξ) is well defined in the interval (x, x+h). The “speed” in which the error goes to zero
as h → 0 is called the rate of convergence. When the truncation error is of the order of
O(h), we say that the method is a first order method. We refer to a methods as a p th-
order method if the truncation error is of the order of O(h p ).
It is possible to write more accurate formulas than (5.3) for the first derivative. For
example, a more accurate approximation for the first derivative that is based on the values
of the function at the points f(x−h) and f(x+h) is the centered differencing formula
Let’s verify that this is indeed a more accurate formula than (5.1). Taylor
expansions of the terms on the right-hand-side of (5.4) are
3
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
which means that the expression (5.4) is a second-order approximation of the first
derivative. In a similar way we can approximate the values of higher-order derivatives.
For example, it is easy to verify that the following is a second-order approximation of the
second derivative
where we assume that ξ ∈ (x − h, x + h) and that f(x) has four continuous derivatives in
the interval. Hence, the approximation (5.6) is indeed a second-order approximation of
the derivative, with a truncation error that is given b
4
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Numerical Integration
In analysis, numerical integration comprises a broad family of algorithms for
calculating the numerical value of a definite integral, and by extension, the term is also
sometimes used to describe the numerical solution of differential equations. This article
focuses on calculation of definite integrals. The term numerical quadrature (often
abbreviated to quadrature) is more or less a synonym for numerical integration, especially
as applied to one-dimensional integrals. Some authors refer to numerical integration over
more than one dimension as cubature; others take quadrature to include higher-
dimensional integration.
The basic problem in numerical integration is to compute an approximate solution
to a definite integral
to a given degree of accuracy. If f(x) is a smooth function integrated over a small number
of dimensions, and the domain of integration is bounded, there are many methods for
approximating the integral to the desired precision.
History
The term "numerical integration" first appears in 1915 in the publication A Course
in Interpolation and Numeric Integration for the Mathematical Laboratory by David Gibb.
Quadrature is a historical mathematical term that means calculating area. Quadrature
problems have served as one of the main sources of mathematical
analysis. Mathematicians of Ancient Greece, according to the Pythagorean doctrine,
understood calculation of area as the process of constructing geometrically
a square having the same area (squaring). That is why the process was
named quadrature. For example, a quadrature of the circle, Lune of Hippocrates, The
Quadrature of the Parabola. This construction must be performed only by means
of compass and straightedge.
The ancient Babylonians used the trapezoidal rule to integrate the motion of Jupiter along
the ecliptic
• The area of the surface of a sphere is equal to quadruple the area of a great circle of
this sphere.
• The area of a segment of the parabola cut from it by a straight line is 4/3 the area of
the triangle inscribed in this segment.
For the proof of the results Archimedes used the Method of exhaustion of Eudoxus.
In medieval Europe the quadrature meant calculation of area by any method. More
often the Method of indivisibles was used; it was less rigorous, but more simple and
powerful. With its help Galileo Galilei and Gilles de Roberval found the area of
a cycloid arch, Grégoire de Saint-Vincent investigated the area under a hyperbola (Opus
Geometricum, 1647), and Alphonse Antonio de Sarasa, de Saint-Vincent's pupil and
commentator, noted the relation of this area to logarithms.
John Wallis algebrised this method: he wrote in his Arithmetica Infinitorum (1656) series
that we now call the definite integral, and he calculated their values. Isaac
Barrow and James Gregory made further progress: quadratures for some algebraic
curves and spirals. Christiaan Huygens successfully performed a quadrature of
some Solids of revolution.
The quadrature of the hyperbola by Saint-Vincent and de Sarasa provided a
new function, the natural logarithm, of critical importance.
With the invention of integral calculus came a universal method for area
calculation. In response, the term quadrature has become traditional, and instead the
modern phrase "computation of a univariate definite integral" is more common.
6
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
1. The integrand f(x) may be known only at certain points, such as obtained
by sampling. Some embedded systems and other computer applications may
need numerical integration for this reason.
2. A formula for the integrand may be known, but it may be difficult or impossible to
find an antiderivative that is an elementary function. An example of such an
integrand is f(x) = exp(−x2), the antiderivative of which (the error function, times a
constant) cannot be written in elementary form.
3. It may be possible to find an antiderivative symbolically, but it may be easier to
compute a numerical approximation than to compute the antiderivative. That may
be the case if the antiderivative is given as an infinite series or product, or if its
evaluation requires a special function that is not available.
Methods for one-dimensional integrals
Numerical integration methods can generally be described as combining
evaluations of the integrand to get an approximation to the integral. The integrand is
evaluated at a finite set of points called integration points and a weighted sum of these
values is used to approximate the integral. The integration points and weights depend on
the specific method used and the accuracy required from the approximation.
An important part of the analysis of any numerical integration method is to study the
behavior of the approximation error as a function of the number of integrand evaluations.
A method that yields a small error for a small number of evaluations is usually considered
superior. Reducing the number of evaluations of the integrand reduces the number of
arithmetic operations involved, and therefore reduces the total round-off error. Also, each
evaluation takes time, and the integrand may be arbitrarily complicated.
A 'brute force' kind of numerical integration can be done, if the integrand is reasonably
well-behaved (i.e. piecewise continuous and of bounded variation), by evaluating the
integrand with very small increments.
We have now seen some of the most generally useful methods for discovering
antiderivatives, and there are others. Unfortunately, some functions have no simple
antiderivatives; in such cases if the value of a definite integral is needed it will have to be
approximated. We will see two methods that work reasonably well and yet are fairly
simple; in some cases more sophisticated techniques will be needed.
7
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
The integral of f(x) can then be approximated by the integral of Pn(x), i.e.,
Note that if we want to integrate several different functions, and use their values at
the same points (x0, . . . , xn), the quadrature coefficients (6.8) should be computed only
once, since they do not depend on the function that is being integrated. If we change the
interpolation/integration points, then we must recompute the quadrature coefficients. For
equally spaced points, x0, . . . , xn, a numerical integration formula of the form.
8
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Example
The composite trapezoidal rule is obtained by applying the trapezoidal rule in each
subinterval [xi−1, xi ], i = 1, . . . , n, i.e.,
A particular case is when these points are uniformly spaced, i.e., when all intervals
have an equal length. For example, if
9
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
The notation of a sum with two primes, , means that we sum over all the terms
with the exception of the first and last terms that are being divided by 2. We can also
compute the error term as a function of the distance between neighboring points, h. We
know from (6.11) that in every subinterval the quadrature error is
Clearly
10
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Video Links
• https://www.youtube.com/watch?v=tcqsLqIyjmk
Numerical Integration With Trapezoidal and Simpson's
Rule
• https://www.youtube.com/watch?v=RTX-ik_8i-k
• https://www.youtube.com/watch?v=7EqRRuh-5Lk
References
• https://en.wikipedia.org/wiki/Numerical_differentiatio
n
• http://www2.math.umd.edu/~dlevy/classes/amsc466
/lecture-notes/differentiation-chap.pdf
• https://www.whitman.edu/mathematics/calculus_onli
ne/section14.01.html
• https://mathworld.wolfram.com/NumericalDifferentiat
ion.html
• https://en.wikipedia.org/wiki/Numerical_integration#:
~:text=In%20analysis%2C%20numerical%20integra
tion%20comprises,on%20calculation%20of%20defi
nite%20integrals.
11
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Chapter 9
Ordinary and Partial
Differential Equations
Objectives
After completing this chapter, you will be able to:
▪ Solve Ordinary and Partial Differential Equations
▪ Understand the characteristic of a Ordinary and
Partial Differential Equations
▪ Identify the Classification of Ordinary and Partial
Differential Equations
Introduction
among ordinary differential equations, linear differential equations play a prominent role
for several reasons. Most elementary and special functions that are encountered
in physics and applied mathematics are solutions of linear differential equations
(see Holonomic function). When physical phenomena are modeled with non-linear
equations, they are generally approximated by linear differential equations for an easier
solution. The few non-linear ODEs that can be solved explicitly are generally solved by
transforming the equation into an equivalent linear ODE (see, for example Riccati
equation).
Some ODEs can be solved explicitly in terms of known functions and integrals. When
that is not possible, the equation for computing the Taylor series of the solutions may be
useful. For applied problems, numerical methods for ordinary differential equations can
supply an approximation of the solution.
1
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
2
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
where a i (x) and r (x) are continuous functions of x.The function r(x) is called the source
term, leading to two further important classifications:
Homogeneous
If r(x) = 0, and consequently one "automatic" solution is the trivial solution, y = 0.
The solution of a linear homogeneous equation is a complementary function, denoted
here by yc.
Non-Linear
A differential equation that cannot be written in the form of a linear combination.
System Of ODEs
A number of coupled differential equations form a system of equations. If y is a
vector whose elements are functions; y(x) = [y1(x), y2(x),..., ym(x)], and F is a vector-
valued function of y and its derivatives, then
3
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
For a system of the form , some sources also require that the Jacobian
matrix be non-singular in order to call this an implicit ODE [system]; an implicit ODE
system satisfying this Jacobian non-singularity condition can be transformed into an
explicit ODE system. In the same sources, implicit ODE systems with a singular Jacobian
are termed differential algebraic equations (DAEs). This distinction is not merely one of
terminology; DAEs have fundamentally different characteristics and are generally more
involved to solve than (nonsingular) ODE systems. Presumably for additional derivatives,
the Hessian matrix and so forth are also assumed non-singular according to this scheme,
although note that order, which makes the Jacobian singularity criterion sufficient for this
taxonomy to be comprehensive at all orders.
The behavior of a system of ODEs can be visualized through the use of a phase
portrait.
Singular Solutions
The theory of singular solutions of ordinary and partial differential equations was a
subject of research from the time of Leibniz, but only since the middle of the nineteenth
century has it received special attention. A valuable but little-known work on the subject
is that of Houtain (1854). Darboux (from 1873) was a leader in the theory, and in the
geometric interpretation of these solutions he opened a field worked by various writers,
notably Casorati and Cayley. To the latter is due (1872) the theory of singular solutions
of differential equations of the first order as accepted circa 1900.
Reduction To Quadratures
The primitive attempt in dealing with differential equations had in view a reduction
to quadratures. As it had been the hope of eighteenth-century algebraists to find a method
for solving the general equation of the nth degree, so it was the hope of analysts to find
4
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
a general method for integrating any differential equation. Gauss (1799) showed,
however, that complex differential equations require complex numbers. Hence, analysts
began to substitute the study of functions, thus opening a new and fertile
field. Cauchy was the first to appreciate the importance of this view. Thereafter, the real
question was no longer whether a solution is possible by means of known functions or
their integrals, but whether a given differential equation suffices for the definition of a
function of the independent variable or variables, and, if so, what are the characteristic
properties.
Fuchsian Theory
Two memoirs by Fuchs inspired a novel approach, subsequently elaborated by
Thomé and Frobenius. Collet was a prominent contributor beginning in 1869. His method
for integrating a non-linear system was communicated to Bertrand in
1868. Clebsch (1873) attacked the theory along lines parallel to those in his theory
of Abelian integrals. As the latter can be classified according to the properties of the
fundamental curve that remains unchanged under a rational transformation, Clebsch
proposed to classify the transcendent functions defined by differential equations
according to the invariant properties of the corresponding surfaces f = 0 under rational
one-to-one transformations.
Lie's Theory
From 1870, Sophus Lie's work put the theory of differential equations on a better
foundation. He showed that the integration theories of the older mathematicians can,
using Lie groups, be referred to a common source, and that ordinary differential equations
that admit the same infinitesimal transformations present comparable integration
difficulties. He also emphasized the subject of transformations of contact.
Lie's group theory of differential equations has been certified, namely: (1) that it unifies
the many ad hoc methods known for solving differential equations, and (2) that it provides
powerful new ways to find solutions. The theory has applications to both ordinary and
partial differential equations.
Sturm–Liouville Theory
Sturm–Liouville theory is a theory of a special type of second order linear ordinary
differential equation. Their solutions are based on eigenvalues and
corresponding eigenfunctions of linear operators defined via second-order homogeneous
linear equations. The problems are identified as Sturm-Liouville Problems (SLP) and are
named after J.C.F. Sturm and J. Liouville, who studied them in the mid-1800s. SLPs have
an infinite number of eigenvalues, and the corresponding eigenfunctions form a complete,
orthogonal set, which makes orthogonal expansions possible. This is a key idea in applied
mathematics, physics, and engineering. SLPs are also useful in the analysis of certain
partial differential equations.
Existence And Uniqueness Of Solutions
Local Existence And Uniqueness Theorem Simplified
The theorem can be stated simply as follows. For the equation and initial value
problem:
5
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
in the x-y plane, where a and b are real (symbolically: a, b ∈ ℝ) and × denotes
the cartesian product, square brackets denote closed intervals, then there is an interval
for some h ∈ ℝ where the solution to the above equation and initial value problem can
be found. That is, there is a solution and it is unique. Since there is no restriction on F to
be linear, this applies to non-linear equations that take the form F(x, y), and it can also
be applied to systems of equations.
Global Uniqueness And Maximum Domain Of Solution
When the hypotheses of the Picard–Lindale theorem are satisfied, then local
existence and uniqueness can be extended to a global result. More precisely:
For each initial condition (x0, y0) there exists a unique maximum (possibly infinite) open
interval
such that any solution that satisfies this initial condition is a restriction of the solution
that satisfies this initial condition with domain In the case that , there are exactly two
possibilities.
6
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
7
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
The term exact solution is often used for second- and higher-order nonlinear PDEs to
denote a particular solution (see also Preliminary remarks at Second-Order Partial
Differential Equations).
Partial differential equations are used to mathematically formulate, and thus aid the
solution of, physical and other problems involving functions of several variables, such as
the propagation of heat or sound, fluid flow, elasticity, electrostatics, electrodynamics, etc.
General Form Of First-Order Partial Differential Equation
A first-order partial differential equation with n independent variables has the
general form
8
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
where C1 and C2 are arbitrary constants; such particular solutions are known
as integrals of system. Then the general solution to equation can be written as
where Φ is an arbitrary function of two variables. With equation (4) solved for u2, one
often specifies the general solution in the form u2=Ψ(u1), where Ψ(u) is an arbitrary
function of one variable.
Remark. If h(x,y,w)≡0, then w=C2 can be used as the second integral in (3).
Example. Consider the linear equation
9
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
10
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
References
• https://en.wikipedia.org/wiki/Ordinary_differentia
l_equation#CITEREFSimmons1972
• http://www.scholarpedia.org/article/Partial_differ
ential_equation
• https://www.britannica.com/science/partial-
differential-eq
Video Links
• https://www.youtube.com/watch?v=y3dHKNl-q-k
ODEs: Introduction to Ordinary Differential Equations
• https://www.youtube.com/watch?v=bP2Ug7Mm5RE
• https://www.youtube.com/watch?v=oICjqJwRbqY&li
st=PLYdroRCLMg5MLOnfJMyu4nLxtKOD7gGpz&in
dex=2
11
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Chapter 10
Optimization
Objectives
After completing this chapter, you will be able to:
▪ Understand about Optimization
▪ Identify and understand the Characterize the major
subfields of Optimization
▪ Classification of critical points and extrema
▪ Apply the Computational optimization techniques
and Application of Optimization
Introduction
Optimization is a general term used to describe types of problems and solution
techniques that are concerned with the best (“optimal”) allocation of limited resources in
projects. The problems are called optimization problems and the methods optimization
methods. Typical problems are concerned with planning and making decisions, such as
selecting an optimal production plan. A company has to decide how many units of each
product from a choice of (distinct) products it should make. The objective of the company
may be to maximize overall profit when the different products have different individual
profits.
1
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Linear Programming
Interest in linear programming has also extended to economics. In 1937 the Hungarian-
born mathematician John von Neumann analyzed a steadily expanding economy based
on alternative methods of production and fixed technological coefficients. As far as
mathematical history is concerned, the study of linear inequality systems excited virtually
no interest before 1936. In 1911 a vertex-to-vertex movement along edges of
a polyhedron (as is done in the simplex method) was suggested as a way to solve a
problem that involved optimization, and in 1941 movement along edges was proposed
for a problem involving transportation. Credit for laying much of the mathematical
foundations should probably go to von Neumann. In 1928 he published his famous paper
2
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
on game theory, and his work culminated in 1944 with the publication, in collaboration
with the Austrian economist Oskar Morgenstern, of the classic Theory of Games and
Economic Behaviour. In 1947 von Neumann conjectured the equivalence of linear
programs and matrix games, introduced the important concept of duality, and made
several proposals for the numerical solution of linear programming and game problems.
Serious interest by other mathematicians began in 1948 with the rigorous development
of duality and related matters.
Basic ideas
A simple problem in linear programming is one in which it is necessary to find the
maximum (or minimum) value of a simple function subject to certain constraints. An
example might be that of a factory producing two commodities. In any production run, the
factory produces x1 of the first type and x2 of the second. If the profit on the second type
is twice that on the first, then x1 + 2x2 represents the total profit. The function x1 + 2x2 is
known as the objective function.
The simplex method
To illustrate the simplex method, the example from the preceding section will be
solved again. The problem is first put into canonical form by converting the linear
inequalities into equalities by introducing “slack variables” x3 ≥ 0 (so that x1 + x3 = 8), x4 ≥
0 (so that x2 + x4 = 5), x5 ≥ 0 (so that x1 + x2 + x5 = 10), and the variable x0 for the value
of the objective function (so that x1 + 2x2 − x0 = 0). The problem may then be restated as
that of finding nonnegative quantities x1, …, x5 and the largest possible x0 satisfying the
resulting equations. One obvious solution is to set the objective variables x1 = x2 = 0,
which corresponds to the extreme point at the origin. If one of the objective variables is
increased from zero while the other one is fixed at zero, the objective value x0 will
increase as desired (subject to the slack variables satisfying the equality constraints). The
variable x2 produces the largest increase of x0 per unit change; so it is used first. Its
increase is limited by the nonnegativity requirement on the variables. In particular, if x2 is
increased beyond 5, x4 becomes negative.
Standard formulation
Nonlinear Programming
3
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Although the linear programming model works fine for many situations, some problems
cannot be modeled accurately without including nonlinear components. One example
would be the isoperimetric problem: determine the shape of the closed plane curve having
a given length and enclosing the maximum area. The solution, but not a proof, was known
by Pappus of Alexandria c. 340 CE:
An important early algorithm for solving nonlinear programs was given by the Nobel
Prize-winning Norwegian economist Ragnar Frisch in the mid-1950s. Curiously, his
approach fell out of favour for some decades, reemerging as a viable and competitive
approach only in the 1990s. Other important algorithmic approaches include sequential
quadratic programming, in which an approximate problem with a quadratic objective and
linear constraints is solved to obtain each search step; and penalty methods, including
the “method of multipliers,” in which points that do not satisfy the constraints incur penalty
terms in the objective to discourage algorithms from visiting them.
4
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Feasibility problem
The satisfiability problem, also called the feasibility problem, is just the problem of
finding any feasible solution at all without regard to objective value. This can be regarded
5
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
as the special case of mathematical optimization where the objective value is the same
for every solution, and thus any solution is optimal.
Many optimization algorithms need to start from a feasible point. One way to obtain such
a point is to relax the feasibility conditions using a slack variable; with enough slack, any
starting point is feasible. Then, minimize that slack variable until the slack is null or
negative.
Existence
The extreme value theorem of Karl Weierstrass states that a continuous real-
valued function on a compact set attains its maximum and minimum value. More
generally, a lower semi-continuous function on a compact set attains its minimum; an
upper semi-continuous function on a compact set attains its maximum point or view.
Necessary conditions for optimality
One of Fermat's theorems states that optima of unconstrained problems are found
at stationary points, where the first derivative or the gradient of the objective function is
zero (see first derivative test). More generally, they may be found at critical points, where
the first derivative or gradient of the objective function is zero or is undefined, or on the
boundary of the choice set. An equation (or set of equations) stating that the first
derivative(s) equal(s) zero at an interior optimum is called a 'first-order condition' or a set
of first-order conditions.
Optima of equality-constrained problems can be found by the Lagrange
multiplier method. The optima of problems with equality and/or inequality constraints can
be found using the 'Karush–Kuhn–Tucker conditions'.
Sufficient conditions for optimality
While the first derivative test identifies points that might be extrema, this test does
not distinguish a point that is a minimum from one that is a maximum or one that is neither.
When the objective function is twice differentiable, these cases can be distinguished by
checking the second derivative or the matrix of second derivatives (called the Hessian
matrix) in unconstrained problems, or the matrix of second derivatives of the objective
function and the constraints called the bordered Hessian in constrained problems. The
conditions that distinguish maxima, or minima, from other stationary points are called
'second-order conditions' (see 'Second derivative test'). If a candidate solution satisfies
the first-order conditions, then the satisfaction of the second-order conditions as well is
sufficient to establish at least local optimality.
Sensitivity And Continuity Of Optima
The envelope theorem describes how the value of an optimal solution changes
when an underlying parameter changes. The process of computing this change is
called comparative statics. The maximum theorem of Claude Berge (1963) describes the
continuity of an optimal solution as a function of underlying parameters.
Calculus Of Optimization
Finding the points where the gradient of the objective function is zero (that is, the
stationary points). More generally, a zero subgradient certifies that a local minimum has
been found for minimization problems with convex functions and other locally Lipschitz
6
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
functions. Further, critical points can be classified using the definiteness of the Hessian
matrix: If the Hessian is positive definite at a critical point, then the point is a local
minimum; if the Hessian matrix is negative definite, then the point is a local maximum;
finally, if indefinite, then the point is some kind of saddle point.
Constrained problems can often be transformed into unconstrained problems with the
help of Lagrange multipliers. Lagrangian relaxation can also provide approximate
solutions to difficult constrained problems.
When the objective function is a convex function, then any local minimum will also be a
global minimum. There exist efficient numerical techniques for minimizing convex
Functions, Such As Interior-Point Methods.
Literative Methods
The iterative methods used to solve problems of nonlinear programming differ according
to whether they evaluate Hessians, gradients, or only function values. While evaluating
Hessians (H) and gradients (G) improves the rate of convergence, for functions for which
these quantities exist and vary sufficiently smoothly, such evaluations increase
the computational complexity (or computational cost) of each iteration. In some cases,
the computational complexity may be excessively high.
• Methods that evaluate Hessians (or approximate Hessians, using finite differences):
o Newton's method
o Sequential quadratic programming: A Newton-based method for small-medium
scale constrained problems. Some versions can handle large-dimensional
problems.
o Interior point methods: This is a large class of methods for constrained
optimization. Some interior-point methods use only (sub)gradient information and
others of which require the evaluation of Hessians.
• Methods that evaluate gradients, or approximate gradients in some way (or even
subgradients):
7
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Electrical Engineering
Some common applications of optimization techniques in electrical
engineering include active filter design, stray field reduction in superconducting magnetic
energy storage systems, space mapping design of microwave structures, handset
8
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
9
ADVANCED ENGINEERING MATHEMATICS FOR ECE MODULE
Video Links
Introduction to Optimization
• https://www.youtube.com/watch?v=I1JqGiG_P_w
• https://www.youtube.com/watch?v=AoJQS10Ewn4
References
• https://www.britannica.com/science/optimization
• https://en.wikipedia.org/wiki/Mathematical_optimizati
on
• https://www.macmillanexplorers.com/optimization-
and-linear-programming/17836282
10