Optimization Hansen
Optimization Hansen
Some of the more basic mathematical concepts you will need to review for this class are
the slope and intercept formula for a linear function. How do changes in the formula
affect the shape of the line? Is a slope of –2 steeper or flatter than a slope of –5? What is
the slope of a horizontal line (zero) and a vertical line (infinity).
We will also consider the concept of equilibrium, which involves the solution of a
number of equations simultaneously (i.e. finding values for several variables which solve
several equations at once). The most obvious example of solving for equilibrium in
economics is finding the price and quantity at which supply equals demand.
You will need to be comfortable with basic differentiation rules and procedures. We will
use derivatives to talk about increasing or decreasing functions and for optimizing
(finding the minimum or maximum of a function using the first order and second order
conditions). What follows below is a review of important concepts on optimization.
Optimization:
The major mathematical concept that we will use the most in Intermediate
Microeconomic Theory is the optimum of a function. Optima are either maxima or
minima. Most individual household of firm decisions in economics involve
maximization (of utility, profits) or minimization (of expenditure, cost) and these
decisions also often involve constraints that need to be taken into consideration (certain
amount of income to spend, cost levels, etc.).
When dΠ/dQ>0, Π is an increasing function of Q, Π moves in the same direction with Q.
When dΠ/dQ<0, Π is a decreasing function of Q, Π moves in opposite direction from Q.
When dΠ/dQ=0, the function has a stationary point (either maximum or minimum).
Example: Π = Π (Q)= -2Q2+ 24Q +5
dΠ/dQ= -4Q +24, so this function is increasing for -4Q+24>0 or when Q<6.
The function is decreasing for -4Q +24 <0 or when Q >6.
The function has a stationary point at -4Q +24= 0 or when Q = 6.
To determine whether you have a maximum or a minimum you need to check the second
derivative of the function at that value for which you found the stationary point. If f"<0,
the stationary point is a maximum. If f">0, the stationary point is a minimum. These are
the Second Order Conditions for a maximum and a minimum. (Functions which have
maxima look like hills. Functions which have minima look like valleys).
Most of the problems we will tackle in Intermediate Microeconomic Theory will involve
more complex functions than the ones described above. We will need to use techniques
from multivariate calculus to deal with this type of function. Now, we have y = f(x, z)
instead of y = f(x). (The Utility function is often expressed as a function of quantities of
different goods people can purchase. Output is expressed as a function of the quantities
of different inputs required in the production).
These functions are difficult to draw (they are at least three dimensional) so we use a
mapmaker’s technique. Pick a given level of the dependent variable and then plot
combinations of the independent variables that give you that level. This process is how
we get indifference curves and isoquants. It is like putting a mountain into two
dimensions; the construction will be helpful to us in illustrating optima in a multivariate
world
How to compute the optimum: keep thinking about the mountain analogy. Finding an
optimum requires getting to the top of the mountain and making sure you are at the
highest point in all directions. The point you find must be an optimum on the north/south
axis as well as on the east/west axis. You use partial derivatives to do this.
When taking a partial derivative, you take the derivative of the function with respect to
ONE of the independent variables while holding all of the other independent variables
constant (you treat them as if they were constant). You can take a partial derivative of a
multivariate function for each of the independent variables. Essentially, you can hold one
direction constant and look at what’s happening in the other direction (i.e. find the
optimum) and then hold the other direction constant and find the optimum in the first
direction. The true “top of the mountain” is that point at which you have a maximum in
all directions.
Partial derivatives of a function like y = f(x, z) are written as ∂y/∂x and ∂y/∂z and can be
interpreted as “the derivative of y with respect to x holding everything else constant.”
(This interpretation is just like the economic use of the phrase ceteris paribus). To take a
partial derivative with respect to one of the independent variables, just treat all the other
independent variables as constants (as if they were some fixed number).
You can also take second partial derivatives to see how the first partial change when the
independent variables change. You get more second partials than first partials because
you can take the partial ∂Π/∂Q or ∂Π/∂T with respect to Q and T.
2
= = ΠQQ = Π11 = -4
Q Q Q 2
2
= = ΠTT = Π22 = -6
T T T 2
2
= = ΠQT= Π21 = 2
Q T QT
2
= = ΠTQ= Π12 = 2
T Q TQ
Note that the “cross partials” are the same. In general, fij = fji.
You can use the partials to find optima just like we did before. This time you need to
have a stationary point in all directions at the same time. The first order conditions for an
optimum in this case are that 1 = 0 and 2 = 0 at the same time. (You need to have
values for Q and T that solve both of these equations at the same time.)
-4 (3) + 2 T + 10 = 0 -2 + 2T = 0 so T = 1.
This function has a stationary point at Q = 3, T = 1.
Π11 < 0 and Π22 < 0 slopes are decreasing in both directions from the stationary point.
This is part of the condition for a maximum.
Π11 > 0 and Π22 > 0 slopes are increasing in both directions from the stationary point.
This is part of the condition for a minimum.
The final condition will guarantee that you are not at a saddle point Π11 Π22 - Π122 > 0
CONSTRAINTS: Many of the problems that we will want to solve involve constraints
on the function we want to maximize. (Utility is maximized taking the budget constraint
into account; profits may have to be maximized taking a production cap into account.)
Finding an optimum in a case like this is like dealing with an uncrossable fence on the
mountain. You want to get as high as you can on the mountain except that you can’t
cross the fence.
In our profit function example, suppose that the firm had a production cap imposed by an
outsider like the government so that it could produce no more than Q + T = 2. We
generally rewrite constraints so that they show something equal to zero. Rewritten, the
constraint looks like 2 - Q - T = 0.
We need a “trick” to help us maximize profits subject to the constraint. The problem is
that to find an optimum we need all the first partials to equal zero and we also need the
constraint to equal zero. This process would leave us with 3 equations to solve with only
2 independent variables. (If you know any linear algebra, you should realize that this
won’t work.) The “trick” we will use is called the method of Langrange. We create a
new variable called the Langragean Multiplier and use it to combine the objective
function and the constraint.
∂ L /∂Q = -4Q + 2T + 10 - = 0
∂ L /∂T= 2Q – 6T - = 0
∂ L /∂ = 2 – Q – T = 0
The third FOC just guarantees that the constraint holds. Also, the other partials include
but you can still solve for values of Q and T that solve the equations. The easiest way to
do this is to eliminate by solving for in the FOCs and then setting them equal to each
other. Otherwise, the process is just like what is described above.
Solve for in the first two FOCs: -4Q + 2T + 10 = . Set them equal and get the
expression for Q or T):
-4Q + 2 T + 10 = 2Q – 6T 6Q = 10 + 8T so that Q = 10/6 + 8/6T.
Substitute this expression into the constraint to get a value for T (the other variable), then
plug that value back into the expression for Q (the first variable) to get a value there.
This will give you the values of Q and T for which this function has a stationary point.
(If you do it here, you should find that Q = 13/7 and T = 1/7.)
There are also SOCs that are needed to determine whether the stationary point is a
maximum or a minimum. For the constrained case, we are really testing to see if we have
a “quasi-concave” function (for a maximum) or “quasi-convex” function (for a
minimum). The SOCs are very unpleasant looking. (For a maximum of the function y =
f(x, z) the SOC requires that f11 f22 – 2f12 f1 f2 + f22 f12 > 0. This condition is a condition
on the “bordered Hessian” matrix of first and second order partial derivatives of the
function.) You can assume that the functions you use in constrained optimization
problems satisfy all of the correct SOCs. I will only give you the right kinds of functions
in this course but it is important that you realize that not all functions will “work”.
Finally, you can also solve for a value for in any constrained optimization problem.
The variable represents the effect on Π (in this case) of a slight change in the constraint.
Essentially, tells you how Π changes when you relax the constraint a little (e.g., let the
constant get a little bigger). One could interpret as the marginal profit of an increase in
output here. In the consumer theory case, where you are maximizing utility subject to the
budget constrained, represents the marginal utility of an addition to income.