Optimization

This unit covers the following ideas. In preparation for the quiz and exam, make sure you have a lesson plan containing examples that explain and illustrate the following concepts.

Explain the properties of the gradient, its relation to level curves and level surfaces, and how it can be used to find directional derivatives.
Find equations of tangent planes using the gradient and level surfaces. Use the derivative (tangent planes) to approximate functions, and use this in real world application problems.
Explain the second derivative test in terms of eigenvalues. Use the second derivative test to optimize functions of several variables.
Use Lagrange multipliers to optimize a function subject to constraints. %Explain why Lagrange multipliers works by considering the gradients of the function and the constraint at a maximum.

You'll have a chance to teach your examples to your peers prior to the exam. The following homework problems line up with the topics we will discuss in class. {\noindent \footnotesize

{|l|c|l|l|l|l|}\hline Topic (11th ed.) &Sec &Basic Practice &Good Problems &Thy/App &Comp \\\hline Directional Derivatives and the Gradient & 14.5&1-22 &23-32 &33-36 & \\\hline Tangent Planes and approximation & 14.6&1-22 &23-24, 47-58, 60-63 & 59& \\\hline 2nd Derivative Test (use eigenvalues) & 14.7&1-38 &39-44, 49-52, &45-48, 53-64 &65-70 \\\hline Lagrange Multipliers & 14.8&1-32 &33-40 &41-44 &45-50 \\\hline

%Taylor Polynomials & 14.10& & & & \\\hline

} {\noindent \footnotesize

{|l|c|l|l|l|l|}\hline Topic (12th ed.) &Sec &Basic Practice &Good Problems &Thy/App &Comp \\\hline Directional Derivatives and the Gradient & 14.5&1-24 &25-36 &37-40 & \\\hline Tangent Planes and approximation & 14.6&1-22 &23-24, 31-32, 49-62, 64-67 &63 & \\\hline 2nd Derivative Test (use eigenvalues) & 14.7&1-38 &39-44, 49-60, &45-48, 61-68 &69-74 \\\hline Lagrange Multipliers & 14.8&1-32 &33-40 &41-44 &45-50 \\\hline

%Taylor Polynomials & 14.10& & & & \\\hline

}

The Gradient

See Larson 13.6–13.7.

The derivative $Df$ of a function $f:\mathbb{R}^n\to\mathbb{R}$ (one output dimension) is called the gradient of $f$, and written $\vec \nabla f$, when we want to emphasize that the derivative is a vector field.

You'll want a computer to help you construct the graphs, particularly $h$. Please use the Mathematica introduction in Brainhoney. You could use Wolfram Alpha (use the links in the function chapter if you forgot how to graph).

Consider the functions $f(x,y)=9-x^2-y^2$, $g(x,y)=2x-y$, and $h(x,y)=\sin x\cos y$.

See Sage. You can modify these commands to help in the plots below too.
Compute $\vec \nabla f(x,y)$. Then draw both $\vec \nabla f$ and several level curves of $f$ on the same axes.
Sage
Compute $\vec \nabla g(x,y)$. Then draw both $\vec \nabla g$ and several level curves of $g$ on the same axes.
Sage
Compute $\vec \nabla h(x,y)$. Then draw both $\vec \nabla h$ and several level curves of $h$ on the same axes.
What relationships do you see between the gradient vector field and level curves?

When you present in class, be prepared to provide rough sketches of the level curves and gradients of each function.

The next problem investigates why the gradient vector and the level curve have the relationship you hopefully saw.

Suppose $\vec r(t)$ is a level curve of $f(x,y)$.

Suppose you know that at $t=0$, the value of $f$ at $\vec r(0)$ is 7. What is the value of $f$ at $\vec r(1)$? [What does it mean to be on a level curve?]
As you move along the level curve $\vec r$, how much does $f$ change? Use this to tell the class what $\ds\frac{df}{dt}$ must equal.
At points along the level curve $\vec r$, we have the composite function $f(\vec r(t))$. Compute the derivative $\ds\frac{df}{dt}$ using the chain rule.
Use your work from the previous parts to explain why the gradient always meets the level curve at a 90$^\circ$ angle. We say that the gradient is normal to level curves (i.e., a gradient vector is orthogonal to the tangent vector of the curve).

In the derivative chapter, we extended differential notation from $dy=f' dx$ to $d\vec y = D\vec f d\vec x$. The key idea is that a small change in the output variables is approximated by the product of the derivative and a small change in the input variables. As a quick refresher, if we have the function $z=f(x,y)$, then differential notation states that $$dz = \begin{bmatrix}f_x&f_y\end{bmatrix} \begin{bmatrix}dx\\dy\end{bmatrix}.$$

Suppose the temperature at a point in the plane is given by the function $T(x,y)=x^2-xy-y^2$ degrees Fahrenheit. A particle is at $P=(2,3)$.

Use differentials to estimate the change in temperature if the particle moves 1 unit in the direction of $\vec u=\left(3,4\right)$. [Hint: Find a unit vector in that direction.]
What is the actual change in temperature if the particle moves 1 unit in the direction of $\vec u=\left(3,4\right)$?
Use differentials to estimate the change in temperature if the particle moves about .2 units in the direction of $\vec u=\left(3,4\right)$.

Let's generalize the calculation we just did into a formula for any $f(x,y)$.

Suppose that $z=f(x,y)$ is a differentiable function (so the derivative is the matrix $\begin{bmatrix}f_x&f_y\end{bmatrix}$). Remember to use differential notation in this problem.

If $(dx,dy)=(1,0)$, which means we've moved one unit in the $x$ direction while holding $y$ constant, what is $dz$?
If $(dx,dy)=(0,1)$, which means we've moved one unit in the $y$ direction while holding $x$ constant, what is $dz$?
Consider the direction $\vec u=(2,3)$. Find a unit vector in the direction of $\vec u$. If we move one unit in the direction of $\vec u$, what is $dz$? [It's all right to leave you answer as a dot product.]

The directional derivative of $f$ in the direction of the unit vector $\vec u$ at a point $P$ is defined to be $$D_{\vec u} f(P) = Df(P) \vec u = \vec \nabla f \cdot \vec u.$$ We dot the gradient of $f$ with the direction vector $\vec u$. The partial derivative of $f$ with respect to $x$ is precisely the directional derivative of $f$ in the $(1,0)$ direction. Similarly, the partial derivative of $f$ with respect to $y$ is precisely the directional derivative of $f$ in the $(0,1)$ direction. This definition extends to higher dimensions.

Note that in the definition above, we require the vector $\vec u$ to be a unit vector. If you are asked to find a directional derivative in some direction, make sure you start by finding a unit vector in that direction. We want to deal with unit vectors because when we say something has a slope of $m$ units, we want to say “The function rises $m$ units if we run $1$ unit.”

Sage.

Consider the function $f(x,y) = 9-x^2-y^2$.

Draw several level curves of $f$.
At the point $P=(2,1)$, place a dot on your graph. Then draw a unit vector based at $P$ that points in the direction $\vec u=(3,4)$ [not to the point $(3,4)$, but in the direction $\vec u=(3,4)$]. If you were to move in the direction $(3,4)$, starting from the point $(2,1)$, would the value of $f$ increase or decrease?
Find the slope of $f$ at $P=(2,1)$ in the direction $\vec u=(3,4)$ by finding the directional derivative. This should agree with your previous answer.
If you stand at $Q=(-2,3)$ and move in the direction $\vec v= (1,-1)$, will $f$ increase or decrease? Find the directional derivative of $f$ in the direction $\vec v=(1,-1)$ at the point $Q=(-2,3)$.

Recall that the directional derivative of $\vec f$ in the direction $\vec u$ is the dot product $\vec \nabla f\cdot \vec u = |\vec \nabla f| |\vec u|\cos\theta$. In this problem, you'll explain why the gradient points in the direction of greatest increase.

- What is the angle between the two vectors $\vec \nabla f$ and $\vec u$? [Hint: we learned a formula for the angle between two vectors back in the vectors chapter.]
Why is the directional derivative of $\vec f$ the largest when $\vec u$ points in the exact same direction as $\vec \nabla f$? [Hint: What angle maximizes the cosine function?]
When $\vec u$ points in the same direction as $\vec \nabla f$, show that $D_{\vec u}f = |\vec \nabla f|$. In other words, explain why the length of the gradient is precisely the slope of $f$ in the direction of greatest increase (the slope in the steepest direction).
Which direction points in the direction of greatest decrease?

Suppose you are looking at a topographical map (see Wikipedia for an example). On this topographical map, each contour line represents 100 ft in elevation. You notice in one section of the map that the contour lines are really close together, and they start to form circles around a spot on the graph. You notice in another section of the map that the contour lines are spaced quite far apart. Let $f(x,y)$ be the elevation of the land, so that the topographical map is just a contour plot of $f$.

For example, you can look at a contour plot of $f(x,y) = (x+1)^3-3(x+1)^2-y^2+2$ in Sage.
Where is the slope of the terrain larger, in the section with closely packed contour lines, or the section with contour lines that are spread out. In which section will the gradient be a longer vector?
At the very top of a mountain, or the very bottom of a valley, will the gradient be a long vector or a short vector?

Summarizing our results, we have the following theorem.

Let $f$ be a continuously differentiable function, with $\vec r$ a level curve of the function.

The gradient is always normal to level curves, meaning $\vec \nabla f\cdot \dfrac{d\vec r}{dt}=0$.
The gradient points in the direction of greatest increase.
The directional derivative of $f$ in the direction of the gradient is the length of the gradient. Symbolically, we write $D_{\vec \nabla f}f = |\vec \nabla f|$.
At a maximum or minimum, the gradient is the zero vector.

The next few problems have you practice using differentials, and then obtain tangent lines and planes to curves and surfaces using differentials.

See Sage.

Consider the function $f(x,y)=x^2+y^2$. Consider the level curve $C$ given by $f(x,y)=25$. Our goal is to find an equation of the tangent line to $C$ at $P=(3,-4)$.

Draw $C$. Compute $\vec \nabla f$ and add to your graph the vector $\vec \nabla f(P)$.
We know the point $P=(3,-4)$ is on the tangent line. Let $Q=(x,y)$ represent another point on the tangent line. Add to your graph the point $Q$ and the vector $\vec {PQ} = (x-3,y+4)$.
Why are $\vec \nabla f(P)$ and $\vec{PQ}$ orthogonal? Use this fact to write an equation of the tangent line.
What is a normal vector to the line?

This problem might be better if on part 3 we wrote, “Why does the dot product $\vec \nabla f(P)\cdot\vec{PQ}$ equal zero? Compute this dot product. Why is this an equation of the tangent line?” To many students are correctly giving a vector equation of the tangent line, by reversing the order of the gradient, negating one of the terms, and then using (x,y)=dir vec(t)+point.

The previous problem had you give an equation of the tangent line to a level curve, by using differential notation. The next problems asks you to repeat this idea and give an equation of a tangent plane to a level surface.

See Sage.

Consider the function $f(x,y,z)=x^2+y^2+z^2$. Consider the level surface $S$ given by $f(x,y,z)=9$. Our goal is to find an equation of the tangent plane to $S$ at $P=(1,2,-2)$.

Draw $S$.
Compute $\vec \nabla f$. Add to your graph the vector $\vec \nabla f(P)$, with its base at $P$.
We know the point $P=(1,2,-2)$ is on the tangent plane. Let $Q=(x,y,z)$ be any other point on the tangent plane. What is the component form of the vector $\vec {PQ}$?
Why are $\vec \nabla f(P)$ and $\vec{PQ}$ orthogonal? Use this fact to write an equation of the tangent plane.
What is a normal vector to the plane?

Find an equation of the tangent plane to the hyperboloid of one sheet $1=x^2-y^2+z^2$ at the point $(-3,3,1)$.

%Also give an equation of the normal line (the line that sticks straight up out of the surface). FYI, the normal line is used in computer graphics to know how to shade an object.

The two surfaces $x^2+y^2+z^2=14$ and $3x+4y-z=-1$ intersect in a curve $C$. Draw both surfaces, and show us the curve $C$. Then, at the point $(2,-1,3)$, find an equation of the tangent line to this curve. [Hint: The line is in both tangent planes, so it is orthogonal to both normal vectors. The cross product gets you a vector that is orthogonal to two vectors.]

This is almost exactly like TODO and feels redundant. NOTE: I added this problem because it is a pet favorite of mine. I decided to remove it. I may add it back next semester as an optional problem. It's not exactly easy to see the answer, whereas it is simple to see the solution without differentials on the volume problem.

% Consider an electrical circuit with two parallel resistors, each having resistance {$R_1$} and {$R_2$}. The total resistance $R$ in this circuit is given by $$\frac{1}{R} = \frac{1}{R_1}+\frac{1}{R_2}.$$ %

%
Show that $DR = \begin{bmatrix}\frac{R^2}{R_1^2} & \frac{R^2}{R_2^2} \end{bmatrix}$. Then compute $dR$. %
When you purchase a resistor, it comes with a stated resistance. However, manufacturing defects always occur, and the stated resistance on each resistor is not the actual resistance. This is where differentials come in handy. % If {$ R_1 $} changes from an expected 10 ohms to an actual 9.9 ohms, and {$ R_2 $} changes from 20 to 20.2, would you expect a positive or negative change in the total resistance {$R$}? Use differentials to make your claim, and then back it up by computing the total resistance using (10,20) and then using $(9.9,20.2)$. %
Suppose both $R_1$ and $R_2$ have the same resistance, and each have a tolerance of $dR$. Using differentials, what would you expect as the maximum possible error for the total resistance. You should show that running resistors in parallel reduces the error. (For those of you familiar with electrical systems, running resistors in series causes the errors to add.) %

The Second Derivative Test

We start with a review problems from first-semester calculus.

Let $f(x) = x^3-3x^2$. Find the critical values of $f$ by solving $f'(x)=0$. Determine if each critical value leads to a local maximum or local minimum by computing the second derivative. State the local maxima/minima of $f$. Sketch the function using the information you discovered.

% Consider the functions $f(x)=x^3$, $g(x)=x^4$, and $h(x)=-x^6$. %

%
Show that $x=0$ is the only critical value for $f$, $g$, and $h$. %
What is the value of the second derivative at $x=0$ for each function? %
Draw all 3 functions and determine for each if $x=0$ is the location of a local maximum, local minimum, or neither. %
What conclusions can you draw if the second derivative of a function at a critical point is zero? %

We now generalize the second derivative test to all dimensions. We've already seen that the second derivative of a function such as $z=f(x,y)$ is a square matrix. The second derivative test relied on understanding if a function was concave up or concave down. We need a way to examine the concavity of $f$ as we approach a point $(x,y)$ from any of the infinitely many directions. Such a method exists, and leads to an eigenvalue/eigenvector problem. I'm assuming that most of you have never heard the word “eigenvalue.” We could spend an entire semester just studying eigenvectors. We'd need a few weeks to discover what they are from a problem-based approach. Instead, here is an example of how to find eigenvalues and eigenvectors.

Let $A$ be a square matrix, so in 2D we have $A=\begin{pmatrix} a&b\\c&d \end{pmatrix}$. The identity matrix $I$ is a square matrix with 1's on the diagonal and zeros everywhere else, so in 2D we have $I = \begin{pmatrix} 1&0\\0&1 \end{pmatrix}$. The eigenvalues of $A$ are the solutions $\lambda$ to the equation $|A-\lambda I|=0$. Remember that $|A|$ means, “Compute the determinant of $A$.” So in 2D, we need to find the value $\lambda$ so that $$\left|\begin{pmatrix} a&b\\c&d\end{pmatrix}-\lambda \begin{pmatrix} 1&0\\0&1 \end{pmatrix} \right|=\begin{vmatrix} a-\lambda &b\\c&d-\lambda \end{vmatrix}=0.$$ This definition extends to any square matrix. In 3D, the eigenvalues are the solutions to the equation $$\left|\begin{pmatrix} a&b&c\\d&e&f\\g&h&i\end{pmatrix}-\lambda \begin{pmatrix} 1&0&0\\0&1&0\\0&0&1\end{pmatrix} \right| = \begin{vmatrix} a-\lambda&b&c\\d&e-\lambda&f\\g&h&i-\lambda\end{vmatrix}=0.$$ An eigenvector of $A$ corresponding to $\lambda$ is a nonzero vector $\vec x$ such that $A\vec x=\lambda x$.

As you continue taking more upper level science courses (in physics, engineering, mathematics, chemistry, and more) you'll soon see that eigenvalues and eigenvectors play a huge role. You'll start to see them in most of your classes. For now, we'll use them without proof to apply the second derivative test. In class, make sure you ask me to show you pictures with each problem we do, so we can see how eigenvalues and eigenvectors appear in surfaces.

I would like to put a reference to Ken Kutler's book from Provo for a proof. I would really like to have more references period in the book, so if you want to make a bibtex file, and you ever feel like adding a reference, feel free.

Let $f(x,y)$ be a function so that all the second partial derivatives exist and are continuous. The second derivative of $f$, written $D^2f$ and sometimes called the Hessian of $f$, is a square matrix.

Because the second derivative is always symmetric (why is it?), in a linear algebra course you could prove that the eigenvalues of $D^2f$ must always be real numbers.

Let $\lambda_1$ be the largest eigenvalue of $D^2f$, and $\lambda_2$ be the smallest eigenvalue. Then $\lambda_1$ is the largest possible second derivative obtained in any direction. Similarly, the smallest possible second derivative obtained in any direction is $\lambda_2$. The eigenvectors give the directions in which these extreme second derivatives are obtained. The second derivative test states the following.

Suppose $(a,b)$ is a critical point of $f$, meaning $Df(a,b) = \begin{bmatrix}0&0\end{bmatrix}$.

If all the eigenvalues of $D^2f(a,b)$ are positive, then in every direction the function is concave upwards at $(a,b)$ which means the function has a local minimum at $(a,b)$.
If all the eigenvalues of $D^2f(a,b)$ are negative, then in every direction the function is concave downwards at $(a,b)$. This means the function has a local maximum at $(a,b)$.
If the smallest eigenvalue of $D^2f(a,b)$ is negative, and the largest eigenvalue of $D^2f(a,b)$ is positive, then in one direction the function is concave upwards, and in another the function is concave downwards. The point $(a,b)$ is called a saddle point.
If the largest or smallest eigenvalue of $f$ equals 0, then the second derivative tests yields no information.

Consider the function $f(x,y)=x^2-2x+xy+y^2$. The first and second derivatives are $$Df(x,y)=\begin{bmatrix}2x-2+y,x+2y\end{bmatrix} \quad\text{and}\quad D^2f = \begin{bmatrix}2&1 \\1&2\end{bmatrix}.$$ The first derivative is zero (the zero matrix) when both $2x-2+y=0$ and $x+2y=0$. We need to solve the system of equations $2x+y=2$ and $x+2y=0$. Double the second equation, and then subtract it from the first to obtain $0x-3y=2$, or $y=-2/3$. The second equation says that $x=-2y$, or that $x=4/3$. So the only critical point is $(4/3,-2/3)$.

In this example, the second derivative is constant, so the point $(4/3,-2/3)$ did not change the matrix. In general, the point will affect your matrix. See Sage

%https://sagecell.sagemath.org/?z=eJy1Vl1r5DYUfZ9fIQLFckbjTjLsy4Ieym4KhbaEdBcKxmsUWx6LyLJXkrP2_vpeSfZ47AbaPnQeJqOrq3PPPfeDVHggY0yHL_eH-9thP9yO-_HL_W5X8goJKXtjNbM8t9xYXBFUvTJpKD68I-9igux0OpFTHL_fIfjc3Nz4v0_c9lohhix7lhy1FSq0sKJgEnWtUNYQxMWZK0DoORyYKicDL2yrwdJBcHh_-ojOmnX1wkaoM7I1R0WvXxlE4YhZxFlRb0Iknoj_-uX3x8-f_ggMEarQ4QDIVa8KK1qVIPTZ9EzKEWCFQaZue1miZ8BFACZ9ArMzQaCMULxEUryA-SKfkw30aTXiA2s6yZM5mhMJYXjGemlR0C52FBqhRNM3PveGDf530MPjuBznsKBGaydAuwX08v9LwI5p1nDLNRLqoiLcvy3hzod8-POn3x5_fbjoh5BhZ_4eoDWOIP8o3lxMqhwmWTa3viNIqCr9W49tsToNPMKbzQ3U6Rv2KMub_8gQ2v2WDx3GjuoBqMY_nrau_xfdeVIkdFMOjVe8KG4MPXmj6XXFCp4XrWw1jZ6hhtHi7a2IojTSvIxIuM78PaRFLJ3zJnbK3EJrwLxCU8Az30DeXHlzFczVYs6nNtGigIswkjh1gEG4LKB2ovDzR317nkrYEHDtQR1mTPB4fUJtxwphR3pMTmST4uoU0GuQQzBFQ8hkOuJwaVqpDIXvV45T_lWFetJjmMCvyrV3lYDcpeDK4jgjjn5G3MPeTVReAnn6Sfc8Dl3ua0bTNPrgKICqH-ZReHSjEBEUPVw21nLy2kRZdi2IoWk4C3oMggKpObBjloaBNO63zwSJCpl0yJzh6ckPsEnH6ZhdDR44g9yOO07xCx_J08efMVCCBeAQnQVOE66PlwjLG4PnmrnPPOS5H3JX-quCYxdj8Z1kz5m9eM-V8I5JUTN15jn0_RkDmeUld_1EOChk6BYl8bu-YRBwgKfn2uLl4dxWexq2EDTWmnEiBcweNNQ0HmfNuYKSGPGd07vjguSbAWh4Fk6U76LDV7wSAOgbBerEi8buUwrN_eLNXWdDzs49cb_xN1Hamq6nlgQiy3Cm4of7LE6msYgTWBnKSFgbm0zi1dLytXH7OAfiTS_ZtjSD63gFfbK3t45ReszIGGzjxXZ3VekLopvRC45PCq8iEYQt8VvC-ll9MyGyrKl1_uuAU_321EfYr7V8y9O4UqfeO1vd-5lMWNdxVeJ0y2YtJQmVRtiJEG80EHt6F8Zch39Mpq0-Edjt_mnL17aRiXfCwbXmrOR6WiBXu_0vkdXv9w%3D%3D

to see a graph which shows the eigenvectors in which the largest and smallest second derivatives occur.

%\includegraphics[width=\marginparwidth]{2nd-der-test}

We find the eigenvalues of $D^2 f(4/3,-2/3)$ by solving the equation $$\begin{vmatrix}2-\lambda&1 \\1&2-\lambda\end{vmatrix} = (2-\lambda)(2-\lambda)-1=0.$$ Expanding the left hand side gives us {$4-4\lambda + \lambda^2 -1 = 0$}. Simplifying and factoring gives us $\lambda^2-4\lambda +3 = (\lambda-3)(\lambda -1) = 0$. This means the eigenvalues are $\lambda = 1$ and $\lambda=3$. Since both numbers are positive, the function is concave upwards in every direction. The critical point $(4/3,-2/3)$ corresponds to a local minimum of the function. The local minimum is the output $f(4/3,-2/3) = (4/3)^2-2(4/3)+(4/3)(-2/3)+(-2/3)^2$. A graph of $f$ is provided on the right. The red vector $(1,1)$ points in the direction in which the second derivative is the largest value 3. The red vector $(-1,1)$ points in the direction in which the second derivative is the smallest value 1. These vectors are called eigenvectors, and you can learn much more about them, in particular how to find them, in a linear algebra course. For this course, we just need to be able to find eigenvalues.

See 14.7 for more practice.

Consider the function $f(x,y)=x^2+4xy+y^2$.

Find the critical points of $f$ by finding when $Df(x,y)$ is the zero matrix.
Find the eigenvalues of $D^2f$ at any critical points.
Label each critical point as a local maximum, local minimum, or saddle point, and state the value of $f$ at the critical point.

Consider the function $f(x,y)=x^3-3x+y^2-4y$.

Find the critical points of $f$ by finding when $Df(x,y)$ is the zero matrix.
Find the eigenvalues of $D^2f$ at any critical points. [Hint: First compute $D^2f$. Since there are two critical points, evaluate the second derivative at each point to obtain 2 different matrices. Then find the eigenvalues of each matrix.]
Label each critical point as a local maximum, local minimum, or saddle point, and state the value of $f$ at the critical point.

Consider the function $f(x,y)=x^3 + 3xy +y^3$.

Find the critical points of $f$ by finding when $Df(x,y)$ is the zero matrix.
Find the eigenvalues of $D^2f$ at any critical points.
Label each critical point as a local maximum, local minimum, or saddle point, and state the value of $f$ at the critical point.

You now have the tools needed to find optimal solutions to problems in any dimension. Here's a silly problem that demonstrates how we can use what we've just learned.

\label{optimize box in cake} For my daughter's birthday, she has asked for a Barbie princess cake. I purchased a metal pan that's roughly in the shape of a paraboloid $z=f(x,y)=9-x^2-y^2$ for $z\geq 0$. To surprise her, I want to hide a present inside the cake. The present is a bunch of small candy that can pretty much fill a box of any size. I'd like to know how large (biggest volume) of a rectangular box I can fit under the cake, so that when we start cutting the cake, she'll find her surprise present. The box will start at $z=0$ and the corners of the box (located at $(x,\pm y)$ and $(-x,\pm y)$) will touch the surface of the cake $z=9-x^2-y^2$.

What is the function $V(x,y)$ that we are trying to maximize?
If you find all the critical points of $V$, you'll discover there are 9. However, only one of these critical points makes sense in the context of this problem. Find that critical point.
Use the second derivative test to prove that the critical point yields a maximum volume.
What are the dimensions of the box? What's the volume of the box?

The only thing left for me is to now determine how much candy I should buy to fill the box. I'll take care of that.

When was in Provo, I remember teaching a business calc course. In the text, I recall reading about an aerospace company (maybe Boeing) that tried to build an “optimal” wing. They set up their equations, found derivatives, and build the wing that the critical point suggested they should build. Turns out they build the “worst” wing instead of the “best.” This was just a classic case of why you should ALWAYS check if the critical points lead to maxes or mins. I can't put this in the text until I find the reference. I want to add it to the text someday.

In this problem, we'll derive the version of the second derivative test that is found in most multivariate calculus texts. The test given below only works for functions of the form $f:\mathbb{R}^2\to\mathbb{R}$. The eigenvalue test you have been practicing will work with a function of the form $f:\mathbb{R}^n\to\mathbb{R}$, for any natural number $n$.

Suppose that $f(x,y)$ has a critical point at $(a,b)$.

Find a general formula for the eigenvalues of $D^2f(a,b)$. Your answer will be in terms of the second partials of $f$.
Let $D=f_{xx}f_{yy}-f_{xy}^2$.
- If $D<0$, explain why $f$ has a saddle point at $(a,b)$.
- If $D=0$, explain why the second derivative test fails.
- If $D>0$, explain why $f$ has either a maximum or minimum at $(a,b)$.
- If $D>0$, and $f_x(a,b)>0$, does $f$ have a local max or local min at $(a,b)$. Explain.
The only critical point of $f(x,y) = x^2+3xy+2y^2$ is at $(0,0)$. Does this point correspond to a local maximum, local minimum, or saddle point? Give the eigenvalues (which should come instantly out of part 1). Find $D$, from part 2, to answer the question.

Lagrange Multipliers

The last problem was an example of an optimization problem where we wish to optimize a function (the volume of a box) subject to a constraint (the box has to fit inside a cake). If you are economics student, this section may be the key reason why you were asked to take multivariate calculus. In the business world, we often want to optimize something (profit, revenue, cost, utility, etc.) subject to some constraint (a limited budget, a demand curve, warehouse space, employee hours, etc.). An aerospace engineer will build the best wing that can withstand given forces. Everywhere in the engineering world, we often seek to create the “best” thing possible, subject to some outside constraints. Lagrange discovered an extremely useful method for answering this question, and today we call it “Lagrange Multipliers.” Rather than introduce Cobb-Douglass production functions (from economics) or sheer-stress calculations (from engineering), we'll work with simple examples that illustrate the key points. Sometimes silly examples carry the message across just as well.

Suppose an ant walks around the circle $g(x,y)=x^2+y^2=1$. As the ant walks around the circle, the temperature is $f(x,y) = x^2+y+4$. Our goal is to find the maximum and minimum temperatures reached by the ant as it walks around the circle. We want to optimize $f(x,y)$ subject to the constraint $g(x,y)=1$.

Draw the circle $g(x,y)=1$. Then, on the same set of axes, draw several level curves of $f$. The level curves $f=3, 4, 5, 6$ are a good start. Then add more (maybe at each 1/4th). If you make a careful, accurate graph, it will help a lot below.
Based solely on your graph, where does the minimum temperature occur? What is the minimum temperature?
If the ant is at the point $(0,1)$, and it moves left, will the temperature rise or fall? What if the ant moves right?
On your graph, place a dot(s) where you believe the ant reaches a maximum temperature (it may occur at more than one spot). Explain why you believe this is the spot where the maximum temperature occurs. What about the level curves tells you that these spots should be a maximum.
Draw the gradient of $f$ at the places where the minimum and maximum temperatures occur. Also draw the gradient of $g$ at these spots. How are the gradients of $f$ and $g$ related at these spots?

Suppose $f$ and $g$ are continuously differentiable functions. Suppose that we want to find the maximum and minimum values of $f$ subject to the constraint $g(x,y)=c$ (where $c$ is some constant). Then if a maximum or minimum occurs, it must occur at a spot where the gradient of $f$ and the gradient of $g$ point in the same, or opposite, directions. So the gradient of $g$ must be a multiple of the gradient of $f$. To find the maximum and minimum values (if they exist), we just solve the system of equations that result from $$\vec \nabla f = \lambda \vec \nabla g,\quad \text{and}\quad g(x,y)=c$$ where $\lambda$ is the proportionality constant. The maximum and minimum values will be among the solutions of this system of equations.

Suppose an ant walks around the circle $x^2+y^2=1$. As the ant walks around the circle, the temperature is $T(x,y) = x^2+y+4$. Our goal is to find the maximum and minimum temperatures $T$ reached by the ant as it walks around the circle.

What function $f(x,y)$ do we wish to optimize? What is the constraint $g(x,y)=c$?
%
- Why do you know the ant must have reached both a maximum and minimum temperature?
The most common error on this problem is to divide both sides of an equation by $x$, which could be zero. If you do this, you'll only get 2 ordered pairs.
% Find the gradient of $f$ and the gradient of $g$. Then solve the system of equations that you get from the equations $$\vec \nabla f = \lambda \vec \nabla g, \quad \quad x^2+y^2=1.$$ You should obtain 4 ordered pairs $(x,y)$.
At each ordered pair, find the temperature. What is the maximum temperature obtained? What is the minimum temperature obtained.

\marginparbmw{See 14.8 for more practice.}

Consider the curve $xy^2=54$ (draw it). The distance from each point on this curve to the origin is a function that must have a minimum value. Find a point $(a,b)$ on the curve that is closest to the origin. [The distance to the origin is $d(x,y)=\sqrt{x^2+y^2}.$ This distance is minimized when $f(x,y) = x^2+y^2$ is minimized. So use $f(x,y)=x^2+y^2$ as the function you wish to minimize. What's the constraint $g(x,y)=c$?]

Find the dimensions of the rectangular box with maximum volume that can be inscribed inside the ellipsoid $$\frac{x^2}{2^2}+\frac{y^2}{3^2}+\frac{z^2}{5^2}=1.$$ [What is the function $f$ you wish to optimize? What is the constraint $g=c$? Try solving each equation for $\lambda$ so you can eliminate it from the problem.]

Repeat problem TODO, but this time use Lagrange multipliers. Find the dimensions of the rectangular box of maximum volume that fits underneath the surface $z=f(x,y)=9-x^2-y^2$ for $z\geq 0$. [Hint: Let $f(x,y,z) = (2x)(2x)(z)$ and $g(x,y,z)=z+x^2+y^2=9$. You'll get a system of 4 equations with 4 unknowns ($x,y,z,\lambda$). Try solving each equation for lambda. You know $x,y,z$ can't be zero or negative, so ignore those possible cases.]

This would be an appropriate place to put Taylor series in high dimensions, if your school wants the students to see those ideas. You can introduce it all using matrix derivatives. You could then actually prove the second derivative test, as well as define positive definite matrices.

We finished all the problems in this unit in 4 days of time (really only 3 and a half, as the line integral unit overlapped on the first day.

%%% Local Variables: %%% mode: latex %%% TeX-master: "215-problems" %%% End: