Chapter 8

  CALCULUS OF VARIATIONS

Introduction top

   The calculus of variations and its extensions are devoted to finding the optimum function that gives the best value of the economic model and satisfies the constraints of a system. The need for an optimum function, rather than an optimal point, arises in numerous problems from a wide range of fields in engineering and physics, which include optimal control, transport phenomena, optics, elasticity, vibrations, statics and dynamics of solid bodies and navigation. Two examples are determining the optimal temperatures profile in a catalytic reactor to maximize the conversion and the optimal trajectory for a missile to maximize the satellite payload placed in orbit. The first calculus of variations problem, the Brachistochrone problem, was posed and solved by Johannes Bernoulli in 1696(1). In this problem the optimum curve was determined to minimize the time traveled by a particle sliding without friction between two points.

     This chapter is devoted to a relatively brief discussion of some of the key concepts of this topic. These include the Euler equation and the Euler-Poisson equations for the case of several functions and several independent variables with and without constraints. It begins with a derivation of the Euler equation and extends these concepts to more detailed cases. Examples are given to illustrate this theory.

     The purpose of this chapter is to develop an appreciation for what is required to determine the optimum function for a variational problem. The extensions and applications to optimal control, Pontryagin's maximum principle and continuous dynamic programming are left to books devoted to those topics.

Functions, Functionals and Neighborhoods top

It will be necessary to discuss briefly functionals and neighborhoods before developing the Euler equation for the solution of the simplest problem in the calculus of variations. In mathematical programming the maximum or minimum of a function was determined to be an optimal point or set of points. In the calculus of variations the maximum or minimum value of a functional is determined to be an optimal function. A functional is a function of a function and depends on the entire path of one or more functions rather than a number of discrete variables.

     For the calculus of variations the functional is an integral, and the function that appears in the integrand of the integral is to be selected to maximize or minimize the value of the integral. The texts by Forray(1), Ewing(2), Weinstock(3), Schechter(4), and Sagan(6) elaborate on this concept. However, at this point let us examine an example of the functional given by equation (8-1). The minimum of this functional is a function y(x) that gives the shortest distance between two points [x0,y(x0)] and [x1,y(x1)].

Equ8-1.jpg (4792 bytes)

In this equation y' is the first derivative of y with respect to x. The function that minimizes this integral, a straight line, will be obtained as an illustration of the use of the Euler equation in the next section.

     The concept of a neighborhood is used in the derivation of the Euler equation to convert the problem into one of finding the stationary point of a function of a single variable. A function is said to be in the neighborhood of a function y if | Y1.jpg (966 bytes) | < h (5). This is illustrated in Figure 8-1(a). The concept can be extended for more restrictive conditions such as that shown in Figure 8-1(b) when | Y1.jpg (966 bytes)| < h and | Y2.jpg (1039 bytes)| < h. For this case Y.jpg (749 bytes) is said to be in the neighborhood of first order to y. Consequently, the higher the order of the neighborhood the more nearly the functions will coincide. Extensions of these definitions will lead to what are referred to as strong and weak variations(6).

Euler Equation top

The simplest form of the integral to be optimized by the calculus of variations is the following one.

Equ8-2.jpg (4354 bytes)

In addition, the values of y(x0) and y(x1) are known; and an example of the function F(x, y, y') was given in equation (8-1) as:

F(x, y, y')  =  [1 + (y')2]1/2                                            (8-3)

To obtain the optimal function that minimizes the equation (8-2), it is necessary to solve the Euler equation, which is the following second order ordinary differential equation.

 

Equ8-4a.jpg (4220 bytes)

It is not obvious that equation (8-4) is a second order ordinary differential equation. Also, it probably appears unusual to be partially differentiating the function F with respect to y and y'. In addition, although the term minimize will be used, stationary points are being located, and their character will have to be determined using sufficient conditions. Consequently, it should be beneficial to outline the derivation of the Euler equation.

First, y(x) is specified as the function that minimizes the functional I[y(x)], equation (8-2). (however, the form of y(x) has to be determined.) Then a function Y.jpg (749 bytes)(x) is constructed to be in the neighborhood of y(x) as follows:

 Y.jpg (749 bytes)(x) = y(x) + an(x)                                                       (8-5)

where a is a parameter that can be made arbitrarily small. Also n(x) is a continuously differentiable function defined on the interval x0 < x < x1 with n(x0) = n(x1) = 0 but is arbitrary elsewhere. The results from the derivation using equation (8-5) are described mathematically as weak variations (1), for an(x) and an'(x) are small.

     Now equation (8-2) is written in terms of the function  Y.jpg (749 bytes)(x) as:

Equ8-6.jpg (4533 bytes)

The above equation can be put in terms of the optimal function y(x) and the arbitrary function n(x) using equation (8-5).

Equ8-7.jpg (5419 bytes)

     The mathematical argument(3) is made that all of the possible functions lie in an arbitrarily small neighborhood of y because a can be made arbitrarily small. As such, the integral of equation (8-7) may be regarded as an ordinary function of a, F(a), because would specify the value of the integral knowing F(a = 0) at the minimum from y(x).

Equ8-8.jpg (6110 bytes)

The minimum of F(a) is obtained by sitting the first derivative of F with respect to a equal to zero. The differentiation is indicated as:

 Equ8-9.jpg (5747 bytes)

Leibnitz' rule, equation (8-10), is required to differentiate the integral given in equation (8-9).

Equ8-10.jpg (7937 bytes)

where x0 and x1 correspond to a1 and a2, and a corresponds to t.

The upper and lower limits, x0 and x1, are constants; and the following derivatives in the second and third terms on the right hand side of equation (8-10) are zero for this case.

Equ8-11.jpg (3747 bytes)

Consequently, the order of integration and differentiation is interchanged, and equation (8-9) can be written as:

 Equ8-12.jpg (5745 bytes)

The integrand can be expanded as follows:

Equ8-13.jpg (7195 bytes)

where dx/da = 0, because x is treated as a constant in the mathematical argument of considering changes from curve to curve at constant x.

Substituting equation (8-13) into equation (8-12) gives:

Equ8-14.jpg (7479 bytes)

The following results are needed:

 Equ8-15.jpg (6454 bytes)

Using equation (8-15), we can write equation (8-14) as:

Equ8-16.jpg (6306 bytes)

An integration-by-parts will give a more convenient form for the term involving n' i.e.:

 Equ8-17a.jpg (8586 bytes)

The first term on the right hand side is zero for n(xo) = n(x1) = 0. Combining the results from equation (8-17) with equation (8-16) gives:

 Equ8-17b.jpg (6119 bytes)

At the optimum dF/da = 0, and letting a ® 0 has  Y.jpg (749 bytes)® y and   Y.jpg (749 bytes)'® y'. Therefore, the above equation becomes:

Equ8-18a.jpg (6433 bytes)

To obtain the Euler equation the fundamental lemma of the calculus of variation is used. This lemma can be stated after Weinstock (3) as:

"If x0 and x1 (>x0) are fixed constants and G(x) is a particular continuous function in the interval x0 < x < x1 and if:

Equ8-18b.jpg (3155 bytes)

for every choice of the continuously differentiable function n(x) for which n(x0) = n(x1) = 0 then G(x) = 0 identically in the interval x0 < x < x1."

The proof of this lemma is by contradiction, and is given by Weinstock (3).

     Applying this lemma to equation (8-18) gives the Euler equation.

Equ8-4b.jpg (4097 bytes)

This equation is a second order ordinary differential equation and has boundary conditions y(xo) and y(x1). The solution of this differential equation y(x) optimizes the integral I[y(x)].

     A more convenient form of the Euler equation can be obtained by applying the chain rule to F(x, y, y')/y'.

Equ8-19.jpg (16853 bytes)

Substituting (equation 8-20) into equation (8-4) and rearranging gives a more familiar form for a second order ordinary differential equation.

Equ8-21.jpg (7544 bytes)

A more convenient way to write this equation is:

Equ8-22.jpg (5647 bytes)

The coefficients for the differential equation come from partially differentiating F.

     A special case that sometimes occurs is to have F(y, y'), i.e., F is not a function of x. For this situation it can be shown that:

 Equ8-23.jpg (4230 bytes)

This equation may be integrated once to obtain a form of the Euler equation given below which can be a more convenient starting point for problem solving.

Equ8-24.jpg (4231 bytes)

where the constant is evaluated using one of the boundary conditions.

     At this point it should be noted that the necessary conditions of the classical theory of maxima and minima have been used to locate a stationary point. This point may be a minimum, maximum or saddle point. To determine its character, sufficient conditions must be used; and these will be discussed subsequently. However, before that the following example is used to illustrate an application of the Euler equation.

 

Example 8-1

Determine the function that gives the shortest distance between two given points. Referring to Figure 8-2, we can state the problem as:

Ex8-1a.jpg (3188 bytes)

and from the figure it follows that:

ds = [(dx)2 + (dy)2]½ = [1+ (y')2]½ dx

Substituting for ds in the integral gives:

Ex8-1b.jpg (3395 bytes)

Evaluating the partial derivatives for the Euler equation.

Fy = 0               Fy 'x = 0              Fy 'y = 0               Fy 'y ' = 1 / [ 1 + (y')2]3/2

Substituting, equation 8-22 becomes:

Ex8-1c.jpg (5895 bytes)

simplifying gives

Ex8-1d.jpg (2194 bytes)

Integrating the above equation twice gives:

y = c1 x + c2

This is the equation of a straight line; and the constants, c1 and c2, are evaluated from the boundary conditions.

     Another classic problem of the calculus of variation as mentioned earlier is the Brachistochrone problem(10). The shape of the curve between two points is to be determined to minimize the time of a particle sliding along a wire without frictional resistance. The particle is acted upon only by gravitational forces as it travels between the two points. The approach to the solution is the same as for Example 8-1, and the integral for the time of travel, T, is given by Weinstock(3) as:

 Equ8-25.jpg (5880 bytes)

and the solution is in terms of the following parametric equations.

x = x0 + a[q - sin(q)]                y = y0 + a[1 - cos(q)]                       (8-26)

The details of the solution are given by Weinstock(3). The solution is the equations for a cycloid.

     The method of obtaining the Euler equation is used almost directly to obtain the results for more detailed forms of the integrand of equation (8-2). The next section extends the results for more complex problems.

top