Variations, Calculus Of
VARIATIONS, CALCULUS OF, in mathematics. The calculus of variations arose from the attempts that were made by origin mathematicians in the 17th century to solve problems of the of which the following are typical examples, (i) It Calculus. j s required to determine the form of a chain of given length, hanging from two fixed points, by the condition that its centre of gravity must be as low as possible. This problem of the catenary was attempted without success by Galileo Galilei (1638). (ii) The resistance of a medium to the motion of a body being assumed to be a normal pressure, proportional to the square of the cosine of the angle between the normal to the surface and the direction of motion, it is required to determine the meridian curve of a surface of revolution, about an axis in the direction of motion, so that the resistance shall be the least possible. This problem of the solid of least resistance was solved by Sir Isaac Newton (1687). (iii) It is required to find a curve joining two fixed points, so that the time of descent along this curve from the higher point to the lower may be less than the time along any other curve. This problem of the brachistochrone was proposed by John (Johann) Bernoulli (1696).
The contributions of the Greek geometry to the subject consist of a few theorems discovered by one Zenodorus, of whom little g arf is known. Extracts from his writings have been pre- hlstory served in the writings of Pappus of Alexandria and Theon of Smyrna. He proved that of all curves of given perimeter the circle is that which encloses the largest area. The problems from which the subject grew up have in common the character of being concerned with the maxima and minima of quantities which can be expressed by integrals of the form F(*. y, y^dx.
in which y is an unknown function of x, and F is an assigned function of three variables, viz. x, y, and the differential coefficient of y with respect to x, here denoted by y'; in special cases x or y may not be explicitly present in F, but y' must be. In any such problem it is required to determine y as a function of x, so that the integral may be a maximum or a minimum, either absolutely or subject to the condition that another integral or like form may have a prescribed value. For example, in the problem of the catenary, the integral rx\ Jxo must be a minimum, while the integral has a given value. When, as in this example, the length of the sought curve is given, the problem is described as isoperimelric. At the end of the first memoir by James (Jakob) Bernoulli on the infinitesimal calculus (1690), the problem of determining the form of a flexible chain was proposed. Gottfried Wilhelm Leibnitz gave the solution in 1691, and stated that the centre of gravity is lower for this curve than for any other of the same length joining the same two points. The first step towards a theory of such problems was taken by James Bernoulli (1697) in his solution of the problem of the brachistochrone. He pointed out that if a curve, as a whole, possesses the maximal or minimal property, every part of the curve must itself possess the same property. Beyond the discussion of special problems, nothing was attempted for many years.
The first general theory of such problems was sketched by Leonhard Euler in 1736, and was more fully developed by him in his Euler. treatise Methodus inveniendi . . . published in 1744. He generalized the problems proposed by his predecessors by admitting under the sign of integration differential coefficients of order higher than the first. To express the condition that an integral of the form C Xl F(x,y,y',y",...yW)dx J Xo may be a maximum or minimum, he required that, when y is changed into y+u, where u is a function of x, but is everywhere ' infinitely " small, the integral should be unchanged. Resolving the integral into a sum of elements, he transformed this condition into an equation of the form P flF * 3F-I aSpSy*-- +( ~ :) d^dy^>\ ' and he concluded that the differential equation obtained by equating to zero the expression in the square brackets must be satisfied. This equation is in general of the 2rcth order, and the 2n arbitrary constants which are contained in the complete primitive must be adjusted to satisfy the conditions that y, y', y,... yi- have given values at the limits of integration. If the function y is required also to satisfy the condition that another integral of the same form as the above, but containing a function <j> instead of F, may have a prescribed value, Euler achieved his purpose by replacing F in the differential equation by F+X<*>, and adjusting the constant X so that the condition may be satisfied. This artifice is known as the isoperimelric rule or rule of the undetermined multiplier. Euler illustrated his methods by a large number of examples.
The new theory was provided with a special symbolism by Joseph Louis de la Grange (commonly called Lagrange) in a series of memoirs published in 1760-62. This symbolism , was afterwards adopted by Euler (1764), and Lagrange La Z ra "Z e - is generally regarded as the founder of the calculus of variations. Euler had been under the necessity of resolving an integral into a sum of elements, recording the magnitude of the change produced in each element by a slight change in the unknown function, and thence forming an expression for the total change in the sum under consideration. Lagrange proposed to free the theory from this necessity. Euler had allowed such changes in the position of the curve, along which the integral, to be made a maximum or minimum, is taken, as can be produced by displacement parallel to the axis of ordinates. Lagrange admitted a more general change of position, which was called variation. The points of the curve being specified, by their co-ordinates, x, y, z, and differentiation along the curve being denoted, as usual, by the symbol d, Lagrange considered the change produced in apy quantity Z, which is expressed in terms of *, y, z, dx, dy, dz, d*x, . . . when the co-ordinates x, y, z are changed by " infinitely " small increments. This change he denoted by SZ, and regarded as the variation of Z. He expressed the rules of operation with by the equations SdZ=d6Z, S/Z=/5Z.
By means of these equations fdZ can be transformed by the process of integration by parts into such a form that differentials of variations occur at the limits of integration only, and the _. transformed integral contains no differentials of variations. The terms at the limits and the integrand of s y mbols - the transformed integral must vanish separately, if the variation of the original integral vanishes. The process of freeing the original integral from the differentials of variations results in a differential equation, or a system of differential equations, for the determination of the form of the required curve, and in special terminal conditions, which serve to determine the constants that enter into the solution of the differential equations. Lagrange's method lent itself readily to applications of the generalized principle of virtual velocities to problems of mechanics, and he used it in this way in the Mecanique analytique (1788). The terminology and notation of mechanics are still largely dominated by these ideas of Lagrange, for his methods were powerful and effective but they are rendered obscure by the use of " infinitely " small quantities, of which, in other departments of mathematics, he subsequently became an uncompromising opponent. The same ideas were applied by Lagrange himself, by Euler, and by other mathematicians to various extensions of the cal- E * iea - culus of variations. These include problems concerning sl " s integrals of which the limits are variable in accordance *" , with assigned conditions, the extension of Euler's rule of * ?*/ the multiplier to problems in which the variations are restricted by conditions of various types, the maxima and minima of integrals involving any number of dependent variables, such as are met with in the formulation of the dynamical Principle of Least Action, the maxima and minima of double and multiple integrals. In all these cases Lagrange's methods have been applied successfully to obtain the differential equation, or system of differential equations, which must be satisfied if the integral in question is a maximum or a minimum. This equation, or equations, will be referred to as the principal equation, or principal equations, of the problem.
The problems and method of the calculus admit of more .exact Formulation as follows: We confine our attention to the case where the sought curve is plane, and the function F _ . contains no differential coefficients of order higher than ~ orn "'"*" the first. Then the problem is to determine a curve ^"pL t joining two fixed points (* , yo) and (*,, yO so that the p rnh i fn , line integral P'F(*, y, y')dx J Xo '.aken along the curve may be a maximum or a minimum. When it is said that the integral is a minimum for some curve, it is meant that it must be possible to mark a finite area in the plane of (x, y), so that the curve in question lies entirely within this area, and the integral taken along this curve is less than the integral taken along any other curve, which joins the same two points and lies entirely within the delimited area. There is a similar definition for a maximum. The word extremum is often used to connote both maximum and minimum. The problem thus posed is known as the First Problem of the Calculus of Variations. If we begin v/ith any curve joining the fixed end points, and surround it by an area of finite breadth, any other curve drawn within the area, and joining the same end points, is called a variation of the original curve, or a varied curve. The original curve is defined by specifying y as a function of x. Necessary conditions for the existence of an extremum can be found by choosing special methods of variation.
One method of variation is to replace y by y+tu, where u is a function of x, and e is a constant which may be taken as small as we please. The function u is independent of e. It is differentiable, and its differential coefficient is continuous within the interval of . integration. It must vanish at x = x<> and at x = *i. This method of variation has the property that, when the ordinate of the curve is but slightly changed, the direction tlons. Q f tne tan g en t is but slightly changed. Such variations are called weak variations. By such a variation the integral is changed into f*F(*. y+tu, y'+(u')dx, and the increment, or variation of the integral, is ,y'+tu')-F(x, y,y')\dx.
In order that there may be an extremum it is necessary that _. f the variation should be one-signed. We expand the ex- * ' pression under the sign of integration in powers of t. The vana- jj fgt term o f t jj e ex p ans ; on contributes to the variation '*"' the term This term is called'the first variation. The variation of the integral cannot be one-signed unless the first variation vanishes. On transforming the first variation by integration by parts, and observing that u vanishes at xxo and at x=Xi, we find a necessary condition for an extremum in the form ary curves.
It is a fundamental theorem that this equation cannot hold for all admissible functions u, unless the differential equation 5Jdy'~dy = is satisfied at every point of the curve along which the integral is taken. This is the principal equation for this problem. The Station- curves that are determined by it are called the stationary curves, or the extremals, of the integral. We learn that the integral cannot be an extremum unless it is taken along a stationary curve. A difficulty might arise from the fact that, in the foregoing argument, it is tacitly assumed that y, as a function of x, is one-valued ; and we can have no a priori ground for assuming that this is the case for the sought curve. This difficulty might be met by an appeal to James Bernoulli's principle, according to which every arc of a stationary curve is a stationary curve between the end points of the arc a principle which can be proved readily by adopting such a method of variation that the arc of the curve between two points is displaced, and the rest of the curve is not. But another method of meeting it leads to important developments. This is the method Para- ^ parametric representation, introduced by K. Weiermetrlc strass. According to this method the curve is defined method ^y specifying x and y as one-valued functions of a parameter 6. The integral is then of the form v, y, x, y)d6, where the dots denote differentiation with respect to 0, and Ms a homogeneous function of x, y of the first degree. The mode of dependence of x and y upon 9 is immaterial to the problem, provided that they are one-valued functions of 6. A weak variation is obtained by changing x and y into x+eu, y+ev, where u and v are functions of which have continuous differential coefficients and are independent of 6. It is then found that the principal equations of the problem are M'dT"dx =0 ' 50"ay~ay =0- These equations are equivalent to a single equation, for it can be proved without difficulty that, when / is homogeneous of the first degree in x, y yldOdx where _ dx f i _ _ i_ ay . _ i ay 71 ydx 2 xydxdy x 2 dy 2 ' The stationary curves obtained by this method are identical with those obtained by the previous method.
The formulation of the problem by the parametric method often enables us to simplify the formation and integration of the principal equation. A very simple example is furnished by the problem: Given two points in the plane of (x, y) on the same side of the axis of x, it is required to find a curve .
joining them, so that this curve may generate, by revolu- catenola - tion, about the axis of x, a surface of minimum area. The integral to be made a minimum is f JO and the principal equation is d.
= o, of which the first integral is yx(i?+y*)-l=c, M-+ 1 } 1 ' and the stationary curves are the catenaries y = c cosh{(# a)/c\.
The required minimal surface is the catenoid generated by the revolution of one of these catenaries about its directrix.
The parametric method can be extended without difficulty so as to become applicable to more general classes of problems. A simple example is furnished by the problem of forming the equations of the path of a ray of light in a variable medium. According to Fermat's principle, the integral ftids is a oray. minimum, ds representing the element of arc of a ray, and n the refractive index. Thus the integral to be made a minimum is J 00 The equations are found at once in forms of the type these equations can be written in and, since (i 2 +y L +z")ld8 = c the usual forms of the type The formation of the first variation of an integral by means of a weak variation can be carried out without difficulty in the case of a simple integral involving any number of dependent variables and differential coefficients of arbitrarily high orders, and also in the cases of double and multiple integrals; and the quantities of the type (U, which are used in the process, may be regarded as equivalent to Lagrange's dx, Sy, . . . The same process may not, however, be applied to isoperimetric problems. If the first variation of the integral which is to be made an extremum, Kuleof subject to the condition that another integral has a pre- '* elnu '- scribed value, is formed in this way, and if it vanishes, the "P" er - curve is a stationary curve for this integral. If the prescribed value of the other integral is unaltered, its first variation must vanish; and, if the first variation is formed in this way, the curve is a stationary curve for this integral also. The two integrals do not, however, in general possess the same stationary curves. We can avoid this difficulty by taking the variations to be of the form ii+M2,_ where a and are independent constants; and we can thus obtain a completely satisfactory proof of the rule of the undetermined multiplier. A proof on these lines was first published by P. Du Bois-Reymond (1879). The rule had long been regarded as axiomatic.
The parametric method enables us to deal easily with the problem of variable limits. If, in the First Problem, the terminal point (xi, yO is movable on a given guiding curve <f(xi, yj) =o, the first variation of the integral can be written where (xi+eUi, yi+tfi) is on the curve <t>(xi, yi)=o, and u,, v, denote the values of u, v at (xi, y t ). It follows that the required curve must be a stationary curve, and that the condition _ aj>_ dx dy\ dy i ). The corresponding condition in the case of _ _ _ _ ~' must hold at the integral is found from the equations fx = f Variable limits.
to be , 3F _a/. = 3F dy 7 ' dy' ay 7 Transversals of stationary curves.
This discussion yields an important result, which may be stated as follows: Let two stationary curves of the integral be drawn from the same initial point A to points P, Q, which are near together, and let the line PQ be of length v, and make an angle w with the axis of* (fig. i). The excess of the integral taken along AQ, from A to Q, above the integral taken along AP, from A to P, is expressed, correctly to the first order in v, by the formula v cos a | F(x, y, y') + (tan o> y')^ ? f _In this formula x, y are the co-ordinates of p. * P, and y' has the value belonging to the point P and the stationary curve AP. When the coefficient of v cos w in the formula vanishes, the curve AP is said to be cut transversely by the line PQ, and a curve which cuts a family of stationary curves transversely is described as a transversal of those curves. In the problem of variable limits, when a terminal point moves on a given guiding curve, the integral cannot be an extremum unless the stationary curve along which it is taken is cut transversely by the guiding curve at the terminal point. A simple example is afforded by the shortest line, drawn on a surface, from a point to a given curve, lying on the surface. The required curve must be a geodesic, and it must cut the given curve at right angles. The problem of variable limits may always be treated by a method of which the following is the principle: In the First Problem let the initial point (%, yo) be fixed, and let the terminal point Alterna- ^ y^ m p ve on ^ fixed guiding curve Ci. Now, whatever * . the terminal point may be, the integral cannot be an extremum unless it is taken along a stationary curve. We have then to choose among those stationary curves which are drawn from (#o, yo) to points of Ci that one which makes the integral an extremum. This can be done by expressing the value of the integral taken along a stationary curve from the point (xo, yo) to the point (xi, yi) in terms of the co-ordinates x t , y\, and then making this expression an extremum, in regard to variations of x\, yi, by the methods of the differential calculus, subjecting (xi, yi) to the condition of moving on the curve Ci.
An important example of the first variation of integrals is afforded by the Principle of Least Action in dynamics. The kinetic energy T is a homogeneous function of the second degree in the differential coefficients (ft, q t , . . . q n of the co-ordinates q t , q t , . . . q n with respect to the time /, and the potential energy V is a function of these co-ordinates. The energy equation is of the form Principle T + V = E , where E is a constant. A course of the system is defined when the co-ordinates gare expressed as functionsof a single parameter 6. The action A of the system is defined as the integral C flldl, taken along a course from the initial position (g< 0) ) to the final of least action.
position (?^ 1) ), but fc and It are not fixed. The equations of motion are the principal equations answering to this integral. To obtain them it is most convenient to write *(?) for T, and to express the integral in the form where q' denotes the differential coefficient of a co-ordinate q with respect to 0, and, in accordance with the parametric method, the limits of integration are fixed, and the integrand is a homogeneous function of the q"s of the first degree. There is then no difficulty in deducing the Lagrangian equations of motion of the type cjT dT dV_ dt dq dq + dq~- These equations determine the actual course of the system. Now if the system, in its actual course, passes from a given initial position (<7< >) to a variable final position (q), the action A becomes a function of the q's, and the first method used in the problem of variable limits shows that, for every q 3A = aT dq dq' When the kinetic energy T is expressed as a homogeneous quadratic function of the momenta dT/dq, say and the differential coefficients of A are introduced instead of those of T, the energy equation becomes a non-linear partial differential equation of the first order for the determination of A as a function of the g's. This equation is *A 3A\ , ... _ -) + V = E.
ofvary- action. A complete integral of this equation would yield an expression for A as a function of the q's containing n arbitrary constants, ai, o 2 , ... a,, of which one a is merely additive to A ; and the courses of the system which are compatible with the equations of motion are determined by equations of the form =6 = 6 d A =b where the 6's are new arbitrary constants. It is noteworthy that the differential equations of the second order by which the geodesies on an ellipsoid are determined were first solved by this method (C. G. J. Jacobi, 1839).
It has been proved that every problem of the calculus of variations, in which the integral to be made an extremum contains only one independent variable, admits of a similar transformation; that is to say, the integrals of the principal Pnadple equations can always be obtained, in the way described above, from a complete integral of a partial differential * equation of the first order, and this partial differential "^"" alequation can always be formed by a process of elimination. ? C "J r These results were first proved by A. Clebsch (1858).
Among other analytical developments of the theory of the first variation we may note that the necessary and sufficient condition that an expression of the form F(x, y, y', /-">) Condition should be the differential coefficient of another expression rabillty. of the form is the identical vanishing of the expression aF d aF , d 2 aF . , , jJ ~dy~Txdy''T"dx* ~dy~"~ ' ' +( -~ 1 '" J: The result was first found by Euler (1744). A differential equation <t>(x, y, y', y") =o is the principal equation answering to an integral of the form JF(x,y,y')dx if the equation ~ox ay* = ay is satisfied identically. In the more general case of an equation of the form Condlttom that a entlal equation may arise from a problem of the calculus of varia- <t>(x, y, y', . .y n) ) o tions.
the corresponding condition is that the differential expression obtained by Lagrange's process of variation, viz., a* must be identical with the " adjoint " differential expression d<j> d f d<f> This matter has been very fully investigated by A. Hirsch (i{ , t .
To illustrate the transformation of the first variation of multiple integrals we consider a double integral of the form ff <!>(*, y, f, P, 2- r,s, f)dxdy, taken over that area of the z plane which is bounded by a variation closed curve s'. Here p, q, . . . t denote the partial of a differential coefficients of z with respect to x and y of j oua i e the first and second orders, according to the usual nota- integral. tion. When z'is changed into z+ov, the terms of the first order in e are rr IW i W 9w i W d W VJ ( W +-dp -dx+fq-S^+dr Each term must be transformed so that no differential coefficients of w are left under the sign of double integration. We exemplify the process by taking the term containing dHu/dx 2 . We have d (ty dw\ d (W\ dw ) , , ai (dr -dx) ~Tx (W)"K\ dxdy 3A "I w) J The first two terms are transformed into a line integral taken round the boundary s', and we thus find M \ *-&%) \ d *'+ff^ (?) "* where v denotes the direction of the normal to the edge s' drawn outwards. The double integral on the right-hand side contributes a term to the principal equation, and the line integral contributes terms to the boundary conditions. The line integral admits of further transformation by means of the relations dw dw cos(x, x)cos(y, ~v)f cos(* ( y)cos(y, Ogp- It becomes cos(*.
-cos(Gr.,, ,< In forming the first term within the square brackets we then use the relations jpcos(x, v) = -^,cos(y, v),- d -pcos(y, >) = ^cos(x, v), d d<i , d d<fr , ->ddt a?Tr = ~ cos(y ' y) diH7+ cos ^' 'rgyW' where p' denotes the radius of curvature of the curve s'.
The necessity of freeing the calculus of variations from dependence upon the notion of infinitely small quantities was realized by Lagrange, and the process of discarding such quantities was partially carried out by him in his Th&orie des functions analytiques (!797)- I* 1 accordance with the interpretation of differentials which he made in that treatise, he interpreted the variation of an integral, as expressed by means of his symbol 5, as the first term, or the sum of the terms of the first order, in the development in series of the complete expression for the change that is made in the value of the integral when small finite changes are made in the variables. The quantity which had been regarded as the _. variation of the integral came to be regarded as the first . variation, and the discrimination between maxima and '"variation ram i ma came to be regarded as requiring the investigation ' of the second variation. The first step in this theory had been taken by A. M. Legendre in 1786.
In the case of an integral of the form Legendre defined the second variation as the integral To this expression he added the term I |a(fry) 2 ', which vanishes identically because Sy vanishes at x=x<, and at x=x\. He took a to satisfy the equation d 2 F /a 2 F do and thus transformed the expression for the second variation to where From this investigation Legendre deduced a new condition for the existence of an extremum. It is necessary, not only that the varia- tion should vanish, but also that the second variation Le ~ , should be one-signed. In the case of the First Problem am S Legendre concluded that this cannot happen unless d 2 F/d/ 2 has the same sign at all points of the stationary curve between the end points, and that the sign must be +for a minimum and for a maximum. In the application of the perametric method the function which has been denoted by/i takes the place of d 2 F/d;y' 2 .
The transformation of the second variations of integrals of various types into forms in which their signs can be determined by inspec- tion subsequently became one of the leading problems of the calculus of variations. This result came about chiefly through the publica- ,. tion in 1837 of a memoir by C. G. J. Jacobi. He trans- Jaco . formedLegendre'sequationfortheauxiliaryfunctionainto a linear differential equation of the second order by the substitution d 2 F d 2 F I dw dydy +a ~ dy' 2 w dx' and he pointed out that Legendre's transformation of the second variation cannot be effected if the function w vanishes between the limits of integration. He pointed out further, that if the stationary curves of the integral are given by an equation of the form y = <t>(x, a, b), where a, 6 are arbitrary constants, the complete primitive of the equation for w is of the form . dtt> . D d< zo=A^ r +B-jr, oa ob where A, B are new arbitrary constants. Jacobi stated these propositions without proof, and the proof of them, and the extension of the results to more general problems, became the object of numerous investigations. These investigations were, for the most part, and for a long time, occupied almost exclusively with analytical developments; and the geometrical interpretation which Jacobi had given, and which he afterwards emphasized in his Vorlesungen uber Dynamik, was neglected until rather recent times. According to this interpretation, the stationary curves which start from a point (*o, 3"o) have an envelope; and the integral of F, taken along such a curve, cannot be an extremum if the point (0, 170) where the curve touches the envelope lies on the arc between the end points. Pairs of points such as (*o, yo) and (o, TJO) were afterwards called conjugate points by Weierstrass. The proof that the in- _ tegral canaot be an extremum if the arc of the curve between the fixed end points contains a pair of conjugate /*< points was first published by G. Erdmann (1878).
Examples of conjugate points are afforded by antipodal points on a Sphere, the conjugate foci of geometrical optics, the kinetic foci of analytical dynamics. If the terminal points are a pair of conjugate points, the integral is not in general an extremum; but there is an exceptional case, of which a suitably chosen arc of the equator of an oblate spheroid may serve as an example. In the problem of the catenoid a pair of conjugate points on any of the catenaries, which are the stationary curves of the problem, is such that the tangents to the catenary at the two points A and A' meet on the axis of revolution (fig. 2). When both the end points of the required curve move on fixed guiding curves Co, Ci, a stationary curve C, joining a point Ao of Co to a point Ai of Ci, cannot yield an extremum unless it is cut transversely by Co at Ao and by Ci at Ai. The envelope of stationary curves which set out from Co towards Ci, and are cut transversely by Co at points near Ao, meets C at a point Do; and the envelope of stationary curves which proceed from Co to Ci, and are cut transversely by Ci at points near Ai, meets C at a point DI. The curve C, drawn from Ao to AI, cannot yield an extremum if Do or Dj lies between Ao and AI, or if Do lies between Ai and DI. These results are due to G. A. Bliss (1903). A simple example is afforded by the shortest line on a Sphere drawn from one small circle to another. In fig. 3 Do is that FIG. 2.
FIG. 3.
Sources of Welerstrass's theory.
pole of the small circle AoBo which occurs first on great circles cutting AoBo at right angles, and proceeding towards AiBi; DI is that pole of the small circle AiBi which occurs first on great circles cutting AiBi at right angles, and drawn from points of A Bo towards AiBi. The arc AoAi is the required shortest line, and it is distinguished from BpBi by the above criterion.
Jacobi's introduction of conjugate points is one of the germs from which the modern theory of the calculus of variations has sprung. Another is a remark made by Legendre (1786) in regard to the solution of Newton's problem of the solid of least resistance. This problem requires that a curve be found for which the integral W*(i+y*)-*dy should be a minimum. The stationary curves are given by the equation yy"( !+/")- = const., a. result equivalent to Newton's solution of the problem; but Legendre observed that, if the integral is taken along a broken line, consisting of two straight lines equally inclined to the axis of x in opposite senses, the integral can be made as small as we please by sufficiently diminishing the angle of inclination. Legendre's remark amounts to admitting a variation of Newton's curve, which is not a weak variation. Variations which are not weak are such that, while the points of a curve are but slightly displaced, the tangents undergo large changes of direction. They are distinguished as strong variations. A general theory of strong variations in connexion with the First Problem, and of the conditions which are sufficient to secure that the integral taken along a stationary curve may be an extremum, was given by Weierstrass in lectures. He delivered courses of lectures on the calculus of variations in several years between 1865 and 1889, and his chief discoveries in the subject seem to have been included in the course for 1879. Through these lectures his theory became known to some students and teachers in Europe and America, and there have been published a few treatises and memoirs devoted to the exposition of his ideas.
In the First Problem the following conditions are known to be necessary for an extremum. I. The path of integration must be a stationary curve. II. The expression 3 2 F/d;y' 2 , or the expression denoted by /i in the application of the parametric method, must not change sign at any point of this curve between the end points. III. The arc of the curve between the end points must not contain a pair of conjugate points. All these results are obtained by using weak variations. Additional ' C Field of stationary carves.
results, relating to strong as well as weak variations, are obtained by a method which permits of the expression of the variation of an integral as a line integral taken along the varied curve. Let A, B be the end points, and let the stationary curve AB be drawn. If the end points A, B are not a pair of conjugate points, and if the point conjugate to A does not lie on the arc AB, then we may find a point A', on the backward continuation of the stationary curve BA beyond A, so near to A that the point conjugate to A' lies on the forward continuation of the arc AB beyond B. This being the case, it is possible to delimit an area of finite breadth, so that the arc AB of the stationary curve joining A, B lies entirely within the area, and no two stationary curves drawn through A' intersect within the area. Through any point of such an area it is possible to draw one, and only one, stationary curve which passes through A'. This family of stationary curves is said to constitute afield of stationary curves about the curve AB. We suppose that such a field exists, and that the varied curve AQPB lies entirely within the delimited area. The variation of the integral fF(x, y, y')dx is identical with the line integral of F taken round a contour consisting of the varied curve AQPB and the stationary curve AB, in the sense AQPBA. The line integral may, as usual, be replaced by the sum of line integrals taken round a series of cells, the external boundaries of the set of cells being identical with the given contour, and the internal boundaries of adB jacent cells being traversed twice in opposite senses. We may choose a suitable FIG. 4. set of cells as follows.
Let Q, P be points on the varied curve, and let A'Q, A'P be the stationary curves of the field which pass through Q, P. Let P follow Q in the sense AQPB in which the varied curve is described. Then the contour consisting of the stationary curve A'Q, from A' to Q, the varied curve QP, from Q to P, and the stationary curve A'P, from P to A', is the boundary of a cell (fig. 4). Let us denote the integral of F taken along a stationary curve by round brackets, thus (A'Q), and the integral of F taken along any other curve by square brackets, thus [PQ]. If the varied curve is divided into a number of arcs such as QP we have the result ]-(AB)=Z{(A'Q)-f[QP]-(A'P)!, and the right-hand member can be expressed as a line integral taken along the varied curve AQPB.
To effect this transformation we seek an approximate expression for the term (A'Q) +[QP] - (A'P) when Q, P are near together. Let As denote the arc QP, and $ the angle which the tangent at P to the varied curve, in the sense from A to B, makes with the axis of x (fig. 5). Also let <t> be the angle which the tangent at P to the stationary curve A'P, in the sense from A' to P, makes with the axis of x. We evaluate (A'Q) (A'P) approximately by means of a result which we obtained in connexion with the problem of variable limits. Observing that the angle here denoted by ^ equivalent to the angle formerly General expression for variation of an Integral.
FIG. 5.
Also we have denoted by *+<> (cf. fig. l), while tan < is equivalent to the quantity formerly denoted by y', we obtain the approximate equation (A'Q) -(A'P) = -As.cos * \ F(*,y-#)
which is correct to the first order in As.
] = As . cos ^(x,y, tan correctly to the same order. Hence we find that, correctly to the first order in As, (A'Q)-HQP]-(A'P)=E(x,y, tan *, tan where E(x, y, tan <t>, tan y, tan ^) When the parametric method is used the function E takes the form where X, ^ are the direction cosines of the tangent at P to the curve AQPB, in the sense from A to B, and /, m are the direction cosines ol the tangent at P to the stationary curve A'P, in the sense from A' to P. The function E, here introduced, has been called Weierstrass's excess function. We learn that the variation of the integral, that , . is to say, the excess of the integral of F taken along the strass's varied curve above the integral of F taken along the excess original curve, is expressible as the line integral /Eds function taken along the varied curve. We can therefore state a sufficient (but not necessary) condition for the existence ol an extremum in the form: When the integral is taken along a necessary conditions.
itationary curve, and there is no pair of conjugate points on the arc of the curve terminated by the given end points, the integral is certainly an extremum if the excess function has the same sign at all points of a finite area containing the whole of this arc within t. Further, we may specialize the excess function by identifying A' with A, and calculating the function for a point P on the arc AB of the stationary curve AB, and an arbitrary direction of the tangent at P to the varied curve. This process is equivalent to theintroduction of a particular type of strong variation. We may in fact take, as a varied curve, the arc AQ of a neighbouring stationary curve, the straight line QP drawn from Q to a point of the arc AB, and the arc PB of the stationary curve AB (fig. 6). The sign of the variation is then the same as that of the function JL(x, y, tan <t>, tan ^), where (x, y) is the point P, ^ is the , angle which the straight "~ "" line QP makes with the F IG. 6.
axis of x, and <t> is the angle which the tangent at P to the curve APB makes with the same axis. We thus arrive at a new necessary (but not sufficient) condition for the existence of an extremum of the integral /Fds, viz. the specialized excess function, so calculated, must not change sign between A and B.
The sufficient condition, and the new necessary condition, associated with the excess function, as well as the expression for the variation as /Eds, are due to Weierstrass. In applications to special problems it is generally permissible to identify A' with A, and to regard QP as straight. The direction of QP must be such that the integral of F taken along it is finite and real. We shall describe such directions as admissible. In the statement of the sufficient condition, and the new necessary condition, it is of course understood that the direction specified by $ is admissible. The excess function generally vanishes if ^ = <, but it does not change sign. It can be shown without difficulty that, when ^ is very nearly equal to <#>, the sign of E is the same as that of connected wHh the excess function.
(tan ^ tan <) 2 cos ,. . ,, .
\oy J y' = tan $ and thus the necessary condition as to the sign of the excess function includes Legendre's condition as to the sign of d'F/sy 2 .
Weierstrass's conditions have been obtained by D. Hilbert from the observation that, if p is a function of x and y, the integral taken along i curve joining two fixed points, has the same value for all such curves, provided that there is a field of stationary curves, and that p is the gradient at the point (x, y) of that stationary curve of the field which passes through this point.
An instructive example of the excess function, and the conditions connected with it, is afforded by the integral Example of the orfy*x 3 y The first integral of the principal equation is y ii- = const., and the stationary curves include the axis of x, straight lines parallel to the axis of y, and the family of exponential curves y = ae". A field of stationary curves is expressed by the equation y=y exp {c(*-x )), and, as these have no envelope other than the initial point (x a , yo), there are no conjugate points. The function /i is 6ij/~ 4 , and this is positive for curves going from the initial point in the positive direction of the axis of *. The value of the excess function is y'cos ^(cotV 3 cotV+2 tan ^ cotV).
The directions ^ = o and ^ = ir are inadmissible. On putting i/- = JT we get 2y 2 cot 3 <t>; and on putting ^ = i>ir we get 2}> 2 cot 3 0. Hence the integral taken along AQ'PB is greater than that taken along APB, and the integral taken along AQPB is less than that taken along APB, when Q'Q are sufficiently near to P on the ordinate of P (fig. 7). It follows that the ., integral is neither a maximum ' q /B nor a minimum.
It has been proved by Weierstrass that the excess function cannot be one-signed if the function / of the parametric method is a rational function of x and y. This result includes the above example, and the problem of the solid of least resistance, for which, as Legendre had seen, there can be no solu- FIG. 7.
tion if strong variations are admitted. As another example of the calculation of excess functions, it may be noted that the value o" the excess function in the problem of the catenoid is Field of stationary curvet and transversals.
In general it is not necessary that a field of stationary curves should consist of curves which pass through a fixed point. Any family of stationary curves depending on a single parameter may constitute a field. This remark is of importance in connexion with the adaptation of Weierstrass's results to the problem of variable limits. For the purpose of this adaptation A. Kneser (1900) introduced the family of stationary curves which are cut transversely by an assigned curve. Within the field of these curves we can construct the transversals of the family ; that is to say, there is a finite area of the plane, through any point of which there passes one stationary curve of the field and one curve which cuts alj the stationary curves of the field transversely. These curves provide a system of curvilinear co-ordinates, in terms of which the value of (Fdx, taken along any curve within the area, can be expressed. The Value of the integral is the same for all arcs of stationary curves of the field which are intercepted between any two assigned transversals. In the above discussion of the First Problem it has been assumed that the curve which yields an extremum is an arc of a single curve, which must be a stationary curve. It is conceivable that the required curve might be made up of a finite number of arcs of different stationary curves meeting each other at finite angles. It can be shown that such a broken curve cannot yield an extremum unless both the expressions dF/dy' and F y'(dF/dy') are continuous at the corners. In the parametric method dfjdx and df/dy must be continuous at the corners. This result limits very considerably _. the possibility of such discontinuous solutions, though it does not exclude them. An example is afforded by the solutions Problem of the catenoid. The axis of x and any lines ' parallel to the axis of y satisfy the principal equation; and the conditions here stated show that the only discontinuous solution of the problem is presented by the broken line ACDB (fig. 8). A broken line like AA'B'B is excluded. Discontinuous solutions have generally been supposed to be of special importance in cases where the required curve is restricted by the condition of not crossing the boundary of a certain limited area. In such cases part of the boundary may have to be taken as part of the curve. Problems of this kind were investigated in detail by J. Steiner and I. Todhunter. In recent times tne theory has been much extended by C. Caratheodory.
In any problem of the calculus of variations the first step is the formation of the principal equation or equations; and the second Exlff. ste P ' s the solution of the equation or equations, in accordance with the assigned terminal or boundary conditions. theorem* If this solution cannot be effected, the methods of the calculus fail to answer the question of the existence or nonexistence of a solution which would yield a maximum or minimum of the integral under consideration. On the other hand, if the existence of the extremum could be established independently, the existence of a solution of the principal equation, which would also satisfy the boundary conditions, would be proved. The most famous example of such an existence-theorem is Dirichlet's principle, according to which there exists a function V, which satisfies the equation at all points within a closed surface S, and assumes a given value at each point of S. The differential equation is the principal equation answering to the integral axis of x taken through the volume within the surface S. The theorem of the existence of V is of importance in all those branches of mathematical physics in which use is made of a potential function, satisfying Laplace's equation ; and the two-dimensional form of the theorem is of fundamental importance in the theory of functions of a complex variable. It has been proposed to establish the existence of V by means of the argument that, since I cannot be negative, there must Dlrkh- be - amon the functions which have the prescribed let's ' boundary values, some one which gives to I the smallest principle P os , s , ible value - This unsound argument was first exposed by Weierstrass. He observed that precisely the same argument would apply to the integral fx*y">dx taken along a curve from the point (-1, a) to the point (i, b). On the one hand, the principal equation answering to this integral can be solved, and it can be proved that it cannot be satisfied by any function y at all points of the interval -i <*< i if y has different values at the end points. Un the other hand, the integral can be made as small as we please by a suitable choice of y. Thus the argument fails to distinguish between a minimum and an inferior limit (see FUNCTION). In order to prove Dirichlet's principle it becomes necessary ft) devise a proof that, in the case of the integral I, there cannot be a limit of this kind. This has been effected by Hilbert for the two- dimensional form of the problem.
BIBLIOGRAPHY. The literature of the subject is very extensive, and only a few of the more important works can be cited here. The earlier history can be gathered from M. Cantor's Geschichte d. Math. Bde. 1-3 (Leipzig, 1894-1901). I. Todhunter's History of the Calculus of Variations (London, 1861) gives an account of the various treatises and memoirs published between 1760 and 1860. E. Pascal's Cakplo dette variazioni . . . (Milan, 1897; German translation, Leipzig, 1899) contains a brief but admirable historical summary of the pre-Weierstrassian theory with references to the literature. A general account of the subject, including Weierstrass's theory, is given by A. Kneser, Ency. d. math. Wiss. ii. A 8; and an account of various extensions of Weitrstrass's theory and of Hilbert's work is given by E. Zermelo and H. Hahn, Ency. d. math. Wiss. ii. A 8a (Leipzig, 1904). The following treatises may be mentioned : L. Euler, Methodus inveniendi lineas curvas maximi minimive proprietatr gaudentes . (Lausanne and Geneva, 1744) ; J. H. Jellett, An Elementary Treatise on the Calculus of Variations (Dublin, 1850); E. Moigno and L. Lindelof, " Lecons sur le calc. diff.et int., " Calculdes variations (Paris, 1861), t. iv. ; L. B. Carll, A Treatise on the Calculus of Variations (London, 1885). E. Pascal's book cited above contains a brief systematic treatise on the simpler parts of the subject. A. Kneser, Lehrbuch d. Variationsrechnung (Brunswick, 1900) ; H. Hancock, Lectures on the Calculus of Variations (Cincinnati, 1904); and O. Bolza, Lectures on the Calculus of Variations (Chicago, 1904), give accounts of Weierstrass's theory. Kneser has made various extensions of this theory. Bojza gives an introduction to Hilbert's theories also. The following memoirs and monographs may be mentioned : J. L. Lagrange, ' Essai sur une npuvelle me'thode pour determiner les max. et les min. des formules integrates ind6fimes," Misc. Taur. (1760-62), t. ii., or (Euvres, t. i. (Paris, 1867); A. M. Legendre, " Sur la maniere de distinguer les max. des min. dans le calc. des var.," Mjm. Paris Acad. (1786); C. G. J. Jacobi, " Zur Theorie d. Variationsrechnung . . . ," J. f. Math. (Crelle), Bd. xvii. (1837), or Werke, Bd. iv. (Berlin, 1886); M. Ostrogradsky, " M^m. sur le calc. des var. des integrates multiples," Mem. St Petersburg Acad. (1838); J. Steiner, " Einfache Beweise d. isoperimetrischen Hauptsatze, J. f. Math. (Crelle), Bd. xviii. (1839); O. Hesse, " Uber d. Kriterien d. Max. u. Min. d. einfachen Integrate," J. f. Math. (Crelle), Bd. liv. (1857); A. Clebsch, "Uber diejenigen Probleme d. Variationsrechnung welche nur eine unabhangige Variable enthalten," /. /. Math. (Crelle), Bd. Iv. (1858), and other memoirs in this volume and in Bd. Ivi. (1859); A. Mayer, Beitrdge z. Theorie d. Max. u. Min. einfacher Integrale (Leipzig, 1866), and " Kriterien d. Max. u. Min. . . ," /. /. Math. (Crelle), Bd. Ixix. (1868); I. Todhunter, Researches in the Calc. of Var. (London, 1871); G. Sabinine, " Sur ... les max . . . des integrales multiples," Bull. Si Petersburg Acad. (1870), t. xv., and " Developpements . . . pour ... la discussion de la variation seconde des integrates . . . multiples," Bull. d. sciences math. (1878); G. Frobenius, " Uber adjungirte lineare Differentialausdriicke," J. f. Math. (Crelle), Bd. Ixxxv. (1878); G. Erdmann, " Zur Untersuchung d. zweiten Variation einfacher Integrale," Zeitschr. Math. u. Phys. (1878), Bd. xxiii. ; P. Du Bois-Reymond, " Erlauterungen z. d. Anfangsgriinden d. Variationsrechnung," Math. Ann. (1879), Bd. xv. ; L. Scheeffer, " Max. u. Min. d einfachen Int.," Math. Ann. (1885), Bd. xxv., and " Uber d. Bedeutung d. Begriffe Max. . . ," Math. Ann. (1886), Bd. xxvi. ; A. Hirsch, " Uber e. charakteristische Eigenschaft d. Diff.-Gleichungen d. Variationsrechnung," Math. Ann. (1897), Bd. xlix. The following deal with Weierstrassian and other modern 'developments: H. A. Schwarz, " Uber ein die Flachen kleinsten Flacheninhalts betreffendes Problem d. Variationsrechnung," Festschrift on the occasion of Weierstrass's 7Oth birthday (1885), Werke, Bd. i. (Berlin, 1890); G. Kobb, " Sur les max. et les min. des int. doubles," Ada Math. (1892-93), Bde. xyi., xvii.; E. Zermelo, " Untersuchungen z. Variationsrechnung,"Z)ijerfa/zon (Berlin, 1894) ; W. F. Osgood, " Sufficient Conditions in the Calc. of Var.," Annals of Math. (1901), vol. ii., also, " On the Existence of a Minimum . . . ," and " On a Fundamental Property of a Minimum . . . ," Amer. Math. Soc. Trans. (1901), vol. ii. ; D. Hilbert, " Math. Probleme," Gottingen Nachr. (1900), and "Uber das Dirichlet'sche Prinzip," Gottingen Festschr. (Berlin, 1901); G. A. Bliss, " Jacobi's Criterion when both End Points are variable," Math. Ann. (1903), Bd. Iviii. ; C. Caratheodory, " Uber d. diskontinuirlichen Losungen i. d. Variationsrechnung," Dissertation (Gottingen, 1904); and " Uber d. starken Max . . . ," Math. Ann. (1906), Bd. Ixii. (A. E. H. L.)
Note - this article incorporates content from Encyclopaedia Britannica, Eleventh Edition, (1910-1911)