A Mathematical Journal Vol. 7, No 3, (27 - 37). December 2005. Sufficiency of the maximum principle for time optimality H. O. Fattorini Department of Mathematics, University of California Los Angeles, California 90095-1555 hof@math.ucla.edu ABSTRACT For infinite dimensional linear systems, Pontryagin’s maximum principle is shown to be sufficient for time optimality with conditions on the initial condition and on the target. These conditions cannot be given up and are shown to be best possible by means of counterexamples. RESUMEN Para sistemas lineales en dimensión infinita, el principio del máximo de Pon- tryagin es suficiente para alcanzar optimalidad en el tiempo con condiciones en el valor inicial y el final. Estas condiciones no se pueden relajar y se muestra que son las mejores posibles, por medio de contraejemplos. Key words and phrases: linear control systems in Banach spaces, time optimal problem. Math. Subj. Class.: 93E20, 93E25. 1 Introduction. Consider the time optimal problem of driving the solution y(t) of y′(t) = Ay(t) + u(t) , y(0) = ζ (1.1) 28 H. O. Fattorini 7, 3(2005) from the initial point ζ to a point target, y(T ) = ȳ (1.2) with maximum-norm bound ‖u(t)‖ ≤ 1 a. e. in 0 ≤ t ≤ T (1.3) in minimum time T ; A is the infinitesimal generator of a strongly continuous semi- group S(t) in a Banach space E and the controls u(t) are strongly measurable (so that, in view of (1.3), belong to the unit ball of L∞(0, T ; E)). Solutions or trajectories y(t) = S(t)ζ + ∫ t 0 S(t − σ)u(σ)dσ of (1.1) are named y(t) = y(t, ζ, u) and controls satisfying (1.3) are called admissible. Let Z be an arbitrary linear space with E∗ ⊆ Z. We say that Z is a multiplier space if (i) S(t)∗ is defined in Z,(ii) S(t)∗Z ⊆ E∗ for t > 0. A control ū(t) in the interval 0 ≤ t ≤ T satisfies the weak maximum principle if there exists z in a multiplier space Z such that S(t)∗z is not identically zero in 0 < t ≤ T and 〈S(T − t)∗z, ū(t)〉 = max ‖u‖≤1 〈S(T − t)∗z, u〉 a. e. in 0 ≤ t ≤ T , (1.4) where 〈· , ·〉 is the duality of the space E and the dual E∗. The control satisfies the strong maximum principle if (1.4) holds and∫ T 0 ‖S(t)∗z‖E∗dt < ∞ . (1.5) The space Z(T ) consists of all multipliers that satisfy 1 (1.5). In Hilbert space, (1.4) is equivalent to ū(t) = S(T − t)∗z ‖S(T − t)∗z‖ (1.6) whenever the denominator is not zero.It is known [3] that if the control ū(t) drives ζ ∈ E to ȳ = y(T, ζ, ū) ∈ D(A) then the strong maximum principle (1.4)-(1.5) is a necessary condition for time optimality. It is also known [4] that(1.4)-(1.5) is a sufficient condition if ζ = 0 or ȳ = y(T, ζ, ū) = 0; then ū(t) drives ζ to ȳ time optimally.2 We prove below (Theorem 1.2) that these conditions on the initial and final point of the trajectory can be relaxed to one of the two assumptions ζ ∈ D(A), ‖Aζ‖ ≤ 1 or ȳ ∈ D(A), ‖Aȳ‖ ≤ 1 , (1.7) 1The semigroup S(t)∗ may not be strongly continuous, but in all cases the norm ‖S(t)∗‖ is lower semicontinuous, thus theintegral (1.5) makes sense. The spaces Z(T ) are the same for all T > 0; condition (1.5) only bears on the behavior of ‖S(t)∗‖ near zero. 2The weak maximum principle(1.4) is not a sufficient condition for time optimality; for a coun- terexample, see[6] 7, 3(2005) Sufficiency of the maximum principle for time optimality 29 (the first with an additional condition on the adjoint semigroup). That restrictions on the initial condition ζ or the target ȳ cannot be completely given up is illustrated with several examples, two of which show that conditions (1.7) are the best possible of their kind. We also see (in Example 4.2) that restrictions on ‖ζ‖, ‖ȳ‖ (rather than on ‖Aζ‖, ‖Aȳ‖) do not guarantee sufficiency of the maximum principle for time optimality. Remark 1.1. If S(t) is a group or, more generally, if S(T )E = E (t > 0) then the condition ȳ ∈ D(A) is not required to show that the maximum principle is a necessary condition for time optimality; moreover, Z(T ) = E∗. Sufficiency of the maximum principle, however, requires the same conditions as those in the general case. 2 Sufficiency of the maximum principle. Let R∞(T ) ⊆ E be the space of all elements of the form y = y(T, 0, u) = ∫ T 0 S(T − σ)u(σ)dσ , u(·) ∈ L∞(0, T ; E) . (2.1) The norm ‖y‖R∞(T ) is the infimum of ‖u(·)‖L∞(0,T ;E) for all u(·) that satisfy (2.1); in other words, R∞(T ) is the quotient of L∞(0, T ; E) by the closed subspace charac- terized by y(T, 0, u) = 0. An element z ∈ Z(T ) defines a bounded linear functional ξz in R∞(T ) through the formula 〈〈ξz, y〉〉 = ∫ T 0 〈S(T − σ)∗z, u(σ)〉dσ (2.2) where y and u(·) are related by (2.1) and 〈〈·, ·〉〉 indicates the duality of the space R∞(T ) and its dual R∞(T )?; the norm of ξz satisfies ‖ξz‖R∞(T )? = ∫ T 0 ‖S(t)∗z‖E∗dt . (2.3) Theorem 2.1. Assume that ū(t) satisfies (1.4) - (1.5) and that either (a) ȳ = y(T, ζ, ū) ∈ D(A), ‖Aȳ‖ < 1 , or (b) ζ ∈ D(A), ‖Aζ‖ < 1 , S(t)∗z 6= 0 in 0 ≤ t < T . (2.4) Then ū(t) drives ζ to ȳ time optimally in 0 ≤ t ≤ T. Proof of case (a). Assume ū(·) does not drive ζ to ȳ time optimally. Then there exists δ > 0 and a control ũ(·) ∈ L∞(0, T − δ; E), ‖ũ(·)‖L∞(0,T−δ;E) ≤ 1, that drives ζ to ȳ in time T − δ. The control 30 H. O. Fattorini 7, 3(2005) v(σ) = { ũ(σ) (0 ≤ σ < T − δ) −Aȳ (T − δ ≤ σ ≤ T ) (2.5) satisfies ‖v(·)‖L∞(0,T ;E) ≤ 1 . We have ȳ − S(t − (T − δ))ȳ = − ∫ t−(T−δ) 0 S(σ)Aȳ dσ = − ∫ t−(T−δ) 0 S(t − (T − δ) − σ)Aȳ dσ = ∫ t T−δ S(t − σ)(−Aȳ) dσ (T − δ ≤ t ≤ T ) (2.6) hence the trajectory y(t, ζ, v) starts at ζ, reaches ȳ at time T − δ and stays at ȳ for T − δ ≤ t ≤ T ; y(t, ζ, v) = ȳ (T − δ ≤ t ≤ T ) . (2.7) This can be also be seen noting that if y(t) = ȳ then we have y′(t) − Ay(t) = −Aȳ in T − δ ≤ t ≤ T. The maximum principle (1.4) is equivalent to∫ T 0 〈S(T − σ)∗z, u(σ)〉dσ ≤ ∫ T 0 〈S(T − σ)∗z, ū(σ)〉dσ (u(·) ∈ L∞(0, T ; E), ‖u(·)‖L∞(0,T ;E) ≤ 1) . (2.8) In terms of the linear functional ξz in (2.2), this is〈〈 ξz, ∫ T 0 S(T − σ)u(σ)dσ 〉〉 ≤ 〈〈 ξz, ∫ T 0 S(T − σ)ū(σ)dσ 〉〉 (u(·) ∈ L∞(0, T ; E), ‖u(·)‖L∞(0,T ;E) ≤ 1) . (2.9) We have y(T, ζ, v) = y(T, ζ, ū), thus∫ T 0 S(T − σ)v(σ)dσ = ∫ T 0 S(T − σ)ū(σ)dσ , (2.10) and it follows from (2.9) that〈〈 ξz, ∫ T 0 S(T − σ)u(σ)dσ 〉〉 ≤ 〈〈 ξz, ∫ T 0 S(T − σ)v(σ)dσ 〉〉 (u(·) ∈ L∞(0, T ; E), ‖u(·)‖L∞(0,T ;E) ≤ 1) , (2.11) which, being equivalent to (1.4), gives 〈S(T − t)∗z, v(t)〉 = max ‖u‖≤1 〈S(T − t)∗z, u〉 a. e. in 0 ≤ t ≤ T . (2.12) 7, 3(2005) Sufficiency of the maximum principle for time optimality 31 We have S(T − σ)∗z 6= 0 near3 T, hence (2.12) implies ‖v(σ)‖ = 1 near T. This is a contradiction, since by hypothesis ‖v(σ)‖ = ‖Aȳ‖ < 1. Proof of case (b). This time we define v(σ) = −Aζ (0 ≤ σ ≤ δ) ũ(σ − δ) (δ ≤ σ ≤ T ) . (2.13) As in (2.6) we have ζ − S(t)ζ = ∫ t 0 S(t − σ)(−Aζ)dσ , hence the trajectory y(t, ζ, v) stays at ζ for 0 ≤ t ≤ δ, y(t, ζ, v) = ȳ (0 ≤ t ≤ δ) , and then starts for the target ȳ, which hits at time T. The proof ends in the same way as that of (a) noting that S(T − σ)∗z 6= 0 in 0 ≤ σ ≤ T (in particular, in 0 ≤ σ ≤ δ) hence (2.12) implies ‖v(σ)‖ = 1 in 0 ≤ σ ≤ δ, in contradiction to the fact that ‖v(σ)‖ = ‖Aζ‖ < 1 in 0 ≤ σ ≤ δ. 3 Counterexamples, I. To see that (2.4) cannot be relaxed, we have Example 3.1.Consider the one dimensional system y′(t) = −ay(t) + u(t), y(0) = ζ (3.1) with a > 0. We have S(t) = e−at = S(t)∗, thus controls satisfying (1.4) with z 6= 0 are of one of the two forms ū(t) = { 1 if z > 0 , −1 if z < 0 . (3.2) For the initial condition and target ζ = 1/a, ȳ = 1/a we have∫ T 0 S(T − σ) · 1 dσ = ∫ T 0 e−a(T−σ)dσ = 1 − e−aT a = ȳ − S(T )ζ so that the first control in (3.2) drives ζ to ȳ in any time T ≥ 0; in other words, y(T, ζ, ū) = ȳ for all T ≥ 0. None of these drives is time optimal except forthe one where T = 0. 3The semigroup equation for the adjoint semigroup S(t)∗ implies: if S(T − t)∗z = 0, then S(T − σ)∗z = S(t − σ)∗S(T − t)∗z = 0 for σ ≤ t. Accordingly, unless S(T − t)∗z 6= 0 in an interval (ρ, T ), ρ < T,S(T − t)∗z will be identically zero in 0 < t ≤ T. 32 H. O. Fattorini 7, 3(2005) Example 3.2. For another counterexample (or, rather, family of counterexamples) we use an arbitrary unitary group S(t) in Hilbert space. Here we have Z(T ) = E (see Remark 1.1), S(t)∗ = S(−t) = S(t)−1, ‖S(t)y‖ = ‖y‖. Controls satisfying the maximum principle are given by (1.6) (the denominator satisfies ‖S(T −t)∗z‖ = ‖z‖). Assuming (as we may) that ‖z‖ = 1 we have∫ T 0 S(T − σ) S(T − σ)∗z ‖S(T − σ)∗z‖ dσ = ∫ T 0 S(T − σ)S(T − σ)∗z dσ = T z (3.3) so that the control (1.6) drives ζ to ȳ in time T if and only if T is a solution of the equation T z = ȳ − S(T )ζ . (3.4) This equation implies the scalar equation T = ‖S(T )ζ − ȳ‖ (3.5) and, conversely, if T > 0 is a solution of (3.5) it is clear that (3.4) will holdwith z = ȳ − S(T )ζ ‖ȳ − S(T )ζ‖ . (3.6) Theorem 3.3. Assume (3.5) has only one nonnegative solution T. Then the control (1.6) drives ζ to ȳ in optimal time T. If (3.5) has multiple solutions, only the control (1.6) corresponding to the smallest T drives ζ to ȳ time optimally. Proof. Let T ≥ 0 be the smallest solution of (3.5). If T = 0 we don’t need to drive at all so that T is the optimal time. If T > 0 there exists an admissible control driving from ζ to ȳ, hence the standard existence theorem [2, Theorem 1.2] provides a control ū(t) driving from ζ to ȳ in optimal time T . Since S(t) is a group (Remark 1.1) this control must satisfy the maximum principle (1.4) with a nonzero multiplier z ∈ Z(T ) = E∗ = E. We are in a Hilbert space, which means this control must of the form (1.6) (with ‖z‖ = 1), ū(t) = S(T − t)∗z ‖S(T − t)∗z‖ = S(T − t)∗z (the denominator cannot be zero since ‖S(T − t)∗z‖ = ‖z‖). As in (3.3) we then have∫ T 0 S(T − σ)ū(σ)dσ = T z = ȳ − S(T )ζ hence T is a solution of (3.5) and, as T is the optimal time, we must have T = T. Corollary 3.4. Assume that, either (a) ζ ∈ D(A), ‖Aζ‖ > 1, or (b) ζ /∈ D(A). Then there exists a control of theform (1.6) that drives ζ to ζ in time T > 0, thus is not time optimal. Proof. We write (3.5) for ζ = ȳ as ‖S(t)ζ − ζ‖ t = 1 . (3.7) 7, 3(2005) Sufficiency of the maximum principle for time optimality 33 In case (a) we have lim t→0+ ‖S(t)ζ − ζ‖ t = lim t→0+ ∥∥∥ S(t)ζ − ζ t ∥∥∥ → ‖Aζ‖ > 1 , and we deduce that (3.7) has a positive solution, since the left side tends to 0 as t → ∞. In case (b), lim inf t→0+ ‖S(t)ζ − ζ‖ t = lim inf t→0+ ∥∥∥ S(t)ζ − ζ t ∥∥∥ = ∞ since a finite lim inf implies that ζ ∈ D(A) ([1, Theorem 2.1.2. (c), p.88]). Remark 3.5. Corollary 3.4 has an interesting application. The equivalence S(T )ζ + ∫ T 0 S(T − σ)u(σ)dσ = ȳ ⇐⇒ ∫ T 0 S(T − σ)u(σ)dσ = ȳ − S(T )ζ says that u(t) drives ζ to ȳ in time T ⇐⇒ u(t) drives 0 to ȳ − S(T )ζ in time T. If “drives” is changed to “drives optimally”, the implication =⇒ remains. In fact, if ū(·) does not drive 0 to ȳ −S(T )ζ time optimally then there exists δ > 0 and acontrol u(·) with ‖u(·)‖L∞(0,T−δ;E) ≤ 1 and ∫ T−δ 0 S(T − δ − σ)u(σ)dσ = ȳ − S(T )ζ . Then, if we define v(σ) = { 0 (0 ≤ σ < δ) u(σ − δ) (δ ≤ σ ≤ T ) we have ∫ T 0 S(T − σ)v(σ)dσ = ∫ T−δ 0 S(T − δ − σ)u(σ)dσ = ȳ − S(T )ζ , thus v(t) drives from ζ to ȳ in time T. If this drive were time optimal,the “bang- bang” Theorem 2.2 in [2] would say that ‖v(σ)‖ = 1 a. e., which is not the case since v(σ) = 0 in 0 ≤ σ ≤ δ. Accordingly, the optimal driving time from ζ to ȳ is < T. The implication ⇐= is not true; in the setting of unitary semigroups in Hilbert spaces it suffices to take ȳ = ζ ∈ D(A) with ‖Aζ‖ > 1, and, applying Corollary 3.4 construct a control ū(·) satisfying (1.6) and driving ζ to ζ in time T > 0. The same control drives 0 to ζ − S(T )ζ, but this drive is optimal since the initial condition satisfies (b) in Theorem 2.1. 34 H. O. Fattorini 7, 3(2005) 4 Counterexamples, II. The next example belongs to the family in Example 3.2. Example 4.1. Consider E = IR2, A = [ 0 1 −1 0 ] . (4.1) The semigroup generated by A, S(t) = eAt = [ cos t sin t − sin t cos t ] (4.2) is unitary. In polar coordinates, ζ = (r cos θ, r sin θ),ȳ = (s cos ϕ, s sin ϕ), and ‖S(t)ζ − ȳ‖2 = ∥∥∥∥S(t) [ r cos θ r sin θ ] − [ s cos ϕ s sin ϕ ] ∥∥∥∥2 = (r cos t cos θ + r sin t sin θ − s cos ϕ)2 +(−r cos θ sin t + r cos t sin θ − s sin ϕ)2 = r2 + s2 − 2rs cos(t − θ + ϕ) . We have ‖Ay‖ = ‖y‖. For ζ = ȳ = (1.1, 0) (so that ‖Aζ‖ = ‖Aȳ‖ = 1.1) we have r = s = 1.1,θ = ϕ = 0. Equation (3.5) (Figure 1) has a positive solution T = 1.49797 (4.3) thus we can drive from ζ = (1.1, 0) back to ζ in time T with a control satisfying (1.6), ū(σ) = S(T − σ)z , (4.4) with z given by (3.6), z = (0.68090, 0.73238) . Figure 2 below shows the drive (moving clockwise) whichis obviously not time optimal. 7, 3(2005) Sufficiency of the maximum principle for time optimality 35 For ζ = (4, 0), ȳ = (−4, 0) we have r = s = 4, θ = 0, ϕ = π. Equation (3.5) (Figure 3) has three solutions, T0 = 2.50471 , T1 = 4.26666 , T2 = 7.19061 . (4.5) thus we can drive from (4, 0) to (−4, 0) with three different controls that satisfy (1.6), ūj (σ) = S(Tj − σ)∗zj j = 0 , 1 , 2 , (4.6) where the zj are given by (3.6) for each Tj , z0 = (−0.31308, 0.94972) , z1 = (−0.53333, −0.84590) , z2 = (−0.89882, 0.43830) . Figure 2 shows the three trajectories, each plotted for 0 ≤ t ≤ Tj ; only the first (thicker curve) is time optimal. Remark 4.2. The strong maximum principle (1.4)-(1.5) is a sufficient condition for norm optimality [4] with no conditions on ζ or ȳ so that each of the controls ūj (t), 36 H. O. Fattorini 7, 3(2005) j = 0, 1, 2 in (4.6) is norm optimal in its own interval; this means, if ζ = (4, 0) can be drivento ȳ = (−4, 0) in the interval 0 ≤ t ≤ Tj by means of a control u(t) then ‖u(·)‖L∞(0,Tj ;E) ≥ 1 = ‖ūj (·)‖L∞(0,Tj ;E) . The same observation applies to the control (4.4); it drives ζ back to ζ norm optimally in the interval 0 ≤ t ≤ T. Example 4.3. Example 3.1 can be manipulated into evidence that restrictions on ‖ζ‖ or ‖ȳ‖ such as ‖ζ‖ ≤ � or ȳ ≤ � don’t guarantee sufficiency of the maximum principle for time optimality. To this end we consider thespace E = `2 of all sequences y = {y1, y2, . . . } such that ‖y‖2 = ∑ |yk|2 < ∞, equipped with the norm ‖ · ‖. The operator is Ay = A{yk} = {−nyk} with maximal domain. It generates the semigroup S(t){yk} = {e−ktyk} = S(t)∗ . Let ζn = 1 n {δnk} = ȳn , zn = {δnk} (δnk the Kronecker delta). We have ‖ζn‖ = ‖ȳn‖ = 1 n , ‖Aζn‖ = ‖Aȳn‖ = 1 . If ūn(·) satisfies (1.6) in an interval 0 ≤ t ≤ T with z = zn then ūn(σ) = {δnk} and∫ T 0 S(T − σ)ū(σ)dσ = 1 − e−nT n δnk = ȳn − S(T )ζn for any T > 0. Accordingly, ūn(σ) drives ζn to yn in an arbitrary interval 0 ≤ t ≤ T. The driveis not optimal unless T = 0. Received: April 2004. Revised: May 2004. References [1] P. L. Butzer, H. Berens, Semi-Groups of Operators and Approximation, Springer, Berlin 1967. [2] H. O. Fattorini, Time-optimal control of solutions of operational differential equations, SIAM J. Control 2 (1964) 54-59. 7, 3(2005) Sufficiency of the maximum principle for time optimality 37 [3] H. O. Fattorini, The maximum principle in infinite dimension,Discrete & Continuous Dynamical Systems 6 (2000) 557-574. [4] H. O. Fattorini, Existence of singular extremals and singular functionals in reachable spaces, Jour. Evolution Equations 1 (2001)325-347. [5] H. O. Fattorini, A survey of the time optimal problem and the norm optimal problem in infinite dimension, Cubo Mat. Educacional 3 (2001) 147-169. [6] H. O. Fattorini, Time optimality and the maximum principle in infinite di- mension, Optimization 50 (2001) 361-385.