136x Filetype PDF File size 0.39 MB Source: ww3.math.ucla.edu
Iterative Solution of Algebraic Riccati Equations for Damped Systems Kirsten Morris and Carmeliza Navasca Abstract—Algebraic Riccati equations (ARE) of large dimen- common iterative methods are Chandrasekhar [3], [9] and sion arise when using approximations to design controllers for Newton-Kleinman iterations [14], [17], [28]. Here we use a systems modelled by partial differential equations. We use a modification to the Newton-Kleinman method first proposed modified Newton method to solve the ARE. Since the modified by by Banks and Ito [3] as a refinement for a partial solution Newton method leads to a right-hand side of rank equal to to the Chandraskehar equation. the number of inputs, regardless of the weights, the resulting Lyapunov equation can be more efficiently solved. A low-rank Solution of the Lyapunov equation is a key step in im- Cholesky-ADI algorithm is used to solve the Lyapunov equation plementing either modified or standard Newton-Kleinman. resulting at each step. The algorithm is straightforward to code. The Lyapunov equations arising in the Newton-Kleinman Performance is illustrated with an example of a beam, with method have several special features: (1) the model order different levels of damping. Results indicate that for weakly n is generally much larger than the number of controls m damped problems a low rank solution to the ARE may not exist. Further analysis supports this point. or number of observations p and (2) the matrices are often INTRODUCTION sparse. We use a recently developed method [20], [26] that uses these features, leading to an efficient algorithm. Use Aclassical controller design objective is to find a control of this Lyapunov solver with a Newton-Kleinman method is u(t) so that the objective function described in [27], [6], [7], [8]. ! ∞!x(t),Qx(t)"+u∗(t)Ru(t)dt (1) In the next section we describe the algorithm to solve the Lyapunov equations. We use this Lyapunov solver with both 0 standard and modified Newton-Kleinman to solve a number is minimized where R is a symmetric positive definite matrix of standard control examples, including one with several and Q is symmetric positive semi-definite. It is well-known space variables. Our results indicate that modified Newton- that the solution to this problem is found by solving an Kleinman achieves considerable savings in computation time algebraic Riccati equation over standard Newton-Kleinman. We also found that using A∗P +PA−PBR−1B∗P =−Q. (2) the solution from a lower-order approximation as an ansatz for a higher-order approximation significantly reduced the for a feedback operator computation time. K=−R−1B′P. (3) I. DESCRIPTION OF ALGORITHM If the system is modelled by partial differential or delay One approach to solving large Riccati equations is the differential equations then the state x(t) lies in an infinite- Newton-Kleinman method [17]. The Riccati equation (2) can dimensional space. The theoretical solution to this problem be rewritten as for many infinite-dimensional systems parallels the theory for ∗ ∗ finite-dimensional systems [11], [12], [18], [19]. In practice, (A−BK) P +P(A−BK)=−Q−K RK. (4) the control is calculated through approximation. The matrices We say a matrix A is Hurwitz if σ(A ) ⊂ C . A, B, and C arise in a finite dimensional approximation of o o − Theorem 1.1: [17] Consider a stabilizable pair (A,B) the infinite dimensional system. Let n indicate the order of with a feedback K0 so that A − BK0 is Hurwitz. Define the approximation, m the number of control inputs and p S =A−BK,andsolve the Lyapunov equation i i the number of observations. Thus, A ∈ Rn×n, B ∈ Rn×m S∗P +P S =−Q−K∗RK (5) and C ∈ Rp×n. There have been many papers written i i i i i i describing conditions under which approximations lead to for P and then update the feedback as K =−R−1B∗P. approximating controls that converge to the control for the i i+1 i Then original infinite-dimensional system [4], [12], [15], [18], lim P = P [19]. In this paper we will assume that an approximation i→∞ i has been chosen so that a solution to the Riccati equation (2) with quadratic convergence. exists for sufficiently large n and also that the approximating The key assumption in the above theorem is that an initial feedback operators converge. Unfortunately, particularly in estimate, or ansatz, K , such that A − BK is Hurwitz 0 0 partial differential equation models with more than one space is available. For an arbitrary large Riccati equation, this dimension, many infinite-dimensional control problems lead condition may be difficult to satisfy. However, this condition to Riccati equations of large order. A survey of currently is not restrictive for Riccati equations arising in control of used methods to solve large ARE can be found in [8]. Two infinite-dimensional systems. First, many of these systems are stable even when uncontrolled and so the initial iterate TABLE I K may be chosen as zero. Second, if the approximation COMPLEXITY OF CF-ADI AND ADI [20] 0 procedure is valid then convergence of the feedback gains is obtained with increasing model order. Thus, a gain obtained CF-ADI ADI 2 Sparse A O(Jrn) O(Jrn ) from a lower order approximation, perhaps using a direct 2 3 method, may be used as an ansatz for a higher order Full A O(Jrn ) O(Jrn ) approximation. This technique was used successfully in [14], [24], [28]. TABLE II Amodification to this method was proposed by Banks and CHOLESKY-ADI METHOD Ito [3]. In that paper, they partially solve the Chandrasekhar Given A and D equations and then use the resulting feedback K as a o Choose ADI parameters {p ,...,p } with ℜ(p ) < 0 √ 1 J i ∗ −1 stabilizing initial guess. Instead of the standard Newton- Define z = −2p (A +p I) D 1 1 o 1 and Z =[z ] Kleinman iteration (5) above, Banks and Ito rewrote the 1 1 For i = 2,...,J Riccati equation in the form √−2pi+1 ∗ −1 Define W =( √ )[I − (p −p)(A +p I) ] i −2pi i+1 i o i+1 ∗ ∗ (1) z = W z (A−BK) X +X(A−BK) = −D RD, i i i−1 i i i i i i (2) If $z$ > tol Di = Ki −Ki−1, Ki+1 = Ki−B∗Xi. Z =[Z z ] i i−1 i Solution of a Lyapunov equation is a key step in each Else, stop. iteration of the Newton-Kleinman method, both standard and modified. Consider the Lyapunov equation XAo+A∗X=−DD∗ (6) symmetric A the optimal parameters can be easily calculated. o A heuristic procedure to calculate the parameters in the where Ao ∈ Rn×n is Hurwitz and D ∈ Rn×r. Factor general case is in [26]. If the spectrum of A is complex, Q into C∗C and let nQ indicate the rank of C. In the we estimated the complex ADI parameters as in [10]. As case of standard Newton-Kleinman, r = m + nQ while for the spectrum of A flattens to the real axis the ADI parameters modified Newton-Kleinman, r = m. If Ao is Hurwitz, then are closer to optimal. the Lyapunov equation has a symmetric positive semidefinite If A is sparse, then sparse arithmetic can be used in solution X. As for the Riccati equation, direct methods such o as Bartels-Stewart [5] are only appropriate for low model calculation of Xi. However, full calculation of the dense order and do not take advantage of sparsity in the matrices. iterates Xi is required at each step. By setting X0 = 0, In [3] Smith’s method was used to solve the Lyapunov it can be easily shown that Xi is symmetric and positive −1 semidefinite for all i, and so we can write X = ZZ∗ equations. For p < 0, define U = (A − pI)(A + pI) o o where Z is a Cholesky factor of X [20], [26]. (A Cholesky ∗ −1 ∗ −1 and V = −2p(Ao+pI) DD (Ao+pI) . The Lyapunov factor does not need to be square or be lower triangular.) equation (6) can be rewritten as Substituting Z Z∗ for X , setting X = 0, and defining i i i 0 ∗ ∗ −1 X=U XU+V. M =(A +pI) weobtain the following iterates i o i In Smith’s method [30], the solution X is found by using Z = "−2p M D successive substitutions: X = limi→∞Xi where 1 " 1 1 Z = [ −2pMD, M(A∗−pI)Z ]. Xi = U∗Xi−1U +V (7) i i i i o i i−1 with X0 = 0. Convergence of the iterations can be improved Note that Z ∈ Rn×r, Z ∈ Rn×2r, and Z ∈ Rn×ir. by careful choice of the parameter p e.g. [29, pg. 297]. 1 2 i This method of successive substitition is unconditionally The algorithm is stopped when the Cholesky iterations convergent, but has only linear convergence. converge within some tolerance. In [20] these iterates are The ADI method [21], [32] improves Smith’s method by reformulated in a more efficient form, using the observation using a different parameter p at each step. Two alternating that the order in which the ADI parameters are used is i irrelevant. This leads to the algorithm shown in Table II. linear systems, This solution method results in considerable savings in (A∗ +piI)Xi−1 = −DD∗−Xi−1(Ao−piI) computation time and memory. In calculation of the feedback o 2 ∗ ∗ ∗ ∗ K, the full solution X never needs to be constructed. Also, (A +piI)X = −DD −X 1(Ao−piI) o i i−2 the complexity of this method (CF-ADI) is also considerably are solved recursively starting with X0 = 0 ∈ Rn×n and less than that of standard ADI as shown in Table I. Recall parameters pi < 0. If all parameters pi = p then the above that for standard Newton-Kleinman, r is the sum of the rank equations reduce to Smith’s method. If the ADI parameters of the state weight nQ and controls m. For modified Newton- pi are chosen appropriately, then convergence is obtained Kleinman r is equal to m. The reduction in complexity in J iterations where J ≪ n. The parameter selection of CF-ADI method over ADI is marked for the modified problem has been studied extensively [10], [21], [31]. For Newton-Kleinman method. 10 2 2 2 E 2.68 ×10 N/m where M = EI d2φ+C I d2ψ ∈H2(0,1). I 1.64 ×10−9 m4 bdr d bdr b We use R = 1 and define C by the tip position: ρ 1.02087 kg/m 2 Cv 2 Ns/m Cd 2.5 ×108 Ns/m2 w(1,t) = C[w(x,t) w˙(x,t)]. L 1 m I 121.9748 kg m2 Then Q = C∗C. We also solved the control problem with h d .041 kg−1 Q=I. TABLE III Let H2N ⊂ H be a sequence of finite-dimensional TABLE OF PHYSICAL PARAMETERS. subspaces spanned by the standard cubic B-splines with a uniform partition of [0,1] into N subintervals. This yields an approximation in H2N × H2N [16, e.g.] of dimension II. CONTROL OF EULER-BERNOULLI BEAM n = 4N. This approximation method yields a sequence of solutions to the algebraic Riccati equation that converge In this section, we consider a Euler-Bernoulli beam strongly to the solution to the infinite-dimensonal Riccati clamped at one end (r = 0) and free to vibrate at the other equation corresponding to the original partial differential end (r = 1). Let w(r,t) denote the deflection of the beam equation description [4], [22]. from its rigid body motion at time t and position r. The deflection is controlled by applying a torque u(t) at the III. LOW RANK APPROXIMATIONS TO LYAPUNOV clamped end (r = 0). We assume that the hub inertia I FUNCTION ¨ h is much larger than the beam inertia I so that I θ ≈ u(t). b h Tables IV,V show the effect of varying C on the number Thepartial differential equation model with Kelvin-Voigt and d of iterations required for convergence. Larger values of C viscous damping is d (i.e. smaller values of α) leads to a decreasing number of iterations. Small values of C lead to a large number of ρw (r,t)+C w (r,t)+... d tt v t required iterations in each solution of a Lyapunov equation. ∂2 ρr [C I w (x,t) +EI w (r,t)] = u(t), Recall that the CF-ADI algorithm used here starts with a ∂r2 d b rrt b rr I h rank 1 initial estimate of the Cholesky factor and the rank of with boundary conditions the solution is increased at each step. The efficiency of the w(0,t) = 0 Cholesky-ADI method relies on the existence of a low-rank w (1,t) = 0. approximation of the solution X to the Lyapunov equation. r This is true of many other iterative algorithms to solve large EI w (1,t)+C I w (1,t) = 0 b rr d b rrt Lyapunov equations. ∂ [EI w (r,t)+C I w (r,t)] = 0. Theorem 3.1: For any symmetric, positive semi-definite ∂r b rr d b rrt r=1 matrix X of rank n let µ ≥ µ ... ≥ µ be its eigenvalues. 1 2 n The values of the physical parameters in Table II are as in For any rank k < n matrix Xk, [2]. *X−X * µ Let k 2 ≥ k+1. # $ *X* µ dw 2 1 H= w∈H2(0,1):w(0)= (0) = 0 Proof: See, for example, [13, Thm. 2.5.3]. ! dr Thus, the relative size of the largest and smallest eigen- be the closed linear subspace of the Sobolev space H2(0,1) values determines the existence of a low rank approximation 2 Xk that is close to X, regardless of how this approximation and define the state-space to be X = H×L (0,1) with state is obtained. z(t) = (w(·,t), ∂ w(·,t)). A state-space formulation of the ∂t The ratio µ /µ is plotted for several values of C in above partial differential equation problem is k+1 1 d Figure 1. For larger values of C the solution X is closer to d d a low rank matrix than it is for smaller values of C . x(t) = Ax(t)+Bu(t), d dt A bound on the rank of the solution to a Lyapunov where equation where A is symmetric is given in [25]. A tighter 0 I bound on the error in a low-rank approximation has been A= obtained [1] in terms of the eigenvalues and eigenvectors of EI 4 C I 4 C A. This bound is applicable to all diagonalizable matrices − bd − d b d − v ρ dr4 ρ dr4 ρ A. The bound for the case where the right-hand-side D has and 0 rank one is as follows. B= Theorem 3.2: [1, Thm 3.1] Let V be the matrix of right r eigenvectors of A, and denote its condition number by κ(X). Ih Denote the eigenvalues of A by λ1,...λn. There is a rank with domain k < n approximation to X satisfying d 2 2 dom(A)={(φ,ψ)∈X :ψ∈H, M(L)= M(L)=0} *X−X * ≤(n−k) δ (κ(V)*D* ) (8) dr k 2 k+1 2 TABLE IV where BEAM: : EFFECT OF CHANGING C (STANDARD NEWTON-KLEINMAN) * * d k 2 −1 )*λk+1−λj* δk+1 = * ∗ * . Cv Cd α N.-K. It’s Lyapunov Itn’s CPU time 2Real(λk+1) j=1*λk+1 +λj* 2 1E04 1.5699 – – – 3E05 1.5661 – – – The eigenvalues λi are ordered so that δi are decreasing. 4E05 1.5654 3 1620;1620;1620 63.14 In order to calculate this bound, all the eigenvalues and 1E07 1.5370 3 1316;1316;1316 42.91 eigenvectors of A are needed. 1E08 1.4852 3 744;744;744 18.01 If the condition number κ(V ) is not too large, for instance 5E08 1.3102 3 301;301;301 5.32 if A is normal, then δk+1/δ1 gives a relative error estimate of the error µ /µ in the approximation X . This estimate TABLE V k+1 1 k BEAM: EFFECT OF CHANGING C (MODIFIED NEWTON-KLEINMAN) is observed in [1] to estimate the actual decay rate of the d eigenvalues of X quite well, even in cases where the decay C C α N.-K. It’s Lyapunov It’s CPU time is very slow. v d 2 1E04 1.5699 2 – – Consider a parabolic partial differential equation, with A- 3E05 1.5661 2 – – matrix of the approximation is symmetric and negative defi- 4E05 1.5654 2 1620;1 24.83 nite. Then all the eigenvalues are real. A routine calculation 1E07 1.5370 2 1316;1 16.79 yields that 1E08 1.4852 2 744;1 7.49 5E08 1.3102 2 301;1 2.32 δk+1 ≈ λ1 δ1 λk and so the rate of growth of the eigenvalues determines the [23]. A simple calculation yields that for all k, accuracy of the low rank approximation. The accuracy of the low-rank approximant can be quite different if A is non- k−1 symmetric. The solution X to the Lyapunov equation could δk = ) 1 2 ≈1. 4c δ1 1+ v 2 be, for instance, the identity matrix [1], [25] in which case j=1 (wk−wj) the eigenvalues of X do not decay. This indicates that the eigenvalues of the Lyapunov solution It is observed through several numerical examples in [1] do not decay. that it is not just complex eigenvalues that cause the decay rate to slow down, but the dominance of the imaginary parts IV. CONCLUSIONS of the eigenvalues as compared to the real parts, together with The results indicate a problem with solving the algebraic absence of clustering in the spectrum of A. This effect was Riccati equation for systems with light damping where the observed in the beam equation. Figure 2 shows the change eigenvalues are not decaying rapidly. Although better choice in the spectrum of A as C is varied. Essentially, increasing d of the ADI parameters might help convergence, the fact C increases the angle that the spectrum makes with the d that the low rank approximations to the solution converge imaginary axis. Note that the eigenvalues do not cluster. As very slowly when the damping is small limits the achievable the damping is decreased, the dominance of the imaginary improvement. This may have consequences for control of parts of the eigenvalues of A increases and the decay rate of coupled acoustic-structure problems where the spectra are the eigenvalues of X slows. The Cholesky-ADI calculations closer to those of hyperbolic systems than those of parabolic for the beam with C = 0 did not terminate on a solution d systems. Xk with k < n. These observations are supported by results in the theory of control of partial differential equations. If C > 0 the d -./0123450+637.8+98:+;<=)+>.7?+23:@.1/+A B ! + semigroup for the original partial differential equation is #! parabolic and the solution to the Riccati equation converges !#! uniformly in operator norm [18, chap.4]. However, if C = 0, #! d !$! the partial differential equation is hyperbolic and only strong #! convergence of the solution is obtained [19]. Thus, one might !%! #! expect a greater number of iterations in the Lyapunov loop # ! !&! +,+*#! to be required as C is decreased. Consider the bound (8) ! d !"! for the beam with C = 0, and an eigenfunction basis for #! d the approximation so that the eigenvalues of A are the exact !)! 2 #! eigenvalues. Defining c = C /ρ, c = EI /ρandindicating ' v v b "C#! !(! ' the roots of 1 + cos(a)cosh(a) by a , the eigenvalues are #! #C#! k ( #C#! !'!+ λ =−c /2±iω #! k v k ! " #! #" $! $" %! %" &! * where + 2 2 Fig. 1. Eigenvalue ratio for solution to beam ARE for varying C ωk = ca −c /4 d k v
no reviews yet
Please Login to review.