Thau
Thau
To cite this article: SEKHAR RAGHAVAN & J. KARL HEDRICK (1994): Observer design for a class of nonlinear systems,
International Journal of Control, 59:2, 515-528
This article may be used for research, teaching, and private study purposes. Any substantial or systematic
reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any form to
anyone is expressly forbidden.
The publisher does not give any warranty express or implied or make any representation that the contents
will be complete or accurate or up to date. The accuracy of any instructions, formulae, and drug doses should
be independently verified with primary sources. The publisher shall not be liable for any loss, actions, claims,
proceedings, demand, or costs or damages whatsoever or howsoever caused arising directly or indirectly in
connection with or arising out of the use of this material.
INT. J. CONTROL, 1994, VOL. 59, No.2, 515-528
1. Introduction
The problem of designing nonlinear observers has been studied for a long
time. Much of the effort has resulted in extensions of the linear Luenberger
observer (Kailath 1980): examples of this are the extended Kalman filter,
psuedo linearization techniques etc (see Misawa and Hedrick 1989 for a survey).
These techniques are valid in a small range around the operating point. They
also frequently require heavy real-time computation. Recently, geometric tech-
niques were proposed to build exact observers for a general description of
nonlinear systems (Krener and Isidori 1983, and Keller 1987). However, the
conditions that are required to be satisfied by these observers are extremely
stringent, making the applicable class very small.
This paper represents an effort towards developing viable methods to design
and implement observers for inherently nonlinear systems. We describe a new
computationally-viable design methodology for constructing observers for a class
of nonlinear systems. These systems are driven by nonlinear functions which are
Lipschitz in nature. The underlying theory makes use of the methods developed
for the quadratic stabilization of uncertain systems (Khargonekar et al. 1990,
Peterson 1987 and Peterson and Hollot, 1986). In particular, we develop dual
results applicable to the observer problem at hand.
In this case, y is said to be the Lipschitz constant of [, Here, and in the rest of
this paper, /·1 denotes euclidean norms. 0
If [ is a differentiable function with bounded partial derivatives, then y is
simply the upper bound on the norm of the jacobian matrix for the function, the
upper bound taken over the entire R", However, in general, a Lipschitz function
may not be differentiable.
The systems we consider have the form given by
x = Ax + g(t, U, y) + [(t, u, x)
(2)
y = Cx
where the state x E R"; A E R n x n . y E Rq represents the measurements of the
Downloaded by [North Carolina State University] at 00:58 25 March 2013
system, and the input U E R'", The matrices A and C are constants, and, g:
R x R'" x Rq ~ R"; [: R x R'" x R" ~ R", The nonlinearity [(',', x) is
assumed to be globally Lipschitz in x with a Lipschitz constant y. i.e.
I[(t, U, x) - [(t, u, x)1 < ylx - xl. for all U E R'", t E R (3)
We also assume that the pair (A, C) is detectable.
The assumption that [ is Lipschitz globally may be relaxed to assume that [
is only locally Lipschitz. All the results in the ensuing sections will then be valid
in some local neighbourhood around a nominal point. (Hereafter, when we say
that [ is Lipschitz, we will mean that the inequality (3) is satisfied.)
Systems of type (2) are not uncommon: mechanical systems frequently
contain Lipschitz-type nonlinearities-trignometric nonlinearities which occur in
robotic applications, a nonlinear softening spring etc. Nonlinearities which are
square or cubic in nature are not globally Lipschitz; however, they are locally
so. Moreover, when such functions occur in physical systems, they frequently
have a saturation in their growth rate, making them globally Lipschitz functions.
Frequently, we make the system measurements part of the system state, so
the assumption that the output is linear in the state is justified. The assumption
that the system dynamics are linear in the input is usually true for mechanical
systems because the input is usually a torque or a force which enters the
dynamics linearly due to the form of Newton's laws. In summary, while the class
of systems we are addressing is not exhaustive, it is fairly large from an
engineering point-of-view.
3. A background result
Consider the system given by (2). For this system, we construct an observer
as follows:
£ = Ax + g(t, U, y) + [(t, u, x) + L(y - Cx) (4)
where the observed state is denoted by X, and L E R'":" is the observer gain
matrix. Then, the error in the state estimate, x := x - X, has the following
dynamics:
t = (A - LC)x + [(t, u, x) - [(t, u, x) (5)
The design problem is to choose the observer gain L so that (5) is made stable.
The following result which establishes a condition in this respect was established
by Thau (1973).
Observer design for class of nonlinear systems 517
Theorem 1 (Thau 1973): Given the system given by (2) and its observer (4). If
Y < Amin(Q)/2Am.x(P), where P and Q are positive definite, symmetric matrices
satisfying the Lyapunov equation
(A - LC)Tp + P(A - LC) = -Q (6)
Then limr-oo x = O.
Thus, Theorem 1 provides a procedure to check the stability of the error
dynamics after having picked L. However, it does not suggest a method to
design a stable observer. Stated otherwise, given the Lipschitz constant of f to
be y, it is not apparent how one chooses the observer-gain matrix L so that we
satisfy y < Amin(Q)/2Ama.{P).
Downloaded by [North Carolina State University] at 00:58 25 March 2013
A = [~ ~J (7)
if and only if there exists another positive definite, symmetric matrix Pz such that
(9)
Proof of Lemma 1: We will show the necessity. The sufficiency part is similar.
Suppose that
ATp[ + PIA + rPIPI + 1<0
is satisfied. Now, define Pz := (Plr)-I. Obviously, Pz is a positive-definite and
symmetric matrix. Now, substituting for PI in the above inequality as
PI := (pzr)-I, we have
AT(Pzr)-1 + (Pzr)-IA + yZ(pzr)-I(pzr)-1 + 1<0
Downloaded by [North Carolina State University] at 00:58 25 March 2013
Multiplying the above inequality on the left and right by (y Pz)* and y Pz
respectively (where * denotes complex conjugate transpose), and noting that Pz
is a real matrix, we get
PzAT + APz + I + rPzPz < 0
which is the same as (9). This proves the necessity. Proving sufficiency follows
similar steps. 0
Lemma 2: Given the system (2) and its observer (4). If P is any positive definite
and symmetric matrix, then
2x T P(f(t, u, x) - f(t, u, x» '"
rx T PPX + xTx (10)
Theorem 2: Given the system (2) and its observer (4). If there exists an e > 0
such that the following Algebraic Riccati Equation (ARE) has a symmetric,
positive definite solution P
then, the observer gain L = (1/2e) PC T stabilizes the error dynamics, given (5) for
all f with a Lipschitz constant y.
Observer design for class of nonlinear systems 519
Proof of Theorem 2: Suppose an e > 0 that solves the Riccati equation is found.
Then substituting L = (l/2e)PC T in (11) gives that
(A - LC)P + P(A - LC)T + Y PP + I + eI =0
Since e > 0, it is clear that
(A - LC)P + P(A - LC? + YPp + I <0 (12)
where the inequality above denotes that the matrix sum on the left-hand side is
negative definite. From this, the following is true:
(A - LC)P + P(A - LC? < 0
Then (A - LC)T and, consequently, (A - LC) are stable matrices. Using
Downloaded by [North Carolina State University] at 00:58 25 March 2013
Lemma 1, (12) implies that there exists a positive definite and symmetric matrix
PI satisfying the inequality
(A - LC)TPI + PI(A - LC) + yPIPI + 1<0 (13)
Now, consider the Lyapunov function, V = x PIX. Differentiating V with
T
respect to time along the trajectories of the error dynamics (5), we get
V = xT«A - LC)T PI + PI(A - LC»x + 2x T Pj(f(t, u, x) - f(t, u, x»
Using Lemma 2, we see that
V"" xT«A - LC)TPI + PI(A - LC»x + YXTPIPIX + xTx
From (13), V is clearly negative definite. Hence, x -> 0 as t -> 00. 0
Theorem 2 thus establishes a sufficient condition for stabilizing the error
dynamics (5). We note here that the resulting design is an H observer design, 00
Theorem 3: Given the system (2) and its observer (4). If there exists an e > 0,
and a positive-definite and symmetric matrix Q such that the ARE
and
f(t, u, x) = [1'5Si~(Xl)J
Let us now consider an observer of the type (4) with the gain matrix as
L =DJ
Here, we have not followed any particular method to select L, but simply made
(A - LC) a stable matrix. Indeed, the error dynamics are of the form (5), with
A - LC = [-~ -n
Downloaded by [North Carolina State University] at 00:58 25 March 2013
Notice that the Lipschitz constant Y = 1·5. In order to prove the stability of
the resulting error dynamics, we can check if the condition imposed by Theorem
1 holds; i.e. whether 1/2YmaxCP) > Y where P is solved from
(A - LC)Tp + peA - LC) = -I
It is seen that y = 0·2. Further, when the Lyapunov equation is solved for the
transformed (A - LC) matrix, we get Amax(P) = 1·47. It is immediate, then, to
verify that the stability condition holds in the transformed coordinates.
Thus, we see that the designed observer gives stable error dynamics for the
system at hand. The fact that the stability condition was not verified earlier
merely implies that there is at least one function f( t, U, x) (with a non-zero entry
in its first channel) having a Lipschitz constant y = 1·5, for which the stability of
its error dynamics cannot be established using the Lyapunov function candidate,
iT Pi. Thus, a straightforward calculation does not take into account the special
structure of the nonlinearity under consideration. The transformation used to
prove the stability takes advantage of the fact that one of the channels of f is
zero. If this were not true, the state-transformation may not have resulted in a
much lower value of Y (as it did in this case). Of course, for stability, not only
does the value of y matter but also how exactly the matrix A gets transformed,
in the sense that the structure of the transformed matrix manifests itself in the
form of the new Lyapunov solution P.
522 S. Raghavan and f. K. Hedrick
robot's performance range. With this in view, recent literature has focused on
the control of flexible joint robots, e.g. Spong (1987). In this paper, Spong
proposed the use of an I/O linearizing control law for these robots. This control
however, uses information on all states-which for a single-link flexible-joint
robot are the joint position and velocity, and the link position and velocity. For
physical reasons, while one can easily measure the motor position and velocity,
the measurement of the other states are not trivial. In the present paper, we
present a nonlinear observer that estimates the link position and velocity based
on the inputs of motor position and velocity.
Figure 1 shows the schematic of a laboratory model of a single-link flexible
joint robot. In the figure, f m represents the inertia of the actuator, a dc motor,
and f l represents the inertia of the controlled link. em and e, are angular
rotations of the motor and the link respectively, and, W m and CO) are their
angular velocities. In general, these will be different from each other as
functions of time, due to the torsional compliance, k.
A state-space description of this system is given next.
8m = Wm
The length of the link is given by 2b. B represents the viscous friction in the
motor bearing and the back-e.m.f. effects. The following are the simulation
parameters used in the simulations. They are representative of a laboratory
model that can be used to model a flexible-joint robot.
The system model can be represented by the following equations.
i = Ax + f(x) + g(y)u (14)
with the following values for the matrices.
o
[ -4~'6 ~]. [2~'6]
1
-1·25 48·6
A = 0 o o 1 ' g = 0
o o
Downloaded by [North Carolina State University] at 00:58 25 March 2013
19·5 -19·5 0
and
f ~ ]
=[ -3'33~in(O,)
System (14) is clearly in the form described by (2): y = Xl, and z = Cx, with
C = [b ~ ~ ~J
An attempt to solve the Riccati equation in Theorem 2 using Algorithm 1 to
obtain the observer gain in original coordinates was unsuccessful. However,
noticing that f has a zero entry in three of its channels, we take advantage of
this fact using a transformation of coordinates z = Tx, where
T = diag[l, 1,4,0·1]
Under this transformation, the new Lipschitz constant is ji = 0·083. Algorithm 1
was then employed to solve for the observer-gain [ in the transformed
coordinates. L in the original coordinates was then calculated to be L = r : L.
0.91
0·38 0,38]
6·71
L = 0.89 (15)
[ 1·45
1·02 4·16
The structure of the observer is given by
~ = Ax + g(u) + f(8,) + Ly (16)
where,
and, x = [Om' W m, OJ, W)]T represents the state-vector of the robot. The perform-
ance of the observer is shown in Figs 2 and 3, where the convergence of
estimation errors in 8, and ~ are seen. The input was such that the link position
tracks a sinusoidal trajectory.
A:=(::t=o; c:=(:;L=o
and
B := g(O)
Then the given system can be expanded as
i = Ax + Bu + fleX) + gl(X)U}
(19)
y = Cx + hl(x)
2~'':
...
o :
o I 2 3 4 '5 6
Timc(sec::)
Figure 2. Observer performance: link position (solid curve), and its estimate (broken curve)
in radians.
l:~://
·10 : :'
:::v
0 1 2 3 4 5 6
Time(sec)
Figure 3. Observer performance: link velocity (solid curve), and its estimate (broken curve)
rad s':".
Observer design for class of nonlinear systems 525
One can now find the highest value of y for which an observer design is
possible for the following system.
x = Ax + Bu + h(X)}
(20)
y = Cx
This can be done by iterating with higher and higher values of y in Algorithm 1.
The highest possible value of y directly determines Q. Then, the observer for
the original nonlinear system is implemented as follows.
i = (A - LC)x + Bu + hex) + gl(x)u + Ly (21)
As mentioned before, the observer so designed will be approximate, even
locally, since we have neglected the higher-order terms of g and h. But we can
expect it to be better than a design that neglects all the higher-order terms, and
uses only the linearized version of the system.
9. Conclusions
This paper described a viable computational procedure to design observers
for a class of nonlinear systems. These systems are characterized by nonlinear
functions which are Lipschitz in nature. Nonlinear observer design is still a field
in its infancy, and we hope this paper represents a fruitful step forward. The
suggested methodology is based on the dual of the results from the theory of
stabilization of uncertain systems. We have applied the present methodology to
the design of an observer for the single-link flexible joint robot. This observer
estimates the link variables based on the estimates of the joint variables. We
also offer some insights into exploiting the special structure of the given
nonlinear systems through state transformations.
On some relevant literature, the problem of designing observers for systems
with Lipschitz nonlinearities was first considered by Thau (1973). Related work
can be found in the work of Kou et at. (1975), and Zak (1990). Another paper
by the present authors (Raghavan and Hedrick 1992) discusses the efficient use
of nonlinear observers in designing compensators.
ACKNOWLEDGMENTS
The authors are very grateful to Professor Andy Packard for his invaluable
help and advice. Thanks are also due to the anonymous referee whose
comments helped provide a shorter proof for Lemma 1.
526 S. Raghavan and J. K. Hedrick
Appendix
Proof of Theorem 3: Sufficiency was proved in Theorem 2. We will prove the
necessity here. The proof given below uses the methodology given by Khargone-
kar et af. (1990) and Peterson (1987) for the case of quadratic stabilization of
uncertain systems.
Suppose the error dynamics are quadratically stabilizable by some L E R"?",
For this choice of L, there exists a positive definite matrix P such that (with I
defined as in the Proof of Lemma 2)
xT«A - LC)T P + P(A - LC»x + 2x T e] < 0
for all x'" 0 and for all f obeying (3.3) i.e.
Downloaded by [North Carolina State University] at 00:58 25 March 2013
xT«A - LC)T P + P(A - LC»x < - max {(2x T ph ITI < Ilx!} ,,;;; 0
It follows that
(xT«A - LC)T P + P(A - LC»X)2 > 4 max {(x T pJ)2: ITI < Ilx!}
Then, from Lemma A.l (see below), we get that
(xT«A - LC)T P + P(A - LC»x)2 - 4(y2xT PPX)(xTx) > 0
By Lemma A.2, this means there exists a A> 0 such that
XT(A2y2PP + A«A - LC)T P + P(A - LC» + I)x < 0
Calling P := AP, we have that
xT«(A - LC)Tp + P(A - LC» + Ipp + I)x < 0
Then, by Lemma 1, there exists another positive definite matrix PI such that
xT«(A - LC)P1 + P\(A - LC)T) + Ip\p\ + I)x < 0
i.e.
xT(AP\ + PIAT)x - 2x T LCP\x + IxTp\p\x + xTx < 0
for all x'" O. Thus, we have
xT(AP\ + PIAT)x + IxTp\p\x + xTx < 0
for all x'" 0 and such that CPlx = O. It follows from Lemma A.3 that there
exists a a* > 0 such that
Q* := a*PICTCP\ - API - P\A T - Ip\P1 - I
is positive definite. This implies that
API + P\A T + P\(II - a*CTC)p\ + I + Q* = 0
Calling f* = l/a* gives the required ARE. Also, notice that when o" is replaced
by a> a*, the ARE holds for a different Q > Q*. Calling e = 1/a then proves
the Theorem. 0
LemmaA.l: Given the system (2)-(3) and its observer (4), let 1:= f(t, u, x)-
f(t, 1I, x). If P is a positive definite and symmetric matrix, then
max {(x T PJ)2: FI < Ilxl} = (y2xT PPX)· (xTx)
Observer design for class of nonlinear systems 527
Now, choose vectors V2, .•. , V n and '72, ... , IJn so that
{VI, ... , vn} and {IJI,.··, IJn}
Downloaded by [North Carolina State University] at 00:58 25 March 2013
Therefore,
(x Tpj)2 = (v TyVWTx)2
= (VT YVbL)2
Ivi
= r(vTV)(1/T1/)
= r(x T PPX)(xTx)
REFERENCES
KAILATH,T., 1980, Linear Systems (Englewood Cliffs, NJ: Prentice Hall).
KELLER, H., 1987, Nonlinear observer design by transformations into a generalized observer
canonical form. International Journal of Control, 46, 1915-1930.
528 Observer design for class of nonlinear systems
KHAROONEKAR, P. P., PETERSON, I. R., and ZHOW, K., 1990, Robust stabilization of uncertain
linear systems: quadratic stabilizability and H~ control theory. IEEE Transactions on
Automatic Control, 35, 356-361.
KRENER, A. J., and ISIDORI, A., 1983, Linearization by output injection and non-linear
observers. Systems and Control Letters, 3, 47-52.
Kou, S. R., ELLIOT, D. L., and TARN, T. J., 1975, Exponential observers for nonlinear
dynamic systems. Information and Control, 29, 204-216.
M,SAWA, E. A., and HEDRICK, J. K., 1989, Nonlinear observers: A state-of-the-art survey.
ASME Journal of Dynamic Systems and Measurement, 11,344-352.
NAGPAL, K. M., and KHARODNEKAR, P. P., 1991, Filtering and smoothing in an H oo selling.
IEEE Transactions on Automatic Control, 36, 152-166.
PETERSON, I. R., 1987, A stabilization algorithm for a class of uncertain linear systems.
Systems and Control Letters, 7, 351-357.
PETERSON, I. H., and HOLLOT, C. Y., 1986, A Riccati equation approach to the stabilization
Downloaded by [North Carolina State University] at 00:58 25 March 2013