Nonlinear Programming 20 34
Nonlinear Programming 20 34
The nonlinear
programming problem f
One The nonlinear programming prob-
lem that will concern us has three
The fundamental ingredients: a finite
number of real variables, a finite
Nonlinear number of constraints which the
variables must satisfy, and a func-
Programming tion of the variables which must
Problem, be minimized (or maximized).
Mathematically speaking we can
Preliminary state the problem as follows: Find
specific values (xi, . . . ,xn), if
Concepts, they exist, of the variables
and (£1, • . . ,zn) that will satisfy the
inequality constraints
Notation
the equality constraints
t This refers to the works by Courant, written in 1947, and by Fleming, written in 1965,
as listed in the Bibliography at the back of the book. This system of references will be
used throughout the book with one exception: [Gordan 73] refers to Gordan's paper written
in 1873.
a
Preliminary Concepts and Notations 1.3
3
1.2 Nonlinear Programming
of A. The empty set is the set which contains no elements and is denoted
by 0. We denote a set sometimes by {x,y,z}, if the set is formed by the
elements x, y, z. Sometimes a set is characterized by a property that its
elements must have, in which case we write
[x | x satisfying property P\
For example the set of all nonnegative real numbers can be written as
The set of elements belonging to at least one of the sets of the (finite or
infinite) family of sets (r,),e/ is called the union of the family and is
denoted by U F,. Then
The set of elements belonging to all the sets of the (finite or infinite)
family of sets (F,)ie/, is called the intersection of the family and is denoted
by r! F,. Then
ie/
Two sets F and A are disjoint if they do not intersect, that is, if
r n A = 0.
The difference of the sets A and F is the set of those elements of A
not contained in F and is denoted by A ~ F. We have then
4
Preliminary Concept! and Notations 1.1
then
For example the statement "for each x there exists a y such that 6(x,y) = 1"
can be written as
5
1.8 Nonlinear Programming:
3. Vectors
n-vector
An n-vector or n-dimensional vector x, for any positive integer n,
is an n-tuple (xi, . . . ,xn) of real numbers. The real number x, is
referred to as the ith component or element of the vector x.
Rn
The n-dimensional (real) Euclidean space Rn, for any positive
integer n, is the set of all n-vectors.
The notation x £j Rn means that x is an element of Rn, and hence,
x is an n-vector. Frequently we shall also refer to re as a point in Rn.
R1, or simply R, is then the Euclidean line (the set of all real numbers),
R* is the Euclidean plane (the set of all ordered pairs of real numbers),
and R" = R X R X • • • X R (n times).
Vector addition and multiplication by a real number
Let x,y (E Rn and a G R- The sum x -\- y is defined by
x + y = (xi + 7/1, . . . ,xn + yn)
and the multiplication by a real number ax is defined by
ax = (axi, . . . ,axn)
otherwise they are linearly dependent. (Here and elsewhere 0 denotes the
real number zero or a vector each element of which is zero.)
Linear combination
The vector x £ Rn is a linear combination of x1, . . . , xm G Rn if
x = X1^1 4- • • • + \mxm for some X1, . . . , \m G R
6
Preliminary Concepts and Notations 1.3
Scalar product
The scalar product xy of two vectors x,y E Rn is defined by
xy = xtfji + • • • + xnyn
Norm of a vector
The norm |ja;|| of a vector x E Rn is defined by
Cauchy-Schwarz inequah'ty
Let x,y E Rn. Then
Problem
Establish the fact that Rn is a metric space by showing that
7
1.4 Nonlinear Programming
(triangle inequality)
(Hint: Use the Cauchy-Schwarz inequality to establish the triangle
inequality.)
This definition of angle agrees for n = 2,3 with the one in ana-
lytic geometry. The nonzero vectors x and y are orthogonal if xy = 0
(\l/ = 7T/2); form an acute angle with each other if xy ^ 0 (0 2g ^ f=j ir/2),
a s/nc/ acwte an0/e if xy > 0 (0 ^ ^ < ?r/2), an obtuse angle if xy ^ 0
(7T/2 ^ ^ ^ TT), and a sin'd obtuse angle if xy < 0 (?r/2 < ^ ^ ?r).
4. Matrices
Although our main concern is nonlinear problems, linear systems of the
following type will be encountered very frequently:
AnXj. + • • • + Alnxn - bi
The ith row of the matrix A will be denoted by A» and will be an n-vector.
Hence
The jth column of the matrix A will be denoted by A.> and will be an
w-vector. Hence
Obviously the t'th row of A is equal to the ith column of A', and the jth
column of A is equal to the jth row of A'. Hence
The last equalities of 8 and 9 are to be taken as the definitions of A't and
A'J respectively. Since Ay is the real number in the t'th row of the jth
column of A, then if we define A^t as the real number in thejth row of the
ith column of A', we have
9
1.4 Nonlinear Programming
Here A.j and b are vectors in Rm and x, are real numbers. The represen-
tation 2 can be interpreted as a problem in Rn whereas 11 can be inter-
preted as a problem in Rm. In 2 we are required to find an x £ Rn that
makes the appropriate scalar products 6, (or angles, see 1.3.11} with the
n-vectors Ai} i = 1, . . . , m. In 11, we are given the n + 1 vectors in
Rm> A.j, j'• — 1, . . . , n and b, and we are required to find n weights
xi, . . . , xn such that b is a linear combination of the vectors A.j.
These two dual representations of the same linear system will be used in
interpreting some of the important theorems of the alternative of the next
chapter.
The m X n matrix A of 4 can generate another linear system yA,
defined as follows
then we define the following sub matrices of A (which are matrices with
rows and columns extracted respectively from the rows and columns of A)
10
Preliminary Concepts and Notations 1.8
jth column of
ith row of
and
The range T(X*) of F is the union of the images of all the points of X*,
that is
Function
A function f is a single-valued mapping from a set X into a set Y.
That is for each x £ X, the image set/(x) consists of a single element of F.
The domain of / is X, and we say that / is defined on X. The range of /
is f(X) = U f(x). (For convenience we will write the image of a func-
xex
tion not as a set but as the unique element of that set.)
11
1.& Nonlinear Programming
Numerical function
A numerical function 6 is a function from a set X into #. In other
words a numerical function is a correspondence which associates a real
number with each element x of X.
EXAMPLESS If If
EXAMPLESS X= X R,
= R, then
then is the
d isd the familiar
familiar realsingle-valued
real single-valued func
func
tion of a real variable, such as 0(x) = sin x. If X is the set of positive
integers, then 6 assigns a real number for each positive integer, for exam-
ple 6(x) = l/x\. If X = Rn, then d is the real single-valued function
of n variables.
Vector function
An m-dimensional vector function f is a function from a set X
into Rm. In other words a vector function is a correspondence which
associates a vector from Rm with each element x of Jf. The m compo-
nents of the vector f(x) are denoted by fi(x), . . . , fm(x). Each/< is a
numerical function on X. A vector function / has a certain property (for
example continuity) whenever each of its components/, has that property.
(Conversely, the last two relations could be used to define a linear vector
function on Rn, from which it could be shown that/(a;) = Ax -f 6 [Berge
63, p. 159].)
If m = 1 in the above, then we have a numerical linear function
B on Rn and
12
Preliminary Concepts and Notations 1.6
6. Notation
Vectors and real numbers
In general we shall follow the convention that small Latin letters
will denote vectors such as a, b, c, x, y, z, or vector functions such as/, g, h.
Exceptions will be the letters i, j, k, m, n, and sometimes others, which
will denote integers. Small Greek letters will denote a real number (a
point in R) such as a, ft, 7, £, rj, f, or a numerical function such as 8, </>, \f/.
Subscripts
A small Latin letter with an integer subscript or a small Latin letter
subscript will denote a component of a vector, in general, and on occasion
will denote a vector. For example, if x G Rb, then x$ and zt denote
respectively the third and z'th components of x. On the other hand we
will have occasion to let x\ G Rm, £2 G Rm, etc., in which case this intent
will be made explicit. Small Greek letters with integer or Latin sub-
scripts will occasionally be used to denote real numbers such as Xi, X,. If
x G Rn, K C N = {I, . . . ,n}, and K contains k elements each of
which is distinct, then X^K is a vector in Rk with the components
[xi | i G K-} and is denoted by XK. Thus a small Latin letter with a
capital Latin letter subscript denotes a vector in a space of smaller or
equal dimension to that of the space of the unsubscripted vector.
Superscripts
A small Latin or Greek letter with a superscript or an elevated
symbol will denote a fixed vector or real number, for example x1, xz, x\
x, x, £*, I, etc. Exponentiation on the other hand will be distinguished
by parentheses enclosing the quantity raised to a power, for example (z)2.
Zero
The number 0 will denote either the real number zero or a vector
in Rn all components of which are zero.
Matrices
Matrices will be denoted by capital Latin letters as described in
detail in a previous section, Sec. 1.4.
13
1.6 Nonlinear Programming
Sets
Sets will always be denoted by capital Greek or Latin letters such
as F, A, fl, R, I, X, Y. Capital letters with subscripts, such as TI, T2, I\,
and capital letters with elevated symbols, such as r*, X° will also denote
sets. (See also Sec. 1.2.)
Ordering relations
The following convention for equalities and inequalities will be
used. If x,y £ Rn, then
The set X is called the feasible region, x the minimum solution, and Q(x)
the minimum. All points x in the feasible region X are referred to as
feasible points or simply as feasible.
Another way of writing the same problem which is quite common
in the literature is the following:
subject to
14
Preliminary Concepts and Notations 1.6
We favor the more precise and brief designation 9 of the problem instead
of 10 to 12. Notice that if we let X° = Rn in the above problem, then
we obtain the nonlinear programming problem 1.1.1 to 1.1.3.
If X° = Rn and 6, g, and h are all linear functions on Rn, then
problem 9 becomes a linear programming problem: Find an x, if such
exists, such that
where 6, c, and d are given fixed vectors in Rn, Rm, and Rk respectively,
and A and B are given fixed m X n and k X n matrices respectively.
There exists a vast literature on the subject of linear programming
[Dantzig 63, Gass 64, Hadley 62, Simmonard 66]. It should be remarked
that problem 13 is equivalent to finding an x such that
When B and d are absent from this formulation, 14 becomes the standard
dual form of the linear programming problem [Simmonard 66, p. 95].
16