
to illustrate the applicability of the proposed algorithm. In
Section 5, some conclusions are presented.
Notation. We denote the collections of non-negative inte-
gers, positive integers and real numbers by Z,Z+and R.
Rn×mrepresents the collection of all n×mreal matrices. Rn
is the n-dimensional Euclidean space and | · | denotes its Eu-
clidean norm for vector or matrix of proper size. Zero matrix
(or vector) with appropriate dimension is denoted by O. We
use diag{v}to denote a square diagonal matrix whose main
diagonal is the elements of vector v. The sets of all symmetric
matrices, positive definite matrices and semipositive definite
matrices in Rn×nare represented by Sn,Sn
++ and Sn
+, respec-
tively. w(·)is a one-dimensional standard Brownian motion
defined on a filtered probability space (Ω,F,{Ft}t>0,P) that
satisfies usual conditions. Moreover, we use ⊗to denote the
Kronecker product and for any matrix B∈Rm×n,vec(B)
denotes a vectorization map from the matrix Binto a column
vector of proper size, which stacks the columns of Bon top
of one another, that is, vec(B) = [bT
1, bT
2,· · · , bT
n]T, where
bj∈Rn,j= 1,2,3,· · · , n, are the columns of B. For any
ξ∈Rnand F∈Sn, we define two operators as follows:
vecs :ξ∈Rl→vecs(ξ)∈Rn(n+1)
2,
and vech :F∈Sl→vech(F)∈Rn(n+1)
2,
where
vecs(ξ) = [ξ2
1, ξ1ξ2,· · · , ξ1ξn, x2
2, x2x3,· · · , ξn−1ξn, ξ2
n]T,
vech(F) = [f11,2f12,· · · ,2f1n, f22,2f23,· · · ,2fn−1,n, fnn]T,
and ξj,j= 1,2,· · · , n, is the jth element of ξand fji,
j, i = 1,2,· · · , n, is the (j, i)th element of matrix F. For
simplity, we denote vecs(ξ)by ξin this paper.
2 PROBLEM FORMULATION
This section presents the formulation of our LQS optimal
control problems.
Consider a continuous-time time-invariant stochastic linear
system as follows
dx(s) = [Ax(s) + Bu(s)]ds
+ [Cx(s) + Du(s)]dw(s),
x(0) = x0,
(1)
where x0∈Rnis the initial state. The cost functional is
defined as
J(u(·)) = EZ∞
0
[x(s)TQx(s) + u(s)TRu(s)]ds, (2)
where R > 0,Q≥0and [A, C|Q]is exactly detectable.
Now we give the definition of mean-square stabilizability.
Definition 1. System (1) is called mean-square stabilizable
for any initial state x0, if there exists a matrix K∈Rm×n
such that the solution of
dx(s) = (A+BK)x(s)ds
+ (C+DK)x(s)dw(s),
x(0) = x0
(3)
satisfies lims→∞ E[x(s)Tx(s)] = 0. In this case, the
feedback control u(·) = Kx(·)is called stabilizing and the
constant matrix Kis called a stabilizer of system (1).
Assumption 1. System (1) is mean-square stabilizable.
Under Assumption 1, we define the sets of admissible control
as
Uad ={u(·)∈L2
F(Rm)|u(·)is stabilizing}.(4)
Our continuous-time LQS optimal control problems are
given as follows:
Problem (LQS). For any initial state x0∈Rn, we want to
find an optimal control u∗(·)∈ Uad such that
J(u∗(·)) = inf
u(·)∈Uad
J(u(·)).(5)
Ni and Fang [18] shows that the optimal control of Problem
(LQS) can be obtained by solving the following stochastic
algebraic Riccati equation (SARE)
P A +ATP+CTP C +Q−(P B +CTP D)
×(R+DTP D)−1(BTP+DTP C)=0.(6)
Due to the nonlinear structure of SARE (6), the analytical
solution of (6) is difficult to obtain. To our best knowledge,
there are some iterative algorithms to get the approximation
solution of (6), one of which is the PI method developed
in Ni and Fang [18]. We summarize the method as the
following lemma.
Lemma 1. Assume [A, C|Q]is exactly detectable. For a
given stabilizer K0, let Pi∈Sn
+be the solution of
Pi(A+BKi)+(A+BKi)TPi+Q
+ (C+DKi)TPi(C+DKi) + KT
iRKi= 0,(7)
where Kiis updated by
Ki+1 =−(R+DTPiD)−1(BTPi+DTPiC).(8)
Then Piand Ki,i= 0,1,2,3,· · · can be uniquely deter-
mined at each iteration step, and the following conclusions
hold:
(i) Ki,i= 0,1,2,· · · , are stabilizers.
(ii) limi→∞ Pi=P∗,limi→∞ Ki=K∗, where P∗
is a nonnegative definite solution to SARE (6) and