Filtering and identification of a state space model with linear and bilinear interactions between the states

Al-Mazrooei, A; Al-Mutawa, J; El-Gebeily, M; Agarwal, R

doi:10.1186/1687-1847-2012-176

Filtering and identification of a state space model with linear and bilinear interactions between the states

Research
Open access
Published: 09 October 2012

Volume 2012, article number 176, (2012)
Cite this article

Download PDF

You have full access to this open access article

Advances in Difference Equations Submit manuscript

Filtering and identification of a state space model with linear and bilinear interactions between the states

Download PDF

A Al-Mazrooei¹,
J Al-Mutawa²,
M El-Gebeily² &
…
R Agarwal^2,3

2565 Accesses
1 Citation
Explore all metrics

Abstract

In this paper, we introduce a new bilinear model in the state space form. The evolution of this model is linear-bilinear in the state of the system. The classical Kalman filter and smoother are not applicable to this model, and therefore, we derive a new Kalman filter and smoother for our model. The new algorithm depends on a special linearization of the second-order term by making use of the best available information about the state of the system. We also derive the expectation maximization (EM) algorithm for the parameter identification of the model. A Monte Carlo simulation is included to illustrate the efficiency of the proposed algorithm. An application in which we fit a bilinear model to wind speed data taken from actual measurements is included. We compare our model with a linear fit to illustrate the superiority of the bilinear model.

Maximum Likelihood Least Squares Based Iterative Estimation for a Class of Bilinear Systems Using the Data Filtering Technique

Article 26 December 2019

Extended Gradient-based Iterative Algorithm for Bilinear State-space Systems with Moving Average Noises by Using the Filtering Technique

Article 18 February 2021

Iterative state and parameter estimation algorithms for bilinear state-space systems by using the block matrix inversion and the hierarchical principle

Article 29 September 2021

1 Introduction

Bilinear systems are a special type of nonlinear systems capable of representing a variety of important physical processes. They are used in many applications in real life such as chemistry, biology, robotics, manufacturing, engineering, and economics [1–4] where linear models are ineffective or inadequate. They have also been recently used to analyze and forecast weather conditions [5–10].

Bilinear systems have three main advantages over linear ones: Firstly, they describe a wider class of problems of practical importance. Secondly, they provide more flexible approximations to nonlinear systems than linear systems do. Thirdly, one can make use of their rich geometric and algebraic structures, which promises to be a fruitful field of research for scientists [2] as well as practitioners.

Bilinear models were first introduced in the control theory literature in 1960s [11]. So far, the type of nonlinearity that is extensively treated and analyzed consists of bilinear interaction between the states of the system and the system input [1, 2, 12]. Aside from their practical importance, these systems are easier to handle because they are reducible to linear ones through the use of a certain Kronecker product. In this work, we treat the case where the nonlinearity of the system consists of bilinear interaction between the states of the system themselves. This means that our model will be able to handle evolutions according to the Lotka-Volterra models [6] or the Lorenz weather models [7, 8, 10], thus enabling a wider and more flexible application of such models. To the best of our knowledge, no attempt has been made to treat such systems in the general setting presented here.

The widespread use of bilinear models motivates the need to develop their parameter identification algorithms. A lot of work exists in the literature which presents methods of estimation and parameter identification of linear and nonlinear systems [13–21]. The two most widely used techniques fall under the names of least square estimation and maximum likelihood estimation, respectively.

The maximum likelihood estimation is computed through the well-known EM algorithm [22]. It is an iterative method that tries to improve a current estimate of the system parameters by maximizing the underlying likelihood densities. The algorithm is useful in a variety of incomplete data problems, where algorithms such as the Newton-Raphson method may turn out to be more complicated. It consists of two steps called the Expectation step or the E-step and the Maximization step or the M-step; hence the name of the algorithm. This name was first coined by Dempster, Laird, and Rubin in their fundamental paper [22]. In this paper, we develop the EM algorithm for our bilinear system. This will necessitate also the development of a Kalman filter and smoother suitable for the nonlinear system at hand. The direct development of the recursions for the nonlinear filters is very complicated if not impossible altogether. Instead, we develop our recursions based on a linearization of the quadratic term that uses the most current state estimate available.

The remainder of this article is arranged as follows. In Section 2, the bilinear state space model problem is stated along with underlying assumptions. In Section 3, we derive the bilinear Kalman filter and smoother. Section 4 estimates the unknown parameters in the bilinear state space model via the EM algorithm. Section 5 presents a simulation example that produces very satisfactory results. A real world example is given in Section 6.

2 The bilinear state space model

In this section, we introduce a bilinear state space model and describe a generalization of the Kalman filter and smoother to this model. Our model subsumes the well-known Lorentz-96 model [7] for weather forecast, and the Lotka-Volterra evolution equations appear in many applications in chemistry, biology and control [4, 6]. Other types of bilinear models were investigated in [2, 11], where bilinearity occurs because of the interaction between the input and states of the system.

We will adopt the geometric notation as presented in [17] where the matrix inner product of two random vectors is defined by

〈 x, y 〉 = E (x y^{T}),

and

{∥ x ∥}^{2} = E (x x^{T}) = 〈 x, x 〉 .

We know that

〈 x, y 〉 = cov (x, y) + E (x) E {(y)}^{T} .

Given a sequence $Y = {y_{1}, y_{2}, \dots, y_{t}}$ of random vectors, the conditional expectation $E (x | Y)$ with respect to this inner product is interpreted geometrically as the orthogonal projection of the vector x in the space spanned by the vectors of Y. In particular, if x is uncorrelated with the elements of Y and if it has zero mean, then x is orthogonal to the subspace generated by Y and $E (x | Y) = 0$ . We will also use the projection notation

π_{t} x : = π_{Y} x : = E (x | Y) .

It is characterized by

〈 x - π_{t} x, z 〉 = 0,

for all $z \in M (Y)$ ; the closed subspace of $L^{2}$ of all random vectors z which can be written as measurable functions of the elements of Y [3].

To introduce the model, let us first define the bilinear function $a : R^{n} \times R^{n} \to R^{\frac{n (n + 1)}{2}}$ by

a (x, y) = {(x_{1} y_{1}, x_{1} y_{2}, \dots, x_{1} y_{n}, x_{2} y_{2}, x_{2} y_{3}, \dots, x_{2} y_{n}, \dots, x_{n} y_{n})}^{T},

where $a (\cdot, \cdot)$ is similar to the Kronecker product function except that there is no repetition of the entries. Consider the bilinear state space model given by

(1)

(2)

where $x_{k} \in R^{n}$ is the state vector, $y_{k} \in R^{p}$ is the measurement vector, and $z_{k} = a (x_{k}, x_{k})$ is the bilinear term given by

z_{k} = a (x_{k}, x_{k}) = {(x_{1}^{2}, x_{1} x_{2}, \dots, x_{1} x_{n}, x_{2}^{2}, x_{2} x_{3}, \dots, x_{2} x_{n}, \dots, x_{n}^{2})}^{T} .

The matrices are of appropriate dimensions, i.e., $A \in R^{n \times n}$ , $B \in R^{n \times \frac{n (n + 1)}{2}}$ and $C \in R^{p \times p}$ . The uncorrelated noise corruption signals $w_{k}$ and $v_{k}$ are, as usual, assumed to be white having Gaussian distribution with zero mean and covariances Q and R, respectively, i.e.,

\begin{matrix} w_{k} \sim N (0, Q), v_{k} \sim N (0, R), \\ 〈 w_{k}, w_{l} 〉 = Q δ_{k l}, 〈 v_{k}, v_{l} 〉 = R δ_{k l} \end{matrix}

and

〈 w_{k}, v_{l} 〉 = 0 .

Lemma 1 $〈 w_{k}, z_{l} 〉 = 0$ , $〈 v_{k}, z_{l} 〉 = 0$ , $l \leq k$ .

Proof Let $l \leq k$ . Then since $x_{l}$ , $w_{k}$ are uncorrelated and $E {w_{k}} = 0$ , $w_{k} ⊥ x_{l}$ . This means that $E {w_{k} | x_{l}} = 0$ . Hence,

\begin{array}{rcl} 〈 w_{k}, z_{l} 〉 & = & E {E {w_{k} a (x_{l}, x_{l}) | x_{l}}} \\ = & E {E {w_{k} | x_{l}} a (x_{l}, x_{l})} \\ = & 0 . \end{array}

The second equation can be shown in exactly the same way. □

The Taylor polynomial expansion of the form $a (x, x)$ at the point $x_{0}$ can be written as follows (with $z = a (x, x)$ ):

z = z_{0} + z^{'} (x_{0}) (x - x_{0}) + \frac{1}{2} H (x, x_{0}) (x - x_{0}),

(3)

where $z^{'} (x)$ is the $\frac{n (n + 1)}{2} \times n$ gradient of $a (x, x)$ given by

z^{'} (x) = {[\frac{\partial x_{i} x_{j}}{\partial x_{l}}]}_{i, j, l = 1, 2, \dots, m},

and $H (x, x_{0})$ is given by

H (x, x_{0}) = [\begin{array}{c} {(x - x_{0})}^{T} D_{1} \\ {(x - x_{0})}^{T} D_{2} \\ ⋮ \\ {(x - x_{0})}^{T} D_{m} \end{array}],

with $D_{k}$ being the matrix of second-order derivatives of the entries of $a (x, x)$ and $m = \frac{n (n + 1)}{2}$ . That is,

D_{k} = [\begin{array}{c} \frac{\partial z_{k 1}^{'}}{\partial x_{1}} & \frac{\partial z_{k 2}^{'}}{\partial x_{1}} & \dots & \frac{\partial z_{k n}^{'}}{\partial x_{1}} \\ \frac{\partial z_{k 1}^{'}}{\partial x_{2}} & \frac{\partial z_{k 2}^{'}}{\partial x_{2}} & \dots & \frac{\partial z_{k n}^{'}}{\partial x_{2}} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ \frac{\partial z_{k 1}^{'}}{\partial x_{n}} & \frac{\partial z_{k 2}^{'}}{\partial x_{n}} & \dots & \frac{\partial z_{k n}^{'}}{\partial x_{n}} \end{array}], where k = 1, 2, \dots, m .

To illustrate, suppose $n = 3$ , then

z^{'} (x) = [\begin{array}{c} 2 x_{1} & 0 & 0 \\ x_{2} & x_{1} & 0 \\ x_{3} & 0 & x_{1} \\ 0 & 2 x_{2} & 0 \\ 0 & x_{3} & x_{2} \\ 0 & 0 & 2 x_{3} \end{array}]

and

\begin{matrix} D_{1} = [\begin{array}{c} 2 & 0 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{array}], D_{2} = [\begin{array}{c} 0 & 1 & 0 \\ 1 & 0 & 0 \\ 0 & 0 & 0 \end{array}], D_{3} = [\begin{array}{c} 0 & 0 & 1 \\ 0 & 0 & 0 \\ 1 & 0 & 0 \end{array}], \\ D_{4} = [\begin{array}{c} 0 & 0 & 0 \\ 0 & 2 & 0 \\ 0 & 0 & 0 \end{array}], D_{5} = [\begin{array}{c} 0 & 0 & 0 \\ 0 & 0 & 1 \\ 0 & 1 & 0 \end{array}], D_{6} = [\begin{array}{c} 0 & 0 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 2 \end{array}] . \end{matrix}

Note, for example, that the Lorentz-96 model (with $n = 3$ ) takes the form (1) with $A = - I$ and

B = [\begin{array}{c} 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 1 & 0 \end{array}] .

3 A bilinear Kalman filter and smoother

In this section, we will develop a Kalman filter and smoother for the bilinear system (1) and (2).

3.1 A bilinear Kalman filter

Given a sequence of measurements $Y_{t} = {y_{1}, y_{2}, \dots, y_{t}}$ , let

\begin{matrix} x_{k}^{t} = E (x_{k} | Y_{t}) : = E_{t} (x_{k}), \\ P_{k}^{t} = {∥ x_{k} - x_{k}^{t} ∥}^{2}, \\ z_{k}^{t} = E_{t} (z_{k}), \\ {\dot{P}}_{k}^{t} = 〈 x_{k} - x_{k}^{t}, z_{k} - z_{k}^{t} 〉, \\ {\ddot{P}}_{k}^{t} = {∥ z_{k} - z_{k}^{t} ∥}^{2} . \end{matrix}

When $x = x_{k}$ , equation (3) becomes

z = z_{0} + z^{'} (x_{0}) (x_{k} - x_{0}) + \frac{1}{2} H (x_{k}, x_{0}) (x_{k} - x_{0}) .

(4)

In order to compute equation (4), we approximate the second-degree term $H (x_{k}, x_{0})$ by using the most current available state estimation for $x_{k}$ ; that is,

In the case of prediction, we take
$x_{k} \approx x_{k}^{k - 2}, so H (x_{k}, x_{0}) \approx H (x_{k}^{k - 2}, x_{0}) .$

By setting $x_{0} = x_{k}^{k - 1}$ , equation (4) becomes

z_{k} \approx z_{k}^{k - 1} + z^{'} (x_{k}^{k - 1}) (x_{k} - x_{k}^{k - 1}) + \frac{1}{2} H (x_{k}^{k - 2}, x_{k}^{k - 1}) (x_{k} - x_{k}^{k - 1}) .

In the case of filtering, we take
$x_{k} \approx x_{k}^{k - 1}, so H (x_{k}, x_{0}) \approx H (x_{k}^{k - 1}, x_{0}) .$

By setting $x_{0} = x_{k}^{k}$ , equation (4) becomes

z_{k} \approx z_{k}^{k} + z^{'} (x_{k}^{k}) (x_{k} - x_{k}^{k}) + \frac{1}{2} H (x_{k}^{k - 1}, x_{k}^{k}) (x_{k} - x_{k}^{k}) .

In the case of smoothing, we take
$x_{k} \approx x_{k}^{k + 2}, so H (x_{k}, x_{0}) \approx H (x_{k}^{k + 2}, x_{0}) .$

By setting $x_{0} = x_{k}^{k + 1}$ , equation (4) becomes

z_{k} \approx z_{k}^{k + 1} + z^{'} (x_{k}^{k + 1}) (x_{k} - x_{k}^{k + 1}) + \frac{1}{2} H (x_{k}^{k + 2}, x_{k}^{k + 1}) (x_{k} - x_{k}^{k + 1}) .

In summary, we have the following linearization:

z_{k} \approx z_{k}^{t} + V_{k}^{t} (x_{k} - x_{k}^{t}),

(5)

where

V_{k}^{t} = z^{'} (x_{k}^{t}) + \frac{1}{2} H (x_{k}^{t \pm 1}, x_{k}^{t}) .

We also define

(6)

(7)

(8)

Theorem 2 For the bilinear state space model defined by (1) and (2), we have

(9)

(10)

with

\begin{matrix} x_{k + 1}^{k + 1} = x_{k + 1}^{k} + K_{k + 1} [y_{k} - C x_{k + 1}^{k}], \\ P_{k + 1}^{k + 1} = [I - K_{k + 1} C] P_{k + 1}^{k}, \\ {\dot{P}}_{k + 1}^{k + 1} = P_{k + 1}^{k + 1} {[V_{k + 1}^{k + 1}]}^{T}, \\ {\ddot{P}}_{k + 1}^{k + 1} = V_{k + 1}^{k + 1} {\dot{P}}_{k + 1}^{k + 1}, \\ K_{k + 1} = P_{k + 1}^{k} C^{T} {[C P_{k + 1}^{k} C^{T} + R]}^{- 1}, \end{matrix}

and

V_{k + 1}^{k + 1} = z^{'} (x_{k + 1}^{k + 1}) + \frac{1}{2} H (x_{k + 1}^{k}, x_{k + 1}^{k + 1}), k = 0, \dots, N .

Proof Equation (9) is obtained by applying the conditional expectation $E_{k} (\cdot)$ to (1):

\begin{array}{rcl} x_{k + 1}^{k} & = & E_{k} (x_{k + 1}) \\ = & E_{k} (A x_{k} + B z_{k} + w_{k}) \\ = & A E_{k} (x_{k}) + B E_{k} (z_{k}) + E_{k} (w_{k}) \\ = & A x_{k}^{k} + B z_{k}^{k} . \end{array}

To obtain the error recursion (10), we proceed as follows:

\begin{array}{rcl} P_{k + 1}^{k} & = & {∥ x_{k + 1} - x_{k + 1}^{k} ∥}^{2} = {∥ (I - π_{k}) x_{k + 1} ∥}^{2} \\ = & {∥ (I - π_{k}) (A x_{k} + B z_{k} + w_{k}) ∥}^{2} \\ = & {∥ A (x_{k} - x_{k}^{k}) + B (z_{k} - z_{k}^{k}) + w_{k} ∥}^{2} \\ = & A {∥ x_{k} - x_{k}^{k} ∥}^{2} A^{T} + B {∥ z_{k} - z_{k}^{k} ∥}^{2} B^{T} + A 〈 x_{k} - x_{k}^{k}, z_{k} - z_{k}^{k} 〉 B^{T} \\ + B 〈 z_{k} - z_{k}^{k}, x_{k} - x_{k}^{k} 〉 A^{T} + A 〈 x_{k} - x_{k}^{k}, w_{k} 〉 + 〈 w_{k}, x_{k} - x_{k}^{k} 〉 A^{T} \\ + B 〈 z_{k} - z_{k}^{k}, w_{k} 〉 + 〈 w_{k}, z_{k} - z_{k}^{k} 〉 B^{T} + {∥ w_{k} ∥}^{2} \\ = & A P_{k}^{k} A^{T} + A {\dot{P}}_{k}^{k} B^{T} + B {({\dot{P}}_{k}^{k})}^{T} A^{T} + B {\ddot{P}}_{k}^{k} B^{T} + Q . \end{array}

Now, when $t = k$ , we derive the filtering steps. Let

\begin{array}{rcl} ρ_{k} & = & y_{k} - E_{k - 1} (y_{k}) = (I - π_{k - 1}) y_{k} \\ = & (I - π_{k - 1}) (C x_{k} + v_{k}) = y_{k} - C x_{k}^{k - 1} \\ = & C (x_{k} - x_{k}^{k - 1}) + v_{k}, k = 1, \dots, N . \end{array}

Then, the mean of the innovations is given by

E_{k - 1} (ρ_{k}) = π_{k - 1} (I - π_{k - 1}) y_{k} = 0,

and the variance

\begin{array}{rcl} Σ_{k + 1} & = & {∥ ρ_{k + 1} ∥}^{2} \\ = & {∥ C (x_{k + 1} - x_{k + 1}^{k}) + v_{k + 1} ∥}^{2} \\ = & C {∥ x_{k + 1} - x_{k + 1}^{k} ∥}^{2} C^{T} + {∥ v_{k + 1} ∥}^{2} \\ = & C P_{k + 1}^{k} C^{T} + R . \end{array}

Also,

\begin{array}{rcl} 〈 ρ_{k + 1}, y_{k} 〉 & = & 〈 y_{k + 1} - y_{k + 1}^{k}, y_{k} 〉 \\ = & 〈 (I - π_{k}) y_{k + 1}, y_{k} 〉 = 〈 y_{k + 1}, (I - π_{k}) y_{k} 〉 = 0, \end{array}

which means that the innovations are orthogonal to the past measurements. On the other hand,

\begin{array}{rcl} 〈 x_{k + 1}, ρ_{k + 1} 〉 & = & 〈 x_{k + 1}, C (x_{k + 1} - x_{k + 1}^{k}) + v_{k + 1} 〉 \\ = & 〈 x_{k + 1}, x_{k + 1} - x_{k + 1}^{k} 〉 C^{T} \\ = & 〈 x_{k + 1}, (I - π_{k}) x_{k + 1} 〉 C^{T} \\ = & 〈 (I - π_{k}) x_{k + 1}, (I - π_{k}) x_{k + 1} 〉 C^{T} \\ = & {∥ (I - π_{k}) x_{k + 1} ∥}^{2} C^{T} \\ = & P_{k + 1}^{k} C^{T} . \end{array}

From these results, we conclude that $x_{k + 1}$ and $ρ_{k + 1}$ have a Gaussian joint distribution conditional on $Y_{k}$ . That is,

{(\begin{array}{c} x_{k + 1} \\ ρ_{k + 1} \end{array}) | {y_{t}}_{1}^{k}} \sim N {(\begin{array}{c} x_{k + 1}^{k} \\ 0 \end{array}), (\begin{array}{c} P_{k + 1}^{k} & P_{k + 1}^{k} C^{T} \\ C P_{k + 1}^{k} & Σ_{k + 1} \end{array})} .

Now, since $Y_{k}$ , $ρ_{k + 1}$ are orthogonal,

\begin{array}{rcl} x_{k + 1}^{k + 1} & = & π_{k + 1} (x_{k + 1}) = π_{{Y_{k}, ρ_{k + 1}}} (x_{k + 1}) \\ = & π_{Y_{k}} (x_{k + 1}) + π_{ρ_{k + 1}} (x_{k + 1}) \\ = & E_{k} (x_{k + 1}) + 〈 x_{k + 1}, ρ_{k + 1} 〉 Σ_{k + 1}^{- 1} ρ_{k + 1} \\ = & x_{k + 1}^{k} + P_{k + 1}^{k} C^{T} {[C P_{k + 1}^{k} C^{T} + R]}^{- 1} ρ_{k + 1} \\ = & x_{k + 1}^{k} + K_{k + 1} [y_{k + 1} - C x_{k + 1}^{k}], \end{array}

where

K_{k + 1} = P_{k + 1}^{k} C^{T} {[C P_{k + 1}^{k} C^{T} + R]}^{- 1} = P_{k + 1}^{k} C^{T} Σ_{k + 1}^{- 1}

represents the Kalman gain.

Next, we derive the recursion for $P_{k + 1}^{k + 1}$ . Since $x_{k + 1} - x_{k + 1}^{k} = (x_{k + 1} - x_{k + 1}^{k + 1}) + π_{ρ_{k + 1}} (x_{k + 1} - x_{k + 1}^{k})$ is an orthogonal decomposition,

\begin{array}{rcl} P_{k + 1}^{k + 1} & = & {∥ x_{k + 1} - x_{k + 1}^{k + 1} ∥}^{2} \\ = & {∥ x_{k + 1} - x_{k + 1}^{k} ∥}^{2} - {∥ π_{ρ_{k + 1}} (x_{k + 1} - x_{k + 1}^{k}) ∥}^{2} \\ = & P_{k + 1}^{k} - {∥ 〈 x_{k + 1} - x_{k + 1}^{k}, ρ_{k + 1} 〉 Σ_{k + 1}^{- 1} ρ_{k + 1} ∥}^{2} \\ = & P_{k + 1}^{k} - {∥ P_{k + 1}^{k} C^{T} Σ_{k + 1}^{- 1} ρ_{k + 1} ∥}^{2} \\ = & P_{k + 1}^{k} - P_{k + 1}^{k} C^{T} Σ_{k + 1}^{- 1} {∥ ρ_{k + 1} ∥}^{2} Σ_{k + 1}^{- 1} C P_{k + 1}^{k} \\ = & P_{k + 1}^{k} - P_{k + 1}^{k} C^{T} Σ_{k + 1}^{- 1} C P_{k + 1}^{k} \\ = & P_{k + 1}^{k} - K_{k + 1} C P_{k + 1}^{k} \\ = & [I - K_{k + 1} C] P_{k + 1}^{k} . \end{array}

The equation for ${\dot{P}}_{k + 1}^{k + 1}$ is obtained as follows:

\begin{array}{rcl} {\dot{P}}_{k + 1}^{k + 1} & = & 〈 x_{k + 1} - x_{k + 1}^{k + 1}, z_{k + 1} - z_{k + 1}^{k + 1} 〉 \\ = & 〈 x_{k + 1} - x_{k + 1}^{k + 1}, V_{k + 1}^{k + 1} (x_{k + 1} - x_{k + 1}^{k + 1}) 〉 \\ = & 〈 x_{k + 1} - x_{k + 1}^{k + 1}, x_{k + 1} - x_{k + 1}^{k + 1} 〉 {[V_{k + 1}^{k + 1}]}^{T} \\ = & P_{k + 1}^{k + 1} {[V_{k + 1}^{k + 1}]}^{T} . \end{array}

Finally, for ${\ddot{P}}_{k + 1}^{k + 1}$ we have

\begin{array}{rcl} {\ddot{P}}_{k + 1}^{k + 1} & = & 〈 z_{k + 1} - z_{k + 1}^{k + 1}, z_{k + 1} - z_{k + 1}^{k + 1} 〉 \\ = & V_{k + 1}^{k + 1} 〈 x_{k + 1} - x_{k + 1}^{k + 1}, x_{k + 1} - x_{k + 1}^{k + 1} 〉 {[V_{k + 1}^{k + 1}]}^{T} \\ = & V_{k + 1}^{k + 1} P_{k + 1}^{k + 1} {[V_{k + 1}^{k + 1}]}^{T} \\ = & V_{k + 1}^{k + 1} {\dot{P}}_{k + 1}^{k + 1} . \end{array}

This completes the proof. □

We summarize the bilinear Kalman filter as follows:

(11)

(12)

and

V_{k}^{k} = z^{'} (x_{k}^{k}) + \frac{1}{2} H (x_{k}^{k - 1}, x_{k}^{k}), k = 0, \dots, N .

Also, note that the bilinear Kalman filter algorithm is a generalization of the Kalman filter for the linear case which is given in [17].

3.2 A bilinear Kalman smoother

In this subsection, we will develop a Kalman smoother for the bilinear system (1) and (2). We will use the following notation:

\begin{matrix} P_{k_{1}, k_{2}}^{N} = 〈 x_{k_{1}} - x_{k_{1}}^{N}, x_{k_{2}} - x_{k_{2}}^{N} 〉, \\ {\dot{P}}_{k_{1}, k_{2}}^{N} = 〈 x_{k_{1}} - x_{k_{1}}^{N}, z_{k_{2}} - z_{k_{2}}^{N} 〉, \\ {\ddot{P}}_{k_{1}, k_{2}}^{N} = 〈 z_{k_{1}} - z_{k_{1}}^{N}, z_{k_{2}} - z_{k_{2}}^{N} 〉 . \end{matrix}

Lemma 3 Let

ϵ_{k + 1} = {v_{k + 1}, \dots, v_{N}, w_{k + 2}, \dots, w_{N}} .

(13)

Then for $1 \leq k \leq N - 1$ and with the approximation (5),

L {y_{m}}_{1}^{N} = L {{y_{m}}_{1}^{k}, x_{k + 1} - x_{k + 1}^{k}, ϵ_{k + 1}},

(14)

where $L {\cdot}$ denotes the subspace spanned by ${\cdot}$ .

Proof Recall that

z_{m} = z_{m}^{N} + V_{m}^{N} (x_{m} - x_{m}^{N}),

that is,

(z_{m} - z_{m}^{N}) \in L {x_{m} - x_{m}^{N}} .

Since

Similarly, since

Continuing in this manner, we get (14). □

We state the bilinear Kalman smoother in the following theorem.

Theorem 4 Consider the bilinear state space model (1) and (2) with $x_{N}^{N}$ and $P_{N}^{N}$ as given in (11) and (12). Then for $k = N - 1, \dots, 1$ , we have

(15)

(16)

where

J_{k} = [P_{k}^{k} A^{T} + {\dot{P}}_{k}^{k} B^{T}] {[P_{k + 1}^{k}]}^{- 1} .

Proof Noting the mutual orthogonality of ${y}_{1}^{k}$ , ${x_{k + 1} - x_{k + 1}^{k}}$ and $ϵ_{k + 1}$ and the orthogonality of $x_{k}$ and $ϵ_{k + 1}$ ,

\begin{array}{rcl} x_{k}^{N} & = & π_{N} x_{k} = π_{k} x_{k} + π_{(x_{k + 1} - x_{k + 1}^{k})} x_{k} \\ = & x_{k}^{k} + 〈 x_{k}, x_{k + 1} - x_{k + 1}^{k} 〉 {∥ x_{k + 1} - x_{k + 1}^{k} ∥}^{- 2} (x_{k + 1} - x_{k + 1}^{k}) \\ = & x_{k}^{k} + 〈 x_{k}, x_{k + 1} - x_{k + 1}^{k} 〉 {[P_{k + 1}^{k}]}^{- 1} (x_{k + 1} - x_{k + 1}^{k}) . \end{array}

Now,

\begin{array}{rcl} 〈 x_{k}, x_{k + 1} - x_{k + 1}^{k} 〉 & = & 〈 x_{k}, A x_{k} + B z_{k} + w_{k} - x_{k + 1}^{k} 〉 \\ = & 〈 x_{k}, A (x_{k} - x_{k}^{k}) + B (z_{k} - z_{k}^{k}) 〉 \\ = & 〈 x_{k} - x_{k}^{k}, x_{k} - x_{k}^{k} 〉 A^{T} + 〈 x_{k} - x_{k}^{k}, z_{k} - z_{k}^{k} 〉 B^{T} \\ = & P_{k}^{k} A^{T} + {\dot{P}}_{k}^{k} B^{T} . \end{array}

Thus,

\begin{array}{rcl} x_{k}^{N} & = & x_{k}^{k} + [P_{k}^{k} A^{T} + {\dot{P}}_{k}^{k} B^{T}] {[P_{k + 1}^{k}]}^{- 1} (x_{k + 1} - x_{k + 1}^{k}) \\ = & x_{k}^{k} + J_{k} (x_{k + 1} - x_{k + 1}^{k}) . \end{array}

Equation (15) now follows by taking the projection $π_{N}$ again of both sides and noting that $k \leq N$ . To derive (16), we compute

\begin{array}{rcl} P_{k}^{N} & = & {∥ x_{k} - x_{k}^{N} ∥}^{2} = {∥ x_{k} - x_{k}^{k} - J_{k} (x_{k + 1} - x_{k + 1}^{k}) ∥}^{2} \\ = & {∥ x_{k} - x_{k}^{k} ∥}^{2} - 〈 x_{k} - x_{k}^{k}, x_{k + 1} - x_{k + 1}^{k} 〉 J_{k}^{T} - J_{k} 〈 x_{k + 1} - x_{k + 1}^{k}, x_{k} - x_{k}^{k} 〉 + J_{k} P_{k + 1}^{k} J_{k}^{T} \\ = & P_{k}^{k} - 〈 (1 - π_{k}) x_{k}, x_{k + 1} 〉 J_{k}^{T} - J_{k} 〈 x_{k + 1}, (1 - π_{k}) x_{k} 〉 + J_{k} P_{k + 1}^{k} J_{k}^{T} \\ = & P_{k}^{k} - 〈 (1 - π_{k}) x_{k}, A x_{k} + B z_{k} 〉 J_{k}^{T} - J_{k} 〈 A x_{k} + B z_{k}, (1 - π_{k}) x_{k} 〉 + J_{k} P_{k + 1}^{k} J_{k}^{T} \\ = & P_{k}^{k} - (P_{k}^{k} A^{T} + {\dot{P}}_{k}^{k} B^{T}) J_{k}^{T} - J_{k} (A P_{k}^{k} + B {\dot{P}}_{k}^{k}) + J_{k} P_{k + 1}^{k} J_{k}^{T} \\ = & P_{k}^{k} - J_{k} P_{k + 1}^{k} J_{k}^{T} - J_{k} P_{k + 1}^{k} J_{k}^{T} + J_{k} P_{k + 1}^{k} J_{k}^{T} = P_{k}^{k} - J_{k} P_{k + 1}^{k} J_{k}^{T}, \end{array}

which completes the proof. □

The next theorem states the bilinear lag-one recursions.

Theorem 5 Consider the bilinear state space model (1) and (2). Then

\begin{matrix} P_{k + 1, k}^{N} = A P_{k}^{N} + B {({\dot{P}}_{k}^{N})}^{T}, \\ {\dot{P}}_{k + 1, k}^{N} = P_{k + 1, k}^{N} {[V_{k}^{N}]}^{T} . \end{matrix}

Proof Using the definitions in (6) and (7),

\begin{array}{rcl} P_{k + 1, k}^{N} & = & 〈 x_{k + 1} - x_{k + 1}^{N}, x_{k} - x_{k}^{N} 〉 = 〈 (1 - π_{N}) x_{k + 1}, (1 - π_{N}) x_{k} 〉 \\ = & 〈 x_{k + 1}, (1 - π_{N}) x_{k} 〉 = 〈 A x_{k} + B z_{k} + w_{k}, (1 - π_{N}) x_{k} 〉 \\ = & A 〈 x_{k}, (1 - π_{N}) x_{k} 〉 + B 〈 z_{k}, (1 - π_{N}) x_{k} 〉 \\ = & A P_{k}^{N} + B {({\dot{P}}_{k}^{N})}^{T} . \end{array}

Also,

\begin{array}{rcl} {\dot{P}}_{k + 1, k}^{N} & = & 〈 x_{k + 1} - x_{k + 1}^{N}, z_{k} - z_{k}^{N} 〉 \\ = & 〈 x_{k + 1} - x_{k + 1}^{N}, V_{k}^{N} (x_{k} - x_{k}^{N}) 〉 \\ = & 〈 x_{k + 1} - x_{k + 1}^{N}, x_{k} - x_{k}^{N} 〉 {[V_{k}^{N}]}^{T} \\ = & P_{k + 1, k}^{N} {[V_{k}^{N}]}^{T} . \end{array}

□

4 The bilinear EM algorithm

The unknown parameter set $θ = {A, B, C, Q, R, V, μ}$ is estimated by the EM algorithm that iteratively updates the current estimate $θ (i)$ of θ by maximizing the log-likelihood function

(17)

where

$f_{0} (\cdot)$ represents the n-variate normal density of the initial state $x_{0}$ with mean μ and the covariance matrix V.
$f_{v} (\cdot)$ represents the p-variate normal density with zero mean and the covariance matrix R.
$f_{w} (\cdot)$ represents the n-variate normal density function with zero mean and the covariance matrix Q.

The conditional expectation step (E-step) finds the missing data, i.e., $X_{N}$ , given the observed data and current estimated parameters, and then substitutes these expectations for the missing data. Specifically, let $θ (i - 1)$ be the current estimate of the parameter θ, then the E-step finds the conditional expectation $E {\cdot}$ of the complete-data log-likelihood given $θ (i - 1)$ :

q (θ | θ (i - 1)) = E {log L (θ, X_{N}, Y_{N}) | Y_{N}, θ (i - 1)} .

(18)

The M-step determines $θ (i)$ by maximizing the expected complete-data log-likelihood

q (θ (i) | θ (i - 1)) \geq q (θ | θ (i - 1)), \forall θ .

The following theorem accomplishes the expectation step.

Theorem 6 For the bilinear state space model (1) and (2),

\begin{array}{rcl} q (θ (i) | θ (i - 1)) & = & - \frac{1}{2} log | V | - \frac{1}{2} Tr {V^{- 1} (Δ - {\hat{x}}_{0} μ^{T} - μ {\hat{x}}_{0}^{T} + μ μ^{T})} \\ - \frac{N}{2} log | Q | - \frac{1}{2} Tr {Q^{- 1} (Θ - Ψ A^{T} - Π B^{T} - A Ψ^{T} \\ + A Φ A^{T} - B Π^{T} + B Λ B^{T})} \\ - \frac{N}{2} log | R | - \frac{1}{2} Tr {R^{- 1} (δ - Ω C^{T} - C Ω^{T} + C Φ C^{T})} + const, \end{array}

where

\begin{matrix} Δ = E_{N} (x_{0} x_{0}^{T}), {\hat{x}}_{0} = E_{N} (x_{0}), \\ Θ = \sum_{k = 1}^{N} (x_{k}^{N} {(x_{k}^{N})}^{T} + P_{k}^{N}), Ψ = \sum_{k = 1}^{N} (x_{k}^{N} {(x_{k - 1}^{N})}^{T} + P_{k, k - 1}^{N}), \\ Π = \sum_{k = 1}^{N} (x_{k}^{N} {(z_{k - 1}^{N})}^{T} + {\dot{P}}_{k, k - 1}^{N}), Φ = \sum_{k = 1}^{N} (x_{k - 1}^{N} {(x_{k - 1}^{N})}^{T} + P_{k - 1}^{N}), \\ Γ = \sum_{k = 1}^{N} E_{N} (x_{k - 1} z_{k - 1}^{T}), Λ = \sum_{k = 0}^{N - 1} (z_{k}^{N} {(z_{k}^{N})}^{T} + {\ddot{P}}_{k}^{N}), \\ Ω = \sum_{k = 1}^{N} x_{k}^{N} y_{k}^{T}, δ = \sum_{k = 1}^{N} {∥ y_{k} ∥}^{2} . \end{matrix}

Proof Since the system is Markovian, we may use Bayes’ rule successively to get

p (θ, X_{N}, Y_{N}) = p (y_{1}, \dots, y_{N}, x_{0}, \dots, x_{N})

(19)

= p (x_{0}) \prod_{k = 1}^{N} p (y_{k} | x_{k}) \prod_{k = 0}^{N - 1} p (x_{k + 1} | x_{k}) .

(20)

From the assumptions on $x_{0}$ , $w_{k}$ , and $v_{k}$ , the density functions $p (x_{0})$ , $p (y_{k} | x_{k})$ , and $p (x_{k + 1} | x_{k})$ are given by

\begin{matrix} p (x_{0}) = \frac{1}{{(2 π)}^{\frac{n}{2}} | V |^{\frac{1}{2}}} exp {- \frac{1}{2} {(x_{0} - μ)}^{T} V^{- 1} (x_{0} - μ)}, \\ p (x_{k + 1} | x_{k}) = \frac{1}{{(2 π)}^{\frac{n}{2}} | Q |^{\frac{1}{2}}} exp {- \frac{1}{2} {(x_{k + 1} - A x_{k} - B z_{k})}^{T} Q^{- 1} (x_{k + 1} - A x_{k} - B z_{k})}, \end{matrix}

and

p (y_{k} | x_{k}) = \frac{1}{{(2 π)}^{\frac{n}{2}} | R |^{\frac{1}{2}}} exp {- \frac{1}{2} {(y_{k} - C x_{k})}^{T} R^{- 1} (y_{k} - C x_{k})} .

Now, substituting these densities in (17) and taking the logarithm of both sides, we get

\begin{array}{rcl} L (θ, X_{N}, Y_{N}) & = & - \frac{1}{2} log | V | - \frac{1}{2} {(x_{0} - μ)}^{T} V^{- 1} (x_{0} - μ) \\ - \frac{N}{2} log | Q | - \frac{1}{2} \sum_{k = 0}^{N - 1} {(x_{k + 1} - A x_{k} - B z_{k})}^{T} Q^{- 1} (x_{k + 1} - A x_{k} - B z_{k}) \\ - \frac{N}{2} log | R | - \frac{1}{2} \sum_{k = 1}^{N} {(y_{k} - C x_{k})}^{T} R^{- 1} (y_{k} - C x_{k}) + const . \end{array}

The result follows upon taking the expectation conditional on $Y_{N}$ , making use of

\begin{matrix} E_{N} (x^{T} A y) = Tr [A E_{N} (x y^{T})], \\ E_{N} (x_{k} z_{k}^{T}) = 0, \\ E_{N} (x_{k + 1} z_{k}^{T}) = B E_{N} (z_{k} z_{k}^{T}), \end{matrix}

and simplifying. The middle equality follows from the fact that odd moments of Gaussian random variables vanish. □

The computation of Θ, Φ, Ψ, Π and Λ given a current estimate $θ (i)$ of θ involves the use of the bilinear Kalman filter and smoother introduced in Sections 3.1 and 3.2. For this purpose, we introduce

\begin{matrix} Θ = \sum_{k = 1}^{N} E_{t} (x_{k} x_{k}^{T}) = \sum_{k = 1}^{N} (x_{k}^{t} {(x_{k}^{t})}^{T} + P_{k}^{t}), \\ Φ = \sum_{k = 1}^{N} (x_{k - 1}^{t} {(x_{k - 1}^{t})}^{T} + P_{k - 1}^{t}), Ψ = \sum_{k = 1}^{N} (x_{k}^{t} {(x_{k - 1}^{t})}^{T} + P_{k, k - 1}^{t}), \\ Π = \sum_{k = 1}^{N} (x_{k}^{t} {(z_{k - 1}^{t})}^{T} + {\dot{P}}_{k, k - 1}^{t}), Λ = \sum_{k = 0}^{N - 1} (z_{k}^{t} {(z_{k}^{t})}^{T} + {\ddot{P}}_{k}^{t}) . \end{matrix}

The next step of the EM algorithm is to maximize the function $q (θ (i) | θ (i - 1))$ with respect to θ.

Theorem 7 The maximizer of $q (θ (i) | θ (i - 1))$ is obtained for the parameter vector θ given by

(21)

(22)

(23)

Proof Let

Then

q (θ (i) | θ (i - 1)) = q_{1} (μ, V) + q_{2} (A, B, Q) + q_{3} (C, R) + const,

which means that $q (θ (i), θ (i - 1))$ is maximized by separately minimizing $q_{1}$ , $q_{2}$ , $q_{3}$ . This is done by setting the partial derivative of q with respect to each parameter equal to zero (i.e., $\frac{\partial q}{\partial x} = 0$ ) and solving the resulting system of equations. □

The EM algorithm for a bilinear state space model is summarized as follows.

Bilinear EM algorithm

1.
Initialize the EM algorithm by choosing initial values of $θ (0)$ .
2.
Calculate the incomplete-data likelihood, $log L (Y_{n}; θ)$ .
3.
Execute the E-step by using the bilinear Kalman filter and smoother in (9)-(10) and (15)-(16), respectively.
4.
Execute the M-step using (21)-(23) and update the estimates of θ using (M-step) to obtain $θ (i)$ .
5.
Repeat Steps 2 to 4 until convergence.

5 Simulation results

A $1, 000$ Monte Carlo simulation is performed to illustrate the utility of the bilinear algorithm. The observed data are generated according to the second-order bilinear state space model

(24)

(25)

where $w_{k}$ and $v_{k}$ are independent identically distributed (i.i.d.) Gaussian noises such that

\begin{matrix} w_{k} \sim N (0, 0.01 \times I_{2}), \\ v_{k} \sim N (0, 0.01) . \end{matrix}

In all simulations, the number of iterations for the EM algorithm is fixed and its value set to $J = 100$ .

Figure 1 shows a sample of realizations of the input noise $w_{k}$ , and Figure 2 shows the output noise $v_{k}$ , respectively. Figure 3 compares the observed output signals and the estimated output signal. The average estimates of the parameters are

\begin{matrix} A = [\begin{array}{c} 0.3891 & 0.1015 \\ - 0.09812 & 0.2117 \end{array}], B = [\begin{array}{c} 0.0012 & 1.1016 & 0 \\ 0 & - 0.0345 & 0.9761 \end{array}], \\ C = [\begin{array}{c} - 0.0927 & 1.014 \end{array}] . \end{matrix}

The mean square error (MSE) is defined as

E_{N} = \frac{1}{N} \sum_{k = 1}^{N} {(y_{k} - C x_{k | k - 1})}^{2},

and its value for $1, 000$ run for different values of $cov (R)$ and $cov (Q)$ is kept constant, which is shown in Table 1.

Table 1 Comparison of the mean square errors

Full size table

6 Application to wind speed

In this section, we apply the proposed bilinear algorithm to the daily averaged wind speed data for Arar, a city located in the north eastern region of the kingdom of Saudi Arabia for a period of $16 \frac{1}{2}$ years as shown in Figure 4. It should be noted that all the calculations are carried out on normalized time series data.

To estimate the dimension of the state in the state space model, we apply the stochastic subspace system identification algorithm described in [23]. This is done by constructing the singular value diagram of the block Hankel matrix for the normalized wind speed data as shown in Figure 5. That is, the dimension of the state equals the number of the significant singular values; here $n = 2$ . For clarity, we compare the observed wind speed values with the estimated ones using a linear model [3] with our proposed algorithm for a period of 100 days as shown in Figure 6. The estimated parameters for the linear state space model are

A = [\begin{array}{c} 0.9475 & 0.1991 \\ 0.1968 & 0.0935 \end{array}], C = [\begin{array}{c} 0.5355 & 0.3492 \end{array}],

and for the bilinear state space model, they are

\begin{matrix} A = [\begin{array}{c} 0.9475 & 0.1991 \\ 0.1968 & 0.0935 \end{array}], C = [\begin{array}{c} 0.5355 & 0.3492 \end{array}], \\ B = [\begin{array}{c} 0 & 1.092 & - 1.092 \\ 0.01092 & 0.0557 & 0.1092 \end{array}] . \end{matrix}

The MSE for the estimated wind speed data using the linear EM algorithm is 0.3864, and 0.0197 for the bilinear EM algorithm.

References

Krener AJ: Bilinear and nonlinear realizations of input-output maps. SIAM J. Control 1975, 13(4):827–834. 10.1137/0313049
Article MathSciNet MATH Google Scholar
Pardalos PM, Yatsenko V: Optimization and Control of Bilinear Systems: Theory, Algorithms and Applications. Springer, Berlin; 2008.
Book MATH Google Scholar
Shumway R, Stoffer D: An approach to time series smoothing and forecasting using the EM algorithm. J. Time Ser. Anal. 1982, 3(4):253–264. 10.1111/j.1467-9892.1982.tb00349.x
Article MATH Google Scholar
Strgate SH: Nonlinear Dynamics and Chaos, with Applications to Physics, Biology, Chemistry, and Engineering. Perseus Books, New York; 1994.
Google Scholar
Galanis G, Anadranistakis M: A one-dimension Kalman filter for correction of near surface temperature forecasts. Meteorol. Appl. 2002, 9: 437–441. 10.1017/S1350482702004061
Article Google Scholar
Goel NS: On the Volterra and Other Nonlinear Models of Interacting Populations. Academic Press, San Diego; 1971.
Google Scholar
Lorenz EN, Emanuel KE: Optimal sites for supplementary weather observations: simulations with a small model. J. Atmos. Sci. 1998, 55: 399–414. 10.1175/1520-0469(1998)055<0399:OSFSWO>2.0.CO;2
Article Google Scholar
Lorenz EN: Designing chaotic models. J. Atmos. Sci. 2005, 62: 1574–1588. 10.1175/JAS3430.1
Article MathSciNet Google Scholar
Monbet V, Ailliot P, Prevosto M: Survey of stochastic models for wind and sea state time series. Probab. Eng. Mech. 2007, 22: 113–126. 10.1016/j.probengmech.2006.08.003
Article Google Scholar
Roy D, Musielak ZE: Generalized Lorenz models and their routes to chaos. III. Energy-conserving horizontal and vertical mode truncations. Chaos Solitons Fractals 2007, 33: 1064–1070. 10.1016/j.chaos.2006.05.084
Article MathSciNet MATH Google Scholar
Preistley MB: Non-Linear and Non-Stationary Time Series Analysis. Academic Press, San Diego; 1989.
Google Scholar
Gibson S, Wills A, Ninness B: Maximum-likelihood parameter estimation of bilinear systems. IEEE Trans. Autom. Control 2005, 50: 1581–1596.
Article MathSciNet Google Scholar
Anderson JL, Anderson SL: A Monte Carlo implementation of the nonlinear filtering problem to produce ensemble assimilations and forecasts. Mon. Weather Rev. 1999, 127: 2741–2758. 10.1175/1520-0493(1999)127<2741:AMCIOT>2.0.CO;2
Article Google Scholar
Bendat J: Nonlinear System Analysis and Identification from Random Data. Wiley-Interscience, New York; 1990.
MATH Google Scholar
Daum FE: Exact finite dimensional nonlinear filters. IEEE Trans. Autom. Control 1986, 31(7):616–622. 10.1109/TAC.1986.1104344
Article MathSciNet MATH Google Scholar
Ha QP, Trinh H: State and input simultaneous estimation for a class of nonlinear system. Automatica 2004, 40: 1779–1785. 10.1016/j.automatica.2004.05.012
Article MathSciNet MATH Google Scholar
Kailath T, Sayed A, Hassibi B: Linear Estimation. Prentice Hall, New York; 2000.
MATH Google Scholar
Kerschen G, Worden K, Vakakis AF, Golinval J: Past, present and future of nonlinear system identification in structural dynamics. Mech. Syst. Signal Process. 2006, 20: 505–592. 10.1016/j.ymssp.2005.04.008
Article Google Scholar
Ljung L: System Identification: Theory for the User. 2nd edition. Prentice Hall, Upper Saddle River; 1999.
MATH Google Scholar
Norgaard M, Poulsen NK, Ravn O: New developments in state estimation for nonlinear systems. Automatica 2000, 36: 1627–1638. 10.1016/S0005-1098(00)00089-3
Article MathSciNet MATH Google Scholar
Wiener N: Nonlinear Problems in Random Theory. MIT Press, Boston; 1958.
MATH Google Scholar
Dempster A, Laird N, Rubin D: Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. B 1977, 39: 1–38.
MathSciNet MATH Google Scholar
Tanaka H, Katayama T: A stochastic realization algorithm via block LQ decomposition in Hilbert space. Automatica 2006, 42: 741–746. 10.1016/j.automatica.2005.12.025
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

The first author was supported by Tayyebah University. The second and third authors would like to thank King Fahd University for the excellent research facilities they provide.

Author information

Authors and Affiliations

Department of Mathematics, Taibah University, Al-Madinah, Kingdom of Saudi Arabia
A Al-Mazrooei
Department of Mahtematics and Statistics, King Fahd University of Petroleum and Minerals, Dhaharan, 31261, Kingdom of Saudi Arabia
J Al-Mutawa, M El-Gebeily & R Agarwal
Department of Mathematics, Texas A&M University-Kingsville, Kingsville, TX, 78363-8202, USA
R Agarwal

Authors

A Al-Mazrooei
View author publications
You can also search for this author in PubMed Google Scholar
J Al-Mutawa
View author publications
You can also search for this author in PubMed Google Scholar
M El-Gebeily
View author publications
You can also search for this author in PubMed Google Scholar
R Agarwal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to A Al-Mazrooei.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

The authors have achieved equal contributions to each part of this paper. All authors read and approved the final version of the manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Al-Mazrooei, A., Al-Mutawa, J., El-Gebeily, M. et al. Filtering and identification of a state space model with linear and bilinear interactions between the states. Adv Differ Equ 2012, 176 (2012). https://doi.org/10.1186/1687-1847-2012-176

Download citation

Received: 22 April 2012
Accepted: 24 September 2012
Published: 09 October 2012
DOI: https://doi.org/10.1186/1687-1847-2012-176

Filtering and identification of a state space model with linear and bilinear interactions between the states

Abstract

Similar content being viewed by others

Maximum Likelihood Least Squares Based Iterative Estimation for a Class of Bilinear Systems Using the Data Filtering Technique

Extended Gradient-based Iterative Algorithm for Bilinear State-space Systems with Moving Average Noises by Using the Filtering Technique

Iterative state and parameter estimation algorithms for bilinear state-space systems by using the block matrix inversion and the hierarchical principle

1 Introduction

2 The bilinear state space model

3 A bilinear Kalman filter and smoother

3.1 A bilinear Kalman filter

3.2 A bilinear Kalman smoother

4 The bilinear EM algorithm

5 Simulation results

6 Application to wind speed

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Filtering and identification of a state space model with linear and bilinear interactions between the states

Abstract

Similar content being viewed by others

1 Introduction

2 The bilinear state space model

3 A bilinear Kalman filter and smoother

3.1 A bilinear Kalman filter

3.2 A bilinear Kalman smoother

4 The bilinear EM algorithm

5 Simulation results

6 Application to wind speed

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation