International Journal of Control Science and Engineering

p-ISSN: 2168-4952    e-ISSN: 2168-4960

2021;  11(1): 1-8

doi:10.5923/j.control.20211101.01

Received: Aug. 9, 2021; Accepted: Aug. 25, 2021; Published: Sep. 15, 2021

 

Stochastic Dynamic Systems’ State Estimation Based on Mean Squared Error Minimizing and Kalman Filtering

Bukhar Kussainov

Institute of Heat Power Engineering and Control Systems, Almaty University of Power Engineering and Telecommunications, Almaty, Republic of Kazakhstan

Correspondence to: Bukhar Kussainov, Institute of Heat Power Engineering and Control Systems, Almaty University of Power Engineering and Telecommunications, Almaty, Republic of Kazakhstan.

Email:

Copyright © 2021 The Author(s). Published by Scientific & Academic Publishing.

This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/

Abstract

In automatic control systems, telecommunications and information systems subjected to impact of random disturbances and measurement inaccuracies, there is the problem of estimating the state vector of observed stochastic system. With the aim to solve the problem the state space system model is described and the problem statement is given. To solve the problem it’s used the discrete Kalman filter (KF) presenting itself the recurrent procedure in the form of the set of the difference vector-matrix equations. In the paper the way of deriving the equations of KF on the basis of the procedure of minimization of the mean-squared error of estimation based on a method of the least squares is considered. Using this procedure the discrete analog of the Wiener-Hopf equation as well as Gaussian and Gaussian-Markov estimates of the state vector of linear stochastic system are received satisfying to a minimum of the mean-squared error in the estimate. On the basis of the received estimates and the discrete equation of Wiener-Hopf the equations of the KF is derived, the theorem of the KF with the minimum mean-squared error is formulated, the sequence of using the equations of KF making up the recursive algorithm of KF for computer program realization is explained.

Keywords: Stochastic systems, State estimation, Kalman filter

Cite this paper: Bukhar Kussainov, Stochastic Dynamic Systems’ State Estimation Based on Mean Squared Error Minimizing and Kalman Filtering, International Journal of Control Science and Engineering, Vol. 11 No. 1, 2021, pp. 1-8. doi: 10.5923/j.control.20211101.01.

1. Introduction

Modern automatic control systems as well as telecommunications and information systems transmitting and processing signals and subjected to impact of random perturbations and uncertainties of system parameters and to influence of random external disturbances and measurement noises can be considered in the form of the state space models of non-stationary linear dynamical stochastic systems [1]. In stochastic systems, to realize modern control algorithms or to separate a useful signal from its mixture with noise, there is a problem of estimating the entire state vector of the dynamical system based on measured values of system’s output signal [2]. The solution of this problem in real time is a filtering problem, for which the classical and most popular solution algorithm is the discrete Kalman filter (KF) [3,4], using both in control theory and in the theory of signal transmission and processing [5]. The KF has the same structure as the considered dynamical system, is the mathematical model and consists of a set of difference vector-matrix equations for calculating estimates of the state of a stochastic system, estimates of the error covariance matrices and the filter gain. The difference between it and the system is that at any given time, the filter gain is optimal relative to the specified statistical properties of disturbances and measurement errors [2-8]. The computational algorithm of the KF is a recursive procedure that is convenient for program realization using programming languages as well as MATLAB [3,9-11] and other computer programs for system modeling.
The article presents a mathematical model of a discrete non-stationary linear stochastic dynamical system, the formulation of the problem of estimating the vector of the system state, the derivation of equations and the formulation of the KF theorem, as well as an algorithm for using the equations of the discrete KF. As it’s known, the estimation of the state of a dynamical system (the solution of the filtration problem), as well as the derivation of the KF equations can be carried out using the Bayesian approach, maximum likelihood estimation or the least squares method [12]. Here we follow the already known path and consider the filtration problem as a generalization of the Gaussian least squares method, described in detail in [13]. Based on the least squares method and the procedure for minimizing the mean-squared error of estimation, a discrete analog of the Wiener-Hopf equation is obtained, as well as Gaussian and Gaussian-Markov estimates (and estimates of their error covariance matrices) of the state vector of the observed system, which are linear unbiased and satisfy the minimum value of the mean-squared error of estimation [13]. The discrete Wiener-Hopf equation, Gaussian and Gaussian-Markov estimates with a minimum mean-squared error are used later to derive the discrete KF equations, that are a recurrent procedure in which at a discrete time of the extrapolation (prediction) stage, based on the difference equations of the dynamics of the observed system, the estimate of the state vector is calculated for the next moment of time, and then, at the time of the correction stage, based on new measurement of the system output signal and the changed value of the KF gain, the estimate of the state vector of the system calculated at the time of extrapolation of the KF procedure is corrected [2-13].
From the first application of the KF in the airspace the KF was a part of the Apollo onboard guidance [3] and to our days the KF has been demonstrating its usefulness in many various applications in different areas of technology and economics [14-16]. However, it is still not easy for people who are not familiar with the estimation theory to understand and implement the vector-matrix equations of the KF. Whereas there is a large number of excellent introductory materials and literature on the KF the purpose of this paper is to remind one simple method for deriving and explain the recursive algorithm for using the equations of the KF.

2. Notational Preliminaries

All vectors and matrices are time-varying quantities are treated at the discrete time instants . By convention, the argument of vectors (e.g., …) and matrices (e.g., …) denotes the fact that the values of these variables correspond to the th step of time. The notation designates that the value of the estimation vector at the time instant conditioned on time instant measurements. If we are estimating a future value of , and we refer to this as a predicted estimate. The case is referred to as a filtered estimate. Prediction and filtering make up the algorithm of KF and can be done in real time [6-8].
The list of notations used through the paper is summarized in the Table 1.
Table 1. List of notations
     

3. The Basic Model and the Problem of the State Estimation

Consider the basic linear, time-varying (nonstationary), discrete-time state variable model of dynamical systems [5,6] as:
(1)
(2)
where is a state vector; is a measurement vector; is input vector; is system matrix; is input matrix; is measurement matrix. , , matrices and vector are known.
Additionally, is Gaussian white noise sequence of model uncertainties and disturbances and is Gaussian white noise sequence of measurement inaccuracies, i.e.,
(3)
(4)
respectively, where superscript denotes the matrix transposition.
, are and covariance matrices, respectively, is the Dirac delta function, i.e., for and for . Supposed that and are mutually uncorrelated, i.e.,
(5)
The state vector is zero mean and has a covariance matrix , i.e.,
(6)
Initial state vector and its covariance matrix are known and is uncorrelated with and , i.e.,
(7)
The objective is to estimate the unknown state vector at from the noisy measurement vector , where .
The estimate of a state vector must be: 1) linear, 2) unbiased, i.e., and must have 3) a minimum value of the mean of the squared error , i.e.,
(8)
where is the error in the estimate.
Thus, there is the mean-squared estimation problem: given the noisy measurements determine a linear unbiased estimator of the entire state vector such that the conditional mean-squared error in the estimate
(9)
is minimized [5].
This mean-squared estimator, can also be called as the minimum variance estimator, since
(10)
Note that here the variances are diagonal elements of error-covariance matrix defined by [6]:
(11)

4. The Method of Minimizing of the Mean-Squared Error of Estimation

To obtain the expressions for estimates of unknown vector from the measurement vector in conditions of measurement noises according to the Eq. (2) let’s consider the generic linear observation model [5]:
(12)
where is an unknown vector, is a known measurement vector, is a known measurement matrix, is an unknown vector of measurement errors.
The unknown quantities and are random variables with the following expectations and covariance matrices and they are mutually uncorrelated, i.e.:
(13)
The assumption of a linearity leads to the following expression for the estimate of
(14)
where vector and matrix must determine, however, the request of the unbiased estimation means that:
hence since
Thus the Eq. (14) for becomes as:
(15)
The matrix will be determined from the condition that the variance of estimation error is minimum. According to Eq. (15) every component of is depended on vector via an row of matrix which is denoted as Thus
(16)
where is the row vector.
Mentioned request about a minimum variance of estimation error signifies that
(17)
Hence
(18)
Thus, the variance of error is the sum in which the first term don’t depend on , the second and third terms are linear and quadratic forms of is the column vector). A necessary condition of minimum of Eq. (18) is that all its partial derivatives with respect to must be equal to zero. In other words, taking the gradient of with respect to must be equal to zero, i.e.,
(19)
Applying the rule of the gradient calculation to the right hand side of Eq. (18) yields [13]:
(20)
This Eq. (20) is regarded as the Wiener-Hopf equation in the discrete form, let’s rewrite it in the compact vector-matrix form:
(21)
The necessary condition for the fairness of the system of linear algebraic equations (21) with unknown weighting matrix is that the variance of each estimation error must be extremum. The sufficient condition of it is the positive definiteness of the matrix formed by the second derivatives of the function with respect to . In other words, the Hessian matrix with respect to must be positive definite for all i.e.:
(22)
Recalculation of partial derivatives for the left hand side of the Eq. (20) yields the required Hessian matrix. Hence the condition above turns into the following [13]:
(23)
This condition is sufficient that the extremum values of variance obtained by using Eq. (21) will be really minimum. Thus the requirement of Eq. (23) is necessary and sufficient condition that the Eq. (21) will have only one solution for .
To obtain the matrix we must find the covariance matrices in Eq. (21), so taking into account the Eqs. (12), (13) we have:
(24)
(25)
Now rewrite the Eq. (21) in the following form:
(26)
Here in parentheses is matrix. If measurements are less than unknowns, then from the Eq. (26) we can find the matrix :
(27)
Substitute the Eq. (27) into the Eq. (15) we receive the first form of the linear unbiased estimate with a minimum value of its mean-squared error:
(28)
In order to obtain the covariance matrix of estimation error consider following equations:
(29)
(30)
Taking into account the Eqs. (15), (24) we have from the Eq. (30):
(31)
Substitute here the Eq. (27) to the Eq. (31) we receive the covariance matrix of the first form estimate (28):
(32)
For the case, if measurements are more than unknowns, the Eq. (26) can be transformed to the following form [13]:
(33)
Here in parentheses is matrix. Using matrix obtained from the Eq. (33) the second form of the linear unbiased estimate with a minimum value of mean-squared estimation error is:
(34)
According to the Eq. (31) the covariance matrix of the second form estimate (34) is given by:
(35)
In the estimate (34), if that is and rank of matrix is then we have the Gaussian-Markov estimate [13]:
(36)
According to the Eq. (35) the covariance matrix of the Gaussian-Markov estimate is given by:
(37)

5. Deriving the Equations of the Kalman Filter

The Kalman filter operates in a predict-correct manner [5].

5.1. Prediction

At the initial observation moment according to the Eqs. (2), (6) the following measuring is obtained:
(38)
Compare these Eqs. (38) with the Eqs. (12), (13) and substitute the corresponding quantities to the Eq. (28) for a linear unbiased estimate with a minimum of a mean-squared error, we have:
(39)
where
(40)
The error-covariance matrix according to the Eq. (32) is given by:
(41)
The solution for all subsequent moments of time is obtained by moving from to With this aim consider the discrete Wiener-Hopf equation (21) which is the necessary and sufficient condition that estimate will have a minimum mean-squared error. Rewrite Eq. (21) in more short form:
(42)
Suppose observations are already done and the estimate with the minimum of the mean-squared error is obtained. The latter means that the Eq. (42) is satisfied, i.e.:
(43)
We point out that this Eq. (43) is already satisfied.
Suppose the quantities and at the time and are known. According to the Eq. (1) let’s find the predicted value of herewith uncertainties and disturbances (with zero expectations) are not taken into account:
(I)
This is the first equation of the Kalman filtering procedure.
Let’s show that this prediction is optimal. According to Eq. (42) we have:
(44)
Here in Eq. (44) replace with from the Eq. (1), herewith disturbances cannot be taken into account because they are not correlated with We have the following equation:
(45)
From the Eq. (45) we can see that the optimal prediction is corresponded to the Eq. (I).
Prediction error is equal to:
(46)
The error-covariance matrix of prediction is given by:
however, the noises are not correlated with the estimation errors , so
(II)
This is the second equation of the Kalman filtering procedure. By this the prediction is done.

5.2. Correction

The estimate of the predicted state vector at the instant of time, , obtained with the available measurement (Eq. (I)), after the next measurement must be corrected to the value
In order to find let’s replace in Eq. (I) with and substitute instead of the corresponding expression from the Eq. (1):
(47)
Taking into account the Eq. (2) we can write:
(48)
or
(49)
Here the quantities are defined through comparing the Eqs. (48), (49). The covariance matrix of the upper part of vector was earlier denoted as , the covariance matrix of the lower part is equal to . Covariance between and and between and are equal to zero. So, we have:
(50)
Since the measurement vector contains the component we can calculate Gaussian-Markov estimate at the instant of time according to the Eqs. (36), (37), i.e.:
(51)
where
(52)
Here
(53)
(54)
(55)
Substitute the expression for from the Eq. (55) into the Eq. (51) of Gaussian-Markov estimate:
(56)
In Eq. (56) the factor before is the gain matrix :
(57)
Using the Eq. (54) for and taking into account the Eq. (52) we receive:
(58)
Let’s multiply by on the left the Eq. (58) and taking into account the Eq. (57) we receive:
(59)
Let’s find the expression for from the Eq. (59) and substitute it into the Eq. (56):
(III)
This is the third equation of the Kalman filter. It corresponds to the observer model equation of the observed object (system) with feedback equaled a difference of weighted output signals [17,18].
Let’s multiply by on the right the Eq. (59):
and from this equation we can receive the error-covariance matrix:
(IV)
This is the forth equation of the Kalman filter. Let’s multiply this equation by on the right and ascribe to the first tirm in the right hand side the factor From the obtained equation taking into account the Eq. (57) we can find the gain matrix :
(V)
This is the fifth equation of the Kalman filter.

6. The Theorem of the Kalman Filter

The Eqs. (39)-(41) and (I)-(V) make up together the Kalman filter which is usually formulated as the theorem. Let’s present the formulation of the theorem of Kalman filter satisfying the minimum of mean-squared error of estimation.
Theorem (The Kalman Filter). Let given a discrete stochastic system defined by the Eqs. (1)-(7) and considered at instants of time. The linear unbiased estimate with the minimum mean-squared error in the estimation of the state vector of this system at any time instant is obtained by the recursive equations (I)-(V) the initial state of which at is determined by the equations (39)-(41).
In addition to the proof of the theorem considered above to check the correctness of the Eqs. (IV), (V). With this aim let’s make the expression for according to the Eq. (42):
(60)
In this Eq. (60) the estimation error according to the Eq. (III) and taking into account the Eq. (2) is equal to:
(61)
To check the correctness of the Eq. (IV) multiply on the right the both parts of the Eq. (61) by and calculate expectation. Still, using the Eq. (42) will allow us to obtain the Eq. (IV).
To check the correctness of the deriving the Eq. (V), consider the first part of the mathematical expectation in Eq. (60), containing values from to and located to the left of the vertical line for . The new measurement error is uncorrelated with the old observations from to . The product of two expressions in square brackets in (61), correlated with the set of observations from to , means zero mathematical expectation according to equation (44). This means that expression (III) satisfies the part of the requirement (42) that is to the left of the vertical line. The remaining part of the requirement (60) allows us to determine the undefined gain matrix . On the basis of (61), the following equality must be valid:
(62)
The quantities and are not correlated with . The rest of the mathematical expectations can be represented in a simpler form. To do this, we use Eq. (29) with respect to , taking into account that the covariance between and can be replaced by . As a result, we have:
(63)
Solving this Eq. (63) with respect to we’ll receive the Eq. (V).

7. The Algorithm of Using the Equations of the Kalman Filter

The Kalman filter is a recursive procedure that is convenient for program realization on computers. The algorithm of using the Eqs. (39)-(41) and (I)-(V) of KF is the following as:
1) At the initial state the initial estimate of the state vector and the initial error-covariance matrix are built according to the Eqs. (39)-(41):
where
and
2) The estimate and its error-covariance matrix are extrapolated to the next observation instant of time according to the Eqs. (I), (II):
Correction:
3) The optimal gain matrix is calculated according to the Eq. (V) and extrapolated (predicted) estimate is improved to the value according to the Eq. (III) using the new measurement :
where
is called the innovation process,
is called the predicted value of the new measurement.
4) The error-covariance matrix of the new modified estimate is calculated according to the Eq. (IV):
5) If the next is then the current time instant should be considered as For the estimate of the state calculated at the step 3 and now considered as for the error-covariance matrix calculated at the step 4 and now considered as should be carried out the steps 2, 3 and 4 of the algorithm. If then the procedure is ended.
Therefore, the best estimate of using all observations up to and including is obtained by a predictor step, and a corrector step, The predictor step uses information from the state equation (1). The corrector step uses the new measurement available at The correction is the error (difference) between new measurement, , and its best predicted value, , multiplied by weighting (or gain) factor . The factor determines how much we will alter (change) the best estimate based on the new observation, i.e., 1) if the elements of are small, we have considerable confidence in our model, and 2) if they are large, we have considerable confidence in our observation measurements. Thus, the KF is a dynamical feedback system, its gain matrix and predicted- and filtering-error covariance matrices comprise a matrix feedback system operating within the KF [5,6].

8. Conclusions

The discrete Kalman filter, developed by R. Kalman back in 1960 [19], is currently a classic result of the theory of control systems and the theory of signal processing, as well as the most popular filtering algorithm using in automatic control systems, telecommunications and information systems subjected to random disturbances and measurement inaccuracies. The Kalman filter is a recursive procedure consisting of difference vector-matrix equations for calculating estimates of the state of a stochastic system, the estimates of the error covariance matrices and the filter gain. A common approach to the derivation of the KF equations is the Bayesian approach [11]. The paper describes the simplest way to obtain the KF equations, based on the use of the procedure for minimizing the mean-squared error of estimation, which is a further generalization of the least squares method [12]. As a result of this procedure, we obtained: a discrete analog of the Wiener-Hopf equation, as well as Gaussian and Gaussian-Markov estimates (and their error covariance matrices), which are linear and unbiased and satisfy the minimum value of the mean-squared error of the estimation. Based on the discrete Wiener-Hopf equation, Gaussian and Gaussian-Markov estimates, the KF equations are obtained using simple algebraic transformations and reasoning. The KF theorem is formulated, which satisfies the minimum of mean-squared error of estimation, and the algorithm for using the KF equations, which is convenient for program realization, is also explained.

References

[1]  P. R. Kumar, Pravin Varaiya, Stochastic systems: Estimation, Identification and Adaptive Control, New Jersey, Prentice Hall, 1986.
[2]  K. J. Åström, R. M. Murray, Feedback systems: An Introduction for Scientist and Engineers, New Jersey, Oxford, Princeton University Press, 2008.
[3]  Mohinder S. Grewal, Angus P. Andrews, Kalman Filtering: Theory and Praxis Using MATLAB, 4th ed., New Jersey, John Wiley & Sons, 2015.
[4]  Greg Welch, Gary Bishop, An Introduction to the Kalman Filter, UNC-Chapel Hill, TR 95-041, July 24, 2006.
[5]  Jerry, M., Mendel, Lessons in Estimation Theory for Signal Processing, Communications, and Control, New Jersey, Prentice Hall, 1995.
[6]  George, M., Siouris, An Engineering Approach to Optimal Control and Estimation Theory, New York, John Wiley & Sons, 1996.
[7]  Lennart, Ljung, System Identification. Theory for the User, 2nd ed., New Jersey, Prentice Hall, 1999.
[8]  Karel, J., Keesman, System Identification. An Introduction, London, Springer-Verlag, 2011.
[9]  Robert G. Brown, Patrick Y.C. Hwang, Introduction to random signals and applied Kalman filtering: with MATLAB exercises, 4th ed., New Jersey, John Wiley & Sons, 2012.
[10]  Lennart, Ljung, System Identification ToolboxTM. User’s Guide, 3 Apple Hill Drive Natick, The Mathworks, Inc., 2015.
[11]  Armando Barreto, Malek Adjouadi, Intuitive Understanding of Kalman Filtering with MATLAB, CRC Press, 2021.
[12]  P. Eykhoff, Bases of identification of control systems: parameter and state estimation, Moscow, Mir, 1975. (In Russ.)
[13]  K. Brammer, G. Ziffling, Kalman-Bucy Filter: Deterministic Observation and Stochastic Filtering (Translation from German), Moscow, Nauka, 1984. (In Russ.)
[14]  Vedran Cordić (editor), Kalman filter, Vukovar, Croatia, Intech, 2010.
[15]  Bruce P. Gibbs, Advanced Kalman filtering, least squares and modeling, New Jersey, John Wiley & Sons, 2011.
[16]  Charles K. Chui, Guangrong Chen, Kalman filtering with real-time applications, 5th ed., Springer International Publishing AG, 2017.
[17]  R.C. Dorf, R.H. Bishop, Modern control systems, 12th ed., New Jersey, Prentice Hall, 2011.
[18]  K. Ogata, Modern control engineering, 3rd ed., New Jersey, Prentice Hall, 1997.
[19]  R.E. Kalman, A new approach to linear filtering and prediction problems, ASME Transactions of the ASME – Journal of Basic Eng. 82(1), pp. 35-45, March, 1960.