International Journal of Statistics and Applications
p-ISSN: 2168-5193 e-ISSN: 2168-5215
2015; 5(6): 293-301
doi:10.5923/j.statistics.20150506.05

Poti Abaja Owili1, Luke Orawo2, Dankit Nassiuma3
1Mathematics and Computer Science Department, Laikipia University, Nyahururu, Kenya
2Mathematics Department, Egerton University, Nakuru, Private Bag, Egerton-Njoro, Kenya
3Mathematics Department, Africa International University, Nairobi, Kenya
Correspondence to: Poti Abaja Owili, Mathematics and Computer Science Department, Laikipia University, Nyahururu, Kenya.
| Email: | ![]() |
Copyright © 2015 Scientific & Academic Publishing. All Rights Reserved.
This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/

In this study optimal linear estimates of missing values for pure bilinear time series models whose innovations have a student-t distribution are derived by minimizing the h-steps-ahead dispersion error. Data used in the study was simulated using the R-software where 100 samples of size 500 were generated for simple bilinear models. In each sample, three data positions 48, 293 and 496 were selected at random and artificial missing values created at these points. For comparison purposes, artificial neural network (ANN) and exponential smoothing (EXP) estimates were also computed. The performance criteria used to ascertain the efficiency of these estimates were the mean absolute deviation (MAD) and mean squared error (MSE). The study found that optimal linear estimates were the most efficient for estimating missing values of the pure bilinear time series followed by exponential smoothing estimates. Further, these estimates were equivalent to one-step-ahead forecast. The study recommends the use of optimal linear estimate for estimating missing values in pure bilinear time series data whose innovations have student-t distribution.
Keywords: ANN, Exponential smoothing, MAD, Performance criterion, Simulation
Cite this paper: Poti Abaja Owili, Luke Orawo, Dankit Nassiuma, Estimation of Missing Values for Pure Bilinear Time Series Models with Student-t Innovations, International Journal of Statistics and Applications, Vol. 5 No. 6, 2015, pp. 293-301. doi: 10.5923/j.statistics.20150506.05.
is said to be a bilinear time series model of order BL (p, q, P, Q) if it satisfies the difference equation![]() | (1) |
are constants; the innovation sequence
are i.i.d random process which has a student-t distribution and
=1. For pure bilinear time series, only the bilinear coefficient is not equal to zero. Thus the pure bilinear time series model, BL(0,0,P,Q) is given by ![]() | (2) |
are the coefficients of the model, and
are i.i.d student-t process with zero mean and common finite variance. Some important properties of this model such as invertibility and stationarity have been studied. Invertibility conditions for some particular stationary bilinear models has been derived as in [3] and [4]. [5] has established the conditions under which the general bilinear model is invertible and a condition of invertibility of model (2) can be derived from this result. This condition is expressed as
where
is the correlation coefficient. The notion of invertibility is very useful for statistical applications, such as the prediction of
given its past.A stationary bilinear model can be expressed in a kind of moving average with infinite order as in [6]. This enhances its application in making inferences. Bilinear models have the property that although they involve only a finite number of parameters, they can approximate with arbitrary accuracy ‘reasonable’ nonlinearity. [7] showed that with a large bilinear coefficient bij, a bilinear model can have sudden large amplitude bursts and is suitable for some kind of seismological data such as earthquakes, underground nuclear explosions. The variant of the bilinear process is time dependent. This feature enables bilinear process to be used also for financial data as in [8].Researchers have achieved forecast improvement with simple nonlinear time series models. [9] reported a forecast improvement with bilinear models in forecasting stock prices. The statistical properties of such models have been analyzed in detail as in [4], [10], [11] and [12].Several techniques have been used in estimating missing values. Most of them are nonparametric and includes use of artificial neural network as in [13]; [14]; [15] and exponential smoothing as in [16]. The techniques used for estimating missing values do not consider the distribution of the innovation sequence. They also don not consider nonlinear models. Therefore in this study estimates of missing values for nonlinear bilinear time series models with student-t distribution are derived using an optimal linear interpolation technique based on minimizing the h-steps-ahead dispersion error.
missing out of a set of an arbitrarily large number of n possible observations generated from a time series process
. Let the subspace
be the allowable space of estimators of
based on the observed values
i.e.,
where n, the sample size, is assumed large. The projection of
onto
(denoted
) such that the dispersion error of the estimate (written disp
is a minimum would simply be a minimum dispersion linear interpolator. Direct computation of the projection
onto
is complicated since the subspaces
and
are not independent of each other. We thus consider evaluating the projection onto two disjoint subspaces of
To achieve this, we express
as a direct sum of the subspaces
and another subspace, say
such that
. A possible subspace is
, where
is based on the values
. The existence of the subspaces
is shown in the following lemma.
is a nondeterministic stationary process defined on the probability space
. Then the subspaces
defined in the norm of the
are such that
Proof:Suppose
can be represented as
where
. Clearly the two components on the right hand side of the equality are disjoint and independent and hence the result. The best linear estimator of
can be evaluated as the projection onto the subspaces
and
such that disp
is minimized. i.e.,
But
where the coefficients’ are estimated such that the dispersion error is minimized. The resulting error of the estimate is evaluated as
Now squaring both sides and taking expectations, we obtain the dispersion error as![]() | (1) |
and solving for
) we should obtain the coefficients
which are used in estimating the missing value. The missing value
is estimated as ![]() | (2) |
for the missing observation
is by given
where
is the estimate obtained from the model based on the previous lagged observations of the data before the point m, the missing data point and xm the missing value, the coefficients
are to be estimated by minimizing the dispersion error
given by equation (1).
The estimate of the missing value for this model is given by theorem 4.1.Theorem 4.1The optimal linear estimate for missing observation for BL (0, 0, 1,1) with student errors is given by
ProofThe stationary bilinear model, BL (0, 0, 1,1) is given by
and the h-steps ahead forecast is
Therefore the forecast error is![]() | (5) |
![]() | (6) |
Hence equation (6) can be simplified as
Now differentiating equation (4) with respect to
and equating to zero, we obtain\
Substituting the values of
in equation (1), we obtain best estimator of the missing value as
This shows that the missing value is a one-step-ahead forecast based on the past observations collected before the missing value. This is in agreement with other studies that have estimated missing values using forecasting as in [29] and [1].Theorem 4.2The optimal linear estimate for BL (0,0,2,1) with student–t distribution
ProofThe stationary BL(0,0,2,1) is given by
The h-steps ahead forecast error is expressed as
Therefore the forecast error is
or it can also be represented as
Substituting equation (6) in equation (1), we have![]() | (9) |
Hence equation (7) can be simplified as![]() | (10) |
and equating to zero, we obtain
Therefore the estimate of the missing value for the BL(0,0,2,1) is
![]() | Figure 1. BL(0,0,1,1) with t-distributed innovations |
![]() | Figure 2. BL(0,0,2,1).with t-istributed innovations |
![]() | Table 1. Efficiency Measures obtained for student-tBL (0,0,1,1) |
![]() | Table 2. Efficiency Measures obtained for student-BL(0,0,2,1) |
|