Applied Mathematics
p-ISSN: 2163-1409 e-ISSN: 2163-1425
2013; 3(3): 98-106
doi:10.5923/j.am.20130303.03
Gurprit Grover, Ravi Vajala, Manoj Kumar Varshney
Department of Statistics, University of Delhi, Delhi, 110007, India
Correspondence to: Ravi Vajala, Department of Statistics, University of Delhi, Delhi, 110007, India.
| Email: | ![]() |
Copyright © 2012 Scientific & Academic Publishing. All Rights Reserved.
This paper models the HIV population through a Poisson distribution and obtains the expressions for the estimators of the average number of HIV individuals (Incidence Rate of HIV). Conventional methods for obtaining such estimates have used the Maximum likelihood Principle that does not take into account, any prior information about the parameter. Bayesian perspective accommodates this missing link and hence obtains the estimators where data is refined using the prior information. Three different types of prior distributions including Jeffreys non-informative priors have been considered and the corresponding estimates along with standard errors have been obtained assuming a squared error loss function. However, computational techniques like Markov Chain Monte Carlo (MCMC) have been avoided by using the Empirical Bayes Perspective. These procedures were applied on the state and year-wise data of HIV patients in India and relevant estimates are obtained and compared with actual figures. When year is considered as random variable, M.L.E proved to be better than the Bayes estimates but vice-versa is seen when states were considered as a random variable.
Keywords: Poisson Model, Human Immuno Deficiency Virus (HIV), Informative and Non-Informative Priors, Bayes Estimators, Empirical Bayes Methods
Cite this paper: Gurprit Grover, Ravi Vajala, Manoj Kumar Varshney, On the Estimation of Average HIV Population Using Various Bayesian Techniques, Applied Mathematics, Vol. 3 No. 3, 2013, pp. 98-106. doi: 10.5923/j.am.20130303.03.
![]() | (1) |
or a weighted mean of observations for a sample x1, …, xn of n observations. Moreover, despite satisfying the properties of a good estimator asymptotically, the Maximum likelihood estimator (M.L.E) fails to take into account any additional information available on the parameter λ prior to taking the sample. This additional information may be incorporated into the estimation process by the so-called prior distribution and hence Bayesian approach may be used to evolve a much more refined estimator. Bayesian methods do not require large samples or asymptotics for their validity. They allow for incorporation of expert knowledge through the specification of prior distribution.Classical M.L.E approach to the problem of estimation relies on an estimator which is obtained theoretically and remains the same for whatever may be the data set. However, Bayesian approach obtains a separate set of estimators for every set of prior information and adjusts these estimators for changes in the data set. Such an estimator provides a logical alternative because it not only incorporates the additional information on the parameter, but also relies on the data to a great extent.![]() | (2) |
does not equal zero.When substantial information about the average HIV cases is available, we may look at the (Natural) Conjugate Priors wherein the functional form of the prior and posterior remains same and when no information is available, we may consider the Non-informative Prior (Jeffreys)[10].The subsequent sections develop theory for modeling the HIV incidence λ, using various prior distributions in the population. This prior information is refined to posterior distribution by means of additional information provided by the data and estimates of λ are obtained from the posterior distribution. The estimates are obtained so as to provide minimum risk (which is expected loss) with respect to the posterior distribution. Of course, there is no consensus opinion on defining the loss, although the Quadratic loss is popularly used and found to be sufficient in majority of the situations.
The posterior mean of the distribution is
which provides an estimate of λ with variance
The problem in finding the estimator of intensity λ is that it is based on the data as well as the parameters of the prior distribution α and β (known as hyperparameters). Authors[16, 18] of related studies have, on the basis of their past knowledge, judgment or intuition taken various predetermined values for these hyperparameters and obtained the estimate of the average HIV cases. These estimators were further studied for robustness with respect to the prior parameters. However, we believe that the information about the parameters of interest lies in the data itself and hence the hyperparameters have been estimated using the Empirical Bayesian Procedure[17].Let
denote the conditional mean and variance of the random variable X which denotes the HIV cases in the population. Let
denote the marginal mean and variance of these HIV cases. Assuming that these quantities exist, we have ![]() | (3) |
![]() | (4) |
then,
Therefore the estimates of the hyperparameters when the prior distribution of λ is Gamma(α, β) are obtained as
are the sample mean and variance respectively. These may in turn be used to find the estimate of HIV incidence rate along with its standard error.
is not available and the intention is to use the available clinical data to determine the parameter, Harold Jeffreys[10] approach may be used to obtain the following non-subjective reference prior in terms of the Fisher’s Information matrix:![]() | (5) |
is the Fisher’s Information matrix.Therefore, the prior distribution for the HIV incidence rate (λ) according to Jeffrey’s rule may be taken as
Using the Bayes rule, the posterior distribution is obtained as (λ|x1, …, xn ) ~ Gamma(
). The posterior mean of the distribution is
which provides an estimate of λ with variance
The situation of no prior information about the Incidence rate λ, may be also modeled through the improper prior,
where 0 ≤ λ < ∞ for which the posterior distribution is given by (λ|x1, …, xn ) ~ Gamma
The estimate of the HIV incidence rate is then
with variance
Also, in case of large sample sizes, Brenner et al.[4], Fraser and McDunnough[8] gives the asymptotic posterior distribution for the HIV incidence parameter
on assuming non informative prior.If
is the M.L.E of
and prior density of
is non-informative (or likelihood dominates the prior density), then the posterior density of
is given by
| x1, …, xn ~ Normal
where L(
) is the logarithm of likelihood of (
| x).Using this result, we obtain the posterior distribution of the HIV incidence as ![]() | (6) |
with variance
which is the Maximum Likelihood Estimator of
.The objective of using these non-informative priors is to highlight the 'weak' or 'negligible' information over 'no' prior information as assumed while obtaining M.L.E. Also, this paper assumes an informative prior modeled by Gamma distribution in comparison with weak (Jeffreys), negligible (Improper) and no (M.L.E) prior information. The consideration for these non-informative priors is to bridge the gap between substantial amount of information as compared to no information. Various other informative priors could also have been considered for this purpose but then, these could also be obtained by giving suitable values to the parameters of Gamma distribution being one from the exponential class of distributions. In such situations, the solution would simply be obtained by doing a grid search for the best values of hyperparameters that will yield minimum standard errors of the estimators. However, doing so would dilute the concept of Empirical Bayes procedure which recommends for the estimation of hyperparameters from the sample itself.
|
|
![]() | Figure 1. Bayes estimates of the average HIV persons in various states/UT’s of India |
![]() | Figure 2. Estimated Standard Error for the Bayes estimates of the average HIV persons in various states/UT’s of India |
![]() | Figure 3. Bayes estimates of the average HIV persons in India for the years 2002-2011 |
![]() | Figure 4. Estimated Standard Error for the Bayes estimates of the average HIV persons in India for the years 2002-2011 |
|
![]() | Figure 5. Estimated % HIV with respect to Area(sq. Km) and Population |