American Journal of Mathematics and Statistics
p-ISSN: 2162-948X e-ISSN: 2162-8475
2017; 7(2): 78-88
doi:10.5923/j.ajms.20170702.04

Richard Puurbalanta1, Atinuke O. Adebanji2
1University for Development Studies, Faculty of Mathematical Sciences, Department of Statistics, Navrongo Campus, Ghana
2Kwame Nkrumah University of Science and Technology, Faculty of Physical Sciences, College of Science, Department of Mathematics, Kumasi, Ghana
Correspondence to: Richard Puurbalanta, University for Development Studies, Faculty of Mathematical Sciences, Department of Statistics, Navrongo Campus, Ghana.
| Email: | ![]() |
Copyright © 2017 Scientific & Academic Publishing. All Rights Reserved.
This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/

The use of discrete spatial-statistical methods for poverty analyses is important, especially in light of the fact that living standards surveys are generally dominated by categorical observations made at several locations. The proximity of these observations imposes geographical structure on the data. This study presents ordinal geo-statistical models for household poverty analyses that recognize the ordinal nature of poverty severity. For all models, Bayesian inference via Markov Chain Monte Carlo (MCMC) was used. Precision of the models, understood in terms of ease of implementation and accuracy of estimation, is compared. The objective was to quantify spatial associations, given some household features, and produce a map of poverty-severity for Ghana. The Clipped Gaussian Spatial Ordinal Probit (CG-SOP) Model was identified as best for describing spatial poverty. Positive correlation with respect to the distribution of extreme poverty was observed. We see evidence of this in the map of predictions. Significant variables include household size, education, and residency of household head. This approach to poverty analysis is relevant for policy design and the implementation of cost-effective programmes to reduce (category and site)-specific poverty, and monitoring changes in both category and geographical trends thereof. Analysis was based on the Ghana living standards survey (GLSS) 2012 data.
Keywords: Household poverty-Severity, Latent Variables, Ordinal models,Bayesian inference, Markov Chain Monte Carlo, Prediction maps
Cite this paper: Richard Puurbalanta, Atinuke O. Adebanji, Bayesian Spatial Ordinal Models for Regional Household Poverty-Severity, American Journal of Mathematics and Statistics, Vol. 7 No. 2, 2017, pp. 78-88. doi: 10.5923/j.ajms.20170702.04.
![]() | (1) |
is the vector of
responses, and
constitutes the probability of observation
being in category
Here, the order of the
outcomes is not important. To derive the proportional odds cumulative OP GLM from (1), let the responses
be arranged in order of magnitude, and
be the corresponding thresholds associated with the ordering. Further let
be a Gaussian random variable assumed to be latent, and assigning values to
according to the regression function:![]() | (2) |
is
design matrix, and
is a
unknown vector of regression coefficients, and
is the
vector of independently and identically distributed
measurement errors:
Though the values of
cannot be directly observed, the rule that assigns
to
is that if
exceeds a given threshold, then, an observation falls in the
category. This culminates in cumulative multiple binary outcomes: ![]() | (3) |
is the vector of
response,
and
Clearly,
in our application, refers to the Gaussian expenditure line, and is asymptotic of the ordinal variable
when
Our objective is to predict the probability of a household falling in the
category given the observed covariates
This probability is determined by the values of the latent variable
and is given by ![]() | (4) |
is Gaussian, and
the outcome is a probit model, implying that the probability of falling in the
category is:![]() | (5) |
is the cumulative distribution function
for the normal
Thus, the likelihood function for the parameters is ![]() | (6) |
![]() | (7) |
is the data likelihood,
is the prior density of model parameters, and
is the integrated likelihood, with
being the parameter space. Posterior estimation is done by setting up the Gibbs sampler [14], which requires us to derive full conditionals for all parameters. We impose, following [12], non-informative priors on the regression coefficients. Then ![]() | (8) |
![]() | (9) |
![]() | (10) |
is the vector of observed ordinal outcomes, whiles
is latent,
is an indicator variable, leading to: ![]() | (11) |
under geo-statistical assumptions, then the data is defined as
where
and
respectively are observations and response levels. Let
be the vector of associated covariates. Intuitively then, ![]() | (12) |
are
-dimensional covariates,
a
vector of coefficients, and
a
spatially-dependent error. Let
,
, and
, then equation (12) is written in matrix form as: ![]() | (13) |
for
where
is a non-diagonal symmetric and positive definite matrix.![]() | (14) |
is an
design matrix,
is a
matrix of fixed effects coefficients, and
spatially-dependent errors that capture all unobserved errors arising from the influence of common features for observations within certain proximal distances. The
element of the
spatial covariance matrix
is
parameterized by ![]() | (15) |
represents variability of the spatial process, and
is a monotonic correlation function with a correlation decay parameter
measuring the strength of spatial dependence over the Euclidean distance
between locations
and
[22]. By maintaining the ordinal probit form, and assuming, following [22], that the spatial error term
also represents random measurement error in unexplained explanatory variables (the nugget), then ![]() | (16) |
![]() | (17) |
is an indicator function equalling 1 when
and 0 otherwise.![]() | (18) |
is the data vector,
is latent regression function assigning values to observable ordered data,
is distribution of the latent errors incorporated via data augmentation, and
are prior densities for the model parameters. The use of MCMC sampling, especially Gibbs techniques [20], requires us to derive full conditionals for all parameters. We assume following [22] non-informative priors for the regression coefficients and inverse gamma for the spatial parameters. Thus, the full conditionals of all parameters have the following functional forms: ![]() | (19) |
![]() | (20) |
![]() | (21) |
![]() | (22) |
![]() | (23) |
![]() | (24) |
by assuming that the spatially-dependent errors
are latent, but together with some covariates, are assigning values to
according to the multivariate regression function: ![]() | (25) |
element of
is
parameterized by
, where parameters are as previously defined. The link between
and
is seen by assuming that for a vector of thresholds
the ordinal random field,
is obtained by quantizing
at levels
thus, culminating in multiple binary outcomes: ![]() | (26) |
Our objective is to model the probability of an observation
at site
falling at the
category. This probability is determined by 
![]() | (27) |
is n-dimensional, and determined by the density of
Therefore, the likelihood of the data is the integral of an n-dimensional multivariate normal distribution taking values in the interval 
![]() | (28) |
![]() | (29) |
is an indicator variable, and
being the joint prior density, specifies uncertainty on the respective estimated parameters. We complete the specification by imposing non-information priors on the regression parameters and inverse gamma on the spatial parameters [16, 17].The full conditional posterior distributions of each parameter [20] have the following functional forms: ![]() | (30) |
![]() | (31) |
![]() | (32) |
![]() | (33) |
![]() | (34) |
to be the map of predictive ordered responses corresponding to poverty-severity risk at unsampled locations
The predictions are derived from the conditional probabilities that a new location falls in category
given the observed data. In ordered modelling, these conditional probabilities cannot be obtained via direct MCMC simulations; they are obtained by calculating
using MCMC integration and data augmentation, where
is a latent variable at new locations, being equal to
in both OP GLM and SGLMM, and
for the CG-SOP model.Following the Bayesian approach, the predictive posterior distributions
is augmented by
to give: ![]() | (35) |
![]() | (36) |
![]() | (37) |
and likelihood-ratio
statistic. In Bayesian analyses, predictive model selection is checked within the framework of Bayesian decision theory [26, 27, 20], where prediction accuracy is based on the Bayesian expected loss (BEL). The loss function
represented here by the additive loss function estimates the loss incurred in predicting
by
at location; assigning zero loss to correct predictions. For the multi-categorical application, the additive loss function in respect of
new locations is expressed as: ![]() | (38) |
penalizes the loss incurred for mis-predicting category
when the true category is
If
is set to 1, equation (38) translates to the well-known mis-prediction rate (MPR) (defined as the proportion of predictions that are incorrect), and optimal Bayes’ prediction is attained by choosing the value for
that minimizes (38), leading to the expected value of the loss function: ![]() | (39) |
that reports the highest estimated probability. In assessing the models, it is important to note the difference between the estimated BEL
(using the training data) and validation BEL
(using the validation data) for each model. A comparatively large difference shows a model is making incorrect predictions [26, 20]. For the ordered data, it is also possible for the model to suffer greater loss for predictions that are farther from the truth. Thus, we assign loss coefficients
and get the absolute mis-prediction rate (AMPR) defined as:![]() | (40) |
is the true value and
is the prediction. In principle, the AMPR accounts for the size of the mis-prediction. We apply all the fit measures (MPR, AMPR, and BEL) to determine the models’ ability to correctly forecast future values.![]() | Figure 1. Classical Semi-Variogram |
|
to be 0.035, the CG-SOP model estimate of
is 0.041. These disparities are largely due to the differential parameterizations used for each model.We use the
estimate to determine the effective range of spatial correlation, commonly defined as the distance beyond which the correlation reduces to less than 5% of variance. For the exponential correlation function, the effective range is
for the SGLMM, and
for the CG-SOP model [20]. Thus, the estimated effective range for the response surface of the SGLMM is 66 kilometers, suggesting lower spatial variation compared to the CG-SOP model with an effective range of 73 kilometers. Though both models appear to exhibit large ranges relative to the maximum inter-site distance of
the CG-SOP model does better at larger lags (see Figures 2 and 3).![]() | Figure 2. Poverty-Severity Map of Ghana Using the SGLMM |
![]() | Figure 3. Poverty-Severity Map of Ghana using the CG-SOP Model |
estimate in all three models shows a positive link between household size and poverty severity, showing that poverty severity levels tend to increase from non-poor to extremely poor for sites with higher household sizes. Age was found not statistically significant in the OP GLM. Higher education was found to reduce the risk of extreme poverty in all three models. Extreme poverty-risk was also related to the residency of household head, with urbanites being at lower risks of extreme poverty. The impact employment has on poverty-risk in this study is typical; showing lowest risk for wage employed householders in the SGLMM and CG-SOP models. Its effect in the non-spatial OP GLM was however mixed. Unemployed was found not to be statistically significant in this model.Table 2 summarizes and compares results for predictive abilities whiles Table 3 does same for Bayesian expected loss (BEL) for each specification. Clearly, results of Table 2 identified the CG-SOP as the preferred model for prediction. Its predicted values come closest to the truth compared to the aspatial cumulative OP GLM and the SGLMM. Closely following the CG-SOP in predictive performance is the SGLMM. Thus, incorporating spatial dependence in the modelling framework improves estimation and prediction as evidenced by the performance of the two spatial models.
|
and validation
data. A significantly large difference between
and
indicates a model is mis-predicting the outcomes [26]. Table 3 presents results of the analysis, showing the OP-GLM and SGLMM as culprits in this regard. The CG-SOP has the smallest difference between
and 
|
and
show a relatively stronger spatial dependence. Results of the ordinal SGLMM are similarly distributed. However, the SGLMM overestimates
suggesting a lower spatial variation than the CG-SOP model, especially at large lags (Figures 2 and 3). The results thus suggest that the CG-SOP is the preferred model for estimation and prediction. Regarding our application, higher levels of extreme poverty is predicted in most of the northern half of the country, extending to the borders with Togo, Cote d'Ivoire, and Burkina Faso. This region lies in the dry savannah ecological zone with short seasonal rainfalls, rendering agricultural lands unproductive for the predominantly farming populations in the area.Low poverty is predicted in much of the forested south and coastal parts of the country. The Greater Accra region is predicted with the lowest risk of extreme poverty, as expected. The seaports and heavy industrial plants (along with the recently discovered crude oil) are found predominantly across the middle and coastal belts of the country, where both the rich agricultural lands and tropical rain forest coincide. Heavy mining of gold, diamond and other mineral resources over the last half-century have all contributed significantly to the low levels of poverty predicted at various locations in the south of the country. However, pockets of poverty observed at different scales in the southernmost parts of the country reflect urban and peri-urban deprivations. These could be due to disturbances linked to short range environmental factors, reiterating the fact that even over much smaller distances, local disturbances can have a distinct effect on the distribution of the poor.The spatial disparities of poverty-severity revealed by this study conform to expert opinion that in a geographic environment, there can be a dominant non-stochastic relationship between economic wellbeing and the spatial dynamics of a country [28, 29]. Application of the modelling techniques to the socio-econometric problem considered here is novel, and demonstrates our contribution to the wider scope of spatial-statistical methods. We model multi-categorical socio-econometric data in a spatial measurement framework that recognizes the ordinal nature of the variable. In our application, we depart from the auto regressive (AR) approach by directly embedding the ordinal variable within the distributional framework of a latent spatial GRF, and this marks a significant innovation.