International Journal of Statistics and Applications
p-ISSN: 2168-5193 e-ISSN: 2168-5215
2016; 6(4): 223-234
doi:10.5923/j.statistics.20160604.03

Kehinde P. Akinpelu1, Oyindamola B. Yusuf1, Onoja M. Akpa1, Abass O. Gbolahan2
1Department of Epidemiology and Medical Statistics, Faculty of Public Health, College of Medicine, University of Ibadan, Nigeria
2State Ministry of Health, State Secretariat, Nigeria
Correspondence to: Oyindamola B. Yusuf, Department of Epidemiology and Medical Statistics, Faculty of Public Health, College of Medicine, University of Ibadan, Nigeria.
| Email: | ![]() |
Copyright © 2016 Scientific & Academic Publishing. All Rights Reserved.
This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/

Monthly reports of malaria cases are usually presented as count data potentially with excess zeros. The standard Poisson and negative binomial regression used for modeling such data cannot account for excess zeros and over-dispersion. Hence, this study was designed to model the annual trends in the occurrence of malaria among under-5 children using the zero inflated negative binomial (ZINB) and zero inflated Poisson regression (ZIP). The study also determined the effects of month, year and geographical location on the occurrence of malaria. Malaria surveillance data were obtained from the Integrated Disease Surveillance and Response (IDSR) of Oyo State Ministry of Health, Nigeria from 2010 – 2014. Descriptive statistics were conducted to check for the presence of over-dispersion. Model comparisons were performed between ZINB and ZIP and the best model was selected using Vuong z-statistic criteria. Incidence rate ratios and 95% CI were determined. There were slight variations in the incidence of malaria cases; 35.81 per 1000 in 2011, 35.64 per 1000 in 2013 and 35.72 per 1000 in 2014. The highest risk of malaria was in the year 2014 (IRR = 3.59, 95% CI: 3.05, 4.23) and lowest in 2012 (IRR =2.56, 95% CI: 2.31, 2.83). The risk of malaria was highest in October (IRR = 1.47, 95% CI: 1.15, 1.88) and lowest in January (IRR = 0.80, 95% CI: 0.69, 0.94). The highest risk of malaria was reported in Saki West (IRR= 4.77, 95% CI: 3.58, 6.35) and lowest in Ogbomoso South (IRR = 0.73, 95% CI: 0.55, 0.97). The Vuong z-statistic for the ZINB and ZIP models was -17.079 (i.e. V < -1.96), indicating that ZINB fits the data better. The zero inflated negative binomial regression is the best model to determine the factors that predict the number of cases of malaria, when there is an indication of over dispersion and excess zeros. Zero inflated negative binomial model is suggested for researchers dealing with similar data.
Keywords: Malaria, Over dispersion, Zero inflated count models
Cite this paper: Kehinde P. Akinpelu, Oyindamola B. Yusuf, Onoja M. Akpa, Abass O. Gbolahan, Zero Inflated Regression Models with Application to Malaria Surveillance Data, International Journal of Statistics and Applications, Vol. 6 No. 4, 2016, pp. 223-234. doi: 10.5923/j.statistics.20160604.03.
be the reported cases (number of children) of malaria at a given time point
Assuming
is the probability that the response
for the
point in time is necessarily 0 and
is the expected value of the count response, then the two component models are described as follows:The probability of observing a zero (0) count is ![]() | (1) |
is ![]() | (2) |
and
as conditional expected values and variance of
respectively [35].Combining equations (1) and (2) above and assuming that
is the matrix of covariates or independent variables in the model, we have a stage equation for the ZIP model as follows:![]() | (3) |
![]() | (4) |
and
can be performed using Vuong test [36], [2]. Briefly, using the maximum likelihood estimation procedures, the Vuong test specifically tests the null hypothesis that the two models fit the dataset equally well and model comparison was based on the predicted probabilities of the two competing models [36].Let
be the predicted probability of an observed count for case
from the model
Given a sample of size
and defined
as: ![]() | (5) |
is given by:![]() | (6) |
![]() | Figure 1. The distribution of number of reported of under-5 malaria |
![]() | Figure 2. Pattern of malaria cases in under-5 children from 2010 to 2014 |
![]() | Table 1. Descriptive statistics of number of cases among under 5 children in 2010 to 2014 |
![]() | Table 2. Parameter Estimates in the Zero Inflated Negative Binomial and Zero Inflated Poisson Regression Model for Malaria Among Under-5 Children |
![]() | Table 3. Comparison Tests for the Under-5 Malaria Models (ZIP nd ZINB) |