International Journal of Statistics and Applications

p-ISSN: 2168-5193    e-ISSN: 2168-5215

2016;  6(6): 376-385

doi:10.5923/j.statistics.20160606.06

 

A Size-Biased Poisson-Amarendra Distribution and Its Applications

Rama Shanker1, Hagos Fesshaye2

1Department of Statistics, Eritrea Institute of Technology, Asmara, Eritrea

2Department of Economics, College of Business and Economics, Halhale, Eritrea

Correspondence to: Rama Shanker, Department of Statistics, Eritrea Institute of Technology, Asmara, Eritrea.

Email:

Copyright © 2016 Scientific & Academic Publishing. All Rights Reserved.

This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/

Abstract

In this paper, a size-biased Poisson-Amarendra distribution (SBPAD) has been proposed by size-biasing the Poisson-Amarendra distribution (PAD) of Shanker (2016 b), a Poisson mixture of Amarendra distribution introduced by Shanker (2016 a). The first four moments (about origin) and the central moments (about mean) have been obtained and expressions for coefficient of variation (C.V), skewness, kurtosis and index of dispersion have been given. The estimation of its parameter has been discussed using maximum likelihood estimation and method of moments. Three examples of real data-sets have been presented to test the goodness of fit of SBPAD over size-biased Poisson distribution (SBPD), size-biased Poisson-Lindley distribution (SBPLD) and size-biased Poisson-Sujatha distribution (SBPSD).

Keywords: Amarendra distribution, Poisson-Amarendra distribution, Sujatha distribution, Poisson-Sujatha distribution, Size-biasing, Moments, Estimation of parameter, Goodness of fit

Cite this paper: Rama Shanker, Hagos Fesshaye, A Size-Biased Poisson-Amarendra Distribution and Its Applications, International Journal of Statistics and Applications, Vol. 6 No. 6, 2016, pp. 376-385. doi: 10.5923/j.statistics.20160606.06.

1. Introduction

Size-biased distributions are a particular class of weighted distributions which arise naturally in practice when observations from a sample are recorded with probability proportional to some measure of unit size. In field applications, size-biased distributions can arise either because individuals are sampled with unequal probability by design or because of unequal detection probability. Size-biased distributions come into play when organisms occur in groups, and group size influences the probability of detection. Fisher (1934) firstly introduced these distributions to model ascertainment biases which were later formalized by Rao (1965) in a unifying theory for problems where the observations fall in non-experimental, non-replicated and non-random categories. Size-biased distributions have applications in environmental science, econometrics, social science, biomedical science, human demography, ecology, geology, forestry etc. Van Duesen (1986) has detailed study about the applications of size-biased distributions for fitting distributions of diameter at breast height (DBH) data arising from horizontal point sampling (HPS). Later, Lappi and Bailey (1987) have applied size-biased distributions to analyze HPS diameter increment data. The applications of size-biased distributions to the analysis of data relating to human population and ecology can be found in Patil and Rao (1977, 1978). A number of research have been done relating to size-biased distributions and their applications in different fields of knowledge by different researchers including Scheaffer (1972), Patil and Ord (1976), Singh and Maddala (1976), Patil (1981), McDonald (1984), Gove (2000, 2003), Correa and Wolfson (2007), Drummer and McDonald (1987), Ducey (2009), Alavi and Chinipardaz (2009), Ducey and Gove (2015), are some among others.
Let a random variable has probability distribution . If sample units are weighted or selected with probability proportional to , then the corresponding size-biased distribution of order is given by its probability mass function
where . When , the distribution is known as simple size-biased distribution and is applicable for size-biased sampling and for , the distribution is known as area-biased distribution and is applicable for area-biased sampling.

2. Size-Biased Poisson-Amarendra Distribution (SBPAD)

The Poisson-Amarendra distribution (PAD) having probability mass function (p.m.f.)
(2.1)
has been introduced by Shanker (2016 b) for modeling count data in various fields of knowledge. Its various statistical properties, estimation of parameter and applications have been discussed in details by Shanker (2016 b) and shown that it is better than Poisson distribution (PD), Poisson-Lindley distribution (PLD) of Sankaran (1970) and Poisson-Sujatha distribution (PSD) introduced by Shanker (2016 d). It would be recall that PSD is a Poisson mixture of Sujatha distribution introduced by Shanker (2016 c). Shanker and Hagos (2016 a) have detailed study about the applications of PSD for modeling discrete data from biological sciences.
The PAD arises from the Poisson distribution when its parameter follows Amarendra distribution introduced by Shanker (2016 a) with probability density function (p.d.f.)
(2.2)
The p.m.f. of the size-biased Poisson-Amarendra distribution (SBPAD) with parameter can be obtained as
(2.3)
where is the mean of the PAD with p.m.f. (2.1).
Recall that the p.d.f. (2.3) of SBPAD can also be obtained from the size-biased Poisson distribution (SPBD) with p.m.f.
(2.4)
when its parameter follows the size-biased Amarendra distribution (SBAD) with p.d.f.
(2.5)
We have
(2.6)
which is the p.m.f of SBPAD.
The p.m.f. of size-biased Poisson-Sujatha distribution (SBPSD) given by
(2.7)
has been introduced by Shanker and Hagos (2016 b), which is a size biased version of Poisson-Sujatha distribution (PSD) proposed by Shanker (2016 d).
It would be recall that the p.m.f of size-biased Poisson-Lindley distribution (SBPLD) given by
(2.8)
has been introduced by Ghitany and Mutairi (2008), which is a size-biased version of Poisson-Lindley distribution introduced by Sankaran (1970). The Poisson-Lindley distribution (PSD) is a Poisson mixture of Lindley (1958) distribution. Ghitany and Mutairi (2008) have discussed its various mathematical and statistical properties, estimation of the parameter using maximum likelihood estimation and the method of moments, and goodness of fit. Shanker et al (2015) have critical study on the applications of SBPLD for modeling data on thunderstorms and found that SBPLD is a better model for thunderstorms than size-biased Poisson distribution (SBPD).
The graphs of the probability mass functions of SBPAD, SBPSD and SBPLD for selected values of their parameter θ are shown in the figure 1.
Figure 1. Graphs of pmf of SBPAD, SBPSD and SBPLD for selected values of the parameter θ

3. Moments and Associated Measures

Using (2.6), the rth factorial moment about origin of the SBPAD (2.3) can be obtained as
Taking , we get
Using gamma integral and some small algebraic simplification, the rth factorial moment about origin of SBPAD (2.3) can be obtained as
(3.1)
Taking r = 1,2,3, and 4 in (3.1), the first four factorial moments about origin can be obtained and using the relationship between moments about origin and factorial moments about origin, the first four moments about origin of the SBPAD (2.3) are thus obtained as
Now, using the relationship between central moments and the moments about origin, the central moments of the SBPAD (2.3) are thus obtained as
The coefficient of variation, coefficient of skewness, coefficient of kurtosis and index of dispersion of the SBPAD (2.3) are finally obtained as
The conditions under which SBPAD, SBPSD and SBPLD are over-dispersed , equi-dispersed and under-dispersed are presented in table 3.1.
Table 3.1. Condition for over-dispersion, equi-dispersion, and under-dispersion for SBPAD, SBPSD and SBPLD
     
To study the characteristics of and of SBPAD, SBPSD and SBPLD for varying values of their parameter , the numerical values of these characteristics have been presented in table 3.2.
Table 3.2. Characteristics of
      for SBPAD, SBPSD and SBPLD for selected values of the parameter θ
     
The graphs of coefficient of variation (C.V), coefficient of skewness, coefficient of kurtosis and index of dispersion of SBPAD, SBPSD and SBPLD are shown in figure 2
Figure 2. Graphs of coefficient of variation (C.V), coefficient of skewness , coefficient of kurtosis , and index of dispersion for SBPAD, SBPSD and SBPLD for selected values of their parameter θ

4. Unimodality and Increasing Failure Rate

Using p.m.f. of SBPAD from (2.3), we have
which is a deceasing function of x, is log-concave. Therefore, SBPAD is unimodal, has an increasing failure rate (IFR), and hence increasing failure rate average (IFRA). It is new better than used in expectation (NBUE) and has decreasing mean residual life (DMRL). The definitions, concepts and interrelationship between these aging concepts have been discussed in Barlow and Proschan (1981).

5. Parameter Estimation

5.1. Maximum Likelihood Estimate (MLE): Let be a random sample of size n from the SBPAD (2.3) and let be the observed frequency in the sample corresponding to such that , where k is the largest observed value having non-zero frequency. The likelihood function Lof the SBPAD (2.3) is given by
The log likelihood function can be obtained as
The first derivative of the log likelihood function is thus given by
where is the sample mean.
The maximum likelihood estimate (MLE), of of SBPAD (2.3) is the solution of the equation and is given by the solution of the following non-linear equation
This non-linear equation can be solved by any numerical iteration methods such as Newton- Raphson method, Bisection method, Regula –Falsi method etc. In the present paper, Newton-Raphson method has been used to solve the above non-linear equation to find MLE of the parameter.
5.2. Method of Moment Estimate (MOME): Equating the population mean to the corresponding sample mean, the method of moment estimate (MOME) of of SBPAD (2.3) is the solution of the following fourth degree polynomial equation in
where is the sample mean.

6. Goodness of Fit

In this section the goodness of fit of the SBPAD, SBPSD, SBPLD and SBPD have been presented for three count data- sets. The fitting of these distributions are based on maximum likelihood estimates of the parameter. The first data-set is immunogold assay data of Cullen et al. (1990) regarding the distribution of number of counts of sites with particles from immunogold assay data, the second data-set is animal abundance data of Keith and Meslow (1968) regarding the distribution of snowshoe hares captured over 7 days, and the third data-set is number of counts of pairs of running shoes owned by 60 members of an athletics club, reported by Simonoff (2003).
Table 6.1. Distribution of number of counts of sites with particles from Immunogold data
     
Table 6.2. Distribution of snowshoe hares captured over 7 days
     
Table 6.3. Number of counts of pairs of running shoes owned by 60 members of an athletics club, reported by Simonoff (2003, p. 100)
     

7. Concluding Remarks

A size-biased Poisson mixture of size-biased Amarendra distribution named, “size-biased Poisson-Amarendra distribution (SBPAD)” has been proposed by size-biasing the Poisson-Amarendra distribution (PAD) of Shanker (2016 b). Its moments and other structural properties including coefficient of variation, skewness, kurtosis, index of dispersion, hazard rate and unimodality have been studied. The estimation of its parameter has been discussed using method of maximum likelihood and that of moments. Three examples of real data-sets have been presented to test the goodness of fit of SBPAD over the size-biased Poisson distribution (SBPD), the size-biased Poisson-Lindley distribution (SBPLD) and the size-biased Poisson -Sujatha distribution (SBPSD). The fit of SBPAD over SBPD, SBPLD, and SBPSD shows that SBPAD can be considered as an important distribution for modeling data which structurally excludes zero counts.

ACKNOWLEDGEMENTS

The authors express their gratitude to the editor-in-chief and the anonymous reviewer for his constructive suggestions which improved the quality of the paper.

References

[1]  Alavi, S.M.R. and Chinipardaz, R. (2009): Form-invariance under weighted sampling, Statistics, 43, 81 – 90.
[2]  Barlow, R.E. and Proschan, F. (1981): Statistical Theory of Reliability and Life Testing, Silver Spring, MD.
[3]  Correa, J.A. and Wolfson, D.B. (2007): Length-bias: some Characterizations and applications, Journal of Statistical Computation and Simulation, 64, 209 – 219.
[4]  Drummer, T.D. and MacDonald, L.L. (1987): Size biased in line transect sampling, Biometrics, 43, 13 – 21.
[5]  Cullen, M.J., Walsh, J., Nicholson, L.V., and Harris, J.B. (1990): Ultrastructural localization of dystrophin in human muscle by using gold immunolabelling, Proceedings of the Royal Society of London, 20, 197 - 210.
[6]  Ducey, M.J. (2009): Sampling trees with probability nearly proportional to biomass, For. Ecol. Manage., 258, 2110 - 2116.
[7]  Ducey, M.J. and Gove, J.H. (2015): Size-biased distributions in the generalized beta distribution family, with applications to forestry, Forestry- An International Journal of Forest Research. 88, 143 – 151.
[8]  Fisher, R.A. (1934): The effects of methods of ascertainment upon the estimation of frequencies, Ann. Eugenics, 6, 13 – 25.
[9]  Ghitany, M.E. and Al-Mutairi, D.K. (2008): Size-biased Poisson-Lindley distribution and Its Applications, Metron - International Journal of Statistics, LXVI (3), 299 – 311.
[10]  Gove, J.H. (2000): Some observations on fitting assumed diameter distributions to horizontal point sampling data, Can. J. For. Res., 30, 521 – 533.
[11]  Gove, J.H. (2003): Estimation and applications of size-biased distributions in forestry. In Modeling Forest Systems. A Amaro, D. Reed and P. Soares (Eds), CABI Publishing, pp. 201 – 212.
[12]  Keith, L.B. and Meslow, E.C. (1968): Trap Response by snowshoe hares, Journal of Wildlife Management, 20, 795- 801.
[13]  Lappi, J. and bailey, R.L. (1987): Estimation of diameter increment function or other tree relations using angle-count samples, Forest science, 33, 725 – 739.
[14]  Lindley, D.V. (1958): Fiducial distributions and Bayes theorem, Journal of the Royal Statistical Society, 20 (1), 102 – 107.
[15]  McDonald, J.B. (1984): Some generalized functions for the size distribution of income, Econometrics, 52, 647 – 664.
[16]  Patil, G.P. (1981): Studies in statistical ecology involving weighted distributions. In Applications and New Directions, J.K. Ghosh and J. Roy (eds). Proceeding of Indian Statistical Institute. Golden Jubliee, Statistical Publishing society, pp. 478 – 503.
[17]  Patil, G.P. and Ord, J.K. (1976): On size-biased sampling and related form-invariant weighted distributions, Sankhya Ser. B, 38, 48 – 61.
[18]  Patil, G.P. and Rao, C.R. (1977): The Weighted distributions: A survey and their applications. In applications of Statistics (Ed P.R. Krishnaiah0, 383 – 405, North Holland Publications Co., Amsterdam.
[19]  Patil, G.P. and Rao, C.R. (1978): Weighted distributions and size-biased sampling with applications to wild-life populations and human families, Biometrics, 34, 179 – 189.
[20]  Rao, C.R. (1965): On discrete distributions arising out of methods of ascertainment In: Patil, G.P.(eds) Classical and Contagious Discrete Distributions. Statistical Publishing Society, Calcutta, 320 – 332.
[21]  Sankaran, M. (1970): The discrete Poisson-Lindley distribution, Biometrics, 26, 145- 149.
[22]  Scheaffer, R.L. (1972): Size-biased sampling, Technometrics, 14, 635 – 644.
[23]  Shanker, R. (2016a): Amarendra distribution and Its Applications, American Journal of Mathematics and Statistics, 6(1), 44 – 56.
[24]  Shanker, R. (2016b): The discrete Poisson-Amarendra distribution, to appear in International Journal of Statistical distributions and Applications, 2(2), 14 - 21
[25]  Shanker, R. (2016c): Sujatha distribution and Its Applications, Statistics in Transition-new Series, 17 (3), 1 – 20.
[26]  Shanker, R. (2016d): The discrete Poisson-Sujatha distribution, International Journal of Probability and Statistics, 5(1), 1 – 9
[27]  Shanker, R., Hagos, F. and Abrehe, Y. (2015): On Size –Biased Poisson-Lindley Distribution and Its Applications to Model Thunderstorms, American Journal of Mathematics and Statistics, 5 (6), 354 – 360
[28]  Shanker, R. and Hagos, F. (2016a): On Poisson-Sujatha distribution and its Applications to Model count data from Biological Science, Biometrics and Biostatistics International Journal, 3(4), 1 – 7.
[29]  Shanker, R. and Hagos, F. (2016b): Size-biased Poisson-Sujatha distribution with Applications, American Journal of Mathematics and Statistics, 6(4), 145 – 154.
[30]  Singh, S.K. and Maddala, G.S. (1976): A function for the size distribution of incomes, Econometrica, 44, 963 – 970.
[31]  Simmonoff, J.S. (2003): Analyzing Categorical data, Springer, New york
[32]  Van Deusen, P.C. (1986): Fitting assumed distributions to horizontal point sample diameters, For. Sci., 32, 146 -148.