Applied Mathematics

p-ISSN: 2163-1409    e-ISSN: 2163-1425

2020;  10(1): 12-19

doi:10.5923/j.am.20201001.03

Received: July 21, 2020; Accepted: August 5, 2020; Published: August 29, 2020

 

Joint Influence of Double Sampling and Randomized Response Technique on Estimation Method of Mean

Nadia Mushtaq1, Muhammad Noor-ul-Amin2

1Department of Statistics, Forman Christian College University, Lahore

2COMSATS Institute of Information and Technology Lahore, Pakistan

Correspondence to: Nadia Mushtaq, Department of Statistics, Forman Christian College University, Lahore.

Email:

Copyright © 2020 The Author(s). Published by Scientific & Academic Publishing.

This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/

Abstract

In this article, the problem of estimation under two-phase random sampling using randomized response technique is considered. In two-phase (double) sampling, the expression of bias and mean square error (MSE) up to the first-order approximations are derived for the proposed estimators. Simulation studies and real data are presented to demonstrate the performance of proposed estimators.

Keywords: Double Sampling, Sensitive Variable, Randomized Response Technique

Cite this paper: Nadia Mushtaq, Muhammad Noor-ul-Amin, Joint Influence of Double Sampling and Randomized Response Technique on Estimation Method of Mean, Applied Mathematics, Vol. 10 No. 1, 2020, pp. 12-19. doi: 10.5923/j.am.20201001.03.

1. Introduction

It is common practice in sample survey we obtain the information about auxiliary variable(s) from a larger sample at first phase and relatively small sample from the second phase by using two-phase sampling procedure. Many authors worked on two-phase random sampling such as: Sukhatme (1962), Singh and Vishwakarma (2007), Sahoo et al. (2010) Noor-ul-amin and Hanif (2012), Sanaullah et al. (2014), etc.
In survey sampling information on sensitive variable would be collected by using randomized response technique introduced by Warner (1965), because direct reliable information on variable of interest is sometime may not possible. Several authors have worked on randomized response techniques on estimation of mean, including Eichhorn and Hayre (1983), Gupta et al. (2002), Chang et al. (2005), Huang (2008). Sousa et al. (2010) introduced ratio estimators by using non-sensitive auxiliary information. Gupta et al. (2014) presented ratio and regression estimator using optional scrambling. Mushtaq et al. (2017) presented a family of estimators of a sensitive variable using auxiliary information in stratified random sampling. Noor-ul-amin et al. (2018) proposed estimation of mean using generalized optional scrambled responses in the presence of non-sensitive auxiliary variable. Saleem I. et al (2019) presented estimation of mean of a sensitive quantitative variable in complex survey: improved estimator and scrambled randomized response model. Partha Parichha et al. (2020) discuss the development of estimation procedure of population mean in Two-Phase Stratified Sampling.
Encourage the above work, we have suggested a generalized class of estimators in two-phase sampling using randomized response technique. The main purpose is to suggest a strategy of two-phase (double) sampling in randomized response technique and proposed general family of estimators for estimating the finite population mean of a sensitive variable with non-sensitive auxiliary variable based on RRT in two- phase (double) random sampling.

2. Sampling Strategy

2.1. Notations & Scheme of Selection of Sample

We consider the finite population in which be the sensitive study variable, be non-sensitive auxiliary variable which is correlated with and be scrambling variable independent of and The reported response of the respondents is , and is the number of units in the first sample whereas n is the number of units in the second sample. Only in the second sample both study and auxiliary variables are observed, in the first sample only auxiliary variable is observed because study variable is expensive.
The two-phase sampling strategy is given below:
1. The first phase, a large sample of a fixed size is drawn from N to observe only or auxiliary variable.
2. The second phase sample, a sub-sample of fixed size is drawn from to observe and , so that .
Let define the following notations:

2.2. Discussion on Estimators in Double Sampling Strategy in RRT

Firstly, we introduce some existing estimators in double sampling using RRT.
The mean and variance of the usual mean estimator in RRT is given by
(2.1)
(2.2)
Where and .
A ratio estimator in two-phase sampling in RRT is given as:
(2.3)
(2.4)
And regression estimator in two-phase sampling given as
(2.5)
Where
(2.6)

3. Proposed Estimators

We propose the following a class of generalized estimators in two-phase sampling:
(3.1)
Where and are weights whose values are to be determined, and are the parameters of the auxiliary variables.
From for , we obtain the following estimators
From for we obtain the following estimators
Expanding (3.1), we have
(3.2)
Where
(3.3)
Using (3.3), the and of are given by
(3.4)
(3.5)
And optimum values of and , respectively, are found as,
Substituting these optimum values in (3.5), the minimum of is given by
(3.6)
By using (3.6), for different values of and or , we can get the minimum of .

4. Simulation Study & Efficiency Comparison of Proposed Estimators

We use the simulation studies for efficiency comparison by empirically and theoretically. Two populations for simulation studies of size 1000 each from bivariate normal populations for (Y, X), with different covariance matrices are used. The Scrambling variable and .
Population 1 Mean of given as
Population 2 Mean of given as
For all populations, we consider four sample sizes: and respectively.
The empirical and theoretical MSE’s for various sensitive mean estimators are given in Tables 1-2. We estimate the empirical MSE using 10000 samples of different sizes selected from each population.

5. Real Data Set Application of Two-Phase or Double Sampling in RRT

For this analysis, we consider the real population used in Gupta et al. (2012). Let be the monthly salaries amount in 2010, is the number of employees available from business data register and . For this population, we have:
, (in thousands of) and and respectively.
Numerical results of empirical and theoretical MSE based on population data is given in Table 3.
The following expression is used to obtain percent relative efficiency (PRE) of different estimators with respect to :
Where
In Table 3 we present the empirical and theoretical results of MSE estimates and PRE of the various estimators in the stratified sample.

6. Conclusions

In this study, we suggested the idea of two-phase sampling in randomized response technique. We consider a general class of estimators for mean of sensitive variable based on randomized response technique in two-phase sampling. In Tables 1-3, we present the results of the theoretical and empirical MSE and PRE of the estimators in two-phase sampling using randomized response technique. These results are computed with a simulation studies and using a real data set.

Appendix

Table 1. Empirical and Theoretical MSE, PRE for the estimators relative to RRT mean estimator for Population 1 in Two Phase Sampling
     
Table 2. Empirical and Theoretical MSE, PRE for the estimators relative to RRT mean estimator for Population 2 in Two Phase Sampling
     
Table 3. Empirical and Theoretical MSE, PRE for the estimators relative to RRT mean estimator for Real data in Two Phase Sampling
     

References

[1]  Chang, H. J., Huang, K. C., & WU, C. H. (2005). Constructing indirect randomized response techniques using symmetry of response. Journal of Information & Optimization Sciences, 26(3), 549-557.
[2]  Eichhorn, B.H. and Hayre, L. S. (1983). Scrambled randomized response models for obtaining sensitive quantitative data. Journal of Statistical Planning and Inference, 7, 307-316.
[3]  Gupta, S., Gupta, B. and Singh, S. (2002). Estimation of sensitivity level of personal interview survey questions. Journal of Statistical Planning and Inference, 100, 239-247.
[4]  Gupta, S., Kalucha, G., Shabbir, J. and Dass, B.K. (2014). Estimation of finite population mean using optional models in the presence of non-sensitive auxiliary information. American Journal of Mathematical and Management Sciences, 33(2), 147-159.
[5]  Huang, K.C. (2008). Estimation for sensitive characteristics using optional randomized response technique. Quality and Quantity, 42(5), 679-686.
[6]  Saleem, I., Sanaullah, A., Koyuncu, N., Hanif, M., (2019). Estimation of Mean of a Sensitive Quantitative Variable in Complex Survey: Improved Estimator and Scrambled Randomized Response Model, Journal of Science, Ghazi University, 32(3): 1021-1043.
[7]  Mushtaq, N., Noor-ul-Amin, M. and Hanif, M. (2017). A Family of Estimators of a Sensitive Variable Using Auxiliary Information in Stratified Random Sampling, Pakistan Journal of Statistics and operation research. 13(1), 141-155.
[8]  Noor-ul-Amin, M., Mushtaq, N., and Hanif, M. (2018). Estimation of mean using generalized optional scrambled responses in the presence of nonsensitive auxiliary variable, Journal of Statistics and Management Systems. 21(2): 287-304.
[9]  Noor-ul-Amin, M. and Hanif, M. (2012). Some exponential estimators in survey sampling. Pak. J. Statist. 28(3), 367-374.
[10]  Partha Parichha & Dr. Kajla Basu & Arnab Bandyopadhyay, 2020. "Development of Estimation Procedure of Population Mean in Two-Phase Stratified Sampling," Chapters, in: Jan Peter Hessling (ed.), Statistical Methodologies, Intech Open.
[11]  Sahoo, L., Mishra, G., and Nayak, S. (2010). On two different classes of estimators in two-phase sampling using multi-auxiliary variables. Model Assisted Statistics and Applications, 5(1): 61-68.
[12]  Sanaullah, A., Ali, H. A., Noor-ul-Amin, M. and Hanif, M., (2014). Generalized Exponential chain ratio estimators under stratified two-phase random sampling. Applied Mathematics and Computation. 226, 541-547.
[13]  Singh, P. and Vishwakarma, K. (2007). Modified exponential ratio and product estimators for finite population mean in Double sampling. Austral. J. Statist. 36, 217-225.
[14]  Sousa, R., Shabbir, J., Real, P. C., and Gupta, S. (2010). Ratio estimation of the mean of a sensitive variable in the presence of auxiliary information. Journal of Statistical Theory and Practice, 4(3), 495-507.
[15]  Sukhatme, B.V. (1962). Some ratio-type estimators in two-phase sampling, J. Amer. Statist. Assoc. 57, 628-632.
[16]  Warner, S. L. (1965). Randomized response: A survey technique for eliminating evasive answer bias. Journal of the American Statistical Association, 60, 63-69.