Abstract
Singh and Kumar (A family of estimators of population variance using information on auxiliary attribute. Studies in sampling techniques and time series analysis, 2011) and Singh and Malik (Appl Math Comput 235:43–49, 2014) suggested some estimators for estimating the population variance using an auxiliary attribute. This paper suggests a generalized class of estimators based on the adaption of the estimator presented by Koyuncu (Appl Math Comput 218:10900–10905, 2012) for population variance using information on an auxiliary attribute in simple random sampling. The properties of the suggested class of estimators are derived and asymptotic optimum estimator identified with its properties. The large numbers of known estimators are member of the suggested generalized class and it has been shown that proposed generalized class of estimators are more efficient than usual unbiased estimator, ratio, exponential ratio and regression estimator, estimators due to Singh and Malik (Appl Math Comput 235:43–49, 2014) and Singh and Kumar (A family of estimators of population variance using information on auxiliary attribute. Studies in sampling techniques and time series analysis, 2011) using information on auxiliary attribute. In addition, theoretical results are supported by an empirical study and findings are encouraging and support the soundness of present study.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
In the sampling literature, it is well established that efficiency of the estimator of population parameters of interest can be increased by the use of auxiliary information related auxiliary variable x, which is highly correlated with study variable y. In literature of survey sampling many authors have suggested estimators based on auxiliary information. However in many situations of practical importance, instead of an auxiliary variable x there exist an attribute (say, \({\upvarphi }\)) which is highly correlated with study variable y. In these situations by taking the advantage of point bi-serial (see [1]) correlation between the study variable y and the auxiliary attribute \({\upvarphi }\), the efficient estimators of population parameter of interest can be constructed. Several authors including [2–10] have paid their attention towards the improved estimation using auxiliary attribute.
Let us consider a sample of size n is drawn by SRSWOR from a population of size N. Further let \(\hbox {y}_\mathrm{i}\) and \({\upphi }_\mathrm{i}\) denote the observations on variable y and \({\upphi }\) respectively for the ith unit (i = 1,2,3,...,N). It is assumed that attribute \({\upphi }\) takes only the two values 0 and 1 according as \({\upphi } =1\), if ith unit of the population possesses attribute \({\upphi }=0\), if otherwise. The variance of the usual unbiased estimator \(\hbox {S}_\mathrm{y}^2 \) is given by
where
In this paper a family of estimator have been proposed for the population variance \(\hbox {S}_\mathrm{y}^2\) when the auxiliary information is available in the form of attribute. For main results we confine ourselves to sampling scheme SRSWOR ignoring the finite population correction.
Estimators in Literature
In order to have an estimate of the study variable y, assuming the knowledge of the population proportion P, [11] proposed the following estimators.
The MSE expression of the estimator \(\hbox {t}_1\) and variance of \(\hbox {t}_2\) are given, respectively, by
On differentiating (6) with respect to \(\hbox {b}_{\upphi } \) and equating to zero we obtain
Substituting the optimum value of \(\hbox {b}_{\upphi }\) in (6), we get the optimum variance of estimator \(\hbox {t}_2\), as
The MSE expression of the estimator \(\hbox {t}_3\) is given by
Following [12, 13] proposed the following variance estimator using known values of some population parameters
where \(\hbox {s}_\mathrm{y}^2\) and \(\hbox {s}_{\upphi }^2 \) are unbiased estimator of population variances \(\hbox {S}_\mathrm{y}^2\) and \(\hbox {S}_{\upphi }^2\), respectively.
To obtain the bias and MSE, we write-
\(\hbox {s}_\mathrm{y}^2 =\hbox {S}_\mathrm{y}^2 \left( {1+\hbox {e}_0 } \right) \) and \(\hbox {s}_{\upphi }^2 =\hbox {S}_{\upphi }^2 \left( {1+\hbox {e}_1} \right) .\)
Such that \(\hbox {E}\left( {\hbox {e}_0} \right) =\hbox {E}\left( {\hbox {e}_1} \right) =0\)
and \(\hbox {E}\left( {\hbox {e}_0^2 } \right) =\frac{\left( {{\uplambda } _{40} -1} \right) }{\hbox {n}},\, \hbox {E}\left( {\hbox {e}_1^2 } \right) =\frac{\left( {{\uplambda }_{04} -1} \right) }{\hbox {n}}\), \(\hbox {E}\left( {\hbox {e}_0 \hbox {e}_1 } \right) =\frac{\left( {{\uplambda }_{22} -1} \right) }{\hbox {n}}\)
and \({\upkappa }_{\mathrm{pb}} ={\uprho }_{\mathrm{pb}} \frac{\hbox {C}_\mathrm{y}}{\hbox {C}_\mathrm{p}}.\)
The MSE expression of \(\hbox {t}_{\mathrm{KC}_\mathrm{i}}\) (i = 1,2,3,4) to the first order of approximation are respectively given by
where
Following [5], Singh and Malik proposed the following variance estimator.
where \(\hbox {n}_1,\,\hbox {n}_2\) are either real numbers or the functions of the known parameters of attribute such as \(\hbox {C}_\mathrm{p},{\uprho }_{\mathrm{pb}},\, {\upbeta } _{2{\upphi }}\) and \({\upkappa }_{\mathrm{pb}}\).
The MSE expression of \(\hbox {t}_\mathrm{s}\) to the first order of approximation are respectively given by
where \(\hbox {A}_1 =\frac{\hbox {n}_1 \hbox {S}_{\upphi }^2}{\hbox {n}_1 \hbox {S}_{\upphi }^2 +\hbox {n}_2}.\)
The minimum MSE of \(\hbox {t}_\mathrm{s}\) is observed at \(\hbox {n}_1 ={\uprho }_{\mathrm{pb}}\) and \(\hbox {n}_2 ={\upbeta }_{2{\upphi }}\).
Following [5], Singh and Malik proposed another improve ratio type estimator \(\hbox {t}_{\mathrm{rs}}\) for the population variance as
where \({\upeta } ,\hbox {v}\) are either real numbers or the functions of the known parameters of attributes such as \(\hbox {C}_\mathrm{p} ,{\upbeta }_{2{\upphi } } ,{\uprho }_{\mathrm{pb}} \) and \({\upkappa }_{\mathrm{pb}}\).
Up to the first order approximation, the minimum MSE of \(\hbox {t}_{\mathrm{rs}}\) is given by,
where \({\upalpha }_0 =\frac{({\uplambda }_{22} -1)}{\hbox {A}_2 ({\uplambda }_{04} -1)}\) and \(\hbox {A}_2 =\frac{{\upeta } \hbox {S}_{\upphi }^2 }{({\upeta } \hbox {S}_{\upphi }^2 -\hbox {v})}.\)
Another improved class of estimator suggested by [13] is as follows
where \({\upgamma }\) and \({\upmu }\) are either real numbers or function of known parameters of the auxiliary attribute \({\upphi } \) such as \(\hbox {C}_\mathrm{p} ,{\upbeta }_{2{\upphi } } ,{\uprho }_{\mathrm{pb}}\) and \({\upkappa }_{\mathrm{pb}}\). The scalar \({\upgamma }\) takes value -1 and +1 for ratio and product type estimators, respectively.
The min MSE of estimator \(\hbox {t}_\mathrm{n}\) up to the first order of approximation is given by,
where,
The Suggested Class of Estimators
Motivated by [14] we propose generalized class of estimators \(\hbox {t}_\mathrm{M}\) for estimating the population variance \(\hbox {S}_\mathrm{y}^2 \), as
where \({\upomega }_1\) and \({\upomega }_2\,({\upomega }_1 + {\upomega }_2 \ne 1)\) are suitable constants to be determined such that MSE of \(\hbox {t}_\mathrm{M}\) is minimum, \({\upeta } ,\, {\upgamma }\) and \({\uplambda } \) are either real numbers or the functions of the known parameter associated with auxiliary attribute (See [15]).
\({\upgamma } \) and \({\upeta } \) are chosen in such a way that they generate ratio type and product type estimators for variance estimators for particular values as +1 and \(-\)1.
A set of new estimators generated from (21) using suitable values of \({\upomega }_1 ,{\upomega }_2 ,{\upgamma } ,{\upeta }\) and \({\uplambda }\) are listed in Table 1.
Expanding Eq. (21) in terms of e’s up to the first order of approximation, we have,
where, \(\hbox {e}_0 =\frac{\hbox {s}_\mathrm{y}^2 -\hbox {S}_\mathrm{y}^2}{\hbox {S}_\mathrm{y}^2 }, \hbox {e}_1 =\frac{\hbox {s}_{\upphi }^2 -\hbox {S}_{\upphi }^2 }{\hbox {S}_{\upphi }^2}\) and \(\hbox {v}=\frac{{\upeta } \hbox {S}_{\upphi }^2 }{{\upeta } \hbox {S}_{\upphi }^2 +{\uplambda }}.\)
To obtain the bias and MSE of the estimator \(\hbox {t}_{\mathrm{M}}\) to the first degree of approximation, we write
Such that \(\hbox {E}(\hbox {e}_0 )=\hbox {E}(\hbox {e}_1)=0\)
Also \(\hbox {E}(\hbox {e}_0 )=\frac{{\uplambda }_{04} -1}{\hbox {n}}, \quad \hbox {E}(\hbox {e}_1 )=\frac{{\uplambda }_{40} -1}{\hbox {n}}\) and \(\hbox {E}(\hbox {e}_0 \hbox {e}_1)=\frac{{\uplambda } _{22} -1}{\hbox {n}}\)
Taking expectation both sides of Eq. (22), we get the bias expression of estimator \(\hbox {t}_\mathrm{M}\) as
Squaring both sides of Eq. (23) and taking expectation we get the MSE expression of estimator \(\hbox {t}_\mathrm{M} \) as
where
Partially differentiating Eq. (24) with respect to \({\upomega }_1\) and \({\upomega }_2\) and equating to zero, we get the optimum value of \({\upomega }_1\) and \({\upomega }_1\) as
Substituting the optimal values of \({\upomega }_i\) (i = 1,2) we obtain the minimum MSE associated with \(\hbox {t}_\mathrm{M} \),
Efficiency Comparisons
We compare the efficiency of the proposed estimator \(\hbox {t}_\mathrm{M}\) under optimum condition with the usual unbiased estimator, ratio estimator, exponential ratio estimator, regression estimator for variance estimation using an auxiliary attribute:
From Eqs. (26) to (33), we conclude that the proposed estimator \(\hbox {t}_\mathrm{M}\) under aforesaid conditions performs better than the other existing estimator for the same scenario discussed in this paper.
Empirical Study
In this section we compare the performance of different estimators considered in this paper using two population data sets. The description of population data sets are as follows.
Population I [Source: [16], p. 256].
-
y = Number of villages in the circle.
-
\({\upphi } =\) A circle consisting more than five villages.
-
N = 89, n = 23, \(\hbox {S}_\mathrm{y}^2 =4.074, \hbox {S}_{\upphi }^2 =0.11, \hbox {C}_\mathrm{y} =0.601, \hbox {C}_\mathrm{p} =2.678, {\uprho }_{\mathrm{pb}} =0.766\), \({\upbeta }_{2{\upphi }} =6.162, {\uplambda }_{22} =3.996, {\uplambda }_{40} =3.811\), \({\uplambda }_{04} =6.162\).
Population II [ Source: [17], p. 203].
-
y = Household size in each household of village.
-
\({\upphi } =\) Household consisting size more than five.
-
N=35, n=15, \(\hbox {S}_\mathrm{y}^2 =4.232, \hbox {S}_{\upphi }^2 =0.252, \hbox {C}_\mathrm{y} =0.346\), \(\hbox {C}_\mathrm{p} =0.879, {\uprho }_{\mathrm{pb}} =0.773,\, {\upbeta }_{2{\upphi } } =1.052, {\uplambda }_{22} =0.952,\, {\uplambda }_{40} =4.977, {\uplambda }_{04} =1.052\).
Table 2 exhibits that the PRE’s of the proposed estimators including the different members of the proposed class along with the PRE’s of the existing estimators with respect to \(\hbox {S}_\mathrm{y}^{2}\) for two real population data sets. Estimators \(\hbox {t}_{\mathrm{M}_\mathrm{i}}\) (i = 1,2, ..., 10) are obtained from class of estimators \(\hbox {t}_\mathrm{M} \) by taking different values of \({\upeta } \) and \({\upgamma }\) and percent relative efficiency shown in the table. The highest PRE is obtained for \({\upgamma } =0,\, {\upeta } =1\) and \({\uplambda } =\hbox {C}_\mathrm{p}\) of estimator \(\hbox {t}_\mathrm{M}\). It has also been observed that the suggested class of estimators \(\hbox {t}_\mathrm{M}\) under optimum condition is more efficient than the usual unbiased estimator, ratio estimator, regression estimator, [11, 13] estimator and other estimators discussed in this paper. Hence for observed choice of parameters the proposed estimator \(\hbox {t}_\mathrm{M}\) is best among the entire estimators considered in this paper.
Conclusion
In this article we have suggested a generalized class of estimators for the population variance of study variable y when information is available on an auxiliary attribute in simple random sampling without replacement (SRSWOR). In addition, some known estimators of population variance such as usual unbiased estimator, ratio, and exponential ratio type estimators are found to be members of the proposed generalized class of estimators. Some new members are also generated from the proposed generalized class of estimators. We have determined the biases and mean square errors of the proposed class of estimators up to the first order of approximation. The proposed generalized class of estimators is advantageous in the sense that the properties of the estimators, which are members of the proposed class of estimators, can be easily obtained from the properties of the proposed generalized class. Thus the study unifies properties of several estimators for population variance using information on an auxiliary attribute. In theoretical and empirical efficiency comparisons, it has been shown that almost all the members of the proposed generalized class of estimators are more efficient than the usual unbiased estimator, ratio, exponential ratio, regression estimator, estimators due to [11, 13] and all other estimators considered here using information on an auxiliary attribute, scrupulously, estimator \(\hbox {t}_{\mathrm{M7}}\) is best among all the members of generalized class in the sense of having least mean square error.
References
Kendall, M.G., Stuart, A.: The Advanced Theory of Statistics, 2nd edn. Charles Griffin and Company Limited, London (1967)
Naik, V.D., Gupta, P.C.: A note on estimation of mean with known population proportion of an auxiliary character. J. Ind. Soc. Agric. Stat. 48(2), 151–158 (1996)
Jhajj, H.S., Sharma, M.K., Grover, L.K.: A family of estimators of population mean using information on auxiliary attribute. Pak. J. Stat. 22(1), 43–50 (2006)
Shabbir, J., Gupta, S.: On estimating the finite population mean with known population proportion of an auxiliary variable. Pak. J. Stat. 23, 1–9 (2007)
Singh, R., Cauhan, P., Sawan, N., Smarandache, F.: Auxiliary Information and a Priori Values in Construction of Improved Estimators. Renaissance High Press, Ames (2007)
Singh, R., Cauhan, P., Sawan, N., Smarandache, F.: Ratio estimators in simple random sampling using information on auxiliary attribute. Pak. J. Stat. Oper. Res. 4(1), 47–53 (2008)
Abd-Elfattah, A.M., Sherpeny, E.A., Mohamed, S.M., Abdou, O.F.: Improvement estimating the population mean in simple random sampling using information on auxiliary attribute. Appl. Math. and Comput. 215, 4198–4202 (2010)
Sharma, P., Verma, H., Sanaullah, A., Singh, R.: A study of some exponential ratio-product type estimators using information on auxiliary attributes under second order approximation. IJSE 12(3), 58–66 (2013a)
Sharma, P., Singh, R., Kim, J.M.: Study of some improved ratio type estimators using information on auxiliary attributes under second order approximation. J. Sci. Res. 57, 138–146 (2013b)
Sharma, P., Singh, R.: Improved estimators in simple random sampling when study variable is an attribute. J. Stat. Appl. Probab. Lett. 2(1), 1–8 (2015)
Singh, R., Kumar, M.: A Family of Estimators of Population Variance Using Information on Auxiliary Attribute. Studies in Sampling Techniques and Time Series Analysis. Zip Publishing, Columbus (2011)
Kadilar, C., Cingi, H.: Ratio estimators in simple random sampling. Appl. Math. Comput. 151, 893–902 (2004)
Singh, R., Malik, S.: Improved estimation of population variance using information on auxiliary attribute in simple random sampling. Appl. Math. Comput. 235, 43–49 (2014)
Koyuncu, N.: Efficient estimators of population mean using auxiliary attributes. Appl. Math. Comput. 218, 10900–10905 (2012)
Sharma, P., Singh, R.: Efficient estimators of population mean in stratified random sampling using auxiliary attribute. World Appl. Sci. J. 27(12), 1786–1791, ISSN: 1818–4952 (2013)
Sukhatme, P.V., Sukhatme, B.V.: Sampling Theory of Surveys with Applications. Iowa State University Press, Ames (1970)
Mukhopadhyay, P.: Theory and Methods of Survey Sampling, 2nd edn. PHI Learning Pvt. Ltd, New Delhi (1998)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Adichwal, N.K., Sharma, P., Verma, H.K. et al. Generalized Class of Estimators for Population Variance Using Auxiliary Attribute. Int. J. Appl. Comput. Math 2, 499–508 (2016). https://doi.org/10.1007/s40819-015-0073-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40819-015-0073-3