Abstract
There are several reasons why robust regression techniques are useful tools in sampling design. First of all, when stratified samples are considered, one needs to deal with three main issues: the sample size, the strata bounds determination and the sample allocation in the strata. Since the target variable y, objective of the survey, is unknown, it is used some auxiliary information x known for the entire population from which the sample is drawn. Such information is helpful as it is strongly correlated with the target y, but of course some discrepancies between them may arise. The use of auxiliary information, combined with the choice of the appropriate statistical model to estimate the relationship with the variable of interest y, is crucial for the determination of the strata bounds, the size of the sample and the sampling rates according to a chosen precision level of the estimates, as it has been shown by Rivest (2002). Nevertheless, this regression-based approach is highly sensitive to the presence of contaminated data. Indeed, the influence of outlying observations in both y and x has an explosive impact on the variances with the effect of strong departures from the optimum sample allocation. Therefore, we expect increasing sample sizes in the strata, wrong allocation of sampling units in the strata and some errors in the strata bounds determination. Since the key tool for stratified sampling is the measure of scale of y conditional to the knowledge of some auxiliary x, a robust approach based on S-estimator of regression is proposed in this paper. The aim is to allow for robust sample size and strata bounds determination, together with the optimal sample allocation. To show the advantages of the proposed method, an empirical illustration is provided for Belgian business surveys in the sector of Construction. It is considered a skewed population framework, which is typical for businesses, with a stratified design with one take-all stratum and L − 1 strata. Simulation results are also provided.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Rivest, L.P.: A generalization of Lavallée and Hidiroglou algorithm for stratification in business surveys. Techniques d’enquêtes 28, 207–214 (2002)
Rousseeuw, P.J., Yohai, V.J.: Robust regression by means of S-estimators. In: Franke, J., Hardle, W., Martin Robust, D. (eds.) Nonlinear Time Series. Lecture Notes in Statistics. vol. 26, pp. 256–272. Springer, Berlin (1984)
Tillé, Y.: Théorie des sondages. Dunod, Paris (2001)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Bramati, M.C. (2014). Response Burden Reduction Through the Use of Administrative Data and Robust Sampling. In: Crescenzi, F., Mignani, S. (eds) Statistical Methods and Applications from a Historical Perspective. Studies in Theoretical and Applied Statistics(). Springer, Cham. https://doi.org/10.1007/978-3-319-05552-7_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-05552-7_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05551-0
Online ISBN: 978-3-319-05552-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)