Using ‘found’ data to augment a probability sample: Procedure and case study

Mc Overton, J. C.; Young, T. C.; Overton, W. S.

doi:10.1007/BF00555062

Using ‘found’ data to augment a probability sample: Procedure and case study

Published: May 1993

Volume 26, pages 65–83, (1993)
Cite this article

Environmental Monitoring and Assessment Aims and scope Submit manuscript

J. C. Mc Overton¹,
T. C. Young² &
W. S. Overton³

148 Accesses
37 Citations
3 Altmetric
Explore all metrics

Abstract

While probability sampling has the advantage of permitting unbiased population estimates, many past and existing monitoring schemes do not employ probability sampling. We describe and demonstrate a general procedure for augmenting an existing probability sample with data from nonprobability-based surveys (‘found’ data). The procedure, first proposed by Overton (1990), uses sampling frame attributes to group the probability and found samples into similar subsets. Subsequently, this similarity is assumed to reflect the representativeness of the found sample for the matching subpopulation. Two methods of establishing similarity and producing estimates are described: pseudo-random and calibration. The pseudo-random method is used when the found sample can contribute additional information on variables already measured for the probability sample, thus increasing the effective sample size. The calibration method is used when the found sample contributes information that is unique to the found observations. For either approach, the found sample data yield observations that are treated as a probability sample, and population estimates are made according to a probability estimation protocol. To demonstrate these approaches, we applied them to found and probability samples of stream discharge data for the southeastern US.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Resampling of spatially correlated data with preferential sampling for the estimation of frequency distributions and semivariograms

Article 09 July 2016

Spatially balanced sampling designs for environmental surveys

Article 30 July 2019

Spatially Clustered Survey Designs

Article Open access 28 July 2023

References

Messer, J.J., Ariss, C.W., Baker, J.R., Drouse, S.K., Eshleman, K.N., Kaufmann, P.R., Linthurst, R.A., Omernik, J.M., Overton, W.S., Sale, M.J., Schonbrod, R.D., Stambaugh, S.M. and Tuschall, J.R. Jr.: 1986, ‘National Surface Water Survey: National Stream Survey Phase I — Pilot Survey’, EPA/600/4-86/026. U.S. Environmental Protection Agency, Office of Research and Development, Washington DC.
Google Scholar
Overton, W.S.: 1987, ‘A Sampling and Analysis Plan for Streams, in the National Surface Water Survey Conducted by the EPA’, Technical Report 117, Department of Statistics, Oregon State University, Corvallis.
Google Scholar
Overton, W.S.: 1989, ‘Calibration Methodology for the Double Sample Structure of the National Lake Survey Phase II Sample’, Technical Report 130, Department of Statistics, Oregon State University, Corvallis.
Google Scholar
Overton, W.S.: 1990, ‘A Strategy for Use of Found Samples in a Rigorous Monitoring Design’, Technical Report 139, Department of Statistics, Oregon STate University, Corvallis.
Google Scholar
Smith, B.G.: 1987, ‘CLUSB, Version 3, Recording for Microcomputer and Manual Revision’, Unpublished manuscript.
Young, T.C., DePinto, J.V. and Heidtke, T.M.: 1988, ‘Some Factors Affecting Fluvial Load Estimation Efficiency’,Water Resources Research 24, 1535–1540.
Google Scholar

Download references

Author information

Authors and Affiliations

Biology Department, UCLA, 90024, Los Angeles, CA, USA
J. C. Mc Overton
Department of Civil and Environmental Engineering, Clarkson University, 13676, Potsdam, NY, USA
T. C. Young
Department of Statistics, Oregon State University, 97331-4606, Corvallis, OR, USA
W. S. Overton

Authors

J. C. Mc Overton
View author publications
You can also search for this author in PubMed Google Scholar
T. C. Young
View author publications
You can also search for this author in PubMed Google Scholar
W. S. Overton
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mc Overton, J.C., Young, T.C. & Overton, W.S. Using ‘found’ data to augment a probability sample: Procedure and case study. Environ Monit Assess 26, 65–83 (1993). https://doi.org/10.1007/BF00555062

Download citation

Received: 15 December 1991
Revised: 15 July 1992
Issue Date: May 1993
DOI: https://doi.org/10.1007/BF00555062

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Using ‘found’ data to augment a probability sample: Procedure and case study

Abstract

Access this article

Similar content being viewed by others

Resampling of spatially correlated data with preferential sampling for the estimation of frequency distributions and semivariograms

Spatially balanced sampling designs for environmental surveys

Spatially Clustered Survey Designs

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Using ‘found’ data to augment a probability sample: Procedure and case study

Abstract

Access this article

Similar content being viewed by others

Resampling of spatially correlated data with preferential sampling for the estimation of frequency distributions and semivariograms

Spatially balanced sampling designs for environmental surveys

Spatially Clustered Survey Designs

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation