Misclassification errors in prevalence estimation: Bayesian handling with care

Speybroeck, Niko; Devleesschauwer, Brecht; Joseph, Lawrence; Berkvens, Dirk

doi:10.1007/s00038-012-0439-9

Misclassification errors in prevalence estimation: Bayesian handling with care

Hints & Kinks
Published: 20 December 2012

Volume 58, pages 791–795, (2013)
Cite this article

International Journal of Public Health

Niko Speybroeck¹,
Brecht Devleesschauwer^1,2,
Lawrence Joseph³ &
…
Dirk Berkvens⁴

939 Accesses
57 Citations
2 Altmetric
Explore all metrics

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Berkvens D, Speybroeck N, Praet N, Adel A, Lesaffre E (2006) Estimating disease prevalence in a Bayesian framework using probabilistic constraints. Epidemiology 17:145–153
Article PubMed Google Scholar
Bouwknegt M, Engel B, Herremans MMPT, Widdowson MA, Worm HC, Koopmans MPG, Frankena K, De Roda Husman AM, De Jong MCM, Van Der Poel WHM (2008) Bayesian estimation of hepatitis E virus seroprevalence for populations with different exposure levels to swine in The Netherlands. Epidemiol Infect 136:567–576
Article PubMed CAS Google Scholar
Branscum AJ, Gardner IA, Johnson WO (2005) Estimation of diagnostic-test sensitivity and specificity through Bayesian modeling. Prev Vet Med 68:145–163
Article PubMed CAS Google Scholar
Engel B, Swildens B, Stegeman A, Buist W, De Jong M (2006) Estimation of sensitivity and specificity of three conditionally dependent diagnostic tests in the absence of a gold standard. J Agric Biol Environ Stat 11:360–380
Article Google Scholar
Joseph L, Gyorkos TW, Coupal L (1995) Bayesian estimation of disease prevalence and the parameters of diagnostic tests in the absence of a gold standard. Am J Epidemiol 141(3):263–272
PubMed CAS Google Scholar
Liu P, Xiao S, Shi ZX, Bi XX, Yang HT, Jin H (2011) Bayesian evaluation of the human immunodeficiency virus antibody screening strategy of duplicate enzyme-linked immunosorbent assay in Xuzhou blood center, China. Transfusion 51:793–798
Article PubMed Google Scholar
Lunn DJ, Thomas A, Best N, Spiegelhalter D (2000) WinBUGS—A Bayesian modelling framework: concepts, structure, and extensibility. Stat Comput 10:325–337
Article Google Scholar
Lunn D, Spiegelhalter D, Thomas A, Best N (2009) The BUGS project: evolution, critique and future directions (with discussion). Stat Med 28:3049–3082
Article PubMed Google Scholar
Rogan WJ, Gladen B (1978) Estimating prevalence from the results of a screening test. Am J Epidemiol 107:71–76
PubMed CAS Google Scholar
Speybroeck N, Praet N, Claes F, Van Hong N, Torres K, Mao S, Van den Eede P, Thi Thinh T, Gamboa D, Sochantha T, Thang ND, Coosemans M, Büscher P, D’Alessandro U, Berkvens D, Erhart A (2011) True versus apparent malaria infection prevalence: the contribution of a Bayesian approach. PLoS ONE 6(2):e16705
Article PubMed CAS Google Scholar
Speybroeck N, Williams CJ, Lafia KB, Devleesschauwer B, Berkvens D (2012) Estimating the prevalence of infections in vector populations using pools of samples. Med Vet Entomol 26:361–371
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Health and Society, Université catholique de Louvain, Brussels, Belgium
Niko Speybroeck & Brecht Devleesschauwer
Department of Virology, Parasitology and Immunology, Faculty of Veterinary Medicine, Ghent University, Merelbeke, Belgium
Brecht Devleesschauwer
Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, QC, Canada
Lawrence Joseph
Department of Biomedical Sciences, Institute of Tropical Medicine, Antwerp, Belgium
Dirk Berkvens

Authors

Niko Speybroeck
View author publications
You can also search for this author in PubMed Google Scholar
Brecht Devleesschauwer
View author publications
You can also search for this author in PubMed Google Scholar
Lawrence Joseph
View author publications
You can also search for this author in PubMed Google Scholar
Dirk Berkvens
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Niko Speybroeck.

Appendix 1: Bayesian approach for estimating “true” prevalence from “apparent” prevalence

To introduce the Bayesian approach for estimating “true” prevalence (pi) from “apparent” prevalence (p), we build on the illustrations from the main text. In these examples, we had assumed a test sensitivity (SE) of 80 % and a test specificity (SP) of 90 %. We then estimated “true” prevalence from an “apparent” prevalence of 24 and 5 %, using the Rogan–Gladen estimator (Eq. (2)). This resulted in estimates of, respectively, 20 and −48 %.

To parameterize the Bayesian model, we need information on the sample size (n), the number of positive test results (x), and an a priori probability distribution for “true” prevalence (pi). As a first illustration, we will continue to assume that SE and SP are known, fixed values (Model 1). This assumption will be relaxed in a further illustration (Model 2).

For simplicity, we will assume that our sample size was 500. An “apparent” prevalence of 24 % would thus have resulted from observing 120 positive test results. Likewise, an “apparent” prevalence of 5 % would have resulted from observing 25 positive results. As prior probability distribution for “true” prevalence, we will apply a uniform distribution ranging from 0 to 100 %. This is a common choice, as it expresses our belief before having observed any data, that the “true” prevalence can take any possible value and that each possible value is equally likely.

This information can now be used to establish the following Bayesian model:

Model 1 Bayesian estimation of true prevalence from apparent prevalence, based on fixed sensitivity and specificity; comments, denoted by the “#” symbol, translate the code in words

We fitted the model in WinBUGS using two chains, each containing 6,000 samples, of which the first 1,000 were discarded as “burn-in”. Typical output is presented in Table 3. As expected, the Bayesian model succeeded in yielding only positive values, thus providing a useful “true” prevalence estimate for our second case.

Table 3 “True” prevalence estimation results from Bayesian model 1

Full size table

The Bayesian approach makes it possible to flexibly account for uncertainty in the values for SE and SP. Instead of assuming fixed values, we assume in Bayesian model 2 that SE can take any possible value between 70 and 90 %, and that SP can take any possible value between 85 and 95 %. For the sake of comparability, these values were chosen so that their means would correspond to the fixed values applied before.

Model 2 Bayesian estimation of true prevalence from apparent prevalence, based on stochastic sensitivity and specificity; comments, denoted by the “#” symbol, translate the code in words

Again, the model was fitted in WinBUGS using two chains, each containing 1,000 “burn-in” samples and 5,000 retained samples. Typical (numerical and graphical) output is presented in Table 4. The results are similar to those obtained by Model 1, but show wider credibility intervals, owing to the additional uncertainty introduced by defining SE and SP as stochastic nodes.

Table 4 “True” prevalence estimation results from Bayesian model 2

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Speybroeck, N., Devleesschauwer, B., Joseph, L. et al. Misclassification errors in prevalence estimation: Bayesian handling with care. Int J Public Health 58, 791–795 (2013). https://doi.org/10.1007/s00038-012-0439-9

Download citation

Received: 28 September 2012
Accepted: 03 December 2012
Published: 20 December 2012
Issue Date: October 2013
DOI: https://doi.org/10.1007/s00038-012-0439-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Misclassification errors in prevalence estimation: Bayesian handling with care

Access this article

References

Author information

Authors and Affiliations

Corresponding author

Appendix 1: Bayesian approach for estimating “true” prevalence from “apparent” prevalence

Appendix 1: Bayesian approach for estimating “true” prevalence from “apparent” prevalence

Model 1 Bayesian estimation of true prevalence from apparent prevalence, based on fixed sensitivity and specificity; comments, denoted by the “#” symbol, translate the code in words

Model 2 Bayesian estimation of true prevalence from apparent prevalence, based on stochastic sensitivity and specificity; comments, denoted by the “#” symbol, translate the code in words

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation