Skip to main content

Data Smoothing, Extrapolation, and Triangulation

  • Chapter
  • First Online:
Methods in Epidemiology

Part of the book series: Advances in Experimental Medicine and Biology ((AEMB,volume 1333))

  • 615 Accesses

Abstract

While population size data at the national and subnational levels are critical for planning and resource allocation, most PSE studies are conducted in a few selected cities and towns due to resource and time constraints. Extrapolation methods to estimate the population size for locations not included in the PSE exercise are critical to providing data for all subnational jurisdictions and the entire country as a whole. The decision on how to select cities or towns that reflect best the national picture is a critical first step. You may need to include cities or towns from areas with low, medium, or high prevalence of your target population(s). Having some data from all these (three) areas increase the power of your study and helps to better extrapolate the results to unobserved areas, and to the national level. Through this process, data smoothing of subnational data is often needed due to relatively small samples size problem. In this chapter, we describe methods that are required for smoothing subnational data, model subnational data to extrapolate estimates to regional and national levels, and triangulate findings from different size estimation methods by a Bayesian framework to produce credible estimates.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Concato J, Peduzzi P, Holford TR, Feinstein ARJJoce (1995) Importance of events per independent variable in proportional hazards analysis I. Background, Goals, and General Strategy 48(12):1495–1501

    Google Scholar 

  • Epidemiology Bureau (2016) Department of health, size estimation of key affected populations in the Philippines https://www.aidsdatahub.org/sites/default/files/resource/2015-size-estimation-key-affected-populations-philippines.pdf  

  • Geyer C (1992) Practical Markov chain Monte Carlo. Statistical Science 7(4):473–483 

    Google Scholar 

  • Hastie T, Qian J,  Tay K (2021) An introduction to glmnet. https://glmnet.stanford.edu/articles/glmnet.html

  • Heisterkamp SH, Doornbos G, Gankema M (1993) Disease mapping using empirical Bayes and Bayes methods on mortality statistics in The Netherlands. Stat Med 12(19–20):1895–1913. https://doi.org/10.1002/sim.4780121915. PMID: 8272669

  • Jalali M, Nikfarjam A, Haghdoost AA, Memaryan N, Tarjoman T, Baneshi MR (2013) Social hidden groups size analyzing: application of count regression models for excess zeros. J Res Health Sci 13(2)

    Google Scholar 

  • Johnston LG, Prybylski D, Raymond HF, Mirzazadeh A, Manopaiboon C, McFarland W (2013) Incorporating the service multiplier method in respondent-driven sampling surveys to estimate the size of hidden and hard-to-reach populations: case studies from around the world. Sex Transm Dis 40(4):304–310. Epub 2013/03/15. https://doi.org/10.1097/olq.0b013e31827fd650. PubMed PMID: 23486495

  • Johnston LG, McLaughlin KR, El Rhilani H, Latifi A, Toufik A, Bennani A et al (2015) Estimating the size of hidden populations using respondent-driven sampling data: case examples from morocco. Epidemiology (Cambridge, Mass) 26(6):846–852. Epub 2015/08/11. https://doi.org/10.1097/ede.0000000000000362. PubMed PMID: 26258908; PubMed Central PMCID: PMCPMC4586393

  • Johnston LG, McLaughlin KR, Rouhani SA, Bartels SA (2017) Measuring a hidden population: a novel technique to estimate the population size of women with sexual violence-related pregnancies in South Kivu Province, Democratic Republic of Congo. J Epidemiol Global Health 7(1):45–53. Epub 2016/09/25. https://doi.org/10.1016/j.jegh.2016.08.003. PubMed PMID: 27663900

  • Law J, Haining R, Maheswaran R, Pearson TJGA (2006) Analyzing the relationship between smoking and coronary heart disease at the small area level: a Bayesian approach to spatial modeling. 38(2):140–159

    Google Scholar 

  • Legendre PJE (1993) Spatial autocorrelation: trouble or new paradigm? 74(6):1659–1673

    Google Scholar 

  • Mercer L, Wakefield J, Chen C, Lumley T (2014) A comparison of spatial smoothing methods for small area estimation with sampling weights. 8:69–85

    Google Scholar 

  • Mukhopadhyay S, Sahu SK (2018) A Bayesian spatiotemporal model to estimate long-term exposure to outdoor air pollution at coarser administrative geographies in England and Wales. J R Stat Soc A 181:465–486. https://doi.org/10.1111/rssa.12299

  • National AIDS/STD Control Programme (NASP) (2016) Mapping study and size estimation of key populations in Bangladesh for HIV programs 2015–2016. https://www.aidsdatahub.org/resource/mapping-study-size-estimation-key-populations-bangladesh-hiv-programs-2015-2016

  • Okal J, Geibel S, Muraguri N, Musyoki H, Tun W, Broz D, Kuria D, Kim A, Oluoch T, Raymond HF (2013) Estimates of the size of key populations at risk for HIV infection: men who have sex with men, female sex workers and injecting drug users in Nairobi, Kenya. Sex Transm Infections 89(5):366–371. https://doi.org/10.1136/sextrans-2013-051071

  • Reddy A (2010) Estimating the size of populations at high risk of HIV in Bangladesh using a Bayesian hierarchical model

    Google Scholar 

  • Schlüter BS, Masquelier B (2021) Space-time smoothing of mortality estimates in children aged 5–14 in Sub-Saharan Africa. PLOS ONE 16(1):e0245596. https://doi.org/10.1371/journal.pone.0245596

  • Sharifi H, Karamouzian M, Baneshi MR, Shokoohi M, Haghdoost A, McFarland W et al (2017) Population size estimation of female sex workers in Iran: synthesis of methods and results. PLoS ONE 12(8): https://doi.org/10.1371/journal.pone.0182755

  • Sulaberidze L, Mirzazadeh A, Chikovani I, Shengelia N, Tsereteli N, Gotsadze G (2016) Population size estimation of men who have sex with men in Tbilisi, Georgia; Multiple methods and triangulation of findings. PloS One 11(2):e0147413

    Google Scholar 

  • Tibshirani R (1996) Regression shrinkage and selection via the Lasso. J Royal Stat Soc Series B (Methodological) 58:267–288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x

  • Ward MD, Gleditsch KSJMahwfwemLvA (2007) An introduction to spatial regression models in the social sciences 8:2007

    Google Scholar 

  • Wesson PD, Mirzazadeh A, McFarland W (2018) A Bayesian approach to synthesize estimates of the size of hidden populations: the anchored multiplier. Int J Epidemiol 47(5):1636–1644. https://doi.org/10.1093/ije/dyy132. PMID: 29931067; PMCID: PMC6208278

  • Wesson PD, McFarland W, Qin CC, Mirzazadeh A (2019) Software application profile: the anchored multiplier calculator-a Bayesian tool to synthesize population size estimates. Int J Epidemiol 48(6):1744–1749. https://doi.org/10.1093/ije/dyz101. PMID: 31106350; PMCID: PMC6929553

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ali Mirzazadeh .

Editor information

Editors and Affiliations

4.1 Electronic Supplementary Material

Below is the link to the electronic supplementary material.

470757_1_En_4_MOESM1_ESM.xlsx

Supplementary material 1

470757_1_En_4_MOESM2_ESM.docx

Supplementary material 2

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Mirzazadeh, A., Baneshi, M.R. (2021). Data Smoothing, Extrapolation, and Triangulation. In: Rutherford, G. (eds) Methods in Epidemiology. Advances in Experimental Medicine and Biology, vol 1333. Springer, Cham. https://doi.org/10.1007/978-3-030-75464-8_4

Download citation

Publish with us

Policies and ethics