Abstract
While population size data at the national and subnational levels are critical for planning and resource allocation, most PSE studies are conducted in a few selected cities and towns due to resource and time constraints. Extrapolation methods to estimate the population size for locations not included in the PSE exercise are critical to providing data for all subnational jurisdictions and the entire country as a whole. The decision on how to select cities or towns that reflect best the national picture is a critical first step. You may need to include cities or towns from areas with low, medium, or high prevalence of your target population(s). Having some data from all these (three) areas increase the power of your study and helps to better extrapolate the results to unobserved areas, and to the national level. Through this process, data smoothing of subnational data is often needed due to relatively small samples size problem. In this chapter, we describe methods that are required for smoothing subnational data, model subnational data to extrapolate estimates to regional and national levels, and triangulate findings from different size estimation methods by a Bayesian framework to produce credible estimates.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Concato J, Peduzzi P, Holford TR, Feinstein ARJJoce (1995) Importance of events per independent variable in proportional hazards analysis I. Background, Goals, and General Strategy 48(12):1495–1501
Epidemiology Bureau (2016) Department of health, size estimation of key affected populations in the Philippines https://www.aidsdatahub.org/sites/default/files/resource/2015-size-estimation-key-affected-populations-philippines.pdf
Geyer C (1992) Practical Markov chain Monte Carlo. Statistical Science 7(4):473–483
Hastie T, Qian J, Tay K (2021) An introduction to glmnet. https://glmnet.stanford.edu/articles/glmnet.html
Heisterkamp SH, Doornbos G, Gankema M (1993) Disease mapping using empirical Bayes and Bayes methods on mortality statistics in The Netherlands. Stat Med 12(19–20):1895–1913. https://doi.org/10.1002/sim.4780121915. PMID: 8272669
Jalali M, Nikfarjam A, Haghdoost AA, Memaryan N, Tarjoman T, Baneshi MR (2013) Social hidden groups size analyzing: application of count regression models for excess zeros. J Res Health Sci 13(2)
Johnston LG, Prybylski D, Raymond HF, Mirzazadeh A, Manopaiboon C, McFarland W (2013) Incorporating the service multiplier method in respondent-driven sampling surveys to estimate the size of hidden and hard-to-reach populations: case studies from around the world. Sex Transm Dis 40(4):304–310. Epub 2013/03/15. https://doi.org/10.1097/olq.0b013e31827fd650. PubMed PMID: 23486495
Johnston LG, McLaughlin KR, El Rhilani H, Latifi A, Toufik A, Bennani A et al (2015) Estimating the size of hidden populations using respondent-driven sampling data: case examples from morocco. Epidemiology (Cambridge, Mass) 26(6):846–852. Epub 2015/08/11. https://doi.org/10.1097/ede.0000000000000362. PubMed PMID: 26258908; PubMed Central PMCID: PMCPMC4586393
Johnston LG, McLaughlin KR, Rouhani SA, Bartels SA (2017) Measuring a hidden population: a novel technique to estimate the population size of women with sexual violence-related pregnancies in South Kivu Province, Democratic Republic of Congo. J Epidemiol Global Health 7(1):45–53. Epub 2016/09/25. https://doi.org/10.1016/j.jegh.2016.08.003. PubMed PMID: 27663900
Law J, Haining R, Maheswaran R, Pearson TJGA (2006) Analyzing the relationship between smoking and coronary heart disease at the small area level: a Bayesian approach to spatial modeling. 38(2):140–159
Legendre PJE (1993) Spatial autocorrelation: trouble or new paradigm? 74(6):1659–1673
Mercer L, Wakefield J, Chen C, Lumley T (2014) A comparison of spatial smoothing methods for small area estimation with sampling weights. 8:69–85
Mukhopadhyay S, Sahu SK (2018) A Bayesian spatiotemporal model to estimate long-term exposure to outdoor air pollution at coarser administrative geographies in England and Wales. J R Stat Soc A 181:465–486. https://doi.org/10.1111/rssa.12299
National AIDS/STD Control Programme (NASP) (2016) Mapping study and size estimation of key populations in Bangladesh for HIV programs 2015–2016. https://www.aidsdatahub.org/resource/mapping-study-size-estimation-key-populations-bangladesh-hiv-programs-2015-2016
Okal J, Geibel S, Muraguri N, Musyoki H, Tun W, Broz D, Kuria D, Kim A, Oluoch T, Raymond HF (2013) Estimates of the size of key populations at risk for HIV infection: men who have sex with men, female sex workers and injecting drug users in Nairobi, Kenya. Sex Transm Infections 89(5):366–371. https://doi.org/10.1136/sextrans-2013-051071
Reddy A (2010) Estimating the size of populations at high risk of HIV in Bangladesh using a Bayesian hierarchical model
Schlüter BS, Masquelier B (2021) Space-time smoothing of mortality estimates in children aged 5–14 in Sub-Saharan Africa. PLOS ONE 16(1):e0245596. https://doi.org/10.1371/journal.pone.0245596
Sharifi H, Karamouzian M, Baneshi MR, Shokoohi M, Haghdoost A, McFarland W et al (2017) Population size estimation of female sex workers in Iran: synthesis of methods and results. PLoS ONE 12(8): https://doi.org/10.1371/journal.pone.0182755
Sulaberidze L, Mirzazadeh A, Chikovani I, Shengelia N, Tsereteli N, Gotsadze G (2016) Population size estimation of men who have sex with men in Tbilisi, Georgia; Multiple methods and triangulation of findings. PloS One 11(2):e0147413
Tibshirani R (1996) Regression shrinkage and selection via the Lasso. J Royal Stat Soc Series B (Methodological) 58:267–288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x
Ward MD, Gleditsch KSJMahwfwemLvA (2007) An introduction to spatial regression models in the social sciences 8:2007
Wesson PD, Mirzazadeh A, McFarland W (2018) A Bayesian approach to synthesize estimates of the size of hidden populations: the anchored multiplier. Int J Epidemiol 47(5):1636–1644. https://doi.org/10.1093/ije/dyy132. PMID: 29931067; PMCID: PMC6208278
Wesson PD, McFarland W, Qin CC, Mirzazadeh A (2019) Software application profile: the anchored multiplier calculator-a Bayesian tool to synthesize population size estimates. Int J Epidemiol 48(6):1744–1749. https://doi.org/10.1093/ije/dyz101. PMID: 31106350; PMCID: PMC6929553
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
4.1 Electronic Supplementary Material
Below is the link to the electronic supplementary material.
470757_1_En_4_MOESM1_ESM.xlsx
Supplementary material 1
470757_1_En_4_MOESM2_ESM.docx
Supplementary material 2
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Mirzazadeh, A., Baneshi, M.R. (2021). Data Smoothing, Extrapolation, and Triangulation. In: Rutherford, G. (eds) Methods in Epidemiology. Advances in Experimental Medicine and Biology, vol 1333. Springer, Cham. https://doi.org/10.1007/978-3-030-75464-8_4
Download citation
DOI: https://doi.org/10.1007/978-3-030-75464-8_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-75463-1
Online ISBN: 978-3-030-75464-8
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)