Abstract
Population attributable fraction (PAF) refers to the proportion of all cases with a particular outcome in a population that could be prevented by eliminating a specific exposure. The authors of a recent paper evaluated the prevalence and estimated the PAFs for risk factors of TB among elderly people in China [Inf Dis Poverty. 2019;8:7]. Confounding is inevitable in observational studies and Levin’s formula is of limited use in practice for unbiasedly estimating PAF. In a complex survey design, an unbiased estimation of the PAF can be calculated using a sampleweighted version of the Miettinen formula or a sample weighed parametric gformula. With respect to causal interpretation of PAF in public health setting, computation of PAF is logical and practical when the exposure is amenable to intervention.
Multilingual abstracts
Please see Additional file 1 for translations of the abstract into the five official working languages of the United Nations.
To the Editor.
We read with great interest a recent article titled [1]: “Prevalence and risk factors of active pulmonary tuberculosis among elderly people in China: a population based crosssectional study”. The authors evaluated the prevalence and identify the risk factors of TB among elderly people in China using a crosssectional study. However, there are several concerns in the analysis.

i).
In the statistical analysis section it was indicated that population attributable fraction (PAF) of each adjusted risk factor was estimated using Levin’s formula where RR is the risk ratio and p_{e} means proportion of population exposed to risk factors [2].
In the study, the adjusted odds ratio (OR) was used in place of RR.
PAF refers to the proportion of all cases with a particular outcome in a population that could be prevented by eliminating a specific exposure [3]. Formula 1 is unbiased in the absence of confounding and effect modification [3, 4]. Observational studies are subject to confounding which will lead to bias if Levin’s formula is inappropriately applied to estimate PAFs [3]. The Levin’s formula is valid only for unadjusted risk ratio [3,4,5]. The bias from this error will depend on the degree of confounding [6]. For a dichotomous exposure an unbiased estimation of PAF can be calculated using the Miettinen’s formula [7].
Where RR_{adj} is the adjusted risk ratio and p_{c} is the prevalence of exposure among the cases. This produces valid estimate in the presence of confounding, assuming exposure status and confounders are accurately measured and adjusted for.
As an example, in this study [1] the adjusted OR and prevalence of diabetes in the active TB cases was reported 1.83 (1.08–3.10) and 16/193, respectively. The PAF using formula 2 is 3.76% which is less than the reported value (5.52%).

ii).
The term “attributable” refers to a causal interpretation [3]. One of the main assumptions underlying the PAF is no bias in study design. Therefore, the application of formula 2 in cohort design is acceptable but for casecontrol and crosssectional studies, it needs more considerations. In a crosssectional study, reverse causality and prevalenceincidence bias are the main concerns for assessing the effect of the exposure on the outcome.

iii).
Another potential source of bias in the study is failure to adjust observed estimates of the prevalence of TB and exposure to risk factors for the complex sampling design employed. With such a design, the population prevalence should be adjusted using inverse probability weighting (IPW) so that the reported prevalence is appropriately adjusted for multistage and disproportionate sampling [8]. Further, the authors do not mention whether or how clustering was taken account of in the multivariable logistic regression modeling.

iv).
For complex survey designs, it is necessary to adjust PAFs for the complex sampling design [9, 10]. PAF can be computed as a sampleweighted version of the Miettinen or Bruzzi formula (formula 2 in reference [9] or formula 3 in reference [10] or sampleweighted modelbased standardization, also known as parametric gformula (formula 3 in reference [9] or formula 4 in reference [10].

v).
As PAF is a function of the prevalence of exposure, selfreported measurement of exposures in this study can be lead to bias in the estimation of PAF. As reported in this study, the selfreported and local health documentation search of diabetes was not sufficient to estimate the real distribution.

vi).
With respect to causal interpretation of PAF in a public health setting, computation of PAF is logical and practical when the exposure is amenable to intervention [6]. Therefore, it is less apparent why the attributable fraction for unmodifiable risk factors such as age and sex may be of use.
In sum, unbiased estimation of PAF requires several assumptions which are often ignored in practice. We recommend using sampleweighted version of Miettinen formula or sample weighed parametric gformula [3, 11].
Availability of data and materials
Not applicable.
Abbreviations
 PAF:

Population attributable fraction
References
Zhang CY, Zhao F, Xia YY, Yu YL, Shen X, Lu W, et al. Prevalence and risk factors of active pulmonary tuberculosis among elderly people in China: a population based crosssectional study. Infect Dis Poverty. 2019;8(1):7.
Darrow LA. Commentary: errors in estimating adjusted attributable fractions. Epidemiology. 2014;25(6):917–8.
Mansournia MA, Altman DG. Population attributable fraction. BMJ. 2018;360:k757.
Darrow LA, Steenland NK. Confounding and bias in the attributable fraction. Epidemiology. 2011;22(1):53–8.
Flegal KM. Bias in calculation of attributable fractions using relative risks from nonsmokers only. Epidemiology. 2014;25(6):913–6.
Rockhill B, Newman B, Weinberg C. Use and misuse of population attributable fractions. Am J Public Health. 1998;88(1):15–9.
Miettinen OS. Proportion of disease caused or prevented by a given exposure, trait or intervention. Am J Epidemiol. 1974;99(5):325–32.
Mansournia MA, Altman DG. Inverse probability weighting. BMJ. 2016;352:i189.
Heeringa SG, Berglund PA, West BT, Mellipilan ER, Portier K. Attributable fraction estimation from complex sample survey data. Ann Epidemiol. 2015;25(3):174–8.
Graubard BI, Fears TR. Standard errors for attributable risk for simple and complex sample designs. Biometrics. 2005;61(3):847–55.
Mansournia MA, Etminan M, Danaei G, Kaufman JS, Collins G. Handling time varying confounding in observational research. BMJ. 2017;359:j4587.
Acknowledgements
The authors are also grateful to the reviewer for her comments which greatly improved this paper.
Funding
Not applicable.
Author information
Authors and Affiliations
Contributions
A.Kh wrote the paper and MA. M revised the paper. All authors approved the final version of the paper.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests or financial disclosure about this publication.
Additional file
Additional file 1:
Multilingual abstracts in the five official working languages of the United Nations. (PDF 485 kb)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Khosravi, A., Mansournia, M.A. Recommendation on unbiased estimation of population attributable fraction calculated in “prevalence and risk factors of active pulmonary tuberculosis among elderly people in China: a population based crosssectional study”. Infect Dis Poverty 8, 75 (2019). https://doi.org/10.1186/s4024901905878
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s4024901905878
Keywords
 Population attributable fraction
 Confounding
 Sampleweighted parametric gformula