Construction and case study of a novel lung cancer risk index

Faghani, Ali; Guo, Lei; Wright, Margaret E.; Hughes, M. Courtney; Vaezi, Mahdi

doi:10.1186/s12885-022-10370-4

Construction and case study of a novel lung cancer risk index

Research
Open access
Published: 06 December 2022

Volume 22, article number 1275, (2022)
Cite this article

Download PDF

You have full access to this open access article

BMC Cancer Aims and scope Submit manuscript

Construction and case study of a novel lung cancer risk index

Download PDF

1707 Accesses
1 Citation
Explore all metrics

Abstract

Purpose

This study constructs a lung cancer risk index (LCRI) that incorporates many modifiable risk factors using an easily reproducible and adaptable method that relies on publicly available data.

Methods

We used meta-analysis followed by Analytic Hierarchy Process (AHP) to generate a lung cancer risk index (LCRI) that incorporates seven modifiable risk factors (active smoking, indoor air pollution, occupational exposure, alcohol consumption, secondhand smoke exposure, outdoor air pollution, and radon exposure) for lung cancer. Using county-level population data, we then performed a case study in which we tailored the LCRI for use in the state of Illinois (LCRI_IL).

Results

For both the LCRI and the LCRI_IL, active smoking had the highest weights (46.1% and 70%, respectively), whereas radon had the lowest weights (3.0% and 5.7%, respectively). The weights for alcohol consumption were 7.8% and 14.7% for the LCRI and the LCRI_IL, respectively, and were 3.8% and 0.95% for outdoor air pollution. Three variables were only included in the LCRI: indoor air pollution (18.5%), occupational exposure (13.2%), and secondhand smoke exposure (7.6%). The Consistency Ratio (CR) was well below the 0.1 cut point. The LCRI_IL was moderate though significantly correlated with age-adjusted lung cancer incidence (r = 0.449, P < 0.05) and mortality rates (r = 0.495, P < 0.05).

Conclusion

This study presents an index that incorporates multiple modifiable risk factors for lung cancer into one composite score. Since the LCRI allows data comprising the composite score to vary based on the location of interest, this measurement tool can be used for any geographic location where population-based data for individual risk factors exist. Researchers, policymakers, and public health professionals may utilize this framework to determine areas that are most in need of lung cancer-related interventions and resources.

View this article's peer review reports

Lung cancer mortality attributable to residential radon: a systematic scoping review

Article 28 December 2022

Bridging the etiologic and prognostic outlooks in individualized assessment of absolute risk of an illness: application in lung cancer

Article 11 July 2016

Meta-analysis of case–control studies on the relationship between lung cancer and indoor radon exposure

Article 08 December 2018

Introduction

Cancer is the second leading cause of death in the US, with lung cancer accounting for almost one-quarter of these deaths. The American Cancer Society estimates that 236,740 new lung cancers will be diagnosed in 2022, and this disease will claim the lives of more than 130,000 men and women [1]. Numerous studies have examined risk factors for lung cancer, with smoking being the single largest contributor to the disease [2,3,4,5,6,7,8,9,10,11]. Other established risk factors include age [12], secondhand smoke exposure [13], environmental exposures (radon [14], indoor and outdoor air pollution [15, 16]), occupational exposures [17], diet [18], alcohol consumption [19], genetic predisposition [20], previous lung disease [21], and arsenic exposure [22]. Many of these risk factors are modifiable, including active smoking and secondhand smoke exposure, environmental exposures, occupational exposures, alcohol consumption, and diet [23].

Although many studies have investigated associations between individual risk factors and lung cancer risk or mortality [20,21,22,23,24,25,26,27,28,29,30,31,32], less is known about how these factors interact to influence the development and progression of the disease. Some studies have examined interactions between smoking and one other risk factor, such as radon, alcohol consumption, family history, previous lung disease, or some component of diet [33]. To our knowledge, there are few, if any, studies that simultaneously investigated the contribution of more than two modifiable risk factors for lung cancer. This may be because epidemiologic studies are often limited in their ability to consider multiple factors simultaneously, given limited sample sizes and ranges of exposures within their study populations [34].

To address this gap, we constructed a Lung Cancer Risk Index (LCRI) that incorporates several modifiable risk factors using Meta-Analytic Hierarchy Process (Meta-AHP). While this approach has been used in the soil science field [35], it has not been commonly employed in the health sciences. Meta-AHP may be superior to a traditional principal component analysis approach because Meta-AHP can effectively extract essential variables and assign weights more precisely. We tailored this index for use in a case study of the state of Illinois; the LCRI_IL was created using publicly available county-level data for all 102 Illinois counties. We then evaluated the correlation between the LCRI_IL and reported lung cancer incidence and mortality rates. We provide researchers with an easily reproducible and adaptable method that uses publicly available data to generate a composite measure that integrates multiple modifiable risk factors for lung cancer. This measure can be tailored for any geographic area and is potentially widely applicable. Public health officials and policymakers may consider using this measure when making decisions regarding lung cancer-related interventions and resource allocation in their communities.

Methods

Figure 1 shows the process that we used to generate the lung cancer risk index (LCRI). Each step in the figure is explained in detail below.

Step 1: identify relevant articles: search strategy and article selection

Using the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [36], we conducted searches of PubMed (including MEDLINE) and Google Scholar for full-length articles that were published between January 1990 and April 2021. We utilized the following keyword strings to capture relevant studies: “lung cancer” in conjunction with one of the following—“smoking,” “passive smoking,” “secondhand smoke,” “radon,” “occupation,” “air pollution,” “alcohol consumption,” or “risk factors.” We did not include diet in our index because the World Cancer Research Fund (WCRF) and American Institute for Cancer Research (AICR) consider there to be “limited evidence” that diet is a risk factor for lung cancer [37]. We chose to exclude arsenic exposure from our index because the US public water supply levels are kept below 50 µg/L [38, 39], which is far below concentrations associated with increased lung cancer risk [22, 40]. Nevertheless, researchers in other countries should consider adding arsenic to an LCRI adapted for use in their locations. We assessed the quality of the articles included in the present study using appraisal checklists and criteria of quality recommended by JBI (formerly known as "Joanna Briggs Institute"), an international organization focused on improving evidence as it relates to the feasibility, appropriateness, meaningfulness, and effectiveness of healthcare interventions [41].

As shown in Fig. 2, the initial literature search yielded 1197 articles. We removed 268 articles that were duplicates, not peer-reviewed prior to publication, or written in languages other than English. We then reviewed the abstracts of the 929 remaining articles and applied the study inclusion criteria: (1) randomized controlled trial, prospective cohort study, retrospective cohort study, case-cohort study, case–control study, or nested case–control study; (2) reported the relative risk (RR) or odds ratio (OR) associated with increased risk (i.e., RR or OR > 1, which is a requirement of the Analytic Hierarchy Process (AHP) model); and (3) reported 95% confidence intervals (CIs). After excluding 877 articles that did not meet the inclusion criteria specified above, at least two researchers reviewed the full text of the remaining 52 manuscripts [42].

Steps 2a / 2b: meta-analysis

The second step in creating our index was to extract the adjusted OR and RR from all 52 articles for each lung cancer risk factor examined (Additional file Table 1). Next, a weighted average of study-specific estimates using inverse variance weights was derived for each risk factor [43] to increase the accuracy of outcomes [44, 45]. The potential for publication bias was evaluated by funnel plots and the methods described by Egger et al. [46] and Begg et al. [47]. Using a random-effects model, we analyzed the studies and considered heterogeneity and within-study variance [48]. We evaluated heterogeneity using Cochrane’s Q-statistic [49] and the I² inconsistency statistical tests [50].

We considered the OR to be a good approximation of the RR for our analysis, which is reasonable when the outcome is rare [51]. We used the OR and the logOR and calculated standard errors (SEs) as data points for the meta-analysis. All statistical manipulations were conducted using the meta-analysis package for R (metaphor Version 2, MA, USA).

Steps 3a-3c: Analytic Hierarchy Process (AHP)

The third step in creating our index was to use the results of our meta-analysis as inputs for the AHP analysis and to generate weights for each risk factor. AHP is one of the most widely used Multi-Criterion Decision Making (MCDM) methods [52] and has been increasingly implemented in health care, including cancer research [53,54,55,56,57]. AHP can quantitatively prioritize risk factors by producing weights for each factor, making it an ideal method to apply in this study. For each included modifiable risk factor, we used the OR derived from our meta-analysis as input variables in the AHP. Using the values from meta-analysis and the assessment matrix, we created the pair-wise comparison matrix (i.e., a matrix to compare risk factors in pairs to evaluate their relative importance). We created an assessment matrix with numbers that pair with different importance levels. For example, 1, 3, 5, 7, and 9 pair with equal, weak, obvious, intense, and extreme importance, while 2, 4, 6, and 8 pair with intermediate importance, respectively [58] (Additional file Table 1).

The relative importance of smoking versus all other included risk factors was assigned considering the assessment matrix. This step was then repeated for all other remaining risk factors. Next, an n by n matrix was created where n represented the number of modifiable risk factors. Next, we solved the linear system, where A is the coefficient matrix using Eq. 1:

$$AX=\lambda X or \left(A-\lambda {I}_{n}\right)X=0$$

(1)

where A is the comparison matrix of order n, and λ is one of its eigenvalues. X represents the eigenvector of A associated with λ, and A-λI_n represents the matrix coefficient. We used MATLAB (MathWorks, Massachusetts, USA) to calculate the eigenvalues and eigenvectors of the matrix [59]. Then we used the derived eigenvector to specify the weights of each risk factor where the eigenvector represented the index coefficient. Next, we estimated the contribution of each risk factor to lung cancer. We then calculated the z score and considered the z score as the corresponding value in the index. Finally, z-scores were converted to percentiles for mapping purposes.

We used the Consistency Ratio (CR) to verify the reliability of our results. To do this, we first calculated the Consistency Index (CI₁) using the following equation:

$${CI}_{1}= (\lambda max-n)/(n-1)$$

(2)

where $\lambda max$ was the maximum eigenvalue and n represented the order of the matrix. Accordingly, the CR was calculated by dividing the CI₁ by the index for the corresponding Random Index (RI) using the following equation:

$${CI}_{1}= {CI}_{1}/RI$$

(3)

Saaty [60] has presented the values for RI considering the matrix size. Also, Saaty [60] suggested that the CR needs to be less than 0.1 to produce consistent results.

Results

As shown in Table 1, the process that we used to create the LCRI yielded the highest weight for active smoking (46.1%) and the lowest weight for radon exposure (3.0%). The CR of the AHP analysis for the present study was 0.07, well below the 0.1 cut point that demonstrates consistency of the analysis.

Table 1 Overall effect size and final weights for modifiable risk factors included in the Lung Cancer Risk Index (LCRI)

Full size table

We used the weights in Table 1 to produce the LCRI:

$$LCRI=0.461{A}_{1}+0.185{A}_{2}+0.132{A}_{3}+0.078{A}_{4}+0.076{A}_{5}+0.038{A}_{6}+0.030{A}_{7}$$

(4)

where A₁ to A₇ represent each included modifiable risk factor, as listed in Table 1. It should be noted that A₁ to A₇ can be values of 0 or 1, where 0 indicates the corresponding risk factor was not in effect and 1 indicates the corresponding risk factor was in effect (i.e., 0 = no exposure and 1 = exposure / risk exists). We calculated the corresponding z score for each geographical area (e.g., if the emitted air pollution for a county is X tons/year, the corresponding value for A₆ would be the corresponding z score which is dependent on the average and variance of emitted air pollution for that specific county compared to all other counties in any state). Developed countries such as the US do not rely on major sources of household air pollution—kerosene, wood, or coal—to generate heat [61, 62], so A₂ is assigned a value of 0 for individuals living in these countries. The $\mathrm{LCRI}$ can take any value between 0 and 1: an LCRI value of 0 means no predicted lung cancer risk (A₁ to A₇ all equal 0), and an LCRI value of 1 represents the highest possible predicted risk of lung cancer.

Case study

We test the adaptability and utility of the LCRI in a case study performed using data for our home state of Illinois. In this case study, we constructed the LCRI_IL – a version of the LCRI that reflects the available population-level data in our state. IL is comprised of 102 counties, some of which are urban and many of which are rural. Forty percent of the state’s population resides in Cook County – home to the City of Chicago. Cook County is the second-most populous county in the nation, with more than 5.2 million racially and ethnically diverse residents [63].

Our first step in creating the LCRI_IL was to collect all necessary risk factor data from publicly available data sources. For all counties, we extracted data for 2014–2019 for active smoking (percentage of adults who are current smokers), radon exposure (pCi/L), outdoor air pollution (concentration of fine particulate matter (PM2.5)), and alcohol consumption (percentage of adults reporting binge or heavy drinking in past 30 days) [64, 65]. There were no publicly available county-level data for secondhand smoke exposure or occupational exposures, so those risk factors were dropped from the LCRI_IL.

The second step in creating the LCRI_IL was to generate weights for each available risk factor using the previously described methods (see Methods, Steps 3a-3c). The weights used in the LCRI_IL were 0.70 for active smoking, 0.14 for alcohol consumption, 0.095 for outdoor air pollution, and 0.057 for radon exposure. The corresponding equation to derive the LCRI_IL is:

$${LCRI}_{IL}=0.701{B}_{1}+0.147{B}_{2}+0.095{B}_{3}+0.057{B}_{4}$$

(5)

where B₁ to B₄ represent active smoking, alcohol consumption, outdoor air pollution, and radon exposure, respectively. The CR of the AHP analysis for the case study was 0.04, which indicated the consistency of the analysis.

Figure 3 shows the prevalence of each individual risk factor that was included in the LCRI_IL, as well as lung cancer outcomes [66], by county across Illinois. There is substantial heterogeneity for each risk factor across the state. Among the top 28 counties that have the highest lung cancer incidence and / or mortality rates, eight are also among the top 20 LCRI_IL counties. These eight counties are predominantly located in rural areas (as defined by the US census, [63]) of Southern and Southeastern Illinois, though one is an urban county located on the east side of the state. Notably, Cook County had the highest LCRI_IL score but among the lowest lung cancer incidence and mortality rates.

Table 2 presents Pearson correlation coefficients between the LCRI_IL z scores, active smoking, and lung cancer incidence and mortality rates. The correlation coefficients between the LCRI_IL and lung cancer incidence and mortality were 0.45 and 0.50, respectively, with both p-values < 0.05. The correlation coefficient between the LCRI_IL and active smoking was high at 0.87, which was expected given that this risk factor had the highest assigned weight in the index.

Table 2 Results of Pearson correlation test between LCRI_IL, percentage of active smokers, age-adjusted lung cancer incidence rate, and age-adjusted lung cancer mortality rate

Full size table

In sensitivity analyses, we examined the magnitude of the correlation coefficient for each component of the LCRI_IL in relation to lung cancer incidence and mortality rates. The correlation coefficient was only statistically significant for active smoking, and the magnitude and significance were similar to that of the LCRI_IL (Table 3). In an additional sensitivity analysis, alcohol consumption was dropped from the LCIR_IL – since it is so highly correlated with active smoking – and the resulting index showed similar correlation with lung cancer incidence and mortality rates (0.496 and 0.545, respectively) as compared to the original index.

Table 3 Sensitivity analysis of individual components of the LCRI_IL in relation to lung cancer outcomes in Illinois

Full size table

Discussion

We created a novel lung cancer risk index (LCRI) that integrates multiple modifiable risk factors into one measure. Active smoking is the predominant risk factor for lung cancer and is linked with 80–90% of lung cancer deaths [25]. As expected, smoking received the highest weight in both our original index (LCRI: 46.1%) and the one that we adapted for use in the state of Illinois (LCRI_IL: 70.1%). Conversely, radon exposure had the lowest weight in each index (LCRI: 3%, LCRI_IL: 5.7%).

Previous studies have largely focused on associations between individual risk factors and lung cancer outcomes [11, 13, 25, 29]. However, there are laboratory, animal, and human data showing that risk factors interact with each other to affect cancer outcomes [67,68,69]. For example, Wu et al. [67] reviewed and highlighted the evidence that cancer causation is multifactorial and suggested that researchers consider the contributions of individual factors and their joint effects on cancer burden. Li et al. showed that gene-smoking interactions play important roles in the etiology of lung cancer 68]. Our index represents an attempt to address these known interactions by using population-based data to capture the combined impact of multiple risk factors for lung cancer into one measure.

Hot spots identified by our index share similar distribution patterns of risk factors from the geospatial analysis. Interestingly, Cook County has the highest LCRI_IL despite low adjusted lung cancer incidence and mortality rates. Although Cook County has moderate to high levels of alcohol consumption, Fine Particulate Matter 2.5, and air pollution, it also has a high ratio of primary care physicians to the population (1050:1, ranked 8th in IL), suggesting greater availability of healthcare resources. This may explain the discordance between Cook County’s LCRI_IL and lung cancer incidence and mortality rates. Counties with high LCRI_IL and high lung cancer incidence or mortality rates are mostly in the rural area of the state with fewer available healthcare resources [70]. This echoes findings from recent studies that cancer mortality rates associated with modifiable risks were higher in rural compared with urban populations [71, 72].

Cancer is a heterogeneous disease [73] with many risk factors at individual and social levels. Our model included the factors studied in the literature where the studies met the criteria for inclusion (e.g., being a modifiable risk factor, having an OR or RR, etc.); however, it is important to note that other non-modifiable factors such as age, gender, and race have been shown to also be strongly associated with lung cancer’s incidence and mortality rates [74]. Nevertheless, the study offers a useful framework that health policymakers and researchers can use to identify and examine potential lung cancer risk factors for their geographical areas.

Our study has several strengths. First, to our knowledge, ours is the first study to use meta-analysis in combination with AHP to create a composite risk index for a specific cancer. Second, our model summarized complex and multi-dimensional factors to provide a tool for use by healthcare decision-makers. Our index includes several major and minor modifiable risk factors rather than a single biomedical factor. Third, our study presents a new approach where researchers and policymakers can utilize databases (e.g., U.S. Centers for Disease Control & Prevention’s Behavioral Risk Factor Surveillance System, U.S. Environmental Protection Agency’s Office of Air Quality Planning and Standards, etc.) at multiple geographic levels to identify areas that may benefit from resource allocation and public health interventions. Additionally, a Meta-AHP approach could potentially be combined with machine learning and deep learning models [75, 76] to analyze risk factors and predict health outcomes more accurately.

There were several limitations to this study. First, the AHP approach only allows for the inclusion of risk estimates greater than 1. As a result, we could not include protective behaviors such as fruit and vegetable consumption in our index. Second, AHP relies directly and exclusively on the magnitude of a single risk estimate generated from the meta-analysis, which is likely an underestimate because the model does not allow for variation in exposure prevalence by region. As an example, radon is widely considered to be the second leading cause of lung cancer, behind cigarette smoking [77]. However, as shown in Table 1, this risk factor received the lowest weight in the index because the risk estimate from the meta-analysis was only 1.24–the smallest magnitude of any factor examined. Third, we could not include secondhand smoke and occupational exposures in our tailored LCRI_IL index because county-level data in Illinois are not publicly available for these two factors. We also did not include non-modifiable risk factors such as age, gender, and race. Fourth, because alcohol consumption and tobacco smoking are strongly correlated, the confounding effect of smoking may impact the weight of alcohol consumption in the LCRI. However, when we removed alcohol consumption from LCRI_IL in a sensitivity test, the resulting index showed similar correlation to lung cancer outcomes. Future research is needed to examine the effect the strong correlation between smoking and alcohol has on the LCRI. Fifth, we imposed a single cut point for each risk factor in our models, while, in actuality, some risk factors may exhibit curvilinear or other types of relationships with cancer outcomes. Finally, the meta-analysis was limited to literature published in 1990 and beyond, and therefore did not capture earlier studies.

Conclusion

We generated a lung cancer risk index that incorporated several modifiable risk factors into one composite score. The index was driven heavily by active smoking, as expected. In addition, the index was modestly correlated with lung cancer outcomes in a case study conducted in Illinois, demonstrating its adaptability and potential utility in numerous geographic locations and potentially in many different fields. Future refinements to the index could include adding other modifiable risk factors, examining the impact of non-modifiable risk factors such as age, gender, and race / ethnicity in the LCRI, performing geographical cluster analysis, and incorporating other health behavior factors in AHP-based cancer risk factor models for lung cancer or other health outcomes.

Availability of data and materials

All data generated or analyzed during this study are included in this published article and its supplementary information files.

References

American Cancer Society.; https://www.cancer.org/. [Cited 20 Apr 2021]. Available from
Malhotra J, et al. (2016) Risk factors for lung cancer worldwide. Eur Respir J. 2016;48(3):889–902.
Article Google Scholar
Wang Q, Gümüş ZH, Colarossi C, Memeo L, Wang X, Kong CY, Boffetta P. Small Cell Lung Cancer: Epidemiology, Risk Factors, Genetic Susceptibility, Molecular Pathology, Screening and Early Detection. Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer. 2022;S1556–0864(22)01851–2. Advance online publication.
Gudenkauf FJ, Thrift AP. Preventable causes of cancer in Texas by race/ethnicity: major modifiable risk factors in the population. PLoS ONE. 2022;17(10):e0274905.
Article CAS Google Scholar
Brenner DR, et al. Lung cancer risk in never-smokers: a population-based case-control study of epidemiologic risk factors. BMC Cancer. 2020;10(1):1–9.
Google Scholar
Ridge CA, McErlean AM, Ginsberg MS. Epidemiology of lung cancer. in Seminars in interventional radiology Thieme Medical Publishers. 2013.
Couraud S, et al. Lung cancer in never smokers–a review. Eur J Cancer. 2012;48(9):1299–311.
Article CAS Google Scholar
Weiderpass E. Lifestyle and cancer risk. J Prev Med Public Health. 2010;43(6):459–71.
Article Google Scholar
Dresler C. The changing epidemic of lung cancer and occupational and environmental risk factors. Thorac Cardiovasc Surg. 2013;23(2):113–22.
Google Scholar
Samet JM, et al. Lung cancer in never smokers: clinical epidemiology and environmental risk factors. Clin Cancer Res. 2009;15(18):5626–45.
Article Google Scholar
O’Keeffe LM. Smoking as a risk factor for lung cancer in women and men: a systematic review and meta-analysis. BMJ Open. 2018;8(10): e021611.
Article Google Scholar
Torre LA, Siegel RL, Jemal A. Lung cancer statistics. lung cancer and personalized medicine. Adv Exp Med Biol. 2016;893:1–19.
Article Google Scholar
Taylor R, Najafi F, Dobson A. Meta-analysis of studies of passive smoking and lung cancer: effects of study type and continent. Int J Epidemiol. 2007;36(5):1048–59.
Article Google Scholar
Gawełek E, Drozdzowska B, Fuchs A. Radon as a risk factor of lung cancer. Przegl Epidemiol. 2017;71(1):90–8.
Google Scholar
VENA, J.E. Air pollution as a risk factor in lung cancer. Am J Epidemiol. 1982;116(1):42–56.
Article Google Scholar
Behera D, Balamugesh T. Indoor air pollution as a risk factor for lung cancer in women. JAPI. 2005;53:190–2.
CAS Google Scholar
Gustavsson P, et al. Occupational exposure and lung cancer risk: a population-based case-referent study in Sweden. Am J Epidemiol. 2000;152(1):32–40.
Article CAS Google Scholar
Brennan P, et al. A multicenter case–control study of diet and lung cancer among non-smokers. Cancer Causes Control. 2020;11(1):49–58.
Article Google Scholar
Bandera EV, Freudenheim JL, Vena JE. Alcohol consumption and lung cancer: a review of the epidemiologic evidence. Cancer Epidemiol Biomarkers Prev. 2001;10(8):813–21.
CAS Google Scholar
Kanwal M, Ding XJ, Cao Y. Familial risk for lung cancer. Oncol Lett. 2017;13(2):535–42.
Article CAS Google Scholar
Wu AH, et al. Previous lung disease and risk of lung cancer among lifetime nonsmoking women in the United States. Am J Epidemiol. 1995;141(11):1023–32.
Article CAS Google Scholar
Celik I, et al. Arsenic in drinking water and lung cancer: a systematic review. Environ Res. 2008;108(1):48–55.
Article CAS Google Scholar
Świątkowska B. Modifiable risk factors for the prevention of lung cancer. Rep Prac Oncol Radiother. 2007;12(2):119–24.
Article Google Scholar
Turner MC, et al. Radon and lung cancer in the American Cancer Society cohort. Cancer Epidemiol Prev Biomark. 2011;20(3):438–48.
Article CAS Google Scholar
Yun YH, et al. Relative and absolute risks of cigarette smoking on major histologic types of lung cancer in Korean men. Cancer Epidemiol Prev Biomark. 2005;14(9):2125–30.
Article CAS Google Scholar
Nikić D, Stanković AM. Air pollution as a risk factor for lung cancer. Arch Oncol. 2005;13(2):79–82.
Article Google Scholar
Brenner DR, et al. Alcohol consumption and lung cancer risk: A pooled analysis from the International Lung Cancer Consortium and the SYNERGY study. Cancer Epidemiol. 2019;58:25–32.
Article Google Scholar
Kreienbrock L, et al. Case-control study on lung cancer and residential radon in western Germany. Am J Epidemiol. 2001;153(1):42–52.
Article CAS Google Scholar
Pesch B, et al. Cigarette smoking and lung cancer—relative risk estimates for the major histological types from a pooled analysis of case–control studies. Int J Cancer. 2012;131(5):1210–9.
Article CAS Google Scholar
Lee T, Gany F. Cooking oil fumes and lung cancer: a review of the literature in the context of the US population. J Immigr Minor Health. 2013;15(3):646–52.
Article Google Scholar
Stockwell HG, et al. Enviromental tobacco smoke and lung cancer risk in nonsmoking women. J Natl Cancer Ins. 1992;84(18):1417–22.
Article CAS Google Scholar
Miller AB, et al. Fruits and vegetables and lung cancer: findings from the European prospective investigation into cancer and nutrition. Int J Cancer. 2004;108(2):269–76.
Article CAS Google Scholar
Menvielle G, et al. The role of smoking and diet in explaining educational inequalities in lung cancer incidence. J Natl Cancer Inst. 2009;101(5):321–30.
Article Google Scholar
Spitz MR, et al. A risk model for prediction of lung cancer. J Natl Cancer Inst. 2007;99(9):715–26.
Article Google Scholar
Xue R, Wang C, Liu M, Zhang D, Li K, Li N. A new method for soil health assessment based on Analytic Hierarchy Process and meta-analysis. Sci Total Environ. 2019;650:2771–7.
Article CAS Google Scholar
Liberati A, et al. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. J Clin Epidemiol. 2009;62(10):e1–34.
Article Google Scholar
World Cancer Research Fund 2021; Lung Cancer. https://www.wcrf.org/dietandcancer/lung-cancer/. [Cited 1 Jun 2021]. Available from
Pontius FW. Appendix G: Listing of Drinking Water Federal Register Notices. Drinking Water Regulation and Health. 2003:53–969.
Federal Register. National Primary Drinking Water Regulations; Arsenic and Clarifications to Compliance and New Source Contaminants Monitoring]. https://www.federalregister.gov/documents/2001/01/22/01-1668/national-primary-drinking-water-regulations-arsenic-and-clarifications-to-compliance-and-new-source. [Cited 1 Jun 2021]. Available from
Boffetta, and C. Borron,. Low-level exposure to arsenic in drinking water and risk of lung and bladder cancer: a systematic review and dose–response meta-analysis. Dose-Response. 2019;17(3):1559325819863634.
Google Scholar
JBI’s Critical Appraisal Tools. 2021. [Cited 1 Mar 2021]. Available from https://jbi.global/critical-appraisal-tools.
Page MJ, et al. (2021) PRISMA 2020 explanation and elaboration: updated guidance and exemplars for reporting systematic reviews. BMJ. 2021;372:160.
Article Google Scholar
Kleinbaum D, Kupper L, Morgenstern H. Principles and quantitative methods. Epidemiologic research. New York: Van Nostrand Reinhold Company; 1982. p. 427-50.
Borenstein M. Introduction to meta-analysis. John Wiley & Sons; 2021.
Book Google Scholar
Hunter JE, Schmidt FL. Methods of meta-analysis: Correcting error and bias in research findings. Sage; 2004.
Egger M, et al. Bias in meta-analysis detected by a simple, graphical test. BMJ. 1997;315(7109):629–34.
Article CAS Google Scholar
Begg CB, Mazumdar M. Operating characteristics of a rank correlation test for publication bias. Biometrics. 1994;50(4):088–1101.
Article Google Scholar
DerSimonian R, Laird N. Meta-analysis in clinical trials. Control Clin Trials. 1986;7(3):177–88.
Article CAS Google Scholar
Cochran WG. The combination of estimates from different experiments. Biometrics. 1954;10(1):101–29.
Article Google Scholar
Higgins JP, et al. Measuring inconsistency in meta-analyses. BMJ. 2003;327(7414):557–60.
Article Google Scholar
Matakidou A, Eisen T, Houlston R. Systematic review of the relationship between family history and lung cancer risk. Br J Cancer. 2005;93(7):825–33.
Article CAS Google Scholar
Mu E, Pereyra-Rojas M. Understanding the analytic hierarchy process, in Practical decision making Springer. 2017;7–22.
Schmidt K, et al. Applying the Analytic Hierarchy Process in healthcare research: a systematic literature review and evaluation of reporting. BMC Med Inform Decis Mak. 2015;15(1):1–27.
Article Google Scholar
Hummel JM, Bridges J, IJzerman, M. J. Group decision making with the analytic hierarchy process in benefit-risk assessment: a tutorial. The Patient. 2014;7(2):129–40.
Article Google Scholar
Sakti CY, Sungkono KR, Sarno R. International Seminar on Application for Technology of Information and Communication (iSemantic). Determination of Hospital Rank by Using Analytic Hierarchy Process (AHP) and Multi Objective Optimization on the Basis of Ratio Analysis (MOORA). 2019:178–183.
Schmidt K, Aumann I, Hollander I, Damm K, Schulenburg J-MG. Applying the Analytic Hierarchy Process in healthcare research: a systematic literature review and evaluation of reporting. BMC Med Inform Decis Mak. 2015;1:234. https://doi.org/10.1186/s12911-015-0234-7.).
Article Google Scholar
Pauer F, Schmidt K, Babac A, Damm K, Frank M, von der Schulenburg JM. Comparison of different approaches applied in Analytic Hierarchy Process - an example of information needs of patients with rare diseases. BMC Med Inform Decis Mak. 2016;16(1):117. https://doi.org/10.1186/s12911-016-0346-8.
Article Google Scholar
Saaty TL. Decision making with the analytic hierarchy process. Int J Serv Sci. 2008;1(1):83–98.
Google Scholar
Moler CB. Numerical computing with MATLAB. SIAM; 2004.
Saaty, T.L. (1989) Group decision making and the AHP, in The analytic hierarchy process.Springer. pp, 59–67.
Bruce N, Perez-Padilla R, Albalak R. Indoor air pollution in developing countries: a major environmental and public health challenge. Bull World Health Organ. 2000;78(9):1078–92.
CAS Google Scholar
WEO-2017 Special Report: Energy Access Outlook, International Energy Agency, 2017 https://webstore.iea.org/weo-2017-special-report-energy-access-outlook. Available from
2020 United States census
Illinois Department Of Public Health Division Of Behavioral Risk Factor Surveillance System. http://www.idph.state.il.us/brfss/. [Cited Aug 2020]. Available from
Illinois Department of Public Health. http://www.idph.illinois.gov. Avaialbe from
United States Cancer Statistics 2021. Centers for Disease Control and Prevention;]. https://gis.cdc.gov/cancer/uscs/#/StateCounty/. [Cited 1 Aug2021]. Available from
Wu S, Zhu W, Thompson P, Hannun YA. Evaluating intrinsic and non-intrinsic cancer risk factors. Nat Commun. 2018;9(1):3490. https://doi.org/10.1038/s41467-018-05467-z.
Article CAS Google Scholar
Li Y, Xiao X, Han Y, Gorlova O, Qian D, Leighl N, Johansen JS, Barnett M, Chen C, Goodman G, Cox A, Taylor F, Woll P, Wichmann HE, Manz J, Muley T, Risch A, Rosenberger A, Arnold SM, Haura EB, Amos CI. Genome-wide interaction study of smoking behavior and non-small cell lung cancer risk in Caucasian population. Carcinogenesis. 2018;39(3):336–46. https://doi.org/10.1093/carcin/bgx113.
Article CAS Google Scholar
Biesalski HK, de MesquitaBuenoChesson BA, Chytil F, Grimble R, Hermus RJ, Köhrle J, Lotan R, Norpoth K, Pastorino U, Thurnham D. European consensus statement on lung cancer: risk factors and prevention lung cancer panel. CA Cancer J Clin. 1998;48(3):167–166.
Article CAS Google Scholar
Health Resources and Administration (HRSA). Rural Access to Health Care Services Request for Information, available from https://www.hrsa.gov/rural-health/rfi-rural-health-care-access
Fogleman AJ, Mueller GS, Jenkins WD. Does where you live play an important role in cancer incidence in the U.S.? Am J Cancer Res. 2015;5(7):2314–9.
Google Scholar
Moss JL, Pinto CN, Srinivasan S, Cronin KA, Croyle RT. Persistent poverty and cancer mortality rates: an analysis of county-level poverty designations. Cancer Epidemiol Biomarkers Prev. 2020;29(10):1949–54. https://doi.org/10.1158/1055-9965.EPI-20-0007.
Article Google Scholar
Hiatt RA, Breen N. The social determinants of cancer: a challenge for transdisciplinary science. Am J Prev Med. 2008;35(2 Suppl):S141–50. https://doi.org/10.1016/j.amepre.2008.05.006.
Article Google Scholar
Stram DO, Park SL, Haiman CA, Murphy SE, Patel Y, Hecht SS, Le Marchand L. Racial/ethnic differences in lung cancer incidence in the multiethnic cohort study: an update. J Natl Cancer Inst. 2019;111(8):811–9.
Article Google Scholar
Afshar P, Mohammadi A, Tyrrell N, Cheung P, Sigiuk A, Plataniotis KN, Nguyen ET, Oikonomou A. [Formula: see text]: deep learning-based radiomics for the time-to-event outcome prediction in lung cancer. Sci Rep. 2020;10(1):12366. https://doi.org/10.1038/s41598-020-69106-8.
Article CAS Google Scholar
Lima E, Gorski E, Loures EFR, Portela SEA, Deschamps F, & 9th IFAC Conference on Manufacturing Modelling, Management and Control, MIM 2019. (2019). Applying machine learning to AHP multicriteria decision making method to assets prioritization in the context of industrial maintenance 4.0. Ifac-papersonline, 52, 13, 2152–2157
Health risk of Radon. https://www.epa.gov/radon/health-risk-radon. Available from

Download references

Acknowledgements

Not applicable.

Funding

The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Author information

Ali Faghani and Lei Guo contributed equally to this work.

Authors and Affiliations

College of Engineering and Engineering Technology, Northern Illinois University, DeKalb, IL, USA
Ali Faghani & Mahdi Vaezi
School of Interdisciplinary Health Professions, Northern Illinois University, DeKalb, IL, USA
Lei Guo
University of Illinois Cancer Center, Chicago, IL, USA
Margaret E. Wright
School of Health Studies, Northern Illinois University, DeKalb, IL, USA
M. Courtney Hughes

Authors

Ali Faghani
View author publications
You can also search for this author in PubMed Google Scholar
Lei Guo
View author publications
You can also search for this author in PubMed Google Scholar
Margaret E. Wright
View author publications
You can also search for this author in PubMed Google Scholar
M. Courtney Hughes
View author publications
You can also search for this author in PubMed Google Scholar
Mahdi Vaezi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MCH, MV, MW, and AF conceptualized the study. All authors contributed to the study design. Material preparation, data collection and analysis were performed by AF and LG. The first draft of the manuscript was written by AF and LG and all authors commented on subsequent versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Mahdi Vaezi.

Ethics declarations

Ethics approval and consent to participate

Not applicable. This is a secondary analysis using publicly available dataset.

Consent for publication

Not applicable.

Competing interests

The authors have no conflicts of interest to disclose.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Faghani, A., Guo, L., Wright, M.E. et al. Construction and case study of a novel lung cancer risk index. BMC Cancer 22, 1275 (2022). https://doi.org/10.1186/s12885-022-10370-4

Download citation

Received: 30 May 2022
Accepted: 25 November 2022
Published: 06 December 2022
DOI: https://doi.org/10.1186/s12885-022-10370-4

Construction and case study of a novel lung cancer risk index