FormalPara Key Summary Points

Why carry out this study?

Lung cancer ranks highest in mortality of all cancers in the United States.

Pneumonitis is a potentially serious side effect of non-small–cell lung cancer (NSCLC) treatment that may lead to treatment discontinuation.

The objective was to estimate the cumulative incidence and incidence rates, and identify predictors of treatment-related pneumonitis in NSCLC patients.

What was learned from the study?

Nineteen percent of patients developed pneumonitis over the course of their NSCLC treatment, with male gender, history of pneumonitis, use of other targeted therapies, use of immunomodulatory drugs, and history of radiation therapy all predicting pneumonitis.

Awareness of the pneumonitis predictors identified in this study may help clinicians devise strategies to mitigate the impact of pneumonitis, enhance treatment adherence, and improve outcomes.

Digital Features

This article is published with digital features, including a summary slide and plain language summary, to facilitate understanding of the article. To view digital features for this article, go to https://doi.org/10.6084/m9.figshare.14333708.

Introduction

Cancer of the lung ranks second among all cancer types in the United States in terms of incidence and first in terms of attributable deaths, with estimates indicating that this cancer type will account for 235,760 new cases and 131,880 deaths in 2021 [1]. An estimated 57% of patients with lung cancer have metastases at diagnosis, with a 5-year relative survival rate in such patients of only 5% [2]. A majority (85%) of all lung cancer cases are of the non-small cell lung cancer (NSCLC) histologic group, with squamous cell carcinoma and adenocarcinoma being the predominant histologic subtypes of NSCLC [3].

Treatment of NSCLC has evolved significantly in recent years. The approval of targeted therapies including third-generation tyrosine kinase inhibitors (TKIs) and immune checkpoint inhibitors (ICIs) has led to improved long-term survival, relegating conventional chemotherapy to a secondary role [4, 5]. However, these targeted therapies are associated with a potentially fatal treatment-induced adverse event (AE), pneumonitis, which has been documented in patients treated with programmed death 1 (PD-1)/programmed death ligand 1 (PD-L1) ICIs [6,7,8,9], epidermal growth factor receptor-tyrosine kinase inhibitors (EGFR-TKIs) [10,11,12], and anaplastic lymphoma kinase (ALK) inhibitors [13]. Symptoms typically associated with pneumonitis resulting from the use of targeted NSCLC therapies include cough, dyspnea, fever, chest pain, and hypoxia, accompanied by pulmonary infiltrates that are evident in chest computer tomography (CT) images [14, 15]. Although patients usually respond to oral corticosteroids, some may develop significant dyspnea requiring the use of supplemental oxygen, discontinuation of NSCLC therapy, or intravenous corticosteroids and additional immunosuppressive agents (e.g., infliximab, cyclophosphamide, or mycophenolate mofetil) [14,15,16]. While rare, high-grade (grade 3/4) pneumonitis is associated with significant morbidity and mortality in a small proportion (1%) of those affected [14, 15, 17]. Guidelines issued by the American Society of Clinical Oncology (ASCO) recommend permanent discontinuation of NSCLC therapy in patients with severe pneumonitis (grades 3 and 4) [18], leading to cessation of potentially beneficial treatment.

To date, studies using real-world data on NSCLC patients have focused on subgroups of interest such as those receiving a PD-L1 inhibitor [19,20,21,21], patients with stage III NSCLC [22], or, more recently, efforts evaluating the reliability of real-world endpoints [23,24,25]. We designed a study to examine an important safety concern among individuals diagnosed with NSCLC, which may compromise a patient’s ability to complete the prescribed treatment. Here, we estimate the cumulative incidence and incidence rates, and identify predictors of treatment-related pneumonitis in NSCLC patients across all disease stages who had received currently approved therapies from a large and representative real-world data set in the United States.

Methods

Data Source

The data analyzed were obtained from the Optum® electronic health records (EHR) database [26]. This database contains records from approximately 140,000 physicians at over 700 hospitals, and 7000 clinics across more than 80 integrated delivery networks (IDNs) in the United States [27]. De-identified information on demographic and socioeconomic categories, coded diagnoses and procedures, prescribed medications, laboratory results, and clinical administrative data is available for > 80 million patients from diverse settings (inpatient, outpatient, and ambulatory) across all census regions in the United States [27]. The information does not include any identifiable information as defined by the Health Insurance Portability and Accountability Act (HIPAA) of 1996 [28], eliminating the need for institutional review board (IRB) approval or waiver [29].

Optum’s® proprietary natural language processing (NLP)-based data are used to identify concepts that may not be captured by International Classification of Diseases (ICD) or procedure codes, and complements conventional data elements captured in EHR data relating to diagnosis, drugs, procedures, laboratory test results, and patient characteristics. The NLP concepts are identified and created based on broad topics such as Medications, Signs, Disease and Symptoms (SDS), Measurements, and Observations, and are harvested from the notes fields within the electronic medical records. The data used for development of each NLP concept is de-identified and accuracy is verified through a series of quality assurance steps prior to release for use. Each NLP concept included in the data is associated with a unique subject record and a date of observation, allowing longitudinal tracking of concepts such as “non-small cell lung cancer” or “pneumonitis” over time.

Cohort Assembly

The study cohort was identified using ≥ 1 primary lung cancer diagnosis code (ICD-9-CM: 162, ICD-10-CM: C33, C34), and no mention of small cell lung cancer (SCLC) in SDS data, ≥ 1 mention of NSCLC in SDS data or core NSCLC drug use, first active date ≥ 12 months prior to index lung cancer diagnosis date (“index date”), index date after January 1, 2008, no lung surgery or other cancer diagnosis during the 12-month period prior to the index date, and age ≥ 18 years on the index date. Core NSCLC drugs considered for the inclusion criteria were platinum- and non-platinum–based chemotherapies, ICIs (alone or in combination), EGFR-TKIs, and other targeted therapies (Table 1).

Table 1 Components of the lines of treatment and definitions of treatment categories for the competing risks regression analyses

Lines of Treatment

Lines of therapy (LOT) for each patient were established using business rules centered on (1) identifying continuous periods of drug use, (2) establishing concurrent use of individual drugs and concatenating such drugs into a treatment regimen, and (3) earmarking periods of use of distinct regimens as LOTs. The drugs considered for this analysis are listed in Table 1 and comprise both chemotherapies and targeted agents (including EGFR-TKIs and ICIs).

Analyses Performed

Pneumonitis occurrences were identified using ICD-9 codes (495.0–495.9, 506.0, 507.0, 507.1, 507.8, 508.0, 508.8, 516.32, 516.33, 516.35, 518.3, and 997.39), ICD-10 codes (J67.0–J68.0, J69.0, J69.1, J69.8, J70.0, J70.2, J82, J84.113, J84.114, J84.2, J95.4), and SDS terms (“allergic interstitial pneumonitis,” “chemical pneumonitis,” “cryptogenic organizing pneumonitis,” “desquamative interstitial pneumonitis,” “interstitial pneumonitis,” “organizing pneumonitis,” “pneumonitis,” “radiation pneumonitis”). These data were then used to perform the analyses described below.

Cumulative Incidence

The cumulative incidence of pneumonitis was defined as the percentage of patients with a diagnosis of pneumonitis during (1) the entire follow-up period and (2) in each respective LOT period. Cumulative incidence estimates were stratified by the presence/absence of a history of pneumonitis prior to the start of the evaluation period, LOT, and biomarker subgroup (EGFR-positive/negative and PD-L1-positive/negative). A patient was deemed to be an EGFR mutant if identified as EGFR-positive from the SDS data set or had received osimertinib, erlotinib, afatinib, gefitinib, or dacomitinib (monotherapy or in combination with chemotherapy drugs) in LOT1.

Incidence Rates and Relative Risk

Incidence rates, expressed as cases per 1000 person-days, were calculated as the number of patients diagnosed with at least one occurrence of pneumonitis between the start and end of follow-up for each of the two respective evaluation periods, divided by the sum of the duration from the start of the evaluation period to the first occurrence of pneumonitis for those patients with at least one diagnosis of pneumonitis in the evaluation period and the sum of the duration from the start to the end of the evaluation period (or end of follow-up) for patients without a pneumonitis event in the evaluation period. The entire follow-up period was defined as the time from index date to the end of follow-up. Each LOT evaluation period was defined as the start of a LOT to 30 days after the end of that LOT or one day prior to the start of the next LOT, whichever was earlier, or (where there was no next LOT) up to the end of follow-up for or the end of the last LOT plus 30 days, whichever was earlier. These incidence rates were used to carry out a univariate analysis of 16 covariates (Table 4) to determine the relative risk (RR) of each covariate. The RR was calculated as IR1/IR2, where IR1 is the incidence rate for patients having the condition reported in terms of incidents/1000 person-days and IR2 is the incidence rate for patients who do not have the condition reported as incidents/1000 person-days. P values to test the relative risk of developing pneumonitis were derived from a generalized linear model after accounting for differential patient follow-up times. Patients with unknown histology were excluded from this analysis. The covariates included demographic characteristics, histology, biomarker status, prior history of comorbidities/adverse events, and treatment groups. There were six treatment groups defined: any ICI monotherapy, any ICI combination therapy, EGFR-TKI therapy, other targeted therapy, platinum-based chemotherapy, any non-platinum-based chemotherapy (Table 1).

Competing Risk Regression

A multivariable competing risk regression (30) was used in order to account for the competing risk of death prior to the development of pneumonitis. We used 16 covariates (Table 5) used to identify significant predictors of pneumonitis. The significance level was 0.05.

Results

Selection of the Study Cohort

The study cohort included a total of 81,628 patients identified as depicted in Fig. 1. Almost half (49%) of the cohort were male, 46.9% resided in the Midwest followed by 28.4% in the South, and 85% were Caucasian (Table 2). The mean age of patients at diagnosis was 69 years (SD = 10.1), and patients with Eastern Cooperative Oncology Group (ECOG) score ≥ 2 constituted 30% of patients with a valid score. Sixty percent of patients had localized disease, while 38.3% were EGFR-positive. The mean Charlson Comorbidity Index (CCI) was 1.8 (SD = 2.56).

Fig. 1
figure 1

Selection of the analysis cohort. NSCLC, non-small cell lung cancer; SCLC, small cell lung cancer; SDS, signs, diseases, and symptoms

Table 2 Demographic and clinical characteristics of the analytical cohort by histology

Drug-Treated Patients

A total of 21.7% of patients were treated with medications for ≥ 1 LOT and had a median follow-up (MFU) of 401 days from their index diagnosis, with 8.8% having ≥ 2 LOTs (MFU: 566 days) and 3.8% having ≥ 3 LOTs (MFU: 742) (Table 3).

Table 3 Distribution of eligible patients with LOT(s) with their average length of follow-up

Pneumonitis Cumulative Incidence

Overall, 19.0% (95% confidence intervals [CI]: 18.5–19.6%) of treated patients developed pneumonitis during any LOT, while 26.2% (95% CI: 24.3–28.1%) of those with a prior history of pneumonitis in any time prior to LOT 1 and 17.0% (95% CI: 16.4–17.6%) of those without a previous history of pneumonitis developed the condition during any LOT. Regardless of biomarker subgroup, histologic category, or LOT, in general, a higher proportion of patients with a prior history of pneumonitis developed the condition than patients without a prior history (Fig. 2). Except for PD-L1+ non-squamous cell carcinoma patients, an increase in the cumulative incidence of pneumonitis in each subsequent LOT was observed across EGFR and PD-L1 status, by squamous/non-squamous histology. Overall, there was an increase in cases of pneumonitis of 2.7% from LOT 1 to LOT 3 (data not shown). Between the various subgroups, the largest increase between LOTs 1 and 3 was seen for PD-L1− patients with a non-squamous histology, with cumulative incidence going up from 13.6% to 22.7%. The smallest increase was seen in EGFR+/non-squamous patients (9.7–12.1%). PD-L1+/non-squamous patients was the only exception to this pattern going from 17.9% in LOT 1 to 13.8% in LOT3.

Fig. 2
figure 2

Cumulative incidence of pneumonitis by biomarker subgroup, LOT, histology, and prior history of pneumonitis. EGFR epidermal growth factor receptor; LOT line of therapy; PD-L1 programmed death-ligand 1

Of the 16 variables that were evaluated as risk factors for the development of pneumonitis in univariate analysis, several were shown to be significantly (P < 0.05) associated with a higher risk for developing the condition in at least two LOTs (Table 4). Some common factors included: male gender, squamous histology, EGFR-negative status, history of pneumonitis, history of diabetes, monotherapy treatment with an immunomodulatory drug, and history of radiation therapy. Patients treated with other targeted therapies and non-platinum-based chemotherapy showed a lower risk for developing pneumonitis in at least two LOTs.

Table 4 Relative risk of developing pneumonitis reported in NSCLC while on treatmenta by LOT

The multivariable competing risk regression model to identify predictors of pneumonitis for each LOT showed history of pneumonitis to be significant across all three LOTs. Additionally, there were a number of significant predictors across at least two LOTs. Specifically, male gender (LOTs 1–2), EGFR-negative status (LOTs 1–2), and history of radiation therapy (LOTs 1 and 3) were positively associated (P < 0.05) with the development of pneumonitis (Table 5). Treatment with other targeted therapy (LOTs 1–2) was negatively associated with developing pneumonitis.

Table 5 Fine and Gray competing risk model for time to pneumonitis while on treatment by LOTa

Discussion

Pneumonitis is a significant and serious AE associated with drugs used to treat NSCLC. Our study estimated the risk of pneumonitis, among patients diagnosed with NSCLC who received drug interventions, across clinical and treatment characteristics of interest including prior history of pneumonitis, regimen, LOT, histology, and biomarker status. The incidence of pneumonitis in NSCLC has previously been studied mostly in primary and meta-analyses of clinical trials [6, [7, 11], and in some retrospective chart reviews from select hospitals (8, 9). This has resulted in a wide range of estimates from 3–5% in the clinical trial setting [7, 31] to 19–21% from additional trials or hospital data [14, 32]. This discrepancy may be explained in part by the increased awareness of this AE in recent years and partly to enhanced pharmacovigilance following the administration of targeted agents [14].

In our analysis of EHR data from hospital clinics across the country, the cumulative incidence of pneumonitis among NSCLC patients was estimated to be 19.0% during any LOT, and 33.7% among those with a prior history of pneumonitis and 17.0% for those without a previous history of pneumonitis. Competing risk regression revealed various factors to be positively associated with the development of pneumonitis over multiple LOTs. Predictors of increased pneumonitis risk included a previous history of pneumonitis, male gender, history of radiation therapy, and EGFR-negative status.

To our knowledge, only two other studies have examined rates of pneumonitis in large real-world data sets. The first study used OptumLabs administrative claims data and examined frequencies of all immune-related adverse events (irAEs) in NSCLC patients receiving PD-L1 inhibitors [19]. Pneumonitis was reported in 2.5% of 3164 patients within a month of receipt of a PD-L1 inhibitor, increasing to 14.3% after 9 months. The second study, a retrospective analysis of the Symphony Health administrative claims data, estimated incidence, and timing of radiation-induced pneumonitis following chemoradiotherapy in patients with stage III NSCLC [22]. The cumulative incidence of treatment-related pneumonitis was reported to be 12.4%, with the annual incidence ranging from 5.5% to 18.1%. The higher rates in our study are perhaps explained by the inclusion of more stage IV patients and the effect of more patients having been treated with PD-L1 inhibitors.

Limitations associated with using real-world data need to be recognized. The data for this study were not recorded for research purposes; as such there may be coding errors that could affect the treatment patterns and predictive factors associated with pneumonitis. While this is a large multi-source database, it may not be nationally representative of all NSCLC patients. A final limitation is with the use of NLP for identifying patients of non-small cell cancer histology and partly for diagnosis of pneumonitis. We rely on the data vendor’s NLP algorithm for this and cannot know how well the NLP extracts the information from the physician notes.

Future research building on this study could include using another US-based EHR data source that is focused on community and academic based hematology-oncology clinics or using a non-US real-world data source.

Pneumonitis remains a significant risk in patients diagnosed with NSCLC. This study identified independent factors that may predispose individuals to pneumonitis risk such as previous history of pneumonitis, male gender, EGFR-negative status, ICI therapy, other targeted therapies, or history of radiation. Awareness and monitoring of these factors may help mitigate the risk of pneumonitis for these patients.

Conclusion

Pneumonitis is a significant side effect of medicines developed to treat NSCLC. Recognition of this fact and awareness of the different factors predisposing patients to its development will help physicians proactively tailor treatment regimens to reduce the likelihood of its onset. Patients may consequently be able to better adhere to treatment regimens, leading to positive clinical outcomes and improved quality of life.