Introduction

At present, screening for lung cancer is only recommended in three high-income countries (United States of America, Canada, and Japan), focusing mainly on high-risk groups such as long-term smokers [1]. Today, approximately 50% of patients with lung cancer in Western countries are diagnosed with late-stage cancer [2]. Because pneumonia can be an early symptom and is the most common differential diagnosis [3, 4], it is of interest to examine patterns of antibiotic use prior to the diagnosis of lung cancer.

The few studies to date that have examined histories of antibiotic use in patients with lung cancer have mainly focused on possible causal associations, i.e., antibiotics as a risk factor or risk modifier [5, 6]. In these studies, information on antibiotic use in the period directly before the date of lung cancer diagnosis has generally been excluded to avoid the influence of reversed causality.

Using routinely collected data, we aimed to compare patterns of recent fillings of prescriptions of antibiotics (i.e., within three years prior to the diagnosis of lung cancer) recommended for the treatment of pneumonia between patients with lung cancer and control individuals free of lung cancer. We also explored possible trends in the magnitude of the effect size for the association between a history of repeated pre-diagnostic fillings of antibiotic prescriptions and lung cancer by stage as a possible indicator of diagnostic delays.

Materials and methods

Study design and study population

We performed a population-based study with routinely collected data to examine possible associations between a diagnosis of lung cancer and a recent history of filled prescriptions of antibiotics recommended for the treatment of pneumonia (Supplementary Table S1). We identified individuals diagnosed with lung cancer (International Classification of Diseases for Oncology third edition [ICD–O–3] code: C34 or ICD–7 code: 1,621) between 2009 and 2016 (cases), as recorded in the Swedish National Lung Cancer Register (NLCR). The NLCR includes 97% of all individuals registered with a diagnosis of lung cancer in the Swedish Cancer Register (SCR) to which reporting is mandated [7, 8]. Using a risk set sampling approach, each case was individually matched with up to five randomly selected individuals free of lung cancer (controls), at the date of the diagnosis for the case, from the Swedish Population Register [9]. The matching factors were the year of birth, sex, and place of residence at the date of diagnosis for the case. The index date was defined as the date of the first lung cancer diagnosis recorded in the NLCR or the SCR, and the corresponding date for the matched controls.

We excluded cases with unclassified, unknown, or missing lung cancer histopathology, cases with more than a six-month difference between the recorded date of diagnosis in the NLCR and the SCR, and controls with a record of a lung cancer diagnosis in the SCR before the index date.

Data sources

Data for the present study were retrieved from the Lung Cancer Database Sweden (LCBaSe) [2, 10], a register-based research database generated by record linkages between the NLCR, the SCR, the National Patient Register (NPR), the Prescribed Drug Register (PDR), the Cause of Death Register (CDR), the Multi-generation Register (MGR), the Swedish Population Register, and the LISA-database (a database containing sociodemographic data). The record linkages were made possible by the use of the personal identity number, a unique personal identifier given to all residents of Sweden [9]. In this study, we used data in the LCBaSe from the NLCR, the SCR, the NPR, the PDR, the Swedish Population Register, and the LISA-database [8, 9, 11,12,13,14].

Definition of exposure

We used four different measures to assess the history of filled prescriptions of antibiotics recommended for treatment of pneumonia (Supplementary Table S1) as recorded in the PDR within three years prior to the index date:

  1. (1)

    at least one filled prescription,

  2. (2)

    repeated fillings (≥ 2 fillings),

  3. (3)

    the number of filled prescriptions (categorized as 0, 1, 2, 3, and ≥ 4), and

  4. (4)

    the number of different types of antibiotics (unique anatomical therapeutic chemical codes at the 5th level, categorized as 0, 1, 2, and ≥ 3).

Statistical analysis

Descriptive analyses were used to describe the clinical characteristics and demographics of cases and controls. Continuous variables were summarized using the median and the first and third quartiles (q1 and q3). Categorical variables were summarized with counts and per cent.

We used conditional logistic regression models, with the matched individuals as the clusters, to estimate odds ratios (OR) and 95% confidence intervals (CI) for possible associations between a diagnosis of lung cancer (non-small cell lung cancer [NSCLC] overall, squamous cell carcinoma [SCC], adenocarcinoma, and small cell lung cancer [SCLC]) and a history of recent exposure. The effect of being a patient with lung cancer was estimated for the different histopathological subtypes overall and, within each histology, separately by sex, age, educational level, summarized cancer stage according to the tumor–node–metastasis [TNM] system from the American Joint Committee on Cancer [15], and separately for the descriptors that make up the stage. In the adjusted model, the highest attained educational level the year before the index date (categorized by years of formal education: ≤ 9, 10–12, and ≥ 13; baseline level: “ ≤ 9”) was used as an indicator of socioeconomic status. Other covariates included in the adjusted models were a record in the NPR of a chronic obstructive pulmonary disease (COPD) diagnosis (ICD–10: J44, baseline level: “no”) in the five years prior to the index date, at least one filled prescription of pre-defined antibiotics at any time prior to the start of the observation period (a minimum of six months) (baseline level: “no”), a history of any cancer (any record of a diagnosis in the SCR prior to the index date) (baseline level: “no”), and the other TNM-descriptors in the descriptor specific subanalyses (e.g., if estimating the effect of exposure in subgroups of the T-descriptor then the estimate was adjusted for the N- and M-descriptors; baseline levels: “T1” and “N1”).

We performed several sets of sensitivity analyses. First for different time intervals of the observation period, and second for those with an index date of the 1st July 2009 or after and without records of filled prescriptions of a pre-defined antibiotic within 12 months prior to the start of the observation period (washout period of 12 months) (Supplementary Figure S1). Third, where filled prescriptions in the three months before the index date were not considered, and fourth, following the exclusion of individuals with a history of COPD.

All data management and statistical analyses were performed using R statistical packages version 3.5.3 (R Development Core Team) and Stata version 15 (StataCorp LP), or later versions.

Results

We identified a total of 29,890 patients with a diagnosis of lung cancer between 2009 and 2016 (Supplementary Figure S2). Of these, 2,873 cases (and their matched controls) were excluded because of unclassified, unknown or missing histopathology (n = 2275) or more than a six-month difference between the recorded date of diagnosis in the NLCR and the SCR (n = 598). An additional 90 controls were excluded because of a record of diagnosis of lung cancer in the SCR before their index date. Hence, the final study population encompassed a total of 27,017 patients with lung cancer and 12,9355 controls.

Characteristics of the study population

A high educational level (i.e., post-upper secondary degree) was less common among cases (16%) compared to controls (25%) (Table 1). Compared to cases, a lower proportion of controls had a history of COPD (13% and 3%, respectively).

Table 1 Baseline characteristics of patients with a diagnosis of lung cancer (cases) and individuals free of lung cancer (controls), Lung Cancer DataBase Sweden, 2009–2016

Patterns and odds of fillings

Half of the cases and one-third of the controls had a recent history of at least one filled prescription of antibiotics (Table 1). The proportion of cases and controls exposed to antibiotics did not vary substantially by age. For cases, the proportion varied between 47% (≥ 80 years) and 52% (60–69 years), and for controls, between 30% (< 50 years) and 34% (60–69 years). Among patients with lung cancer, the percentage with at least one filled prescription started to increase around three to four months prior to the diagnosis, while it remained relatively stable for individuals free of lung cancer, this was independent of a history of COPD (Fig. 1). In patients with lung cancer, ever-smokers had more often filled at least one prescription compared to never-smokers, an observation that was present throughout the observation period (Supplementary Figure S3). The proportion of individuals with repeated fillings was 25% among patients with lung cancer and 13% among individuals free of lung cancer (Table 1), and the maximum treatment cycles were 63 and 78, respectively. In a comparison between individuals with and without a history of COPD, both cases and controls with COPD had more often filled repeated prescriptions of antibiotics (Supplementary Figure S4).

Fig. 1
figure 1

The proportion of cases (individuals with lung cancer) and controls (individuals free of lung cancer) with at least one filled prescription of antibiotics recommended for the treatment of pneumonia. Presented for separate months of the observation period, overall and by history of chronic obstructive pulmonary disease. Cases and controls were matched by sex, year of birth and place of residence. Lung Cancer DataBase Sweden, 2009–2016

Compared to controls, the likelihood of a recent history of at least one filled prescription was elevated in patients with lung cancer: NSCLC overall (OR: 1.83, 95% CI 1.77–1.88), SCC (OR: 2.18, 95% CI 2.05–2.32), adenocarcinoma (OR: 1.70, 95% CI 1.64–1.77), and SCLC (OR: 1.92, 95% CI 1.78–2.06) (Table 2). The magnitudes of these effect sizes became more pronounced with an increasing number of filled prescriptions (Fig. 2) and with proximity to the diagnosis of lung cancer (Fig. 3, Supplementary Table S2). While the magnitudes of the effect sizes did not differ by sex or educational level, they became attenuated with increasing age (Table 2). Compared to controls, patients with lung cancer had between two and three times higher odds for a recent history of four or more filled prescriptions in three years prior to the diagnosis. A similar pattern was observed for the association with the number of different types of antibiotics from filled prescriptions (Supplementary Table S3).

Table 2 Odds ratios and 95% confidence intervals for the association between a diagnosis of lung cancer and a recent history of at least one filled prescription of antibiotics recommended for the treatment of pneumonia, Lung Cancer DataBase Sweden, 2009–2016
Fig. 2
figure 2

Odds ratios (solid line) and 95% confidence intervals (shaded area) for the association between a diagnosis of lung cancer and the number of recently filled prescriptions of antibiotics recommended for the treatment of pneumonia. Cases and controls were matched by sex, year of birth and place of residence. The odds ratios were adjusted for sex, year of birth, place of residence, highest attained education, previous chronic obstructive pulmonary disease diagnosis, previous use of antibiotics recommended for the treatment of pneumonia, and history of any cancer. Lung Cancer DataBase Sweden, 2009–2016

Fig. 3
figure 3

Odds ratios for the association between a diagnosis of lung cancer and a recent history of at least one filled prescription of antibiotics recommended for the treatment of pneumonia in different time intervals before diagnosis. Cases and controls were matched by sex, year of birth and place of residence. The odds ratios were adjusted for sex, year of birth, place of residence, highest attained education, previous chronic obstructive pulmonary disease diagnosis, previous use of antibiotics recommended for the treatment of pneumonia, and history of any cancer. Lung Cancer DataBase Sweden, 2009–2016

For repeated fillings, we did not find a trend in the magnitude of the effect size by cancer stage (Table 3). However, there was an indication of a slightly pronounced magnitude of the effect size for stage III compared to stage IV disease. In separate assessments, there was no trend in the effect size by the T- and N-descriptors.

Table 3 Odds ratios and 95% confidence intervals for the association between a diagnosis of lung cancer and a recent history of repeated fillings (≥ 2 fillings) of antibiotics recommended for the treatment of pneumonia, by cancer stage at diagnosis, Lung Cancer DataBase Sweden, 2009–2016

When assessing the exposure in the six months prior to the diagnosis only, the magnitudes of the ORs for repeated fillings became substantially pronounced compared to those unexposed and those who filled one prescription only, a pattern also present when assessing the association for different types of antibiotics from filled prescriptions (Supplementary Table S4, Supplementary Table S5). When we applied a 12-month washout period, the results remained essentially unchanged with an estimated two-fold increase in odds for a history of at least one filled prescription: NSCLC overall (OR: 1.84, 95% CI 1.77–1.90), SCC (OR: 2.11, 95% CI 1.96–2.27), adenocarcinoma (OR: 1.73, 95% CI 1.66–1.81), and SCLC (OR: 1.95, 95% CI 1.78–2.13) (Supplementary Table S6 When excluding the three months closest before the index date, the magnitudes of the ORs decreased (Supplementary Table S7). However, an indication of more pronounced ORs with an increasing number of fillings remained. Following the exclusion of individuals with a history of COPD, the estimated ORs remained virtually unchanged (Supplementary Table S8).

Discussion

Main findings

To the best of our knowledge, this is the first population-based study using routinely collected data that has investigated patterns of pre-diagnostic use of antibiotics as a potential early indicator of lung cancer.

Compared to individuals free of lung cancer, a higher proportion of patients with lung cancer had filled at least one prescription of antibiotics within three years prior to diagnosis, and, additionally more often had a history of repeated fillings. The odds of a recent history of at least one filled prescription was approximately two times higher among patients with lung cancer compared to individuals free of lung cancer, and the magnitude of the effect size became more pronounced with increasing numbers of filled prescriptions and with proximity to the diagnosis. Also, the magnitude of the effect size was more pronounced among younger compared to older individuals, and among patients with SCC and SCLC compared to those diagnosed with adenocarcinoma. We found no evidence in support of a trend in the magnitude of the effect size by cancer stage.

Interpretation and comparison with other studies

Our findings of more frequent use of antibiotics prior to diagnosis in patients with lung cancer are for several reasons not surprising. First, pulmonary infections are not uncommon in the area of tumor growth and can present as the first symptom of a malignancy [3, 4]. Second, tumors and infected loci can initially be indistinguishable on radiographic evaluation [16]. Third, COPD and upper respiratory infections are more common in smokers at increased risk of lung cancer and individuals with COPD are often prescribed antibiotics for exacerbations [17, 18]. Taken together, our findings do not necessarily reflect an inappropriate prescribing of antibiotics. However, with as many as 7% of the patients with lung cancer in the present study having four or more treatment cycles (the maximum being 63) of antibiotics within three years prior to the diagnosis, our findings indicate improper clinical follow-up and reassessment for some patients, contrary to long-standing guidelines [19].

Our findings that the proportion of cases with filled antibiotic prescriptions started to increase three to four months before diagnosis as well as the pronounced effect size for being exposed in the last period prior to diagnosis indicate that early signs and symptoms of lung malignancy are commonly starting to present during this time window. Our results also corroborate the findings by Ewing et al. of an increasing frequency of primary care consultations approximately 80–100 days before confirmation of a diagnosis of lung cancer [20].

The reasons for the attenuation of the magnitude of the OR with increasing age are unclear, but may reflect a lower diagnostic intensity and underreporting of lung cancer among older individuals [21]. Also, the baseline occurrence of lung cancer is higher among older individuals, hence, a higher increase in absolute numbers of patients with lung cancer is required to yield the same relative change as for younger individuals. The more pronounced magnitudes of the effect sizes observed for SCC and SCLC compared to adenocarcinoma may reflect that these subtypes are generally more centrally located compared to adenocarcinomas and that they are associated with severe symptoms of pneumonia [22,23,24]. It may also reflect differences in smoking history: a higher proportion of patients with adenocarcinoma are never-smokers compared to those with SCC and SCLC [2, 25].

The indication of slightly attenuated magnitude of the OR for the associations between lung cancer and a history of repeated fillings for stage IV compared to stage III cancer may reflect characteristics in patients with stage III disease associated with an increased risk for infections or a presence of infection-like symptoms [26]. Stage III lung cancer is characterized by lymph node involvement or a T4-status indicating (independent of tumor size) spread to an ipsilateral lobe or invasion of heart or central parts of the lung (e.g., the carina) [27]. Patients with stage IV disease may have a small and more favorably located primary tumor without any involvement of lymph nodes, not causing infections to the same extent, and consequently, have a lower likelihood of being prescribed antibiotics before diagnosis. The descriptor specific estimates, with no trends in association for the T- and N-descriptors, bring no clarity to the observed differences between stage III and stage IV. Studies investigating the causes of diagnostic delays of lung cancer have identified factors such as inconclusive chest X-ray readings, the presence of comorbid conditions, waiting times for chest X-ray and the absence of symptoms [16, 28]. Taken together, the findings of our study do not provide evidence that a history of repeated treatment cycles of antibiotics is related to diagnostic delays, at least not as reflected in an increased likelihood of being diagnosed with more advanced disease.

Alternative explanations for our findings include a role of antibiotics as a risk factor, possibly via an influence on the immune system or host–microbiota composition [5, 6, 29]. However, for several reasons our findings are unlikely to be explained by a risk modifying effect of exposure to antibiotics. First, the latency period for the initiation and development of a malignancy is generally long. Second, the estimates did not change following the exclusion of individuals with filled prescriptions within the washout period of 12 months before the start of the observation period. However, we could not assess exposure before 2005, when the PDR was launched.

Strengths and limitations

The strengths of our study included the use of data from nationwide Swedish population-based registers of high completeness and quality that minimized the risk of selection bias and misclassification bias of exposures and outcomes. The completeness and validity of prescription data in the PDR is very high, mainly because the transfer of information from community pharmacies is almost exclusively automated [13].

Several limitations need mentioning. No information was available for the indication of the prescribed antibiotics. While we restricted the analyses to antibiotics recommended for the treatment of pneumonia, it cannot be ruled out that some prescriptions were for other infections. However, because respiratory infections are one of the most common indications for antibiotic use [30], this is likely to have resulted in no or only a minor misclassification of exposure. Also, assuming a higher risk of respiratory infections in patients with lung cancer (for which the majority are current or former smokers) compared to the general population, any misclassification is likely to be more common in controls and would have biased the results toward the null. The absence of data of actual use by the patient was not a concern because the aim of this study was not to examine exposure to antibiotics as a risk factor per se. Any non-differential misclassification of the exposure or the outcome would have biased the ORs toward the null. Although most pneumonia diagnosis are made in a primary care setting, such information based on ICD coding is not available in the NPR or in the other data sources used for the purpose of the present study. However, all prescriptions made in primary care are included in the PDR, once a prescription is filled.

Because information on smoking history was available for lung cancer patients only, smoking could not be included in the adjusted models. However, information on COPD diagnosis retrieved from the NPR was included in the logistic regression models.

Our findings are likely to be generalisable to settings with similar health care systems and guidelines for the management of pneumonia and lung cancer.

Conclusions

We found that a diagnosis of lung cancer was associated with an increased likelihood of recent pre-diagnostic fillings of antibiotic prescriptions. The likelihood became more pronounced with a greater number of fillings and with proximity to the diagnosis, further supporting the notion that infection represents an early sign of lung cancer. We found no evidence that repeated treatment cycles were associated with a diagnostic delay as reflected by cancer stage at diagnosis. Our findings further underscore the importance to rule out lung cancer following pneumonia treatment, especially in patients with a history of repeated treatment cycles.