Introduction

Asthma and chronic obstructive pulmonary disease (COPD) are heterogeneous chronic lung diseases, characterized by the presence of airway obstruction and airway inflammation [1, 2]. In asthma, airway obstruction is typically completely or nearly completely reversible [2], while irreversible airway obstruction is typical for COPD [1]. Although both diseases have overlapping clinical features, acknowledged in guidelines [3], they are generally regarded as different disorders; each requiring their own diagnostic and management strategies. For example, for the pharmacological management of asthma, inhaled corticosteroids (ICSs) are the most efficacious drugs currently available [2]. In COPD, inhaled bronchodilators are fundamental for treatment, whereas addition of ICS is only recommended for COPD patients suffering from severe disease (FEV1 < 50 % predicted) and/or a history of recurrent exacerbations [1]. In order to determine appropriate treatment, prognosis and follow-up, international guidelines have emphasized the importance of differentiating asthma from COPD.

In general, careful history taking, physical examination, and lung function testing often lead to a clear diagnosis [4]. However, it is frequently difficult if not impossible to achieve an accurate diagnosis of either asthma or COPD in older patients [5--8]. Although asthma generally affects children and young adults, it is not uncommon that asthma starts later in life [9]. In approximately 4 to 8 % of asthmatic cases, the first asthma symptoms are present in late adulthood (late-onset asthma) or even after 65 years of age. With increasing age, a proportion of patients with asthma may develop persistent irreversible airflow limitation, particularly in the presence of risk factors such as smoking [5], blood eosinophilia, chronic mucus hypersecretion, and a low level of FEV1 [10]. This type of asthma is clinically indistinguishable from COPD, which the guidelines called ACOS or asthma-COPD overlap syndrome [3], and medical history, physical examination, and lung function tests may become insufficient to distinguish asthma from COPD to allow the most adequate therapy. The lack of a diagnostic standard to identify asthma at older age, together with poor perception of symptoms such as dyspnea, may further hamper the recognition of asthma in the elderly [11].

In case of doubt, clinicians may attempt to achieve a best possible diagnosis of asthma or COPD by taking bronchial biopsies for histopathological examination, although this is not a common practice. It has been suggested that pathological examination of bronchial tissue, taking features such as denudation of the epithelium in asthma and epithelial hyperplasia in COPD into account, might contribute to resolving the diagnostic difficulty. Despite clear morphological differences between asthma and COPD, some morphological characteristics can be found in both diseases (in particular in chronic or severe cases), which impairs their diagnostic value in an individual case [12]. Bourdin et al. demonstrated that the diagnostic value of histological examination of endobronchial biopsies from subjects with asthma or COPD is limited, sensitivity and specificity ranging between 36–48 and 56–79 % respectively [13]. However, the latter study included mostly young, never-smoking, non-steroid-using asthma patients with normal lung function and high bronchodilator reversibility. Furthermore, asthma and COPD patients were not matched for age, airway obstruction, ICS use, and smoking habits. Since these factors modulate histological features of airway inflammation and remodeling, they therefore may have confounded the results.

In the current study, we aimed to identify the most important histopathological features to differentiate between asthma and COPD in bronchial biopsies. We hypothesized that the accuracy of the pathological diagnosis would improve when taking into account modulating factors such as smoking, age, and ICS use.

Materials and methods

Patients and matching of biopsies

Biopsies from 24 asthma and 24 matched (see below) COPD patients were included. Subjects were only included when there was no uncertainty in the diagnosis and when subjects met all criteria for either asthma or COPD according to international guidelines [1, 2]. Subjects were selected from several asthma and COPD cohort studies performed in our institute [14--17]. We created three groups (A–C) of asthma patients (n = 8 each group), carefully matched with three groups of COPD patients (n = 8 each group).

  1. A.

    The first asthma and COPD groups included subjects who did not use ICS, were >45 years old, had a post-bronchodilator (BD) FEV1/FVC <70 %, and had smoked >10 pack-years.

  2. B.

    The second asthma and COPD groups included subjects with the same criteria, but subjects had used ICS during the last 30 months.

  3. C.

    The third group included asthma patients without ICS use, and with post-BD FEV1 > 90 % predicted, age < 45 years, 0 pack-years smoking, and atopy (Phadiatop > 1.0). This was contrasted with COPD patients without ICS use, with post-BD FEV1 < 50 % predicted, age > 45 years, current smoking with >10 pack-years, and without atopy.

Groups A and B included patients with a clinically difficult differential diagnosis between asthma and COPD, whereas control group C included so-called classical cases easy to differentiate.

Table 1 shows the group characteristics (A–C) and Table S1 individual characteristics. Details of the selection process are depicted in the Fig. S1.

Table 1 Characteristics of patients with asthma and COPD

Medical records, lung function data, and paraffin-embedded endobronchial biopsies (EBB) were available from all selected patients.

EBB staining, virtual microscopy, and interactive website

EBB slides were stained with hematoxylin and eosin (H&E). An experienced pulmonary pathologist (WT) checked the quality of the slides for each patient and selected the best specimen based on size, intactness, and presence of mucosal and submucosal layers. This quality check was performed without knowing the diagnosis or paying attention to the possible diagnosis. Entire biopsies were scanned using Aperio ScanScope Digital Slide Scanner at ×40 magnification. Afterwards, the images were uploaded to a specially designed interactive website, allowing to view the slides at different magnifications and to navigate into different areas of the bronchial biopsy like with a normal microscope (Fig. 1). Quality of our web-based virtual microscopy was checked by an experienced independent pulmonary pathologist who compared all 48 slide images with series of biopsies from the original glass slides by microscope.

Fig. 1
figure 1

Example slide on interactive website. Screenshot of the interactive website, showing a slide with a representative bronchial biopsy at ×20 magnification. The small image in the lower part of the picture is the overview window, showing the current position and size of the large window. The website allowed to view the slides at different magnifications and to navigate into different areas of the bronchial biopsy like a normal microscope

Study protocol

Ten pathologists from different countries (Netherlands, Brazil, Canada, Austria, Switzerland), five of them specialized in pulmonary pathology, participated in the study and used our interactive website following a strict protocol (Table 2). The pathologists were informed about the design of the study but had no clinical information (age, sex, ICS use, smoking, lung function). In phases 1 and 4, the pathologists were offered 48 slides, i.e., 24 pairs of matched asthma and COPD patients, and were asked to indicate per pair which one was asthma and which one COPD. In phases 2 and 3, all 48 slides were offered in a randomly mixed order and the pathologists were asked to choose for either asthma or COPD. In phase 3, the pathologists were additionally asked to indicate the presence or absence of a pathological criterion using the criteria list of Bourdin’s study [13], with small modifications (Box 1). In addition, they were asked how sure they felt about their diagnosis using a 0–10 visual analog scale (VAS) score (0 not sure, 10 very sure) and to rank what they considered the three most relevant features (Fig. 2).

Table 2 Design of the study
Fig. 2
figure 2

Example slide on interactive website including the list with pathological criteria. Screenshot of the interactive website, showing a slide (phase 3a) with a representative bronchial biopsy at ×10 magnification. The small image in the lower part of the picture is the overview window, showing the current position and size of the large window. At the right side of the slide the pathologists may record the observed abnormal presence of 12 pathological criteria (yes/no), the diagnosis of asthma or COPD, how sure they feel about the diagnosis, and rank the 3 most relevant features

Box 1

figure a

Statistical analyses

Data from the interactive website were automatically and anonymously saved to an Excel file (MS Excel 2010) and transferred to SPSS software (version 19.0; SPSS Inc., Chicago, IL). The individual data was computed to a concordant or discordant diagnosis of asthma or COPD (i.e., concordant between pathological and clinical diagnosis), and results were expressed as the mean percentage of the concordant diagnosis. In phase 3, the reported presence of each pathological criterion was compared between slides of asthma and COPD using the Mann-Whitney test. Sensitivity, specificity, and accuracy of each pathological criterion for pathologists making a concordant diagnosis of asthma were calculated. A high sensitivity for asthma indicated automatically a high specificity for COPD and vice versa. Pathological criteria with a p value <0.2 in the univariate analysis were entered in a logistic regression analysis on the presence of asthma. Pathological criteria which contributed independently to a concordant diagnosis of asthma or COPD were combined to find higher accuracy rates for the concordant diagnosis. Selection of combined criteria was based on the highest Wald value in the regression analyses.

Analyses were performed in the total group of patients, within patient groups (A–C), and between pathologists (general, specialized). Intra- (between phases 3a and 3b, see Box 1) and inter-observer (within phase 3a) agreements were assessed with Cohen’s kappa test and Fleiss’ kappa, respectively. The significance level was set at 0.05.

Results

Concordant diagnoses (pathology concordant with clinical diagnosis)

Table 3 shows the percentage of concordant diagnoses per phase, for each disease group (A–C) per pathologist group. The percentages of concordant diagnoses, per pathologist, per phase, are shown in Fig. S2. The percentage of concordant diagnoses of asthma or COPD, per pathologist group, in phase 3a is shown in Table S2. Overall, the highest number of concordant diagnoses was observed in phase 4, particularly by pulmonary pathologists. The highest score for asthma was observed in phase 3a, in the classical asthma/COPD group (C), by pulmonary pathologists, 91.4 %. The highest score for COPD was also observed in phase 3a, in the non-classical asthma/COPD group of ICS users (B), with no difference between pulmonary and general pathologists. Feeling sure about the diagnosis of asthma (in asthma and COPD cases together) was rated on a VAS scale from 1 to 10. In groups A–C, the mean (SD) VAS score for asthma was 5.6 (2.5), 5.2 (2.0), and 6.1 (2.4), respectively. For COPD, this was 5.9 (2.4), 5.8 (2.1), and 5.0 (2.6).

Table 3 Percentage of concordant diagnoses in different phases of the study

Pathological criteria in phase 3

Table 4 shows the reported presence of pathological criteria in the airway wall biopsies in asthma or COPD per disease group (4A–C). Criteria that differed significantly between asthma and COPD were not comparable between the three groups. Eosinophilia was significantly more frequently reported in asthma in groups B + C but not in A (subjects with asthma and COPD in B used ICS, in contrast to subjects in A + C). Submucosal inflammation was significantly more frequently reported in asthma in groups B + C than in group A, yet more frequently in COPD in group A. Table S3 and Box 2 show which pathological criteria significantly contribute to a concordant diagnosis of asthma or COPD in a multiple regression model. Significant criteria present in all groups were goblet cells, inflammatory infiltrate, and glands. Significant criteria present in two groups were eosinophils (group B + C), and in one group squamous metaplasia (group A), BM thickening (group B), hyperplasia (group C), smooth muscle (group C), and submucosal fibrosis (group C). Significant (p < 0.05) Wald values of these models are also shown in Table 4.

Table 4 Characteristics of the pathological criteria for the diagnosis of asthma or COPD in group A (non-ICS users), group B (ICS users), and group C (classical group)

Sensitivity, specificity, and accuracy of criteria for a concordant diagnosis

Table 4 shows the sensitivity, specificity, and accuracy for asthma/COPD diagnosis for groups A–C. Pathological criteria with high accuracy differed importantly between groups A–C. In group A (no ICS use), glands, goblet cells, squamous metaplasia, and submucosal infiltrate showed the highest accuracy. In group B (with ICS use), the highest accuracy was provided by submucosal inflammation, basement membrane thickening, eosinophilia, and glands. In group C (classical asthma and COPD), the highest accuracy was shown by submucosal fibrosis, epithelial hyperplasia, eosinophilia, and glands. Combinations of relevant pathological criteria did improve sensitivity or specificity, but in general not accuracy (Table S4).

Agreement between and repeatability within pathologists

Agreement between pathologists for a concordant diagnosis of asthma or COPD in groups A–C was 0.40, 0.45, and 0.48, respectively. This was 0.33, 0.45, and 0.72 for asthma and 0.60, 0.57, and 0.27 for COPD, respectively. Repeatability within pathologists for a concordant diagnosis in groups A–C was 0.75, 0.78, and 0.80 respectively. This was 0.75, 0.70, and 0.80 for asthma and 0.70, 0.85, and 0.80 for COPD, respectively. Agreement between and repeatability within pathologists for the reported presence of pathological criteria in groups A–C is presented in Table 4. Overall agreement in groups A–C varied between 0.61 and 0.92. Overall repeatability in groups A–C varied between 0.68 and 0.95.

Discussion

Asthma and COPD are obstructive airway diseases with clear differences in etiology and pathophysiology, yet at older age, they frequently are difficult to discriminate, the currently so-called asthma-COPD overlap syndrome (ACOS) [3]. In the current study, there was no discussion about the original underlying disease, as we had historical data from well-characterized cohort studies. Therefore, we had the opportunity to carefully match bronchial biopsies from 24 asthma and 24 COPD patients taking age, FEV1, ICS use, and smoking habits into account. Ten pathologists, not informed about the individual clinical background of the patients, but knowing the study design, were asked to diagnose asthma or COPD. The important outcome of this study is that histological examination of bronchial biopsies alone does not allow differentiating between asthma and COPD. However, as recognition of the specific histological criteria was good, diagnostic value can be expected to improve when selected pathological criteria are applied and adequate clinical information is provided, in which in particular, knowledge about the use of ICS is essential.

In this study, we aimed for high-quality matching of asthma and COPD patients, considering age, smoking, ICS use, and airway obstruction as important modulators of airway inflammation and remodeling. This matching is important because it fits with the original question of the study: whether it is possible for pathologists to discriminate between older asthma patients with fixed airway obstruction and COPD patients, using bronchial biopsies. We designed our study into four phases, gradually increasing the level of difficulty and anticipated that a head-to-head comparison of two paired slides, one from an asthma patient and one from a COPD patient, would help to discriminate between the two diseases. Interestingly, this was not the case. Success rates of scoring of the paired slides were not higher than those of randomly mixed slides. Furthermore, we anticipated that systematic scoring of textbook pathological criteria for asthma or COPD would help to set the correct (=concordant) diagnosis; however, this was not the case.

The overall percentages of a diagnosis of asthma or COPD concordant with the clinical diagnosis were low, taking into account a 50 % concordance by chance. We expected the highest percentage of concordant diagnoses in young asthmatic patients without ICS use and with a normal lung function, assuming that the recognition of the underlying disease is easier when inflammation is not treated with ICS or changed by age- and smoking-related remodeling processes. Indeed, the highest percentage of a concordant diagnosis for asthma in the randomly mixed slides was observed in the group of classical cases (group C, without ICS) and the lowest in the group of asthma subjects who used ICS clinically which are difficult to diagnose (group B). Interestingly, this contrasted with the findings for COPD, where the highest percentage of concordant diagnoses occurred in the group of ICS users. It is not clear at first sight why ICS use reduced the percentage of concordant diagnosis for asthma, yet improved this for COPD. This may well be because corticosteroids reduce eosinophil survival but prolong neutrophil survival; hence, ICS use may have reduced eosinophilia in asthma and increased neutrophilia in COPD [18], eosinophilia being considered a hallmark for asthma and neutrophilia for COPD. This contrasting effect of ICS use on giving a concordant diagnosis of asthma or COPD is an important finding of our study, which—it goes without saying—emphasizes the need for effective transfer of adequate clinical data from clinicians to pathologists.

The reported presence of pathological criteria was significantly different between slides from asthma and COPD patients, in striking contrast with the low percentage of concordant diagnoses of asthma and COPD based on biopsies alone. Additionally, the agreement between the 10 pathologists (inter-observer variability) and repeatability within pathologists (intra-observer variability) was good to excellent for many pathological criteria, independent of a concordant diagnosis. Thus, all expertise to allow a concordant diagnosis of asthma or COPD was present, yet the pathologists appeared to need additional information to be able to apply this expertise successfully, considering the relatively low percentage of concordant diagnoses. A priory, one might expect that the diagnosis of pathologists based upon histology only might be more accurate because it allows integration of all relevant information from all parts of a biopsy. Apparently, this is a difficult task, which is in particular complicated when clinical characteristics like age, smoking, ICS use, and airway obstruction, that affect accuracy and significance of potentially useful pathological criteria, are not known, as shown in the present study.

Next, we tried to identify pathologists with a higher percentage of concordant diagnoses in order to learn and copy their strategy. Beforehand, we expected that pulmonary pathologists would do better than general pathologists. Indeed, at a group level, the mean scores of the pulmonary pathologist group tended to be higher in every phase of the study, and the pathologists with the highest percentages of concordant diagnoses were almost always the pulmonary pathologists. For example, one of the pulmonary pathologists (pathologist B in Table S2) had high accuracy rates, but importantly, in the subgroup of patients who did use ICS (group B), this pathologist had poor results, again demonstrating the importance of knowledge of the clinical background of patients. The pathological criteria that were most frequently ranked as important for the diagnosis by the best scoring pathologists were squamous metaplasia in group A (without ICS use), basement membrane thickening in group B (with ICS use), and submucosal fibrosis in group C (with “classical” asthma and COPD). This is not unexpected as these pathological criteria demonstrated also high accuracy rates at a group level. Finally, we tried to improve accuracy rates by combining pathological criteria that independently associated with the clinical diagnosis of asthma or COPD. Whereas it was not difficult to find combinations with a very high sensitivity or specificity rates, the accuracy rates of these combinations were generally not better than the individual pathological criteria.

We hypothesize that pathologists may improve their ability to differentiate asthma from COPD if they use selected pathological criteria, i.e., those with a high accuracy rate for concordance, provided that the relevant clinical information is known. In non ICS users with fixed airway obstruction the abnormal presence of goblet cells directs towards an asthma diagnosis, whereas glands, squamous metaplasia, and submucosal infiltrate direct towards COPD (Box 2). In ICS users with fixed airway obstruction, the abnormal presence of submucosal infiltrate, basement membrane thickening, eosinophils, and glands direct towards asthma diagnosis. In classical cases, the diagnosis is already known on the basis of clinical characteristics. Nevertheless, the clinical diagnosis can be confirmed by the observation of submucosal infiltrate and eosinophils indicating asthma, whereas submucosal fibrosis, hyperplasia, and glands indicate COPD.

Box 2

figure b

This study has a few limitations. First, one can argue whether or not a physician’s diagnosis of asthma or COPD is an adequate gold standard. Importantly, patient charts of many years back showed that even patients with asthma and fixed airway obstruction at current investigation had reversible airway obstruction at a younger age. After accepting that our physician’s diagnosis is reliable, one might still argue that the selected cases are not the ones that normally would need a pathological examination, because the clinical diagnosis was already established with certainty. Unfortunately, we have no biopsies available to compare cases with certain versus uncertain clinical diagnosis. Our aim however was to establish clues at the microscopical level with strong relation with either asthma or COPD, to be used as clues in difficult biopsies to direct towards a likely diagnosis. Secondly, one could argue that biopsies were not examined by microscope but on screen, which reflects but is not identical to the real-life situation in daily diagnostic practice. This indeed should be validated as is indicated by a recent guideline [19]. Nevertheless, in our approach, all pathologists were exposed to identical images, which improved standardization and allowed pathologists to scroll easily through complete high-resolution slides and varying magnification as in classical microscopy. Third, we did not include slides from healthy controls or from subjects with another lung disease. Consequently, the terms sensitivity and specificity only refer to asthma and COPD, compatible with our study question whether a pathologist is able to give a concordant diagnosis of asthma or COPD. We therefore consider our study design appropriate.

In conclusion, we show that the differentiation between asthma and COPD, based on histopathological examination only of a bronchial biopsy, without adequate clinical information, is difficult. This contrasts with the high percentage of concordant diagnosis observed for a number of reported pathological criteria. We postulate that the diagnostic value is likely to improve when selected pathological criteria (Box 2) are applied and adequate information is provided with respect to the use of ICS. Prospective studies incorporating medical decision techniques may validate algorithms that take these issues into account.