AI-Driven Model for Automatic Emphysema Detection in Low-Dose Computed Tomography Using Disease-Specific Augmentation

Nagaraj, Yeshaswini; Wisselink, Hendrik Joost; Rook, Mieneke; Cai, Jiali; Nagaraj, Sunil Belur; Sidorenkov, Grigory; Veldhuis, Raymond; Oudkerk, Matthijs; Vliegenthart, Rozemarijn; van Ooijen, Peter

doi:10.1007/s10278-022-00599-7

AI-Driven Model for Automatic Emphysema Detection in Low-Dose Computed Tomography Using Disease-Specific Augmentation

Open access
Published: 18 February 2022

Volume 35, pages 538–550, (2022)
Cite this article

Download PDF

You have full access to this open access article

Journal of Digital Imaging Aims and scope Submit manuscript

AI-Driven Model for Automatic Emphysema Detection in Low-Dose Computed Tomography Using Disease-Specific Augmentation

Download PDF

Yeshaswini Nagaraj ORCID: orcid.org/0000-0001-9288-9537^1,2,
Hendrik Joost Wisselink³,
Mieneke Rook^3,4,
Jiali Cai⁵,
Sunil Belur Nagaraj⁶,
Grigory Sidorenkov⁵,
Raymond Veldhuis⁷,
Matthijs Oudkerk^8,9,
Rozemarijn Vliegenthart³ &
…
Peter van Ooijen^1,2

2621 Accesses
4 Citations
5 Altmetric
Explore all metrics

Abstract

The objective of this study is to evaluate the feasibility of a disease-specific deep learning (DL) model based on minimum intensity projection (minIP) for automated emphysema detection in low-dose computed tomography (LDCT) scans. LDCT scans of 240 individuals from a population-based cohort in the Netherlands (ImaLife study, mean age ± SD = 57 ± 6 years) were retrospectively chosen for training and internal validation of the DL model. For independent testing, LDCT scans of 125 individuals from a lung cancer screening cohort in the USA (NLST study, mean age ± SD = 64 ± 5 years) were used. Dichotomous emphysema diagnosis based on radiologists’ annotation was used to develop the model. The automated model included minIP processing (slab thickness range: 1 mm to 11 mm), classification, and detection maps generation. The data-split for the pipeline evaluation involved class-balanced and imbalanced settings. The proposed DL pipeline showed the highest performance (area under receiver operating characteristics curve) for 11 mm slab thickness in both the balanced (ImaLife = 0.90 ± 0.05) and the imbalanced dataset (NLST = 0.77 ± 0.06). For ImaLife subcohort, the variation in minIP slab thickness from 1 to 11 mm increased the DL model’s sensitivity from 75 to 88% and decreased the number of false-negative predictions from 10 to 5. The minIP-based DL model can automatically detect emphysema in LDCTs. The performance of thicker minIP slabs was better than that of thinner slabs. LDCT can be leveraged for emphysema detection by applying disease specific augmentation.

Value of [18F]AlF-NOTA-FAPI-04 PET/CT for differential diagnosis of malignant and various inflammatory lung lesions: comparison with [18F]FDG PET/CT

Article 05 September 2023

Lung cancer detection from thoracic CT scans using an ensemble of deep learning models

Article 21 November 2023

Deep learning for lung Cancer detection and classification

Article 02 January 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Chronic obstructive pulmonary disease (COPD) is among the leading causes of early deaths worldwide [1], and 70% of COPD is estimated to be under-diagnosed [2, 3]. Emphysema is a key component of COPD that is characterized by the destruction of lung parenchyma [4]. Usually, emphysema is diagnosed at the later stages of the disease’s progression and is itself an independent risk factor for lung cancer [5]. Therefore, early detection of emphysema is important.

Low-dose computed tomography (LDCT) has been shown to be capable of detecting lung cancer and also provides an opportunity to detect comorbidities like emphysema in early stages [6]. However, LDCT contains inherent noise, and screening asymptomatic participants means there are more normal scans than abnormal ones, making emphysema detection labor-intensive [7, 8]. On CT imaging, emphysematous lung regions with reduced tissue density appear as areas of low attenuation. For quantitatively assessing emphysema in CT, the low attenuation areas (LAA) under a specific cut-off threshold value are computed, for example, less than −950 HU. Although this method is widely used, it is prone to measurement variation and lacks consensus on an optimal cut-off threshold, leading to uncertainty in the diagnosis [9]. Past studies have proposed automatic emphysema detection using deep learning (DL) algorithms as a solution to bypass these issues and reduce the burden on radiologists while leveraging the lung cancer screening LDCT dataset [10, 11].

The existing supervised machine learning algorithms [12, 13] or DL algorithms for automatic emphysema detection like 3D convolutional neural networks (CNNs) [14], deep-CNNs with long short-term memory [15], and transfer learning models like 3D ResNet [16] require disease localized annotations, which are difficult to obtain for large datasets, or they are primarily developed using HRCT [17, 18]. This motivated us to develop an unsupervised model for emphysema in screening studies.

Data augmentations have been suggested as an aid to task-specific unsupervised DL models [19]. Typically, data augmentation consists of techniques such as geometric transformation, kernel filters, and feature augmentation that enhance the size and quality of the training dataset for the task-specific models [20].

In emphysema diagnostics, minimum intensity projection (minIP) is used as a visualization technique to detect low-density structures (low attenuation areas) in a given computed tomography (CT) volume and emphasize the subtle features of trace and mild emphysema [21, 22]. The purpose of this study is to test the feasibility of applying minIP as a disease-specific augmentation to the proposed unsupervised DL algorithm for automatic emphysema detection in LDCT.

Studies have indicated that varying minIP slab thickness can affect the qualitative assessment of emphysema [23, 24]. However, its effects on DL models remain unknown, and so, along with the development of a minIP-based DL algorithm, we investigated the effects of different slab thicknesses on the DL algorithm for emphysema.

The notable contributions of this study are as follows: (1) we used an unsupervised DL algorithm to address the annotation-less and class-imbalanced scenarios that normally characterize lung cancer screening LDCTs; (2) we tested the feasibility of using clinical domain knowledge such as minIP to emphasize the minimal differences of emphysema regions in LDCT for unsupervised learning; (3) we explored the effects of different slab thickness of minIP on DL algorithm; (4) we generated detection maps to interpret the model predictions and to serve as a quality check; and (5) we validated our model on lung cancer screening data to check the efficacy of the proposed DL algorithm in a real use-case.

Materials and Method

Study Population

Medical ethics committee approval was obtained prior to the study. The population for this retrospective study was chosen from two different cohorts. The first was a general population study in the Netherlands (Imaging in Lifelines or ImaLife) designed to find early imaging biomarkers for the “big-three” thoracic diseases, COPD, coronary artery disease, and lung cancer. The second was the National Lung Cancer Screening Trial (NLST) which was carried in the USA. It is one of the biggest collections of lung cancer screening LDCT data and contains case-specific datasets with annotations. The details regarding the eligibility criteria for the participants of ImaLife and NLST have been previously described [25, 26].

Acquisition Protocol

The CT acquisition protocol for the ImaLife study participants included a low-dose chest CT examination with a third-generation dual-source CT scanner (SOMATOM Force, Siemens). All the scans were reconstructed in the axial plane with filtered back projection using a soft kernel (Br40) with a single slice thickness of 1 mm and 0.7-mm increments [25]. The NLST subcohort contained LDCT scans from various hospitals where multi-detector scanners from GE medical systems, Siemens, Toshiba, and Philips with varying slice-thicknesses were used to produce the scans [27]. A brief overview of CT acquisition and reconstruction protocols used in the subcohorts is shown in Table 1. All the scans used in this study were acquired during end-inspiration breath-hold and without the administration of any contrast media.

Table 1 The CT acquisition and reconstruction protocol for the dataset from ImaLife and NLST

Full size table

Visual Scoring

Each scan from ImaLife was visually scored by one of three trained medical professionals (two radiologists with more than 10 years of experience and one trained technical physician). The three readers followed a standard annotation protocol based on the Fleischner criteria [28]. For this project, the visual scoring was consolidated to dichotomous emphysema diagnosis per scan. The scans with no emphysema annotation were considered normal, and the severity categories trace, mild, moderate, confluent, and advanced destructive emphysema were considered abnormal. For NLST subcohort, radiologists from various hospitals participating in the NLST screening trial had annotated the scans with a yes or no diagnosis for emphysema, and this information was directly used for the external validation [27].

Quantitative CT Measurements of Emphysema

To measure the quantitative characteristics of the subcohorts and evaluate the potential of DL algorithm trained on visual scoring over traditional quantitative analysis, we performed quantitative CT measurements of emphysema using the routinely used automatic densitometry tool (version 4.4.13, Aquarius iNtuition, TeraRecon). The lung region was semi-automatically segmented, and emphysema was quantified as the percentage of all lung area (voxels) having attenuation lower than −950HU (%LAA). Participants with %LAA were divided into two subgroups for analysis, with %LAA ≤ 5% categorized as non-emphysema and %LAA > 5% categorized as emphysema [29].

Minimum Intensity Projection (minIP)

MinIP is a volume rendering technique where the voxels with lowest attenuation in an image slice are projected to form a bidimensional slab. The slab thickness can be varied based on the number of slices used. First, the acquired scans were re-sampled and normalized to compensate for kernel differences [30]. Then the voxels with lowest Hounsfield units on each slice of a patient scan were projected to form varying slab-thickness. In this study, we generated minIP slabs with thicknesses ranging from 1 up to 11 mm in 2-mm increments for the comparative evaluation. In routine evaluation for emphysema, radiologists use 5 to 10 mm minIP slabs on every slices to carefully check for low attenuation areas [21, 31]. Our study evaluated a wider range to assess the effectiveness of the DL algorithm for each slab thickness separately. Illustration of emphysema case with different MinIP settings with slab thicknesses varying between 1 and 100 mm is shown in Fig. A of Online Resource 1. The algorithm for minIP is implemented in Python and will be made available upon request.

Training and Testing Split

For the development of the DL model, the subcohorts were divided into three datasets. The first was a dataset containing non-emphysema scans for adversarial auto-encoder training, the second was a class-balanced dataset for internal validation, and the third was a class-imbalanced (higher number of non-emphysema scans) dataset for external validation. A total of 160 (80%) out of 200 non-emphysema scans from the ImaLife subcohort were used as the training dataset. The internal validation dataset contained the remaining 40 (20%) normal scans and 40 emphysema scans from the ImaLife subcohort. The complete NLST subcohort was used as the external validation dataset. The consort diagram illustrating the data streams for training and internal and external validation is shown in Fig. 1. It is important to note that the above procedure was followed separately for each minIP slab thickness.

Deep Learning Algorithm

To build a prototype for automatic classification of emphysema in an annotation-less, class-imbalanced environment, we used an unsupervised anomaly detection scheme. The model takes minIPs of certain slab thickness as inputs with a dimension of (512 × 512) × n sampled evenly over the height of the lungs, where n represents the number of axial slices for each participant. The predictions are then concatenated in a sequence of axial slices for the final participant score. The model performs classification on the participant level, so that for a participant to be classified as negative, all slices considered from each participant scan had to be classified as non-emphysema. Otherwise, the participant was classified as positive. The complete DL algorithm was implemented on PyTorch (version 5.1, Python 3.7.1, CUDA 10.1) and executed on a NVIDIA Titan XP GPU with 16 GB memory.

Model Architecture

The classification network was built using adversarial auto-encoders, which can be divided into a generator block and a discriminator block (Fig. 2). The first part of the generator block consists of an encoder with fully convolutional layers capable of encoding the high-dimensional image-representation into a low-dimensional latent-representation. The second part of the generator architecture is a decoder that can decode the low-dimensional latent-representation back into a high-dimensional image-representation (X’) [32]. The generator architecture includes sixteen fully convolutional downsampling and upsampling layers with a kernel size of 4 × 4 and stride 2, where each layer is followed by batch normalization and Leaky ReLU activation functions. The details of architectural components and the loss functions have been defined earlier [33]. The encoder’s feature maps are forwarded to the decoder via skip connections with dropout regularization to translate intrinsic image information between blocks without overfitting. This feature maps from the decoder are fed to the discriminator block and the prediction outputs are obtained as anomaly score for each participant scan. The discriminator block has a similar architecture as the generator’s encoder, consisting of eight fully convolutional downsampling layers with a kernel size of 4 × 4 and stride 2, where each layer is followed by batch normalization and Leaky ReLU activation function. We used the discriminative image features from the last convolutional layer of the discriminator as a priori to decide the candidate/anomaly regions in detection maps. Additionally, to train the model with a limited dataset and avoid overfitting, we incorporated discriminator heuristics augmentation [34].

Model Training

During training, the model was fed with axial slices of given minIP slab thickness in which emphysema was not present so that the generator and the discriminator combination could learn to map the intrinsic properties of non-emphysema lungs. The training was based on minimizing the three loss functions, namely, contextual loss based on L1 distance (or generator’s loss), latent loss (L2 distance measure), and adversarial loss (or discriminator’s loss). The definitions of these three loss functions have been previously published [32].

The trained network in the inference phase classifies the abnormal emphysema scans as anomalies and calculates a prediction score based on generator loss and latent loss functions. The prediction score is the prediction probability provided by the discriminator as a continuous variable ranging from 0 to 1, with higher scores corresponding to emphysema detected.

Detection Maps

Along with the prediction scores, the modified network was designed to provide binary masks or residual images for every test image. This was done by subtracting the input image (X) and the generated image (X’) from the generator block [35]. Afterwards, these residual images were post-processed using a lung lobe segmentation algorithm to automatically remove anything apart from the candidate regions generating detection maps. We used an available pre-trained deep learning algorithm trained on the Lung Tissue Research Consortium dataset to perform the post-processing task [36]. The detection maps were then superimposed on the input image to serve as a quality check.

Model Evaluation and Statistics

Our model evaluation was performed in three different stages:

Stage 1: Internal–external validation: The validation findings for each minIP slab thickness (1, 3, 5, 7, 9, and 11 mm) were analyzed using the model’s area under the receiver operating curve (AUC), sensitivity, and specificity. Bootstrapping with 1,000 iterations was performed on AUC to find the confidence interval (CI). Since the external validation dataset was chosen to be a class-imbalanced dataset, we incorporated the F1 score as one of the performance indicators [37]. The optimal minIP slab thickness must show high sensitivity and AUC, with a low number of false negatives. To compare the performances of the various minIP slab thickness, we used McNemar’s test in the IBM SPSS Statistics tool (version 22) [38].
Stage 2: The inter-rater reliability between visual scoring and %LAA was assessed using Kappa. In addition, to test the performance of the visual scoring-based DL algorithm on %LAA-based categorization, we used our optimal minIP slab thickness.
Stage 3: To enhance the explainability of the DL algorithm, randomly selected 2D detection maps from the DL algorithm were compared to the bounding box annotations of the radiologists.

Results

Population Characteristics

The ImaLife subcohort used for training and internal validation consisted of 240 participants (male = 116 [48.4%] and female = 124 [51.6%]); the mean age ± SD at enrollment was 57 ± 6 years. Visual scoring by radiologists indicated 40 (17%) individuals with emphysema in the subcohort. The quantitative CT measurement of emphysema (%LAA −950) in the ImaLife subcohort showed a median of 4% with an interquartile range of 1.8 to 7.8%. Out of 240 individuals, 136 (56.6%) had %LAA ≤ 5%, and 104 (43.4%) had %LAA > 5%.

The NLST subcohort contained 125 (male = 79 [63.2%] and female = 46 [36.8%]) patients with mean age ± SD of 64 ± 5 years at enrollment. The emphysema annotation of the NLST subcohort indicated that 33% individuals had emphysema. The quantitative CT measurement subcohort had median of 15.1% (interquartile range, 5.3 to 28.3%) of emphysema. Out of 125 patients, 25 (20%) had %LAA ≤ 5% and 100 (80%) had %LAA > 5%.

The population characteristics of the subcohorts are shown in Table 2.

Table 2 Population characteristics of ImaLife and NLST subcohorts

Full size table

Model Evaluation

Internal Validation

The minIP-based DL model automatically detected emphysema in the ImaLife subcohort with a sensitivity of 88% in a class-balanced setting. The internal validation results are shown in Fig. 3a and Table 3. For the slab thicknesses from 1 to 11 mm, there was a positive effect on the DL pipeline performance, that is, increasing the slab thickness resulted in an increase in area under receiver operating characteristics curve (AUC). The classifier’s false-negative predictions decreased by 50% (from 10 to 5) when the slab thickness was increased (1 to 11 mm). Of all the minIP slab thicknesses that were tested, the DL pipeline showed the highest performance for 9 and 11 mm, both in terms of AUC (0.90 ± 0.05 and 0.88 ± 0.05) and false negatives (5/40 and 6/40). There was a small but statistically significant difference of 2% in AUC between 9 and 11 mm (p-value = 0.0348).

Table 3 Performance metrics of the DL model for different minIP slab-thicknesses on the ImaLife subcohort

Full size table

In Fig. 4, examples of non-emphysema and emphysema scans before and after applying minIP are shown. It can be seen that apart from helping to visualize low-density regions, or emphysema regions, applying minIP also reduces the complexity of the image by suppressing the high-contrast regions. Our DL algorithm was run without any minIP, which clearly demonstrated that adding greater than 1 mm minIP slab thickness to 1 mm slice thickness LDCT scans can improve the algorithm’s performance. This is shown using the class separation plots obtained from the DL algorithm for emphysema and non-emphysema scans in Fig. B of Online Resource 1 (electronic supplementary material), where an increase in separation between the classes is observed as the minIP slab thickness increases.

External Validation

In the NLST subcohort out of 33% emphysema scans, 79% were detected by the DL pipeline. In a class-imbalanced setting, the 11 mm slab thickness was found to be most sensitive, with a performance of AUC = 0.77 ± 0.06 (Fig. 3b) and nine false negatives being the least among all the minIP slabs (Table 4). The model’s F1 scores were highest for 11 mm (0.70) and 9 mm (0.67) slab thickness, indicating an acceptable performance in a real-world setting.

Table 4 Performance of the DL model for different minIP slab-thicknesses on the NLST subcohort

Full size table

Quantitative CT-Based Evaluation

In the ImaLife subcohort, there was fair agreement on emphysema and non-emphysema scans between our center’s radiologists and quantitative CT analysis (63% concordance, kappa 0.241). In the NLST subcohort, a slight agreement between procured annotation and quantitative CT analysis with 40% concordance (kappa 0.104) was observed. This indicated that the label noise in the NLST subcohort was higher than that of the ImaLife subcohort.

Our optimal minIP model (minIP slab thickness 11 mm), when tested on %LAA-based categorization, yielded an AUC of 0.79 ± 0.02 in the ImaLife subcohort and an AUC of 0.70 ± 0.04 in the NLST subcohort.

Detection Maps

The quality check of the detection maps from the DL model by the radiologists revealed that the prediction regions in detection maps were accurate in highlighting emphysema regions in 97% (34 out of 35) and 91% (30 out of 33) of the predicted cases in internal and external-validation datasets, respectively. An example of a detection map is illustrated in Fig. 5.

Discussion

This novel study aimed to evaluate the feasibility of a minIP-based DL algorithm for automatic emphysema detection in LDCT. Our results show that the proposed DL model trained on LDCT accurately detects the presence of emphysema with a sensitivity of 88% and aids emphysema detection in lung cancer screening. The multi-scale assessment of the DL model revealed that 11 mm minIP slab thickness was optimal in both general population (90%) and lung cancer screening (77%) datasets. The application of minIP enabled the unsupervised DL model to learn faster (in fewer epochs) by reducing the complexity of the image (suppressing the vessels and high-intensity phenotypes) and enabling the DL model to focus on disease-specific features. MinIP also helped collapse 3D information into a more efficient 2D representation, thereby reducing the computational burden. In our study, the performance of thicker minIP slabs (7 to 11 mm) was better than that of thinner slabs (1 to 3 mm). This is similar to those used in the routine clinical evaluation. Lan et al. also found thick slab minIPs (5 to 10 mm) to be more effective, with negligible differences between these minIP slabs [22].

The external validation of the DL model on the class-imbalanced NLST subcohort achieved a sensitivity of 79%, and there was a drop in overall model performance compared to the internal validation. This might be for two reasons. The first is the label noise: in the NLST dataset, the presence of emphysema on imaging is only recorded as yes or no and does not indicate any additional information on the evaluation method or protocol used, making meaningful interpretation difficult. The second reason is that the model is sensitive to reconstruction parameters (especially slice thickness and slice increment) and the NLST dataset contained varying reconstruction parameters, which could have influenced the model’s performance. Although the variation in slice thickness in the external validation dataset could have been compensated for, the scope of the study was to test the model’s constraints and check its creditability in a real-world setting. Examples of false negative cases that the model failed to classify are shown in Fig. 6.

In the clinical setting, lung densitometry (%LAA) is predominantly used to assess emphysema in CT. However, visual emphysema scoring is less sensitive to image noise and can more precisely discriminate between the presence or absence of emphysema [39] and so, our model was developed on visual scoring instead of CT densitometry based scoring. Concurrently, we found the kappa agreement between visual scoring and CT densitometry to be fair and slight for ImaLife and NLST subcohorts, respectively. A similar level of agreement was observed between visual assessment and quantitative CT in the COPDGene study for emphysema detection [39].

Tang et al. developed a transfer learning DL pipeline to classify COPD patients in lung cancer screening LDCTs. Although the authors do not address emphysema specifically, the reported AUC of the model on percentage low attenuation areas (%LAA) was 74% [11]. Therefore, we tested our DL model on %LAA-based categorization and found comparable results.

Furthermore, Humphries et al. performed automatic classification of emphysema using a CNN-LSTM network with 79% accuracy on COPDGene datasets [15], and Hatt et al. used a dataset from a similar cohort using a 3D-Resnet-CNN model and achieved an accuracy of 79.8% [16]. Although these studies have shown that it is possible to use DL models for emphysema, they vary in terms of model architecture (supervised) and inclusion protocol of participants (based on CT dose, GOLD criteria, or scoring), and so directly comparing them with our model is not possible. Moreover, none of the studies addresses the data imbalance that exists in the real-world scenario, while our model with adversarial training is well-suited to class-imbalanced tasks. Previously Nagaraj et al. compared the same adversarial network with RESNET in class-imbalance settings and adversarial model outperformed the pre-trained model showing the potential of adversarial network [33].

In this study, detection maps for anomalies were compared to the visual identification of emphysema by radiologists, and clinically acceptable performance was observed. Although the pixel-wise emphysema localization via detection maps can be used to verify the model’s predictions, the detection maps are only 2D axial sections, and they cannot be considered for 3D emphysema quantification. However, they may be used as an annotation tool for emphysema.

For our future work, we intend to combine objective measurements such as pre- and post-bronchodilator spirometry combined with visual scoring to validate the classifier’s performance. By utilizing our model, the radiologists’ confidence in identifying emphysema can be increased by providing a comparison methodology like our model that is capable of classifying the overall scans into emphysema and non-emphysema categories and pinpointing the emphysema regions in the scans. Additionally, there is no threshold to toggle as in the HU-based method, which is usually subject to bias.

The main limitation of our model is that it does not classify emphysema based on severity levels. Our immediate future work will focus on training the minIP model with a multi-protocol diverse dataset. Additionally, the experiments will be designed to evaluate the effects of combining multiple minIPs to determine whether this can compensate for protocol variations and validate the model on a large scale lung cancer screening dataset. Combining this DL model with automatic nodule exclusion may aid comprehensive lung disease and mortality evaluation in LDCT lung cancer screening.

Conclusions

We developed an automatic minIP-based DL model for the classification and detection of emphysema in LDCT. Using minIP as a disease-specific augmentation technique, the unsupervised DL algorithm becomes a robust model to address the annotation-less and class-imbalanced scenarios that normally characterize lung cancer screening LDCTs. The DL model is sensitive to scan slice thickness and needs to be validated on multi-cohort datasets. When deployed in screening setup, our model can assist large-scale emphysema classification and provide the detection maps that can act as priori to increase the confidence in the decision.

Availability of Data and Material

NLST data is available on request.

Code Availability

Available on request.

Abbreviations

minIP:: Minimum intensity projection
DL:: Deep learning
ImaLife:: Imaging in lifelines
NLST:: National lung cancer screening trail
LDCT:: Low-dose computer tomography
HU:: Hounsfield unit
LAA:: Low attenuation areas
AUC:: Area under the receiver operating curve
CI:: Confidence interval

References

Foreman KJ, Marquez N, Dolgert A, Fukutaki K, Fullman N, McGaughey M, et al: Forecasting life expectancy years of life lost and all-cause and cause-specific mortality for 250 causes of death: reference and alternative scenarios for 2016–40 for 195 countries and territories. Lancet 392(10159):2052–2090, 2018
Article PubMed PubMed Central Google Scholar
Diab N, Gershon AS, Sin DD, Tan WC, Bourbeau J, Boulet L-P, et al: Underdiagnosis and overdiagnosis of chronic obstructive pulmonary disease. Am J Respir Crit Care Med 198(9):1130–1139, 2018
Article PubMed Google Scholar
Ruparel M, Quaife SL, Dickson JL, Horst C, Tisi S, Hall H, et al: Prevalence Symptom Burden and Underdiagnosis of Chronic Obstructive Pulmonary Disease in a Lung Cancer Screening Cohort. Ann Am Thorac Soc 17(7):869–878, 2020
Article PubMed PubMed Central Google Scholar
Hoidal JR, Niewoehner DE: Pathogenesis of emphysema. Chest 83(4):679–685, 1983
Article CAS PubMed Google Scholar
Gonzalez J, Marín M, Sánchez-Salcedo P, Zulueta JJ: Lung cancer screening in patients with chronic obstructive pulmonary disease. Ann Transl Med 4(8), 2016. http://atm.amegroups.com/article/view/9852
Durawa A, Dziadziuszko K, Jelitto-Górska M, Szurowska E: Emphysema–The review of radiological presentation and its clinical impact in the LDCT screening era. Clin Imaging 64:85–91, 2020
Article PubMed Google Scholar
Schilham AMR, Van Ginneken B, Gietema H, Prokop M: Local noise weighted filtering for emphysema scoring of low-dose CT images. IEEE Trans Med Imaging 25(4):451–463, 2006
Article PubMed Google Scholar
Aberle DR, Adams AM, Berg CD, Black WC, Clapp JD, Fagerstrom RM, et al: Reduced Lung Cancer Mortality with Low-Dose Computed Tomographic Screening. N Engl J Med 365(5):395–409, 2011
Article PubMed Google Scholar
Mascalchi M, Camiciottoli G, Diciotti S: Lung densitometry: why how and when. J Thorac Dis 9(9):3319–3345, 2017
Article PubMed PubMed Central Google Scholar
Ardila D, Kiraly AP, Bharadwaj S, Choi B, Reicher JJ, Peng L, et al: End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography. Nat Med 25(6):954–961, 2019
Article CAS PubMed Google Scholar
Tang LYW, Coxson HO, Lam S, Leipsic J, Tam RC, Sin DD: Towards large-scale case-finding: training and validation of residual networks for detection of chronic obstructive pulmonary disease using low-dose CT. Lancet Digit Heal 2(5):259–267, 2020
Article Google Scholar
Gangeh MJ, Sørensen L, Shaker SB, Kamel MS, De Bruijne M, Loog M: A texton-based approach for the classification of lung parenchyma in CT images. In: Jiang T, Navab N, Pluim JPW, Viergever MA Eds. Medical Image Computing and Computer-Assisted Intervention – MICCAI 2010. Heidelberg: Springer Berlin, 2010, pp 595–602
Chapter Google Scholar
Peng L, Lin L, Hu H, Ling X, Wang D, Han X, et al: Joint weber-based rotation invariant uniform local ternary pattern for classification of pulmonary emphysema in CT images. In: Proceedings - International Conference on Image Processing. ICIP, 2018, pp 2050–2054
Ahmed J, Vesal S, Durlak F, Kaergel R, Ravikumar N, Rémy-Jardin M, et al: COPD classification in CT images using a 3D convolutional neural network. In: Tolxdorff T, Deserno T, Handels H, Maier A, Maier-Hein K, Palm C Eds. Informatik aktuell. Wiesbaden: Springer Vieweg, 2020, pp 39–45
Google Scholar
Humphries SM, Notary AM, Centeno JP, Strand MJ, Crapo JD, Silverman EK, et al: Deep learning enables automatic classification of emphysema pattern at CT. Radiology 294(2):434–444, 2020
Article PubMed Google Scholar
Hatt C, Galban C, Labaki W, Kazerooni E, Lynch D, Han M: Convolutional neural network based COPD and emphysema classifications are predictive of lung cancer diagnosis. In: Stoyanov D, et al Eds. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Cham: Springer, 2018, pp 302–309
Google Scholar
Pino Peña I, Cheplygina V, Paschaloudi S, Vuust M, Carl J, Weinreich UM, et al: Automatic emphysema detection using weakly labeled HRCT lung images. PLoS One 13(10):e0205397, 2018
Article PubMed PubMed Central Google Scholar
Karabulut EM, Ibrikci T: Emphysema discrimination from raw HRCT images by convolutional neural networks. In: ELECO 2015 - 9th International Conference on Electrical and Electronics Engineering, 2016, pp 705–708
Abdollahi B, Tomita N, Hassanpour S. Data Augmentation in Training Deep Learning Models for Medical Image Analysis. In: Nanni L, Brahnam S, Brattin R, Ghidoni S, Jain LC Eds. Deep Learners and Deep Learner Descriptors for Medical Applications. Cham: Springer International Publishing, 2020, pp 167–180
Chapter Google Scholar
Shorten C, Khoshgoftaar TM: A survey on Image Data Augmentation for Deep Learning. J Big Data 6(1):60, 2019
Article Google Scholar
Ghonge NP, Chowdhury V: Minimum-intensity projection images in high-resolution computed tomography lung: Technology update. Lung India Off Organ Indian Chest Soc 35(5):439, 2018
Article Google Scholar
Lan H, Nishitani H, Nishihara S, Ueno J, Takao S, Iwamoto S, et al: Using the MDCT thick slab MinIP method for the follow-up of pulmonary emphysema. J Med Invest 58(3–4):175–179, 2011
Article PubMed Google Scholar
Remy-Jardin M, Remy J, Gosselin B, Copin MC, Wurtz A, Duhamel A: Sliding thin slab minimum intensity projection technique in the diagnosis of emphysema: Histopathologic-CT correlation. Radiology 200(3):665–671, 1996
Article CAS PubMed Google Scholar
Satoh S, Ohdama S, Shibuya H: Sliding thin slab minimum intensity projection imaging for objective analysis of emphysema. Radiat Med 24(6):415–421, 2006
Article PubMed Google Scholar
Xia C, Rook M, Pelgrim GJ, Sidorenkov G, Wisselink HJ, van Bolhuis JN, et al: Early imaging biomarkers of lung cancer COPD and coronary artery disease in the general population: rationale and design of the ImaLife (Imaging in Lifelines) Study. Eur J Epidemiol 35(1):75–86, 2020
Article PubMed Google Scholar
Gatsonis CA, Aberle DR, Berg CD, Black WC, Church TR, Fagerstrom RM, et al: The national lung screening trial: Overview and study design. Radiology 258(1):243–253, 2011
Article PubMed Google Scholar
Aberle DR, Adams AM, Black WC, Clapp JD, Fagerstrom RM, Gareen IF, Gatsonis C, Marcus PM, Sicks JD: Reduced Lung-Cancer Mortality with Low-Dose Computed Tomographic Screening. N Engl J Med 365(5):395–409, 2011
Article PubMed Google Scholar
Lynch DA, Austin JHM, Hogg JC, Grenier PA, Kauczor H-U, Bankier AA, et al: CT-Definable subtypes of chronic Obstructive Pulmonary Disease: A Statement of the Fleischner Society 1. Radiology 277(1):192–205, 2015
Article PubMed Google Scholar
Lynch DA, Moore CM, Wilson C, Nevrekar D, Jennermann T, Humphries SM, et al: CT-based visual classification of emphysema: Association with mortality in the COPDGene study. Radiology 288(3):859–866, 2018
Article PubMed Google Scholar
Gallardo-Estrella L, Lynch DA, Prokop M, Stinson D, Zach J, Judy PF, et al: Normalizing computed tomography data reconstructed with different filter kernels: effect on emphysema quantification. Eur Radiol 26(2):478–486, 2016
Article PubMed Google Scholar
Satoh S, Kitazume Y, Taura S, Kimula Y, Shirai T, Ohdama S: Pulmonary emphysema: Histopathologic correlation with minimum intensity projection imaging, high-resolution computed tomography and pulmonary function test results. J Comput Assist Tomogr 32(4):576–582, 2008
Article PubMed Google Scholar
Akcay S, Atapour-Abarghouei A, Breckon TP: Skip-GANomaly: Skip Connected and Adversarially Trained Encoder-Decoder Anomaly Detection. In: 2019 International Joint Conference on Neural Networks (IJCNN), 2019, pp 1–8
Nagaraj Y, Cornelissen L, Cai J, Wisselink HJ, Rook M, Vliegenthart R, Veldhuis RNJ, Oudkerk M van Ooijen P: An unsupervised anomaly detection model to classify emphysema in low-dose chest computed tomography. techrxiv, 2020. https://doi.org/10.36227/techrxiv.16670899.v1
Karras T, Aittala M, Hellsten J, Laine S, Lehtinen J, Aila T: Training Generative Adversarial Networks with Limited Data. 2020, http://arxiv.org/abs/2006.06676
Baur C, Wiestler B, Albarqouni S, Navab N: Deep autoencoding models for unsupervised anomaly segmentation in brain MR images. In: Crimi A, Bakas S, Kuijf H, Keyvan F, Reyes M, van Walsum T Eds. Brainlesion: Glioma Multiple Sclerosis Stroke and Traumatic Brain Injuries. Springer, Cham, 2019, pp 161–169
Chapter Google Scholar
Hofmanninger J, Prayer F, Pan J, Rohrich S, Prosch H, Langs G: Automatic lung segmentation in routine imaging is a data diversity problem not a methodology problem. Eur Radiol Exp 4(1):1–13, 2020
Article Google Scholar
Powers DMW: Evaluation: from precision recall and F-measure to ROC informedness markedness and correlation. Mach Learn Technol 2:37–63, 2020
Google Scholar
Fagerland MW, Lydersen S, Laake P: The McNemar test for binary matched-pairs data: mid-p and asymptotic are better than exact conditional. BMC Med Res Methodol 13(1):91, 2013
Article PubMed PubMed Central Google Scholar
Amaza IP, O’shea AMJ, Fortis S, Comellas AP: Discordant quantitative and visual ct assessments in the diagnosis of emphysema. Int J Chron Obstruct Pulmon Dis 16:1231, 2021
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan XP GPU used for this research.

Funding

Part of this work was realized within the Imaging in Lifelines (ImaLife) study supported by the institutional research grant from the Siemens Healthineers and by the Ministry of Economic Affairs and Climate Policy (EZK) by means of the public–private partnership (PPP) allowance made available by the Top Sector Life Sciences and Health to stimulate public–private partnerships. Part of this work was realized within the Deep Learning Algorithms for Medical Image Evaluation (DAME) project, funded by the INTERREG V. A Germany-Netherlands program with resources from the European Regional Development Fund and co-funded by the Netherlands Ministry of Economic Affairs and Climate Policy (EZK), the Province of Groningen and the Niedersächsisches Ministerium für Bundes-und Europaangelegenheiten und Regionale Entwicklung.

Author information

Authors and Affiliations

Department of Radiation Oncology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
Yeshaswini Nagaraj & Peter van Ooijen
DASH, Machine Learning Lab, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
Yeshaswini Nagaraj & Peter van Ooijen
Department of Radiology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
Hendrik Joost Wisselink, Mieneke Rook & Rozemarijn Vliegenthart
Department of Radiology, Martini Hospital Groningen, Groningen, The Netherlands
Mieneke Rook
Department of Epidemiology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
Jiali Cai & Grigory Sidorenkov
Department of Clinical Pharmacy and Pharmacology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
Sunil Belur Nagaraj
Faculty of Electrical Engineering, Mathematics Computer Science (EWI), Data Management Biometrics (DMB), University of Twente, Enschede, The Netherlands
Raymond Veldhuis
Faculty of Medical Sciences, University of Groningen, Groningen, The Netherlands
Matthijs Oudkerk
Institute for DiagNostic Accuracy Research B.V., Groningen, The Netherlands
Matthijs Oudkerk

Authors

Yeshaswini Nagaraj
View author publications
You can also search for this author in PubMed Google Scholar
Hendrik Joost Wisselink
View author publications
You can also search for this author in PubMed Google Scholar
Mieneke Rook
View author publications
You can also search for this author in PubMed Google Scholar
Jiali Cai
View author publications
You can also search for this author in PubMed Google Scholar
Sunil Belur Nagaraj
View author publications
You can also search for this author in PubMed Google Scholar
Grigory Sidorenkov
View author publications
You can also search for this author in PubMed Google Scholar
Raymond Veldhuis
View author publications
You can also search for this author in PubMed Google Scholar
Matthijs Oudkerk
View author publications
You can also search for this author in PubMed Google Scholar
Rozemarijn Vliegenthart
View author publications
You can also search for this author in PubMed Google Scholar
Peter van Ooijen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yeshaswini Nagaraj.

Ethics declarations

Ethics Approval

Approved by Institutional Review Board–the Medical Ethical Committee (METc) of the University Medical Center Groningen.

Informed Consent

Written informed consent was obtained from all subjects of the ImaLife and NLST study.

Study Subjects or Cohorts Overlap

The details of eligibility criteria for the participants of ImaLife and National Lung Cancer screening (NLST) have been previously described. There was no-overlap of study subjects or cohorts.

Statistics and Biometry

One of the authors has significant statistical expertise.

Conflict of Interest

Prof. Dr. Rozemarijn Vliegenthart reports an institutional research grant from Siemens Healthineers. There are no other interests to disclose. Rest of the authors of this manuscript declare no relationships with any companies, whose products or services may be related to the subject matter of the article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Key Points

• MinIP as a disease-specific augmentation applied to LDCT aids an unsupervised DL model in emphysema detection

• The minIP-based DL model is tested in both class-balanced and class-imbalanced settings

• The detection maps from the proposed model allowed for precise pinpointing of emphysema regions in the LDCT scans

• Automatic emphysema detection in LDCT can leverage the lung cancer screening LDCT for early stage emphysema diagnosis

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 251 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nagaraj, Y., Wisselink, H., Rook, M. et al. AI-Driven Model for Automatic Emphysema Detection in Low-Dose Computed Tomography Using Disease-Specific Augmentation. J Digit Imaging 35, 538–550 (2022). https://doi.org/10.1007/s10278-022-00599-7

Download citation

Received: 29 June 2021
Revised: 03 January 2022
Accepted: 29 January 2022
Published: 18 February 2022
Issue Date: June 2022
DOI: https://doi.org/10.1007/s10278-022-00599-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

AI-Driven Model for Automatic Emphysema Detection in Low-Dose Computed Tomography Using Disease-Specific Augmentation

Abstract

Similar content being viewed by others

Value of [18F]AlF-NOTA-FAPI-04 PET/CT for differential diagnosis of malignant and various inflammatory lung lesions: comparison with [18F]FDG PET/CT

Lung cancer detection from thoracic CT scans using an ensemble of deep learning models

Deep learning for lung Cancer detection and classification

Introduction

Materials and Method

Study Population

Acquisition Protocol

Visual Scoring

Quantitative CT Measurements of Emphysema

Minimum Intensity Projection (minIP)

Training and Testing Split

Deep Learning Algorithm

Model Architecture

Model Training

Detection Maps

Model Evaluation and Statistics

Results

Population Characteristics

Model Evaluation

Internal Validation

External Validation

Quantitative CT-Based Evaluation

Detection Maps

Discussion

Conclusions

Availability of Data and Material

Code Availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethics Approval

Informed Consent

Study Subjects or Cohorts Overlap

Statistics and Biometry

Conflict of Interest

Additional information

Publisher's Note

Key Points

Supplementary Information

Supplementary file1 (PDF 251 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation