A multistage framework for respiratory disease detection and assessing severity in chest X-ray images

Sahoo, Pranab; Sharma, Saksham Kumar; Saha, Sriparna; Jain, Deepak; Mondal, Samrat

doi:10.1038/s41598-024-60861-6

A multistage framework for respiratory disease detection and assessing severity in chest X-ray images

Article
Open access
Published: 29 May 2024

Volume 14, article number 12380, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

A multistage framework for respiratory disease detection and assessing severity in chest X-ray images

Download PDF

Pranab Sahoo¹,
Saksham Kumar Sharma²,
Sriparna Saha¹,
Deepak Jain³ &
…
Samrat Mondal¹

527 Accesses
Explore all metrics

Abstract

Chest Radiography is a non-invasive imaging modality for diagnosing and managing chronic lung disorders, encompassing conditions such as pneumonia, tuberculosis, and COVID-19. While it is crucial for disease localization and severity assessment, existing computer-aided diagnosis (CAD) systems primarily focus on classification tasks, often overlooking these aspects. Additionally, prevalent approaches rely on class activation or saliency maps, providing only a rough localization. This research endeavors to address these limitations by proposing a comprehensive multi-stage framework. Initially, the framework identifies relevant lung areas by filtering out extraneous regions. Subsequently, an advanced fuzzy-based ensemble approach is employed to categorize images into specific classes. In the final stage, the framework identifies infected areas and quantifies the extent of infection in COVID-19 cases, assigning severity scores ranging from 0 to 3 based on the infection’s severity. Specifically, COVID-19 images are classified into distinct severity levels, such as mild, moderate, severe, and critical, determined by the modified RALE scoring system. The study utilizes publicly available datasets, surpassing previous state-of-the-art works. Incorporating lung segmentation into the proposed ensemble-based classification approach enhances the overall classification process. This solution can be a valuable alternative for clinicians and radiologists, serving as a secondary reader for chest X-rays, reducing reporting turnaround times, aiding clinical decision-making, and alleviating the workload on hospital staff.

WE-Net: An Ensemble Deep Learning Model for Covid-19 Detection in Chest X-ray Images Using Segmentation and Classification

High-dimensional multinomial multiclass severity scoring of COVID-19 pneumonia using CT radiomics features and machine learning algorithms

Article Open access 01 September 2022

Detection of Covid-19 in Chest X-Ray Images Using Percolation Features and Hermite Polynomial Classification

Find the latest articles, discoveries, and news in related topics.

Introduction

Respiratory diseases, including tuberculosis (TB)¹, pneumonia², and COVID-19³, are widespread and pose substantial health risks. Timely detection and screening are crucial to halting disease progression and enhancing the effectiveness of treatment. Chest X-rays (CXRs) play a pivotal role in the initial assessment of these conditions, where expert radiologists analyze images to pinpoint regions of infection. However, the resemblance in CXR findings among COVID-19, pneumonia, and TB patients complicates the interpretation process, making it labor-intensive, time-consuming, and susceptible to errors. WHO reports a shortage of doctors, with fewer than one available for every 1000 people in over 45% of countries globally. The onset of the COVID-19 pandemic in December 2019 further exacerbated this situation, affecting millions and disrupting the patient-doctor ratio. This makes each medical professional examine a huge number of radiography images, increasing the likelihood of human error with subjective differences¹. In recent years, the advancement of Deep Learning (DL) has been instrumental in various applications. Convolutional Neural Networks (CNNs) have shown remarkable success in speech recognition, object identification, image recognition, and machine translation tasks. CNNs excel at learning hierarchical visual features when provided with ample training data. However, acquiring extensive datasets in the medical domain remains a challenge. Researchers utilized the concept of transfer learning, where models are trained on a large-scale dataset like the ImageNet⁴, and then pre-trained weights are considered for a small target dataset. In order to support clinical decisions, many Computer-Aided Diagnosis (CAD) systems have been developed. These systems are becoming increasingly common in various radiology contexts, including workflow optimization, abnormality detection, and disease progression monitoring^1,5,6,7. Additionally, CAD-based solutions can offer doctors additional insights into the issue that might not be visible to the human eye. These solutions can prove invaluable in underserved areas and communities with limited access to trained radiologists.

Related work

Detecting the abnormality and finding the infected regions in CXR images is vital to adopt a suitable treatment plan. However, it is difficult to pinpoint the infection’s exact location since various abnormalities share the same kinds of patterns. To address this issue, the authors in⁸ introduced a new dataset containing 112,120 CXR images of 14 distinct abnormalities. They utilized a weakly supervised approach to generate a likelihood of infection map. However, the Grad-CAM generated infection map does align accurately with the actual bounding box indicated in the ground truth. A related study by Rajpurkar et al.⁵ used DenseNet-121 to develop novel architecture and reported a better performance than the prior baseline method presented in⁸. Several other works^9,10 also reported better performance using different advanced CNN models. After the outbreak of the recent pandemic in 2019, research efforts have redirected toward computer-aided detection (CAD) of respiratory diseases, including COVID-19^11,12,13,14. The authors in³ developed a new dataset for COVID-19 detection from pneumonia and normal CXR images and presented a novel CNN-based model. Nishio et al.¹⁵ incorporated one private and two public CXR datasets and utilized transfer learning using EfficientNet B5 for classifying COVID-19, non-COVID-19, and healthy cases using CXR images. These approaches primarily use whole CXR images for model training, collected from diverse institutions employing different imaging devices and settings. This variation introduces discrepancies in radiographic attributes such as lighting, contrast, and resolution, ultimately resulting in poor generalization. To address this, few studies^1,14,16,17 have incorporated lung segmentation before classification. The methods mentioned above employed one or a few segmentation models, choosing the best-performing model for final predictions. However, this approach has a significant risk of information loss, as selecting any single ’best’ model may only partially capture the complexity and diversity of the data. Another drawback of these works is that they mainly used data from the Montgomery¹⁸ and Shenzhen¹⁸ datasets, encompassing a range of CXR images with varying quality from both normal and Tuberculosis (TB) patients. Consequently, the segmentation performance tends to be suboptimal when dealing with severe COVID-19 cases exhibiting bilateral consolidation or fluid accumulation. This limitation significantly affects the system’s ability to classify such cases accurately. Moreover, earlier works have shown that a specific model can provide better results under particular conditions, highlighting the difficulty in achieving generalization^13,14. In response, previous research efforts^19,20,21,22 have utilized basic ensemble methods such as majority voting and weighted averaging.

The authors in²³ presented a framework utilizing three pre-trained models and utilized an ensemble learning for COVID-19 detection from CXR images. However, these basic ensemble methods do not utilize the confidence scores of the classifiers during the final prediction computation, resulting in lower performance. In prior research on COVID-19 classification, few advanced fuzzy ensemble approaches have been explored^24,25,26,27. Kundu et al.²⁸ proposed an ensemble technique using the Sugeno fuzzy integral (SFI) and Dey et al.²⁹ utilized Choquet fuzzy ensemble and achieved an accuracy of 99.02% on COVID-19 dataset. Additionally, the authors in¹² presented a rank-based ensemble technique by employing two non-linear functions and reported 98.05% accuracy on a three-class classification problem. Beyond disease classification, identifying the infection areas and assessing the severity level is crucial for understanding a patient’s condition and guiding suitable interventions. This information assists in monitoring disease advancement, optimizing the allocation of scarce medical resources, and strategizing clinical treatments effectively. However, studies addressing severity scoring in CXR images are limited^30,31,32. For instance, Mercaldo et al.³³ introduced a deep learning based technique for COVID-19 detection from CT-scan images and automatically highlights symptomatic areas in the CT images, enhancing diagnostic precision. Cohen et al.³⁴ utilized a severity rating system introduced in a previous work by Warren et al.³², assigning scores from 0-4 based on infection severity. Another researcher³⁰ developed an array-based approach for severity assessment. This method divided the lung regions into several subdivisions and assigned scores ranging from 0-3. Another author in³⁵ created and validated a severity assessment model encompassing COVID-19, influenza, and novel influenza, utilizing CT images from a diverse multi-center dataset and reported that the developed model also applies to patients with other types of viral pneumonia. In the proposed architecture, we have employed the modified the RALE scoring system introduced^12,32 to assess COVID-19 CXR image severity. A literature review summary is presented in the Supplementary File.

Contributions

To address the challenges outlined previously, we present a multi-stage architecture for disease classification (Pneumonia, TB, Normal, and COVID-19), identification of infection areas, and evaluating severity levels in COVID-19 patients. This research significantly advances the current state-of-the-art automated CXR image analysis, providing crucial support to medical professionals in diagnosing respiratory-related conditions.

We have proposed a multi-stage pipeline designed explicitly for lung segmentation, disease classification, infection region identification, and severity assessment from COVID-19 CXR images.
We employed several segmentation models to extract pertinent lung regions. The outputs from the top three models are combined and fed into the subsequent classification models.
Most studies have only used two or three disease categories: normal, pneumonia, and COVID-19. However, tuberculosis, which shares similarities with these respiratory diseases, can sometimes coexist with COVID-19. We have considered both experiments in the proposed work.
This study chooses a robust approach by utilizing three distinct CNN models rather than depending on a single classification model followed by the Gamma function for the ensemble.
In the third stage of our proposed network, we incorporated an infection segmentation module to pinpoint the specific regions affected by the infection.
Finally, we employed a severity assessment method utilizing the modified RALE scoring system. This approach allows for a detailed classification of COVID-19 CXR samples into distinct severity levels, including mild, moderate, severe, and critical.

Problem formalization

In our proposed method, we have developed a multi-stage architecture to tackle the challenges encountered in CAD-based systems. In the first phase, the segmentation models identify the pertinent lung areas. In the subsequent step, classification models categorize these identified lung sections into distinct classes, followed by a Gamma-based fuzzy ensemble (GFE). Considering the dataset $D = {{\{D^i\}}_{i=1}^N}$, N is the total samples. Segmentation masks corresponding to each image $M = {\{M^i\}}_{i=1}^N,$ where $M^i$ is the individual segmented image. The output from the three classification models is ensembled using the Gamma function and mathematically formulated as X = GFE $[\{C_1(M^i)\},\{C_2(M^i)\},\{C_3(M^i)\}]$. Here, $C_1$, $C_2$, and $C_3$ are three pre-trained classifiers, and X is the predicted class.

Dataset and methods

Dataset statistics

Normal and pneumonia CXR images are collected from the Kaggle repository ”Chest X-Ray Images (Pneumonia)”³⁶. The COVID-19 images are accumulated from another Kaggle repository³⁷. It is noteworthy that the images in this repository originated from diverse sources, including some low-quality images with questionable labels. To ensure data reliability, we curated 4966 COVID-19 CXR images. This curation involved the removal of low-resolution images where COVID-19 patterns were not clearly visible and excluding CXR images with multiple lung diseases (multi-label) by an experienced radiologist. 700 TB CXR images are collected from the Kaggle repository¹, a collection of two publicly available datasets: the Montgomery and Shenzhen¹⁸. For lung segmentation, we have employed the COVID-QU-Ex dataset¹⁷, the most extensive publicly accessible dataset related to COVID-19, along with the Montgomery and Shenzhen datasets. Furthermore, infection localization is pivotal for evaluating a patient’s condition and determining appropriate treatment strategies³⁸. We utilized 2951 COVID-19 infection segmentation data from the study³⁹, which included annotated ground-truth infection segmentation masks. Table 1 specifies detailed resources of the datasets.

Table 1 Summary of dataset resources.

Full size table

Methods

To develop a robust end-to-end architecture, we hypothesized the following:

Hypothesis 1

The Gamma-based ensembles may serve as multi-expert recommendations and lower the likelihood of incorrect diagnoses.

Hypothesis 2

The performance of the classification model may be enhanced by segmenting the lung areas in the initial stage.

Hypothesis 3

The proposed architecture’s infection segmentation module could improve the model’s interpretability rather than the Grad-CAM-based approach.

We proposed a multi-stage architecture to evaluate the hypothetical assumptions, as shown in Fig. 1. The suggested approach comprises three main steps: lung segmentation, classification, and infection region segmentation with severity assessment. The following sections provide explanations for each level.

Stage-I: segmentation

The proposed network’s first stage is extracting the relevant lung regions from the CXR images. CXR images acquired through different techniques often undergo varying compression methods, and different patients have different heart sizes, bone structures, cardio-thoracic ratios, and lung borders. These variations can introduce variability in the training process. The primary purpose of lung segmentation in the initial stage is to filter out irrelevant features, ensuring a more precise interpretation and generalization of the model in the subsequent classification phase. In our proposed network, we have utilized the UNet model⁴⁰ along with six distinct backbone models: Attention, Inception-V3, DenseNet-201, ResNet-50, VGG-16, and DenseNet-121. After evaluating the performance (refer to Table 2), we have selected UNet-DenseNet121, UNet, and UNet-VGG16 for the final ensemble based on Intersection over Union (IoU) and Dice scores. All the input images are resized to 256$\times $256 as CXR images often come in varying sizes. Resizing the images to a specific resolution ensures uniformity and reduces computational complexity during processing. Next, we have normalized the image pixel values to a [0, 1] range, which enhances the model’s stability during training by preventing gradient explosion or vanishing during backpropagation. We also have utilized data augmentation techniques such as rotation, flipping, scaling, and translation to increase the diversity of the training dataset. Rotation involved randomly rotating the images within a angle range, flipping included horizontal and vertical flips, scaling encompassed random scaling of the images, and translation involved random shifts in the image position. Refer to the Supplementary File for the input CXR samples, the corresponding ground truth, the segmented masks, and the ensemble lung masks.

Stage-II: classification models

The second stage classifies the segmented CXR images into specific classes based on the relevant lung regions. We have performed two classification experiments: one with a three-class (COVID-19, Normal, and Pneumonia) and another with a four-class (COVID-19, Normal, Pneumonia, and TB). We have adopted Inception-V3⁴¹ and SqueezeNet⁴² with Squeeze and Excitation (SE) block⁴³ and DenseNet-201⁴⁴ based on initial experiments. The reason for selecting SE-Inception-V3 is its depth and ability to capture complex image patterns. It uses a combination of convolutional layers with different kernel sizes, allowing it to capture features at various scales. SE-SqueezeNet is chosen for its lightweight architecture. It achieves good accuracy with significantly fewer parameters compared to traditional architectures. DenseNet-201 is chosen for its densely connected layers, facilitating feature reuse across layers. This architecture has shown to be very effective in learning intricate patterns from data, making it suitable for tasks where detailed feature extraction is crucial.

Proposed ensemble approach for final classification

Ensemble techniques are one of the most powerful techniques for decision-making that fuse several base classifiers’ confidence scores to arrive at the final results. It improves the entire model’s performance and removes the bias introduced by the base models. Recently, there has been an increase in the integration of ensemble learning with DL-based architecture^20,27,28 like simple averaging, majority voting, and weighted average. However, these approaches have disadvantages like giving equal weight to each classifier irrespective of the performance, assigning a higher weight to classifiers with higher accuracy, and ignoring a group of classifiers working together⁴⁵. In this study, we develop a rank-based ensemble approach utilizing the Gamma function to overcome these restrictions and solve the classification problem. This technique has the advantage of using adaptive weights, which prioritize the scores of the individual base classifiers while constructing the ensemble to produce the final confidence score (prediction) for each input sample.

Here, we provide an overview of the fuzzy Gamma-based ensemble techniques used to estimate confidence scores from the base learners. The Gamma function is a mathematical equation introduced by Leonard Euler⁴⁶ and is defined as an extension of a factorial function for non-integral numbers. The Gamma function for a positive integer n and a complex number with a positive real part is defined as Eqs. (1) and (2), respectively.

$$\begin{aligned} \Gamma (n)&= (n-1)! \end{aligned}$$

(1)

$$\begin{aligned} \Gamma (\alpha )&= \int _{0}^{\infty } t^{\alpha -1}e^{-t} \,dt \end{aligned}$$

(2)

In this context, $\alpha $ is a parameter defined for all complex numbers except non-positive integers, and t is a variable representing the values over which the integral is performed. We calculate the rank of input images belonging to class k of the $i^{th}$ classifier’s confidence score (CS) as defined in Eq. 3.

$$\begin{aligned} R^i_k= \Gamma (CS^i_k), \forall i, k; i = 1,2,...,M; k = 1,2,...,K \end{aligned}$$

(3)

We have utilized three CNN models, so M = 3 and k = 3 or 4, depending on the number of classes. We calculate the fuzzy rank sum (FRS) and the complement of confidence factor sum (CCFS) for class k using Eqs. (4) and (5). We calculate the final decision score (FDS) for a class k using the below Eq. (6).

$$\begin{aligned} FRS_k= \sum _{i=1}^{M} R^i_k \end{aligned}$$

(4)

$$\begin{aligned} CCFS_k&= \frac{1}{M}\sum (1-CS_k)^i \end{aligned}$$

(5)

$$\begin{aligned} FDS_k&= FRS_k \times CCFS_k \end{aligned}$$

(6)

The final predicted class for an image I is determined by Eq. 7, by computing the minimum FDS values. To enhance comprehension of the ensemble method, a hypothetical example is illustrated in Fig. 2.

$$\begin{aligned} class(I)=argmin_k(FDS_k) \end{aligned}$$

(7)

Stage-III: infection segmentation and severity assessment

Identifying infection regions and assessing severity in real-time is paramount in clinical settings, offering vital insights into disease progression and guiding appropriate medical interventions. Upon categorizing the input image as COVID-19, it undergoes infection segmentation. Infection segmentation masks for COVID-19 images are only publicly available and utilized to train the COVID-19 infection segmentation models. We employed the UNet model⁴⁰ with six different backbone networks similar to those used in lung segmentation. The infected regions are determined by combining the output of the top-performing models, namely UNet-DenseNet-201, UNet-ResNet50, and UNet-VGG16, selected based on IoU and Dice scores. The severity of lung infections is determined by calculating the percentage of infected pixels in the lungs. We have employed modified RALE scoring and devised a scale from 0 to 3 based on the extent of infection^12,32. In the initial step, after lung segmentation, we identified the largest contour with OpenCV and overlaid it onto the original images. Subsequently, we segmented infection regions, detected contours using OpenCV, and superimposed them onto the lung mask contour. The infection percentage was then computed by dividing the pixels representing the overlap areas by the total lung mask pixels and multiplying by 100. For instance, if the infection involves less than 25% of the whole lung, it is classified as ”mild” (score 0). A score of 1 suggests involvement of less than 50% of the lung, indicating a ”moderate” severity. A score of 2 represents infection covering up to 75%, categorized as ”severe”. A score of 3 suggests extensive infection, from 76% to 100%, categorizing it as ”critical”. This scoring system provides a clinical grading for COVID-19 patients, enabling differentiation into distinct severity levels.

Results and analysis

The outcomes of lung segmentation are presented in Table 2, and it is evident that no single model exhibits superior performance. To mitigate information loss during segmentation, we merged the mask outputs from the top three models (shown in bold). Quantitative analysis demonstrates a substantial enhancement in the ensemble lung mask, with a 3.13% increase in IoU and a 1.75% in the Dice score compared to the best-performing Dense121-UNet. The Adam optimizer with an initial learning rate $=0.0001$ is utilized, and an early stopping strategy, limiting the training of each model to a maximum of 100 epochs with a batch size of 8.

To show the effectiveness of the classification models, we have performed several experiments: 3-class and 4-class classification (with and without lung segmentation). Table 3 represents the outcome of 3-class classification models. SE-Inception-V3 demonstrated a superior performance among the three models in the initial experiment conducted without lung segmentation and attained an accuracy of 97.08%, outperforming DenseNet-201 and SE-SqueezeNet, which achieved 96.48% and 95.70%, respectively. The proposed ensemble technique outperformed individual models, yielding an accuracy of 97.45%. In the next experiment, we used lung-segmented images instead of the whole CXR image to train the models. Following lung segmentation, all the models exhibited improvement as more relevant features were incorporated for classification. Additionally, the final ensemble model outperformed all the individual base models, demonstrating an accuracy of 98.15% and an F1-score of 97.89%. This outcome affirms the validity of Hypothesis 1. Sensitivity is a pivotal metric in disease classification, and the final ensemble model demonstrates a significant sensitivity level of 98.10%. Next, we performed four class classifications by adding the TB datasets with the existing three classes. Table 4 represents the outcome of 4-class classification models with and without segmentation. The outcomes indicate that initial segmentation has achieved 1.04% higher accuracy than without segmentation, which validates Hypothesis 2. We have utilized the Adam optimizer paired with categorical cross-entropy loss function and learning rate = 0.00001. The classification models were trained in mini-batches of 32 for a maximum of 100 epochs, employing an early stopping technique. Table 2 presents the outcome of infection segmentation models. UNet with VGG-16 encoder performs remarkably well, achieving a Dice score of 81.36%. Additionally, UNet with DenseNet-201 and ResNet-50 encoder produces competitive Dice scores of 80.44% and 81.07%, respectively. We combined the predicted masks from the top three individual networks and attained an outstanding Dice score of 96.75% and an IoU score of 93.71%. This significant enhancement validates the effectiveness of the ensemble approach. We have utilized the Adam optimizer with an initial learning rate of 0.0001 and trained the models in mini-batches of 8, continuing for a maximum of 100 epochs. All the above experiments are carried out on a local computer operating on Linux with 32GB of RAM and NVIDIA Quadro RTX 4000 GPU with 40GB of memory. Lung segmentation models required about 4 hours of training for each model, while classification models, including DenseNet 201 and SE-Inception-V3, took approximately 3 hours. SE-SqueezeNet models, on the other hand, required around 4 hours for training. Infection segmentation models were trained in approximately 2 hours.

Table 2 Lung Segmentation and Infection Segmentation results.

Full size table

Table 3 Experiment 1: 3-class classification.

Full size table

Table 4 Experiment 2: 4-class classification.

Full size table

Infection visualization and severity assessment

As mentioned, our proposed architecture utilizes the modified RALE score to assign a severity level according to the extent of infection in lung regions. These levels include mild, moderate, severe, and critical, as illustrated in Fig. 3. This approach aids medical professionals in prioritizing COVID-19-infected patients and devising effective treatment plans, particularly in resource-constrained environments. In Fig. 3, the first image represents a COVID-19 CXR image with a 6.42% lung infection, leading to classification as ”mildly infected.” Such patients may not require immediate admission. Conversely, the second image depicts a patient with an 85.39% infection, categorized as a ’critical patient’ necessitating urgent admission for treatment. This method significantly enhances clinical decision-making by accurately classifying COVID-19 patients from other respiratory diseases while also providing insight into the disease’s severity level. For a more comprehensive overview, please refer to the Supplementary File regarding Hypothesis 3.

Comparative analysis

We rigorously assessed the proposed model by comparing its performance against recent state-of-the-art methods mentioned in the literature survey. To ensure an equitable comparison, we exclusively considered models for which detailed implementations are available in the publications. We have trained these models following the specifications mentioned in their respective papers with our datasets. Table 5 presents the proposed model’s results with six baseline models. Baseline-1 fine-tuned pre-trained CNNs and applied the Sugeno fuzzy ensemble to combine the outcomes. However, this method manually calculates fuzzy measure values, which is time-consuming. Baseline-2 enhanced the previous work by introducing a novel way of calculating fuzzy measure values from validation accuracy. Baseline-3 employed the Choquet fuzzy ensemble to combine the CNN outcomes. Baseline-4 adopted the weighted average ensemble technique to combine decisions from pre-trained classifiers, which require careful weight selection and assume independence among models, making them sensitive, complex, and potentially limited in handling diverse or non-numeric predictions. Baseline-5 utilized a vision transformer as a base model with federated training, and the authors in Baseline-6 utilized CNN models with one added LSTM layer with the Sugeno fuzzy ensemble. Both methods attain an accuracy of 97.14% and 97.42%, respectively, on our dataset. Baseline-7 utilized a rank-based ensemble utilizing two non-linear functions and reported an 98.05% accuracy on a three class classification problem. Contrastingly, our proposed model achieved a substantially higher accuracy of 98.15% for three-class classification and 98.13% for four-class classification, surpassing existing baseline models. The distinct advantage of our proposed gamma-based ensemble lies in its reduced reliance on individual model choices and employs a sophisticated decision-making process.

Table 5 Comparative analysis of our method with the existing state-of-the-art methods in the literature.

Full size table

Additional experiment on unseen dataset

The assessment of an end-to-end model must be generalizable to different datasets with different patient groups. Developing a reliable and clinically effective model is crucial for any robust model development. In this study, we evaluated the proposed architecture using a different test set comprising diverse patient groups, sourced from the widely-used QaTa-COV19 dataset³⁹, which includes 2913 CXR samples of COVID-19 cases. The proposed architecture achieves 97.89% accuracy with an F1-score of 97.48%. The model’s outcomes remain consistent despite variations in the dataset because it concentrates explicitly on pertinent lung regions. This indicates the model’s ability to perform effectively on new and diverse datasets, even with limited training data.

Discussion

Similar findings of common respiratory diseases in CXR images make it challenging and error-prone for medical professionals. This research presents a comprehensive, multi-stage architecture to support better clinical decision-making for patient outcomes and alleviate hospital staff workloads. The proposed architecture consists of a three-stage process involving a three-stage process encompassing lung segmentation, disease classification, infection localization, and severity assessment. In order to reduce information loss during lung segmentation, we tested different UNet variants and combined the output masks from the three models that performed the best. Three pre-trained CNN models are utilized for classification, followed by the proposed ensemble model that generates the fuzzy ranks of the classifiers using the gamma function. By using the gamma function, the proposed method is able to more effectively combine the predictions of different models, leading to improved accuracy and performance compared to other ensemble methods. Moreover, when validated with an extra dataset, the suggested technique proves its reliability, remaining resilient against the diverse nature of different datasets. This robustness stems from the model’s focus on pertinent lung regions during decision-making. During the last phase, the model detects infected regions, assigns a score, and determines the corresponding infection level whenever the input CXR image is COVID-19. We calculate the score function by modifying the RALE scoring method. The optimistic outcomes of the suggested approach can aid radiologists in understanding the disease progression and devising more effective treatment strategies in the early stages. In conclusion, while the outcomes of our study are promising, there are a few challenges. Firstly, the complexity of the ensemble model is higher than that of the standalone architectures, leading to a higher computational complexity. Further research on adopting lightweight pre-trained models in the ensemble process could mitigate the computational complexity. Second, the reliance on additional training data for pre-trained models poses a limitation, prompting further investigation into the model’s generalizability in few-shot dataset settings. Lastly, while we focused on three prevalent lung diseases in our proposed work, extending the research to encompass other common respiratory diseases is essential for practical diagnostic applicability.

Data availability

The source code can be accessed at https://github.com/Pranabiitp/Boosting-Interpretability-and-Generalization-of-Respiratory-Disease-from-CXR-image. The curated COVID-19 dataset is available at our github https://github.com/Pranabiitp/COVID-19-CXR-Dataset. All the other datasets are publicly available.

References

Rahman, T. et al. Reliable tuberculosis detection using chest x-ray with deep learning, segmentation and visualization. IEEE Access 8, 191586–191601 (2020).
Article Google Scholar
Varshni, D., Thakral, K., Agarwal, L., Nijhawan, R. & Mittal, A. Pneumonia detection using cnn based feature extraction. In 2019 IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT) (ed. Varshni, D.) 1–7 (IEEE, 2019).
Google Scholar
Wang, L., Lin, Z. Q. & Wong, A. Covid-net: A tailored deep convolutional neural network design for detection of covid-19 cases from chest x-ray images. Sci. Rep. 10, 1–12 (2020).
CAS Google Scholar
Deng, J. et al. Imagenet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition (ed. Deng, J.) 248–255 (IEEE, 2009).
Chapter Google Scholar
Rajpurkar, P. et al. Chexnet: Radiologist-level pneumonia detection on chest x-rays with deep learning. Preprint at arXiv:1711.05225 (2017).
Sahoo, P., Saha, S., Mondal, S. & Gowda, S. Vision transformer based covid-19 detection using chest ct-scan images. In 2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI) (ed. Sahoo, P.) 01–04 (IEEE, 2022).
Google Scholar
Sahoo, P., Saha, S., Mondal, S., Chowdhury, S. & Gowda, S. Vision transformer-based federated learning for covid-19 detection using chest x-ray. In International Conference on Neural Information Processing (ed. Sahoo, P.) 77–88 (Springer, 2022).
Google Scholar
Wang, X. et al. Chestx-ray8: Hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. In: Proc. IEEE conference on computer vision and pattern recognition, 2097–2106 (2017).
Li, Z. et al. Thoracic disease identification and localization with limited supervision. In: Proc. IEEE conference on computer vision and pattern recognition, 8290–8299 (2018).
Wang, S.-H., Satapathy, S. C., Zhou, Q., Zhang, X. & Zhang, Y.-D. Secondary pulmonary tuberculosis identification via pseudo-zernike moment and deep stacked sparse autoencoder. J. Grid Comput. 20, 1–16 (2022).
Article PubMed Google Scholar
Horry, M. J. et al. Covid-19 detection through transfer learning using multimodal imaging data. IEEE Access 8, 149808–149824 (2020).
Article PubMed Google Scholar
Sahoo, P., Saha, S., Sharma, S. K., Mondal, S. & Gowda, S. A multi-stage framework for covid-19 detection and severity assessment from chest radiography images using advanced fuzzy ensemble technique. Expert Syst. Appl. 121724 (2023).
Afshar, P. et al. Covid-caps: A capsule network-based framework for identification of covid-19 cases from x-ray images. Pattern Recogn. Lett. 138, 638–643 (2020).
Article ADS Google Scholar
Xu, Y., Lam, H.-K. & Jia, G. Manet: A two-stage deep learning method for classification of covid-19 from chest x-ray images. Neurocomputing 443, 96–105 (2021).
Article PubMed PubMed Central Google Scholar
Nishio, M. et al. Deep learning model for the automatic classification of covid-19 pneumonia, non-covid-19 pneumonia, and the healthy: A multi-center retrospective study. Sci. Rep. 12, 8214 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Sharma, A. & Mishra, P. K. Covid-manet: Multi-task attention network for explainable diagnosis and severity assessment of covid-19 from cxr images. Pattern Recogn. 108826 (2022).
Tahir, A. M. et al. Covid-19 infection localization and severity grading from chest x-ray images. Comput. Biol. Med. 139, 105002 (2021).
Article CAS PubMed PubMed Central Google Scholar
Jaeger, S. et al. Two public chest x-ray datasets for computer-aided screening of pulmonary diseases. Quant. Imaging Med. Surg. 4, 475 (2014).
ADS PubMed PubMed Central Google Scholar
Win, K. Y., Maneerat, N., Sreng, S. & Hamamoto, K. Ensemble deep learning for the detection of covid-19 in unbalanced chest x-ray dataset. Appl. Sci. 11, 10528 (2021).
Article CAS Google Scholar
Das, A. K. et al. Automatic covid-19 detection from x-ray images using ensemble learning with convolutional neural network. Pattern Anal. Appl. 1–14 (2021).
Gifani, P., Shalbaf, A. & Vafaeezadeh, M. Automated detection of covid-19 using ensemble of transfer learning with deep convolutional neural network based on ct scans. Int. J. Comput. Assist. Radiol. Surg. 16, 115–123 (2021).
Article PubMed Google Scholar
Chandra, T. B., Verma, K., Singh, B. K., Jain, D. & Netam, S. S. Coronavirus disease (covid-19) detection in chest x-ray images using majority voting based classifier ensemble. Expert Syst. Appl. 165, 113909 (2021).
Article PubMed Google Scholar
Kaleem, S., Sohail, A., Tariq, M. U., Babar, M. & Qureshi, B. Ensemble learning for multi-class covid-19 detection from big data. PLoS ONE 18, e0292587 (2023).
Article CAS PubMed PubMed Central Google Scholar
Sugeno, M. & Murofushi, T. Pseudo-additive measures and integrals. J. Math. Anal. Appl. 122, 197–222 (1987).
Article MathSciNet Google Scholar
Sahoo, P., Saha, S., Mondal, S., Chowdhury, S. & Gowda, S. Computer-aided covid-19 screening from chest ct-scan using a fuzzy ensemble-based technique. In 2022 International Joint Conference on Neural Networks (IJCNN) (ed. Sahoo, P.) 1–8 (IEEE, 2022).
Google Scholar
Majumdar, S., Pramanik, P. & Sarkar, R. Gamma function based ensemble of cnn models for breast cancer detection in histopathology images. Expert Syst. Appl. 213, 119022 (2023).
Article Google Scholar
Sahoo, P., Saha, S., Mondal, S. & Sharma, N. Covid-19 detection from lung ultrasound images using a fuzzy ensemble-based transfer learning technique. In 2022 26th International Conference on Pattern Recognition (ICPR) (ed. Sahoo, P.) 5170–5176 (IEEE, 2022).
Chapter Google Scholar
Kundu, R., Singh, P. K., Mirjalili, S. & Sarkar, R. Covid-19 detection from lung ct-scans using a fuzzy integral-based cnn ensemble. Comput. Biol. Med. 138, 104895 (2021).
Article CAS PubMed PubMed Central Google Scholar
Dey, S., Bhattacharya, R., Malakar, S., Mirjalili, S. & Sarkar, R. Choquet fuzzy integral-based classifier ensemble technique for covid-19 detection. Comput. Biol. Med. 135, 104585 (2021).
Article CAS PubMed PubMed Central Google Scholar
Borghesi, A. & Maroldi, R. Covid-19 outbreak in italy: Experimental chest x-ray scoring system for quantifying and monitoring disease progression. Radiol. Med. (Torino) 125, 509–513 (2020).
Article PubMed Google Scholar
Toussie, D. et al. Clinical and chest radiography features determine patient outcomes in young and middle-aged adults with covid-19. Radiology 297, E197 (2020).
Article PubMed Google Scholar
Warren, M. A. et al. Severity scoring of lung oedema on the chest radiograph is associated with clinical outcomes in ards. Thorax 73, 840–846 (2018).
Article PubMed Google Scholar
Mercaldo, F., Belfiore, M. P., Reginelli, A., Brunese, L. & Santone, A. Coronavirus covid-19 detection by means of explainable deep learning. Sci. Rep. 13, 462 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Cohen, J. P. et al. Predicting covid-19 pneumonia severity on chest x-ray with deep learning. Cureus 12 (2020).
Park, D. et al. Development and validation of a hybrid deep learning-machine learning approach for severity assessment of covid-19 and other pneumonias. Sci. Rep. 13, 13420 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Mooney, P. Chest x-ray images (pneumonia). kaggle, Marzo (2018).
Zhao, A. et al. Covidx cxr-2: Chest x-ray images for the detection of covid-19 (2021).
Shi, F. et al. Large-scale screening of covid-19 from community acquired pneumonia using infection size-aware classification (2020). Preprint at arXiv:2003.09860 (2003).
Degerli, A. et al. Covid-19 infection map generation and detection from chest x-ray images. Health Inf. Sci. Syst. 9, 1–16 (2021).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention (ed. Ronneberger, O.) 234–241 (Springer, 2015).
Google Scholar
Szegedy, C., Ioffe, S., Vanhoucke, V. & Alemi, A. A. Inception-v4, inception-resnet and the impact of residual connections on learning. In Thirty-first AAAI conference on artificial intelligence (2017).
Iandola, F. N. et al. Squeezenet: Alexnet-level accuracy with 50x fewer parameters and< 0.5 mb model size. Preprint at arXiv:1602.07360 (2016).
Hu, J., Shen, L. & Sun, G. Squeeze-and-excitation networks. In: Proc. IEEE conference on computer vision and pattern recognition, 7132–7141 (2018).
Huang, G., Liu, Z., Van Der Maaten, L. & Weinberger, K. Q. Densely connected convolutional networks. In: Proc. IEEE conference on computer vision and pattern recognition, 4700–4708 (2017).
Ganaie, M. A., Hu, M., Malik, A., Tanveer, M. & Suganthan, P. Ensemble deep learning: A review. Eng. Appl. Artif. Intell. 115, 105151 (2022).
Article Google Scholar
Davis, P. J. Leonhard euler’s integral: A historical profile of the gamma function: In memoriam: Milton abramowitz. Am. Math. Mon. 66, 849–869 (1959).
Google Scholar
Dey, S., Bhattacharya, R., Malakar, S., Schwenker, F. & Sarkar, R. Covidconvlstm: A fuzzy ensemble model for covid-19 detection from chest x-rays. Expert Syst. Appl. 206, 117812 (2022).
Article PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science & Engineering, Indian Institute of Technology Patna, Patna, 801106, India
Pranab Sahoo, Sriparna Saha & Samrat Mondal
Maharaja Surajmal Institute of Technology, Delhi, India
Saksham Kumar Sharma
Mount Sinai Hospital and Icahn School of Medicine, New York, USA
Deepak Jain

Authors

Pranab Sahoo
View author publications
You can also search for this author in PubMed Google Scholar
Saksham Kumar Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Sriparna Saha
View author publications
You can also search for this author in PubMed Google Scholar
Deepak Jain
View author publications
You can also search for this author in PubMed Google Scholar
Samrat Mondal
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.S. conceived the idea, P.S. and S.K.S. conducted the experiment(s), P.S., S.S. analyzed the results. P.S. wrote the manuscript with valuable input from S.S. and S.M. Data curation by D.J. All authors reviewed the manuscript.

Corresponding author

Correspondence to Pranab Sahoo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sahoo, P., Sharma, S.K., Saha, S. et al. A multistage framework for respiratory disease detection and assessing severity in chest X-ray images. Sci Rep 14, 12380 (2024). https://doi.org/10.1038/s41598-024-60861-6

Download citation

Received: 29 April 2023
Accepted: 29 April 2024
Published: 29 May 2024
DOI: https://doi.org/10.1038/s41598-024-60861-6
Springer Nature Limited

A multistage framework for respiratory disease detection and assessing severity in chest X-ray images

Abstract

Similar content being viewed by others

WE-Net: An Ensemble Deep Learning Model for Covid-19 Detection in Chest X-ray Images Using Segmentation and Classification

High-dimensional multinomial multiclass severity scoring of COVID-19 pneumonia using CT radiomics features and machine learning algorithms

Detection of Covid-19 in Chest X-Ray Images Using Percolation Features and Hermite Polynomial Classification

Explore related subjects

Introduction

Related work

Contributions

Problem formalization

Dataset and methods

Dataset statistics

Methods

Hypothesis 1

Hypothesis 2

Hypothesis 3

Stage-I: segmentation

Stage-II: classification models

Proposed ensemble approach for final classification

Stage-III: infection segmentation and severity assessment

Results and analysis

Infection visualization and severity assessment

Comparative analysis

Additional experiment on unseen dataset

Discussion

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation