A novel lightweight CNN for chest X-ray-based lung disease identification on heterogeneous embedded system

Sanida, Theodora; Dasygenis, Minas

doi:10.1007/s10489-024-05420-2

A novel lightweight CNN for chest X-ray-based lung disease identification on heterogeneous embedded system

Open access
Published: 04 April 2024

Volume 54, pages 4756–4780, (2024)
Cite this article

Download PDF

You have full access to this open access article

Applied Intelligence Aims and scope Submit manuscript

A novel lightweight CNN for chest X-ray-based lung disease identification on heterogeneous embedded system

Download PDF

881 Accesses
1 Altmetric
Explore all metrics

Abstract

The global spread of epidemic lung diseases, including COVID-19, underscores the need for efficient diagnostic methods. Addressing this, we developed and tested a computer-aided, lightweight Convolutional Neural Network (CNN) for rapid and accurate identification of lung diseases from 29,131 aggregated Chest X-ray (CXR) images representing seven disease categories. Employing the five-fold cross-validation method to ensure the robustness of our results, our CNN model, optimized for heterogeneous embedded devices, demonstrated superior diagnostic performance. It achieved a 98.56% accuracy, outperforming established networks like ResNet50, NASNetMobile, Xception, MobileNetV2, DenseNet121, and ViT-B/16 across precision, recall, F1-score, and AUC metrics. Notably, our model requires significantly less computational power and only 55 minutes of average training time per fold, making it highly suitable for resource-constrained environments. This study contributes to developing efficient, lightweight networks in medical image analysis, underscoring their potential to enhance point-of-care diagnostic processes.

Chest X-ray Images Analysis with Deep Convolutional Neural Networks (CNN) for COVID-19 Detection

Deep Learning Methods for Chest Disease Detection Using Radiography Images

Article 11 May 2023

An intelligent healthcare monitoring system-based novel deep learning approach for detecting covid-19 from x-rays images

Article 12 January 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Epidemic lung diseases, such as lung opacity, viral pneumonia, fibrosis, bacterial pneumonia, COVID-19, and tuberculosis, have emerged as significant global health concerns, leading to substantial morbidity and mortality worldwide. The infected individuals experience various symptoms, including mild to moderate manifestations like fever, coughing, and shortness of breath. However, some individuals develop severe pulmonary conditions that can result in fatality [1, 2]. The outbreak of COVID-19 has been particularly alarming, with a substantial number of cases exhibiting intense chest congestion and a significant decline in oxygen levels, leading to severe cardiovascular complications [3,4,5]. Conversely, pneumonia is a lung disease distinguished by inflammation in the microscopic air sacs within the lungs. Pneumonia can be caused by various factors, including viral infections (such as influenza, COVID-19, or respiratory syncytial virus), common cold viruses, and bacterial infections [6,7,8]. Rapid prognosis and appropriate treatment are critical for addressing these epidemic lung diseases and minimizing their impact on public health [9, 10].

Timely identification and accurate diagnosis of lung diseases are paramount in ensuring proper and effective treatment. Healthcare professionals play a vital role in this process as they are equipped to administer targeted therapies based on the specific underlying cause of the infection [11]. These therapies may include antibiotics for bacterial infections, antiviral medications for viral infections, or other interventions tailored to the particular disease. The primary objective of treatment is to resolve the infection, alleviate symptoms, and prevent complications that may arise from the condition [12]. Complications can range from respiratory failure and pneumonia to long-term lung damage. So, by promptly identifying lung disease and providing appropriate treatment, healthcare professionals can minimize the risk of these complications and significantly reduce the mortality rate associated with these conditions [13, 14].

Chest X-ray (CXR) imaging is a widely employed diagnostic tool that plays a critical characteristic in evaluating the condition of the lungs and detecting abnormalities. It provides valuable information into the presence and extent of various lung diseases and abnormalities, allowing clinicians to make informed decisions regarding patient care [15]. However, accurately distinguishing between different epidemic lung diseases based solely on visual inspection of CXR images can be intricate. It requires specialized expertise and in-depth knowledge of each disease’s radiological features and patterns [16]. Unfortunately, these diseases’ radiological features and patterns often overlap, making it difficult to differentiate them accurately using visual examination alone, and this can lead to diagnostic errors and delays in initiating appropriate treatments [17, 18].

Machine learning (ML) [19, 20], deep learning (DL) [21,22,23], convolutional neural networks (CNNs) [24] and artificial intelligence (AI) [25, 26] have made significant advancements in medical image breakdown, including the analysis of complex medical images such as CXR. These techniques with algorithms have shown great potential in extracting meaningful patterns and features from medical images that may not be readily apparent to human observers, leading healthcare professionals to more accurate and objective diagnoses and reducing diagnostic errors [27,28,29] They can also assist healthcare professionals by highlighting regions of interest, providing quantitative measurements, and suggesting potential diagnoses based on learned patterns. Thus, using those in medical image analysis can improve the efficiency of the diagnostic process, allowing for faster interpretation and decision-making by healthcare professionals. It is vital to mention that those technologies are indicated to augment the expertise of healthcare professionals rather than replace them. Combining human expertise and advanced algorithms can lead to more accurate diagnoses and better patient care in the context of epidemic lung diseases [30].

Today, there is a particular emphasis on point-of-care medicine, which involves providing medical care and diagnostic testing directly or near the location where the patient is being treated [31, 32]. Portable radiographic machines or handheld devices can be used to acquire images, eliminating the need for patient transportation to centralized radiology departments. This saves time and reduces the burden on patients, especially those who may be critically ill or have limited mobility. This approach aims to deliver immediate healthcare services, enabling access to imaging, as it reduces the time between image acquisition and interpretation [33, 34]. This is particularly valuable in critical situations or emergencies where a prompt diagnosis is crucial for initiating appropriate diagnostic decisions. Such a point-of-care can improve healthcare systems’ efficiency and is particularly beneficial in environments with limited resources or remote or underserved areas [35, 36].

This research constitutes precious contributions to the field of medical imaging and disease diagnosis:

We merged multiple datasets to create a more diverse and comprehensive collection of CXR images, effectively forming a seven-category classification problem that enhances the robustness and generalizability of our approach. We developed a lightweight Convolutional Neural Network (CNN) model suitable for embedded applications, making it appealing, particularly in point-of-care environments. The model’s lightweight architecture ensures it can be efficiently utilized on resource-constrained systems such as embedded devices or portable diagnostic tools without compromising its diagnostic efficacy.
We address the prevalent issue of category imbalance in medical image identification and adopt the enhanced focal loss (FE) as the pivotal loss function during the training phase to mitigate the bias towards a particular category. The enhanced FE emphasizes difficult-to-categorized examples, reducing the bias towards well-represented categories. This approach improves the model’s efficacy in accurately identifying rare or underrepresented diseases.
We compared the performance of our model against well-known transfer learning architectures widely used in medical image analysis, such as ResNet50, NASNetMobile, Xception, MobileNetV2, DenseNet121, and ViT-B/16. We employed a five-fold cross-validation (CV) technique to ensure the robustness and reliability of our results. The results of our experiments demonstrated that our model outperforms existing methods in disease categorization from CXR images across critical metrics such as accuracy, precision, and recall, thereby underscoring its potential to significantly aid healthcare professionals in delivering timely and precise diagnoses and treatments.

The remainder of the paper is organized as follows: Section 2 examines related works, while details of the materials are presented in Section 3. Section 4 explains the methodology of the system design implementation, and Section 5 shows the experimental outcomes. Section 6, we discuss our work outcomes. Furthermore, the article concludes with a proposal for future works in Section 7.

2 Related work

In recent years, the prognosis and diagnosis of lung diseases have been the subject of extensive research and development efforts by medical experts, researchers, and scientists worldwide. Over the years, medical imaging technologies, ML, AI, and CNN advancements have improved lung disease categorization accuracy, speed, and efficiency [37]. However, it is essential to note that research in this area continues to evolve, with ongoing efforts to develop even more sophisticated and practical diagnostic solutions.

The authors in [38] proposed a Conditional Generative Adversarial Network (cGAN) with a ResNet-50 fine-tuned deep transfer learning model designed to identify six distinct categories: COVID-Mild, COVID-Medium, COVID-Severe, normal, pneumonia, and tuberculosis. The dataset utilized in their experiments comprised 40 COVID-Mild, 42 COVID-Medium, 36 COVID-Severe, 348 normal, 500 pneumonia, and 263 tuberculosis images derived from CXR images. The authors trained their model for 30 epochs using the categorical cross-entropy (CE) loss function. As an outcome, the model has reached an accuracy score of 93.67%. Also, in [39], the authors suggested a custom CNN model named DWTMBConvNet. This model aims to detect normal tuberculosis, COVID-19, bacterial, and viral pneumonia. The dataset employed in their experiments comprised 3593 normal, 1837 tuberculosis, 2098 COVID-19, 2786 bacterial pneumonia, and 1505 viral pneumonia images derived from chest radiographs. The model was trained for 500 epochs with a categorical CE loss function. As a result, the model has achieved an accuracy score of 95.50%.

In [40], the authors presented the FC-DenseNet103 model to identify five distinct categories: normal, bacterial pneumonia, viral pneumonia, COVID-19, and tuberculosis. The dataset employed in their investigations contained 191 normal, 54 bacterial pneumonia, 20 viral pneumonia, 180 COVID-19, and 57 tuberculosis images derived from CXR images. The authors trained their model for 100 epochs using the categorical CE loss function. As an effect, the model has attained an accuracy rate of 88.90%. Additionally, the authors proposed a ResNet-18 model to identify four distinct categories: normal, COVID-19, viral pneumonia, and tuberculosis. Specifically, the dataset contained 180 COVID-19 images, 191 normal images, 54 bacterial pneumonia images, and 57 tuberculosis images. The ResNet-18 model was trained for 100 epochs with a categorical CE loss function. The accuracy rate achieved by their trained model was reported to be 91.90%.

Wang et al. [41] introduced a custom CNN model named CoroDet designed explicitly to identify four categories: COVID-19, normal, viral pneumonia, and bacterial pneumonia. The dataset employed in their experiments comprised 500 COVID-19 images, 800 normal images, 400 bacterial pneumonia images, and 400 viral pneumonia images derived from chest radiographs. The authors trained their CoroDet model for 50 epochs, optimizing it to achieve the highest possible accuracy. As a result, the model has achieved an accuracy score of 91.20%. In [42], the authors presented a DCNN model called CoroNet based on the Xception model architecture. CoroNet was designed to identify four different categories: COVID-19, normal, viral pneumonia, and bacterial pneumonia. The model consisted of 33 million parameters. For their experiments, the authors utilized a dataset comprising a total of 284 COVID-19 images, 310 normal images, 330 bacterial pneumonia images, and 327 viral pneumonia. The authors trained the CoroNet model for 80 epochs during the training process. As a consequence, the CoroNet model reached an accuracy rate of 89.60%.

Karthik et al. [43] presented a custom CNN model to identify four distinct categories: COVID-19, normal, viral, and bacterial pneumonia. The dataset operated in their investigations contained a total of 558 COVID-19 images, 1583 normal images, 2780 bacterial pneumonia images, and 1493 viral pneumonia images derived from CXR images. The authors trained their custom CNN model for 70 epochs and reached an accuracy rate of 97.94%. In [44], the authors proposed a modified VGG19 model to identify images into four categories: COVID-19, normal, pneumonia, and lung cancer. Specifically, the dataset included 4320 COVID-19 images, 3500 normal images, 5856 pneumonia images, and 20,000 lung cancer. The model was trained for 500 epochs using the categorical CE loss function. The accuracy rate reached by their trained model was reported to be 98.05%.

Ibrahim et al. [45] proposed a CNN model based on the pre-trained AlexNet model. The model was designed to identify four categories: COVID-19, normal, viral pneumonia and bacterial pneumonia. The authors employed a dataset comprising 371 COVID-19 images, 2882 normal images, 4237 viral pneumonia images and 4078 bacterial pneumonia images. The model was trained for 20 epochs using the categorical CE loss function. The suggested CNN model earned an accuracy rate of 93.42%. At the same time, in [46] suggested a CNN model named CoviXNet. CoviXNet was developed to identify three categories: normal, COVID-19, and pneumonia. For their investigations, the authors employed a dataset comprising a total of 1281 COVID-19 images, 3270 normal images and 1656 pneumonia images. The CoviXNet model earned an accuracy rate of 96.61%. In [47] proposed a Convolutional CapsNet model for multi-category identification (COVID-19, no-findings, and pneumonia). The authors used a dataset comprising 231 COVID-19 images, which were increased to 1050 with the data augmentation process, 1050 no-findings, and 1050 pneumonia images. The model was trained for 50 epochs and achieved an accuracy of 84.22%.

Gupta et al. [48] evaluated four pre-trained CNN models (VGG-19, InceptionV3, MobileNetV2, and DenseNet). In addition to these individual models, the researchers also developed four hybrid models by combining different CNN architectures VID (VGG-19, Inception, and DenseNet), VMI(VGG-19, MobileNet, and Inception), VMD (VGG-19, MobileNet, and DenseNet), and IMD(Inception, MobileNet, and DenseNet). To evaluate the performance of these models, the researchers used a dataset consisting of 1500 images of pneumonia, 1500 images of COVID-19, and 1500 images of normal cases. Each CNN model was trained with the same hyperparameters for 10 epochs using the categorical CE loss function. The results indicated that the VMD hybrid model performed the best among all the models considered. The VMD model achieved an overall testing accuracy of 97.30%. In [49] introduced a custom CNN model named DarkCovidNet model for multi-category identification (COVID-19, no-findings, and pneumonia). The authors used a dataset comprising 127 COVID-19 images, 500 no-findings, and 500 pneumonia images. The model was trained for 100 epochs and achieved an accuracy of 87.02%.

Table 1 A summary of state-of-the-art literature using CNN methods to identify lung diseases using CXR images

A novel lightweight CNN for chest X-ray-based lung disease identification on heterogeneous embedded system

Abstract

Similar content being viewed by others

Chest X-ray Images Analysis with Deep Convolutional Neural Networks (CNN) for COVID-19 Detection

Deep Learning Methods for Chest Disease Detection Using Radiography Images

An intelligent healthcare monitoring system-based novel deep learning approach for detecting covid-19 from x-rays images

1 Introduction

2 Related work

3 Materials

3.1 Data collection

3.2 Data pre-processing

3.3 Collection splitting

4 Methodology

4.1 Handling category imbalance

4.2 Proposed lightweight model design

4.3 Pre-trained CNN models design

5 Experimental outcomes

5.1 GPU implementation on NVIDIA Jetson Xavier

5.2 Performance metrics

5.3 Training and validation phase

5.4 Testing phase

6 Discussion

7 Conclusions and future work

Data availability and access

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical and informed consent for data used

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation