Abstract
COVID-19, a highly infectious respiratory disease a used by SARS virus, has killed millions of people across many countries. To enhance quick and accurate diagnosis of COVID-19, chest X-ray (CXR) imaging methods were commonly utilized. Identifying the infection manually by radio imaging, on the other hand, was considered, extremely difficult due to the time commitment and significant risk of human error. Emerging artificial intelligence (AI) techniques promised exploration in the development of precise and as well as automated COVID-19 detection tools. Convolution neural networks (CNN), a well performing deep learning strategy tends to gain substantial favors among AI approaches for COVID-19 classification. The preprints and published studies to diagnose COVID-19 with CXR pictures using CNN and other deep learning methodologies are reviewed and critically assessed in this research. This study focused on the methodology, algorithms, and preprocessing techniques used in various deep learning architectures, as well as datasets and performance studies of several deep learning architectures used in prediction and diagnosis. Our research concludes with a list of future research directions in COVID-19 imaging categorization.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Corona virus disease (COVID-19) is a viral infection caused by the SARS-CoV-2 (Severe Acute Respiratory Syndrome Coronavirus 2) virus, which was initially discovered in Wuhan, China, in December 2019 and quickly spread around the world (Singhal 2020; He et al. 2020). As per the statistics given by World Health Organization, more than 500 million people have been infected worldwide, with roughly 6 million confirmed mortality cases (WHO) (WHO 2022). COVID-19 is classified as a respiratory disease as its symptoms includes high fever, myalgia, and sore throat with dry cough, headache, and chest pain. COVID-19 can spread to other people through minute liquid particles from an infected person's lips or nose when they cough, sneeze, speak, sing, or breathe. The majority of those infected with the virus will have mild to moderate respiratory symptoms and will recover without the need for medical attention. However, some people will become extremely unwell and need medical help. People can be immunized against the virus through vaccination, although they are still susceptible to infection. Furthermore, due to the vaccine's limited supply and the population's geographical distribution, vaccinating the whole global population is a time consuming process. Despite the quick development of vaccines, the disease has spread to over 200 nations and locations.
The most extensively utilized approach for diagnosing COVID-19 is reverse transcription-polymerase chain reaction (RT-PCR) (Corman et al. 2020). Sick people may transmit the virus to close contacts if insufficient resources are deployed to isolate positive patients from other suspected cases while waiting for SARS-CoV-2 coronavirus confirmation by RT-PCR. In clinical practice, CXR radiography and CT (Kassania et al. 2021) (or response to treatment) examined routinely by radiologists are used to detect COVID-19, describe its severity, and track its prognosis. Despite the fact that CT has higher detection sensitivity, chest X-ray radiography is more widely utilized in clinical practice due to its benefits, which include low cost, low radiation dose, ease of use, and widespread availability in general or community hospitals. Figure 1 shows some samples of CXR pictures of COVID and non-COVID instances as an example.
Diagnosing this disease manually takes time and is also prone to human error, and it importantly requires an assistance of skilled radiologists. As anomalies seen early in COVID-19 may resemble those seen in some other pulmonary syndromes such as SARS-CoV-2 or Viral Pneumonia (VP) as shown in Fig. 2, an expert radiologist is essential. Because of the disease's recent origins and similarities to other respiratory disorders like pneumonia, effective interpretation of results through images presents a number of difficulties. CXR images are indicated for a variety of pulmonary disorders; therefore, any automated system created to detect COVID-19 should also examine other respiratory illnesses in order to provide a more comprehensive and robust diagnostic system. COVID-19, the disease caused by the novel coronavirus, can infect the lungs and cause pneumonia. Pneumonia is a potentially fatal lung illness. It can sometimes be dangerous in certain people, particularly the elderly and those with respiratory illnesses. Figure 2 represent various types of Pneumonia.
Machine learning (ML) is becoming more popular in medical imaging applications such computer-aided diagnosis (CAD) (Nemoto et al. 2016), radiomics (Leger et al. 2017), and medical image analysis. One of the key advantages of ML in medical imaging is its ability to automatically extract relevant features and patterns from large datasets, which can help improve diagnostic accuracy and reduce the time and cost of analysis. For example, in CAD systems, ML algorithms can be trained on large datasets of medical images and associated clinical data to identify patterns and predict the presence of disease or abnormalities. Similarly, in radiomics, ML techniques can be used to extract quantitative features from medical images, such as texture, shape, and intensity, and use these features to develop predictive models for disease diagnosis, prognosis, and treatment response. Finally, ML algorithms are also being applied to medical image analysis tasks such as segmentation, registration, and classification, which can help clinicians more accurately, identify and quantify the location and extent of disease within medical images. Overall, the use of ML in medical imaging holds great promise for improving diagnostic accuracy, reducing healthcare costs, and advancing our understanding of disease.
Deep learning is a subset of machine learning that uses artificial neural networks with multiple layers to learn complex representations of data. DL has gained prominence in various industries, including the field of medical diagnosis. In the field of medical diagnosis, deep learning methodologies are used to drastically improve image processing (Litjens et al. 2017; Altaf et al. 2019). Some of the practical applications of DL include image registration with localization, recognition of skeletal and cellular structures and computer-aided disease prognosis in medical imaging and diagnosis (Shen et al. 2017; Chen et al. 2022). In DL, the CNN technique is commonly used for medical imaging (Shin et al. 2016; Anwar et al. 2018), and it comes in a variety of flavors.CNN also used to differentiate COVID image from others. CNN has been successfully applied to a variety of problems, including skin cancer detection (Estava et al. 2017), arrhythmia classification (Yoon et al. 2020), brain disease prediction (Wang et al. 2019), breast cancer finding (Talo et al. 2019), fundus image decomposition (Tan et al. 2017), pneumonitis with X-ray image detection (Rajpurkar et al. 2017), lung sectionalization (Gaál et al. 2020), and White Blood Cells Classification (Gothai et al. 2022). Each layer of CNN extracts COVID-19-related information from chest X-ray images, which can be used to differentiate the COVID and Normal images. Because of the automatic feature learning capabilities provided with CNN, COVID-19 classification based on deep neural networks is becoming more popular (Babu et al. 2022; Ganesh Babu et al. 2022; Kavinkumar and Meeradevi 2021).
The detection of COVID-19 from chest X-rays is crucial in controlling the spread of the pandemic, but accurately interpreting these images can be challenging due to the subtle and complex patterns associated with the virus. To address this issue, researchers have turned to optimization algorithms, which are mathematical methods used to find the best solution to a given problem. Optimization algorithms can be used to extract features from the images that are indicative of the virus, such as ground-glass opacities or consolidation. By optimizing these features, the algorithm's ability to identify COVID-19 in chest X-rays can be improved, leading to earlier diagnosis and treatment. The use of optimization algorithms in the detection of COVID-19 from chest X-rays has become increasingly important in the context of the pandemic, as chest X-rays are a common diagnostic tool for COVID-19. Overall, the use of optimization algorithms represents an important advancement in the fight against the pandemic, as it can improve the accuracy and efficiency of COVID-19 detection, ultimately saving lives (Devi and Maheswaran 2018; Rakkiannan et al. 2023).
The research contributions were explored by tabulating the available DL technologies, emphasizing the obstacles, and suggesting the necessary future investigations. This research looks at and analyzes preprints and published studies on COVID-19 diagnosis using CXR images that become available from 2020 (Batcha et al. 2023; Bennet et al. 2023; Kathamuthu et al. 2023). Among other research databases, the papers were found in Science Direct, Springer, Springer Nature, MDPI, Hindawi, and IEEE. We searched through the abstracts to see which studies utilized traditional machine learning techniques and which investigated deep learning for chest X-ray pictures. The study's primary contributions are as follows:
-
It provides an overview of the several common DL-based methodologies that have been used in related research;
-
It describes the commonly used COVID datasets that are available publicly;
-
It provided an overview of preprocessing techniques and image augmentation utilized in DL approaches.
-
It depicts a high-level overview of several optimization techniques for fine-tuning various hyper parameters.
-
It details the performance of the different DL models.
The following is a breakdown of the structure of this review study. Section 2 gives a comprehensive examination of DL strategies for COVID input image analysis, comprising preprocessing approaches, methodologies, and CXR dataset repository. Section 3 reviews the approach and compares the conduct of several deep learning modeling applications. Section 4 delves into the findings of this investigation. Finally, Section V brings the research to a close.
2 Literature survey
We examined 44 studies that used DL strategies to analyze CXR images containing the symptoms of SARS-CoV-2 viral infections. Major research works used datasets of CNN architectures trained on the ImageNet to implement transfer learning. Some of the work, however, went apart from using this and used unique designs. We provide a general description of the CNN approach for COVID-19 detection and datasets used were examined in this review in the subsections below.
2.1 CNN-based model for COVID-19 prediction
Preprocessing and classifying using pre-trained CNN architectures to categorize a given chest X-ray image was modeled. A thorough understanding of the problem, data obtained, and production environment is required to determine the best preprocessing and augmentation processes for improving model performance. The following sections go over each stage in detail. Figure 3 shows a COVID-19 diagnosis system based on DL, which includes the steps detailed further below.
2.1.1 Preprocessing
Preprocessing is an important step in enhancing image quality and consequently model performance. It entails resizing, normalizing, and, on occasion, grayscale conversion (Rahman et al. 2021). The steps taken to format images before they are used in model training and inference are referred to as image preprocessing. Images in training and testing would be subjected to this preprocessing phase. To clean picture data for model input, preprocessing is necessary. Convolutional neural networks, for example, demand that all images be the same size arrays. Model training time and model inference speed may both be reduced by image preprocessing. If the input images are very huge, shrinking them will cut model training time in half without sacrificing model performance. This section describes the procedures employed during the preprocessing stage in detail which is employed in our studies. Table 1 gives the different preprocessing techniques used in our studies.
Normalization and image resizing is a crucial step in CNN systems that preserve image stability while also improving model performance. If normalization is used, a CNN model learns faster and the gradient descent tends to be more stable. The process of normalization changes the range of pixel values (Islam et al. 2020). As a result, input image pixels in this study were normalized between 0 and 1 (Saha et al. 2021; Haque and Abdelgawad 2020). Image resizing is the process of scaling images. Image processing and machine learning applications both benefit from resizing. The majority of deep learning model architectures assume that all input images are the same size (Asnaoui and Chawki 2021; Showkat and Qureshi 2022; Jia et al. 2021). Almost a third of the people in our study resize the image to 224 × 224 pixels. Resizing allows you to reduce the amount of pixels in an image, which has a number of benefits such as it can reduce the amount of time it takes to train a neural network since the more pixels in an image; the more input nodes there are, increasing the model's complexity (Islam et al. 2020; Li et al. 2021).Also, By normalizing input features to a similar range, the model can become more stable, converge faster, and increase efficiency while reducing the impact of outliers (Chollet , 2018).
2.1.1.1 Augmentation technique
Image augmentation builds training images using a variety of processing techniques or a mix of techniques. Another image processing technique is histogram equalization, which uses the image intensity histogram to increase the global contrast of an image (Heidari et al. 2020; Mostafiz et al. 2020). Images can be flipped horizontally and vertically using flipping (Sousa et al. 2022). Vertical flips are not supported by all frameworks. A vertical flip, on the other hand, is the same as rotating an image 180 degrees and then flipping it horizontally. The image dimensions may not be kept after rotation. If your image is square, rotating it at right angles will keep the image size the same. If it's a rectangle, turning it 180 degrees will keep it the same size. The final image size will alter as the image is rotated at finer angles (Karakanis and Leontidis 2021; Aslan et al. 2022).Cropping is just taking a random piece of an image and cropping it. We then restore the original image size to this part. Random cropping is the common name for this method (Reshi et al. 2021).
Shear is an image distortion technique used to produce or correct perspective angles, typically to enhance images for computer vision applications as in Khan et al. (2022), Asnaoui and Chawki (2021). In addition to shear, various other techniques are used in image augmentation to increase the diversity of the training data, such as zoom augmentation, height shift range function, translation, and Gaussian filtering. Zoom augmentation randomly zooms in or adds pixels around the image to enlarge it as in Asnaoui and Chawki (2021), while height shift range function shifts each pixel vertically to the top or bottom at random (Sharifrazi et al. 2021). Translation involves moving the image in the X or Y direction (or both) during the augmentation process, which enables the convolutional neural network to search in all directions (as described in Abbas et al. 2021; Kiziloluk and Sert 2022). Finally, Gaussian filtering is often used to reduce noise and blur image regions, as seen in some studies (e.g., (Jain et al. 2020; Shankar and Perumal 2021)).
Data augmentation with Generative Adversarial Networks (GANs) has been used to improve CNN training by producing fresh data without any predetermined augmentation procedure (Rasheed et al. 2021).Conditional GANs can transform an image from one domain to an image to another domain. The sole disadvantage of this procedure is that the result is more artistic than practical (Karakanis and Leontidis 2021).Anisotropic diffusion is a technique for improving image quality. It is a nonlinear method that "extracts" the key visual information by removing noise and extraneous details while preserving the edges (Mostafiz et al. 2020).Image data augmentation is used to increase the size of the training dataset in order to improve the model's performance and generalization capacity (Hasan et al. 2021). The DARI method generates synthetic chest X-ray images using GAN and generic data augmentation techniques, which are then integrated with the remaining original radiograph images to create a strong training dataset (Sakib et al. 2020).
2.1.2 Traditional CNN architecture
Nowadays, CNN architectures are good in attaining expert-level performance as human in a variety of complicated visual tasks, such as medical image analysis and pathology detection. Since the first successful CNN in 1998, a plethora of CNN architectures have been proposed in the literature. It was widely used for the application (handwritten digit recognition) and was known as LeNet. It was developed by Yann LeCun. LeNet has three convolutional, two averages pooling, and two fully connected layers, making it a shallow design in comparison to current models. CNN is used for feature extraction. Figure 3 shows a schematic depiction of a typical CNN for COVID-19 prediction, which is detailed further below.
-
1.
Convolution layer Learnable filters (also called as kernels) are convolved with the input images to produce the convolutional layer. It executes an element-wise dot product and sum to create a number as a feature map element. Local connectivity, in which filter weights are doubled to a small area of the input picture at a time, and Weight Sharing, where the same filter weights are duplicated to every grid point of the input image, is two important aspects of convolution. The initial layer retrieves low-level features, whereas the subsequent layers recover high-level features.
-
2.
Pooling layer Along with standard convolutional layers, convolutional networks can have local and/or global pooling layers. By merging the outputs of neuron clusters from one layer into a single neuron in the next layer, pooling layers reduce the number of dimensions in data. Small clusters are combined using local pooling, which generally uses tiling sizes of 2 × 2. The feature map's neurons are affected by global pooling. Maximum and average pooling are the two most used types of pooling. The maximum value of each local cluster of neurons in the feature map is used for max pooling, while the average value is used for average pooling.
-
3.
Fully connected layer It is commonly used to perform classification tasks. This layer, also known as dense layers, take the output of the previous layer, which is a high-dimensional feature map, and convert it into a probability vector indicating the likelihood of the input belonging to each class. The number of neurons in the fully connected layers is typically set to match the number of classes in the dataset. During training, the network learns the optimal weights for each neuron through backpropagation, minimizing the loss function. Once trained, the CNN can classify new data by passing it through the network and determining the predicted class based on the highest probability value in the output vector.
2.1.3 Pre-trained CNN and methodologies
A pre-trained model is one that has been trained on a big benchmark dataset to solve a problem that is comparable to the one we're trying to solve. As a result of the high computational cost of training these models, it is usual practice to import and employ models from the literature. In such situations, the usage of pre-trained models based on the notion of transfer learning (TL) can be beneficial. In TL, the information gained by a DL model trained on a large dataset is applied to a task with a smaller dataset. This reduces the need for a big dataset and a longer learning period, which are both requirements of DL algorithms that are taught from scratch. For classification of COVID-19 from normal cases/Viral Pneumonia/Bacterial Pneumonia, several pre-trained models were utilized in this study: AlexNet, VGG-16, ResNet-101, MobileNet, Se-ResNeXt-50, Densenet-161, SqueezeNet, and Inception-V3. Table 2 depicts the numerous approaches used in our research.
AlexNet was one of the first convolutional networks to solve large-scale image classification problems, paving the way for deep learning applications. Its architecture comprised 8 layers, which included 5 convolutional layers and 3 fully connected layers. During operation, the input images were processed by the convolutional layers, which utilized pooling and filter sizes to extract image features, while the fully connected layers subsequently classified the images based on these extracted features. A notable advancement of AlexNet was the use of ReLU activation functions, which helped address the issue of vanishing gradients that had previously limited neural network depth. Abbas et al. (Abbas et al. 2021) presented the DeTraC system, which uses AlexNet for class decomposition to detect COVID-19 X-ray images from normal and severe respiratory disease cases. AlexNet was used by Shukla et al. (2021) to provide a framework for diagnosing COVID-19 patients using chest X-ray images.
Oxford University's Visual Geometry Group is known as VGG. This model is simple in design yet quite effective in terms of performance. The VGG16 and VGG19 architectures, respectively, include 16 and 19 convolutional layers. VGGNet contains a cascade of five convolutional blocks with fixed kernel sizes of 3 × 3, with the first two blocks each containing two convolutional operations and the last three blocks each containing three convolutional operations. It is important to note that a new convolution block, as well as process enhancement techniques (batch normalization and dropout), can be readily added to the conventional model, allowing for the learning of finer features and enhanced learning speed/stability. A few studies used the VGG16 pre-trained model for COVID-19 classification with CXR for three-class classification (Heidari et al. 2020; Hasan et al. 2021). Some research added or fine-tuned a few layers to the VGG16 pre-trained model for COVID-19 CXR image classification (2 classes (Shibly et al. 2020); 3 classes (Bayoudh et al. 2020; Das et al. 2021)). Asnaoui and Chawki (2021) also made use of VGG19 to classify the given CXR images.
ResNet is the most well-known pre-trained model for COVID-19 classification, and it has been frequently used. The result of each convolutional block is added to the output of the convolution blocks of the deeper stages of ResNet, which is made up of many residual blocks. With the help of X-ray imaging, Jain et al. (2020) were able to detect COVID-19 instances while distinguishing it from bacterial pneumonia, viral pneumonia, and healthy normal persons. Several researchers have utilized ResNet50 to detect COVID-19 using CXR images (Mostafiz et al. 2020; Das et al. 2021; Asnaoui and Chawki 2021). For COVID-19 picture categorization, the authors in Ref. Showkat and Qureshi (2022), Hossain et al. (2022) used customized versions of ResNet. Table 2 provides additional information on these sources.
In contrast to standard residual models, which use extended representations in the input and output, the MobileNetV2 architecture is built on an inverted residual structure, where the input and output of the residual block are thin bottleneck layers. MobileNetV2 filters features in the intermediate expansion layer with lightweight depth wise convolutions. MobileNetV2 was used in a few studies to classify CXR images into three categories: COVID, Other Pneumonia, Normal (Kiziloluk and Sert 2022; Asnaoui and Chawki 2021).To build Se-ResNeXt, a Squeeze and Excitation (SE) block was added to the ResNet. SE blocks allow a network to execute dynamic channel wise feature recalibration, which increases its representational capacity. Hira et al. (2021) used AlexNet, GoogleNet, ResNet-50, Se-ResNet-50, DenseNet121, Inception V4, ResNet V2, ResNeXt-50, and Se-ResNeXt-50 to examine the performance of COVID-19 classifications through CXR images. Se-ResNet-50 outperformed the others in this model.
DenseNet is a ResNet50 modification in which each layer receives additional input from all preceding levels instead of a single previous layer's skip connection. For concatenation, it sends its output to all of the following convolutional layers. As a result, each convolutional layer is said to receive "collective knowledge" from the ones before it. DenseNet has showed good performance in a few studies (Kiziloluk and Sert 2022; Ortiz et al. 2022; Alhudhaif et al. 2021). SqueezeNet is a convolutional neural network that utilizes design tactics to minimize the number of parameters, particularly through the use of fire modules, which "squeeze" parameters using 1 × 1 convolutions. SqueezeNet was used by Gupta et al. (2021b) to extract features during COVID-19 prediction using CXR images. To classify the provided Chest X-ray pictures into COVID-19 Pneumonia and Other Pneumonia classes, SqueezeNet was utilized as the training architecture (Alhudhaif et al. 2021).
For greater model adaption, the Inception V3 model uses numerous strategies to optimize the network. It has a more extensive network than the Inception V1 and V2 models, but its speed is unaffected. It is less computationally costly. As regularizers, it employs auxiliary classifiers. In Ref. Gupta et al. (2021b), the author used the Inception V3 model to extract features and classify input photos into COVID and non-COVID images. Other frameworks used in Covid-19 detection include Capsule Network-based frameworks like COVID-wideNet (Gupta et al. 2022) and COVID-CAPS (Afshar et al. 2020).
2.1.4 Classification task
COVID-19 prediction was achieved by categorizing given CXR pictures into two (binary) or multi classes, as illustrated in Fig. 3. Normal, bacterial, viral, and COVID-19 are among the designations seen in each class. COVID-19 label and non COVID label are the two labels that make up binary categorization. The three labels in the three-class prediction process are: (1) COVID-19, (2) Normal, and (3) pneumonia. The four sorts of labels in the 4 class prediction technique are: (1) COVID-19, (2) Normal, (3) bacterial, and (4) viral pneumonia. The vast majority of works forecast 2 or 3 classes. Table 3 shows the number of publications reviewed divided by the number of classification labels used. One of the review work (Jia et al. 2021), there are five classifications: "COVID-19," "Tuberculosis," "Viral Pneumonia," "Bacterial Pneumonia," and "Normal." This study will also assess whether the CXR image contains "Tuberculosis" disease.
2.1.5 Dataset details
Different datasets were utilized in the peer-reviewed studies. Table 4 provides a summary of these datasets. The reference No., dataset description/URL, and No. of images utilized for each work, listed in each row. Some of these files feature COVID-19 CXR images, while others comprise images of healthy people and people with various pulmonary illnesses. CXR images are frequently used as a first-line imaging technique for COVID-19 patients and studied in several COVID-19 diagnosis studies. This technique is quite inexpensive when compared to other medical imaging approach and poses a lower danger to human health because it is a low-radiation technique. Table 4 also summarizes the most recent and relevant studies in this subject, as well as the dataset’s quantitative parameters, such as the number of records.
3 Performance evaluation
Evaluation indicators are used to evaluate the overall pipeline's performance. For the experiment, the data is usually partitioned into training and testing sets. The training data is used to construct a specific model, while the adequacy of the training and the model is evaluated by simultaneously monitoring, overfitting and under fitting on the validation data. Finally, the generated model's performance is evaluated using previously unseen test data. The accuracy of the classification can be measured by dividing the number of accurately predicted images by the total number of predictions made. If there are two classes of images, for example, each class should have an equal amount of images; only then will the training accuracy work well. The Accuracy can be described as
The test's sensitivity (sometimes called the detection rate in a clinical setting) is the percentage of people who test positive for the condition among those who have it. This can be written as: The chance of a positive test, conditioned on it being actually positive, is known as sensitivity (True Positive Rate). This is expressed mathematically as
The probability of a negative test, conditioned on being actually negative, is known as specificity (True Negative Rate). The fraction of people who do not have the condition but test negative for it is known as test specificity. This can also be written mathematically as:
The precision can be calculated by dividing the individuals correctly identified by total number of correctly identified individuals and incorrectly labeled individuals.
The F-score can be used as a single indicator of positive class test performance. The harmonic mean of precision and recall is the F-score. F-Score is written in mathematical notation as:
If the precision and sensitivity values are high, the F1 score for the particular model is also high. If the precision and sensitivity values are low, the F1 score for that model is also low. The model has a medium F1 score if one of the precision and sensitivity values is low and the other is high. The F1 score solely reflects the model's performance.
It was unable to compare the studies included in this study due to discrepancies in the size of the testing sets and the lack of uniform performance evaluations, making the identification of the most efficient DL models for recognizing COVID-19 from CXR pictures even more difficult. Most writers evaluated the DL models using the accuracy, sensitivity, and specificity criteria. However, when non-standard metrics and data from several sources are used, comparing alternative techniques becomes more difficult. As a result, a public COVID-19 dataset that is both complete and accessible to researchers is necessary. Performance standards for prediction models must also be established. Table 5 illustrates the outcomes of the articles that were read in terms of classification metrics like accuracy, precisions, recall, sensitivity, specificity, and F1-score.
Accuracy is the parameter which measures the overall performance of a model. It is calculated as the percentage of the correctly classified data samples by the model. By integrating CNN with LSTM (Long Short Term Memory) as a classifier, Md. Zabirul Islam et al. (He et al. 2020) achieved accuracy of 99.40%. Using Adam Optimizer, Hira et al. (WHO 2022) achieves a 99.32% with Se-ResNeXt-50. Using Adam as an optimizer, Gupta et al. (Erickson et al. 2017) suggested InstaCovNet-19, with this author achieving accuracy of 99.08% for Multi-class and 99.53% for Binary Class. For classification, Reshi et al. (Altaf et al. 2019) employed deep CNN, which achieved a 99.5%. The Adam optimization technique is employed in this study. With the help of CNN and DWT optimized features, Mostafiz et al. (Shin et al. 2016) achieves a binary classification accuracy of 99.45%. Shukla et al. (Hira et al. 2021) created a framework called COVID-19 that uses a Multiobjective Genetic Algorithm and a Convolutional Neural Network to achieve 99.15%.
Modified Mobile Net and ResNet were used by Jia et al. (Rahman et al. 2021), with average accuracy of 99.6% and 99.3%, respectively. SqueezeNet + ReLu Activation function by Gupta et al. (Das et al. 2021) achieves 99.4%, while Inception V3 + Sigmoid achieve 99.5%. Transfer learning with fine-tuned deep CNN ResNet50 model for classification by Hossain et al. (Sakib et al. 2020) achieves accuracy of 99.95%. Adam Optimizer is used in the above model. Sharifrazi et al. (Gayathri et al. 2022) created a model that combines a convolutional neural network, a support vector machine, and a Sobel filter to reach a 99.02%.Fig. 4 gives the overview of various optimization technique used in the review work.
In this study, various optimization algorithms have been used in deep learning, including stochastic gradient descent optimizer (SGD) (Khan et al. 2022), adaptive learning rate optimization algorithm (Adam) (Heidari et al. 2020; Hira et al. 2021; Nour et al. 2020; Karakanis and Leontidis 2021; Gupta et al. 2021a; Khan et al. 2022; Reshi et al. 2021; Bayoudh et al. 2020; Sousa et al. 2022), Adagrad (Sakib et al. 2020), and RMSprop (Ortiz et al. 2022). These algorithms have been compared based on their convergence speed, ability to escape local minima, and robustness to hyper parameter settings. While SGD is a commonly used optimization algorithm, it can be slow and sensitive to the choice of learning rate. Adam is a popular choice due to its fast convergence and adaptive learning rate, but it may struggle with high-dimensional problems. Adagrad is effective at handling sparse data and can adapt to different learning rates, but it can converge too quickly and be less effective on non-convex problems. RMSprop is similar to Adagrad but addresses its fast convergence issue, making it a good choice for non-convex optimization problems. The choice of optimization algorithm depends on the problem being addressed and the resources available for training.
Additionally, the literature survey found that the batch sizes used in training ranged from 4 to 128, while the number of epochs ranged from 10 to 5000. Some models used convolution layers, max pooling layers, FC layers, and activation function layers, while others used LSTM and GRU layers. Overfitting is a common problem that authors encounter when training their models. To address this issue, authors typically use techniques such as early stopping, regularization, and data augmentation. Early stopping involves monitoring the loss function during training and stopping the training process once the validation error stops improving. Regularization techniques, such as L1 and L2 regularization (Kiziloluk and Sert 2022), penalize large weights and help prevent overfitting. Data augmentation involves generating additional training examples by applying various transformations to the original data. Alternative optimization algorithms have also been used in hyper parameter optimization for deep learning models, such as the Modified Competitive Swarm Optimizer (MCSO) (Jalali et al. 2022), the Salp Swarm Algorithm (Ahmadian et al. 2021), and the modified deer hunting optimization algorithm (M-DHOA) (Kuzhali and Pushpa 2022). The hyper parameters considered in these studies include convolution filter size, number of filters, number of convolutional layers, activation function type, dropout rate, max-pooling size, learning rate, momentum rate, optimizer type, number of epochs, and batch size. These optimization algorithms were found to be effective in improving model performance, demonstrating their potential in hyper parameter optimization for deep learning.
4 Conclusion
This article explores different deep learning (DL) techniques that can be utilized to identify COVID-19 through chest X-ray images, including the current state of research in this area. The paper explains pre-trained CNN models and various datasets used in prior studies. Although DL approaches show promise for automatic COVID-19 diagnosis, cooperation between medical experts and computer scientists is necessary to create more reliable and effective DL models. Despite the excellent outcomes, there is still a lot of room for improvement. In order to enhance model performance, it is crucial to establish datasets that are public, broad, diverse, validated and labeled by experts with related lung disease lesions. Combining sign detection with categorization output could improve both the forecast accuracy and model performance. Techniques such as cross-validation, data augmentation, and transfer learning have been identified to enhance the adaptability and generalization of DL models. It is also crucial to train COVID-19 detection models on a substantial amount of real-world data. Nevertheless, there have been limited studies on multi-class classification (i.e., specifically more than 3 classes), and future research can evaluate the effectiveness of proposed models for this issue. Moreover, incorporating optimization algorithms with DL models can lead to the development of more dependable models.
Data availability
The data used to support the findings of this study are available from the corresponding author upon request.
References
Abbas A, Abdelsamea MM, Gaber MM (2021) Classification of COVID-19 in chest X-ray images using DeTraC deep convolutional neural network. Appl Intell 51(2):854–864
Afshar P, Heidarian S, Naderkhani F, Oikonomou A, Plataniotis KN, Mohammadi A (2020) Covid-caps: a capsule network-based framework for identification of covid-19 cases from x-ray images. Pattern Recogn Lett 138:638–643
Ahmadian S, Jalali SMJ, Islam SMS, Khosravi A, Fazli E, Nahavandi S (2021) A novel deep neuroevolution-based image classification method to diagnose coronavirus disease (COVID-19). Comput Biol Med 139:104994
Ahsan MM, Alam TE, Trafalis T, Huebner P (2020) Deep MLP-CNN model using mixed-data to distinguish between COVID-19 and non-COVID-19 patients. Symmetry 12(9):1526
Alhudhaif A, Polat K, Karaman O (2021) Determination of COVID-19 pneumonia based on generalized convolutional neural network model from chest X-ray images. Expert Syst Appl 180:115141
Altaf F, Islam SM, Akhtar N, Janjua NK (2019) Going deep in medical image analysis: concepts, methods, challenges, and future directions. IEEE Access 7:99540–99572
Anwar SM, Majid M, Qayyum A, Awais M, Alnowami M, Khan MK (2018) Medical image analysis using convolutional neural networks: a review. J Med Syst 42(11):1–13
Aslan MF, Sabanci K, Durdu A, Unlersen MF (2022) COVID-19 diagnosis using state-of-the-art CNN architecture features and Bayesian optimization. Comput Biol Med 142:105244
Babu RG, Hemanand D, Kumar KK, Kanniyappan N, Vinotha V (2022) A survey of satellite images in fast learning method using CNN classification techniques. In: Proceedings of the international conference on cognitive and intelligent computing: ICCIC 2021, vol 1. Springer, Singapore, pp 277–284
Batcha BBC, Singaravelu R, Ramachandran M, Muthusamy S, Panchal H, Thangaraj K, Ravindaran A (2023) A novel security algorithm RPBB31 for securing the social media analyzed data using machine learning algorithms. Wirel Pers Commun 1–28
Bayoudh K, Hamdaoui F, Mtibaa A (2020) Hybrid-COVID: a novel hybrid 2D/3D CNN based on cross-domain adaptation approach for COVID-19 screening from chest X-ray images. Phys Eng Sci Med 43(4):1415–1431
Bennet MA, Mishra OP, Muthusamy S (2023) Modeling of upper limb and prediction of various yoga postures using artificial neural networks. In: 2023 international conference on sustainable computing and data communication systems (ICSCDS). IEEE, pp 503–508
Chen X, Wang X, Zhang K, Fung KM, Thai TC, Moore K, Mannel RS, Liu H, Zheng B, Qiu Y (2022) Recent advances and clinical applications of deep learning in medical image analysis. Med Image Anal 79:102444
Chollet F (2018) Deep learning with Python. Manning Publications, New York
Corman VM, Landt O, Kaiser M, Molenkamp R, Meijer A, Chu DK, Drosten C (2020) Detection of 2019 novel coronavirus (2019-nCoV) by real-time RT-PCR. Eurosurveillance 25(3):2000045
Das AK, Kalam S, Kumar C, Sinha D (2021) TLCoV-An automated Covid-19 screening model using transfer learning from chest X-ray images. Chaos Solitons Fractals 144:110713
De Sousa PM, Carneiro PC, Oliveira MM, Pereira GM, da Costa Junior CA, de Moura LV, Mattjie C, da Silva AMM, Patrocinio AC (2022) COVID-19 classification in X-ray chest images using a new convolutional neural network: CNN-COVID. Res Biomed Eng 38(1):87–97
Devi M, Maheswaran S (2018) An efficient method for brain tumor detection using texture features and SVM classifier in MR images. Asian Pac J Cancer Prev APJCP 19(10):2789
El Asnaoui K, Chawki Y (2021) Using X-ray images and deep learning for automated detection of coronavirus disease. J Biomol Struct Dyn 39(10):3615–3626
Erickson BJ, Korfiatis P, Akkus Z, Kline TL (2017) Machine learning for medical imaging. Radiographics 37(2):505–515
Estava A, Kuprel B, Novoa R, Ko J, Swetter SM, Blau HM, Thrun S (2017) Dermatologist level classification of skin cancer with deep neural networks. Nature 542(7639):115–118
Gaál G, Maga B, Lukács A (2020) Attention u-net based adversarial architectures for chest x-ray lung segmentation. arXiv preprint arXiv:2003.10304
Ganesh Babu R, Bino J, Kavin Kumar K, Prasanna P (2022) Analysis of efficient security using machine learning methods. In: ICCCE 2021: proceedings of the 4th international conference on communications and cyber physical engineering. Springer, Singapore, pp 305–310
Gayathri JL, Abraham B, Sujarani MS, Nair MS (2022) A computer-aided diagnosis system for the classification of COVID-19 and non-COVID-19 pneumonia on chest X-ray images by integrating CNN with sparse autoencoder and feed forward neural network. Comput Biol Med 141:105134
Gothai E, Natesan P, Rajalaxmi RR, Kumar NN, Prasad TN, Jhothith KG (2022) Image classification of white blood cells with canonical correlation. In: 2022 international conference on computer communication and informatics (ICCCI), pp. 01–08. https://doi.org/10.1109/ICCCI54379.2022.9740870
Gupta A, Gupta S, Katarya R (2021a) InstaCovNet-19: A deep learning classification model for the detection of COVID-19 patients using Chest X-ray. Appl Soft Comput 99:106859
Gupta A, Gupta R, Garg N (2021b) An efficient approach for classifying chest X-ray images using different embedder with different activation functions in CNN. J Interdiscip Math 24(2):285–297
Gupta PK, Siddiqui MK, Huang X, Morales-Menendez R, Pawar H, Terashima-Marin H, Wajid MS (2022) COVID-WideNet—a capsule network for COVID-19 detection. Appl Soft Comput 122:108780
Haque KF, Abdelgawad A (2020) A deep learning approach to detect COVID-19 patients from chest X-ray images. AI 1(3):418–435
Hasan MD, Ahmed S, Abdullah ZM, Monirujjaman Khan M, Anand D, Singh A, AlZain M, Masud M (2021) Deep learning approaches for detecting pneumonia in COVID-19 patients by analyzing chest X-ray images. Math Probl Eng 2021:1–8
He F, Deng Y, Li W (2020) Coronavirus disease 2019: What we know? J Med Virol 92(7):719–725
Heidari M, Mirniaharikandehei S, Khuzani AZ, Danala G, Qiu Y, Zheng B (2020) Improving the performance of CNN to predict the likelihood of COVID-19 using chest X-ray images with preprocessing algorithms. Int J Med Inform 144:104284
Hira S, Bai A, Hira S (2021) An automatic approach based on CNN architecture to detect Covid-19 disease from chest X-ray images. Appl Intell 51(5):2864–2889
Hossain MB, Iqbal SHS, Islam MM, Akhtar MN, Sarker IH (2022) Transfer learning with fine-tuned deep CNN ResNet50 model for classifying COVID-19 from chest X-ray images. Inform Med Unlocked 30:100916
Islam MZ, Islam MM, Asraf A (2020) A combined deep CNN-LSTM network for the detection of novel coronavirus (COVID-19) using X-ray images. Inform Med Unlocked 20:100412
Jain G, Mittal D, Thakur D, Mittal MK (2020) A deep learning approach to detect Covid-19 coronavirus with X-ray images. Biocybern Biomed Eng 40(4):1391–1405
Jalali SMJ, Ahmadian M, Ahmadian S, Hedjam R, Khosravi A, Nahavandi S (2022) X-ray image based COVID-19 detection using evolutionary deep learning approach. Expert Syst Appl 201:116942
Jia G, Lam HK, Xu Y (2021) Classification of COVID-19 chest X-Ray and CT images using a type of dynamic CNN modification method. Comput Biol Med 134:104425
Karakanis S, Leontidis G (2021) Lightweight deep learning models for detecting COVID-19 from chest X-ray images. Comput Biol Med 130:104181
Kassania SH, Kassanib PH, Wesolowskic MJ, Schneidera KA, Detersa R (2021) Automatic detection of coronavirus disease (COVID-19) in X-ray and CT images: a machine learning based approach. Biocybern Biomed Eng 41(3):867–879
Kathamuthu ND, Subramaniam S, Le QH, Muthusamy S, Panchal H, Sundararajan SCM, Alrubaie AJ, Zahra MMA (2023) A deep transfer learning-based convolution neural network model for COVID-19 detection using computed tomography scan images for medical applications. Adv Eng Softw 175:103317
Kavinkumar K, Meeradevi T (2021) Classification of tumor of MRI brain image using hybrid feature extraction method and support vector machine classifier. J Med Imaging Health Inform 11(10):2558–2565
Khan SH, Sohail A, Khan A, Lee YS (2022) COVID-19 detection in chest X-ray images using a new channel boosted CNN. Diagnostics 12(2):267
Kiziloluk S, Sert E (2022) COVID-CCD-Net: COVID-19 and colon cancer diagnosis system with optimized CNN hyperparameters using gradient-based optimizer. Med Biol Eng Comput 60:1–18
Leger S, Zwanenburg A, Pilz K, Lohaus F, Linge A, Zöphel K, Kotzerke J, Schreiber A, Tinhofer I, Budach V, Sak A, Stuschke M, Balermpas P, Rödel C, Ganswindt U, Belka C, Pigorsch S, Combs SE, Mönnich D, Zips D, Krause M, Baumann M, Troost EGC, Löck S, Richter C (2017) A comparative study of machine learning methods for time-to-event survival data for radiomics risk modelling. Sci Rep 7(1):1–11
Li X, Zhai M, Sun J (2021) DDCNNC: Dilated and depthwise separable convolutional neural network for diagnosis COVID-19 via chest X-ray images. Int J Cogn Comput Eng 2:71–82
Litjens G, Kooi T, Bejnordi BE, Setio AAA, Ciompi F, Ghafoorian M, van der Laak JAWM, van Ginneken B, Sánchez CI (2017) A survey on deep learning in medical image analysis. Med Image Anal 42:60–88
Mostafiz R, Uddin MS, Reza MM, Rahman MM (2020) Covid-19 detection in chest X-ray through random forest classifier using a hybridization of deep CNN and DWT optimized features. J King Saud Univ Comput Inf Sci 34:3226–3235
Nemoto M, Masutani Y, Nomura Y, Hanaoka S, Miki S, Yoshikawa T, Hayashi N, Ootomo K (2016) Machine learning for computer-aided diagnosis. Igaku Butsuri: Nihon Igaku Butsuri Gakkai Kikanshi Jpn J Med Phys 36(1):29–34
Nour M, Cömert Z, Polat K (2020) A novel medical diagnosis model for COVID-19 infection detection based on deep features and Bayesian optimization. Appl Soft Comput 97:106580
Ortiz S, Rojas F, Valenzuela O, Herrera LJ, Rojas I (2022) Determination of the severity and percentage of COVID-19 infection through a hierarchical deep learning system. J Pers Med 12(4):535
Ouchicha C, Ammor O, Meknassi M (2020) CVDNet: a novel deep learning architecture for detection of coronavirus (Covid-19) from chest x-ray images. Chaos Solitons Fractals 140:110245
Kuzhali SE, Pushpa MK (2022) Adaptive deep learning for deep COVID-19 diagnosis. J Eng Des Tech 32
Rahman MM, Nooruddin S, Hasan KM, Dey NK (2021) HOG+ CNN Net: diagnosing COVID-19 and pneumonia by deep neural network from chest X-Ray images. Sn Comput Sci 2(5):1–15
Rajpurkar P, Irvin J, Zhu K, Yang B, Mehta H, Duan T, Ding D, Bagul A, Langlotz C, Shpanskaya K, Lungren MP, Ng AY (2017) Chexnet: radiologist-level pneumonia detection on chest x-rays with deep learning. arXiv preprint arXiv:1711.05225
Rakkiannan T, Ekambaram G, Palanisamy N, Ramasamy RR, Muthusamy S, Loganathan AK, Panchal H, Thangaraj K, Ravindaran A (2023) An automated network slicing at edge with software defined networking and network function virtualization: a federated learning approach. Wirel Pers Commun 1–20
Rasheed J, Hameed AA, Djeddi C, Jamil A, Al-Turjman F (2021) A machine learning-based framework for diagnosis of COVID-19 from chest X-ray images. Interdiscip Sci Comput Life Sci 13(1):103–117
Reshi AA, Rustam F, Mehmood A, Alhossan A, Alrabiah Z, Ahmad A, Choi GS (2021) An efficient CNN model for COVID-19 disease detection based on X-ray image classification. Complexity 2021:1–12
Saha P, Sadi MS, Islam MM (2021) EMCNet: automated COVID-19 diagnosis from X-ray images using convolutional neural network and ensemble of machine learning classifiers. Inform Med Unlocked 22:100505
Sakib S, Tazrin T, Fouda MM, Fadlullah ZM, Guizani M (2020) DL-CRC: deep learning-based chest radiograph classification for COVID-19 detection: a novel approach. IEEE Access 8:171575–171589
Shah PM, Ullah F, Shah D, Gani A, Maple C, Wang Y, Shahid MA, Islam SU (2021) Deep GRU-CNN model for COVID-19 detection from chest X-rays data. IEEE Access 10:35094–35105
Shankar K, Perumal E (2021) A novel hand-crafted with deep learning features based fusion model for COVID-19 diagnosis and classification using chest X-ray images. Complex Intell Syst 7(3):1277–1293
Sharifrazi D, Alizadehsani R, Roshanzamir M, Joloudari JH, Shoeibi A, Jafari M, Hussain S, Sani ZA, Hasanzadeh F, Khozeimeh F, Khosravi A, Nahavandi S, Panahiazar M, Zare A, Islam SMS, Acharya UR (2021) Fusion of convolution neural network, support vector machine and Sobel filter for accurate detection of COVID-19 patients using X-ray images. Biomed Signal Process Control 68:102622
Shen D, Wu G, Suk HI (2017) Deep learning in medical image analysis. Annu Rev Biomed Eng 19:221–248
Shibly KH, Dey SK, Islam MTU, Rahman MM (2020) COVID faster R-CNN: a novel framework to diagnose novel coronavirus disease (COVID-19) in X-ray images. Inform Med Unlocked 20:100405
Shin HC, Roth HR, Gao M, Lu L, Xu Z, Nogues I, Yao J, Mollura D, Summers RM (2016) Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans Med Imaging 35(5):1285–1298
Showkat S, Qureshi S (2022) Efficacy of transfer learning-based ResNet models in chest X-ray image classification for detecting COVID-19 pneumonia. Chemom Intell Lab Syst 224:104534
Shukla PK, Sandhu JK, Ahirwar A, Ghai D, Maheshwary P, Shukla PK (2021) Multiobjective genetic algorithm and convolutional neural network based COVID-19 identification in chest X-ray images. Math Probl Eng 2021:1–9
Singhal T (2020) A review of coronavirus disease-2019 (COVID-19). Indian J Pediatr 87(4):281–286
Talo M, Yildirim O, Baloglu UB, Aydin G, Acharya UR (2019) Convolutional neural networks for multi-class brain disease detection using MRI images. Comput Med Imaging Graph 78:101673
Tan JH, Fujita H, Sivaprasad S, Bhandary SV, Rao AK, Chua KC, Acharya UR (2017) Automated segmentation of exudates, haemorrhages, microaneurysms using single convolutional neural network. Inf Sci 420:66–76
Wang Z, Li M, Wang H, Jiang H, Yao Y, Zhang H, Xin J (2019) Breast cancer detection using extreme learning machine based on feature fusion with CNN deep features. IEEE Access 7:105146–105158
WHO, Coronavirus disease (COVID-19) Dashboard-2022. https://covid19.who.int/table
Yoon SH, Lee KH, Kim JY, Lee YK, Ko H, Kim KH, Park CM, Kim YH (2020) Chest radiographic and CT findings of the 2019 novel coronavirus disease (COVID-19): analysis of nine patients treated in Korea. Korean J Radiol 21(4):494–500
Funding
There was no financial support received from any organization for carrying out this work.
Author information
Authors and Affiliations
Contributions
All authors contributed equally for the preparation of the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Consent to participate
I have been informed of the risks and benefits involved, and all my questions have been answered to my satisfaction. Furthermore, I have been assured that any future questions I may have will also be answered by a member of the research team. I voluntarily agree to take part in this study.
Consent to publish
Individuals may consent to participate in a study, but object to having their data published in a journal article.
Ethical approval
This material is the authors' own original work, which has not been previously published elsewhere. The paper is not currently being considered for publication elsewhere. The paper reflects the authors' own research and analysis in a truthful and complete manner.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Subramaniam, K., Palanisamy, N., Sinnaswamy, R.A. et al. A comprehensive review of analyzing the chest X-ray images to detect COVID-19 infections using deep learning techniques. Soft Comput 27, 14219–14240 (2023). https://doi.org/10.1007/s00500-023-08561-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-023-08561-7