A novel framework for rapid diagnosis of COVID-19 on computed tomography scans

Akram, Tallha; Attique, Muhammad; Gul, Salma; Shahzad, Aamir; Altaf, Muhammad; Naqvi, S. Syed Rameez; Damaševičius, Robertas; Maskeliūnas, Rytis

doi:10.1007/s10044-020-00950-0

A novel framework for rapid diagnosis of COVID-19 on computed tomography scans

Theoretical Advances
Open access
Published: 22 January 2021

Volume 24, pages 951–964, (2021)
Cite this article

Download PDF

You have full access to this open access article

Pattern Analysis and Applications Aims and scope Submit manuscript

A novel framework for rapid diagnosis of COVID-19 on computed tomography scans

Download PDF

Tallha Akram¹,
Muhammad Attique²,
Salma Gul³,
Aamir Shahzad⁴,
Muhammad Altaf¹,
S. Syed Rameez Naqvi¹,
Robertas Damaševičius ORCID: orcid.org/0000-0001-9990-1084⁵ &
…
Rytis Maskeliūnas⁶

4377 Accesses
62 Citations
3 Altmetric
Explore all metrics

A Correction to this article was published on 12 March 2021

This article has been updated

Abstract

Since the emergence of COVID-19, thousands of people undergo chest X-ray and computed tomography scan for its screening on everyday basis. This has increased the workload on radiologists, and a number of cases are in backlog. This is not only the case for COVID-19, but for the other abnormalities needing radiological diagnosis as well. In this work, we present an automated technique for rapid diagnosis of COVID-19 on computed tomography images. The proposed technique consists of four primary steps: (1) data collection and normalization, (2) extraction of the relevant features, (3) selection of the most optimal features and (4) feature classification. In the data collection step, we collect data for several patients from a public domain website, and perform preprocessing, which includes image resizing. In the successive step, we apply discrete wavelet transform and extended segmentation-based fractal texture analysis methods for extracting the relevant features. This is followed by application of an entropy controlled genetic algorithm for selection of the best features from each feature type, which are combined using a serial approach. In the final phase, the best features are subjected to various classifiers for the diagnosis. The proposed framework, when augmented with the Naive Bayes classifier, yields the best accuracy of 92.6%. The simulation results are supported by a detailed statistical analysis as a proof of concept.

Lung cancer identification: a review on detection and classification

Article 09 June 2020

2D-to-3D: A Review for Computational 3D Image Reconstruction from X-ray Images

Article 22 July 2022

Automatic lung disease classification from the chest X-ray images using hybrid deep learning algorithm

Article 22 March 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Coronavirus Disease 2019 (COVID-19) is highly contagious and has rapidly spread globally infecting almost all the countries with millions of positive cases and more than 0.4 million deaths [1], and is continuously on the rise; see Table 1 and Figs. 1 and 2. The key factor to limit this pandemic situation is the early testing and diagnosis. However, due to its pandemic nature, quick collection and testing of samples from the suspected patients is a challenging issue for clinical management. Its early detection is possible with Nucleic Acid Amplification Tests (NAAT), such as reverse transcription polymerase chain reaction (RT-PCR) [2], which is required to be interpreted by trained clinical laboratory personnel [3]. The initial symptoms of COVID-19 are fever, fatigue and dry cough, and it predominantly affects lungs. The affected lobes with ground glass changes and/or consolidations etc. can be recorded in chest radiology images [4, 5]. This is why the clinicians worldwide are using chest X-rays (CXR) and computed tomography (CT) images as an alternative and fast method for the screening and diagnosis of the COVID-19, especially supported by the fact that the RT-PCR method may take several hours to complete the process [6].

Now due to the rise of this pandemic situation, thousands of people daily undergo CXR and CT scan for screening of COVID-19. This has overburdened the radiologist leading to their decreased productivity [7] in the proper detection of suspicious abnormalities [8]. In the case of contagious diseases, this backlog of radiological studies cannot be afforded; however, in the case of chronic and slow diseases such studies may be delayed. For detailed analysis of the errors and discrepancies in radiology diagnosis, the interested readers are referred to [9,10,11,12,13,14,15,16,17]. Artificial intelligence (AI) and computer vision system play an important role in classifying different complex structures found in the medical images [18,19,20,21] and can be used in the computer aided diagnosis tools. Therefore, AI is being researched for radiological diagnosis since long and has proven quite successful in the cases of breast screening [22], diagnosis and quantification of emphysema severity [23], tuberculosis detection [24] and claims of detection of diagnosis idiopathic pulmonary fibrosis with similar accuracy to a human reader [25].

During the past year, several imaging-based diagnosis techniques of COVID-19 backed by AI and machine learning have been presented, along with their correlation with the RT-PCR [26, 27]. CT and CXR images are processed for the detection of pneumonia like imaging features using AI techniques. In [28], CXR images and deep convolutional neural networks (CNNs) are used to diagnose COVID-19. The models used are ResNet 50, Inception V3 and a hybrid approach based on Inception-ResNetV2, claiming accuracies of $98\%, 97\%$ and $87\%$, respectively. In another approach [29], COVID-19 and bacterial and viral pneumonia are diagnosed and classified into negative or positive by using X-ray radiographs. The approach makes use of GoogleNet as a deep transfer model, claiming $100\%$ accuracy. In another approach [30], the authors have claimed $86.7\%$ accuracy in diagnosing the early signs of COVID-19, using chest CT images with deep learning . Authors in [31] propose to use two-dimensional (2D) and three-dimensional (3D) deep learning models combination and claim an accuracy of $98.2\%$ and specificity of $92.2\%$ with chest CT images. In [32], the authors combined the Inception CNN with Marine Predators algorithm to select the most relevant features from COVID-19 X-ray images, achieving an accuracy of 98.7%. In another study, the deep learning methods are used to extract COVID-19 graphical features and provide clinical diagnosis quite ahead of the pathogenic test helping in timely control of the spread. The authors claim a rather constrained accuracy of $85.2\%$ [10].

On the one hand, it is widely accepted that the diagnoses based on CXR are not as efficient as those based on the CT scans [11]; on the other hand, however, the accuracies reported in the literature for the former surprisingly exceed those for the CT scans. Furthermore, these systems normally give binary decision of either negative or positive, and do not incorporate any qualitative analyses based on the recommendations of Radiological Society of North America (RSNA) [33, 34]. Hence, it may be concluded that most of the available methods are either not sufficiently reliable, or achieve a constrained diagnosis efficiency. Especially for contagious diseases, such as COVID-19, there is still space for a thorough framework that could address the aforementioned discrepancies.

Table 1 Covid-19 statistics: Retrieved on May 19, 2020 from www.worldometer.info

Full size table

In this research work, the CT scan images and CXR images are processed for the detection of radiological signs of COVID-19 using computer vision and AI techniques and classified these images as per the RSNA recommendations using a novel machine learning approach. The proposed technique is capable of minimizing inter-observer variability in image interpretation among the radiologists and hence subjectivity due to difference in experience by qualitative analysis. The system is also capable of picking up very subtle or early findings that can be missed by a radiologist. The solution is a combination of preprocessing stages, especially designed to extract the information using a set of selected feature extraction techniques. The main contributions of the proposed framework are summarized as follows:

1.
A novel entropy-based fitness optimizer function is implemented, which selects the chromosomes with maximum information. The only chromosome with maximum fitness value is selected to get the sub-optimal solution in the minimum number of iterations.
2.
To conserve maximum information and to obliterate the redundant features at the initial level, a preliminary selection process is initiated on each feature set using the entropy-controlled fitness optimizer.
3.
To exploit the complementary strength of all features, a feature fusion approach is utilized which combines all the competing features to generate a resultant feature vector.

The rest of the manuscript is organized as follows: Sect. 2 presents the two commonly used imaging analysis for COVID-19. Datasets and their collection are given in Sect. 3, followed by a detailed description of the proposed framework in Sect. 4. The results and statistical analysis are presented in Sects. 5 and 6, respectively. We conclude the manuscript in Sect. 7.

2 CT and CXR imaging analysis for COVID-19

COVID-19 can be detected with Nucleic Acid Amplification Tests, such as RT-PCR at very early stage [3] which is the gold standard as yet. Some studies have shown that imaging should be discouraged as primary screening tool, because these may suffer from selection bias (from inter observer variability among radiologists) with the claims that it is ten times less sensitive and less specific as compared to RT-PCR [35]. This implies that it can be negative in the early stages of the disease, and imaging features can overlap with many other infectious and noninfectious disease processes. However, in China, the chest CT has proven to have relatively higher sensitivity for COVID-19 as compared to the initial RT-PCR from swab samples [27], possibly due to the high sensitivity of CT images to lung lesion even before RT-PCR [36, 37]. As the RT-PCR takes more time, that is at least 6 h, imaging was a much faster and readily available screening tool in the surge of patients during the pandemic situation especially where RT-PCR were not available; therefore, it has played an important role in the risk stratification and screening for COVID -19. Chest radiographs are less sensitive as compared to chest CT and can give negative diagnose in case of early or mild infection, but can be used as a first line imaging modality [38]. On CXR, the findings may be airspace opacities or Ground Glass Opacities (GGO) mostly distributed in bilateral, peripheral and lower zone [38, 39]. In the early stages of disease, the CT images may be either negative or show GGO only, while at progressive stage increased GGO and crazy paving can appear [35]. The representative CXR and CT scan images of COVID-19 are shown in Fig. 3. In Fig. 3a, CXR shows Bilateral Ground Glass alveolar consolidation with peripheral distribution, which is very clear and can be seen easily, but this may not always be the case, specifically in the early stages of infection. In Fig. 3b, c, two CT images are given, showing air space consolidation and GGO. The changes are much clearly visible in Fig. 3c, while in Fig. 3b the GGO can be misinterpreted with motion blur. This is important to note as it can affect the efficient implementation of an intelligent diagnosis system. It is important to note that many respiratory viruses, such as influenza, organizing pneumonia and connective tissue disorders, can cause pneumonia like changes on both chest radiograph and CT similar to that of COVID-19, and therefore, their proper interpretation and differentiation from COVID-19 is a challenging issue [40, 41]. In order to address such ambiguities, RSNA has recommended statements on reporting CXR and CT finding related to COVID-19 [33, 34]. It is important to follow these recommendations to avoid any misinterpretation.

3 Dataset collection

We collected pneumonia chest CT scans of 35 subjects diagnosed positive of COVID-19 from RadioPaedia image database [2]. Links to some public domain websites are also given for verification as follows:

1.
Case 1—https://radiopaedia.org/cases/covid-19-pneumonia-2
2.
Case 3—https://radiopaedia.org/cases/covid-19-pneumonia-3
3.
Case 4—https://radiopaedia.org/cases/covid-19-pneumonia-4
4.
Case 5—https://radiopaedia.org/cases/covid-19-pneumonia-7
5.
Case 8—https://radiopaedia.org/cases/covid-19-pneumonia-8
6.
Case 15—https://radiopaedia.org/cases/covid-19-pneumonia-23
7.
Case 18—https://radiopaedia.org/cases/covid-19-pneumonia-10
8.
Case 20—https://radiopaedia.org/cases/covid-19-pneumonia-27

All the patients have their RT-PCR COVID-19 test positive and have COVID19-pneumonia. The patients’ history is also provided in the given website along with their detailed travel history. For the healthy scans, we approached [29], so that the model utilized is efficiently trained, Fig. 3a. This figure portrays healthy/normal scans in the top row, while the COVID-19 pneumonia scans are provided in the bottom row. To train the classifier, features extracted from a total of 3500 COVID19-pneumonia and 2400 normal chest CT scans are utilized from the provided link, and are resized into $512\times 512$ resolution.

4 Proposed methodology

Early methods of machine learning utilize either sole or hybrid approaches for feature extraction. Though both methods have their advantages and drawbacks, generally fused feature space has more capacity to retain the dexterous features. Due to this flexibility, the hybrid approaches have gained much popularity among the researchers working in the area of computer vision. However, selection of the most appropriate feature extraction technique is quite a sensitive task, which needs to be handled carefully, otherwise, it may result in feature redundancy and, therefore, increased correlation. In this work, we utilized four different techniques—belongs to two different categories, statistical and texture. Two feature families are not considered, color and shape, because of their limited impact and significance in this application. The proposed framework, Fig. 4, is the subject of discussion in the following subsections.

4.1 Discrete wavelet transform features

The rationale behind selecting the discrete wavelet transform (DWT) for texture feature extraction is its ability to be invariant to translation, scaling and rotation. Further, in DWT, the contours can be requited from the coarsest to the finer scale, enabling the formulation to handle noise effectively. The 2D wavelet decomposition for images is similar to the 1D decomposition, in which the 2D wavelets basis $\Psi _{l,m(t)}$ and scaling basis $\phi _{m}(t)$ are obtained by taking the tensor products of 1D wavelets and scaling functions. For a 2D image, the DWT performs critical subsampling along both rows and columns, and these subbands information is utilized in the next level decomposition. The followed approach utilizes filter banks, described as:

$$\begin{aligned} \alpha _l(k)&= \sum _{u}6h_0(u-2k)\alpha _{l+1}(u) \end{aligned}$$

(1)

$$\begin{aligned} \varsigma _l(k)&= \sum _{u}h_1(u-2k)\alpha _{l+1}(u) \end{aligned}$$

(2)

The lowpass or averaged coefficients $\alpha _l(k)$ are created by half-band lowpass filter $h_o$, whereas the highpass or detailed coefficients $\varsigma _l(k)$ are created by half-band highpass filter $h_1$. From the equations, it can be observed that filtering $\alpha _{l+1}$ with $h_0$ and $h_1$ produces $\alpha _l(k)$ and $\varsigma _l(k)$, followed by decimation by a factor of 2.

To compute DWT coefficients for two levels, two-stage filter banks are required. The initial scale, $l+1$, in terms of $\alpha _{l+1}$ is the original signal, which after one level of decomposition produces highpass $\varsigma _l$ and lowpass coefficients $\alpha _l$. A batch of COVID-19 chest CT scans are represented by $ \{x_1, x_2, x_3,\dots , x_n\} \subseteq {X(m,n)} \in {\mathbb {R}}^{\{m,n\}}$, where m and n are the rows and columns, respectively. Initially, both filters $H_0(\omega )$ and $H_1(\omega )$ are applied on $x_1{m,n}$ to generate a pair of images with both low and high frequencies. Afterward, the filtered images are sub-sampled by a factor of 2 and are forwarded to the next series of filters along the columns. The decimation by a factor of 2 is again carried out after filtration process in the columns. A single column decomposition generates four subband images, $\{LL, LH, HL, HH\}$ of size $\frac{M}{2}, \frac{N}{2} $. The whole computation is performed to generate set of features:

$$\begin{aligned} LL^l(u,v)&= \sum _{m}\sum _{n}~h_0(m-2u)~h_0(n-2v)~.~LL^{l+1}(m,n), \end{aligned}$$

(3)

$$\begin{aligned} HL^l(u,v)&= \sum _{m}\sum _{n}~h_0(m-2u)~h_1(n-2v)~.~LL^{l+1}(m,n), \end{aligned}$$

(4)

$$\begin{aligned} LH^l(u,v)&= \sum _{m}\sum _{n}~h_1(m-2u)~h_0(n-2v)~.~LL^{l+1}(m,n), \end{aligned}$$

(5)

$$\begin{aligned} HH^l(u,v)&= \sum _{m}\sum _{n}~h_1(m-2u)~h_1(n-2v)~.~LL^{l+1}(m,n), \end{aligned}$$

(6)

The lowpass L and highpass H filters are represented by the alphabet letters on the sub images.

4.2 Extended segmentation-based fractal texture analysis (ESFTA)

As discussed earlier, texture features play much more significance role in the recognition process compared to other set up of features including shape and color. Therefore, in this work, we are employing our existing work [42] to extract the texture features of COVID-19 chest CT scans. In this technique, the fractal dimensions are computed from the stack of binary images. The technique works in two steps: (1) image partitioning into stack of binary images using pair threshold binary decomposition (PTBD), (2) fractal analysis of each binary image based on boundaries, pixel count, mean gray level.

4.3 Statistical features

Generally, data follow a normal distribution, which describes how the values of a variable are distributed. In case of normal distribution, it has two fundamental parameters; the mean and standard deviation. Observing a chest CT scan image under M modalities, with an assumption that all the images are spatially registered, i.e., healthy images are same and pixels correspond to the same location. From a pool of images, $x_i$, where $i = \{1,2,\dots , n\} \in X$, each of the healthy image follows a Gaussian distribution $\varPi (\lambda , \sigma )$. One observes a noticeable change in the distribution when provided an infected COVID-19 chest CT scan image. By following the underlying concept, two statistical parameters, entropy and skewness, are selected based on their vast applications in the field of machine learning.

4.3.1 Skewness

It measures asymmetry of the probability distribution about its mean. The skewness value can be negative, zero or positive. If the value is negative then the distribution curve spreads out more to the left of the mean, whereas, in case of positive value, it leans toward the right. The skewness distribution is described using the relation:

$$\begin{aligned} \varsigma = \frac{E(x-\mu )^3}{\sigma ^3} \end{aligned}$$

(7)

where $\mu $ and $\sigma $ are the mean and standard deviation of a random variable x, and E(t) denotes the expected value.

4.3.2 Entropy

Entropy offers the information regarding randomness in a signal by cogitating the system’s disorder. Due to this potential, entropy, in the current perspective, offers a useful information that can be utilized in feature representation. In this framework, Shannon entropy is utilized, which significantly improves the overall accuracy, by embedding the most relevant feature information. For both COVID-19 chest CT scans $x_1, x_2, \dots , x_N \subseteq X(m,n) \in {^{\gtrdot,\ltimes}}$ contains N samples. The image space has $\beta $ measure with $\beta (X) = 1$, the Shannon entropy is calculated as:

$$\begin{aligned} {\mathbb {E}}(X) = -\sum _{k=1}^{N}x_k \log \zeta (x_k) \end{aligned}$$

(8)

where $\zeta (x_k)$ is observing probability for a particular pixel matrix/vector of X. This whole concept allows us to identify the most superior and dominant pixels with a better variation and with least correlation.

4.4 Feature selection framework

The genetic algorithm (GA) belongs to a class of stochastic search algorithms, which on the principle of survival of the fittest finds the sub optimal solution from a pool of solutions. In the GA framework, the population is developed by combining a set of chromosomes, where each chromosome constitutes a possible solution. In the proposing scenario, the extracted set of features are independently plugged into the GA block. The most discriminant chromosome/solution is later selected using the proposed entropy-based fitness optimizer.

Both texture and statistical features are used to generate two pairs of chromosomes, where each chromosome represents a feature type.

$$\begin{aligned} \varPhi ^k({\mathbb {X}}) =\left\{ \Psi _{WL}^1(X), \Psi _{\mathrm{ESFA}}^2(X), \varPhi _{\varsigma }^3(X), \varPhi _{{\mathbb {E}}}^4(X)\right\} ; k \in {\mathbb {Z}}, \end{aligned}$$

(9)

${\mathbb {Z}} = \{1, \dots , 4\}$ is the set of bounded integers, representing a feature chromosome. The entire population is generated from each set of chromosome c, continuous valued vector, having $G_j$ genes. The continuous domain offers more convergence possibilities and also minimizes the probability for a generation to be stuck in a local minima.

$$\begin{aligned} C_j^k = \left\{ G_1^k, G_2^k,, G_3^k,, \dots , G_m^k,\right\} \end{aligned}$$

(10)

where k is the chromosome index in a population, and m is the length of a chromosome. In what follows, we present a genetic operators including proposed crossover, mutation and selection operators for a thorough technical analysis.

4.5 Median-replacer crossover

Selecting a pair of chromosome, $C_j^1 = \{(G_1^1, G_2^1, G_3^1, \dots , G_m^1),(G_1^2, G_2^2, G_3^2, \dots , G_m^2)\}$, where $j = \{1,2,\dots ,m\}$, for an average-replacer crossover operation. Two offsprings, ($O_l^1$ and $O_l^2$), are generated as: $O_l^1 =\lambda C_j^1(1-\lambda )C_J^2$ and $O_l^2 =\lambda C_j^2(1-\lambda )C_J^1$. The median of both offsprings is later replaced using the min/max value extracted from the parent chromosomes. A max value is assigned 1 (${\mathrm{max}}\rightarrow 1$), whereas the min value is assigned 0 (${\mathrm{min}}\rightarrow 1$). A binary random sequence is generated to select min/max value from the first selected chromosome $C_j^1$, and the same procedure applies to the second selected chromosome. Based on the generated binary rand sequence, median values of both offsprings are updated. An inversion mutation is applied on the selected number of chromosomes, whereas, for the selection, both healthy and non-healthy parents are selected for the next generation on the basis of entropy-based fitness optimizer [43].

4.6 Entropy-based fitness optimizer

To select the next-generation offsprings, the fitness of each chromosome needs to be evaluated. In this work, we developed a novel entropy-based fitness optimizer. The whole idea revolves around the fundamental property of the feature randomness calculated by the entropy function. More the entropy value is, greater the chances of a healthy chromosome. Here, the entropy calculator identifies the maximum randomness by controlling the uncertainty. For a real-valued chromosome vector $C_j^1 = \{(G_1^1, G_2^1, G_3^1, \dots , G_m^1)\}$, the Shannon entropy is calculated using the relation:

$$\begin{aligned} {\mathbb {E}}_\varsigma = - \sum _{q=0}^{m-1}\left( G_q^0/ \Delta G\right) ~\log _2\beta \left( G_q^0/ \Delta G\right) \end{aligned}$$

(11)

where $G_q^1$ is the gene q of the first chromosome.

4.7 Feature fusion

Features fusion is a robust strategy pursued by several researchers in the field of machine learning. The original feature space, in most of the cases, does not contain sufficient information compared to the fused feature space. Therefore, in this work, we opted feature fusion strategy to generate a resultant feature vector with enriched information. All the down-sampled features from GA block are later fused by following a cascaded design. These horizontally concatenated feature vectors is later forwarded to the classification block for final labeling using Naive Bayes classifier [44].

5 Results and discussion

The proposed framework for COVID-19 pneumonia is evaluated in this section with both empirical and graphical results. For the validation, 35 subjects are considered with their Coronavirus test positive, with details provided in Sect. 3. A fair training/testing ratio of (70:30) is being followed with 70% as training data and the rest is treated as testing data. To generalize the empirical results and to recognize the precise stats, a tenfold cross-validation technique is exercised. For the final classification, a Naive Bayes classifier is selected based on its improved performance. A fair comparison is also provided with the existing state-of-the-art classifiers including fine KNN (F-KNN) [45], linear support vector machine (L-SVM) [46], ensemble bagged tree (EBT) [47] and fine tree (F-Tree) [48] classifiers. To authenticate the proposed method, several performance measures deem necessary are chosen including sensitivity (SEN), precision (PR), specificity (SPE), an area under the curve (AUC) and accuracy (ACC). The mathematical form of aforementioned measures is provided in the following equations.

$$\begin{aligned} {\mathrm{SEN}}&= \frac{\delta _{\mathrm{TP}}}{\delta _{\mathrm{TP}} + \delta _{\mathrm{FN}}} \times 100\% \end{aligned}$$

(12)

$$\begin{aligned} {\mathrm{SPE}}&= \frac{\delta _{\mathrm{TN}}}{\delta _{\mathrm{TN}} + \delta _{\mathrm{FP}}} \times 100\% \end{aligned}$$

(13)

$$\begin{aligned} {\mathrm{PR}}&= \frac{\delta _{\mathrm{TP}}}{\delta _{\mathrm{TP}} + \delta _{\mathrm{FP}}} \times 100\% \end{aligned}$$

(14)

$$\begin{aligned} {\mathrm{ACC}}&= \frac{\delta _{\mathrm{TP}}+ \delta _{\mathrm{TN}}}{\delta _{\mathrm{TP}} + \delta _{\mathrm{TN}}+\delta _{\mathrm{FP}} + \delta _{\mathrm{FN}}} \times 100\% \end{aligned}$$

(15)

where $\delta _{\mathrm{TP}}$ represents true positive, $\delta _{\mathrm{TN}}$ represents true negative, $\delta _{\mathrm{FP}}$ denotes false positive rate, and $\delta _{\mathrm{FN}}$ represents false negative rate. A few samples results are demonstrated in Fig. 5, where one can observe a binary labeling; corona positive and normal.

Table 2 Accuracy calculated using different feature extraction techniques

Full size table

The results are compiled by taking into consideration three different scenarios; (1) accuracy achieved using independent features, (2) accuracy achieved after employing GA for feature selection and (3) with proposed feature selection and fusion method.

Table 3 Accuracy comparison of different feature extraction techniques after applying GA-based feature selection

Full size table

Case 1 considers the features extracted independently from each technique for the classification; this is tabulated in Table 2. For a fair comparison, different classifiers are being tested and against each technique. The final results are as expected, and the Naive Bayes classifier outperforms other state-of-the-art with an average accuracy of 80.83%, while the second-best average accuracy achieved is with F-KNN (79.52%). The accuracy comparison of different feature extraction techniques with proposed is provided in Fig. 6. It is apparent from the bar plot that the independent features are of no match to the fused features for this application. One can also observe, with DWT features, most of the classifiers performed well compared to other features. This clearly shows, DWT features in this application, would be an appropriate choice compared to other features, either used solely or in the fused form.

In case 2, the selected features from GA are forwarded to the classifier for final labeling, Table 3. The same trend is being followed after feature selection step, and Naive Bayes classifier works exceptionally well almost for all kind features by achieving an average accuracy of $85.55\%$. F-KNN worked second best by achieving an average accuracy of $84\%$. One more time, with DWT features, classification results are exceptional with almost all the classifiers. The average classification accuracy achieved using GA-DWT features by all the selected classifiers are 84.86% compared to 83.85% using G-SFTA. Figure 7 demonstrates that the average accuracy achieved after GA increased compared to stand alone features. A vertical bar clearly indicates that the accuracy margin between the proposed and after GA selection is still comparable, which strengthens the positive significance of feature fusion.

Table 4 A comparison of state-of-the-art classifiers using proposed GA controlled feature selection and fusion method

Full size table

Using the proposed framework, the achieved accuracy using the Naive Bayes classifier is $92.6\%$, whereas a few other classifiers (EBT, L-SVM and F-KNN) behave significantly better to achieving an average accuracy of $92.2\%, 92.1\%$ and $92.0\%$, respectively. The authenticity of proposed framework is further validated from the selected performance parameters including sensitivity ($92.5\%$), specificity ($92.0\%$), precision ($92.5\%$) and the AUC (0.99), see Table 4. From the sensitivity and specificity values, it is quite obvious that the proposed framework has successfully managed to achieve a high true positive and negative rates by correctly classifying the actual positive and actual negative samples. To further describe the performance of a classifier on a set of test data, a confusion matrix is provided, Table 5. From the stats, one can develop a clear understanding, that out of total test samples, $93\%$ are correctly labeled as COVID-19 infected, whereas around $7\%$ are misclassified as normal.

Table 5 Confusion matrix of Naive Bayes classifier after applying optimized GA for features selection

Full size table

In addition, we compare the proposed method results on different training and testing samples such as 70:30, 60:40, 90:10 and so on. The results are plotted in Fig. 5. In this figure, it is shown that the 90:30 approach results are fine but if we consider the standard process of validation, then 70:30 approach results are more useful (Fig. 8).

6 Statistical significance

The objective here in performing the statistical analysis is to gain a high level of confidence in the proposed method. The results are statistically significant, if they are likely not caused by chance. We employed the analysis of variance (ANOVA) to demonstrate, that either the results are statistically significant or not. In this work, we consider the proposed scenario for three different classifiers (Naive Bayes, EBT, L-SVM)—selected on the basis of their improved performance compared to the rest. A Shapiro–Wilk test is performed for assumption of normality, while Bartlett’s test—for homogeneity of variance with a significance level $\alpha = 0.01$. The means of our approach are $\bar{x}_1$, $\bar{x}_2$, and $\bar{x}_3$—calculated from the overall accuracy of both classifiers. The null hypothesis $H_0$, given that $\bar{x}_1 = \bar{x}_2 = \bar{x}_3$, while the alternative hypothesis $H_a$ given that $\bar{x}_1 \ne \bar{x}_2 \ne \bar{x}_3$. We computed the p value and tested the null hypothesis, $H_0$, if it is rejected, $p < \alpha $, then we will be applying Bonferroni post hoc test.

For the proposed entropy controlled GA method (E-GA), and with selected classifiers (Naive Bayes, EBT and L-SVM), the Shapiro–Wilk test generated p value, $p_u = 0.8002$, $p_v = 0.9152$, and $p_l = 0.6878$. By following the Bartlett’s test, the associated Chi-squared probabilities are: $p_u = 0.371$, $p_v = 0.339$, and $p_l = 0.410$. From the calculated p values of two different classifiers, which are significantly greater than $\alpha $. Therefore, from the test (normality and equality of variances), we failed to reject the null hypothesis $H_0$, and confirm that the data were distributed normally, and their variances are homogeneous. ANOVA test including five different parameters (degree of freedom (df), a sum of squared deviation (SS), mean squared error (MSE), F-statistics, and p value) is shown in Table 6. The performance range of three selected classifiers based on the proposed method is shown in Fig. 9.

Table 6 ANOVA test on two selected classifiers based on the proposed method

Full size table

The results are also validated by utilizing Bonferroni post hoc test, which is the most common approach to be applied whenever there exists a chance of a significant difference between the means of multiple distributions. It was certified that the proposed method performed better compared to several existing methods.

7 Conclusion

A computerized technique is proposed in this work for the prediction of COVID-19 from the CT scans. Textural and statistical features are extracted from raw CT images, and then, only best features are selected based on optimized genetic algorithm. The selected features are serially concatenated and later classified using the Naive Bayes classifier. The experimental process is performed on the collected COVID-19 positive and healthy samples and shows the proposed method to be effective. The main contribution of this work is an optimized genetic algorithm for best selection. Using this algorithm, the accuracy of individual feature type is improved and when all selected features are combined, then a significant change has been observed in the accuracy. Based on the performance of this algorithm, we concluded that the selection of most relevant features improves the accuracy, but on the other side, it is a high chance that we miss the important features that play a contribution in the improvement of prediction accuracy. Also, this problem may occur when we have more patients data for final testing. Therefore, in the future studies, we will focus on the reduction of these features.

Change history

12 March 2021
A Correction to this paper has been published: https://doi.org/10.1007/s10044-021-00969-x

References

Yan L, Zhang H-T, Goncalves J, Xiao Y, Wang M, Guo Y, Sun C, Tang X, Jing L, Zhang M et al (2020) An interpretable mortality prediction model for COVID-19 patients. Nat Mach Intell 1–6
Laboratory testing for coronavirus disease 2019 (COVID-19) in suspected human case. https://www.who.int/publications-detail/laboratory-testing-for-2019-novel-coronavirus-in-suspected-human-cases-20200117. Accessed 29 April 2020
Accelerated emergency use authorization (EUA) summary COVID-19 RT-PCR test (Laboratory Corporation of America). https://www.fda.gov/media/136151/download. Accessed 29 April 2020
Pan Y, Guan H, Zhou S, Wang Y, Li Q, Zhu T, Hu Q, Xia L (2020) Initial CT findings and temporal changes in patients with the novel coronavirus pneumonia (2019-nCoV): a study of 63 patients in Wuhan, China. Eur Radiol
Zhu N, Zhang D, Wang W, Li X, Yang B, Song J, Zhao X, Huang B, Shi W, Lu R, Niu P, Zhan F, Ma X, Wang D, Xu W, Wu G, Gao GF, Tan W, China Novel Coronavirus Investigating and Research Team (2020) A novel coronavirus from patients with pneumonia in China, 2019. New Engl J Med 382:727–733
Article Google Scholar
Edwards A (2020) COVID-19 tests: how they work and what’s in development. https://theconversation.com/covid-19-tests-how-they-work-and-whats-in-development-134479. Accessed 31 Mar 2020
Valente F, Costa C, Silva A (2013) Dicoogle, a PACS featuring profiled content based image retrieval. PLoS ONE 8:e61888
Article Google Scholar
Rohman M (2020) Has radiologist burnout. https://www.healthimaging.com/topics/practice-management/has-radiologist-burnout-finally-reached-tipping-point. Accessed 31 May 2020
Harvey H (2019) The UK desperately needs a radiology artificial intelligence incubator. https://towardsdatascience.com/the-uk-desperately-needs-a-radiology-ai-incubator-205553dbac43. Accessed 17 Sept 2019
Brady A, Laoide R, McCarthy P, McDermott R (2012) Discrepancy and error in radiology: concepts, causes and consequences. Ulster Med J 8:1–3
Google Scholar
Brady APA (2017) Error and discrepancy in radiology: inevitable or avoidable? Insights Imaging 8:171–182
Article Google Scholar
Waite S, Grigorian A, Alexander RG, Macknik SL, Carrasco M, Heeger DJ, Martinez-Conde S (2019) Analysis of perceptual expertise in radiology—current knowledge and a new perspective. Front Hum Neurosci 13:213
Article Google Scholar
Ekpo EU, Alakhras M, Brennan P (2018) Errors in mammography cannot be solved through technology alone. Asian Pac J Cancer Prev APJCP 19(2):291–301
Google Scholar
Ganesan K, Acharya UR, Chua CK, Min LC, Abraham KT, Ng K (2013) Computer-aided breast cancer detection using mammograms: a review. IEEE Rev Biomed Eng 6:77–98
Article Google Scholar
Breast cancer screening in the UK. http://www.cancerresearchuk.org/aboutcancer/ type/breastcancer/ about/screening/whoisscreenedforbreastcancer. Accessed 5 May 2016
Pinto A, Brunese L, Pinto F, Reali R, Daniele S, Romano L (2012) The concept of error and malpractice in radiology. In: Seminars in ultrasound, CT and MRI, errors and malpractice in radiology, vol 33, No. 4, pp 275–279
Pinto A, Brunese L (2010) Spectrum of diagnostic errors in radiology. World J Radiol 2:377–383
Article Google Scholar
Wang S, Summers RM (2012) Machine learning and radiology. Med Image Anal 16(5):933–951
Article Google Scholar
Ke Q, Zhang J, Wei W, Połap D, Woźniak M, Kośmider L, Damas̆evĭcius R (2019) A neuro-heuristic approach for recognition of lung diseases from X-ray images. Expert Syst Appl 126:218–232
Article Google Scholar
Chouhan V, Singh SK, Khamparia A, Gupta D, Tiwari P, Moreira C, Damaševičius R, de Albuquerque VHC (2020) A novel transfer learning based approach for pneumonia detection in chest X-ray images. Appl Sci (Switzerland) 10(2):559
Google Scholar
Khan MA, Ashraf I, Alhaisoni M, Damaševičius R, Scherer R, Rehman A, Bukhari SAC (2020) Multimodal brain tumor classification using deep learning and robust feature selection: a machine learning application for radiologists. Diagnostics 10(8):565
Article Google Scholar
Keen CE (2019) Artificial intelligence versus 101 radiologists. https://physicsworld.com/a/artificial-intelligence-versus-101-radiologists. Accessed 17 Sept 2019
Fischer AM, Varga-Szemes A, van Assen M, Griffith LP, Sahbaee P, Sperl JI, Nance JW, Schoepf UJ (2020) Comparison of artificial intelligence-based fully automatic chest CT emphysema quantification to pulmonary function testing. Am J Roentgenol 214:1–7
Article Google Scholar
Sahlol AT, Elaziz MA, Jamal AT, Damaševičius R, Hassan OF (2020) A novel method for detection of tuberculosis in chest radiographs using artificial ecosystem-based optimisation of deep neural network features. Symmetry 12:1146
Article Google Scholar
Christe A, Peters AA, Drakopoulos D, Heverhagen JT, Geiser T, Stathopoulou T, Christodoulidis S, Anthimopoulos M, Mougiakakou SG, Ebner L (2019) Computer-aided diagnosis of pulmonary fibrosis using deep learning and CT images. Investig Radiol 54:627–632
Article Google Scholar
Fau Li L, Qin L, Xu Z, Yin Y, Wang X, Kong B, Bai J, Lu Y, Fang Z, Song Q, Cao K, Liu D, Wang G, Xu Q, Fang X, Zhang S, Xia J, Xia J (2020) Artificial intelligence distinguishes COVID-19 from community acquired pneumonia on chest CT. Radiology 16
Ai T, Yang Z, Hou H, Zhan C, Chen C, Lv W, Tao Q, Sun Z, Xia L (2020) Correlation of chest CT and RT-PCR testing in coronavirus disease 2019 (COVID-19) in China: a report of 1014 cases. Radiology
Narin A, Kaya C, Pamuk Z (2020) Automatic detection of coronavirus disease (COVID-19) using X-ray images and deep convolutional neural networks. arXiv:2003.10849
Loey M, Smarandache F, Khalifa NEM (2020) Within the lack of chest COVID-19 X-ray dataset: a novel detection model based on GAN and deep transfer learning. Symmetry 12(4):651
Article Google Scholar
Xu X, Jiang X, Ma C, Du P, Li X, Lv S, Yu L, Chen Y, Su J, Lang G, Li Y, Zhao H, Xu K, Ruan L, Wu W (2020) Deep learning system to screen coronavirus disease 2019 pneumonia. arXiv:2002.09334 [physics.med-ph]
Gozes O, Frid-Adar M, Greenspan H, Browning PD, Zhang H, Ji W, Bernheim A, Siegel E (2020) Rapid AI development cycle for the coronavirus (COVID-19) pandemic: initial results for automated detection and patient monitoring using deep learning CT image analysis. arXiv:2003.05037 [eess.IV]
Sahlol AT, Yousri D, Ewees AA, Al-qaness MAA, Damaševičius R, Elaziz MA (2020) COVID-19 image classification using deep features and fractional-order marine predators algorithm. Sci Rep 10(1):1–15
Article Google Scholar
Simpson S, Kay FU, Abbara S, Bhalla S, Chung JH, Chung M, Henry TS, Kanne JP, Kligerman S, Ko JP, Litt H (2020) Radiological Society of North America expert consensus statement on reporting chest CT findings related to COVID-19. Endorsed by the Society of Thoracic Radiology, the American College of Radiology, and RSNA. Radiol Cardiothorac Imaging 2:e200152
Article Google Scholar
RSNA offers video series and discussion on structured reporting for assessment of COVID-19. https://www.rsna.org/en/news/2020/April/Structured-Reporting-COVID-19-Videos. Accessed 25 Apr 2020
Murphy A, Bell DJ (2020) COVID-19. https://radiopaedia.org/articles/covid-19-3. Accessed 31 Mar 2020
Rouger M (2020) Imaging the coronavirus disease COVID-1. https://healthcare-in-europe.com/en/news/imaging-the-coronavirus-disease-covid-19.html#. Accessed 28 Apr 2020
Mei X, Lee H-C, Diao K-Y, Huang M, Lin B, Liu C, Xie Z, Ma Y, Robson PM, Chung M et al (2020) Artificial intelligence—enabled rapid diagnosis of patients with COVID-19. Nat Med 26:1–5
Article Google Scholar
Wong HYF, Lam HYS, Fong AH-T, Leung ST, Chin TW-Y, Lo CSY, Lui MM-S, Lee JCY, Chiu KW-H, Chung T, Lee EYP, Wan EYF, Hung FNI, Lam TPW, Kuo M, Ng M-Y (2019) Frequency and distribution of chest radiographic findings in COVID-19 positive patients. Radiology
Rodrigues JCL, Hare SS, Edey A, Devaraj A, Jacob J, Johnstone A, McStay R, Nair A, Robinson G (2020) An update on COVID-19 for the radiologist—a British Society of Thoracic Imaging statement. Clin Radiol 75:323–325
Article Google Scholar
Zhao D, Yao F, Wang L, Zheng L, Gao Y, Ye J, Guo F, Zhao H, Gao R (2020) A comparative study on the clinical features of coronavirus 2019 (COVID-19) pneumonia with other pneumonias. Clin Infect Dis
Bai HX, Hsieh B, Xiong Z, Halsey K, Choi JW, Tran TML, Pan I, Shi L-B, Wang D-C, Mei J, Jiang X-L, Zeng Q-H, Egglin TK, Hu P-F, Agarwal S, Xie F, Li S, Healey T, Atalay MK, Liao W-H (2020) Performance of radiologists in differentiating COVID-19 from viral pneumonia on chest CT. Radiology
Akram T, Khan MA, Sharif M, Yasmin M (2018) Skin lesion segmentation and recognition using multichannel saliency estimation and M-SVM on selected serially fused features. J Ambient Intell Humaniz Comput 1–20
Hussain A, Muhammad YS, Sajid MN (2017) Performance evaluation of best–worst selection criteria for genetic algorithm. Math Comput Sci 2(6):89–97
Article Google Scholar
Rish I et al (2001) An empirical study of the Naive Bayes classifier. In: IJCAI 2001 workshop on empirical methods in artificial intelligence, vol 3, pp 41–46
Xu Y, Zhu Q, Fan Z, Qiu M, Chen Y, Liu H (2013) Coarse to fine k nearest neighbor classifier. Pattern Recognit Lett 34(9):980–986
Article Google Scholar
Jia-Zhi D, Wei-Gang L, Xiao-He W, Jun-Yu D, Wang-Meng Z (2018) L-SVM: a radius-margin-based svm algorithm with logdet regularization. Expert Syst Appl 102:113–125
Article Google Scholar
Dietterich TG (2000) An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization. Mach Learn 40(2):139–157
Article Google Scholar
Safavian SR, Landgrebe D (1991) A survey of decision tree classifier methodology. IEEE Trans Syst Man Cybern 21(3):660–674
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of EE, COMSATS University Islamabad, Wah Campus, Pakistan
Tallha Akram, Muhammad Altaf & S. Syed Rameez Naqvi
Department of Computer Science, HITEC University Taxila, Rawalpindi, Pakistan
Muhammad Attique
Department of Radiology, Shifa International Hospital, Islamabad, Pakistan
Salma Gul
Department of EE, COMSATS University Islamabad, Abbottabad Campus, Pakistan
Aamir Shahzad
Faculty of Applied Mathematics, Silesian University of Technology, Gliwice, Poland
Robertas Damaševičius
Department of Applied Informatics, Vytautas Magnus University, Kaunas, Lithuania
Rytis Maskeliūnas

Authors

Tallha Akram
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Attique
View author publications
You can also search for this author in PubMed Google Scholar
Salma Gul
View author publications
You can also search for this author in PubMed Google Scholar
Aamir Shahzad
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Altaf
View author publications
You can also search for this author in PubMed Google Scholar
S. Syed Rameez Naqvi
View author publications
You can also search for this author in PubMed Google Scholar
Robertas Damaševičius
View author publications
You can also search for this author in PubMed Google Scholar
Rytis Maskeliūnas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Robertas Damaševičius.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Akram, T., Attique, M., Gul, S. et al. A novel framework for rapid diagnosis of COVID-19 on computed tomography scans. Pattern Anal Applic 24, 951–964 (2021). https://doi.org/10.1007/s10044-020-00950-0

Download citation

Received: 23 June 2020
Accepted: 09 December 2020
Published: 22 January 2021
Issue Date: August 2021
DOI: https://doi.org/10.1007/s10044-020-00950-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A novel framework for rapid diagnosis of COVID-19 on computed tomography scans

Abstract

Similar content being viewed by others

Lung cancer identification: a review on detection and classification

2D-to-3D: A Review for Computational 3D Image Reconstruction from X-ray Images

Automatic lung disease classification from the chest X-ray images using hybrid deep learning algorithm

1 Introduction

2 CT and CXR imaging analysis for COVID-19

3 Dataset collection