A Competent Deep Learning Model to Detect COVID-19 Using Chest CT Images

Chakraborty, Somenath; Murali, Beddhu

doi:10.1007/978-3-031-37649-8_7

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 710))

Included in the following conference series:

Machine Intelligence and Digital Interaction Conference

1231 Accesses
1 Altmetric

Abstract

The ongoing terrifying wave of COVID-19 which is caused by the Severe Acute Respiratory Syndrome Coronavirus 2 or SARS-CoV-2 continues to affect human life and disrupt healthcare systems. Chest computed tomography (CT) is an effective clinical tool for estimating the patient’s severity levels and deciding appropriate treatment regimes. In this paper, we use a deep learning method for detecting COVID-19 using chest CT images with the more advanced balanced dataset. We used a dataset of 8054 real patient CT scans, of which 5427 had COVID-19 and 4223 were Non-COVID-19 patient images. Our model had an average detection accuracy of 91.96% on the test dataset. In conclusion, Automated Deep Learning (DL) methodologies allow for speedy evaluation of CT images to detect COVID-19.

You have full access to this open access chapter, Download conference paper PDF

Keywords

1 Introduction

The ubiquitous COVID-19 pandemic is the distinguishing tragedy of the present century thus far. To combat this battle, clinicians and radiologists work together and invented a couple of different diagnostic tools. COVID-19 reportedly started in Wuhan City of China in December 2019 [1,2,3]. According to the Centers for Disease Control and Prevention (CDC), the total number of cases and deaths as of 8th May 2022 is 517 million and 6.25 million, respectively worldwide. That is why this research has a significant value from the social perspective as it helps faster the processes and automate the tasks rapidly and build more confidence in the medical fraternity. The numbers continue to increase with the spread of the Omicron coronavirus variant (B.1.1.529). COVID-19 is a disease that initially affects the respiratory organs, like the lungs [1]. As it is a highly transmittable viral infection without a cure, people are dying all over the world and many severely suffering patients do not find hospital accommodation due to overloading with the existing COVID-19 patients. The Critical Care facility is also overloaded and the situation is overwhelming.

The diagnosis of COVID-19 is one of the crucial aspects of the fight against this disease. In the early days of the pandemic, the detection was very limited and error-prone due to a lack of understanding of the virus genome, but researchers came up with the Reverse Transcription Polymerase Chain Reaction (RT-PCR) [4] pathological method, which is still a gold standard for the detection of COVID-19. However, many challenges still remain in the COVID-19 detection research. The RT-PCR sensitivity rate is 60%–70%, which is very low. Rapid antigen testing is also a fast method to detect COVID-19 patients but due to the different types of variants coming up every now and then it is also not so effective as lots of True negatives coming in the test due to the mutation of the virus genome.

Recently, many automated machine learning approaches have been introduced in the literature for Covid-19 detection such as Chest CXR, Chest CT, etc.

In this paper, we present a very efficient deep learning model for analyzing CT images.

2 Literature Review

As the situation in early 2020 had already started deteriorating and new cases were growing exponentially, The World Health Organization (WHO), declared it a pandemic on 11th March 2020 [5]. WHO distinguishes the variants of coronavirus into two categories, one is a ‘variant of interest’ and the other one is a ‘variant of concern’. According to the transmissibility and detrimental effects, the variant of concern is more dangerous compared to the variant of interest.

He, X. et al. [6] present a deep learning diagnosis method using chest CT images. As their work was reported in early April 2020, their COVID-19 chest CT image dataset was limited. Zhang, K. et al. [7] proposed an AI System for diagnosis, quantitative measurements, and prognosis of COVID-19 pneumonia using computed tomography. As their research was published in early June of 2020 they used the idea of image slices to increase the size of sample data as only 752 COVID-19 positive patient data was included in their work. Gunraj, H. et al. [8] proposed a tailored deep convolutional neural network (CNN) using chest CT images. They generated a dataset of 104,009 chest CT slices across 1,489 patient cases. Harmon, S. A. et al. [9] present a deep neural network using a multinational dataset. M. Khushi et al. [10] and Maillo, Triguero and Herrera [11] show that having limited positive samples in a dataset could lead to data inconsistency and data imbalance issues. Panwar, H. et al. [12] proposes a grad-CAM-based color visualization in their deep learning approach using CT images.

3 Our Proposed Methodology

Deep Convolutional Neural Network (CNN) is one of the most widely used neural networks for image classification problems [13]. CNN in a deep learning setting has been widely applied in the literature, for example, [14,15,16,17,18]. With the availability of different deep learning networks such as ResNet18, Res-Net50 [19], AlexNet [20], DenseNet [21], VGG16 [22], EfficientNet [24], etc., one can design many automated machine learning models that can detect COVID-19 from CXR, CT images, or a combination of both. In our approach, we designed our own classification layer but leveraged the advantages of EfficientNet in multi-feature generation layers through the transfer learning technique. It helps to generate lower-level and higher-level features in a rapid way. Iterative multiple training helps to optimize parameters and hyperparameters with rapid succession due to this integrated deep multi-layer approach.

3.1 Dataset Description and Preprocessing

The data consisted of chest CT images divided into two categories. COVID-19 infected patients and non-COVID-19 patients. We used multiple sources for data acquisition [9, 12,13,14,15] as the opensource COVID-19 data increases over time. These images are labeled by licensed radiologists and freely available in open-source platforms like GITHUB repositories and Kaggle platforms. Once we obtained the CT images, our next step was to preprocess those images with proper annotation and labeling, including normalization and augmentation using machine learning models. We use Pytorch vison models [23], to flip, rotate and include those images to increase the size of the dataset, which basically helps the deep learning model to be more robust and accurate. The deep neural network models derives feature information in a very different way they human radiologist visually inspect the images. Initially, some sort of histogram equalizing and contrast-enhancing.

Our proposed model used a dataset of 8054 real patient CT scans, of which 5427 had COVID-19 and 4223 were Non-COVID-19 patient images. The dataset is further divided into train, validation, and test sets with the ratio of 7:2:1. Figure 1 shows the chest CT images after preprocessing. Our dataset is very well balanced thus does not incurred data imbalance issues. Data imbalance is a very common problem, especially happened in medical data and image analysis if the model used fewer number of positive sample cases than the negative sample cases.

3.2 Methodology

The Fig. 2. It shows the block diagram representation of our proposed model. EfficientNet [24], pretrained weightages helps to reduce the time of our training process as due to transfer learning technique of EfficientNet we used the hyperparameter values in our model.

After preprocessing the images we used EfficientNet [24], a pretrained Model, for the feature generation tasks and then we add a deep classification layer to classify the CT images into COVID-19 position images and COVID-19 negative images, i.e. Non-COVID-19 images. After execution of the feature generation tasks, the total number of parameters of our dataset is 18,830,011, out of which Trainable parameters are 1,281,395 and Non-trainable parameters are 17,548,616.

The architecture of EfficientNet baseline Neural Network is shown in Fig. 3.

EfficientNet is a very efficient high-performance convolutional neural network architecture and scaling method that uniformly scales all dimensions of depth, width and resolution using a compound coefficient. Where benchmark deep neural models arbitrarily scale these factors, the EfficientNet scaling method uniformly scales network width, depth, and resolution with a set of fixed scaling coefficients. The effectiveness of model scaling is based on the baseline network which is playing a key role in high performance. The baseline network builds up with a deep neural architecture search using the AutoML MNAS framework [25], which optimizes both accuracy and efficiency. After leveraging the potential of transfer learning by using the deep learning model EfficientNet in the feature generation architecture we design our deep learning classification architecture. The elements of the architecture is given in the Fig. 4. We use piecewise linear function rectified linear unit in our model as an activation function instead of Softmax or other activation function as we are interested in getting probabilistic value from 0 to 1.

The classification layers provide the percentage of the score for COVID-19 patients and Non-COVID-19 patients.

4 Results

The proposed deep learning model had an average detection accuracy of 91.96% on the test CT images. Sensitivity is 92.24, Specificity is 93.01 and F1 Score is 0.90. Figure 5 shows the Loss Curve. We used three fold cross validation technique and after the 20 epoch as the values converge to the optimum values so no further training of the model needed, We heuristically checked this is the optimum level of the loss in our experiment while running with many epoch heuristically to understand the optimum level. Hence, we only included the optimum value figures upto 20 epoch. From the graph, in Fig. 5a, it is evident the loss decreased over the epoch. Figure 5.b shows the accuracy curve. From the graph, it is evident that the accuracy improves over the epoch.

Figure 6a, b, c are the results coming from an unknown test sample and then matching their value for the predicted score obtained using our proposed model. It is clearly evident from the Fig. 6 that the predicted score clearly confirmed the type of the class with very high accuracy.

The following Table 1 presents the comparison of the proposed model with other existing models using our dataset with the model they described in their papers.

Table 1. Comparison Analysis with Existing Models

Full size table

5 Future Work and Conclusion

The Deep learning Model is not intended to replace the RT-PCR or other bio-chemist methods rather it would help to give extra confidence to the doctors to detect COVID-19 with more accuracy, hence helps to reduce wrong diagnosis. Many earlier models reported in the literature utilized a limited number of COVID-19 positive images due to the lack of availability of such data during the primary phases of the pandemic. In the present work, 8054 CT images were used with balanced positive and negative cases. The results show that this work which is based on EfficientNet is capable of detecting COVID-19 based on CT images with high accuracy. This adds to the growing body of evidence that deep learning techniques may be used in a clinical setting with confidence for detecting COVID-19. One of the recommendations of WHO is to increase testing for COVID-19. Usage of deep learning-based methods would bring down the cost and accelerate the diagnosis process which will enable rapid testing on a global scale. In our future work, we would investigate vision transformer models with this dataset. Transformer architecture performs very well in Natural Language Processing (NLP) tasks and is widely used in NLP domains. But recently many research works comes up with vision applications as well, though Deep Convolutional Neural Network is mostly preferred by most of the researchers in computer vision applications, especially in image processing domains, object detection domains, etc. We also like to extend our work on our classification layers with more variety of activation functions and loss functions.

References

Huang, C., et al.: Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 395(10223), 497–506 (2020)
Article Google Scholar
Wu, F., et al.: A new coronavirus associated with human respiratory disease in China. Nature 579(7798), 265–269 (2020)
Article Google Scholar
McIntosh, K.: Coronavirus Disease 2019 (COVID-19): Epidemiology, Virology, Clinical Features, Diagnosis, and Prevention (2020)
Google Scholar
Wang, W., et al.: Detection of SARS-cov-2 in different types of clinical specimens. JAMA 323(18), 1843–1844 (2020)
Google Scholar
World Health Organization: WHO Director-General’s Opening Remarks at the Media Briefing on COVID-19, 11 March 2020 (2020)
Google Scholar
He, X., et al.: Sample-efficient deep learning for COVID-19 diagnosis based on CT scans. Health Informatics (2020, Preprint). https://doi.org/10.1101/2020.04.13.20063941
Zhang, K., et al.: Clinically applicable AI system for accurate diagnosis, quantitative measurements, and prognosis of COVID-19 pneumonia using computed tomography. Cell 181, 1423–1433 (2020)
Article Google Scholar
Gunraj, H., Wang, L., Wong, A.: COVIDNet-CT: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest CT images. Front. Med. (Lausanne) 7, 608525 (2020). https://doi.org/10.3389/fmed.2020.608525
Harmon, S.A., et al.: Artificial intelligence for the detection of COVID-19 pneumonia on chest CT using multinational datasets. Nat. Commun. 11, 1–7 (2020)
Article Google Scholar
Khushi, M., et al.: A comparative performance analysis of data resampling methods on imbalance medical data. IEEE Access 9, 109960–109975 (2021). https://doi.org/10.1109/ACCESS.2021.3102399
Article Google Scholar
Maillo, J., Triguero, I., Herrera, F.: Redundancy and complexity metrics for big data classification: towards smart data. IEEE Access 8, 87918–87928 (2020). https://doi.org/10.1109/ACCESS.2020.2991800
Article Google Scholar
Panwar, H., et al.: A deep learning and grad-CAM based color visualization approach for fast detection of COVID-19 cases using chest X-ray and CT-Scan images. Chaos Solitons Fractals 140, 110190 (2020). https://doi.org/10.1016/j.chaos.2020.110190
Article MathSciNet Google Scholar
LeCun, Y., Kavukcuoglu, K., Farabet, C.: Convolutional networks and applications in vision. In: ISCAS 2010 - 2010 IEEE International Symposium on Circuits and Systems: Nano-Bio Circuit Fabrics and Systems, pp. 253–256. [5537907] (ISCAS 2010 - 2010 IEEE International Symposium on Circuits and Systems: Nano-Bio Circuit Fabrics and Systems) (2010). https://doi.org/10.1109/ISCAS.2010.5537907
Chakraborty, S., Zhang, C.: Survival prediction model of renal transplantation using deep neural network. In: 2020 IEEE 1st International Conference for Convergence in Engineering (ICCE), pp. 180–183 (2020). https://doi.org/10.1109/ICCE50343.2020.9290695
Chakraborty, S., Murali, B.: A novel medical prognosis system for breast cancer. In: Mandal, J.K., Buyya, R., De, D. (eds.) Proceedings of International Conference on Advanced Computing Applications. AISC, vol. 1406, pp. 403–413. Springer, Singapore (2022). https://doi.org/10.1007/978-981-16-5207-3_34
Chapter Google Scholar
Chakraborty, S.: Category identification technique by a semantic feature generation algorithm. In: Deep Learning for Internet of Things Infrastructure, pp. 129–144. CRC Press
Google Scholar
Chakraborty, S., Bandyopadhyay, S.K.: Scene text detection using modified histogram of oriented gradient approach. IJAR 2(7), 795–798 (2016)
Google Scholar
Chakraborty, S., Murali, B.: Investigate the correlation of breast cancer dataset using different clustering technique. ArXiv abs/2109.01538, n. pag. (2021)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep Residual Learning for Pattern Recognition (CVPR), pp. 770–778 (2016). https://doi.org/10.1109/CVPR.2016.90
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017). https://doi.org/10.1145/3065386
Article Google Scholar
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2261–2269 (2017). https://doi.org/10.1109/CVPR.2017.243
Zhang, X., Zou, J., He, K., Sun, J.: Accelerating very deep convolutional networks for classification and detection. IEEE Trans. Pattern Anal. Mach. Intell. 38(10), 1943–1955 (2016). https://doi.org/10.1109/TPAMI.2015.2502579
Article Google Scholar
PyTorch Vision Models. https://pytorch.org/vision/stable/models.html
Tan, M., Le, Q.V.: EfficientNet: rethinking model scaling for convolutional neural networks. ArXiv, abs/1905.11946 (2019)
Google Scholar
Tan, M., Chen, B., Pang, R., Vasudevan, V., Le, Q.V.: MnasNet: platform-aware neural architecture search for mobile. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2815–2823 (2019)
Google Scholar
Chakraborty, S., Murali, B., Mitra, A.K.: An efficient deep learning model to detect COVID-19 using chest X-ray images. Int. J. Environ. Res. Public Health 19, 2013 (2022). https://doi.org/10.3390/ijerph19042013
Article Google Scholar
Chakraborty, S., Ali, D., Murali, B.: A novel distributed database architectural model for mobile cloud computing. In: Mandal, J.K., Hsiung, P.A., Sankar Dhar, R. (eds.) ICCTA 2021. LNNS, vol. 426, pp. 155–161. Springer, Singapore (2022). https://doi.org/10.1007/978-981-19-0745-6_17

Download references

Author information

Authors and Affiliations

West Virginia University Institute of Technology, Beckley, WV, 25801, USA
Somenath Chakraborty
University of North Texas, Denton, TX, 76203, USA
Beddhu Murali

Authors

Somenath Chakraborty
View author publications
You can also search for this author in PubMed Google Scholar
Beddhu Murali
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Somenath Chakraborty .

Editor information

Editors and Affiliations

National Research Institute, National Information Processing Institut, Warszaw, Poland
Cezary Biele
Polish Academy of Sciences, Systems Research Institute, Warsaw, Poland
Janusz Kacprzyk
Polish-Japanese Academy of Information T, Warsaw, Poland
Wiesław Kopeć
Polish Academy of Sciences, Systems Research Institute, Warsaw, Poland
Jan W. Owsiński
Institute of Applied Computer Science, Łódż University of Technology, Łódź, Poland
Andrzej Romanowski
Department of Informatics in Management, Faculty of Management and Economics, Gdańsk University of Technology, Gdańsk, Poland
Marcin Sikorski

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chakraborty, S., Murali, B. (2023). A Competent Deep Learning Model to Detect COVID-19 Using Chest CT Images. In: Biele, C., Kacprzyk, J., Kopeć, W., Owsiński, J.W., Romanowski, A., Sikorski, M. (eds) Digital Interaction and Machine Intelligence. MIDI 2022. Lecture Notes in Networks and Systems, vol 710. Springer, Cham. https://doi.org/10.1007/978-3-031-37649-8_7

Download citation

DOI: https://doi.org/10.1007/978-3-031-37649-8_7
Published: 25 July 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-37648-1
Online ISBN: 978-3-031-37649-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics