RETRACTED ARTICLE: A deep learning model and machine learning methods for the classification of potential coronavirus treatments on a single human cell

Khalifa, Nour Eldeen M.; Taha, Mohamed Hamed N.; Manogaran, Gunasekaran; Loey, Mohamed

doi:10.1007/s11051-020-05041-z

RETRACTED ARTICLE: A deep learning model and machine learning methods for the classification of potential coronavirus treatments on a single human cell

Research paper
Published: 17 October 2020

Volume 22, article number 313, (2020)
Cite this article

Download PDF

Journal of Nanoparticle Research Aims and scope Submit manuscript

RETRACTED ARTICLE: A deep learning model and machine learning methods for the classification of potential coronavirus treatments on a single human cell

Download PDF

Nour Eldeen M. Khalifa¹,
Mohamed Hamed N. Taha¹,
Gunasekaran Manogaran^2,3 &
…
Mohamed Loey ORCID: orcid.org/0000-0002-3849-4566⁴

7935 Accesses
13 Citations
1 Altmetric
Explore all metrics

This article was retracted on 16 August 2021

This article has been updated

Abstract

Coronavirus pandemic is burdening healthcare systems around the world to the full capacity they can accommodate. There is an overwhelming need to find a treatment for this virus as early as possible. Computer algorithms and deep learning can participate positively by finding a potential treatment for SARS-CoV-2. In this paper, a deep learning model and machine learning methods for the classification of potential coronavirus treatments on a single human cell will be presented. The dataset selected in this work is a subset of the publicly online datasets available on RxRx.ai. The objective of this research is to automatically classify a single human cell according to the treatment type and the treatment concentration level. A DCNN model and a methodology are proposed throughout this work. The methodical idea is to convert the numerical features from the original dataset to the image domain and then fed them up into a DCNN model. The proposed DCNN model consists of three convolutional layers, three ReLU layers, three pooling layers, and two fully connected layers. The experimental results show that the proposed DCNN model for treatment classification (32 classes) achieved 98.05% in testing accuracy if it is compared with classical machine learning such as support vector machine, decision tree, and ensemble. In treatment concentration level prediction, the classical machine learning (ensemble) algorithm achieved 98.5% in testing accuracy while the proposed DCNN model achieved 98.2%. The performance metrics strengthen the obtained results from the conducted experiments for the accuracy of treatment classification and treatment concentration level prediction.

Artificial Intelligence Approach to Predict the COVID-19 Patient’s Recovery

Multi-class autoencoder-ensembled prediction model for detection of COVID-19 severity

Article 02 July 2022

Deep Learning Approach Using 3D-ImpCNN Classification for Coronavirus Disease

Introduction

SARS virus spread around the world and caused a lot of panic globally at the end of February 2003 (Chang et al. 2020; Chamola et al. 2020). This led to set an alarm about viruses and their devastating impact in the new century. The 2019 latest coronavirus was described by the World Health Organization (WHO) in the form of 2019-nCov (COVID-19) (Singhal 2020; Loey et al. 2020a). The 2019 coronavirus was identified as the SARS-CoV-2 by the International Committee on Taxonomy of Viruses (ICTV) in 2020 (Lai et al. 2020; Li et al. 2020; Sharfstein et al. 2020). More than 500,000 fatalities in 213 countries and territories were affected by an outbreak of SARS-CoV-2 before the date of the published article (Worldometer 2020). The transmission of coronavirus (person to person) was spreading so fast for example, in Italy (Giovanetti et al. 2020), US (Holshue et al. 2020), India (Khattar et al. 2020), and Germany (Rothe et al. 2020). On 10 July 2020, SARS-CoV-2 confirmed more than 12 million cases, 6 million recovered cases, and 550,000 death cases. Figure 1 shows some statistics about recovered and death cases of COVID-19 (Coronavirus (COVID-19) map 2020).

Generally, most of the publication focus is on the classification and detection of X-ray and CT images of COVID-19 (Civit-Masot et al. 2020; Waheed et al. 2020; Narayan Das et al. 2020; Ardakani et al. 2020). In this research, our focus is on recognizing and detecting a drug to help in healing from COVID-19 and study a morphological effect of COVID-19. Today, DL is quickly becoming a crucial technology in image/video classification and detection (Loey et al. 2020b, c; Khalifa et al. 2019a). In this paper, a deep learning model and machine learning methods for the classification of potential coronavirus treatments on a single human cell will be presented. The objective of this research is to automatically classify a single human cell according to the treatment type and the treatment concentration level. The novelty of this research is using a proposed classification model based on deep learning and machine learning for COVID-19 virus treatments. The remainder of the document is structured appropriately. “Datasets characteristics” includes a summary of the data set characteristics. “The proposed model” provides a detailed description of the proposed model. Throughout “Experimental results”, preliminary findings are recorded and evaluated, and the assumptions and potential future research are presented in “Conclusion and future works”.

Datasets characteristics

This research conducted its experiments based on the dataset presented in research (Heiser et al. 2020). The dataset attribute description is presented in detail in Table 1. The data are publicly available at RxRx.ai under the name of “RxRx19a Dataset”. It is a high-dimensional dataset that analyzes more than 1660 of FDA-approved drugs in a human cellular model of SARS-CoV-2 infection and included more than 300,000 recorded experiments. Although the presented data is in vitro screen that represents data from only a single human cell type, this dataset is likely broadly applicable to other primary human cell models.

Table 1 RxRx19a dataset attributes description

Full size table

In this research, a subset of data is included in the conducted research experiments. The subset includes VERO cells which are a continuous cell lineage derived from kidney epithelial cells of an African green monkey and human renal cortical epithelial (HRCE) cells. Both cells were selected along with 10, 30, and 100 treatment concentration level with active SARS-CoV-2. This subset includes 32 treatments and three treatment concentration levels with two classes of cell type. Only 3750 cell records are included in the experiment carried out in this research.

The proposed model

The introduced model consists of three phases. The first phase is the preprocessing phase that converts the numerical values of the 1024 cell features to a digital image. The second phase is the training phase based on machine learning algorithms for numerical features and deep convolutional neural networks for the converted image features. The third phase is the testing phase and the evaluation of proposed model accuracy for treatment classification and treatment concentration level prediction. Figure 2 presents the proposed model structure.

Preprocessing phase

The pre-processing phase includes (1) loading the 1024 features of cells on to computer memory, (2) change the cell feature original numerical domain that ranges from − 0.00046466477, 4.508815065 to image range [0, 255] according to equation (1), (3) construct image by converting the data vector of 1024 feature cells into a 32 × 32 pixel image according to the pseudocode presented in Algorithm 1. The result of this phase will be 3750 images. Figure 3 illustrates a set of images after the pre-processing phase.

$$ \mathrm{Pixel}\ \mathrm{value}=\mathrm{Round}\left(\frac{\left(\mathrm{feature}\ \mathrm{cell}\ \mathrm{value}-\left(-0.00046466477\right)\ \right)}{4.508815065}\times 255\right) $$

(1)

where − 0.00046466477 is the minimum cell value and 4.508815065 is the maximum cell value in the 1024 features of cell data and 255 is the maximum value of the image domain.

Algorithm 1: Constructing image from 1024 features of the cell data vector

Training phase

The training phase is conducted based on two methodologies. The first methodology uses machine learning algorithms such as support vector machine, decision trees, and ensemble algorithms. The second methodology is depending on deep convolutional neural networks.

Support vector machine

SVM is one of the most common and impressive machine learning techniques for recognition and regression. SVM is a functioning algorithm, as shown in equation (2), where l is the label from 0 to 1, w. a − q is the output, w and q are the linear category coefficients, and a is the input vector. Equation (3) will enforce the loss function that is to be reduced (Çayir et al. 2018; Jogin et al. 2018).

$$ SV{M}_{{\mathrm{h}}_{\mathrm{k}}}=\max \Big(0,1-{l}_{\mathrm{k}}\left(w.{a}_{\mathrm{k}}-q\right) $$

(2)

$$ SV{M}_{\mathrm{loss}}=\frac{1}{m}\ \sum \limits_{t=1}^m\max \left(0,{h}_{\mathrm{t}}\right) $$

(3)

Decision tree

The decision tree is the computing classification paradigm focused on entropy method and knowledge acquisition. Entropy computes the amount of uncertainty in data as shown in equation (4), where CD is the data, b is the class output, and p(x) is the proportion of q label. Measuring the entropy gap from results, we calculate knowledge acquisition (KA) as illustrated in equation (5), where x is the subset of data (Navada et al. 2011; Tu and Chung 1992).

$$ \mathrm{Entropy}\ (CD)=\sum \limits_{i=1}^n-p\left(\ {b}_{\mathrm{i}}\ \right).\log \Big(\ p\left(\ {b}_{\mathrm{i}}\ \right) $$

(4)

$$ KA=\mathrm{Entropy}\ (CD)-{\sum}_{\mathrm{x}\in \mathrm{D}}p(x)\mathrm{Entropy}\ (x) $$

(5)

Ensemble methods

Ensemble methods are algorithms for machine study that build several classifiers, which is used to identify new cases in one direction or another through specific decisions (typically through weighted or unweighted votes) (Polikar 2012). The used methods are linear regression (Naseem et al. 2010), logistic regression (Kleinbaum and Klein 2002), and K-nearest neighbors algorithm (k-NN) (Mangalova and Agafonov 2014). We improve our ensemble by equation (6) to achieve the best outcomes (Xiao et al. 2018).

$$ \overline{y}=\sum \limits_{k=1}^h{\alpha}_{\mathrm{k}}{y}_{\mathrm{k}} $$

(6)

Deep convolutional neural networks

The structure of the proposed deep convolutional neural networks is presented in Fig. 4. The proposed DCNN consists of three main convolutional layers with window size 3 × 3 pixels, three ReLU layers, and three pooling layers. The previous layers are used as feature extractions while two fully connected layers are used as classification layers. The proposed model for DCNN is a result of a lot of architecture tuning and tweaking based on work presented in (Khalifa et al. 2018; Khalifa et al. 2019b; Khalifa et al. 2020; Loey et al. 2020d).

One problem that faces DCNN is overfitting. Overfitting can be solved by data augmentation (Shorten and Khoshgoftaar 2019; El-Sawy et al. 2017a, b). Data augmentation increases the number of images used for training by applying label-preserving transformations. Also, it is applied to the training set to make the resulting model more invariant to image transformation; in this work, each image in the training dataset is transformed as follows:

Reflection around X-axis.
Reflection around Y-axis.
Reflection around the X-Y axis.

The augmentation process raises the number of images from 3750 images to 15,000 images, 3 times larger than the original dataset. This will lead to a significant improvement in the neural network training phase. Additionally, it will make the proposed DCNN immune to memorize the data and be more robust.

Testing phase

The testing phase is the phase where the proposed model proves its performance and efficiency. The main goals of the proposed model are correctly classifying the treatments based on numerical features by using machine learning algorithms and correctly classifying the treatment images of the features based on DCNN. Also, the prediction of the treatment concentration on every cell is based on numerical features and image features using both machine learning and DCNN.

For machine learning, the performance evaluation will include testing accuracy along with receiver operating characteristic (ROC) curve under 5k-fold cross-validation. For DCNN, testing accuracy, precision, recall, and F1 score (Goutte and Gaussier 2010) are included based on the calculation of the confusion matrix. The performance metrics are presented from equation (2) to equation (10).

$$ \mathrm{Testing}\ \mathrm{Accuracy}=\frac{\mathrm{TruePos}+\mathrm{TrueNeg}}{\left(\mathrm{TruePos}+\mathrm{FalsePos}\right)+\left(\ \mathrm{TrueNeg}+ FalseNeg\right)\ } $$

(7)

$$ \mathrm{Precision}=\frac{\mathrm{TruePos}}{\left(\mathrm{TruePos}+\mathrm{FalsePos}\right)} $$

(8)

$$ \mathrm{Recall}=\frac{\mathrm{TruePos}}{\left(\mathrm{TruePos}+ FalseNeg\right)} $$

(9)

$$ \mathrm{F}1\ \mathrm{Score}=2\ast \frac{\mathrm{Precision}\times \mathrm{Recall}}{\left(\mathrm{Precision}+\mathrm{Recall}\right)} $$

(10)

where TruePos is the count of true positive samples, TrueNeg is the count of true negative samples, FalsePos is the count of false positive samples, and FalseNeg is the count of false negative samples from a confusion matrix.

Experimental results

The experiments are implemented using MATLAB software on a computer server with 96 GB of RAM and Intel Xeon processor (2 GHz). The following specifications are selected during the experiments:

For machine learning algorithms
- Three classifiers are tested (support vector machine, decision trees, and ensemble).
- Two problems (treatment classification and treatment concentration prediction).
- Dataset is in numerical format.
- 5k-fold cross-validation is selected.
- Testing accuracy along with receiver operating characteristic (ROC) and area under curve (AUC) are selected as performance metrics.
For DCNN
- Using the proposed DCNN in “Training phase”.
- Two problems (treatment classification and treatment concentration prediction).
- Dataset is in digital image format.
- Dataset was divided into two sections (70% of the data for the training process and 30% for the testing process).
- Data augmentation is applied for treatment classification problems.
- Testing accuracy, precision, recall, and F1 score are selected as performance metrics.

Treatment classification results

There are 32 classes of treatment according to the subset selected from the original dataset and they are presented in Table 2. The treatment classification will be experimented on by machine learning for numerical format and DCNN for digital image format.

Table 2 Treatment classes according to the selected dataset

Full size table

The first results to be recorded are using classical machine learning, three classical machine learnings are selected, and they are DT, SVM, and ensemble. Table 3 presents the average testing accuracy for the selected machine learning algorithm using 5k cross-validation.

Table 3 Testing accuracy using different machine learning algorithms

Full size table

ROC curve is one of the performance metrics for the machine learning algorithms. An ROC curve is a graph showing the performance of a classification model at all classification thresholds using true positive rate and false positive rate. Figure 5 presents a set of ROC curves for the different machine learning algorithms for one treatment oseltamivir-carboxylate. The AUC provides an aggregate measure of performance across all possible classification thresholds. The AUC for treatment oseltamivir-carboxylate using DT was 73% while using SVM, the AUC was 84%, and using ensemble, the AUC was 86%. There are about 96 ROC curves that can be produced by experimental trails, but there is no need to repeat the figures for different treatments, and the testing accuracy can be a good indicator of the quality of the machine learning algorithm.

Using deep learning architecture, the achieved results are better than using machine learning algorithms in terms of testing accuracy and performance metrics. Using the proposed DCNN model and the conversion to the image domain with augmentation helped the model to achieve better results. The achieved testing accuracy was 98.05%. The recall measure was 95.03% accuracy. The precision measure was 96.52% accuracy. The F1 score measure was 95.97% accuracy. The confusion matrix is presented in Fig. 6. It is clearly shown that using a deep learning model with the conversion to image domain for features enhanced the testing accuracy by 25.35% rather than using an ensemble algorithm which achieved 72.7% testing accuracy.

The progress of the training phase of the proposed deep learning model is presented in Fig. 7, which reflects the advancement of the training process to achieve better accuracy; the model has tuned for early stop of the training if there is no better accuracy achieved in 10 iterations. The batch size was 32 with a learning rate of 0.0001. Examples of testing accuracy along with treatment classification are presented in Fig. 8.

Treatment concentration prediction results

Another goal for the proposed model is to predict the concentration of the treatment on the cell. The first direction to investigate the accuracy of the model is by using a machine-learning algorithm to predict the concentration level of treatment. Three concentration levels are investigated, and they were 10, 30, and 100% concentration level. Table 4 presents the testing accuracy of treatment concentration using DT, SVM, and ensemble algorithms using 5k cross-validation.

Table 4 Testing accuracy using different machine learning algorithms

Full size table

ROC curves and AUC are also extra indicators of the quality of the classifier. Figure 9 presents the ROC curves for the different machine learning algorithms for the different classes of the level of the treatment concentration of 10, 30, and 100. The SVM and the ensemble algorithms achieved AUC with 100% which is a good indicator for the quality of the classifier. Also, according to Table 3, both classifiers (SVM and ensemble) achieved a testing accuracy with 97.3% and 98.5% for a three-class problem.

The second direction is to use deep learning to solve this problem using the same proposed DCNN model for the feature of digital images without using augmentation. There was no need to use the augmentation process as the proposed model achieved a good testing accuracy with 98.2%. Figure 10 presents the confusion matrix for the level of the concentration level of the potential treatment. The proposed model with the conversion of features to images achieved 98.2% testing accuracy along with performance metrics as follows (recall: 87.42%, precision: 99.36%, and F1 score: 93.01%).

For the concentration level, 10% of the achieved accuracy was 98.1%, for the concentration level 30%, the achieved accuracy was 100%. For the concentration level of 100%, the achieved accuracy was also 100%. The achieved accuracy for every class reflects the performance of the proposed DCNN model.

Result discussion

For the treatment classification which includes 32 classes, the proposed DCNN achieved a superior result if it is compared with machine learning algorithms in terms of testing accuracy. The proposed DCNN achieved a result of 98.05% while classical machine learning such as DT, SVM, and ensemble achieved 57.7%, 71.5%, and 72.7%, respectively. The performance metrics supported the obtained results for the proposed DCNN with feature image conversion.

In the treatment concentration level prediction, the classical machine learning algorithms such as DT and SVM achieved a near result with the proposed DCNN. The DT and SVM achieved 96.4% and 97.3%, respectively, while the DCNN achieved 98.2% in testing accuracy. The ensemble algorithm achieved a superior testing accuracy rather than the DCNN and achieved 98.5%. As a general notice, the classical machine learning algorithm for simple classification problems such as treatment concentration level prediction which includes three classes. While in multiclass classification such as treatment classification which includes 32 classes, the deep learning model proved its performance and efficiency if it is compared with classical machine learning.

Conclusion and future works

The coronavirus pandemic is putting healthcare systems around the world into a critical situation. Until now, there is a cure for this virus. One of the methods that can help to defeat this virus is trying approved treatments on human cells as a primary stop to shorten the gap between treatments and finding an actual cure. Computer algorithms and deep learning can close that gap and help in finding a cure. In this paper, a deep learning model and machine learning methods for the classification of potential coronavirus treatments on a single human cell. The dataset selected in work is a subset of the publicly online dataset on RxRx.ai. The objective of this research is to automatically classify the human cell according to treatment and treatment concentration levels. The proposed DCNN model and methodology are based on converting the numerical features from the original dataset to the image domain. The proposed model consists of three convolutional layers, three ReLU layers, three pooling layers, and two fully connected layers. The experimental results showed that the proposed DCNN model for treatment classification (32 classes) achieved 98.05% testing accuracy if it is compared with classical machine learning such as support vector machine, decision tree, and ensemble. In treatment concentration level prediction, the classical machine learning (ensemble) algorithm achieved 98.5% testing accuracy while the proposed DCNN model achieved 98.2%. One of the potential future work is performing same experiments with deep transfer models such as Alexnet and Resnet50 or even deeper neural networks to investigate its performance with used dataset in this research.

Change history

16 August 2021
A Correction to this paper has been published: https://doi.org/10.1007/s11051-021-05266-6

References

Ardakani AA, Kanafi AR, Acharya UR, Khadem N, Mohammadi A (Jun. 2020) Application of deep learning technique to manage COVID-19 in routine clinical practice using CT images: results of 10 convolutional neural networks. Comput Biol Med 121:103795. https://doi.org/10.1016/j.compbiomed.2020.103795
Article CAS Google Scholar
Bagasta AR, Rustam Z, Pandelaki J, Nugroho WA (2019) Comparison of cubic SVM with Gaussian SVM: classification of infarction for detecting ischemic stroke, in IOP Conference Series: Materials Science and Engineering, vol. 546, no. 5, p. 052016
Banfield RE, Hall LO, Bowyer KW, Kegelmeyer WP (2006) A comparison of decision tree ensemble creation techniques. IEEE Trans Pattern Anal Mach Intell 29(1):173–180
Article Google Scholar
Çayir A, Yenidoğan I, Dağ H (2018) Feature extraction based on deep learning for some traditional machine learning methods, in 2018 3rd International Conference on Computer Science and Engineering (UBMK), 2018, pp. 494–497, https://doi.org/10.1109/UBMK.2018.8566383
Chamola V, Hassija V, Gupta V, Guizani M (2020) A comprehensive review of the COVID-19 pandemic and the role of IoT, drones, AI, blockchain, and 5G in managing its impact. IEEE Access 8:90225–90265. https://doi.org/10.1109/ACCESS.2020.2992341
Article Google Scholar
Chang Y-W, Lin C-J (2008) Feature ranking using linear SVM. In: Causation and prediction challenge, pp 53–64
Google Scholar
Chang L, Yan Y, Wang L (2020) Coronavirus disease 2019: coronaviruses and blood safety. Transfus Med Rev. https://doi.org/10.1016/j.tmrv.2020.02.003
Civit-Masot J, Luna-Perejón F, Domínguez Morales M, Civit A (2020) Deep learning system for COVID-19 diagnosis aid using X-ray pulmonary images. Appl Sci 10(13):13. https://doi.org/10.3390/app10134640
Article CAS Google Scholar
Coronavirus (COVID-19) map (2020). https://www.google.com/covid19-map/ (accessed Apr. 26, 2020)
Damrongsakmethee T, Neagoe V-E (2019) Principal component analysis and relieff cascaded with decision tree for credit scoring, in Computer Science On-line Conference, pp. 85–95
El-Sawy A, Loey M, EL-Bakry H (2017a) Arabic handwritten characters recognition using convolutional neural network. WSEAS Trans Comput Res 5 Accessed: Apr. 01, 2020. [Online]. Available: http://www.wseas.org/multimedia/journals/computerresearch/2017/a045818-075.php
El-Sawy A, El-Bakry H, Loey M (2017b) CNN for handwritten Arabic digits recognition based on LeNet-5 BT - Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2016, Cham, pp. 566–575
Giovanetti M, Benvenuto D, Angeletti S, Ciccozzi M (May 2020) The first two cases of 2019-nCoV in Italy: where they come from? J Med Virol 92(5):518–521. https://doi.org/10.1002/jmv.25699
Article CAS Google Scholar
C. Goutte and E. Gaussier, A probabilistic interpretation of precision, recall and F-score, with implication for evaluation, 2010
Google Scholar
Hang R, Liu Q, Song H, Sun Y (2015) Matrix-based discriminant subspace ensemble for hyperspectral image spatial–spectral feature fusion. IEEE Trans Geosci Remote Sens 54(2):783–794
Article Google Scholar
Heiser K et al (2020) Identification of potential treatments for COVID-19 through artificial intelligence-enabled phenomic analysis of human cells infected with SARS-CoV-2. bioRxiv
Holshue ML, DeBolt C, Lindquist S, Lofy KH, Wiesman J, Bruce H, Spitters C, Ericson K, Wilkerson S, Tural A, Diaz G, Cohn A, Fox L, Patel A, Gerber SI, Kim L, Tong S, Lu X, Lindstrom S, Pallansch MA, Weldon WC, Biggs HM, Uyeki TM, Pillai SK, Washington State 2019-nCoV Case Investigation Team (2020) First case of 2019 novel coronavirus in the United States. N Engl J Med 382(10):929–936. https://doi.org/10.1056/NEJMoa2001191
Article CAS Google Scholar
Jogin M, Mohana, Madhulika MS, Divya GD, Meghana RK, Apoorva S (2018) Feature extraction using convolution neural networks (CNN) and deep learning, in 2018 3rd IEEE International Conference on Recent Trends in Electronics, Information Communication Technology (RTEICT), pp. 2319–2323, https://doi.org/10.1109/RTEICT42901.2018.9012507
Khalifa NEM, Taha MHN, Hassanien AE (2018) Aquarium family fish species identification system using deep neural networks. In: International Conference on Advanced Intelligent Systems and Informatics, pp 347–356
Google Scholar
Khalifa N, Loey M, Taha M, Mohamed H (2019a) Deep transfer learning models for medical diabetic retinopathy detection. Acta Inform Med 27(5):327. https://doi.org/10.5455/aim.2019.27.327-332
Article Google Scholar
Khalifa NEM, Taha MHN, Hassanien AE, Hemedan AA (2019b) Deep bacteria: robust deep learning data augmentation design for limited bacterial colony dataset. Int J Reason Based Intell Syst 11(3):256–264
Google Scholar
Khalifa NEM, Taha MHN, Ali DE, Slowik A, Hassanien AE (2020) Artificial intelligence technique for gene expression by tumor RNA-Seq data: a novel optimized deep learning approach. IEEE Access 8:22874–22883
Article Google Scholar
Khattar A, Jain PR, Quadri SMK (2020) Effects of the disastrous pandemic COVID 19 on learning styles, activities and mental health of young Indian students - a machine learning approach, in 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS), pp. 1190–1195, https://doi.org/10.1109/ICICCS48265.2020.9120955
Kleinbaum DG, Klein M (2002) Logistic regression: a self-learning text, 2nd edn. Springer-Verlag, New York
Lai C-C, Shih T-P, Ko W-C, Tang H-J, Hsueh P-R (2020) Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and coronavirus disease-2019 (COVID-19): the epidemic and the challenges. Int J Antimicrob Agents 55(3):105924. https://doi.org/10.1016/j.ijantimicag.2020.105924
Article CAS Google Scholar
Li J, Li J(J), Xie X, Cai X, Huang J, Tian X, Zhu H (Mar. 2020) Game consumption and the 2019 novel coronavirus. Lancet Infect Dis 20(3):275–276. https://doi.org/10.1016/S1473-3099(20)30063-3
Article CAS Google Scholar
Loey M, Smarandache F, Khalifa NEM (2020a) Within the lack of chest COVID-19 X-ray dataset: a novel detection model based on GAN and deep transfer learning. Symmetry 12(4):4. https://doi.org/10.3390/sym12040651
Article CAS Google Scholar
Loey M, ElSawy A, Afify M (2020b) Deep learning in plant diseases detection for agricultural crops: a survey. Int J Serv Sci Manag Eng Technol (IJSSMET) www.igi-global.com/article/deep-learning-in-plant-diseases-detection-for-agricultural-crops/248499 (accessed Apr. 11, 2020)
Loey M, Naman MR, Zayed HH (2020c) A survey on blood image diseases detection using deep learning. Int J Serv Sci Manag Eng Technol (IJSSMET) www.igi-global.com/article/a-survey-on-blood-image-diseases-detection-using-deep-learning/256653 (accessed Jun. 17, 2020)
Loey M, Naman M, Zayed H (2020d) Deep transfer learning in diagnosing leukemia in blood cells. Computers 9(2):2. https://doi.org/10.3390/computers9020029
Article Google Scholar
Mangalova E, Agafonov E (Apr. 2014) Wind power forecasting using the k-nearest neighbors algorithm. Int J Forecast 30(2):402–406. https://doi.org/10.1016/j.ijforecast.2013.07.008
Article Google Scholar
Narayan Das N, Kumar N, Kaur M, Kumar V, Singh D (2020, IRBM) Automated deep transfer learning-based approach for detection of COVID-19 infection in chest X-rays. https://doi.org/10.1016/j.irbm.2020.07.001
Naseem I, Togneri R, Bennamoun M (Nov. 2010) Linear regression for face recognition. IEEE Trans Pattern Anal Mach Intell 32(11):2106–2112. https://doi.org/10.1109/TPAMI.2010.128
Article Google Scholar
Navada A, Ansari AN, Patil S, Sonkamble BA (2011) Overview of use of decision tree algorithms in machine learning, in 2011 IEEE Control and System Graduate Research Colloquium, pp. 37–42, https://doi.org/10.1109/ICSGRC.2011.5991826
Polikar R (2012) Ensemble learning. In: Zhang C, Ma Y (eds) Ensemble machine learning: methods and applications. Springer US, Boston, MA, pp 1–34
Google Scholar
Rothe C, Schunk M, Sothmann P, Bretzel G, Froeschl G, Wallrauch C, Zimmer T, Thiel V, Janke C, Guggemos W, Seilmaier M, Drosten C, Vollmar P, Zwirglmaier K, Zange S, Wölfel R, Hoelscher M (2020) Transmission of 2019-nCoV infection from an asymptomatic contact in Germany. N Engl J Med 382(10):970–971. https://doi.org/10.1056/NEJMc2001468
Article Google Scholar
Sharfstein JM, Becker SJ, Mello MM (2020) Diagnostic testing for the novel coronavirus. JAMA. https://doi.org/10.1001/jama.2020.3864
Shorten C, Khoshgoftaar TM (2019) A survey on image data augmentation for deep learning. J Big Data 6(1):60
Article Google Scholar
Singhal T (2020) A review of coronavirus disease-2019 (COVID-19). Indian J Pediatr 87(4):281–286. https://doi.org/10.1007/s12098-020-03263-6
Article Google Scholar
Tu P-L, Chung J-Y (1992) A new decision-tree classification algorithm for machine learning, in Proceedings Fourth International Conference on Tools with Artificial Intelligence TAI ‘92, pp. 370–377, https://doi.org/10.1109/TAI.1992.246431
Waheed A, Goyal M, Gupta D, Khanna A, Al-Turjman F, Pinheiro PR (2020) CovidGAN: data augmentation using auxiliary classifier GAN for improved Covid-19 detection. IEEE Access 8:91916–91923. https://doi.org/10.1109/ACCESS.2020.2994762
Article Google Scholar
Worldometer (2020) Countries where Coronavirus has spread – Worldometer. https://www.worldometers.info/coronavirus/countries-where-coronavirus-has-spread/ (accessed Jul. 10, 2020)
Xiao Y, Wu J, Lin Z, Zhao X (2018) A deep learning-based multi-model ensemble method for cancer prediction. Comput Methods Prog Biomed 153:1–9. https://doi.org/10.1016/j.cmpb.2017.09.005
Article Google Scholar

Download references

Funding

This research received no external funding.

Author information

Authors and Affiliations

Department of Information Technology, Faculty of Computers & Artificial Intelligence, Cairo University, Cairo, 12613, Egypt
Nour Eldeen M. Khalifa & Mohamed Hamed N. Taha
University of California, Davis, USA
Gunasekaran Manogaran
College of Information and Electrical Engineering, Asia University, Taichung, Taiwan
Gunasekaran Manogaran
Department of Computer Science, Faculty of Computers and Artificial Intelligence, Benha University, Benha, 13518, Egypt
Mohamed Loey

Authors

Nour Eldeen M. Khalifa
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Hamed N. Taha
View author publications
You can also search for this author in PubMed Google Scholar
Gunasekaran Manogaran
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Loey
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohamed Loey.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the topical collection: Role of Nanotechnology and Internet of Things in Healthcare

This article has been retracted. Please see the retraction notice for more detail: https://doi.org/10.1007/s11051-021-05266-6

About this article

Cite this article

Khalifa, N.E.M., Taha, M.H.N., Manogaran, G. et al. RETRACTED ARTICLE: A deep learning model and machine learning methods for the classification of potential coronavirus treatments on a single human cell. J Nanopart Res 22, 313 (2020). https://doi.org/10.1007/s11051-020-05041-z

Download citation

Received: 27 July 2020
Accepted: 06 October 2020
Published: 17 October 2020
DOI: https://doi.org/10.1007/s11051-020-05041-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

RETRACTED ARTICLE: A deep learning model and machine learning methods for the classification of potential coronavirus treatments on a single human cell

Abstract

Similar content being viewed by others

Artificial Intelligence Approach to Predict the COVID-19 Patient’s Recovery

Multi-class autoencoder-ensembled prediction model for detection of COVID-19 severity

Deep Learning Approach Using 3D-ImpCNN Classification for Coronavirus Disease

Introduction

Datasets characteristics