Abstract
Human populations are constantly inundated with viruses, some of which are responsible for various deadly diseases. Molecular biology approaches have been employed extensively to identify pathogenic viruses despite the limitations of the approaches. Nevertheless, recent advances in the next generation sequencing technologies have led to a surge in viral genome sequence databases with potentials for Bioinformatics based virus identification. In this study, we have utilised the Gaussian radial basis function neural network to identify pathogenic viruses. To validate the neural network model, samples of sequences of four different pathogenic viruses were extracted from the ViPR corpus. Electron-ion interaction pseudopotential scheme was used to encode the extracted sample sequences while cepstral analysis technique was applied to the encoded sequences to obtain a new set of genomic features, here called Genomic Cepstral Coefficients (GCCs). Experiments were performed to determine the potency of the GCCs to discriminate between different pathogenic viruses. Results show that GCCs are highly discriminating and gave good results when applied to identify some selected pathogenic viruses.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Wang, D., Urisman, A., Liu, Y.T., Springer, M., Ksiazek, T.G., Erdman, D.D., DeRisi, J.L.: Viral discovery and sequence recovery using DNA microarrays. PLoS Biol. 1(2), 257–260 (2003)
Mabrouk, M.S.: A study of the potential of EIIP mapping method in exon prediction using the frequency domain techniques. Am. J. Biomed. Eng. 2(2), 17–22 (2012)
Sathish Kumar, S., Duraipandian, N.: An effective identification of species from DNA sequence: a classification technique by integrating DM and ANN. Int. J. Adv. Comput. Sci. Appl. 3(8), 104–114 (2012)
Karthika, V., Nair, V.V., Gopinath, D.P.: Classification of organisms using frequency-chaos game representation of genomic sequences and ANN. In: Proceedings of the 10th National Conference on Technological Trends (NCTT09), pp. 243–247, 6–7 Nov 2009
Sandberg, R., Winberg, G., Branden, C.I., Kaske, A., Ernberg, I., Coster, J.: Capturing Whole—Genome characteristics in short sequences using a naive Bayesian classifier. Genome Res. 11, 1404–1409 (2001)
Zanoguera, F., De Francesco, M.: Protein classification into domains of life using Markov chain models. In: Proceeding of the Computational Systems Bioinformatics Conference, pp. 517–519 (2004)
Song, C., Shi, F.: Prediction of protein subcellular localization based on Hilbert-Huang transform. Wuhan Univ. J. Nat. Sci. 17(1), 48–54 (2012)
Adetiba, E., Olugbara, O.O.: Lung cancer prediction using neural network ensemble with histogram of oriented gradient genomic features. Sci. World J. 2015, id. 786013, 1–17 (2015)
Voss, R.F.: Evolution of long-range fractal correlations and 1/f noise in DNA base sequences. Phys. Rev. Lett. 68(25), 3805 (1992)
Anastassiou, D.: Frequency-domain analysis of biomelecular sequences. Bioinformatics 16(12), 1073–1081 (2000)
Anastassiou, D.: Genomic signal processing. IEEE Signal Process. Mag. 18(4), 8–20 (2001)
Nair, A.S., Sreenadhan, S.P.: A coding measure scheme employing electron-ion interaction pseudopotential (EIIP). Bioinformation 1(6), 197–202 (2006)
Pirogova, E., Simon, G.P., Cosic, I.: Investigation of the applicability of dielectric relaxation properties of amino acid solutions within the resonant recognition model. IEEE Trans. Nanobiosci. 2, 63–69 (2003)
Akay, M.: Biomedical Signal Processing, pp. 113–135. Academic Press (2012)
Oppenheim, A.V., Schafer, R.W.: Digital Signal Processing. Englewood Cliffs, Prentice-Hall (1975)
Thakur, S., Adetiba, E., Olugbara, O.O., Millham, R.: Experimentation using short-term spectral features for secure mobile internet voting authentication. Math. Prob. Eng. 2015, id. 564904, 1– 21 (2015)
Adetiba, E., Ekeh, J.C., Matthews, V.O., Daramola, S.A., Eleanya, M.E.U.: Estimating an optimal backpropagation algorithm for training an ANN with the EGFR Exon 19 nucleotide sequence: an electronic diagnostic basis for non-small cell lung cancer (NSCLC). J. Emerg. Trends Eng. Appl. Sci. 2(1), 74–78 (2011)
Kurban, T., Beşdok, E.: A comparison of RBF neural network training algorithms for inertial sensor based terrain classification. Sensors 9(8), 6312–6329 (2009)
Oyang, Y.J., Hwang, S.C., Ou, Y.Y., Chen, C.Y., Chen, Z.W.: Data classification with radial basis function networks based on a novel kernel density estimation algorithm. IEEE Trans. Neural Netw. 16, 225–236 (2005)
Lee, C.C., Chung, P.C., Tsai, J.R., Chang, C.I.: Robust radial basis function neural networks. IEEE Trans. Syst. Man Cybern. Part B: Cybern. 29(6), 674–685 (1999)
Derks, E.P.P.A., Pastor, M.S., Buydens, L.M.C.: Robustness analysis of radial base function and multi-layered feed-forward neural network models. Chemometr. Intell. Lab. Syst. 28(1), 49–60 (1995)
Pickett, B.E., Greer, D.S., Zhang, Y.: Virus pathogen database and analysis resource (ViPR): a comprehensive bioinformatics database and analysis resource for the coronavirus research community. Viruses 4, 3209–3226 (2012)
Zou, K.H., O’Malley, A.J., Mauri, L.: Receiver-operating characteristic analysis for evaluating diagnostic tests and predictive models. Circulation 115(5), 654–657 (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Adetiba, E., Olugbara, O.O., Taiwo, T.B. (2016). Identification of Pathogenic Viruses Using Genomic Cepstral Coefficients with Radial Basis Function Neural Network. In: Pillay, N., Engelbrecht, A., Abraham, A., du Plessis, M., Snášel, V., Muda, A. (eds) Advances in Nature and Biologically Inspired Computing. Advances in Intelligent Systems and Computing, vol 419. Springer, Cham. https://doi.org/10.1007/978-3-319-27400-3_25
Download citation
DOI: https://doi.org/10.1007/978-3-319-27400-3_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27399-0
Online ISBN: 978-3-319-27400-3
eBook Packages: Computer ScienceComputer Science (R0)