Abstract
This paper presents a statistical genomic analysis of the three major coronaviruses that have caused outbreaks in history, the SARS-CoV, MERS-CoV and SARS-CoV-2. Fasta files of the isolated genomes of these viruses were used to perform a global pairwise alignment in Biopython in order to establish sequence identity and the possibility of similar origins. These were then translated into functional proteins and tested for statistically significant differences in aromaticities, instability indices and isoelectric points using ANOVAs and Tukey-HSD post-hoc tests. This research will help future researchers in understanding the characteristics of the SARS-CoV-2 (the new virus strain that has caused the COVID-19 pandemic) and the properties that distinguish it from the MERS-CoV and SARS-CoV.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Wu, D., Wu, T., Liu, Q., Yang, Z.: The SARS-CoV-2 outbreak: what we know. Int. J. Infect. Dis. (2020)
World Health Organisation: https://www.who.int/news-room/fact-sheets/detail/middle-east-respiratory-syndrome-coronavirus-(mers-cov. Accessed May 2020
World Health Organisation: https://www.who.int/ith/diseases/sars/en/. Accessed May 2020
Andersen, K.G., Rambaut, A., Lipkin, W.I., Holmes, E.C., Garry, R.F.: The proximal origin of SARS-CoV-2. Nat. Med. (2020)
Chin, A.W.H., Chu, J.T.S., Perera, M.R.A., Hui, K.P.Y., Yen, H.-L., Chan, M.C.W., Peiris, M., Poon, L.L.M.: Stability of SARS-CoV-2 in different environmental conditions. Lancet Microbe (2020)
Radenkovic, D.: Kaggle. https://www.kaggle.com/radenkovic/coronavirus-accession-sars-mers-cov2 (2020)
Lu, R., Zhao, X., Li, J., Niu, P., Yang, B., Wu, H., Wang, W., Song, H., Huang, B., Zhu, N., Bi, Y., Ma, X., Zhan, F., Wang, L., Hu, T., Zhou, H., Hu, Z., Zhou, W., Zhao, L., Chen, J., Meng, Y., Wang, J., Lin, Y., Yuan, J., Xie, Z., Ma, J., Liu, W.J., Wang, D., Xu, W., Holmes, E.C., Gao, G.F., Wu, G., Chen, W., Shi, W., Tan, W.: Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet (2020)
Lobry, J.R., Gautier, C.: Hydrophobicity, expressivity and aromaticity are the major trends of amino-acid usage in 999 Escherichia coli chromosome-encoded genes. Nucleic Acids Res. 22, 3174–3180 (1994)
Bjellqvist, B., Hughes, G.J., Pasquali, C., Paquet, N., Ravier, F., Sanchez, J.-C., Frutiger, S., Hochstrasser, D.: The focusing positions of polypeptides in immobilized pH gradients can be predicted from their amino acid sequences. Electrophoresis 14, 1023–1031 (1993)
Guruprasad, K., Reddy, B.V.B., Pandit, M.W.: Correlation between stability of a protein and its dipeptide composition: a novel approach for predicting in vivo stability of a protein from its primary sequence. Protein Eng. Design Sel. 4, 155–161 (1990)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sharma, M. (2021). Statistical Genomic Analysis of the SARS-CoV, MERS-CoV and SARS-CoV-2. In: Hassanien, A.E., Bhattacharyya, S., Chakrabati, S., Bhattacharya, A., Dutta, S. (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, vol 1300. Springer, Singapore. https://doi.org/10.1007/978-981-33-4367-2_52
Download citation
DOI: https://doi.org/10.1007/978-981-33-4367-2_52
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-4366-5
Online ISBN: 978-981-33-4367-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)