Abstract
Following Petoukhov and his collaborators, we use two length n zero-one sequences, α and β, to represent a length n genetic sequence \({\alpha\choose\beta}\) so that the columns of \({\alpha\choose\beta}\) have the following correspondence with the nucleotides: \(C\sim{0\choose0}\) , \(U\sim{1\choose0}\) , \(G\sim{1\choose1}\) , \(A\sim{0\choose1}\) . Using the Gray code ordering to arrange α and β, we build a 2n×2n matrix C n including all the 4n length n genetic sequences. Furthermore, we use the Hamming distance of α and β to construct a 2n×2n matrix D n . We explore structures of these matrices, refine the results in earlier papers, and propose new directions for further research.
Similar content being viewed by others
References
Alff-Steinberger, C., 1969. The genetic code and error transmission. Proc. Natl. Acad. Sci. USA 64, 584–591.
Fickett, J., Burks, C., 1989. Development of a database for nucleotide sequences. In: Waterman, M.S. (Ed.), Mathematical Methods in DNA Sequences, pp. 1–34. CRC Press, Boca Raton.
Freeland, S.J., Wu, T., Keulmann, N., 2003. The case for an error minimizing genetic code. Orig. Life Evol. Biosph. 33, 457–477.
Gates, M.A., 1986. A simple way to look at DNA. J. Theor. Biol. 119, 319328.
Goldberg, A.L., Wittes, R.E., 1966. Genetic code: aspects of organisation. Science 153, 420–424.
Haig, D., Hurst, L.D., 1991. A quantitative measure of error minimization in the genetic code. J. Mol. Evol. 22, 412–417.
He, M.X., 2004. Genetic code, attributive mappings and stochastic matrices. Bull. Math. Biol. 66, 965–973.
He, M.X., Petoukhov, S.V., 2007. Harmony of living nature, symmetries of genetic systems and matrix genetics. Int. J. Integr. Biol. 1, 41–43.
He, M.X., Petoukhov, S.V., Ricci, P.E., 2004. Genetic code, Hamming distance and stochastic matrices. Bull. Math. Biol. 66, 1405–1421.
Horn, R.A., Johnson, C.R., 1985. Matrix Analysis. Cambridge University Press, Cambridge.
Leong, P.M., Morgenthaler, S., 1995. Random walk and gap plots of DNA sequences. Comput. Appl. Biosci. 11, 503507.
Nandy, A., 1994. A new graphical representation and analysis of DNA sequence structure: I. Methodology and application to globin genes. Curr. Sci. 66, 309314.
Petoukhov, S., 2008. Matrix genetics, algebras of the genetic code, noise-immunity. Moscow, RCD. http://www.geocities.com/symmetrion/Matrix_genetics/matrix_genetics.html.
Reif, J.H., 1998. Alternative Computational Models: A Comparison of Biomolecular and Quantum Computation, an invited paper at the 18th International Conference on Foundations of Software Technology and Theoretical Computer Science (FST&TCS98). Preprint can be found at http://www.cs.duke.edu/~reif/paper/altcomp.ps.
Sonneborn, T.M., 1965. Degeneracy of the Genetic Code: Extent, Nature and Genetic Implications. Academic Press, San Diego.
Steel, M., Penny, D., 2000. Parsimony, likelihood, and the role of models in molecular phylogenetics. Mol. Biol. Evol. 17, 839–850.
Swanson, R., 1984. A unifying concept for the amino acid code. Bull. Math. Biol. 46, 187–203.
Tucker, A., 2007. Applied Combinatorics, 5th edn. Wiley, New York.
Yang, Z., 1996. Phylogenetic analysis using parsimony and likelihood methods. J. Mol. Evol. 42, 294–307.
Author information
Authors and Affiliations
Corresponding author
Additional information
Research of T. Crowder was partially supported by the NSF CSUMS and NSF UBM undergraduate research grants at William and Mary. His current address is: US Navel Research Laboratory, 4555 Overlook Ave. S.W., Washington, DC 20375.
C.-K. Li is an honorary professor of the University of Hong Kong. His research was partially supported by NSF and the William and Mary Plumeri Award.
Rights and permissions
About this article
Cite this article
Crowder, T., Li, CK. Studying Genetic Code by a Matrix Approach. Bull. Math. Biol. 72, 953–972 (2010). https://doi.org/10.1007/s11538-009-9478-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11538-009-9478-7