Skip to main content
Log in

Geometric Approach to Phylogeographic Analysis Molecular Genetic Sequences: Principal Components and Dendrograms

  • BIOINFORMATICS AND SYSTEMS COMPUTER BIOLOGY
  • Published:
Molecular Biology Aims and scope Submit manuscript

Abstract

Currently, the search for manifestations of selection under the influence of the environment in molecular sequences is usually carried out within closely related species or at the intraspecific level. It is believed that at high taxonomic levels this is unpromising due to phylogenetic relationship. Cytochrome b amino acid sequences of 67 rodent and lagomorph species with known geographic coordinates were digitized using the AAindex database. Based on more than 200 thousand characters, the principal components were obtained. A well-known statistical method, which has not been previously used for such problems, was used, which makes it possible to orthogonally decompose multidimensional variability into intra- and intertaxon variability and analyze them separately. The subfamily level was selected. For the second principal component (17.05% of intertaxon variability), a correlation with latitude was found (r = 0.561; n = 67; p < E–5). The clear division into two groups, revealed by the first principal component (39.48% of intertaxon variability), which does not coincide with the taxonomic one, indicates a possible physicochemical underlying cause for the differences between them. This requires further research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1.
Fig. 2.

Similar content being viewed by others

REFERENCES

  1. NCBI Resource Coordinators. 2015. Database resources of the national center for biotechnology information. Nucleic Acids Res. 43, D6−D17.

    Article  Google Scholar 

  2. Tamura K., Stecher G., Kumar S. 2021. MEGA11: molecular evolutionary genetics analysis version 11. Mol. Biol. Evol. 38, 3022−3027.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Kawashima S., Pokarowski P., Pokarowska M., Kolinski A., Katayama T., Kanehisa M. 2008. AAindex: amino acid index database progress report 2008. Nucleic Acids Res. 36, D202−D205.

    Article  CAS  PubMed  Google Scholar 

  4. Gower J.C. 1966. Some distance properties of latent root and vector methods used in multivariate analysis. Biometrika. 53, 325−338.

    Article  Google Scholar 

  5. Fisher R.A. 1919. XV. The correlation between relatives on the supposition of Mendelian inheritance. Earth Environ. Sci. Trans. R. Soc. Edinb. 52, 399−433.

    Article  Google Scholar 

  6. Fisher R.A. 1936). The use of multiple measurements in taxonomic problems. Ann. Eugenics. 7, 179−188.

    Article  Google Scholar 

  7. Hammer Ø., Harper D.A.T., Ryan P.D. 2001. PAST: paleontological statistics software package for education and data analysis. Palaeontol. Electron. 4, 1−9.

    Google Scholar 

  8. Polunin D., Shtaiger I., Efimov V. 2019. JACOBI4 software for multivariate analysis of biological data. bioRxiv. 803684.

  9. Da Fonseca R.R., Johnson W.E., O’Brien S.J., Ramos M.J., Antunes A. 2008. The adaptive evolution of the mammalian mitochondrial genome. BMC Genomics. 9, 1−22.

    Article  Google Scholar 

  10. Abramson N.I., Bodrov S.Y., Bondareva O.V., Genelt-Yanovskiy E.A., Petrova T.V. 2021. A mitochondrial genome phylogeny of voles and lemmings (Rodentia: Arvicolinae): evolutionary and taxonomic implications. PLoS One. 16, e0248198.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Bondareva O., Genelt-Yanovskiy E., Petrova T., Bodrov S., Smorkatcheva A., Abramson N. 2021. Signatures of adaptation in mitochondrial genomes of Palearctic subterranean voles (Arvicolinae Rodentia). Genes. 12, 1945.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Mori S., Matsunami M. 2018. Signature of positive selection in mitochondrial DNA in Cetartiodactyla. Genes Genet. Systems. 17-00015.

Download references

Funding

This study was performed within the framework of the budgetary project of the Institute of Cytology and Genetics, Siberian Branch, Russian Academy of Sciences no. FWNR-2022-0020 “Systems Biology and Bioinformatics: Reconstruction, Analysis, and Modeling of the Structural-Functional Organization and Evolution of Human, Animal, Plant, and Microorganism Gene Networks.”

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to V. M. Efimov.

Ethics declarations

The authors declare that they have no conflicts of interest. This article does not contain any studies involving animals or human participants performed by any of the authors.

Additional information

Translated by M. Batrukova

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Efimov, V.M., Efimov, K.V. & Kovaleva, V.Y. Geometric Approach to Phylogeographic Analysis Molecular Genetic Sequences: Principal Components and Dendrograms. Mol Biol 57, 176–181 (2023). https://doi.org/10.1134/S002689332302005X

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1134/S002689332302005X

Keywords:

Navigation