Abstract
Currently, the search for manifestations of selection under the influence of the environment in molecular sequences is usually carried out within closely related species or at the intraspecific level. It is believed that at high taxonomic levels this is unpromising due to phylogenetic relationship. Cytochrome b amino acid sequences of 67 rodent and lagomorph species with known geographic coordinates were digitized using the AAindex database. Based on more than 200 thousand characters, the principal components were obtained. A well-known statistical method, which has not been previously used for such problems, was used, which makes it possible to orthogonally decompose multidimensional variability into intra- and intertaxon variability and analyze them separately. The subfamily level was selected. For the second principal component (17.05% of intertaxon variability), a correlation with latitude was found (r = 0.561; n = 67; p < E–5). The clear division into two groups, revealed by the first principal component (39.48% of intertaxon variability), which does not coincide with the taxonomic one, indicates a possible physicochemical underlying cause for the differences between them. This requires further research.
Similar content being viewed by others
REFERENCES
NCBI Resource Coordinators. 2015. Database resources of the national center for biotechnology information. Nucleic Acids Res. 43, D6−D17.
Tamura K., Stecher G., Kumar S. 2021. MEGA11: molecular evolutionary genetics analysis version 11. Mol. Biol. Evol. 38, 3022−3027.
Kawashima S., Pokarowski P., Pokarowska M., Kolinski A., Katayama T., Kanehisa M. 2008. AAindex: amino acid index database progress report 2008. Nucleic Acids Res. 36, D202−D205.
Gower J.C. 1966. Some distance properties of latent root and vector methods used in multivariate analysis. Biometrika. 53, 325−338.
Fisher R.A. 1919. XV. The correlation between relatives on the supposition of Mendelian inheritance. Earth Environ. Sci. Trans. R. Soc. Edinb. 52, 399−433.
Fisher R.A. 1936). The use of multiple measurements in taxonomic problems. Ann. Eugenics. 7, 179−188.
Hammer Ø., Harper D.A.T., Ryan P.D. 2001. PAST: paleontological statistics software package for education and data analysis. Palaeontol. Electron. 4, 1−9.
Polunin D., Shtaiger I., Efimov V. 2019. JACOBI4 software for multivariate analysis of biological data. bioRxiv. 803684.
Da Fonseca R.R., Johnson W.E., O’Brien S.J., Ramos M.J., Antunes A. 2008. The adaptive evolution of the mammalian mitochondrial genome. BMC Genomics. 9, 1−22.
Abramson N.I., Bodrov S.Y., Bondareva O.V., Genelt-Yanovskiy E.A., Petrova T.V. 2021. A mitochondrial genome phylogeny of voles and lemmings (Rodentia: Arvicolinae): evolutionary and taxonomic implications. PLoS One. 16, e0248198.
Bondareva O., Genelt-Yanovskiy E., Petrova T., Bodrov S., Smorkatcheva A., Abramson N. 2021. Signatures of adaptation in mitochondrial genomes of Palearctic subterranean voles (Arvicolinae Rodentia). Genes. 12, 1945.
Mori S., Matsunami M. 2018. Signature of positive selection in mitochondrial DNA in Cetartiodactyla. Genes Genet. Systems. 17-00015.
Funding
This study was performed within the framework of the budgetary project of the Institute of Cytology and Genetics, Siberian Branch, Russian Academy of Sciences no. FWNR-2022-0020 “Systems Biology and Bioinformatics: Reconstruction, Analysis, and Modeling of the Structural-Functional Organization and Evolution of Human, Animal, Plant, and Microorganism Gene Networks.”
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
The authors declare that they have no conflicts of interest. This article does not contain any studies involving animals or human participants performed by any of the authors.
Additional information
Translated by M. Batrukova
Rights and permissions
About this article
Cite this article
Efimov, V.M., Efimov, K.V. & Kovaleva, V.Y. Geometric Approach to Phylogeographic Analysis Molecular Genetic Sequences: Principal Components and Dendrograms. Mol Biol 57, 176–181 (2023). https://doi.org/10.1134/S002689332302005X
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S002689332302005X