Interrelatedness of 5S RNA sequences investigated by correspondence analysis
Correspondence analysis (a form of multivariate statistics) applied to 74 5S ribosomal RNA sequences indicates that the sequences are interrelated in a systematic, nonrandom fashion. Aligned sequences are represented as vectors in a 5N-dimensional space, where N is the number of base positions in the 5S RNA molecule. Mutually orthogonal directions (called factor axes) along which intersequence variance is greatest are defined in this hyperspace. Projection of the sequences onto planes defined by these factorial directions reveals clustering of species that is suggestive of phylogenetic relationships. For each factorial direction, correspondence analysis points to regions of “importance”, i.e., those base positions at which the systematic changes occur that define that particular direction. In effect, the technique provides a rapid determination of group-specific signatures. In several instances, similarities between sequences are indicated that have only recently been inferred from visual base-to-base comparisons. These results suggest that correspondence analysis may provide a valuable starting point from which to uncover the patterns of change underlying the evolution of a macromolecule, such as 5S RNA.
Key words5S RNA Correspondence analysis Multivariate statistics Evolution Phylogeny
Unable to display preview. Download preview PDF.
- Benzecri JP (1969) Statistical analysis as a tool to make patterns emerge from data. In: Watanabe S (ed) Methodologies of pattern recognition. Academic Press, New York, p 35Google Scholar
- Delihas N, Andersen J, Berns D (1985) The structure of the 5S ribosomal RNA from the thermophilic cyanobacteriumSynechococcus lividus II. J Mol Evol 21:334–337Google Scholar
- Fitch WM, Margoliash E (1967) Construction of phylogenetic trees. Science 155:269–284Google Scholar
- Fox GE, Luehrsen KR, Woese CR (1982) Archaebacterial 5S ribosomal RNA. Zentralbl Bakteriol Hyg I [C] 3:330–345Google Scholar
- Frank J, van Heel M (1982) Correspondence analysis of aligned images of biological particles. J Mol Biol 161:124–137Google Scholar
- Lebart L, Morineau A, Warwick KA (1984) Multivariate descriptive statistical analysis. John Wiley & Sons, New YorkGoogle Scholar
- Limaiem J, Henaut A (1984) Etude de la fluctuation de la frequence des quatre bases le long du genome mitochondrial des Mammiferes au moyen de l'analyse factorielle des correspondances. C R Seances Acad Sci 298:279–286Google Scholar
- Margulis L (1970) Origin of eukaryotic cells. Yale University Press, New Haven, ConnecticutGoogle Scholar