Abstract
Well-known dynamic programming algorithms exist for comparing two finite sequences inO(N 2) time and storage, whereN is the common sequence length. Extensions to the comparison ofM finite sequences requireO((2N) M) time and storage, making such algorithms difficult even forM=3. A simple generalization of the sequences makes it possible to obtain some results about the geometry of sequence alignments. These ideas suggest heuristic approaches to problems of comparing several sequences. IfM sequences are known to be related by a binary tree, they can be aligned inO(MN 2) time andO(N 2+NM) storage.
Similar content being viewed by others
Literature
Busemann, H. 1955.The Geometry of the Geodesics. Academic Press, New York.
Fitch, W. M. 1971. “Towards Defining the Course of Evolution: Minimum Change for a Specific Tree Topology.”Syst. Zool.,20, 406–416.
Kruskal, J. B. and M. Liberman. 1983. InTime Warps, String Edits, and Macromolecules: the Theory and Practice of Sequence Comparison, Ed. D. Sankoff and J. B. Kruskal, pp. 125–160. Addison-Wesley, London.
Rabiner, L. R. and J. G. Wilpon. 1979. “Considerations in Applying Clustering Techniques to Speaker-independent Word Recognition.”J. Acoustic. Soc. Am. 66, 663–673.
— and—. 1980. “A Simplified, Robust Training Procedure for Speaker-trained, Isolated-word Recognition Systems.”J. Acoustic. Soc. Am. 68, 1271–1276.
Sankoff, D. 1972. “Matching Sequences under Deletion-Insertion Constraints.”Proc. natn. Acad. Sci. U.S.A. 69, 4–6.
— 1975. “Minimal Mutation Trees of Sequences.”SIAM J. appl. Math. 28, 35–42.
— and J. B. Kruskal. 1983.Time Warps, String Edits, and Macromolecules: the Theory and Practice of Sequence Comparison. Addison-Wesley, London.
Sellers, P. H. 1974. “On the Theory and Computation of Evolutionary Distances.”SIAM J. appl. Math. 26, 787–793.
Ulam, S. M. 1972. InApplications of Number Theory to Numerical Analysis, Ed. S. K. Zaremba, pp. 1–10. Academic Press, New York.
Wagner, R. A. and M. J. Fischer. 1974. “The String-to-String Correction Problem.”J. Ass. comp. Mach. 21, 169–173.
Waterman, M. S., T. F. Smith and W. A. Beyer. 1976. “Some Biological Sequence Metrics.”Adv. Math. 20, 367–387.
Woese, C. R., R. Gutell, R. Gupta and H. F. Noller. 1983. “Detailed Analysis of the Higher-order Structure of 16S-like Ribosonal Ribonucleic Acids.”Microbiol. Rev. 47, 621–669.
Author information
Authors and Affiliations
Additional information
This work was supported by a grant from the System Development Foundation.
Rights and permissions
About this article
Cite this article
Waterman, M.S., Perlwitz, M.D. Line geometries for sequence comparisons. Bltn Mathcal Biology 46, 567–577 (1984). https://doi.org/10.1007/BF02459504
Issue Date:
DOI: https://doi.org/10.1007/BF02459504