Abstract
Much of recent work to determine primary structures of nucleic acids and proteins employs the “fragmentation” or “overlap” stratagem. Typically, a preparation of a given polymer with unknown sequence is purified and then subjected to an enzyme known to cut the polymer at certain specific sites. The quantities and sequences of the resulting fragments are determined. For RNA primary sequences, pancreatic ribonuclease and T1 ribonuclease are ordinarily used as fragmenting enzymes. A technique is described for evaluating such fragment data. It has the following properties: It is easily determined whether or not the fragment data is inconsistent. It is always possible to determine the first and last nucleotides of the unknown sequence from the data of two limit digests. Consistent data from two limit digests can always be fitted into a convenient conceptual framework developed within the theory of graphs. In most cases, partial digest information can be used to modify the framework constructed from two limit digests, as such information is obtained. An efficient analysis of all fragment data in this conceptual framework can always be made. One can detect inconsistencies and can generate the entire list of polymer sequences consistent with the fragment data.
Similar content being viewed by others
Literature
Dayhoff, M. O. 1964. “Computer Aids to Protein Sequence Determination”.J. Theor. Biol.,8, 97–112.
Ford, L. R. Jr. and D. R. Fulkerson. 1962. “Flows in Networks.” Princeton, New Jersey: Princeton University Press.
Goodman, H. M., J. Abelson, A. Landy, S. Brenner and J. D. Smith. 1968. “Amber Suppression: A Nucleotide Change in the Anticodon of a Tyrosine Transfer RNA.”Nature,217, 1019–1024 (Mar. 16).
Harary, F., R. Z. Norman and D. Cartwright. 1965. “Structural Models: An Introduction to the Theory of Directed Graphs.” New York: John Wiley & Sons, Inc.
Holley, R. W., J. Apgar, G. A. Everett, J. T. Madison, M. Marquisee, S. H. Merrill, J. R. Penswick and A. Zamir. 1965. “Structure of a Ribonucleic Acid”.Science,147, 1462–1465 (Mar. 19).
Hutchinson, G. 1968. “Evaluation of Polymer Sequence Data from Two Complete Digests.” Internal Report, National Institutes of Health.
Madison, J. T., G. A. Everett, and H. Kung 1966. “Nucleotide Sequence of a Yeast Tyrosine Transfer RNA”.Science,153, 531–534 (July 29).
Mosimann, J. E., M. B. Shapiro, C. R. Merril, D. F. Bradley and J. E. Vinton. 1966. “Reconstruction of Protein and Nucleic Acid Sequences: IV. The Algebra of Free Monoids and the Fragmentation Strategem. “Bull. Math. Biophysics,28, 235–260.
Mosimann, J. E., and J. E. Vinton. 1968. “Necessary and Sufficient Conditions for a Sequence to be Solvable with Complete Digest Fragments.” Manuscript, National Institutes of Health.
RajBhandary, U. L., S. H. Chang, A. Stuart, R. D. Faulkner, R. M. Hoskinson and H. G. Khorana. 1967. “Studies on Polynucleotides, LXVIII. The Primary Structure of Yeast Phenylalanine Transfer RNA”.Proc. N.A.S.,57, 751–758.
Shapiro, M. B., C. R. Merril, D. F. Bradley and J. E. Mosimann. 1965. “Reconstruction of Protein and Nucleic Acid Sequences: Alanine Transfer Ribonucleic Acid.”Science,150, 918–921 (Nov. 12).
—. 1967. “An Algorithm for Reconstructing Protein and RNA Sequences.”J. Assoc. Comp. Mach.,14, 720–731.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Hutchinson, G. Evaluation of polymer sequence fragment data using graph theory. Bulletin of Mathematical Biophysics 31, 541–562 (1969). https://doi.org/10.1007/BF02476636
Received:
Issue Date:
DOI: https://doi.org/10.1007/BF02476636