Journal of Classification

, Volume 1, Issue 1, pp 187–233 | Cite as

The generation of random, binary unordered trees

  • George W. Furnas
Authors Of Articles

Abstract

Several techniques are given for the uniform generation of trees for use in Monte Carlo studies of clustering and tree representations. First, general strategies are reviewed for random selection from a set of combinatorial objects with special emphasis on two that use random mapping operations. Theorems are given on how the number of such objects in the set (e.g., whether the number is prime) affects which strategies can be used. Based on these results, methods are presented for the random generation of six types of binary unordered trees. Three types of labeling and both rooted and unrooted forms are considered. Presentation of each method includes the theory of the method, the generation algorithm, an analysis of its computational complexity and comments on the distribution of trees over which it samples. Formal proofs and detailed algorithms are in appendices.

Keywords

Uniform sampling Tree construction Monte Carlo studies Tree algorithms Clustering methodology Classification methodology 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. BEYER, T., and HEDETNIEMI, S. M. (1980), “Constant Time Generation of Rooted Trees,”SIAM Journal of Computing, 9, 706–712.Google Scholar
  2. CARROLL, J. D., and CHANG, J.-J. (1973), “A Method for Fitting a Class of Hierarchical Tree Structure Models to Dissimilarities Data and Its Application to Some ‘Body Parts’ Data of Miller's,”Proceedings of the 81st annual convention of the American Psychological Association, 8, 1097–1098.Google Scholar
  3. CARROLL, J. D., and CHANG, J.-J. (1974), “A General Procedure for Fitting Tree Structure Models to Dissimilarity Data,” Unpublished Bell Laboratories Technical Memorandum. Presented at the annual meeting of the Classification Society, North American Branch.Google Scholar
  4. COMTET, L. (1970),Advanced Combinatorics, Boston: D. Reidel Publishing Co.Google Scholar
  5. DESOETE, G., DESARBO, W., FURNAS, G. W., and CARROLL, J. D. (1984), “The Representation of Nonsymmetric Rectangular Proximity Data by Ultrametric and Path Length Tree Structures,”Psychometrika, 49, 289–310.Google Scholar
  6. DINITS, E. A., and ZAITSEV, M. A. (1977), “Algorithm for the Generation of Nonisomorphic Trees,”Avtom. and Telemekh. (USSR)38 121–126, Trans in:Automation and Remote Control (USA), 4, 554–558.Google Scholar
  7. FELSENSTEIN, J. (1978), “The Number of Evolutionary Trees,”Systematic Zoology, 27, 27–33.Google Scholar
  8. FOWLKES, E. B., MALLOWS, C. L., and MCRAE, J. E. (1982), “Some methods for Studying the Shape of Hierarchical Trees,” Unpublished Bell Laboratories Technical Memorandum.Google Scholar
  9. FRANK, O., and SVENSSON, K. (1981), “On Probability Distributions of Single-Linkage Dendrograms,”Journal of Statistical Computation and Simulation, 12, 121–131.Google Scholar
  10. FURNAS, G. W. (1981), “The Construction of Random Terminally Labeled, Binary Trees,” Unpublished Bell Laboratories Technical Memorandum.Google Scholar
  11. GOBEL, F. (1981), “On a 1-1-Correspondence between Rooted Trees and Natural Numbers,”Journal of Combinatorial Theory Series B,29, 141–143.Google Scholar
  12. HARARY, F. (1969),Graph Theory, Reading, Mass: Addison-Wesley, Chapter 4.Google Scholar
  13. HARDING, E. F. (1971), “The Probabilities of Rooted Tree-Shapes Generated by Random Bifurcation,”Advances in Applied Probability, 3, 44–77.Google Scholar
  14. KNOTT, G. D. (1977), “A Numbering System for Binary Trees,”Communication of the Association for Computing machinery, 20(2).Google Scholar
  15. KNUTH, D. E., (1973),The Art of Computer Programming, second edition. Volume I. Reading, Mass: Addison-Wesley.Google Scholar
  16. KOZINA, A. V. (1979), “Coding and Generation of Nonisomorphic Trees,”Kibernetika (USSR) 15, 38–43, Trans. in:Cybernetics (USA) 15, 645–651.Google Scholar
  17. LAPTIN, Y. P. (1981), “An External Problem on Random Trees,” Plenum Publishing Corporation. Translated fromKibernetika, 2, 95–97.Google Scholar
  18. MEIR, A., and MOON, J. W. (1970), “The Distance between Points in Random Trees,”Journal of Combinatorial Theory, 8, 99–103.Google Scholar
  19. MURTAGH, F. (1983), “A Probability Theory of Hierarchic Clustering using Random Dendrograms,”Journal of Statistical Computation and Simulation, 18, 145–157.Google Scholar
  20. MURTAGH, F. (1984), “Counting Dendrograms: A Survey,”Discrete Applied Mathematics, 7, 191–199.Google Scholar
  21. NIJENHUIS, A., and WILF, H. S. (Eds.) (1978),Combinatorial Algorithms for Computers and Calculators, Second Edition, New York: Academic Press.Google Scholar
  22. ODEN, N. L. and SHAO, K-T (1984), “An Algorithm to Equiprobably Generate all Directed Trees with K Labeled Terminal Nodes and Unlabeled Interior Nodes,”Bulletin of Mathematical Biology, in press.Google Scholar
  23. PROSKUROWSKI, A. (1980), “On the Generation of Binary Trees,”Journal of the Association for Computing Machinery, 27, 1–2.Google Scholar
  24. PRUZANSKY, S., TVERSKY, A., and CARROLL, J. D. (1982), “Spatial vs. Tree Representations of Proximity Data,”Psychometrika, 47, 3–24.Google Scholar
  25. READ, R. C. (1972), “The Coding of Various Kinds of Unlabeled Trees,” InGraph Theory and Computing, ed. R. C. Read, New York: Academic Press, 153–182.Google Scholar
  26. READ, R. C. (1978), “Every one a Winner or How to Avoid Isomorphism Search when Cataloguing Combinatorial Configurations,”Annals of Discrete Mathematics, 2, 107–120.Google Scholar
  27. ROBINSON, R. W., and SCHWENK, A. J. (1975), “The Distribution of Degrees in a Large Random Tree,”Discrete Mathematics, 12, 359–372.Google Scholar
  28. ROHLF, F. J. (1983), “Numbering Binary Trees with Labeled Terminal Vertices,”Bulletin of Mathematical Biology, 45, 33–40.Google Scholar
  29. ROTEM, D., and VAROL, Y. L. (1978), “Generation of Binary Trees from Ballot Sequences,”Journal of the Association for Computing Machinery, 25, 396–404.Google Scholar
  30. RUSKEY, F. (1978), “Generating t-ary Trees Lexicographically,”SIAM Journal of Computing, 7(4, 424–439.Google Scholar
  31. RUSKEY, F., and HU, T. C. (1977), “Generating Binary Trees Lexicographically,”SIAM Journal of Computing, 6, 748–758.Google Scholar
  32. SATTATH, S., and TVERSKY, A. (1977), “Additive Similarity Trees,”Psychometrika, 42, 319–345.Google Scholar
  33. SIBSON, R. (1972), “Order Invariant Methods for Data Analysis,”Journal of the Royal Statistical Society, B34, 311–349.Google Scholar
  34. SOLOMON, M., and FINKEL, R. A. (1980), “A Note on Enumerating Binary Trees,”Journal of the Association for Computing Machinery, 27, 3–5.Google Scholar
  35. TROJANOWSKI, A. E. (1978), “Ranking and Listing Algorithms for k-ary Trees,”SIAM Journal of Computing, 7, 492–509.Google Scholar
  36. WILF, H. S. (1981), “The Uniform Selection of Free Trees,”Journal of Algorithms, 2, 204–207.Google Scholar
  37. ZAKS, S. (1982), “Generation and Ranking of k-ary Trees,”Information Processing Letters (Netherlands),14, 44–48.Google Scholar
  38. ZAKS, S., and RICHARDS, D. (1979), “Generating Trees and other Combinatorial Objects Lexicographically,”SIAM Journal of Computing, 8, 73–81.Google Scholar

Copyright information

© Springer-Verlag 1984

Authors and Affiliations

  • George W. Furnas
    • 1
  1. 1.Bell Communications ResearchMorristownUSA

Personalised recommendations