Summary
Based in a generalised recursive tree-building algorithm for populations partitioned into strata a method to obtain simple descriptions of strata is presented. Also strata with a common rule are obtained. Common predictors and criterion variable describe population in all strata or classes of individuals. Algorithm considers strata structure in tree-building algorithm and combines in each step maximisation of an information content measure for the criterion variable in a new binary partition of the population and selection of decisional nodes, based in quality of prediction for subsets of strata. Each decisional tree node is composed of a set of strata and a rule for individuals in these strata that will jointly explain the criterion variable.
Symbolic data analysis fits the method. Input of the algorithm is composed of classes of individuals. Algorithm is extended to individuals described by probabilistic symbolic objects. As output, symbolic objects describe tree, decisional nodes and strata.
Similar content being viewed by others
References
Bravo, M.C. & García-Santesmases, J.M. (1997), Segmentation trees for stratified data, in ‘Applied stochastic models and data analysis: the ins and outs of solving real problems’, Jansen, J. & Lauro C.N.(Eds.), Rocco Curto Ed, 37–42.
Bravo, M.C. & García-Santesmases, J.M. (1998), Symbolic object description of strata by segmentation trees, in ‘NTTS’98 International seminar on new techniques & technologies for Statistics’, Estudio Idea Ed, Napoli, 85–90.
Bravo Llatas, M.C. & García-Santesmases, J.M. (1999), Segmentation trees for stratified data, in ‘Analysis of symbolic data. Exploratory methods for extracting statistical information from complex data’, Bock, H.H & Diday, E. (Eds.), Springer Verlag, Heidelberg, 266–293.
Breiman, L., Friedman, J.H., Olshen, R.A. & Stone, C.J. (1984), Classification and Regression Trees. Wadsworth, Belmond, Ca.
Ciampi A. (1992), Constructing prediction trees from data: the RECPAM approach in ‘Proceedings from the Prague 1991 summer school on computational aspects of model choice’, Physica-Verlag, 105–152.
Ciampi, A., Diday, E., Lebbe, J., Perinel, E. & Vignes, R. (1996), Recursive partition with probabilistically imprecise data in ‘Ordinal and symbolic data analysis’, E. Diday et al. (Eds.), Springer Verlag, 201–212.
Diday, E. (1993), An introduction to symbolic data analysis, Tutorial of the 4th conference of IFCS, Rapport INRIA no. 1936, Paris.
Dubois, D., Xiu M. & Prade, H. (1991), Fuzzy discrimination trees, in ‘Fuzzy engineering toward human friendly systems, part II: fuzzy technology, IFES’91’ 250–260.
Quinlan, J.R. (1986), ‘Induction of Decision Trees’, Machine Learning I, 81–106.
— (1990), Probabilistic decision trees, in ‘Machine learning, an artificial intelligence approach, vol. III’, Kodratoff Y. & Michalski, R. (Eds.), Morgan Kaufmann Publishers, 140–152.
Author information
Authors and Affiliations
Corresponding author
Additional information
Partially supported by the European project Esprit-20821
Rights and permissions
About this article
Cite this article
Bravo, M.C., García-Santesmases, J.M. Symbolic object description of strata by segmentation trees. Computational Statistics 15, 13–24 (2000). https://doi.org/10.1007/s001800050032
Published:
Issue Date:
DOI: https://doi.org/10.1007/s001800050032