Abstract
The framework of this paper is developed on tree-based models for three-way data. Three-way data are measurements of variables on a sample of objects in different occasions (i.e. space, time, factor categories) and they are obtained when prior information play a role in the analysis.
Three way data can be analyzed by exploratory methods, i.e., the factorial approach (TUCKER, PARAFAC, CANDECOMP, etc.) as well as confirmatory methods, i.e., the modelling approach (log-trilinear association models, simultaneous latent budget models, etc.).
Recently, we have introduced a methodology for classification and regression trees in order to deal specifically with three-way data. Main idea is to use a stratifying variable or instrumental variable to distinguish either groups of variables or groups of objects. As a result, prior information plays a role in the analysis providing a new framework of classification and regression trees for three-way data.
In this paper we introduce a tree-based method based on optimal scaling in order to account of the presence of non-linear correlated groups of variables. The results of a real world application on Tourist Satisfaction Analysis in Naples will be also presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Breiman, L., Friedman, J.H., Olshen, R.A., Stone. C.J.: Classification and Regression Trees. Wadsworth, Belmont, CA (1984)
De Leeuw, J., Young, F.W., Takane, Y.: Additive structure in qualitative data: an alternating least square method with optimal scaling features. Psychometrika 31, 33–42 (1976)
Gifi, A.: Nonlinear Multivariate Analysis. Department of Data Theory, University of Leiden, Leiden (1981)
Goodman, L.A., Kruskal, W.H.: Measures of association for cross-classification. J. Am. Stat. Assoc. 48, 732–762 (1954)
Gray, L.N., Williams, J.S.: Goodman and Kruskal’s tau b: multiple and partial analogs. In: Proceedings of the American Statistical Association, pp. 444–448 (1975)
Hastie, T., Friedman, J.H., Tibshirani, R.: The Elements of Statistical Learning: Data Mining, Inference and Prediction. Springer, New York, NY (2001)
Kiers, H.A.L.: Hierarchical relations among three-way methods. Psychometrika 56(3), 449–470 (1991)
Mola, F., Siciliano, R.: A two-stage predictive splitting algorithm in binary segmentation. In: Dodge, Y., Whittaker, J. (eds.) Computational Statistics: COMPSTAT 92, 1, pp. 179–184. Physica Verlag, Heidelberg (1992)
Siciliano, R.: Latent budget trees for multiple classification. In: Vichi, M., Optitz, P. (eds.) Classification and Data Analysis: Theory and Application, Springer, Heidelberg (1999)
Siciliano, R., Aria, M., Conversano, C.: Harvesting trees: methods, software and applications. In Proceedings in Computational Statistics: 16th Symposium of IASC, held Prague, August 23–27, 2004 (COMPSTAT2004), Eletronical Edition (CD) Physica-Verlag, Heidelberg (2004)
Siciliano, R., Mola, F.: Discriminant Analysis and Factorial Multiple Splits in Recursive Partitioning for Data Mining. In: Roli, F., Kittler, J. (eds.) Proceedings of International Conference on Multiple Classifier Systems, Chia, June 24–26, 2002, Lecture Notes in Computer Science, pp. 118–126. Springer, Heidelberg (2002)
Siciliano, R., Mooijaart, A.: Three-factor association models for three-way contingency tables. Comput. Stat. Data Anal. 24(3), 337–356. Elsevier North Holland, Amsterdam (1997)
Siciliano, R., Van Der Heijden, P.G.M. Simultaneous Latent Budget Analysis of a Set of Multidimensional Contingency Tables. Metron LII(1–2), 155–180 (1994)
Siciliano, R., Tutore, V.A., Aria, M.: 3Way trees. In: Classification and Data Analysis 2007, Macerata, 12–14 Sept 2007. Book of Short Papers, pp. 231–234. EUM, Macerata (2007)
Tutore V.A., Mooijaart, A.: Optimal scaling trees. In: Classification and Data Analysis 2007, Macerata, 12–14 Sept 2007. Book of Short Papers, pp. 359–362. EUM, Macerata (2007)
Tutore, V.A., Siciliano, R., Aria, M.: Conditional Classification Trees using Instrumental Variables. Advances in Intelligent Data Analysis, pp. 163–173. Springer, Berlin Heidelberg (2007)
Van de Burg, E.: Nonlinear Canonical Correlation and Some Related Techniques. DSWO Press, Leiden (1988)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tutore, V.A. (2011). Optimal Scaling Trees for Three-Way Data. In: Fichet, B., Piccolo, D., Verde, R., Vichi, M. (eds) Classification and Multivariate Analysis for Complex Data Structures. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13312-1_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-13312-1_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13311-4
Online ISBN: 978-3-642-13312-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)