Abstract
In 1987, Diday added a new dimension to data analysis with his fundamental paper introducing the notions of symbolic data and their analyses. He and his colleagues, among others, have developed innumerable techniques to analyse symbolic data; yet even more is waiting to be done. One area that has seen much activity in recent years involves the search for a measure of dependence between two symbolic random variables. This paper presents a covariance function for interval-valued data. It also discusses how the total, between interval, and within interval variations relate; and in particular, this relationship shows that a covariance function based only on interval midpoints does not capture all the variations in the data. While important in its own right, the covariance function plays a central role in many multivariate methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BERTRAND, P. and GOUPIL, F. (2000): Descriptive statistics for symbolic data. In: H.-H. Bock and E. Diday (Eds.): Analysis of Symbolic Data: Exploratory Methods for Extracting Statistical Information from Complex Data. Springer-Verlag, Berlin, 103–124.
BILLARD, L. and DIDAY, E. (2000): Regression analysis for Interval-Valued Data. In: H.A.L. Kiers, J.-P. Rasson, P.J.F. Groenen, and M. Schader (Eds.): Data analysis, Classification, and Related Methods Berlin: Springer-Verlag, 369–374.
BILLARD, L. and DIDAY, E. (2002): Symbolic regression analysis. In: K. Jajuga, A. Sokolowski, and H.-H. Bock (Eds.): Classification Clustering, and Data Analysis. Springer-Verlag, Berlin, 281–288.
BILLARD, L. and DIDAY, E. (2003): From the statistics of data to the statistics of knowledge: Symbolic data analysis. Journal of the American Statistical Association 98, 470–487.
BILLARD, L. and DIDAY, E. (2006): Symbolic Data Analysis: Conceptual Statistics and Data Mining. Wiley, Chichester.
BOCK, H.-H. and DIDAY, E. (Eds.) (2000): Analysis of Symbolic Data: Exploratory Methods for Extracting Statistical Information from Complex Data. Springer-Verlag, Berlin.
CORSARO, S. and MARINO, M. (2006): Interval linear systems: the state of the art. Computational Statistics 21, 365–384.
DE CARVALHO, F.A.T., LIMA NETO, E.A. and TENORIO, C.P. (2004): A new method to fit a linear regression model for interval-valued data. In: Lecture Notes in Computer Science, KI2004 Advances in Artificial Intelligence, Springer-Verlag, 295–306.
DIDAY, E. (1987): Introduction à l’approche symbolique en analyse des données. Premières Journées Symbolique-Numérique. CEREMADE, Université Paris Dauphine, 21–56.
GIOIA, F. and LAURO, C.N. (2006): Principal component analysis on interval data. Computational Statistics 21, 343–363.
LAURO, C. and GIOIA, F. (2006): Dependence and interdependence analysis for interval-valued variables. In: V. Batagelj, H.-H. Bock, A. Ferligoj and A. Ziberna (Eds.): Data Science and Classification. Springer-Verlag, 171–183.
LIMA NETO, E.A., DE CARVALHO, F.A.T., and FREIRE, E.S. (2005): Applying constrained linear regression models to predict interval-valued data. In: U. Furbach (Ed.): Lecture Notes in Computer Science, KI: Advances in Artificial Intelligence. Springer-Verlag, Berlin, 92–106.
LIMA NETO, E.A., DE CARVALHO, F.A.T. and TENORIO, C.P. (2004): Univariate and multivariate linear regression methods to predict interval-valued features. In: Lecture Notes in Computer Science, AI 2004 Advances in Artificial Intelligence. Springer-Verlag, 526–537.
MARINO M. and PALUMBO F. (2003): Interval arithmetic for the evaluation of imprecise data effects in least squares linear regression. Statistica Applicata, 3.
MOORE R.E. (1966): Interval Analysis. Prentice Hall, Englewood Cliffs, NJ.
PALUMBO F. and LAURO C.N. (2003): A PCA for interval valued data based on midpoints and radii. In: H. Yanai et al (Eds.): New developments in Psychometrics. Psychometric Society, Springer-Verlag, Tokyo.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Billard, L. (2007). Dependencies and Variation Components of Symbolic Interval-Valued Data. In: Brito, P., Cucumel, G., Bertrand, P., de Carvalho, F. (eds) Selected Contributions in Data Analysis and Classification. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73560-1_1
Download citation
DOI: https://doi.org/10.1007/978-3-540-73560-1_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73558-8
Online ISBN: 978-3-540-73560-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)