Advertisement

Dynamic Data Analysis of Evolving Association Patterns

  • Alfonso Iodice D’Enza
  • Francesco Palumbo
Conference paper
Part of the Studies in Classification, Data Analysis, and Knowledge Organization book series (STUDIES CLASS)

Abstract

Dealing with large amounts of data or data flows, it can be convenient or necessary to process them in different ‘pieces’; if the data in question refer to different occasions or positions in time or space, a comparative analysis of data stratified in batches can be suitable. The present approach combines clustering and factorial techniques to study the association structure of binary attributes over homogeneous subsets of data; moreover, it seeks to update the result as new statistical units are processed in order to monitor and describe the evolutionary patterns of association.

Keywords

Statistical Unit Multiple Correspondence Analysis Association Structure Binary Attribute Homogeneous Subset 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. Arabie, P., & Hubert, L. (1994). Cluster analysis in marketing research. IEEE Transactions on Automatic Control,19, 716–723.Google Scholar
  2. Borg, I., & Groenen, P. (2005). Modern multidimensional scaling. New York: Springer.Google Scholar
  3. Brijs, T., Swinnen, G., Vanhoof, K., & Wets, G. (1999). Using association rules for product assortment decisions: A case study. In Proceedings of the fifth ACM SIGKDD international conference on knowledge discovery and data mining, San Diego, California, United States (pp. 254–260). New York: ACM.Google Scholar
  4. Greenacre, M. J. (2007) Correspondence analysis in practice (2nd ed.). Boca Raton: Chapman and Hall/CRC.Google Scholar
  5. Hwang, H., Dillon, W. R., & Takane, Y. (2006). An extension of multiple correspondence analysis for identifying heterogenous subgroups of respondents. Psychometrika,71, 161–171.Google Scholar
  6. Iodice D’Enza, A., & Greenacre, M.J. (2010). Multiple correspondence analysis for the quantification and visualization of large categorical data sets. In Proceedings of SIS09 Statistical Methods for the Analysis of Large Data-Sets, Pescara. Padova: CLEUP.Google Scholar
  7. Mirkin B. (2001). Eleven ways to look at the Chi-squared coefficient for contingency tables. The American Statistician,55(2), 111–120.Google Scholar
  8. Palumbo F., & Iodice D’Enza A. (2010). A two-step iterative procedure for clustering of binary sequences. In: Data analysis And classification (pp. 50–60). Berlin: Springer.Google Scholar
  9. Vichi M., & Kiers H. (2001). Factorial k-means analysis for two way data. Computational Statistics and Data Analysis,37(1), 49–64.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  1. 1.Università di CassinoCassinoItaly
  2. 2.Università degli Studi di Napoli Federico IINaplesItaly

Personalised recommendations