Advertisement

Identifying Relationships in Transactional Data

  • Melissa Rodrigues
  • João Gama
  • Carlos Abreu Ferreira
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7637)

Abstract

Association rules is the traditional way used to study market basket or transactional data. One drawback of this analysis is the huge number of rules generated. As a complement to association rules, Association Rules Network (ARN), based on Social Network Analysis (SNA) has been proposed by several researchers. In this work we study a real market basket analysis problem, available in a Belgian supermarket, using ARNs. We learn ARNs by considering the relationships between items that appear more often in the consequent of the association rules. Moreover, we propose a more compact variant of ARNs: the Maximal Itemsets Social Network. In order to assess the quality of these structures, we compute SNA based metrics, like weighted degree and utility of community.

Keywords

social network analysis association rules network 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In: 20th Intl. Conf. on Very Large Data Bases (VLDB 1994), pp. 487–499 (1994)Google Scholar
  2. 2.
    Albert, R., Barabasi, A.: Statistical Mechanisms of Complex Networks. Reviews of Modern Physics 74, 47–97 (2002)MathSciNetzbMATHCrossRefGoogle Scholar
  3. 3.
    Blondel, V.D., Guillaumem, J., Lambiotte, R., Lefebrvre, E.: Fast Unfolding of Communities in Large Networks. Journal of Statistical Mechanics: Theory and Experiment P10008, 1–12 (2008)Google Scholar
  4. 4.
    Brijs, T., Swinnen, G., Vanhoof, K., Wets, G.: Using Association Rules for Product Assortment Decisions – A Case Study. In: 5th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD 1999), pp. 254–260. ACM (1999)Google Scholar
  5. 5.
    Easley, D., Kleinberg, J.: Networks, Crowds, and Markets – Reasoning About a Highly Connected World. Cambridge University Press (2010)Google Scholar
  6. 6.
    Faceli, K., Lorena, A.C., Gama, J., Carvalho, A.: Inteligência Artificial – Uma Abordagem de Aprendizado de Máquina. Livros Técnicos e Científicos (2011)Google Scholar
  7. 7.
    Fruchterman, T., Reingold, E.M.: Graph Drawing by Force-Directed Placement. Software: Practice and Experience 21(11), 1129–1164 (1991)CrossRefGoogle Scholar
  8. 8.
    Kretschmer, H., Kretschmer, T.: Application of a New Centrality Measure for Social Network Analysis to Bibliometric and Webometric Data. In: 1st IEEE Int. Conf. of Digital Information Management (ICDIM 2006), pp. 199–204. IEEE (2006)Google Scholar
  9. 9.
    Lopez-Fernandez, L., Robles, G., Gonzalez-Barahona, J.M.: Applying Social Network Analysis to the Information in CVS Repositories. In: 1st International Workshop on Mining Software Repositories (MSR 2004), pp. 101–105 (2004)Google Scholar
  10. 10.
    Newman, M.E.J.: The Structure and Function of Complex Networks. Society for Industrial and Applied Mathematics Review 45(2), 167–256 (2003)zbMATHGoogle Scholar
  11. 11.
    Newman, M.E.J.: Fast Algorithm for Detecting Community Structure in Networks. Physical Review E 69(6), 066133, 1–5 (2004)CrossRefGoogle Scholar
  12. 12.
    Pandey, G., Chawla, S., Poon, S., Arunasalam, B., Davis, J.: Association Rules Network – Definition and Applications. Statistical Analysis and Data Mining 1(4), 260–279 (2009)MathSciNetCrossRefGoogle Scholar
  13. 13.
    Raeder, T., Chawla, N.: Market Basket Analysis with Networks. Social Network Analysis and Mining 1(2), 97–113 (2011)CrossRefGoogle Scholar
  14. 14.
    Gouda, K., Zaki, M.: Efficiently Mining Maximal Frequent Itemsets. In: 2001 IEEE International Conference on Data Mining (ICDM 2001), pp. 163–170. IEEE (2001)Google Scholar
  15. 15.
    Bastian, M., Heymann, S., Jacomy, M.: Gephi – An Open Source Software for Exploring and Manipulating Networks. In: 3rd Intl. AAAI Conf. on Weblogs and Social Media (ICWSM 2009), pp. 361–362. AAAI (2009)Google Scholar
  16. 16.
    R Core Team: R – A Language and Environment for Statistical. R Foundation for Statistical Computing (2012)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Melissa Rodrigues
    • 1
  • João Gama
    • 1
    • 2
  • Carlos Abreu Ferreira
    • 2
    • 3
  1. 1.FEP - University of PortoPortoPortugal
  2. 2.LIAAD-INESC TECPortoPortugal
  3. 3.ISEP - Polytechnic Institute of PortoPortoPortugal

Personalised recommendations