Knowledge and Information Systems

, Volume 50, Issue 2, pp 569–584

Attributed graph mining in the presence of automorphism

  • Claude Pasquier
  • Frédéric Flouvat
  • Jérémy Sanhes
  • Nazha Selmaoui-Folcher
Regular Paper

Abstract

Attributed directed graphs are directed graphs in which nodes are associated with sets of attributes. Many data from the real world can be naturally represented by this type of structure, but few algorithms are able to directly handle these complex graphs. Mining attributed graphs is a difficult task because it requires combining the exploration of the graph structure with the identification of frequent itemsets. In addition, due to the combinatorics on itemsets, subgraph isomorphisms (which have a significant impact on performances) are much more numerous than in labeled graphs. In this paper, we present a new data mining method that can extract frequent patterns from one or more directed attributed graphs. We show how to reduce the combinatorial explosion induced by subgraph isomorphisms thanks to an appropriate processing of automorphic patterns.

Keywords

Attributed graph Frequent pattern mining Automorphism Structure mining Itemset mining 

References

  1. 1.
    Agrawal R, Imieliński T, Swami A (1993) Mining association rules between sets of items in large databases. SIGMOD Rec 22(2):207–216CrossRefGoogle Scholar
  2. 2.
    Agrawal R, Srikant R (1995) Mining sequential patterns. In: ICDE’95, pp 3–14Google Scholar
  3. 3.
    Ayres J, Flannick J, Gehrke J, Yiu T (2002) Sequential pattern mining using a bitmap representation. In: KDD’02, pp 429–435Google Scholar
  4. 4.
    Borgelt C (2007) Canonical forms for frequent graph mining. In: Decker R, Lenz H-J (eds) Advances in data analysis. Springer, Berlin, pp 337–349CrossRefGoogle Scholar
  5. 5.
    Borgelt C, Berthold M (2002) Mining molecular fragments: finding relevant substructures of molecules. In: ICDM’02, pp 51–58Google Scholar
  6. 6.
    Bringmann B, Nijssen S (2008) What is frequent in a single graph?. In: PAKDD’08, pp 858–863Google Scholar
  7. 7.
    Chi Y, Yang Y, Xia Y, Muntz RR (2004) Cmtreeminer: mining both closed and maximal frequent subtrees. In: PAKDD’04, pp 63–73Google Scholar
  8. 8.
    Fukuzaki M, Seki M, Kashima H, Sese J (2010) Finding itemset-sharing patterns in a large itemset-associated graph. In: PAKDD’10, pp 147–159Google Scholar
  9. 9.
    Huan J, Wang W, Prins J (2003) Efficient mining of frequent subgraphs in the presence of isomorphism. In: ICDM’05, pp 549–552Google Scholar
  10. 10.
    Inokuchi A, Washio T, Motoda H (2000) An apriori-based algorithm for mining frequent substructures from graph data. In: PKDD’00, pp 13–23Google Scholar
  11. 11.
    Inokuchi A, Washio T, Motoda H (2003) Complete mining of frequent patterns from graphs: mining graph data. Mach Learn 50(3):321–354CrossRefMATHGoogle Scholar
  12. 12.
    Jiang C, Coenen F, Zito M (2013) A survey of frequent subgraph mining algorithms. Knowl Eng Rev 28:75–105CrossRefGoogle Scholar
  13. 13.
    Johnsonbaugh R, Kalin M (1991) A graph generation software package. SIGCSE Bull 23(1):151–154CrossRefGoogle Scholar
  14. 14.
    Kuramochi M, Karypis G (2001) Frequent subgraph discovery. In: ICDM’01, pp 313–320Google Scholar
  15. 15.
    Kuramochi M, Karypis G (2004) An efficient algorithm for discovering frequent subgraphs. IEEE Trans Knowl Data Eng 16(9):1038–1051CrossRefGoogle Scholar
  16. 16.
    Mannila H, Toivonen H (2005) Multiple uses of frequent sets and condensed representations. In: KDD’05, pp 189–194Google Scholar
  17. 17.
    McAuley J, Leskovec J (2012) Learning to discover social circles in ego networks. Neural Inf Process Syst 25:548–556Google Scholar
  18. 18.
    Miyoshi Y, Ozaki T, Ohkawa T (2009) Frequent pattern discovery from a single graph with quantitative itemsets. In: ICDMW’09, pp 527–532Google Scholar
  19. 19.
    Pasquier C, Sanhes J, Flouvat F, Selmaoui-Folcher N (2015) Frequent pattern mining in attributed trees: algorithms and applications. Knowl Inf Syst 46(3):491–514CrossRefGoogle Scholar
  20. 20.
    Pasquier N, Bastide Y, Taouil R, Lakhal L (1999) Discovering frequent closed itemsets for association rules. In: ICDT’99, pp 398–416Google Scholar
  21. 21.
    Wörlein M, Meinl T, Fischer I, Philippsen M (2005) A quantitative comparison of the subgraph miners mofa, gspan, ffsm, and gaston. In: PKDD’05, pp 392–403Google Scholar
  22. 22.
    Yan X, Han J (2002) gspan: graph-based substructure pattern mining. In: ICDM’02, pp 721–724Google Scholar
  23. 23.
    Yan X, Han J (2003) CloseGraph: mining closed frequent graph patterns. In: KDD’03, pp 286–295Google Scholar
  24. 24.
    Yan X, Yu PS, Han J (2004) Graph indexing: a frequent structure-based approach. In: SIGMOD conference, pp 335–346Google Scholar

Copyright information

© Springer-Verlag London 2016

Authors and Affiliations

  • Claude Pasquier
    • 1
    • 2
    • 3
  • Frédéric Flouvat
    • 3
  • Jérémy Sanhes
    • 3
  • Nazha Selmaoui-Folcher
    • 3
  1. 1.Univ. Nice Sophia Antipolis, I3S, UMR 7271Sophia AntipolisFrance
  2. 2.CNRS, I3S, UMR 7271Sophia AntipolisFrance
  3. 3.Multidisciplinary Research Team on Material and Environment (PPME)University of New CaledoniaNouméaNew Caledonia

Personalised recommendations