Taxonomy-Driven Lumping for Sequence Mining

  • Francesco Bonchi
  • Carlos Castillo
  • Debora Donato
  • Aristides Gionis
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5781)

Abstract

In many application domains, events are naturally organized in a hierarchy. Whether events describe human activities, system failures, coordinates in a trajectory, or biomedical phenomena, there is often a taxonomy that should be taken into consideration. A taxonomy allow us to represent the information at a more general description level, if we choose carefully the most suitable level of granularity.

Given a taxonomy of events and a dataset of sequences of these events, we study the problem of finding efficient and effective ways to produce a compact representation of the sequences. This can be valuable by itself, or can be used to help solving other problems, such as clustering.

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Francesco Bonchi
    • 1
  • Carlos Castillo
    • 1
  • Debora Donato
    • 1
  • Aristides Gionis
    • 1
  1. 1.Yahoo! ResearchBarcelonaSpain

Personalised recommendations