Ranked Tiling

  • Thanh Le Van
  • Matthijs van Leeuwen
  • Siegfried Nijssen
  • Ana Carolina Fierro
  • Kathleen Marchal
  • Luc De Raedt
Conference paper

DOI: 10.1007/978-3-662-44851-9_7

Part of the Lecture Notes in Computer Science book series (LNCS, volume 8725)
Cite this paper as:
Le Van T., van Leeuwen M., Nijssen S., Fierro A.C., Marchal K., De Raedt L. (2014) Ranked Tiling. In: Calders T., Esposito F., Hüllermeier E., Meo R. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2014. Lecture Notes in Computer Science, vol 8725. Springer, Berlin, Heidelberg

Abstract

Tiling is a well-known pattern mining technique. Traditionally, it discovers large areas of ones in binary databases or matrices, where an area is defined by a set of rows and a set of columns. In this paper, we introduce the novel problem of ranked tiling, which is concerned with finding interesting areas in ranked data. In this data, each transaction defines a complete ranking of the columns. Ranked data occurs naturally in applications like sports or other competitions. It is also a useful abstraction when dealing with numeric data in which the rows are incomparable.

We introduce a scoring function for ranked tiling, as well as an algorithm using constraint programming and optimization principles. We empirically evaluate the approach on both synthetic and real-life datasets, and demonstrate the applicability of the framework in several case studies. One case study involves a heterogeneous dataset concerning the discovery of biomarkers for different subtypes of breast cancer patients. An analysis of the tiles by a domain expert shows that our approach can lead to the discovery of novel insights.

Keywords

tiling ranked data numerical data pattern mining 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2014

Authors and Affiliations

  • Thanh Le Van
    • 1
  • Matthijs van Leeuwen
    • 1
  • Siegfried Nijssen
    • 1
    • 2
  • Ana Carolina Fierro
    • 3
  • Kathleen Marchal
    • 3
    • 4
    • 5
  • Luc De Raedt
    • 1
  1. 1.Department of Computer ScienceKU LeuvenBelgium
  2. 2.Leiden Institute for Advanced Computer ScienceUniversiteit LeidenThe Netherlands
  3. 3.Department of Microbial and Molecular SystemsKU LeuvenBelgium
  4. 4.Department of Plant Biotechnology and BioinformaticsGhent UniversityBelgium
  5. 5.Department of Information TechnologyiMinds, Ghent UniversityBelgium

Personalised recommendations