LR-SDiscr: An Efficient Algorithm for Supervised Discretization

  • Habiba Drias
  • Hadjer Moulai
  • Nourelhouda Rehkab
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10751)


Discretization is the process of transforming continuous attributes into discrete. It has a great importance nowadays, as continuous data are often present in several domains such as health and industry. This paper describes a new supervised discretization method based on a LR (Left to Right) scanning technique called LR-SDiscr (Left to Right Supervised Discretization). Using both merging and partitioning operations, LR-SDiscr discretizes the data in a single pass, which reduces the complexity of the process and ensures scalability. Various discretization measures can be tested and then compared, as the algorithm offers the possibility of introducing any discretization measure as input. The preliminary results of experiments designed for classification purposes are encouraging.


Data mining Data pre-processing Supervised classification Supervised discretization Division and merging framework Scanner 


