Abstract
Logical Analysis of Data deals with the classification of huge data set by boolean formulas and their synthetic representation by ternary string, referred to as patterns. In this context, the simple pattern minimality problem (SPMP) arises. It consists in determining the minimum number of patterns “explaining” an initial data set of binary strings. This problem is equivalent to the minimum disjunctive normal form problem and, hence, it has been widely tackled by set covering based heuristic approaches. In this work, we describe and tackle a particular variant of the SPMP coming from an application arising in the car industry production field. The main difference with respect to SPMP tackled in literature resides in the fact that the determined patterns must be partitions and not covers of the initial binary string data set. The problem is solved by an effective and fast heuristic, tested on several large size instances coming from a real application.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Avella, P., Boccia, M., Di Martino, C., Oliviero, G., Sforza, A., Vasilev, I.: A decomposition approach for a very large scale optimal diversity management problem. 4OR: Q. J. Oper. Res. 3(1), 23–37 (2005)
Briant, O., Naddef, D.: The optimal diversity management problem. Oper. Res. 52(4), 515–526 (2004)
Garey, M.R., Johnson, D.S.: Computers and intractability. A guide to the theory of NP-completeness. A Series of Books in the Mathematical Sciences. W. H. Freeman and Company, San Francisco (1979)
Hammer, P.L.: Partially defined boolean functions and cause-effect relationships. In: Lecture at the International Conference on Multi-attrubute Decision Making Via OR-Based Expert Systems. University of Passau, Passau, Germany (1986)
Lancia, G., Serafini P.: A set-covering approach with column generation for parsimony haplotyping. JOC: J. Comput. 21(1), 151–166 (2009)
Lancia, G., Serafini P.: The complexity of some pattern problems in the logical analysis of large genomic data sets. Sets. In: Ortuño, F., Rojas, I. (eds.), Bioinformatics and Biomedical Engineering. IWBBIO 2016. Lecture Notes in Computer Science, vol. 9656. Springer, Cham
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Boccia, M., Masone, A., Sforza, A., Sterle, C. (2017). A Partitioning Based Heuristic for a Variant of the Simple Pattern Minimality Problem. In: Sforza, A., Sterle, C. (eds) Optimization and Decision Science: Methodologies and Applications. ODS 2017. Springer Proceedings in Mathematics & Statistics, vol 217. Springer, Cham. https://doi.org/10.1007/978-3-319-67308-0_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-67308-0_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67307-3
Online ISBN: 978-3-319-67308-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)