The Maximum Box Problem and its Application to Data Analysis

Eckstein, Jonathan; Hammer, Peter L.; Liu, Ying; Nediak, Mikhail; Simeone, Bruno

doi:10.1023/A:1020546910706

The Maximum Box Problem and its Application to Data Analysis

Published: December 2002

Volume 23, pages 285–298, (2002)
Cite this article

Download PDF

Computational Optimization and Applications Aims and scope Submit manuscript

The Maximum Box Problem and its Application to Data Analysis

Download PDF

Jonathan Eckstein¹,
Peter L. Hammer²,
Ying Liu²,
Mikhail Nediak² &
…
Bruno Simeone³

283 Accesses
47 Citations
Explore all metrics

Abstract

Given two finite sets of points X ⁺ and X ⁻ in \(\mathbb{R}^n\) ⁿ, the maximum box problem consists of finding an interval (“box”) B = {x : l ≤ x ≤ u} such that B ∩ X ⁻ = ∅, and the cardinality of B ∩ X ⁺ is maximized. A simple generalization can be obtained by instead maximizing a weighted sum of the elements of B ∩ X ⁺. While polynomial for any fixed n, the maximum box problem is \( {\mathcal{N}}{\mathcal{P}}\)-hard in general. We construct an efficient branch-and-bound algorithm for this problem and apply it to a standard problem in data analysis. We test this method on nine data sets, seven of which are drawn from the UCI standard machine learning repository.

Article PDF

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

A random forest guided tour

Article 19 April 2016

Gérard Biau & Erwan Scornet

Learning from positive and unlabeled data: a survey

Article 02 April 2020

Jessa Bekker & Jesse Davis

References

E.M.L. Beale, “Branch and bound methods for mathematical programming systems.” Ann. Discrete Math., vol. 5, pp. 201–219, 1979. Also in Discrete Optimization (Proc. Adv. Res. Inst. Discrete Optimization and Systems Appl., Banff, Alberta, 1977), vol. II.
Google Scholar
R.E. Bixby, M. Fenelon, Z. Gu, E. Rothberg, and R. Wunderling, “MIP: Theory and practice-Closing the gap,” in System Modeling And Optimization (Cambridge, 1999), Kluwer Acad. Publ.: Boston, MA, 2000, pp. 19–49.
Google Scholar
C.L. Blake and C.J. Merz, “UCI repository of machine learning databases,” University of California, Irvine, Department of Information and Computer Sciences, 1998. Also available at http://www.ics. uci.edu/~mlearn/MLRepository.html.
Google Scholar
E. Boros, P.L. Hammer, T. Ibaraki, and A. Kogan, “Logical analysis of numerical data,” Mathematical Programming, vol. 79, pp. 163–190, 1997.
Google Scholar
E. Boros, P.L. Hammer, T. Ibaraki, A. Kogan, E. Mayoraz, and I. Muchnik, “An implementation of logical analysis of data,” IEEE Transactions of Knowledge and Data Engineering, vol. 12, no. 2, pp. 292–306, 2000.
Google Scholar
E. Boros, T. Ibaraki, L. Shi, and M. Yagiura, “Generating all ‘good’ patterns in polynomial expected time,” lecture at the 6th International Symposium on Artificial Intelligence and Mathematics, Ft. Lauderdale, Florida, January 2000.
Y. Crama, P.L. Hammer, and T. Ibaraki, “Cause-effect relationships and partially defined Boolean functions,” Annals of Operations Research, vol. 16, pp. 299–325, 1988.
Google Scholar
P.L. Hammer, A. Kogan, B. Simeone, and S. Szedmak, “Pareto-optimal patterns in logical analysis of data,” RUTCOR Research Report 7-2001, 2001.
Tjen-Sien Lim, Wei-Yin Loh, and Yu-Shan Shih, “Acomparison of prediction accuracy, complexity, and training time of thirty-three old and new classification algorithms,” Machine Learning, vol. 40, pp. 203–228, 2000.
Google Scholar

Download references

Author information

Authors and Affiliations

Rutgers Business School and RUTCOR, Rutgers University, 640 Bartholomew Road, Piscataway, NJ, 08854, USA
Jonathan Eckstein
RUTCOR, Rutgers University, 640 Bartholomew Road, Piscataway, NJ, 08854, USA
Peter L. Hammer, Ying Liu & Mikhail Nediak
Department of Statistics, “La Sapienza”, University, Piazzale Aldo Moro 5, 00185, Rome, Italy
Bruno Simeone

Authors

Jonathan Eckstein
View author publications
You can also search for this author in PubMed Google Scholar
Peter L. Hammer
View author publications
You can also search for this author in PubMed Google Scholar
Ying Liu
View author publications
You can also search for this author in PubMed Google Scholar
Mikhail Nediak
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Simeone
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Eckstein, J., Hammer, P.L., Liu, Y. et al. The Maximum Box Problem and its Application to Data Analysis. Computational Optimization and Applications 23, 285–298 (2002). https://doi.org/10.1023/A:1020546910706

Download citation

Issue Date: December 2002
DOI: https://doi.org/10.1023/A:1020546910706

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The Maximum Box Problem and its Application to Data Analysis

Abstract

Article PDF

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

A random forest guided tour

Learning from positive and unlabeled data: a survey

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

The Maximum Box Problem and its Application to Data Analysis

Abstract

Article PDF

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

A random forest guided tour

Learning from positive and unlabeled data: a survey

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation