A family of NP-complete data aggregation problems

Helman, Paul

doi:10.1007/BF00289148

A family of NP-complete data aggregation problems

Published: March 1989

Volume 26, pages 485–499, (1989)
Cite this article

Acta Informatica Aims and scope Submit manuscript

Paul Helman¹

66 Accesses
2 Citations
Explore all metrics

Summary

We consider a family of general aggregation problems and prove each of its members to be NP-complete in the strong sense. These problems require that we partition a set of objects into “aggregates”. The goal is to minimize the expected cost of satisfying an anticipated collection of requests for subsets of the objects, where the cost of satisfying a request includes both the number and the sizes of the aggregates which must be retrieved. The aggregation problems are viewed as very basic versions of important database optimization problems, including: the partitioning of data items into record types, the clustering of records into physical blocks of storage, and the partitioning of a database into granules to support locking. The NP-completeness results demonstrate that such optimization problems are intractable, even when simplified to the extreme. The fact that the problems are NP-complete in the strong sense also rules out pseudopolynomial time solutions, unless P = NP.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Babad, J.: A record and file partitioning model. CACM, 20, 22–31 (1977)
Google Scholar
Chen, P.: The entity-relationship model — towards a unified view of data. ACM Trans. Database Syst. 1, 9–36 (1976)
Google Scholar
Garey, M., Johnson, D.: Computers and Intractability: A guide to the theory of NP-completeness. San Francisco, Calif.: W.H. Freeman 1979
Google Scholar
Hammer, M., Niamir, B.: A heuristic approach to attribute partitioning. Proc. ACM/SIGMOD Int. Conf. Manage. Data, pp. 93–100, 1979
Helman, P., Veroff, R.: Designing deductive databases. J. Autom. Reasoning 4, 29–69 (1988)
Google Scholar
March, S.: Techniques for structuring database records. ACM Comput. Surv. 15, 45–79 (1983)
Google Scholar
Meyer, T., Helman, P.: Heuristics for designing database records to minimize retrieval times. University of New Mexico, Department of Computer Science, Technical Report No. CS87-4, 1987
Schkolnick, M.: A clustering algorithm for hierarchical structures. ACM Trans. Database Syst. 2, 27–44 (1977)
Google Scholar
Smith, J., Smith, D.: Database abstraction: aggregation and generalization. CACM 20, 405–413 (1977)
Google Scholar
Teorey, T., Fry, J.: Design of Database Structures. Englewood Cliffs, N.J.: Prentice Hall 1982
Google Scholar
Ullman, J.: Principles of Database Systems. Potomac, Maryland: Computer Science Press 1982
Google Scholar
Yao, S., Kunii, T.: Data Base Design Techniques. Berlin Heidelberg New York: Springer 1982
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of New Mexico, 87131, Albuquerque, NM, USA
Paul Helman

Authors

Paul Helman
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Helman, P. A family of NP-complete data aggregation problems. Acta Informatica 26, 485–499 (1989). https://doi.org/10.1007/BF00289148

Download citation

Received: 01 August 1988
Issue Date: March 1989
DOI: https://doi.org/10.1007/BF00289148

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A family of NP-complete data aggregation problems

Summary

Access this article

Similar content being viewed by others

The p-Median Problem

Incommensurability and hardness

Limits of Optimization

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A family of NP-complete data aggregation problems

Summary

Access this article

Similar content being viewed by others

The p-Median Problem

Incommensurability and hardness

Limits of Optimization

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation