Skip to main content
Log in

A family of NP-complete data aggregation problems

  • Published:
Acta Informatica Aims and scope Submit manuscript

Summary

We consider a family of general aggregation problems and prove each of its members to be NP-complete in the strong sense. These problems require that we partition a set of objects into “aggregates”. The goal is to minimize the expected cost of satisfying an anticipated collection of requests for subsets of the objects, where the cost of satisfying a request includes both the number and the sizes of the aggregates which must be retrieved. The aggregation problems are viewed as very basic versions of important database optimization problems, including: the partitioning of data items into record types, the clustering of records into physical blocks of storage, and the partitioning of a database into granules to support locking. The NP-completeness results demonstrate that such optimization problems are intractable, even when simplified to the extreme. The fact that the problems are NP-complete in the strong sense also rules out pseudopolynomial time solutions, unless P = NP.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Babad, J.: A record and file partitioning model. CACM, 20, 22–31 (1977)

    Google Scholar 

  2. Chen, P.: The entity-relationship model — towards a unified view of data. ACM Trans. Database Syst. 1, 9–36 (1976)

    Google Scholar 

  3. Garey, M., Johnson, D.: Computers and Intractability: A guide to the theory of NP-completeness. San Francisco, Calif.: W.H. Freeman 1979

    Google Scholar 

  4. Hammer, M., Niamir, B.: A heuristic approach to attribute partitioning. Proc. ACM/SIGMOD Int. Conf. Manage. Data, pp. 93–100, 1979

  5. Helman, P., Veroff, R.: Designing deductive databases. J. Autom. Reasoning 4, 29–69 (1988)

    Google Scholar 

  6. March, S.: Techniques for structuring database records. ACM Comput. Surv. 15, 45–79 (1983)

    Google Scholar 

  7. Meyer, T., Helman, P.: Heuristics for designing database records to minimize retrieval times. University of New Mexico, Department of Computer Science, Technical Report No. CS87-4, 1987

  8. Schkolnick, M.: A clustering algorithm for hierarchical structures. ACM Trans. Database Syst. 2, 27–44 (1977)

    Google Scholar 

  9. Smith, J., Smith, D.: Database abstraction: aggregation and generalization. CACM 20, 405–413 (1977)

    Google Scholar 

  10. Teorey, T., Fry, J.: Design of Database Structures. Englewood Cliffs, N.J.: Prentice Hall 1982

    Google Scholar 

  11. Ullman, J.: Principles of Database Systems. Potomac, Maryland: Computer Science Press 1982

    Google Scholar 

  12. Yao, S., Kunii, T.: Data Base Design Techniques. Berlin Heidelberg New York: Springer 1982

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Helman, P. A family of NP-complete data aggregation problems. Acta Informatica 26, 485–499 (1989). https://doi.org/10.1007/BF00289148

Download citation

  • Received:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00289148

Keywords

Navigation