Skip to main content
Log in

Searching a minimal semantically-equivalent subset of a set of partial values

  • Published:
The VLDB Journal Aims and scope Submit manuscript

Abstract

Imprecise data exist in databases due to their unavailability or to data/ schema incompatibilities in a multidatabase system. Partial values have been used to represent imprecise data. Manipulation of partial values is therefore necessary to process queries involving imprecise data. In this article, we study the problem of eliminating redundant partial values that result from a projection on an attribute with partial values. The redundancy of partial values is defined through the interpretation of a set of partial values. This problem is equivalent to searching a minimal semantically-equivalent subset of a set of partial values. A semantically-equivalent subset contains exactly the same information as the original set. We derive a set of useful properties and apply a graph matching technique to develop an efficient algorithm for searching such a minimal subset and therefore eliminating redundant partial values. By this process, we not only provide a concise answer to the user, but also reduce the communication cost when partial values are requested to be transmitted from one site to another site in a distributed environment. Moreover, further manipulation of the partial values can be simplified. This work is also extended to the case of multi-attribute projections.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Abiteboul, S. and Grahne, G. Update semantics for incomplete information,Proceedings of the Eleventh International Conference on Very Large Data Bases, Stockholm, 1985.

  • Bancilhon, F. and Spyratos, N. Update semantics of relational views,ACM Transmissions of Database System, 6(4):557–575, 1981.

    Google Scholar 

  • Biskup, J. A foundation of Codd's relational maybe-operations,ACM Transmissions of Database Systems, 8(4):608–636, 1983.

    Google Scholar 

  • Bondy, J.A. and Murty, U.S.R.Graph Theory with Applications, New York: Macmillan Press, 1976.

    Google Scholar 

  • Codd, E.F. Extending the database relational model to capture more meaning,ACM Transmissions of Database Systems, 4(4):397–434, 1979.

    Google Scholar 

  • Codd, E.F. Missing information (applicable and inapplicable) in relational databases,SIGMOD Record, 15(4):53–78, 1986.

    Google Scholar 

  • Codd, E.F. More commentary on missing information in relational databases (applicable and inapplicable information),SIGMOD Record, 16(1):42–50, 1987.

    Google Scholar 

  • DeMichiel, L.G. Resolving database incompatibility: An approach to performing relational operations over mismatched domains,IEEE Transmissions on Knowledge and Data Engineering, 1(4):485–493, 1989.

    Google Scholar 

  • Ford, L.R. and Fulkerson, D.R.Flows in Networks, Princeton, NJ: Princeton University Press, 1962.

    Google Scholar 

  • Garey, M.R. and Johnson, D.S.Computers and Intractability: A Guide to the Theory of NP-Completeness San Francisco: Freeman, 1979.

    Google Scholar 

  • Grant, J. Null values in a relational data base,Information Processing Letters 6(5):156–157, 1977.

    Google Scholar 

  • Grant, J. Partial values in a tabular database model,Information Processing Letters, 9(2):97–99, 1979.

    Google Scholar 

  • Hall, P. On representatives of subsets,J. London Mathematical Society 10(26–30), 1935.

  • Hopcroft, J.E. and Karp, R.M. Ann 5/2 algorithm for maximum matching in bipartite graphs,SIAM J. Computing, 2(4):225–231, 1973.

    Google Scholar 

  • Imieliński, T. and Lipski, W. On representing incomplete information in a relational database.Proceedings of the Seventh International Conference on Very Large Data Bases, Cannes, France, 1981.

  • Imieliński, T. and Lipski, W. Incomplete information and dependencies in relational databases,Proceedings of the ACM SIGMOD International Conference Management of Data, San Jose, California, 1983.

  • Imieliński, T. and Vadaparty, K. Complexity of query processing in databases with or-objects,Proceedings of the ACM Symposium on Principles of Database Systems, 1989.

  • Imieliński, T. Incomplete Deductive Databases.Annals of Mathematics and Artificial Intelligence, 3(2–4):259–294, 1991.

    Google Scholar 

  • Lewis, H. and Papadimitiou, C.Elements of the Theory of Computation, Englewood Cliffs, NJ: Prentice-Hall, 1981, pp.26–26.

    Google Scholar 

  • Lien, E. Multivalued dependencies with null values in relational databases.Proceedings of the Fifth International Conference on Very Large Data Bases, Rio de Janeiro, 1979.

  • Lipski, W. On semantic issues connected with incomplete information systems.ACM Transmissions on Database Systems, 4(3):262–296, 1979.

    Google Scholar 

  • Liu, K.-C. and Sunderraman, R. Indefinite and maybe information in relational databases.ACM Transmissions on Database Systems, 15(1):1–39, 1990.

    Google Scholar 

  • Liu, K.-C. and Sunderraman, R. A generalized relational model for indefinite and maybe information.IEEE Transmissions on Knowledge and Data Engineering 3(1):65–77, 1991.

    Google Scholar 

  • Maier, D.The Theory of Relational Databases Rockville, MD: Computer Science Press, 1983.

    Google Scholar 

  • Motro, A. Accommodating imprecision in database systems: issues and solutions.ACM SIGMOD Record, 19(4):69–74, 1990.

    Google Scholar 

  • Papadimitriou, C.H. and Steiglitz, K..Combinatorial Optimization: Algorithms and complexity, Englewood Cliffs, NJ: Prentice-Hall, 1982, pp.221–226.

    Google Scholar 

  • Suppes, P.Axiomatic Set Theory, Princeton, New Jersey: D. Van Nostrand Company, 1960, pp.99–100.

    Google Scholar 

  • Tsai, P.S.M. and Chen, A.L.P. Querying uncertain data in heterogeneous databases.Proceedings of the IEEE International Workshop on Research Issues on Data Engineering (RIDE), Vienna, 1993.

  • Tseng, F.S.C., Chen, A.L.P., and Yang, W.P. Answering heterogeneous database queries with degrees of uncertainty.Distributed and Parallel Databases: An International Journal, 1(3):281–302, 1993a.

    Google Scholar 

  • Tseng, F.S.C., Chen, A.L.P., and Yang, W.P. Implementing the division operation on a database containing uncertain data. Submitted, 1993b.

  • Tseng, F.S.C., Chen, A.L.P., and Yang, W.P. Evaluating aggregate operators over imprecise data. Submitted, 1993c.

  • Ullman, J.D.Principles of Database and Knowledge-Base Systems, Vol. 2, Rockville, MD: Computer Science Press, 1988.

    Google Scholar 

  • Vassiliou, Y. Null values in data base management: A denotational semantics approach.Proceedings of the ACM-SIGMOD International Conference on the Management of Data, Boston, MA, 1979.

  • Vassiliou, Y. Functional dependencies and incomplete information.Proceedings of the Sixth International Conference on Very Large Data Bases, Montreal, 1980.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tseng, F.S.C., Chen, A.L.P. & Yang, WP. Searching a minimal semantically-equivalent subset of a set of partial values. VLDB Journal 2, 489–512 (1993). https://doi.org/10.1007/BF01263049

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01263049

Key Words

Navigation