Distributed and Parallel Databases

, Volume 1, Issue 3, pp 281–302 | Cite as

Answering heterogeneous database queries with degrees of uncertainty

  • Frank S. C. Tseng
  • Arbee L. P. Chen
  • Wei-Pang Yang
Article

Abstract

In heterogeneous database systems,partial values have been used to resolve some schema integration problems. Performing operations on partial values may producemaybe tuples in the query result which cannot be compared. Thus, users have no way to distinguish which maybe tuple is the most possible answer. In this paper, the concept of partial values is generalized toprobabilistic partial values. We propose an approach to resolve the schema integration problems using probabilistic partial values and develop a full set of extended relational operators for manipulating relations containing probabilistic partial values. With this approach, the uncertain answer tuples of a query are associated with degrees of uncertainty (represented by probabilities). That provides users a comparison among maybe tuples and a better understanding on the query results. Besides, extended selection and join are generalized to α-selection and α-join, respectively, which can be used to filter out maybe tuples with low probabilities — those which have probabilities smaller than α.

Keywords

Heterogeneous database systems probabilistic partial values schema integration problems uncertain query answers 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    D. Barbará, H. Garcia-Molina, and D. Porter, “A probabilistic relational data model,”Lecture Notes in Computer Science: Advances in Database Technology — EDBT'90, vol. 416, Springer-Verlag: Berlin, 1990, pp. 60–74.Google Scholar
  2. 2.
    C. Batini, M. Lenzerini, and S.B. Navathe, “A comparative analysis of methodologies for database schema integration,”ACM Comput. Surveys, vol. 18, pp. 323–364, 1986.Google Scholar
  3. 3.
    J. Biskup, “A foundation of Codd's relational maybe operations,”ACM Trans. Database Systems, vol. 8, pp. 608–636, 1983.Google Scholar
  4. 4.
    Y. Breitbart, “Multidatabase interoperability,”SIGMOD RECORD, vol. 19, pp. 53–60, 1990.Google Scholar
  5. 5.
    Y. Breitbart, P.L. Olson, and G.R. Thompson, “Database integration in a distributed heterogeneous database system,”Proc. IEEE Int. Conf. Data Eng., 1986, pp. 301–310.Google Scholar
  6. 6.
    A.L.P. Chen, “Outerjoin optimization in multidatabase systems,”Proc. IEEE Int. Symp. Databases in Parallel and Distributed Systems (DPDS), 1990, pp. 211–218.Google Scholar
  7. 7.
    A.L.P. Chen, “A localized approach to distributed query processing,”Lecture Notes in Computer Science: Advances in Database Technology — EDBT'90, vol. 416, Springer-Verlag: Berlin, 1990, pp. 188–202.Google Scholar
  8. 8.
    E.F. Codd, “Extending the database relational model to capture more meaning,”ACM Trans. Database Systems, vol. 4, pp. 397–434, 1979.Google Scholar
  9. 9.
    E.F. Codd, “Missing information (applicable and inapplicable) in relational databases,”SIGMOD RECORD, vol. 15, pp. 53–78, 1986.Google Scholar
  10. 10.
    B. Czejdo, M. Rusinkiewicz, and D.W. Embley, “An approach to schema integration and query formulation in federated database systems,”Proc. IEEE Int. Conf. Data Eng., 1987, pp. 477–484.Google Scholar
  11. 11.
    C.J. Date, “The outer join,”Proc. 2nd Int. Conf. Databases (ICOD-2), 1983.Google Scholar
  12. 12.
    U. Dayal and H.Y. Hwang, “View definition and generalization for database integration in a multidatabase system,”IEEE Trans. Software Eng., vol. 10, pp. 628–644, 1984.Google Scholar
  13. 13.
    S.M. Deen, R.R. Amin, and M.C. Taylor, “Data integration in distributed databases,”IEEE Trans. Software Eng., vol. 13, pp. 860–864, 1987.Google Scholar
  14. 14.
    L.G. DeMichiel, “Resolving database incompatibility: an approach to performing relational operations over mismatched domains,”IEEE Trans. Knowledge Data Eng., vol. 1, pp. 485–493, 1989.Google Scholar
  15. 15.
    J. Grant, “Partial values in a tabular database model,”Inform. Proces. Lett., vol. 9, pp. 97–99, 1979.Google Scholar
  16. 16.
    D. Heimbigner and D. McLeod, “A federated architecture for information management,”ACM Trans. Office Inform. Systems, vol. 3, pp. 253–278, 1985.Google Scholar
  17. 17.
    J.A. Larson, S.B. Navathe, and R. Elmasri, “A theory of attribute equivalence in databases with application to schema integration,”IEEE Trans. Software Eng., vol. 15, pp. 449–463, 1989.Google Scholar
  18. 18.
    W. Litwin and A. Abdellatif, “An overview of the multi-database manipulation language MDSL,”Proc. IEEE, vol. 75, pp. 621–632, 1987.Google Scholar
  19. 19.
    W. Litwin, A. Abdellatif, B. Nicolas, Ph. Vigier, and A. Zeronnal, “MSQL: a multi-database manipulation language,”Inform. Sci., vol. 49, pp. 59–101, 1987.Google Scholar
  20. 20.
    W. Litwin and Ph. Vigier, “Dynamic attributes in the multidatabase system MRDSM,”Proc. IEEE Int. Conf. Data Eng., pp. 103–110, 1986.Google Scholar
  21. 21.
    P.L. Meyer,Introductory Probability and Statistical Applications, 2nd ed., Addison-Wesley: Reading, MA, 1970.Google Scholar
  22. 22.
    A. Motro, “Superviews: virtual integration of multiple databases,”IEEE Trans. Software Eng., vol. 13, pp. 785–798, 1987.Google Scholar
  23. 23.
    P.S.M. Tsai and A.L.P. Chen, “Querying uncertain data in heterogeneous databases,”Proc. IEEE Int. Workshop Research Issues on Data Eng. (RIDE), to appear, 1993.Google Scholar
  24. 24.
    F.S.C. Tseng, A.L.P. Chen, and W.P. Yang, “Generalizing the division operation on indefinite databases,”Proc. 2nd Far-East Workshop Future Database Systems, Koyto, Japan, 1992, pp. 347–354.Google Scholar
  25. 25.
    F.S.C. Tseng, A.L.P. Chen, and W.P. Yang, “Evaluating aggregate operations over imprecise data,” submitted.Google Scholar
  26. 26.
    F.S.C. Tseng, A.L.P. Chen, and W.P. Yang, “Searching a minimal semantically-equivalent subset of a set of partial values,”Int. J. Very Large Data Bases, to appear.Google Scholar
  27. 27.
    UNIX TM Time-Sharing System: Unix Programmer's Manual, 7th ed., vol. 2B, Bell Lab., 1979.Google Scholar

Copyright information

© Kluwer Academic Publishers 1993

Authors and Affiliations

  • Frank S. C. Tseng
    • 1
  • Arbee L. P. Chen
    • 2
  • Wei-Pang Yang
    • 1
  1. 1.Department of Computer Science and Information EngineeringNational Chiao Tung UniversityHsinchuTaiwan, 30050 ROC
  2. 2.Department of Computer ScienceNational Tsing Hua UniversityHsinchuTaiwan, 30043 ROC

Personalised recommendations