Skip to main content

Uncertainty Handling for Distributed Database Integration and Knowledge Discovery

  • Chapter
Information, Uncertainty and Fusion

Part of the book series: The Springer International Series in Engineering and Computer Science ((SECS,volume 516))

  • 180 Accesses

Abstract

Distributed databases offer much as a potentially fruitful area for Knowledge Discovery since they allow us to combine data from different, heterogeneous, sources which have not previously been integrated. We provide a methodology for combining continuous or ordinal data that classifies by grouping contiguous values into the same class. Such data is obtained from data which have been horizontally fragmented within the distributed database management system, and which may be held at different levels of granularity. Integration is accomplished by using the intersection hypergraph to produce the integrated universal classification scheme and to determine the cardinalities of each class within the universal table.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. S. Dao and B. Perry. Applying a Data Miner to Heterogeneous Schema Integration. In Proceedings of the First Int. Conf on Knowledge Discovery and Data Mining, eds. UM Fayyad and R Uthurusamy,Montreal,63–68,1995.

    Google Scholar 

  2. W.J. Frawley, G. Piatetsky-Shapiro and C.J. Matheus. Knowledge Discovery in Databases: An Overview. In Knowledge Discovery in Databases eds. G Piatetsky-Shapiro and WJ Frawley, AAAI Press/The MIT Press, 1–27, 1991.

    Google Scholar 

  3. P. Höschka and W. Klösgen. A Support System for Interpreting Statistical Data. In Knowledge Discovery in Databases, eds. G Piatetsky-Shapeiro and W Frawley, AAAI Press/The MIT Press, 325–346, 1991.

    Google Scholar 

  4. M. Hosheimer and M. Kersten. A Perspective on Databases and Data Mining. In Proceedings of the First International Conference on Knowledge Discovery and Data Mining, eds. UM Fayyad and R Uthurusamy, Montreal, 150–155, 1995.

    Google Scholar 

  5. W. Klösgen. Knowledge Extraction: An Overview. In Proceedings of the Seminar on New Techniques and Technologies in Statistics (NTTS-95), eds. W Klösgen, Ph Nanopoulos and A Unwin, Bonn, 263–281, 1995.

    Google Scholar 

  6. J. Korczak, N. Louis and A. Ketterlin. An Approach to Conceptial Classification of International Trade Statistics. In Proceedings of the Seminar on New Techniques and Technologies in Statistics (NITS-95), eds. W Klosgen, Ph Nanopoulos, A Unwin, Bonn, 294–305, 1995.

    Google Scholar 

  7. T. Landers and R.L. Rosenberg. An Overview of Multibase. In Proceedings of the Proc. 2nd Int. Symp. on Distributed Databases, North Holland, 153–184, 1982.

    Google Scholar 

  8. F.M. Malvestuto. The Derivation Problem for Summary Data. In Proceedings of the ACM-SIGMOD 1988 Conference on Management of Data, ACM, New York, 87–96, 1988.

    Google Scholar 

  9. F.M. Malvestuto and M. Moscarini. Query Evaluability in Statistical Databases. IEEE Knowledge and Data Engineering, 2, 425–430, 1990.

    Article  Google Scholar 

  10. F.M. Malvestuto. A Universal-Scheme Approach to Statistical Databases Containing Homogeneous Summary Tables. ACM Transactions on Database Systems, 18, 678–708, 1993.

    Article  Google Scholar 

  11. S.I. McClean and B.W. Scotney. Probabilistic Partial Values for Distributed Database Integration. In Proceedings of Applied Decision Technologies, Brunel University, 155–182, 1995.

    Google Scholar 

  12. J.S. Ribeiro, K.A. Kaufman and L. Kerschberg. Knowledge Discovery from Multiple Databases. In Proceedings of the First Int. Conf. on Knowledge Discovery and Data Mining, eds. UM Fayyad and R Uthurusamy, Montreal, 240–245, 1995.

    Google Scholar 

  13. M.H. Sadreddini, D.A. Bell and S.I. McClean. A Model for Integration of Raw Data and Aggregate Views in Heterogeneous Statistical Databases. Database Technology, 4 (2), 115–127 1991.

    Google Scholar 

  14. M.H. Sadreddini, D.A. Bell and S.I. McClean. Providing Statistical Functionality in a Distributed Environment. In Proceedings of the Int. Conf. Survey and Statistical computing, eds. Westlake A, Banks R, Payne C and Orchard T, north Holland, 467–476, 1992.

    Google Scholar 

  15. B.W. Scotney and S.I. McClean. Using Database Technology to Facilitate Statistical Analysis of Distributed Data. In New Techniques and Technologies for Statistics II, IOS Press, Amsterdam, 203–213, 1997.

    Google Scholar 

  16. B.W. Scotney, S.I. McClean and M.C. Rodgers, M.C. Optimal and Efficient Integration of Summary Tables in a Distributed Database. To appear in The Journal of Data and Knowledge Engineering, 1999.

    Google Scholar 

  17. B.W. Scotney and S.I. McClean. Efficient Knowledge Discovery through the Integration of Heterogeneous Data. To appear in Information and Software Technology, 41 (2), 1999.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer Science+Business Media New York

About this chapter

Cite this chapter

McClean, S.I., Scotney, B.W. (2000). Uncertainty Handling for Distributed Database Integration and Knowledge Discovery. In: Bouchon-Meunier, B., Yager, R.R., Zadeh, L.A. (eds) Information, Uncertainty and Fusion. The Springer International Series in Engineering and Computer Science, vol 516. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-5209-3_16

Download citation

  • DOI: https://doi.org/10.1007/978-1-4615-5209-3_16

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4613-7373-5

  • Online ISBN: 978-1-4615-5209-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics