Abstract
Distributed databases offer much as a potentially fruitful area for Knowledge Discovery since they allow us to combine data from different, heterogeneous, sources which have not previously been integrated. We provide a methodology for combining continuous or ordinal data that classifies by grouping contiguous values into the same class. Such data is obtained from data which have been horizontally fragmented within the distributed database management system, and which may be held at different levels of granularity. Integration is accomplished by using the intersection hypergraph to produce the integrated universal classification scheme and to determine the cardinalities of each class within the universal table.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
S. Dao and B. Perry. Applying a Data Miner to Heterogeneous Schema Integration. In Proceedings of the First Int. Conf on Knowledge Discovery and Data Mining, eds. UM Fayyad and R Uthurusamy,Montreal,63–68,1995.
W.J. Frawley, G. Piatetsky-Shapiro and C.J. Matheus. Knowledge Discovery in Databases: An Overview. In Knowledge Discovery in Databases eds. G Piatetsky-Shapiro and WJ Frawley, AAAI Press/The MIT Press, 1–27, 1991.
P. Höschka and W. Klösgen. A Support System for Interpreting Statistical Data. In Knowledge Discovery in Databases, eds. G Piatetsky-Shapeiro and W Frawley, AAAI Press/The MIT Press, 325–346, 1991.
M. Hosheimer and M. Kersten. A Perspective on Databases and Data Mining. In Proceedings of the First International Conference on Knowledge Discovery and Data Mining, eds. UM Fayyad and R Uthurusamy, Montreal, 150–155, 1995.
W. Klösgen. Knowledge Extraction: An Overview. In Proceedings of the Seminar on New Techniques and Technologies in Statistics (NTTS-95), eds. W Klösgen, Ph Nanopoulos and A Unwin, Bonn, 263–281, 1995.
J. Korczak, N. Louis and A. Ketterlin. An Approach to Conceptial Classification of International Trade Statistics. In Proceedings of the Seminar on New Techniques and Technologies in Statistics (NITS-95), eds. W Klosgen, Ph Nanopoulos, A Unwin, Bonn, 294–305, 1995.
T. Landers and R.L. Rosenberg. An Overview of Multibase. In Proceedings of the Proc. 2nd Int. Symp. on Distributed Databases, North Holland, 153–184, 1982.
F.M. Malvestuto. The Derivation Problem for Summary Data. In Proceedings of the ACM-SIGMOD 1988 Conference on Management of Data, ACM, New York, 87–96, 1988.
F.M. Malvestuto and M. Moscarini. Query Evaluability in Statistical Databases. IEEE Knowledge and Data Engineering, 2, 425–430, 1990.
F.M. Malvestuto. A Universal-Scheme Approach to Statistical Databases Containing Homogeneous Summary Tables. ACM Transactions on Database Systems, 18, 678–708, 1993.
S.I. McClean and B.W. Scotney. Probabilistic Partial Values for Distributed Database Integration. In Proceedings of Applied Decision Technologies, Brunel University, 155–182, 1995.
J.S. Ribeiro, K.A. Kaufman and L. Kerschberg. Knowledge Discovery from Multiple Databases. In Proceedings of the First Int. Conf. on Knowledge Discovery and Data Mining, eds. UM Fayyad and R Uthurusamy, Montreal, 240–245, 1995.
M.H. Sadreddini, D.A. Bell and S.I. McClean. A Model for Integration of Raw Data and Aggregate Views in Heterogeneous Statistical Databases. Database Technology, 4 (2), 115–127 1991.
M.H. Sadreddini, D.A. Bell and S.I. McClean. Providing Statistical Functionality in a Distributed Environment. In Proceedings of the Int. Conf. Survey and Statistical computing, eds. Westlake A, Banks R, Payne C and Orchard T, north Holland, 467–476, 1992.
B.W. Scotney and S.I. McClean. Using Database Technology to Facilitate Statistical Analysis of Distributed Data. In New Techniques and Technologies for Statistics II, IOS Press, Amsterdam, 203–213, 1997.
B.W. Scotney, S.I. McClean and M.C. Rodgers, M.C. Optimal and Efficient Integration of Summary Tables in a Distributed Database. To appear in The Journal of Data and Knowledge Engineering, 1999.
B.W. Scotney and S.I. McClean. Efficient Knowledge Discovery through the Integration of Heterogeneous Data. To appear in Information and Software Technology, 41 (2), 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer Science+Business Media New York
About this chapter
Cite this chapter
McClean, S.I., Scotney, B.W. (2000). Uncertainty Handling for Distributed Database Integration and Knowledge Discovery. In: Bouchon-Meunier, B., Yager, R.R., Zadeh, L.A. (eds) Information, Uncertainty and Fusion. The Springer International Series in Engineering and Computer Science, vol 516. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-5209-3_16
Download citation
DOI: https://doi.org/10.1007/978-1-4615-5209-3_16
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-7373-5
Online ISBN: 978-1-4615-5209-3
eBook Packages: Springer Book Archive