Abstract
When distributed, heterogeneous digital libraries have to be integrated, one of the crucial tasks is to map between different schemas. As schemas may have different granularities, and as schema attributes do not always match precisely, a general-purpose schema mapping approach requires support for uncertain mappings. In this paper we present one of the very few approaches for defining and using uncertain schema mappings. We combine different technologies like DAML+OIL, probabilistic Datalog (since DAML+OIL—as similar ontology languages—lacks rules) and XSLT for actually transforming queries and documents. This declarative approach is fully implemented in the project MIND (which develops methods for retrieval in networked multimedia digital libraries). However, as DAML+OIL lacks some important features, the proposed approach is only a stepping stone for an integrated solution.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Biskup, J., Embley, D.W.: Extracting information from heterogeneous information sources using ontologically specified target views. Information Systems 28(3), 169–212 (2003)
Brickley, D., Guha, R.: RDF vocabulary description language 1.0: RDF Schema, w3c working draft. Technical report, World Wide Web Consortium (April 2002)
Chawathe, S., Garcia-Molina, H., Hammer, J., Ireland, K., Papakonstantinou, Y., Ullman, J., Widom, J.: The TSIMMIS project: Integration of heterogeneous information sources. In: 16th Meeting of the Information Processing Society of Japan, Tokyo, Japan, pp. 7–18 (1994)
Connolly, D., Harmelen, F. v., Horrocks, I., McGuinness, D. L., Patel-Schneider, P. F., Stein, L. A.: DAML+OIL (march 2001) reference description. Technical report, World Wide Web Consortium (2001), http://www.w3.org/TR/daml+oil-reference
Doan, A., Domingos, P., Halevy, A.Y.: Reconciling schemas of disparate data sources: A machine-learning approach. In: SIGMOD Conference (2001)
Endig, M., Hoding, M., Saake, G., Sattler, K.-U., Schallehn, E.: Federation services for heterogeneous digital libraries accessing cooperative and non-cooperative sources. In: Kyoto International Conference on Digital Libraries, pp. 314–321 (2000)
Fuhr, N.: A probabilistic framework for vague queries and imprecise information in databases. In: Proceedings of the 16th International Conference on Very Large Databases, Los Altos, California, pp. 696–707. Morgan Kaufman, San Francisco (1990)
Fuhr, N.: Towards data abstraction in networked information retrieval systems. Information Processing and Management 35(2), 101–119 (1999)
Fuhr, N.: Probabilistic Datalog: Implementing logical information retrieval for advanced applications. Journal of the American Society for Information Science 51(2), 95–110 (2000)
Goncalves, M.A., France, R.K., Fox, E.A.: MARIAN: Flexible interoperability for federated digital libraries. In: Constantopoulos, P., Sølvberg, I.T. (eds.) ECDL 2001. LNCS, vol. 2163, pp. 173–186. Springer, Heidelberg (2001)
Jeong, E., Hsu, C.-N.: Induction of integrated view for XML data with heterogeneous DTDs. In: Paques, H., et al. [17], pp. 151–158
Lassila, O, Swick, R. R.: Resource description framework (RDF) model and syntax specification. W3C recommendation,WorldWideWeb Consortium (Febuary 1999), http://www.w3.org/TR/1999/REC-rdf-syntax-19990222/
Madhavan, J., Bernstein, P.A., Rahm, E.: Generic schema matching with cupid. In: Proc. 27th VLDB Conference, pp. 49–58 (2001), http://www.research.microsoft.com/~philbe/CupidVLDB01.pdf
Nottelmann, H., Fuhr, N.: Learning probabilistic Datalog rules for information classification and transformation. In: Paques, H., et al. [17], pp. 387–394
Nottelmann, H., Fuhr, N.: MIND: An architecture for multimedia information retrieval in federated digital libraries. In: Proceedings of the DELOS-Workshop on Interoperability in Digital Libraries. DELOS-Network of Excellence on Digital Libraries (2001)
Pan, J.Z., Horrocks, I.: Semantic web ontology reasoning in the SHOQ (D n ) description logic. In: Proc. of the 2002 Description Logic Workshop, DL 2002 (2002)
Paques, H., Liu, L., Grossman, D. (eds.) Proceedings of the 10th International Conference on Information and Knowledge Management. ACM Press, New York (2001)
Ullman, J.D.: Principles of Database and Knowledge-Base Systems, vol. I. Computer Science Press, Rockville (1988)
Wüthrich, B.: On the learning of rule uncertainties and their integration into probabilistic knowledge bases. Journal of Intelligent Information Systems 2, 245–264 (1993)
Xu, L., Embley, D.: Combining the best of global-as-view and local-as-view for data integration (submitted for publication), http://www.deg.byu.edu/papers/PODS.integration.pdf
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nottelmann, H., Fuhr, N. (2003). Combining DAML+OIL, XSLT and Probabilistic Logics for Uncertain Schema Mappings in MIND. In: Koch, T., Sølvberg, I.T. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2003. Lecture Notes in Computer Science, vol 2769. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45175-4_19
Download citation
DOI: https://doi.org/10.1007/978-3-540-45175-4_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40726-3
Online ISBN: 978-3-540-45175-4
eBook Packages: Springer Book Archive