Query Translation for Distributed Heterogeneous Structured and Semi-structured Databases

  • Fahad M. Al-Wasil
  • N. J. Fiddian
  • W. A. Gray
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4042)


The main purpose of building data integration systems is to facilitate access to a multitude of data sources. A data integration system must contain a module that uses source descriptions in order to reformulate user queries which are posed in terms of the composite global schema, into sub-queries that refer directly to the schemas of the component data sources. In this paper we propose a method for this user query translation task to target distributed heterogeneous structured data residing in relational databases and semi-structured data held in well-formed XML documents (XML documents which have no referenced DTD or XML schema) produced by Internet applications or human-coded. These XML documents can be XML files on local hard drives or remote documents on Web servers. Our method is based on mappings between the master (composite) view and the participating data source schema structures that are defined in a generated XML Metadata Knowledge Base (XMKB).


Relational Database User Query Document Type Definition Data Integration System Query Processor 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Hu, G., Fernandes, H.: Integration and querying of distributed databases. In: Proceedings of the IEEE International Conference on Information Reuse and Integration (IRI 2003), Las Vegas, NV, USA, October 27-29, pp. 167–174 (2003)Google Scholar
  2. 2.
    Segev, A., Chatterjee, A.: Data manipulation in heterogeneous databases. Sigmod Record 20(4), 64–68 (1991)CrossRefGoogle Scholar
  3. 3.
    Funderburk, J.E., et al.: XTABLES: Bridging Relational Technology and XML. IBM Systems Journal 41(4), 616–641 (2002)CrossRefGoogle Scholar
  4. 4.
    Shanmugasundaram, J., et al.: Efficiently Publishing Relational Data as XML Documents. In: Proceedings of the 26th International Conference on Very Large Databases (VLDB 2000), Cairo, Egypt, pp. 65–76 (September 2000)Google Scholar
  5. 5.
    Lehti, P., Fankhauser, P.: XML data integration with OWL: Experiences & challenges. In: Proceedings of the International Symposium on Applications and the Internet (SAINT 2004), Tokyo, Japan, pp. 160–170 (2004)Google Scholar
  6. 6.
    World Wide Web Consortium, Extensible Markup Language (XML) 1.0 W3C Recommendation, 3rd edn. (February 2004), http://www.w3.org/TR/2004/REC-xml-20040204/
  7. 7.
    Gardarin, G., Sha, F., Dang-Ngoc, T.: XML-based Components for Federating Multiple Heterogeneous Data Sources. In: Akoka, J., Bouzeghoub, M., Comyn-Wattiau, I., Métais, E. (eds.) ER 1999. LNCS, vol. 1728, pp. 506–519. Springer, Heidelberg (1999)Google Scholar
  8. 8.
    Lee, K., Min, J., Park, K.: A Design and Implementation of XML-Based Mediation Framework (XMF) for Integration of Internet Information Resources. In: HICSS 2002: Proceedings of the 35th Annual Hawaii International Conference on System Sciences (HICSS 2002), vol. 2, pp. 202–210. IEEE Computer Society, Los Alamitos (2002)Google Scholar
  9. 9.
    Kurgan, L., Swiercz, W., Cios, K.: Semantic Mapping of XML Tags using Inductive Machine Learning. In: Proceedings of the International Conference on Machine Learning and Applications - ICMLA 2002, Las Vegas, Nevada, USA (2002)Google Scholar
  10. 10.
    Young-Kwang, N., Joseph, G., Guilian, W.: A Metadata Integration Assistant Generator for Heterogeneous Distributed Databases. In: Meersman, R., Tari, Z., et al. (eds.) CoopIS 2002, DOA 2002, and ODBASE 2002. LNCS, vol. 2519, pp. 1332–1344. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  11. 11.
    Al-Wasil, F.M., Gray, W.A., Fiddian, N.J.: Establishing an XML Metadata Knowledge Base to Assist Integration of Structured and Semi-structured Databases. In: ADC 2006: Proceedings of The Seventeenth Australasian Database Conference, Tasmania, Australia, January 16 - 19 (2006)Google Scholar
  12. 12.
    Wiederhold, G.: Mediators in the Architecture of Future Information System. IEEE Computer 25(3), 38–49 (1992)Google Scholar
  13. 13.
    Carey, M.J., et al.: Towards heterogeneous multimedia information systems: the Garlic approach. In: RIDE 1995: Proceedings of the 5th International Workshop on Research Issues in Data Engineering-Distributed Object Management (RIDE-DOM 1995), pp. 124–131. IEEE Computer Society Press, Los Alamitos (1995)CrossRefGoogle Scholar
  14. 14.
    Kirk, T., et al.: The Information Manifold. In: Proceedings of the AAAI Spring Symposium on Information Gathering from Heterogeneous. Distributed Environments, pp. 85–91. Stanford University, Stanford (1995)Google Scholar
  15. 15.
    Tomasic, A., Raschid, L., Valduriez, P.: Scaling access to heterogeneous data sources with DISCO. IEEE Transactions on Knowledge and Data Engineering 10(5), 808–823 (1998)CrossRefGoogle Scholar
  16. 16.
    Ullman, J.D.: Information Integration Using Logical Views. In: Afrati, F.N., Kolaitis, P.G. (eds.) ICDT 1997. LNCS, vol. 1186, pp. 19–40. Springer, Heidelberg (1996)Google Scholar
  17. 17.
    Christophides, V., Cluet, S., Simèon, J.: On wrapping query languages and efficient XML integration. In: Proceedings of ACM SIGMOD Conference on Management of Data, Dallas, Texas, USA (May 2000)Google Scholar
  18. 18.
    Baru, C., et al.: XML-based information mediation with MIX. In: SIGMOD 1999: Proceedings of ACM SIGMOD International Conference on Management of Data, pp. 597–599. ACM Press, New York (1999)CrossRefGoogle Scholar
  19. 19.
    Papakonstantinou, Y., Garcia-Molina, H., Ullman, J.D.: MedMaker: A Mediation System Based on Declarative Specifications. In: ICDE 1996: Proceedings of the 12th International Conference on Data Engineering, pp. 132–141. IEEE Computer Society, Los Alamitos (1996)Google Scholar
  20. 20.
    Manolescu, I., Florescu, D., Kossmann, D.: Answering XML Queries over Heterogeneous Data Sources. In: Proceedings of the 27th International Conference on Very Large Data Bases (VLDB), Rome, Italy (September 2001)Google Scholar
  21. 21.
    Lenzerini, M.: Data integration: a theoretical perspective. In: Proceedings of the 21st ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, Madison, Wisconsin (2002) Google Scholar
  22. 22.
    Levy, A., et al.: Answering queries using views. In: Proceedings of the 14th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, San Jose, CA, USA (1995)Google Scholar
  23. 23.
    Fernndez, M., Tan, W.-C., Suciu, D.: SilkRoute: Trading between Relations and XML. In: Proceedings of the Ninth International World Wide Web Conference, Amsterdam (May 15 - 19, 2000)Google Scholar
  24. 24.
    Shanmugasundaram, J., et al.: Querying XML Views of Relational Data. In: Proceedings of the 27th International Conference on Very Large Data Bases (VLDB), Rome, Italy (September 2001)Google Scholar
  25. 25.
    World Wide Web Consortium XQuery 1.0: An XML Query Language, W3C Working Draft (November 2003), http://www.w3.org/TR/xquery/

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Fahad M. Al-Wasil
    • 1
  • N. J. Fiddian
    • 1
  • W. A. Gray
    • 1
  1. 1.School of Computer ScienceCardiff UniversityCardiffUK

Personalised recommendations