Skip to main content
Log in

Mediators over taxonomy-based information sources

  • Regular Paper
  • Published:
The VLDB Journal Aims and scope Submit manuscript

Abstract.

We propose a mediator model for providing integrated and unified access to multiple taxonomy-based sources. Each source comprises a taxonomy and a database that indexes objects under the terms of the taxonomy. A mediator comprises a taxonomy and a set of relations between the mediator’s and the sources’ terms, called articulations. By combining different modes of query evaluation at the sources and the mediator and different types of query translation, a flexible, efficient scheme of mediator operation is obtained that can accommodate various application needs and levels of answer quality. We adopt a simple conceptual modeling approach (taxonomies and intertaxonomy mappings) and we illustrate its advantages in terms of ease of use, uniformity, scalability, and efficiency. These characteristics make this proposal appropriate for a large-scale network of sources and mediators.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. “XFML: eXchangeable Faceted Metadata Language”. http://www.xfml.org.

  2. Amann B, Fundulaki I (1999) Integrating ontologies and thesauri to build RDF schemas. In: Proceedings of the 3rd European conference for digital libraries ECDL’99, Paris, France, 22 September 1999, pp 234-253

  3. Amba S (1996) Automatic linking of thesauri. In: Proceedings of SIGIR’96, Zurich, Switzerland, 18-22 August 1996, pp 181-186. ACM Press, New York

  4. Ambite JL, Ashish N, Barish G, Knoblock CA, Minton S, Modi PJ, Muslea I, Philpot A, Tejada S (1998) Ariadne: a system for constructing mediators for Internet sources. In: Proceedings of the ACM SIGMOD international conference on management of data, Seattle, 2-4 June, 1998, pp 561-563

  5. Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. ACM Press/Addison-Wesley, Reading, MA

  6. Baumgarten C (1999) Probabilistic information retrieval in a distributed heterogeneous environment. PhD thesis, Technical University of Dresden, Dresden, Germany

  7. Benjamins VR, Fensel D (1998) Community is knowledge! in (KA)2. In: Proceedings of KAW’98, Alberta, Canada, 18-23 April, 1998

  8. Bidault A, Froidevaux C, Safar B (2000) Repairing queries in a mediator approach. In: Proceedings of the ECAI’00, Berlin, 20-25 August 2000, pp 406-410

  9. Boman M, Bubenko JA, Johannesson P, Wangler B (1997) Conceptual modelling. Prentice-Hall, Upper Saddle River, NJ

  10. Boolos G (1998) Logic, logic and logic. Harvard University Press, Cambridge, MA

  11. Callan JP, Lu Z, Croft WB (1995) Searching distributed collections with inference networks. In: Proceedings of the 18th international conference on research and development in information retrieval, Seattle, 9-13 July, 1995, pp 21-18

  12. Calvanese D, de Giacomo G, Lenzerini M, Nardi D, Rosati R (1998) Description logic framework for information integration. In: Proceedings of the 6th international conference on the principles of knowledge representation and reasoning (KR-98), Trento, Italy, 2-5 June 1998, pp 2-13

  13. Calvanese D, de Giacomo G, Lenzerini M (2001) A framework for ontology integration. In: Proceedings of the 2001 international Semantic Web working symposium (SWWS 2001), Stanford, CA, 30 July-1 August 2001, pp 303-316

  14. Chang C-CK, García-Molina H (1999) Mind your vocabulary: query mapping across heterogeneous information sources. In: Proceedings of the ACM SIGMOD, Philadelphia, 1-3 June 1999, pp 335-346

  15. Chang C-CK, García-Molina H (2001) Approximate query mapping: accounting for translation closeness. J Very Large Databases 10(2-3):155-181

    Google Scholar 

  16. Chawathe S, Garcia-Molina H, Hammer J, Ireland K, Papakonstantinou Y, Ullman J, Widom J (1994) The TSIMMIS project: integration of heterogeneous information sources. In: Proceedings of IPSJ, Tokyo, October 1994, pp 7-18

  17. Cluet S, Delobel C, Siméon J, Smaga K (1998) Your mediators need data conversion! In: Proceedings of the ACM SIGMOD international conference on management of data, Seattle, 2-4 June 1998, pp 177-188

  18. Codd EF (1970) A relational model of data for large shared data banks. Commun ACM 13(6):377-387

    Google Scholar 

  19. Constantopoulos P, Doerr M, Vassiliou Y (1993) Repositories for software reuse: the software information base. In: Proceedings IFIP WG 8.1 conference on information system development process, Como, Italy, September 1993, pp 285-307

  20. Craswell N, Hawking D, Thistlewaite P (1999) Merging results from isolated search engines. In: Proceedings of the 10th Australasian database conference, Auckland, New Zealand, 18-21 January 1999, pp 189-200

  21. Croft B (1993) Knowledge-based and statistical approaches to text retrieval. IEEE Expert 9:8-12

    Google Scholar 

  22. Decker S, Erdmann M, Fensel D, Studer R (1999) Ontobroker: ontology based access to distributed and semi-structured information. In: Semantic issues in multimedia systems. Kluwer, Dordrecht

  23. Doan A, Madhavan J, Domingos P, Halevy A (2002) Learning to map between ontologies on the Semantic Web. In: Proceedings of the World Wide Web Conference (WWW-2002), Honolulu, 7-11 May 2002, pp 662-673

  24. Duschka OM, Genesereth MR (1997a) Answering recursive queries using views. In: Proceedings of PODS 1997, Tucson, AZ, 12-14 May 1997, pp 109-116

  25. Duschka OM, Genesereth MR (1997b) Query planning in infomaster. In: Proceedings of the 12th annual ACM symposium on applied computing, SAC’97, San Jose, February 1997, pp 109-111

  26. Fagin R (1999) Combining fuzzy information from multiple systems. J Comput Sys Sci 58(1):83-99

    Google Scholar 

  27. Fan Y, Gauch S (1999) Adaptive agents for information gathering from multiple, distributed information sources. In: Proceedings of the 1999 AAAI symposium on intelligent agents in cyberspace, Stanford, CA, March 1999, pp 40-46

  28. Fuhr N (1999) A decision-theoretic approach to database selection in networked IR. ACM Trans Inf Sys 17(3):229-249

    Google Scholar 

  29. Galton A (1990) Logic for information technology. Wiley, New York

  30. Garcia-Molina H, Papakonstantinou Y, Quass D, Rajaraman A, Sagiv Y, Ullman J, Vassalos V, Widom J (1994) The TSIMMIS approach to mediation: data models and languages. In: Proceedings of IPSJ, Tokyo, October 1994, pp 7-18

  31. Garcia-Molina H, Ullman JD, Widom J (2000) Database system implementation, chap 11. Prentice-Hall, Upper Saddle River, NJ

  32. Genesareth MR, Keller AM, Duschka O (1997) Infomaster: an information integration system. In: Proceedings of 1997 ACM SIGMOD, Tucson, AZ, May 1997, pp 539-542

  33. Gravano L, Garcia-Molina H (1995) Generalizing GlOSS to vector-space databases and broker hierarchies. In: Proceedings of the 21st conference on very large databases, Zurich, Switzerland, 11-15 September 1995, pp 78-89

  34. Guarino N (1998) Some ontological principles for designing upper level lexical resources. In: Proceedings of the 1st international conference on language resources and evaluation, Granada, Spain, May 1998, pp 527-534

  35. Guarino N, Masolo C, Vetere G (1999) OntoSeek: content-based access to the Web. IEEE Intell Sys 14(3):70-80

    Google Scholar 

  36. Halevy AY (2001) Answering queries using views: a survey. J Very Large Databases 10(4):270-294

    Google Scholar 

  37. Helleg H, Krause J, Mandl T, Marx J, Muller M, Mutschke P, Strogen R (2001) Treatment of semantic heterogeneity in information retrieval. Technical Report 23, Social Science Information Centre, Köln, Germany, May 2001. http://www.gesis.org/en/publications/reports/iz\_working\_papers/

  38. Howe A, Dreilinger D (1997) SavvySearch: a metasearch engine that learns which search engines to query. AI Mag 18(2):19-25

    Google Scholar 

  39. Information Systems Laboratory. The Semantic Index System (SIS). Institute of Computer Science Foundation for Research and Technology Hellas. http://zeus.ics.forth.gr/forth/ics/isl/r-d-activities/semantic\_index\_syst em.html

  40. International Organization for Standardization (1986) Documentation - guidelines for the establishment and development of monolingual thesauri. Ref. No ISO 2788-1986

  41. Kashyap V, Sheth A (1996) Semantic and schematic similarities between database objects: a context-based approach. J Very Large Databases 5(4):276-304

    Google Scholar 

  42. Kashyap V, Sheth A (1998) Semantic heterogeneity in global information systems: the role of metadata, context and ontologies. In: Cooperative information systems: trends and directions. Academic, San Diego

  43. Knoblock C, Arens Y, Hsu C-N (1994) Cooperating Agents for Information Retrieval. In: Procedings of the 2nd international conference on cooperative information systems, Toronto, 17-20 May 1994, pp 122-133

  44. Lacher M, Groh G (2001) Facilitating the exchange of explicit knowledge through ontology mappings. In: Proceedings of the 14th international FLAIRS conference, Key West, FL, 21-23 May 2001, pp 305-309

  45. Lattes V, Rousset M-C (1998) The use of CARIN language and algorithms for information integration: the PISCEL project. In: Proceedings of the 2nd international and workshop on intelligent information integration, Brighton Centre, Brighton, interdisciplinary UK, August 1998

  46. Lenzerini M (2002) Data integration: a theoretical perspective. In: Proceedings of ACM PODS 2002, Madison, WI, June 2002, pp 233-246

  47. Levy AY, Srivastava D, Kirk T ( Data model and query evaluation in global information systems. J Intell Inf Sys 5(2):121-143

    Google Scholar 

  48. Luke S, Spector L, Rager D, Hendler J (1997) Ontology-based Web Agents. In: Proceedings of the 1st international conference on autonomous agents, Marina del Rey, CA, 5-8 February 1997, pp 59-66. http://www.cs.umd.edu/projects/plus/SHOE/

  49. Mazur Z (1994) Models of a distributed information retrieval system based on thesauri with weights. Inf Process Manage 30(1):61-77

    Google Scholar 

  50. McGuinness DL (1998) Ontological issues for knowledge-enhanced search. In: Proceedings of FOIS’98, Trento, Italy, June 1998. IOS Press, Amsterdam

  51. Meghini C, Straccia U (1996) A relevance terminological logic for information retrieval. In: Proceedings of SIGIR’96, Zurich, Switzerland, August 1996, pp 197-205

  52. Mena E, Kashyap V, Sheth A, Illarramendi A (1996) OBSERVER: an approach for query processing in global information systems based on interoperation across preexisting ontologies. In: Proceedings of the 1st IFCIS international conference on cooperative information systems (CoopIS’96), Brussels, Belgium, June 1996, pp 14-25. IEEE Press, New York

  53. Mitra P, Wiederhold G, Jannink J (1999) Semi-automatic integration of knowledge sources. In: Proceedings of the 2nd international conference on information fusion, Sunnyvale, CA, July 1999

  54. Nuutila E (1995) Efficient transitive closure computation in large digraphs. PhD thesis, Acta Polytechnica Scandinavica, Helsinki, 1995. http://www.cs.hut.fi/~enu/thesis.html

  55. Paice C (1991) A thesaural model of information retrieval. Inf Process Manage 27(5):433-447

    Google Scholar 

  56. Prieto-Diaz R (1991) Implementing faceted classification for software reuse. Commun ACM 34(5):88-97

    Google Scholar 

  57. Princeton University Cognitive Science Laboratory. WordNet: a lexical database for the English language. http://www.cogsci.princeton.edu/~wn

  58. Ranganathan SR (1965) The colon classification. In: Artandi S (ed) Rutgers series on systems for the intellectual organization of information, vol IV. Graduate School of Library Science, Rutgers University, New Brunswick, NJ

  59. Rolleke T, Fuhr N (1996) Retrieval of complex objects using a four-valued logic. In: Proceedings of SIGIR’96, Zurich, Switzerland, August 1996, pp 206-214

  60. Ryutaro I, Hideaki T, Shinichi H (2001) Rule induction for concept hierarchy alignment. In: Proceedings of the 2nd workshop on ontology learning at the 17th international conference on AI (IJCAI), Seattle, 4 August 2001.

  61. Sacco GM (2000) Dynamic taxonomies: a model for large information bases. IEEE Trans Knowl Data Eng 12(3):468-479

    Google Scholar 

  62. Salton G (1983) Introduction to modern information retrieval. McGraw-Hill, New York

  63. Selberg E, Etzioni O (1995) Multi-service search and comparison using the MetaCrawler. In: Proceedings of the 1995 World Wide Web conference, Boston, December 1995

  64. Sintichakis M, Constantopoulos P (1997) A method for monolingual thesauri merging. In: Proceedings of the 20th international conference on research and development in information retrieval, ACM SIGIR’97, Philadelphia, July 1997, pp 129-138

  65. Spyratos N (1987) The partition model: a deductive database model. ACM Trans Database Sys 12(1):1-37

    Google Scholar 

  66. Subrahmanian VS, Adah S, Brink A, Emery R, Rajput A, Ross R, Rogers T, Ward C (1996) HERMES: a heterogeneous reasoning and mediator system. http://www.cs.umd.edu/projects/hermes/overview/paper

  67. Tzitzikas Y (2001) Democratic data fusion for information retrieval mediators. In: Proceedings of the ACS/IEEE international conference on computer systems and applications, Beirut, Lebanon, June 2001

  68. Tzitzikas Y, Meghini C (2003a) Ostensive automatic schema mapping for taxonomy-based peer-to-peer systems. In: Proceedings of the 7th international workshop on cooperative information agents, CIA-2003, Helsinki, Finland, August 2003. Lecture notes on artificial intelligence, vol 2782, pp 78-92. (Best Paper Award)

  69. Tzitzikas Y, Meghini C (2003b) Query evaluation in peer-to-peer networks of taxonomy-based sources. In: Proceedings of the 19th international conference on cooperative information systems, CoopIS’2003, Catania, Sicily, Italy, November 2003

  70. Tzitzikas Y, Spyratos N, Constantopoulos P (2001) Mediators over ontology-based information sources. In: Proceedings of the 2nd international conference on Web information systems engineering, WISE 2001, Kyoto, Japan, December 2001, pp 31-40

  71. Tzitzikas Y, Analyti A, Spyratos N, Constantopoulos P (2003a) An algebraic approach for specifying compound terms in faceted taxonomies. In: Proceedings of the 13th European-Japanese conference on information modelling and knowledge bases, Kitakyushu, Japan, June 2003

  72. Tzitzikas Y, Meghini C, Spyratos N (2003b) Taxonomy-based conceptual modeling for peer-to-peer networks. In: Proceedings of the 22nd international conference on conceptual modeling, ER’2003, Chicago, October 2003, pp 446-460

  73. Tzitzikas Y, Spyratos N, Constantopoulos P (2002) Query evaluation for mediators over Web catalogs. Int J Inf Theories Appl 9(2)

  74. Ullman JD (1997) Information integration using logical views. In: Proceedings of the 6th international conference on database theory (ICDT-97), Delphi, Greece, 8-10 January 1997, pp 19-40

  75. Van Harmelen F, Fensel D (1999) Practical knowledge representation for the Web. In: Proceedings of the workshop on intelligent information integration, IJCAI’99, Stockholm, Sweden, 31 July 1999

  76. Vorhees E, Gupta N, Johnson-Laird B (1995) The collection fusion problem. In: Proceedings of the 3rd text retrieval conference (TREC-3), Gaithersburg, MD, November 1995.

  77. Vorhees E (1997) Multiple search engines in database merging. In: Proceedings of the 2nd ACM international conference on digital libraries, 25-28 July 1997, Philadelphia, pp 93-102

  78. Wiederhold G (1992) Mediators in the architecture of future information systems. IEEE Comput 25:38-49

    Article  Google Scholar 

  79. Yerneni R, Li C, Garcia-Molina H, Ullman J (1999) Computing capabilities of mediators. In: Proceedings of ACM SIGMOD’99, Philadelphia, 1-3 June 1999, pp 443-454

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yannis Tzitzikas.

Additional information

Received: 29 November 2001, Accepted: 30 October 2003, Published online: 6 February 2004

Edited by: M. Missikoff

Yannis Tzitzikas: Current address: Istituto di Scienza e Tecnologie dell’ Informazione, ISTI-CNR, Pisa, Italy

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tzitzikas, Y., Spyratos, N. & Constantopoulos, P. Mediators over taxonomy-based information sources. The VLDB Journal 14, 112–136 (2005). https://doi.org/10.1007/s00778-003-0119-8

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00778-003-0119-8

Keywords:

Navigation