Advertisement

Semantic Web Search Engines

  • Mathieu d’AquinEmail author
  • Li Ding
  • Enrico Motta

Abstract

The last couple of years have seen an increasing growth in the amount of Semantic Web data made available, and exploitable, on the Web. Compared to the Web, one unique feature of the Semantic Web is its friendly interface with software programs. In order to better serve human users with software programs, supporting infrastructures for finding and selecting the distributed online Semantic Web data are needed. A number of Semantic Web search engines have emerged recently. These systems are based on different design principles and provide different levels of support for users and/or applications. In this chapter, a survey of these Semantic Web search engines is presented, together with the detailed description of the design of two prominent systems: Swoogle and Watson. The way these systems are used to enable domain applications and support cutting-edge research on Semantic Web technologies is also discussed. In particular, this chapter includes examples of a new generation of semantic applications that, thanks to Semantic Web search engines, exploit online knowledge at runtime, without the need for laborious acquisition in specific domains. In addition, through collecting large amounts of semantic content online, Semantic Web search engines such as Watson and Swoogle allow researchers to better understand how knowledge is formally published online and how Semantic Web technologies are used. In other terms, by mining the collected semantic documents, it becomes possible to get an overview and explore the Semantic Web landscape today.

The first section below (Sect. 16.1) presents a general overview of the area, including the main challenges, related systems, as well as an abstract specification of what is called Semantic Web search engines. It also includes a detailed overview of the two systems more specifically considered as case studies, Swoogle (Sect. 16.1.4) and Watson (Sect. 16.1.5). Section 16.2 shows how these systems are currently being used and applied, both as development platforms to make possible the realization of applications exploiting Semantic Web content (Sect. 16.2.1), and as research platforms, allowing one to better understand the content of the Semantic Web, how knowledge is published online and how it is structured. Finally, Sect. 16.3 briefly introduces other resources to be considered in the area of Semantic Web search engines, and Sect. 16.4 concludes the chapter.

Keywords

Semantic Content Keyword Search Query Expansion Semantic Data SPARQL Query 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Sowa, J.F.: Conceptual graphs summary. In: Nagle, T.E., Nagle, J.A., Gerholz, L.L., Eklund, P.W. (eds.) Conceptual Structures: Current Research and Practice, pp. 3–51. Ellis Horwood, New York (1992) ISBN:0-13-175878-0Google Scholar
  2. 2.
    Hobbs, J.R., Ferguson, G., Allen, J., Fikes, R., Hayes, P., McDermott, D., Niles, I.: Adam Pease, Austin Tate, Mabry Tyson, Richard Waldinger. A daml ontology of time. http://www.cs.rochester.edu/~ferguson/daml/daml-time-nov2002.txt (2002)
  3. 3.
    Chen, H., Perich, F., Finin, T., Joshi, A.: SOUPA: standard ontology for ubiquitous and pervasive applications. In: Proceedings of the First International Conference on Mobile and Ubiquitous Systems: Networking and Services (Mobiquitous 2004), Boston (2004)Google Scholar
  4. 4.
    Hayes, P.: RDF semantics. http://www.w3.org/TR/2004/REC-rdf-mt-20040210/ (2004)
  5. 5.
    Patel-Schneider, P.F., Hayes, P., Horrocks, I.: OWL web ontology language semantics and abstract syntax. http://www.w3.org/TR/2004/REC-owl-semantics-20040210/ (2004)
  6. 6.
    Ayers, D., Völkel, M.: Cool URIs for the semantic web, W3C Interest Group Note. http://www.w3.org/TR/cooluris/ (Sept 2010)
  7. 7.
    Prud’hommeaux, E., Seaborne, A.: SPARQL query language for RDF. http://www.w3.org/TR/2006/WD-rdf-sparql-query-20060220/ (2006)
  8. 8.
    Bizer, C.: The emerging web of linked data. IEEE Intell. Syst. 24, 87–92 (2009)CrossRefGoogle Scholar
  9. 9.
    Bizer, C., Heath, T., Berners-Lee, T.: Linked data, the story so far. Int. J. Semantic Web Inf. Syst. 5(3), 1–22 (2009)CrossRefGoogle Scholar
  10. 10.
    Androutsopoulos, I., Ritchie, G.D., Thanisch, P.: Natural language interfaces to databases: an introduction. Nat. Lang. Eng. 1(1), 29–81 (1995)CrossRefGoogle Scholar
  11. 11.
    Harth, A., Umbrich, J., Decker, S.: Multi-crawler: a pipelined architecture for crawling and indexing semantic web data. In: Proceedings of the Fifth International Semantic Web Conference (ISWC 2006), Athens, GA. Lecture Notes in Computer Science, vol. 4273, pp. 258–271. Springer, Berlin (2006)Google Scholar
  12. 12.
    Ding, L., Pan, R., Finin, T., Joshi, A., Peng, Y., Kolari, P.: Finding and ranking knowledge on the semantic web. In: Proceedings of the Fourth International Semantic Web Conference (ISWC 2005), Galway. Lecture Notes in Computer Science, vol. 3729, pp. 156–170. Springer, Heidelberg (2005)Google Scholar
  13. 13.
    Alani, H., Brewster, C., Shadbolt, N.: Ranking ontologies with aktiverank. In: Proceedings of the Fifth International Semantic Web Conference (ISWC 2006), Athens, GA. Lecture Notes in Computer Science, vol. 4273, pp. 1–15. Springer, Berlin (2006)Google Scholar
  14. 14.
    d’Aquin, M., Euzenat, J., Le Duc, C., Lewen, H.: Sharing and reusing aligned ontologies with cupboard. In: Demo, Proceedings of the Fifth International Conference on Knowledge Capture (K-CAP 2009), Los Angeles (2009)Google Scholar
  15. 15.
    Ding, L., Finin, T., Joshi, A., Pan, R., Cost, R.S., Peng, Y., Reddivari, P., Doshi, V., and Sachs, J.: Swoogle: a search and metadata engine for the semantic web. In: Proceedings of the 13th ACM International Conference on Information and Knowledge Management (CIKM 2004), Washington, DC, pp. 652–659 (2004)Google Scholar
  16. 16.
    Sherman, C.: Metacrawlers and metasearch engines. http://searchenginewatch.com/links/article.php/2156241 (last visited on March 2006) (2004)
  17. 17.
    Gulli, A., Signorini, A.: The indexable web is more than 11.5 billion pages. In: Proceedings of the 14th International World Wide Web Conference (WWW 2005) (poster paper), Chiba (2005)Google Scholar
  18. 18.
    Page, L., Brin, S., Motwani, R., Wino-grad, T.: The PageRank citation ranking: bringing order to the web. Technical report, Stanford Digital Library Technologies Project (1998)Google Scholar
  19. 19.
    d’Aquin, M., Sabou, M., Motta, E., Angeletou, S., Gridinoc, L., Lopez, V., Zablith, F.: What can be done with the semantic web? An overview of Watson-based applications. In: Proceedings of the Fifth Workshop on Semantic Web Applications and Perspectives (SWAP 2008), Rome (2008)Google Scholar
  20. 20.
    d’Aquin, M., Motta, E., Sabou, M., Angeletou, S., Gridinoc, L., Lopez, V., Guidi, D.: Toward a new generation of semantic web applications. IEEE Intell. Syst. 23(3), 20–28 (2008)CrossRefGoogle Scholar
  21. 21.
    Angeletou, S., Sabou, M., Specia, L., Motta, E.: Bridging the gap between folksonomies and the semantic web: an experience report. In: Proceedings of the International Workshop on Bridging the Gap between Semantic Web and Web 2.0 at the Fourth European Semantic Web Conference (ESWC 2007), Innsbruck (2007)Google Scholar
  22. 22.
    Peroni, S., Motta, E., d’Aquin, M.: Identifying key concepts in an ontology through the integration of cognitive principles with statistical and topological measures. In: Proceedings of the Third Asian Semantic Web Conference (ASWC 2008), Bangkok. Lecture Notes in Computer Science, vol. 5367, pp. 242–256. Springer, Berlin (2009)Google Scholar
  23. 23.
    d’Aquin, M., Motta, E., Sabou, M., et al.: Towards a new generation of semantic web applications. IEEE Intell. Syst. 23(3), 20–28 (2008)CrossRefGoogle Scholar
  24. 24.
    Sabou, M., d’Aquin, M., Motta, E.: Exploring the semantic web as background knowledge for ontology matching. J. Data Semant. XI. Lecture Notes in Computer Science, vol. 5383, pp. 156–190, doi: 10.1007/978-3-540-92148-6_6 (2008)Google Scholar
  25. 25.
    Tummarello, G., Cyganiak, R., Catasta, M., Danielczyk, S., Delbru, R., Decker, S.: Sigma: live views on the Web of Data. In: Demonstration at the Proceedings of the 19th World Wide Web Conference (WWW 2010), Raleigh (2010)Google Scholar
  26. 26.
    Tummarello, G., Oren, E., Delbru, R.: Sindice.com: weaving the open linked data. In: Proceedings of the Sixth International Semantic Web Conference and Second Asian Semantic Web Conference (ISWC/ASWC 2007), Busan. Lecture Notes in Computer Science, vol. 4825, pp. 547–560. Springer, Berlin (2007)Google Scholar
  27. 27.
    Finin, T.W., Sachs, J., Parr, C.S.: Finding data, knowledge, and answers on the semantic web. In: Proceedings of the 20th International Florida Artificial Intelligence Research Society Conference (FLAIRS 2007), Key West, pp. 2–7. AAAI, Menlo Park (2007)Google Scholar
  28. 28.
    Haase, P., Stojanovic, L.: Consistent evolution of OWL ontologies. In: Proceedings of the Second European Semantic Web Conference (ESWC 2005), Heraklion. Lecture Notes in Computer Science, vol. 3532, pp. 182–197. Springer, Berlin (2005)Google Scholar
  29. 29.
    Zablith, F.: Dynamic ontology evolution. In: Proceedings of the Seventh International Semantic Web Conference (ISWC 2008), Doctoral Consortium, Karlsruhe (2008)Google Scholar
  30. 30.
    Zablith, F., Sabou, M., d’Aquin, M., Motta, E.: Using background knowledge for ontology evolution. In: Proceedings of the Second International Workshop on Ontology Dynamics (IWOD 2008) Co-located with Seventh International Semantic Web Conference (ISWC 2008), Karlsruhe (2008)Google Scholar
  31. 31.
    Fernàndez, S., Berrueta, D., Shi, L., Labra, J.E., Ordóñez de Pablos, P.: Mailing lists and social semantic web. In: Patricia Ordonez de Pablos Miltiadis D. Lytras (ed.) Social Web Evolution: Integrating Semantic Applications and Web 2.0 Technologies. Social Computing: Concepts, Methodologies, Tools, and Applications Editor(s): Subhasish Dasgupta (George Washington University, USA), pp. 335–349 (2010)Google Scholar
  32. 32.
    Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: a framework and graphical development environment for robust NLP tools and applications. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL 2002), Philadelphia (2002)Google Scholar
  33. 33.
    Lopez, V., Sabou, M., Motta, E.: PowerMap: mapping the real semantic web on the fly. In: Proceedings of the International Semantic Web Conference (ISWC 2006), Athens, GA. Lecture Notes in Computer Science, vol. 4273, pp. 414–427. Springer, Berlin (2006)Google Scholar
  34. 34.
    Angeletou, S., Sabou, M., Motta, E.: Semantically enriching folksonomies with FLOR. In: Proceedings of the First International Workshop on Collective Semantics: Collective Intelligence and the Semantic Web (CISWeb 2008), Tenerife (2008)Google Scholar
  35. 35.
    Ding, L., Finin, T.: Characterizing the semantic web on the web. In: Proceedings of the Fifth International Semantic Web Conference (ISWC 2006), Athens, GA. Lecture Notes in Computer Science, vol. 4273, pp. 242–257. Springer, Berlin (2006)Google Scholar
  36. 36.
    Lawrence, S., Lee Giles, C.: Accessibility of information on the web. Nature 400, 107–109 (1999)CrossRefGoogle Scholar
  37. 37.
    Bharat, K., Broder, A.: A technique for measuring the relative size and overlap of public web search engines. In: Proceedings of the Seventh International Conference on World Wide Web (WWW 1998), Brisbane, pp. 379–388 (1998)Google Scholar
  38. 38.
    d’Aquin, M., Baldassarre, C., Gridinoc, L., Angeletou, S., Sabou, M., Motta, E.: Characterizing knowledge on the semantic web with Watson. In: Workshop on Evaluation of Ontologies and Ontology-Based Tools (EON 2007), Madrid (2007)Google Scholar
  39. 39.
    Wang, T.D., Parsia, B., Hendler, J.: A survey of the web ontology landscape. In: Proceedings of the Fifth International Semantic Web Conference, (ISWC 2006), Athens, GA. Lecture Notes in Computer Science, vol. 4273, pp. 682–694. Springer, Berlin (2006)Google Scholar
  40. 40.
    Allocca, C., d’Aquin, M., Motta, E.: Detecting different versions of ontologies in large ontology repositories. In: Proceedings of the Third International Workshop on Ontology Dynamics (IWOD 2009), Washington, DC. CEUR-WS Online Proceedings, vol. 519 (2009)Google Scholar
  41. 41.
    d’Aquin, M., Allocca, C., Motta, E.: A platform for semantic web studies. In: Web Science Conference (WebSci 2010), Poster Session, Raleigh (2010)Google Scholar
  42. 42.
    d’Aquin, M.: Formally measuring agreement and disagreement in ontologies. In: Proceedings of the Fifth International Conference on Knowledge Capture (K-CAP 2009), Los Angeles (2009)Google Scholar
  43. 43.
    Cheng, G., Ge, W., Qu, Y.: Falcons: searching and browsing entities on the semantic web. In: Proceedings of the 17th International Conference on World Wide Web (WWW 2008), Beijing, pp. 1101–1102. ACM, New York (2008)Google Scholar
  44. 44.
    Harth, A., Hogan, A., Delbru, R., Umbrich, J., O’Riain, S., Decker, S.: SWSE: Answers before links! In: Proceedings of the Semantic Web Challenge, CEUR Workshop Proceedings, vol. 295 (2007)Google Scholar
  45. 45.
    Buitelaar, P., Eigner, T., Declerck, T.: Ontose lect: a dynamic ontology library with support for ontology selection. In: Proceedings of the Demo Session at the Third International Semantic Web Conference (ISWC 2004), Hiroshima (2004)Google Scholar
  46. 46.
    Thomas, E., Pan, J.Z., Sleeman, D.H.: On-tosearch2: searching ontologies semantically. In: Proceedings of the Third International Workshop on OWL: Experiences and Directions (OWLED 2007), Innsbruck. CEUR Workshop Proceedings, vol. 258 (2007)Google Scholar
  47. 47.
    Ungrangsi, R., Anutariya, C., Wuwongse, V.: SQORE-based ontology retrieval system. In: Proceedings of the 18th International Conference on Database and Expert Systems Applications (DEXA 2007), Regensburg. Lecture Notes in Computer Science, vol. 4653, pp. 720–729. Springer, Berlin (2007)Google Scholar
  48. 48.
    Myers, P., Espinosa, R., Parr, C.S., Jones, T., Hammond, G.S., Dewey, T. A.: The Animal Diversity Web (online). http://animaldiversity.org (2006). Accessed 5 Aug 2009

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  1. 1.Knowledge Media InstituteThe Open UniversityMilton KeynesUK
  2. 2.Tetherless World Constellation, Computer Science DepartmentRensselaer Polytechnic InstituteTroyUSA

Personalised recommendations