Advertisement

Topology of the Web of Data

  • Christian Bizer
  • Pablo N. Mendes
  • Anja Jentzsch
Chapter
Part of the Data-Centric Systems and Applications book series (DCSA)

Abstract

Over the last years, an increasing number of web sites have started to embed structured data into HTML documents as well as to publish structured data in addition to HTML documents directly on the Web. This trend has led to the extension of the Web with a global data space—the Web of Data. As the classic document Web, the Web of Data covers a wide variety of topics ranging from data describing people, organizations, and events over products and reviews to statistical data provided by governments as well as research data from various scientific disciplines. This chapter gives an overview of the topology of the Web of Data. We discuss the different techniques that are used to publish structured data on the Web and provide statistics about the amount and topics of the data currently published using each technique.

Keywords

Resource Description Framework Link Data Link Open Data Resource Description Framework Data British Broadcasting Corporation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Adida, B., Birbeck, M.: RDFa primer—bridging the human and data webs—W3C recommendation. http://www.w3.org/TR/xhtml-rdfa-primer/ (2008). Accessed 1 February 2012, http://www.w3.org/TR/xhtml-rdfa-primer/
  2. 2.
    Beckett, D.: RDF/XML Syntax Specification (Revised)—W3C Recommendation. http://www.w3.org/TR/rdf-syntax-grammar/(2004). Accessed 1 February 2012
  3. 3.
    Belleau, F., Nolin, M., Tourigny, N., Rigault, P., Morissette, J.: Bio2rdf: Towards a mashup to build bioinformatics knowledge systems. J. Biomed. Inform. 41(5), 706–716 (2008)Google Scholar
  4. 4.
    BernersLee, T.: Linked data design issues. http://www.w3.org/DesignIssues/LinkedData.html(2006). Accessed 1 February 2012
  5. 5.
    Berners-Lee, T.: Giant global graph. http://dig.csail.mit.edu/breadcrumbs/node/215 (2007). Accessed 1 February 2012, http://dig.csail.mit.edu/breadcrumbs/node/215
  6. 6.
    Berners-Lee, T., Kagal, L.: The fractal nature of the semantic web. AI Magazine 29(3) (2008)Google Scholar
  7. 7.
    Bilke, A., Naumann, F.: Schema matching using duplicates. Proceedings of the International Conference on Data Engineering, 05–08 April 2005Google Scholar
  8. 8.
    Birbeck, M.: Rdfa and linked data in uk government web-sites. Nodalities Magazine 7(2009)Google Scholar
  9. 9.
    Bizer, C., Cyganiak, R.: Quality-driven information filtering using the WIQA policy framework. J. Web Semant. Sci. Serv. Agents World Wide Web 7(1), 1–10 (2009)Google Scholar
  10. 10.
    Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semant. Web Inf. Syst. 5(3), 1–22 (2009)Google Scholar
  11. 11.
    Bizer, C., Heese, R., Mochol, M., Oldakowski, R., Tolksdorf, R., Eckstein, R.: The impact of semantic web technologies on job recruitment processes. Proceedings of the 7th Internationale Tagung Wirtschaftsinformatik (WI2005), 2005Google Scholar
  12. 12.
    Bizer, C., Jentzsch, A., Cyganiak, R.: State of the LOD cloud. http://www4.wiwiss.fu-berlin.de/lodcloud/state/ (2011). Accessed 1 February 2012, http://www4.wiwiss.fu-berlin.de/lodcloud/state/
  13. 13.
    Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: Dbpedia – a crystallization point for the web of data. J. Web Semant. Sci. Serv. Agents World Wide Web 7(3), 154–165 (2009)Google Scholar
  14. 14.
    Blanco, R., Halpin, H., Herzig, D.M., Mika, P., Pound, J., Thompson, H.S., Tran Duc, T.: Repeatable and reliable search system evaluation using crowdsourcing. Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information, SIGIR ’11, ACM, New York, NY, USA, 2011, pp. 923–932 DOI http://doi.acm.org/10.1145/2009916.2010039 http://doi.acm.org/10.1145/2009916.2010039
  15. 15.
    Blog, G.: Introducing schema.org: search engines come together for a richer web. http://googleblog.blogspot.com/2011/06/introducing-schemaorg-search-engines.html(2011). Accessed 1 February 2012
  16. 16.
    Blog, Y.S.: We now support microformats. http://www.ysearchblog.com/2006/06/21/we-now-support-microformats/(2006). Accessed 1 February 2012
  17. 17.
    Blog, Y.S.: The Yahoo! search open ecosystem. http://www.ysearchblog.com/archives/000527.html(2008). Accessed 1 February 2012
  18. 18.
    Callan, J.: The Sapphire Web Crawler – crawl statistics. http://boston.lti.cs.cmu.edu/crawler/crawlerstats.html(2009). Accessed 1 February 2012
  19. 19.
    Campinas, S., Ceccarelli, D., Perry, T.E., Delbru, R., Balog, K., Tummarello, G.: The Sindice-2011 dataset for entity-oriented search in the web of data. 1st international workshop on entity-oriented search (EOS) 2011Google Scholar
  20. 20.
    Central, G.W.: Introducing rich snippets. http://googlewebmastercentral.blogspot.com/2009/05/introducing-rich-snippets.html(2011). Accessed 1 February 2012
  21. 21.
    Cheng, G., Qu, Y.: Searching linked objects with falcons: approach, implementation and evaluation. Int. J. Semant. Web Inform. Syst. (IJSWIS) 5(3), 49–70 (2009)Google Scholar
  22. 22.
    Harth, A.: Visinav: a system for visual search and navigation on web data. Web Semant. Sci. Serv. Agents World Wide Web 8(4), 348–354 (2010). DOI DOI: 10.1016/j.websem.2010.08.001. http://www.sciencedirect.com/science/article/B758F-50THXFH-1/2/84c276a928b5889c9870f9e57eda2658
  23. 23.
    Harth, A.: The billion triple challenge datasets. http://km.aifb.kit.edu/projects/btc-2011/(2011). Accessed 1 February 2012
  24. 24.
    Harth, A., Hogan, A., Umbrich, J., Decker, S.: SWSE: objects before documents! Proceedings of the Semantic Web Challenge 2008, 2008Google Scholar
  25. 25.
    Heath, T., Bizer, C.: Linked Data – Evolving the Web into a Global Data Space. Morgan and Claypool Publishers, Seattle, WA USA (2011). http://linkeddatabook.com/
  26. 26.
    Heath, T., Motta, E.: Revyu: linking reviews and ratings into the web of data. J. Web Semant. Sci. Serv. Agents World Wide Web 6(4) (2008)Google Scholar
  27. 27.
    Hepp, M.: Goodrelations: an ontology for describing products and services offers on the web. Proceedings of the 16th International Conference on Knowledge Engineering and Knowledge Management, Acitrezza, Italy, 2008Google Scholar
  28. 28.
    Isele, R., Harth, A., Umbrich, J., Bizer, C.: LDSpider: an open-source crawling framework for the web of linked data. ISWC 2010 posters and demonstrations track: collected abstracts vol. 658, 2010Google Scholar
  29. 29.
    Jacobs, I., Walsh, N.: Architecture of the World Wide Web, Volume One (2004). http://www.w3.org/TR/webarch/, Accessed 1 February 2012
  30. 30.
    Jentzsch, A., Hassanzadeh, O., Bizer, C., Andersson, B., Stephens, S.: Enabling tailored therapeutics with linked data. Proceedings of the WWW2009 Workshop on Linked Data on the Web, 2009. http://events.linkeddata.org/ldow2009/papers/ldow2009\_paper9.pdf
  31. 31.
    Klyne, G., Carroll, J.J.: Resource description framework (RDF): concepts and abstract syntax – W3C recommendation (2004). Accessed 1 February 2012, Http://www.w3.org/TR/rdf-concepts/Google Scholar
  32. 32.
    Kobilarov, G., Scott, T., Raimond, Y., Oliver, S., Sizemore, C., Smethurst, M., Bizer, C., Lee, R.: Media meets semantic web – how the bbc uses dbpedia and linked data to make connections. The semantic web: research and applications, 6th European semantic web conference, 2009, pp. 723–737Google Scholar
  33. 33.
    Madhavan, J., Shawn, J.R., Cohen, S., Dong, X., Ko, D., Yu, C., Halevy, A.: Web-scale data integration: you can only afford to pay as you go. Proceedings of the Conference on Innovative Data Systems Research, 2007Google Scholar
  34. 34.
    Mika, P.: Microformats and RDFa deployment across the Web. http://tripletalk.wordpress.com/2011/01/25/rdfa-deployment-across-the-web/(2011). Accessed 1 February 2012
  35. 35.
    Möller, K., Heath, T., Handschuh, S., Domingue, J.: Recipes for semantic web dog food – the eswc and iswc metadata projects. Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference, Busan, Korea, 2007Google Scholar
  36. 36.
    Myers, J.: Creating local visibility to open box products with front-end semantic web. http://jay.beweep.com/2010/03/30/creating-local-visibility-to-open-box-products -with-front-end-semantic-web/(2010). Accessed 1 February 2012
  37. 37.
    Neubert, J.: Bringing the “thesaurus for economics” on to the web of linked data. Proceedings of the WWW2009 Workshop on Linked Data on the Web, 2009Google Scholar
  38. 38.
    Sheridan, J., Tennison, J.: Linking uk government data. Proceedings of the WWW2010 Workshop on Linked Data on the Web, 2010. http://ceur-ws.org/Vol-628/ldow2010\_paper14.pdf
  39. 39.
    Søren, A., Jens, L., Sebastian, H.: Linkedgeodata – adding a spatial dimension to the web of data. Proceedings of the International Semantic Web Conference, 2009Google Scholar
  40. 40.
    Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic knowledge. In: Williamson C.L., Zurko M.E., Patel-Schneider P.F., Shenoy P.J. (eds.) Proceedings of the 16th International Conference on World Wide Web, ACM, Banff, Alberta, Canada, 8–12 May 2007, pp. 697–706, http://doi.acm.org/10.1145/1242572.1242667
  41. 41.
    Tummarello, G., Cyganiak, R., Catasta, M., Danielczyk, S., Delbru, R., Decker, S.: Sig.ma: live views on the web of data. Web Semant. Sci. Serv. Agents World Wide Web 8(4), 355–364 (2010). DOI DOI: 10.1016/j.websem.2010.08.003. http://www.sciencedirect.com/science/article/B758F-50THXFH-3/2/50a61206ad3a34d5541aebf5a465484a
  42. 42.
    Van de Sompel, H., Lagoze, C., Nelson, M., Warner, S., Sanderson, R., Johnston, P.: Adding escience assets to the data web. Proceedings of the 2nd Workshop on Linked Data on the Web (LDOW2009), 2009Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Christian Bizer
    • Pablo N. Mendes
      • 1
    • Anja Jentzsch
      • 2
    1. 1.Web-based Systems GroupFreie Universität BerlinBerlinGermany
    2. 2.Web-based Systems GroupFreie Universität BerlinBerlinGermany

    Personalised recommendations