Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web of Data

  • Benjamin Adrian
  • Jörn Hees
  • Ivan Herman
  • Michael Sintek
  • Andreas Dengel
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6317)


The appearance of Linked Open Data (LOD) was an important milestone for reaching a Web of Data. More and more RDF data sets get published to be consumed and integrated into a variety of applications. Pointing out one application, Linked Data can be used to enrich web pages with semantic annotations. This gives readers the chance to recall Semantic Web’s knowledge about text passages. RDFa provides a well-defined base, as it extends HTML tags in web pages to a form that contains RDF data. Nevertheless, asking web authors to manually annotate their web pages with semantic annotations is illusive. We present Epiphany, a service that annotates Linked Data to web pages automatically by creating RDFa enhanced versions of the input HTML pages. In Epiphany, Linked Data can be any RDF dataset or mashup (e.g., DBpedia, BBC programs, etc.). Based on ontology-based information extraction and the dataset, Epiphany generates an RDF graph about a web page’s content. Based on this RDF graph, RDFa annotations are generated and integrated in an RDFa enhanced version of the web page. Authors can use Epiphany to get RDFa enhanced versions of their articles that link to Linked Data models. Readers may use Epiphany to receive RDFa enhanced versions of web pages while surfing. We analysed results of Epiphany with Linked Data from BBC about music biographies and show a similar quality compared to results of Open Calais. Epiphany provides annotations from a couple of Linked Data sets.


Semantic Annotation Link Open Data Music Group Datatype Property Text Snippet 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Bizer, C., Heath, T., Berners-Lee, T.: Linked Data – the story so far. Int. Journal on Semantic Web and Information Systems, IJSWIS (2009)Google Scholar
  2. 2.
    W3C: RDFa in XHTML: syntax and processing rules for embedding rdf through attributes. W3C working draft, W3C (2010)Google Scholar
  3. 3.
    Khare, R.: Microformats: The next (small) thing on the semantic web? IEEE Internet Computing 10(1), 68–75 (2006)MathSciNetCrossRefGoogle Scholar
  4. 4.
    Burel, G., Cano, A.E., Lanfranchi, V.: Ozone browser: Augmenting the web with semantic overlays. In: Proceedings of the 5th Workshop on Scripting and Development for the Semantic Web SFSW 2009. CEUR Workshop Proceedings, vol. 449 (2009)Google Scholar
  5. 5.
    Corlosquet, S., Delbru, R., Clark, T., Polleres, A., Decker, S.: Produce and Consume Linked Data with Drupal! In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 763–778. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  6. 6.
    Google: Help us make the web better: An update on Rich Snippets (2009),
  7. 7.
    Yahoo! Inc.: SearchMonkey Guide - A Manual for SearchMonkey Developers and Publishers (2008),
  8. 8.
    Bizer, C., Cyganiak, R., Heath, T.: How to publish linked data on the web. Web page (2007),
  9. 9.
    Handschuh, S., Staab, S., Ciravegna, F.: S-CREAM - Semi-automatic CREAtion of Metadata. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, pp. 358–372. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  10. 10.
    Adrian, B.: Incorporating ontological background knowledge into information extraction. In: Maynard, D. (ed.) ISWC 2009 Doctoral Consortium (2009)Google Scholar
  11. 11.
    Huynh, D., Mazzocchi, S., Karger, D.: Piggy bank: Experience the semantic web inside your web browser. Web Semantics 5(1), 16–27 (2007)CrossRefGoogle Scholar
  12. 12.
    W3C: Gleaning resource descriptions from dialects of languages (GRDDL). W3C rec., W3C (2007)Google Scholar
  13. 13.
    Pilgrim, M.: Greasemonkey Hacks: Tips & Tools for Remixing the Web with Firefox (Hacks). O’Reilly Media, Inc., Sebastopol (2005)Google Scholar
  14. 14.
    Tori, A.: Zemanta Service (2008)Google Scholar
  15. 15.
    Alexander, K., Cyganiak, R., Hausenblas, M., Zhao, J.: voiD Guide - Using the Vocabulary of Interlinked Datasets (2009),
  16. 16.
    Dublin Core Metadata Initiative: DCMI Metadata Terms (2006),
  17. 17.
    Adrian, B., Hees, J., van Elst, L., Dengel, A.: iDocument: using ontologies for extracting and annotating information from unstructured text. In: Mertsching, B., Hund, M., Aziz, Z. (eds.) KI 2009. LNCS (LNAI), vol. 5803, pp. 249–256. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  18. 18.
    Adrian, B., Dengel, A.: Believing finite-state cascades in knowledge-based information extraction. In: KI. LNCS (LNAI). Springer, Heidelberg (2008)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Benjamin Adrian
    • 1
  • Jörn Hees
    • 2
  • Ivan Herman
    • 3
  • Michael Sintek
    • 1
  • Andreas Dengel
    • 1
    • 2
  1. 1.Knowledge Management DepartmentDFKI GmbHKaiserslauternGermany
  2. 2.CS DepartmentUniversity of KaiserslauternKaiserslauternGermany
  3. 3.Centre for Mathematics and Computer Sciences (CWI)AmsterdamThe Netherlands

Personalised recommendations