Extraction and Semantic Annotation of Workshop Proceedings in HTML Using RML
Despite the significant number of existing tools, incorporating data into the Linked Open Data cloud remains complicated; hence discouraging data owners to publish their data as Linked Data. Unlocking the semantics of published data, even if they are not provided by the data owners, can contribute to surpass the barriers posed by the low availability of Linked Data and come closer to the realisation of the envisaged Semantic Web. rml, a generic mapping language based on an extension over Open image in new window, the Open image in new window standard for mapping relational databases into rdf, offers a uniform way of defining the mapping rules for data in heterogeneous formats. In this paper, we present how we adjusted our prototype rml Processor, taking advantage of rml’s scalability, to extract and map data of workshop proceedings published in html to the rdf data model for the Semantic Publishing Challenge needs.
- 1.Coetzee, P., Heath, T., Motta, E.: Sparqplug: generating linked data from legacy HTML, SPARQL and the DOM (2008)Google Scholar
- 2.Connolly, D.: Gleaning resource descriptions from dialects of languages (GRDDL). W3C recommendation, September 2007Google Scholar
- 3.Dimou, A., Vander Sande, M., Colpaert, P., Verborgh, R., Mannens, E., Van de Walle, R.: RML: a generic language for integrated RDF mappings of heterogeneous data. In: Workshop on Linked Data on the Web (2013)Google Scholar
- 4.Dimou, A., Vander Sande, M., De Nies, T., Verborgh, R., Mannens, E., Van de Walle, R.: RDF mapping rules refinements according to data consumers feedback. In: 2nd International World Wide Web Conference, Poster Track Proceedings (2014)Google Scholar
- 5.Droop, M., et al.: Translating XPath queries into SPARQL queries. In: Meersman, R., Tari, Z. (eds.) OTM 2007 Workshops, Part I. LNCS, vol. 4805, pp. 9–10. Springer, Heidelberg (2007)Google Scholar