Advertisement

Toward RDF Normalization

  • Regina Ticona-Herrera
  • Joe Tekli
  • Richard Chbeir
  • Sébastien Laborie
  • Irvin Dongo
  • Renato Guzman
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9381)

Abstract

Billions of RDF triples are currently available on the Web through the Linked Open Data cloud (e.g., DBpedia, LinkedGeoData and New York Times). Governments, universities as well as companies (e.g., BBC, CNN) are also producing huge collections of RDF triples and exchanging them through different serialization formats (e.g., RDF/XML, Turtle, N-Triple, etc.). However, RDF descriptions (i.e., graphs and serializations) are verbose in syntax, often contain redundancies, and could be generated differently even when describing the same resources, which would have a negative impact on their processing. Hence, we propose here an approach to clean and eliminate redundancies from such RDF descriptions as a means of transforming different descriptions of the same information into one representation, which can then be tuned, depending on the target application (information retrieval, compression, etc.). Experimental tests show significant improvements, namely in reducing RDF description loading time and file size.

Keywords

Rdf graph Serialization Redundancies and disparities 

Notes

Acknowledgments

This work has been partly supported by FINCyT (Fund for Innovation, Science and Technology) of Peru.

References

  1. 1.
    Belleau, F., et al.: Bio2rdf: towards a mashup to build bioinformatics knowledge systems. J. Biomed. Inform. 41(5), 706–716 (2008)CrossRefGoogle Scholar
  2. 2.
    Fernández, J.D., et al.: Binary rdf representation for publication and exchange (HDT). J. Web Semant. 19, 22–41 (2013)CrossRefGoogle Scholar
  3. 3.
    Gutierrez, C., et al.: Foundations of semantic web databases. In: PODS 2004, pp. 95–106. ACM (2004)Google Scholar
  4. 4.
    Gutierrez, C., et al.: Foundations of semantic web databases. J. Comput. Syst. Sci. 77(3), 520–541 (2011)MathSciNetCrossRefzbMATHGoogle Scholar
  5. 5.
    Hayes, J., Gutierrez, C.: Bipartite graphs as intermediate model for RDF. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 47–61. Springer, Heidelberg (2004) CrossRefGoogle Scholar
  6. 6.
    Jiang, G., et al.: Using semantic web technology to support ICD-11 textual definitions authoring. J. Biomed. Semant. 4, 11 (2013)CrossRefGoogle Scholar
  7. 7.
    Kerzazi, A., et al.: A model-based mediator system for biological data integration. In: Journes Scientifiques en Bio-Informatique, pp. 70–77 (2007)Google Scholar
  8. 8.
    Kerzazi, A., et al.: A semantic mediation architecture for RDF data integration. In: SWAP, p. 3 (2008)Google Scholar
  9. 9.
    Longley, D.: RDF dataset normalization (2015). http://json-ld.org/spec/latest/rdf-dataset-normalization/
  10. 10.
    Nolin, M.-A., et al.: Building an hiv data mashup using Bio2RDF. Briefings Bioinform. 13(1), 98–106 (2012)CrossRefGoogle Scholar
  11. 11.
    Pathak, J., et al.: Lexgrid: a framework for representing, storing, and querying biomedical terminologies from simple to sublime. J. Am. Med. Inform. Assoc. 16(3), 305–315 (2009)CrossRefGoogle Scholar
  12. 12.
    Salameh, K., Tekli, J., Chbeir, R.: SVG-to-RDF image Semantization. In: Traina, A.J.M., Traina Jr., C., Cordeiro, R.L.F. (eds.) SISAP 2014. LNCS, vol. 8821, pp. 214–228. Springer, Heidelberg (2014) Google Scholar
  13. 13.
    Sporny, M., Longley, D.: RDF graph normalization (2013). http://json-ld.org/spec/ED/rdf-graph-normalization/20111016/
  14. 14.
    Tao, C., et al.: A RDF-base normalized model for biomedical lexical grid. In: The 8th International Semantic Web Conference, p. 2 (2009)Google Scholar
  15. 15.
    Ticona-Herrera, R., et al.: Rdf similarity. Technical report (2015). http://rdfn.sigappfr.org/RDFN-TR-15.pdf
  16. 16.
    Vrandecic, D., et al.: RDF syntax normalization using XML validation. In: Proceedings of the SemRUs, p. 11 (2009)Google Scholar
  17. 17.
    Weiss, C., Karras, P., Bernstein, A.: Hexastore: sextuple indexing for semantic web data management. Proc. VLDB Endow. 1(1), 1008–1019 (2008)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Regina Ticona-Herrera
    • 1
    • 3
  • Joe Tekli
    • 2
  • Richard Chbeir
    • 1
  • Sébastien Laborie
    • 1
  • Irvin Dongo
    • 1
  • Renato Guzman
    • 3
  1. 1.University of Pau and Adour Countries - LIUPPAAngletFrance
  2. 2.Lebanese American UniversityByblosLebanon
  3. 3.San Pablo Catholic UniversityArequipaPeru

Personalised recommendations