RDSZ: An Approach for Lossless RDF Stream Compression

  • Norberto Fernández
  • Jesús Arias
  • Luis Sánchez
  • Damaris Fuentes-Lorenzo
  • Óscar Corcho
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8465)

Abstract

In many applications (like social or sensor networks) the information generated can be represented as a continuous stream of RDF items, where each item describes an application event (social network post, sensor measurement, etc). In this paper we focus on compressing RDF streams. In particular, we propose an approach for lossless RDF stream compression, named RDSZ (RDF Differential Stream compressor based on Zlib). This approach takes advantage of the structural similarities among items in a stream by combining a differential item encoding mechanism with the general purpose stream compressor Zlib. Empirical evaluation using several RDF stream datasets shows that this combination produces gains in compression ratios with respect to using Zlib alone.

Keywords

#eswc2014Fernandez 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Álvarez-García, S., Brisaboa, N.R., Fernández, J.D., Martínez-Prieto, M.A.: Compressed k2-Triples for Full-In-Memory RDF Engines. In: AMCIS (2011)Google Scholar
  2. 2.
    Arias, J., Fernández, N., Sánchez, L., Fuentes-Lorenzo, D.: Ztreamy: A middleware for publishing semantic streams on the web. Web Semantics: Science, Services and Agents on the World Wide Web (in print)Google Scholar
  3. 3.
    Atemezing, G., Corcho, O., Garijo, D., Mora, J., Poveda-Villalón, M., Rozas, P., Vila-Suero, D., Villazón-Terrazas, B.: Transforming Meteorological Data into Linked Data. Semantic Web Journal (2012)Google Scholar
  4. 4.
    Barbieri, D.F., Braga, D., Ceri, S., Grossniklaus, M.: An execution environment for C-SPARQL queries. In: Proceedings of the 13th International Conference on Extending Database Technology, EDBT 2010, pp. 441–452 (2010)Google Scholar
  5. 5.
    Calbimonte, J.-P., Corcho, O., Gray, A.J.G.: Enabling ontology-based access to streaming data sources. In: Patel-Schneider, P.F., Pan, Y., Hitzler, P., Mika, P., Zhang, L., Pan, J.Z., Horrocks, I., Glimm, B. (eds.) ISWC 2010, Part I. LNCS, vol. 6496, pp. 96–111. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  6. 6.
    Fernández, J.D., Gutierrez, C., Martínez-Prieto, M.A.: RDF compression: basic approaches. In: Proceedings of the 19th International Conference on World Wide Web, WWW 2010, pp. 1091–1092 (2010)Google Scholar
  7. 7.
    Fernández, J.D., Martínez-Prieto, M.A., Gutiérrez, C., Polleres, A., Arias, M.: Binary RDF representation for publication and exchange (HDT). Web Semantics: Science, Services and Agents on the World Wide Web 19, 22–41 (2013)CrossRefGoogle Scholar
  8. 8.
    Joshi, A., Hitzler, P., Dong, G.: Logical linked data compression. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds.) ESWC 2013. LNCS, vol. 7882, pp. 170–184. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  9. 9.
    Le-Phuoc, D., Nguyen Mau Quoc, H., Le Van, C., Hauswirth, M.: Elastic and scalable processing of linked stream data in the cloud. In: Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K. (eds.) ISWC 2013, Part I. LNCS, vol. 8218, pp. 280–297. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  10. 10.
    Deutsch, P., Gailly, J.-L. (eds.): ZLIB Compressed Data Format Specification version 3.3. Internet RFC 1950 (May 1996)Google Scholar
  11. 11.
    Deutsch, P. (ed.): DEFLATE Compressed Data Format Specification version 1.3. Internet RFC 1951 (May 1996)Google Scholar
  12. 12.
    Urbani, J., Maassen, J., Drost, N., Seinstra, F., Bal, H.: Scalable RDF data compression with MapReduce. Concurrency and Computation: Practice and Experience 25(1), 24–39 (2013)CrossRefGoogle Scholar
  13. 13.
    Valle, E.D., Ceri, S., Harmelen, F.V., Fensel, D.: It’s a streaming world! reasoning upon rapidly changing information. IEEE Intelligent Systems 24(6), 83–89 (2009)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Norberto Fernández
    • 1
  • Jesús Arias
    • 1
  • Luis Sánchez
    • 1
  • Damaris Fuentes-Lorenzo
    • 1
  • Óscar Corcho
    • 2
  1. 1.Dpto. Ing. TelemáticaUniversidad Carlos III de MadridSpain
  2. 2.Ontology Engineering GroupUniversidad Politécnica de MadridSpain

Personalised recommendations