Skip to main content
Log in

Complex Event Processing on Linked Stream Data

  • SCHWERPUNKTBEITRAG
  • Published:
Datenbank-Spektrum Aims and scope Submit manuscript

Abstract

Social networks and Sensor Web technologies typically generate a massive amount of data published as streams. In order to give these streams a meaningful sense and enrich them with semantic descriptions, the concept of Linked Stream Data (LSD) has emerged. However, to support a wide range of LSD scenarios and queries comprehensive solutions providing not only classic data stream operators such as windows, but also for processing of complex events, linking of (static) datasets, and scalable processing are required. In this paper, we present our approach for processing LSD and addressing these requirements. In contrast to existing LSD engines relying on streaming extensions to SPARQL, our PipeFlow system is a (relational) dataflow language and engine providing support for complex event processing (CEP) and a few dedicated operators for RDF data. We describe this language and particularly the CEP model as well as the system architecture for parallel CEP and LSD processing by exploiting partitioning techniques for cluster environments. Finally, we report results from experiments evaluating our system in comparison to existing LSD engines.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. The maximum heap size of JVM instances when running CQELS was set to 4 GB.

  2. V. 1.0.0 from https://code.google.com/p/cqels/.

  3. V. 1.1 from https://code.google.com/p/etalis/.

References

  1. Abadi DJ, Ahmad Y, Balazinska M, Cherniack M, hyon Hwang J, Lindner W, Maskey AS, Rasin E. Ryvkina E, Tatbul N, Xing Y, Zdonik S (2005) The design of the borealis stream processing engine. CIDR Conference, Asilomar, pp 277–289

  2. Abadi DJ, Carney D, Cetintemel U, Cherniack M, Convey C, Lee S, Stonebraker M, Tatbul N, Zdonik S (2003) Aurora: a new model and architecture for data stream management. VLDB J 12(2):120–139

  3. Agrawal J, Diao Y, Gyllstrom D, Immerman N (2008) Efficient pattern matching over event streams. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, SIGMOD '08. ACM, New York, pp 147-160

  4. Anicic D, Fodor P, Rudolph S, Stojanovic N (2011) EP-SPARQL: a unified language for event processing and stream reasoning. In: Proceedings of the 20th International Conference on World Wide Web, WWW '11. ACM, New York, pp 635-633

  5. Barbieri DF, Braga D, Ceri S, Della Valle E, Grossniklaus M (2009) C-SPARQL: SPARQL for continuous querying. In: Proceedings of the 18th International Conference on World Wide Web, WWW '09. ACM, New York, pp 1061–1062

  6. Barbieri DF, Braga D, Ceri S, Valle ED, Grossniklaus M (2010) Querying RDF streams with C-SPARQL. SIGMOD Rec 39(1):20–26

  7. Beckett D Redland. http://librdf.org/. Accessed 15 Feb 2015

  8. Bolles A, Grawunder M, Jacobi J (2008) Streaming SPARQL extending SPARQL to process data streams. In: Proceedings of the 5th European Semantic Web Conference on The Semantic Web: Research and Applications, ESWC'08. Springer, Berlin, pp 448–462

  9. Chakravarthy S, Mishra D (1994) Snoop: an expressive event specification language for active databases. Data Knowl Eng 14(1):1–26

  10. Dean J, Ghemawat S (2004) Mapreduce: Simplified data processing on large clusters. In: OSDI. ACM, New York, pp 137-150

  11. Demers A, Gehrke J, Panda B, Riedewald M, Sharma V, White W (2007) Cayuga: a general purpose event monitoring system. In: Conference on Innovative Data Systems Research. ACM, New York, pp 412-422

  12. Digital Marketing Ramblings Amazing facebook user statictics. http://expandedramblings.com/index.php/by-the-numbers-17-amazing-facebook-stats/. Accessed 15 Feb 2015

  13. Distributed Stream Computing Platform: S4. http://incubator.apache.org/s4/. Accessed 15 Feb 2015

  14. Gedik B, Schneider S, Hirzel M, Wu K (2014) Elastic scaling for data stream processing. IEEE Trans Parallel Distrib Syst 25(6):1447–1463

  15. Le-Phuoc D, Dao-Tran M, Parreira JX, Hauswirth M (2011) A native and adaptive approach for unified processing of linked streams and linked data. In: Proceedings of the 10th ISWC—Volume Part I, ISWC'11. Springer, Berlin, pp. 370–388

  16. Le-Phuoc D, Nguyen Mau Quoc H, Le Van C, Hauswirth M (2013) Elastic and scalable processing of linked stream data in the cloud. In: The Semantic Web—ISWC 2013, Lecture Notes in Computer Science, vol. 8218. Springer, Berlin, pp. 280–297

  17. Margara A, Urbani J, van Harmelen F, Bal H (2014) Streaming the web: reasoning over dynamic data. Web Semantics: Science, Services and Agents on the World Wide Web 25(0)

  18. Olston C, Reed B, Srivastava U, Kumar R, Tomkins A (2008) Pig latin: a not-so-foreign language for data processing. In: SIGMOD. ACM, New York

  19. Phuoc DL, Dao-Tran M, Pham MD, Boncz PA, Eiter T, Fink M (2012) Linked stream data processing engines: facts and figures. In: International Semantic Web Conference (2), Lecture Notes in Computer Science, vol. 7650. Springer, pp. 300–312

  20. Saleh O, Betz H, Sattler K (2015) Partitioning for scalable complex event processing on data streams. In: New Trends in Database and Information Systems II, Advances in Intelligent Systems and Computing, vol. 312. Springer International Publishing, Cham, pp. 185–197

  21. Saleh O, Sattler K (2014) On efficient processing of linked stream data. In: On the Move to Meaningful Internet Systems: OTM 2014 Conferences, LNCS, vol. 8841. Springer, Berlin, pp. 700–717

  22. Spark A Spark: lightning-fast cluster computing. https://spark.apache.org. Accessed 15 Feb 2015

  23. Sriskandarajah S, Kasun G, Isuru LN, Subash C, Srinath P, Vishaka N (2011) Siddhi: a second look at complex event processing architectures. In: Proceedings of the 2011 ACM workshop on Gateway computing environments, GCE '11. ACM, New York, pp. 43–50

  24. {Stanford University}: Stanford Stream Data Manager. http://infolab.stanford.edu/stream/. Accessed 15 Feb 2015

  25. Tucker PA, Maier D, Sheard T, Fegaras L (2003) Exploiting punctuation semantics in continuous data streams. IEEE Trans Knowl Data Eng 15(3):555–568

  26. Twitter: Storm, distributed and fault-tolerant realtime computation. http://storm-project.net/. Accessed 15 Feb 2015

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Omran Saleh M.Sc..

Additional information

This work was partially supported by the Deutsche Forschungsgemeinschaft (DFG) under grant no. SA 782/22 and GRK 1487.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Saleh, O., Hagedorn, S. & Sattler, KU. Complex Event Processing on Linked Stream Data. Datenbank Spektrum 15, 119–129 (2015). https://doi.org/10.1007/s13222-015-0190-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13222-015-0190-5

Keywords

Navigation