Advertisement

A Data Services Composition Approach for Continuous Query on Social Media Streams

  • Guiling WangEmail author
  • Xiaojiang Zuo
  • Marc Hesenius
  • Yao Xu
  • Yanbo Han
  • Volker Gruhn
Chapter
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11360)

Abstract

We witness a rapid increase in the number of social media streams due to development of Web2.0, IoT and Cloud Computing technology. These sources include both traditional relational databases and streaming data from messaging infrastructure. We would like to use multiple social media streams to answer complex queries to enable information sharing and intelligence gathering for better collaboration. For this purpose, we adopt data services as the basic abstraction for both traditional relational databases and data streams retrieval. A flexible continuous data service model with continuous query as service operation is proposed. Service operation instance is modeled as a view defined on data streams. In the view, data part and time synchronization part are separated from each other. Based on the continuous data service model, we proposed a continuous data service composition algorithm for answering queries across data streams and relational data. The main idea is to find the contained rewriting of user query on views satisfying both data part and time synchronization part containment relationship. We also present use case and experimental studies that indicate that the approach is effective and efficient.

Keywords

Data streams Query rewriting Data services Service composition Continuous query 

Notes

Acknowledgments

This work is supported by Beijing Natural Science Foundation No. 4172018 (Building Stream Data Services for Spatio-Temporal Pattern Discovery in Cloud Computing Environment) and National Natural Science Foundation of China No. 61672042 (Models and Methodology of Data Services Facilitating Dynamic Correlation of Big Stream Data), and University Cooperation Projects Foundation of CETC Ocean Corp.

References

  1. 1.
    Carey, M.J., Onose, N., Petropoulos, M.: Data services. Commun. ACM 55(6), 86–97 (2012)CrossRefGoogle Scholar
  2. 2.
    Vaculín, R., Chen, H., Neruda, R., Sycara, K.: Modeling and discovery of data providing services. In: 2008 IEEE International Conference on Web Services, pp. 54–61, September 2008Google Scholar
  3. 3.
    Barhamgi, M., Benslimane, D., Medjahed, B.: A query rewriting approach for web service composition. IEEE Trans. Serv. Comput. 3(3), 206–222 (2010)CrossRefGoogle Scholar
  4. 4.
    Zhou, L., Chen, H., Yu, T., Ma, J., Wu, Z.: Ontology-based scientific data service composition: a query rewriting-based approach. In: AAAI Spring Symposium: Semantic Scientific Knowledge Integration, pp. 116–121 (2008)Google Scholar
  5. 5.
    Zhang, F., Wang, G., Han, Y.: Automatic generation of service composition plans for correlated queries. In: 2013 10th Web Information System and Application Conference, pp. 143–149, November 2013Google Scholar
  6. 6.
    Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and issues in data stream systems. In: Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pp. 1–16. ACM (2002)Google Scholar
  7. 7.
    Ghanem, T.M., Elmagarmid, A.K., Larson, P.Å., Aref, W.G.: Supporting views in data stream management systems. ACM Trans. Database Syst. 35(1), 1–47 (2008)CrossRefGoogle Scholar
  8. 8.
    Zhao, W., Liu, C., Chen, J.: Automatic composition of information-providing web services based on query rewriting. Sci. China Inf. Sci. 55(11), 2428–2444 (2012)CrossRefGoogle Scholar
  9. 9.
    Barhamgi, M., Benslimane, D., Ouksel, A.M.: Composing and optimizing data providing web services. In: Proceedings of the 17th International Conference on World Wide Web, pp. 1141–1142. ACM (2008)Google Scholar
  10. 10.
    Doan, A., Halevy, A., Ives, Z.: Principles of Data Integration, 1st edn. Morgan Kaufmann Publishers Inc., San Francisco (2012)Google Scholar
  11. 11.
    Lemos, A.L., Daniel, F., Benatallah, B.: Web service composition: a survey of techniques and tools. ACM Comput. Surv. 48(3), 33:1–33:41 (2015)CrossRefGoogle Scholar
  12. 12.
    Pottinger, R., Halevy, A.: MiniCon: a scalable algorithm for answering queries using views. Int. J. Very Large Data Bases 10(2–3), 182–198 (2001)zbMATHGoogle Scholar
  13. 13.
    Zaharia, M., Das, T., Li, H., Hunter, T., Shenker, S., Stoica, I.: Discretized streams: fault-tolerant streaming computation at scale. In: Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, SOSP 2013, pp. 423–438. ACM, New York (2013)Google Scholar
  14. 14.
    Wang, G., et al.: Building a replicated logging system with Apache Kafka. Proc. VLDB Endow. 8(12), 1654–1655 (2015)CrossRefGoogle Scholar
  15. 15.
    Fielding, R.T.: Architectural styles and the design of network-based software architectures. Ph.D. thesis, University of California (2000)Google Scholar
  16. 16.
    Hickson, I.: Server-sent events. https://www.w3.org/TR/eventsource/. Accessed 25 Oct 2015
  17. 17.
    Newman, M.E.: Power laws, Pareto distributions and Zipf’s law. Contemp. Phys. 46(5), 323–351 (2005)CrossRefGoogle Scholar
  18. 18.
    Han, Y., Liu, C., Su, S., Zhu, M., Zhang, Z., Zhang, S.: A proactive service model facilitating stream data fusion and correlation. Int. J. Web Serv. Res. (IJWSR) 14(3), 1–16 (2017)CrossRefGoogle Scholar
  19. 19.
    Billet, B., Issarny, V., Texier, G.: Composing continuous services in a CoAP-based IoT. In: 2017 IEEE International Conference on AI Mobile Services (AIMS), pp. 46–53, June 2017Google Scholar
  20. 20.
    Han, Y., Wang, G., Yu, J., Liu, C., Zhang, Z., Zhu, M.: A service-based approach to traffic sensor data integration and analysis to support community-wide green commute in China. IEEE Trans. Intell. Transp. Syst. 17(9), 2648–2657 (2016)CrossRefGoogle Scholar
  21. 21.
    Carbone, P., Katsifodimos, A., Ewen, S., Markl, V., Haridi, S., Tzoumas, K.: Apache Flink: stream and batch processing in a single engine. Bull. IEEE Comput. Soc. Tech. Comm. Data Eng. 36(4), 28–38 (2015)Google Scholar
  22. 22.
    Confluent: KSQL: Streaming SQL for Apache Kafka. https://www.confluent.io/product/ksql/. Accessed 25 July 2018
  23. 23.
    Genesereth, M.R., Keller, A.M., Duschka, O.M.: Infomaster: an information integration system. SIGMOD Rec. 26(2), 539–542 (1997)CrossRefGoogle Scholar
  24. 24.
    Levy, A.Y., Rajaraman, A., Ordille, J.J.: The world wide web as a collection of views: query processing in the information manifold. In: VIEWS, pp. 43–55 (1996)Google Scholar
  25. 25.
    Levy, A.Y., Rajaraman, A., Ordille, J.J.: Querying heterogeneous information sources using source descriptions. In: Proceedings of the 22th International Conference on Very Large Data Bases, VLDB 1996, pp. 251–262. Morgan Kaufmann Publishers Inc., San Francisco (1996)Google Scholar
  26. 26.
    Benedikt, M., Lopez-Serrano, R., Tsamoura, E.: Biological web services: integration, optimization, and reasoning. CEUR Workshop Proceedings, pp. 21–27 (2016)Google Scholar
  27. 27.
    Thakkar, S., Ambite, J.L., Knoblock, C.A.: Composing, optimizing, and executing plans for bioinformatics web services. VLDB J. 14(3), 330–353 (2005)CrossRefGoogle Scholar
  28. 28.
    Wang, G., Yang, S., Han, Y.: Mashroom: end-user mashup programming using nested tables. In: Proceedings of the 18th International Conference on World Wide Web, pp. 861–870. ACM (2009)Google Scholar
  29. 29.
    Han, Y., Wang, G., Ji, G., Zhang, P.: Situational data integration with data services and nested table. Serv. Oriented Comput. Appl. 7(2), 129–150 (2013)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag GmbH Germany, part of Springer Nature 2019

Authors and Affiliations

  • Guiling Wang
    • 1
    • 2
    Email author
  • Xiaojiang Zuo
    • 1
  • Marc Hesenius
    • 3
  • Yao Xu
    • 2
  • Yanbo Han
    • 1
  • Volker Gruhn
    • 3
  1. 1.Beijing Key Laboratory on Integration and Analysis of Large-Scale Stream DataNorth China University of TechnologyBeijingChina
  2. 2.Ocean Information Technology CompanyChina Electronics Technology Group Corporation (CETC Ocean Corp.)BeijingChina
  3. 3.paluno - The Ruhr Institute for Software TechnologyUniversity of Duisburg-EssenEssenGermany

Personalised recommendations