Skip to main content

Processing of Streaming Weakly Structured Data

  • Conference paper
  • First Online:
Digital and Information Technologies in Economics and Management (DITEM 2021)

Abstract

The article discusses systems that allow performing continuous queries, as well as their architecture. It was found that the work of such systems begins with the construction of a query execution plan. The request is executed by an executor similar to the executor used in database management systems (DBMS). The existing systems for performing continuous queries over data streams in JSON format currently do not meet the requirements. It is necessary to develop our own system for executing continuous queries over JSON streaming data based on modern technologies for streaming computing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Denisova, O.A., Kunsbaeva, G.A., Chiglintsiva, A.S.: Big data: some ways to solve the problems of higher education. In: Journal of Physics: Conference Series. International Scientific and Practical Conference Information Technologies and Intelligent Decision Making Systems, ITIDMS-II 2021, p. 012021 (2021)

    Google Scholar 

  2. Denisova, O.A.: Motivation of technical university students to study physics and methods of teaching it in the context of a pandemic. In: Journal of Physics: Conference Series, p. 12025. Krasnoyarsk Science and Technology City Hall (2020)

    Google Scholar 

  3. Denisova, O.A.: Big data technology: assessing the quality of the educational environment. In: Journal of Physics: Conference Series. International Scientific and Practical Conference Information Technologies and Intelligent Decision Making Systems, ITIDMS-II 2021, p. 012027 (2021)

    Google Scholar 

  4. Sharma, S., Gadia, S., Udoyara, S.: Subset, subquery and queryable-visualization in parametric big data model. Int. J. Inf. Manag. Data Insights 1(1), 100003 (2021)

    Google Scholar 

  5. Brahmia, Z., Grandi, F., Brahmia, S.: A graphical conceptual model for conventional and time-varying JSON data. Procedia Comput. Sci. 184, 823–828 (2021)

    Article  Google Scholar 

  6. XML: eXtensible Markup Language. http://www.w3.org/XML/. Accessed 21 Oct 2021

  7. Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. In: OSDI 2004: Proceedings of the 6th Conference on Symposium on Operation Systems Design and Implementation. USENIX Association (2004)

    Google Scholar 

  8. Ghemawat, S., Gobioff, H., Leung, S.: The Google file system. SIGOPS Oper. Syst. Rev. 37(5), 29–43 (2003)

    Article  Google Scholar 

  9. Neumeyer, L., Robbins, B., Nair, A., Kesari, A.: S4: distributed stream computing platform (2010)

    Google Scholar 

  10. Wang, J., Zhang, J., Zhong, R.: Big data analytics for intelligent manufacturing systems. J. Manuf. Syst. 1016 (2021)

    Google Scholar 

  11. Chandrasekaran, S., Cooper, O.: TelegraphCQ: continuous dataflow processing for an uncertain world. In: CIDR, vol. 20, p. 668 (2003)

    Google Scholar 

  12. Gray, P., Amosql, M.: Liu, L., Ozsu, M.T. (eds.) Encyclopedia of Database Systems. Springer (2019)

    Google Scholar 

  13. XQuery: An XML Query Language. http://www.w3.org/TR/xquery-30/. Accessed 21 Oct 2021

  14. Chen, J., DeWitt, D.J., Tian, F., Wang, Y.: NiagaraCQ: a scalable continuous query system for internet databases. In: SIGMOD 2000, pp. 379–390 (2000)

    Google Scholar 

  15. XML-QL: A Query Language for XML. http://www.w3.org/TR/NOTE-xml-ql/. Accessed 21 Oct 2021

  16. Wu, E., Diao, Y., Rizvi, S.: High-performance complex event processing over streams. In: Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, pp. 407–418 (2006)

    Google Scholar 

  17. XPath: XML Path Language. http://www.w3.org/TR/xpath20/. Accessed 21 Oct 2021

  18. Cranor, C., Johnson, T., Spataschek, O., Shkapenyuk, V.: Gigascope: a stream database for network applications. Network, pp. 647–651 (2013)

    Google Scholar 

  19. Gyllstrom, D., Sase, Wu, E.: Complex event processing over streams. In: Proceedings of the Third Biennial Conference on Innovative Data Systems Research (2007)

    Google Scholar 

  20. Cube: A system for collecting timestamped events and deriving metrics. https://github.com/square/cube

  21. Mongo, D.B.: An open-source document database. http://www.mongodb.org/. Accessed 21 Oct 2021

  22. Benítez-Hidalgo, A.: TITAN: a knowledge-based platform for Big Data workflow management. Knowl.-Based Syst. 232, 107489 (2021)

    Article  Google Scholar 

  23. Bauleo, E., Carnevale, S.: Design, realization and user evaluation of the SmartVortex Visual Query System for accessing data streams in industrial engineering applications. J. Vis. Lang. Comput. 25(5), 577–601 (2014)

    Article  Google Scholar 

  24. Sahal, R., Breslin, J.G., Intizar, M.: Big data and stream processing platforms for Industry 4.0 requirements mapping for a predictive maintenance use case. J. Manuf. Syst. 54, 138–151 (2020)

    Article  Google Scholar 

  25. Chu, Z., Yu, J., Hamdulla, A.: A novel deep learning method for query task execution time prediction in graph database. Future Gener. Comput. Syst. 112, 534–548 (2020)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Olga Denisova .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Denisova, O. (2022). Processing of Streaming Weakly Structured Data. In: Gibadullin, A. (eds) Digital and Information Technologies in Economics and Management. DITEM 2021. Lecture Notes in Networks and Systems, vol 432. Springer, Cham. https://doi.org/10.1007/978-3-030-97730-6_5

Download citation

Publish with us

Policies and ethics