Skip to main content

Operationalizing Analytics with NewSQL

  • Conference paper
  • First Online:
Software Engineering and Algorithms (CSOC 2021)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 230))

Included in the following conference series:

Abstract

Operational data and analytical data are no longer two separate disciplines and discussions. Data Analysis is gaining more ground and more request from companies that begin to base their strategies - as well as decision intelligence and decision management - on factual information. In surveying the state of the data industry and the trends in data management technologies, NewSQL systems appear to be more present. They are able to answer the question of bridging operational data storage and administration with providing real-time access to analytical data. This paper aims to provide a structured look into the features and capabilities offered by NewSQL systems that can be leveraged to allow Data Analysis over a variety of data types. Furthermore, it provides an overview of Realtime Analytics offerings, Map Reduce capabilities and hybrid (transactional and analytical) features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Almassabi, A., Bawazeer, O., Adam, S.: Top NewSQL databases and features classification. Int. J. Database Manag. Syst. 10, 11–31 (2018)

    Article  Google Scholar 

  2. Altibase: Altibase (2020). https://altibase.com/

  3. Apache: Apache spark (2020). https://spark.apache.org/

  4. Arasu, A., Babcock, B., Babu, S., McAlister, J., Widom, J.: Characterizing memory requirements for queries over continuous data streams. ACM Trans. Database Syst. (TODS) 29(1), 162–194 (2004)

    Article  Google Scholar 

  5. Aslett, M.: What we talk about when we talk about NewSQL (2020). https://blogs.451research.com/information_management/2011/04/06/what-we-talk-about-when-we-talk-about-newsql/

  6. Barber, R., et al.: Evolving databases for new-gen big data applications. In: CIDR (2017)

    Google Scholar 

  7. Bestavros, A., Lin, K.J., Son, S.H.: Real-Time Database Systems: Issues and Applications, vol. 396. Springer, Boston (2012). https://doi.org/10.1007/b116080

  8. Binani, S., Gutti, A., Upadhyay, S.: SQL vs. NoSQL vs. NewSQL-a comparative study. Database 6(1), 1–4 (2016)

    Google Scholar 

  9. Bowman, J.S., Emerson, S.L., Darnovsky, M.: The Practical SQL Handbook: Using Structured Query Language. Addison-Wesley Longman Publishing Co., Inc., Boston (1996)

    Google Scholar 

  10. Brewer, E.: Spanner, truetime and the cap theorem (2017)

    Google Scholar 

  11. Chandra, U.: A comparative study on: NoSQL, NewSQL and Polygot persistence. Int. J. Soft Comput. Eng. (IJSE) 7 (2017)

    Google Scholar 

  12. Chen, M., Mao, S., Liu, Y.: Big data: a survey. Mobile Netw. Appl. 19(2), 171–209 (2014)

    Article  Google Scholar 

  13. CitusData: Citus (2020). https://www.citusdata.com/

  14. Cockroachlabs: Cockroachdb (2020). https://www.cockroachlabs.com/

  15. Corbellini, A., Mateos, C., Zunino, A., Godoy, D., Schiaffino, S.: Persisting big-data: the NoSQL landscape. Inf. Syst. 63, 1–23 (2017)

    Article  Google Scholar 

  16. Dean, J., Ghemawat, S.: MapReduce: Simplified data processing on large clusters (2004)

    Google Scholar 

  17. Dedehayir, O., Steinert, M.: The hype cycle model: a review and future directions. Technol. Forecast. Soc. Chang. 108, 28–41 (2016)

    Article  Google Scholar 

  18. Duan, Y., Fu, G., Zhou, N., Sun, X., Narendra, N.C., Hu, B.: Everything as a service (XAAS) on the cloud: origins, current and future trends. In: 2015 IEEE 8th International Conference on Cloud Computing, pp. 621–628 (2015)

    Google Scholar 

  19. Duggirala, S.: NewSQL databases and scalable in-memory analytics. In: Advances in Computers, vol. 109, pp. 49–76. Elsevier, Amsterdam (2018)

    Google Scholar 

  20. Emani, C.K., Cullot, N., Nicolle, C.: Understandable big data: a survey. Comput. Sci. Rev. 17, 70–81 (2015)

    Article  MathSciNet  Google Scholar 

  21. Fauna: Fauna (2020). https://fauna.com

  22. Fenn, J., Raskino, M.: Mastering the Hype Cycle: How to Choose the Right Innovation at the Right Time. Harvard Business Press (2008)

    Google Scholar 

  23. Gartner: Gartner (2020). https://www.gartner.com

  24. Geczy, P.: Big data characteristics. Macrotheme Rev. 3(6), 94–104 (2014)

    Google Scholar 

  25. solidIT consulting & software development GmbH: Db-engines (2020). https://db-engines.com/

  26. Gray, J., Reuter, A.: Transaction Processing: Concepts and Techniques. Elsevier, Amsterdam (1992)

    Google Scholar 

  27. Hajoui, O., Dehbi, R., Talea, M., Batouta, Z.I.: An advanced comparative study of the most promising NoSQL and NewSQL databases with a multi-criteria analysis method. J. Theor. Appl. Inf. Technol. 81(3), 579 (2015)

    Google Scholar 

  28. Han, J., Haihong, E., Le, G., Du, J.: Survey on NoSQL database. In: 2011 6th International Conference on Pervasive Computing and Applications, pp. 363–366. IEEE (2011)

    Google Scholar 

  29. Holubová, I., Svoboda, M., Lu, J.: Unified management of multi-model data. In: Laender, A., Pernici, B., Lim, E.P., de Oliveira, J. (eds.) Conceptual Modeling. ER 2019. Lecture Notes in Computer Science, vol. 11788, pp. 439–447. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-33223-5_36

  30. Huang, D., et al.: TIDB: a raft-based HTAP database. Proc. VLDB Endow. 13(12), 3072–3084 (2020)

    Article  Google Scholar 

  31. Inc., G.: Google trends (2020). https://trends.google.com

  32. Inc., G.: Spanner (2020). https://cloud.google.com/spanner

  33. Khasawneh, T.N., AL-Sahlee, M.H., Safia, A.A.: SQL, NewSQL, and NoSQL databases: a comparative survey. In: 2020 11th International Conference on Information and Communication Systems (ICICS), pp. 013–021 (2020)

    Google Scholar 

  34. Khurshid, K., Khan, A., Siddique, H., Rashid, I., et al.: Big data-9vs, challenges and solutions. Tech. J. 23(03), 28–34 (2018)

    Google Scholar 

  35. Leavitt, N.: Will NoSQL databases live up to their promise? Computer 43(2), 12–14 (2010)

    Article  Google Scholar 

  36. May, N., Bohm, A., Lehner, W.: Sap HANA-the evolution of an in-memory DBMS from pure OLAP processing towards mixed workloads. Datenbanksysteme für Business, Technologie und Web (BTW 2017) (2017)

    Google Scholar 

  37. Murazzo, M., Gómez, P., Rodríguez, N., Medel, D.: Database newsql performance evaluation for big data in the public cloud. In: Naiouf, M., Chichizola, F., Rucci, E. (eds.) Cloud Computing and Big Data. JCC&BD 2019. Communications in Computer and Information Science, vol. 1050. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-27713-0_10

  38. NuoDB: Nuodb (2020). https://nuodb.com/

  39. Ohlhorst, F.J.: Big Data Analytics: Turning Big Data into Big Money, vol. 65. Wiley, Hoboken (2012)

    Google Scholar 

  40. O’Leary, D.E.: Gartner’s hype cycle and information system research issues. Int. J. Account. Inf. Syst. 9(4), 240–252 (2008)

    Article  Google Scholar 

  41. Özsu, M.T., Valduriez, P.: NoSQL, NewSQL, and Polystores. In: Principles of Distributed Database Systems, pp. 519–557. Springer (2020). https://doi.org/10.1007/978-3-030-26253-2_11

  42. Pavlo, A., Aslett, M.: What’s really new with NewSQL? SIGMOD Rec. 45(2), 45–55 (2016)

    Article  Google Scholar 

  43. PingCap: Tidb (2020). https://pingcap.com/

  44. Pokorný, J.: How to store and process big data: are today’s databases sufficient? In: Saeed, K., Snášel, V. (eds.) Computer Information Systems and Industrial Management, pp. 5–10. Springer, Heidelberg (2014). https://link.springer.com/chapter/10.1007/978-3-662-45237-0_2

  45. Pokornỳ, J.: How to store and process big data: are today’s databases sufficient? In: Saeed, K., Snasel, V. (eds.) Computer Information Systems and Industrial Management. CISIM 2015. Lecture Notes in Computer Science, vol. 8838, pp. 5–10. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-45237-0_2

  46. Rifaie, M., Kianmehr, K., Alhajj, R., Ridley, M.J.: Data modelling for effective data warehouse architecture and design. Int. J. Inf. Decis. Sci. 1(3), 282–300 (2009)

    Google Scholar 

  47. SAP: Sap hana (2020). https://www.sap.com/products/hana.html

  48. SequoiaDB: Tidb (2020). http://www.sequoiadb.com/en/

  49. Shanker, U., Misra, M., Sarje, A.K.: Distributed real time database systems: background and literature review. Distrib. Parallel Databases 23(2), 127–149 (2008)

    Article  Google Scholar 

  50. Shoup, R., Pritchett, D.: The EBAY architecture. In: SD Forum (2006)

    Google Scholar 

  51. Shute, J., et al.: F1: A distributed SQL database that scales (2013)

    Google Scholar 

  52. SingleStore: Memsql (2020). https://www.singlestore.com/

  53. Steiner, A.: A generalisation approach to temporal data models and their implementations. Ph.D. thesis, ETH Zurich (1998)

    Google Scholar 

  54. TechJury: How much data is created every day (2020). https://techjury.net/blog/how-much-data-is-created-every-day

  55. Thomson, A., Diamond, T., Weng, S.C., Ren, K., Shao, P., Abadi, D.J.: Calvin: fast distributed transactions for partitioned database systems. In: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pp. 1–12 (2012)

    Google Scholar 

  56. Verma, S., Kawamoto, Y., Fadlullah, Z.M., Nishiyama, H., Kato, N.: A survey on network methodologies for real-time analytics of massive IoT data and open research issues. IEEE Commun. Surv. Tutor. 19(3), 1457–1477 (2017)

    Article  Google Scholar 

  57. VoltDB, I.: VoltDB (2020). https://www.voltdb.com/

  58. Ward, J.S., Barker, A.: Undefined by data: a survey of big data definitions. arXiv preprint arXiv:1309.5821 (2013)

Download references

Acknowledgments

This research was made possible by funding from the ICT-AGRI-FOOD 2020 Joint Call. This work was supported by a grant of the Romanian National Authority for Scientific Research and Innovation, CCCDI - UEFISCDI, project number COFUND-ICT-AGRI-FOOD-MUSHNOMICS 205/2020, within PNCDI III.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ionela Chereja .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chereja, I., Hahn, S.M.L., Matei, O., Avram, A. (2021). Operationalizing Analytics with NewSQL. In: Silhavy, R. (eds) Software Engineering and Algorithms. CSOC 2021. Lecture Notes in Networks and Systems, vol 230. Springer, Cham. https://doi.org/10.1007/978-3-030-77442-4_21

Download citation

Publish with us

Policies and ethics