Abstract
Operational data and analytical data are no longer two separate disciplines and discussions. Data Analysis is gaining more ground and more request from companies that begin to base their strategies - as well as decision intelligence and decision management - on factual information. In surveying the state of the data industry and the trends in data management technologies, NewSQL systems appear to be more present. They are able to answer the question of bridging operational data storage and administration with providing real-time access to analytical data. This paper aims to provide a structured look into the features and capabilities offered by NewSQL systems that can be leveraged to allow Data Analysis over a variety of data types. Furthermore, it provides an overview of Realtime Analytics offerings, Map Reduce capabilities and hybrid (transactional and analytical) features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Almassabi, A., Bawazeer, O., Adam, S.: Top NewSQL databases and features classification. Int. J. Database Manag. Syst. 10, 11–31 (2018)
Altibase: Altibase (2020). https://altibase.com/
Apache: Apache spark (2020). https://spark.apache.org/
Arasu, A., Babcock, B., Babu, S., McAlister, J., Widom, J.: Characterizing memory requirements for queries over continuous data streams. ACM Trans. Database Syst. (TODS) 29(1), 162–194 (2004)
Aslett, M.: What we talk about when we talk about NewSQL (2020). https://blogs.451research.com/information_management/2011/04/06/what-we-talk-about-when-we-talk-about-newsql/
Barber, R., et al.: Evolving databases for new-gen big data applications. In: CIDR (2017)
Bestavros, A., Lin, K.J., Son, S.H.: Real-Time Database Systems: Issues and Applications, vol. 396. Springer, Boston (2012). https://doi.org/10.1007/b116080
Binani, S., Gutti, A., Upadhyay, S.: SQL vs. NoSQL vs. NewSQL-a comparative study. Database 6(1), 1–4 (2016)
Bowman, J.S., Emerson, S.L., Darnovsky, M.: The Practical SQL Handbook: Using Structured Query Language. Addison-Wesley Longman Publishing Co., Inc., Boston (1996)
Brewer, E.: Spanner, truetime and the cap theorem (2017)
Chandra, U.: A comparative study on: NoSQL, NewSQL and Polygot persistence. Int. J. Soft Comput. Eng. (IJSE) 7 (2017)
Chen, M., Mao, S., Liu, Y.: Big data: a survey. Mobile Netw. Appl. 19(2), 171–209 (2014)
CitusData: Citus (2020). https://www.citusdata.com/
Cockroachlabs: Cockroachdb (2020). https://www.cockroachlabs.com/
Corbellini, A., Mateos, C., Zunino, A., Godoy, D., Schiaffino, S.: Persisting big-data: the NoSQL landscape. Inf. Syst. 63, 1–23 (2017)
Dean, J., Ghemawat, S.: MapReduce: Simplified data processing on large clusters (2004)
Dedehayir, O., Steinert, M.: The hype cycle model: a review and future directions. Technol. Forecast. Soc. Chang. 108, 28–41 (2016)
Duan, Y., Fu, G., Zhou, N., Sun, X., Narendra, N.C., Hu, B.: Everything as a service (XAAS) on the cloud: origins, current and future trends. In: 2015 IEEE 8th International Conference on Cloud Computing, pp. 621–628 (2015)
Duggirala, S.: NewSQL databases and scalable in-memory analytics. In: Advances in Computers, vol. 109, pp. 49–76. Elsevier, Amsterdam (2018)
Emani, C.K., Cullot, N., Nicolle, C.: Understandable big data: a survey. Comput. Sci. Rev. 17, 70–81 (2015)
Fauna: Fauna (2020). https://fauna.com
Fenn, J., Raskino, M.: Mastering the Hype Cycle: How to Choose the Right Innovation at the Right Time. Harvard Business Press (2008)
Gartner: Gartner (2020). https://www.gartner.com
Geczy, P.: Big data characteristics. Macrotheme Rev. 3(6), 94–104 (2014)
solidIT consulting & software development GmbH: Db-engines (2020). https://db-engines.com/
Gray, J., Reuter, A.: Transaction Processing: Concepts and Techniques. Elsevier, Amsterdam (1992)
Hajoui, O., Dehbi, R., Talea, M., Batouta, Z.I.: An advanced comparative study of the most promising NoSQL and NewSQL databases with a multi-criteria analysis method. J. Theor. Appl. Inf. Technol. 81(3), 579 (2015)
Han, J., Haihong, E., Le, G., Du, J.: Survey on NoSQL database. In: 2011 6th International Conference on Pervasive Computing and Applications, pp. 363–366. IEEE (2011)
Holubová, I., Svoboda, M., Lu, J.: Unified management of multi-model data. In: Laender, A., Pernici, B., Lim, E.P., de Oliveira, J. (eds.) Conceptual Modeling. ER 2019. Lecture Notes in Computer Science, vol. 11788, pp. 439–447. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-33223-5_36
Huang, D., et al.: TIDB: a raft-based HTAP database. Proc. VLDB Endow. 13(12), 3072–3084 (2020)
Inc., G.: Google trends (2020). https://trends.google.com
Inc., G.: Spanner (2020). https://cloud.google.com/spanner
Khasawneh, T.N., AL-Sahlee, M.H., Safia, A.A.: SQL, NewSQL, and NoSQL databases: a comparative survey. In: 2020 11th International Conference on Information and Communication Systems (ICICS), pp. 013–021 (2020)
Khurshid, K., Khan, A., Siddique, H., Rashid, I., et al.: Big data-9vs, challenges and solutions. Tech. J. 23(03), 28–34 (2018)
Leavitt, N.: Will NoSQL databases live up to their promise? Computer 43(2), 12–14 (2010)
May, N., Bohm, A., Lehner, W.: Sap HANA-the evolution of an in-memory DBMS from pure OLAP processing towards mixed workloads. Datenbanksysteme für Business, Technologie und Web (BTW 2017) (2017)
Murazzo, M., Gómez, P., RodrÃguez, N., Medel, D.: Database newsql performance evaluation for big data in the public cloud. In: Naiouf, M., Chichizola, F., Rucci, E. (eds.) Cloud Computing and Big Data. JCC&BD 2019. Communications in Computer and Information Science, vol. 1050. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-27713-0_10
NuoDB: Nuodb (2020). https://nuodb.com/
Ohlhorst, F.J.: Big Data Analytics: Turning Big Data into Big Money, vol. 65. Wiley, Hoboken (2012)
O’Leary, D.E.: Gartner’s hype cycle and information system research issues. Int. J. Account. Inf. Syst. 9(4), 240–252 (2008)
Özsu, M.T., Valduriez, P.: NoSQL, NewSQL, and Polystores. In: Principles of Distributed Database Systems, pp. 519–557. Springer (2020). https://doi.org/10.1007/978-3-030-26253-2_11
Pavlo, A., Aslett, M.: What’s really new with NewSQL? SIGMOD Rec. 45(2), 45–55 (2016)
PingCap: Tidb (2020). https://pingcap.com/
Pokorný, J.: How to store and process big data: are today’s databases sufficient? In: Saeed, K., Snášel, V. (eds.) Computer Information Systems and Industrial Management, pp. 5–10. Springer, Heidelberg (2014). https://link.springer.com/chapter/10.1007/978-3-662-45237-0_2
Pokornỳ, J.: How to store and process big data: are today’s databases sufficient? In: Saeed, K., Snasel, V. (eds.) Computer Information Systems and Industrial Management. CISIM 2015. Lecture Notes in Computer Science, vol. 8838, pp. 5–10. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-45237-0_2
Rifaie, M., Kianmehr, K., Alhajj, R., Ridley, M.J.: Data modelling for effective data warehouse architecture and design. Int. J. Inf. Decis. Sci. 1(3), 282–300 (2009)
SAP: Sap hana (2020). https://www.sap.com/products/hana.html
SequoiaDB: Tidb (2020). http://www.sequoiadb.com/en/
Shanker, U., Misra, M., Sarje, A.K.: Distributed real time database systems: background and literature review. Distrib. Parallel Databases 23(2), 127–149 (2008)
Shoup, R., Pritchett, D.: The EBAY architecture. In: SD Forum (2006)
Shute, J., et al.: F1: A distributed SQL database that scales (2013)
SingleStore: Memsql (2020). https://www.singlestore.com/
Steiner, A.: A generalisation approach to temporal data models and their implementations. Ph.D. thesis, ETH Zurich (1998)
TechJury: How much data is created every day (2020). https://techjury.net/blog/how-much-data-is-created-every-day
Thomson, A., Diamond, T., Weng, S.C., Ren, K., Shao, P., Abadi, D.J.: Calvin: fast distributed transactions for partitioned database systems. In: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pp. 1–12 (2012)
Verma, S., Kawamoto, Y., Fadlullah, Z.M., Nishiyama, H., Kato, N.: A survey on network methodologies for real-time analytics of massive IoT data and open research issues. IEEE Commun. Surv. Tutor. 19(3), 1457–1477 (2017)
VoltDB, I.: VoltDB (2020). https://www.voltdb.com/
Ward, J.S., Barker, A.: Undefined by data: a survey of big data definitions. arXiv preprint arXiv:1309.5821 (2013)
Acknowledgments
This research was made possible by funding from the ICT-AGRI-FOOD 2020 Joint Call. This work was supported by a grant of the Romanian National Authority for Scientific Research and Innovation, CCCDI - UEFISCDI, project number COFUND-ICT-AGRI-FOOD-MUSHNOMICS 205/2020, within PNCDI III.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Chereja, I., Hahn, S.M.L., Matei, O., Avram, A. (2021). Operationalizing Analytics with NewSQL. In: Silhavy, R. (eds) Software Engineering and Algorithms. CSOC 2021. Lecture Notes in Networks and Systems, vol 230. Springer, Cham. https://doi.org/10.1007/978-3-030-77442-4_21
Download citation
DOI: https://doi.org/10.1007/978-3-030-77442-4_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-77441-7
Online ISBN: 978-3-030-77442-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)