Abstract
In big data environment, apache Cassandra is a distributed database which offers very high availability. It is an open source database system and is designed to manage large transactional data across various server globally. Main feature of Cassandra is to provide high availability and very high fault tolerance, decentralized database system with zero downtime. A traditional relational database (RDBMSs) is used to storing data for various applications from many years, but some changes are required because application must be scale to levels that were unimaginable. But only scaling is not the main concern of changes, companies are also requires such type of applications that always available and running fast where RDBMS database fail. Apache Cassandra is a fully distributed database that has such type of architecture where it handles extreme data velocity with highly availability, scalability and recovers from fault tolerance easily. In Cassandra architecture, there is no master node to handle all the nodes in the ring or network. The data distribution among nodes in this architecture is in equal probation. Cassandra creates such type of environment where an entire datacenter can lose but still perform as if nothing happened. This paper provides a brief idea about Cassandra.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
http://cassandra.apache.org/. Last accessed on 20 Dec 2015
http://planetcassandra.org/. Last accessed on 20 Dec 2015
Wang, G., Tang, J.: The NoSQL principles and basic application of cassandra model. In: 2012 International Conference on Computer Science and Service System (CSSS), pp. 1332–1335. IEEE, Augt 2012
Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)
Bagade, P., Chandra, A., Dhende, A.B.: Designing performance monitoring tool for NoSQL Cassandra distributed database. In: 2012 International Conference on Education and e-Learning Innovations (ICEELI), pp. 1–5. IEEE, July 2012
Jiang, W., Zhang, L., Qiang, W., Jin, H., Peng, Y.: MyStore: a high available distributed storage system for unstructured data. In: 2012 IEEE 14th International Conference on High Performance Computing and Communication and 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), pp. 233–240. IEEE, June 2012
Terry, D.B., Theimer, M.M., Petersen, K., Demers, A.J., Spreitzer, M.J., Hauser, C.H.: Managing update conflicts in Bayou, a weakly connected replicated storage system. In: ACM SIGOPS Operating Systems Review, vol. 29, no. 5, pp. 172–182. ACM, Dec 1995
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wahid, A., Kashyap, K. (2019). Cassandra—A Distributed Database System: An Overview. In: Abraham, A., Dutta, P., Mandal, J., Bhattacharya, A., Dutta, S. (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, vol 755. Springer, Singapore. https://doi.org/10.1007/978-981-13-1951-8_47
Download citation
DOI: https://doi.org/10.1007/978-981-13-1951-8_47
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1950-1
Online ISBN: 978-981-13-1951-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)