Abstract
All the technology that has been used for the big data handling is inspired by technology that was explain in the Google paper back in 2003. HBase is of the top most used and preferred open source distributed system developed by the Apache including apache zookeeper, apache Hadoop HBase provide random access for the storing and retrieving the data. In HBase we can store any type of data in any format, data can be structured and semi structured. It is very malleable and dynamic in case of data model. It is a No-SQL database i.e. it doesn’t let any inter row transactions to occur. Unlike traditional systems HBase run on multiple or a cluster of computers instead of single one, number of computer in a cluster can be increased or decreased as per the requirement. This type of design provide a more powerful and scalable approach for the data handling. This chapter explains about the how efficient HBase architecture and its command, operations are different from traditional systems.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Gopalani, S., & Arora, R. (2015). Comparing apache spark and map reduce with performance analysis using K-means. International Journal of Computer Applications, 113(1).
Wiewiórka, M. S., et al. (2014). SparkSeq: Fast, scalable, cloud-ready tool for the interactive genomic data analysis with nucleotide precision. Bioinformatics, btu343.
Shoro, A. G., & Soomro, T. R. (2015). Big data analysis: Apache spark perspective. Global Journal of Computer Science and Technology,15(1).
Gu, L., & Li, H. (2013). Memory or time: Performance evaluation for iterative operation on hadoop and spark. In 2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC). IEEE.
Chen, H., et al. (2012). Hog: Distributed hadoop mapreduce on the grid. High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion. IEEE.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Saxena, A., Singh, S., Shakya, C. (2018). Concepts of HBase Archetypes in Big Data Engineering. In: Roy, S., Samui, P., Deo, R., Ntalampiras, S. (eds) Big Data in Engineering Applications. Studies in Big Data, vol 44. Springer, Singapore. https://doi.org/10.1007/978-981-10-8476-8_5
Download citation
DOI: https://doi.org/10.1007/978-981-10-8476-8_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8475-1
Online ISBN: 978-981-10-8476-8
eBook Packages: EngineeringEngineering (R0)