Abstract
In recent years, a massive breakthrough has attracted interest, dramatically altering the way companies produce, conduct, and operate data and analytics structures. Hadoop, NoSQL, and the cloud have ushered in a new age of scale-out, flexible, and real-time computation, resulting in the development of new data processing, collection, and analytical knowledge to enable advanced machine learning and optimization techniques. Over the last decade, many developments in data processing and development have occurred, including big data systems, cloud services, data lakes, optimization, self-service, data collation, machine learning, and many others. Today’s data management systems imitate the old data warehousing and BI systems on the outskirts, but with new data management concepts and updated processes. The need to modernize data processing architecture is widespread, as shown by the amount of architecture consultancy inquiries. As a result, this research explores a simple path to massive data handling framework as well as various popular file system architectures for handling big data for data mining. Furthermore, major frameworks for data mining are investigated and compared.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Azarmi B (2016) Scalable big data architecture. A practitioner’s guide to choosing relevant big data architecture. Apress, Berkeley. https://doi.org/10.1007/978-1-4842-1326-1
Burys J, Awan AJ, Heinis T (2018) Large-scale clustering using MPI-based canopy. https://doi.org/10.13140/RG.2.2.26139.11049
Chattopadhyay A, Chang C-H, Yu H (2017) Emerging technology and architecture for big-data analytics. https://doi.org/10.1007/978-3-319-54840-1
Fahmideh M, Beydoun G (2018) Big data analytics architecture design—an application in manufacturing systems. Comput Ind Eng. https://doi.org/10.1016/j.cie.2018.08.004
Godson K, Behera R (2019) Big data architectures: a detailed and application oriented review
Komal MS (2018) A review paper on big data analytics tools
Rafsanjani K, Asghari M, Emami Z, Nasibeh (2012) A survey of hierarchical clustering algorithms. J Math Comput Sci 5:229–240 https://doi.org/10.22436/jmcs.05.03.11
Kumar A, Ingle Y, Pande A, Dhule P (2014) Canopy clustering: a review on pre-clustering approach to k-means clustering
Mukherjee S (2019) Benefits of AWS in modern cloud. https://doi.org/10.5281/zenodo.2587217
Patil P, Karthikeyan A (2020) A survey on k-means clustering for analyzing variation in data. https://doi.org/10.1007/978-981-15-0146-3_29
Memon M, Soomro S, Jumani A, Kartio M (2017) Big data analytics and its applications. Ann Emerg Technol Comput 1. https://doi.org/10.33166/AETiC.2017.01.006
Mijwil M (2018) Microsoft azure what is it and where does microsoft bet with its cloud?
Kalipe GK, Behera RK (2019) Big data architectures: a detailed and application oriented review
Mijwil M (2018) Microsoft azure what is it and where does microsoft bet with its cloud?
Mukherjee S (2019) Benefits of AWS in modern cloud
Komal MS (2018) A review paper on big data analytics tools
Rafsanjani MK, Asghari Z, Emami N (2012) A survey of hierarchical clustering algorithms. J Mathe Comput Sci 5:229–240. https://doi.org/10.22436/jmcs.05.03.11
Kumar A, Ingle YS, Pande A, Dhule P (2014) Canopy clustering: a review on pre-clustering approach to K-Means clustering
Memon MA, Soomro S, Jumani AK, Kartio MA (2017) Big data analytics and its applications. Annals Emerge Technol Comput 1. https://doi.org/10.33166/AETiC.2017.01.006.
Nielsen F (2016) Hierarchical clustering. https://doi.org/10.1007/978-3-319-21903-5_8.
Ramesh B (2015) Big data architecture
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sivakumar, K., Kalaivani, S., Venkatesan, D., Vetrivel, V. (2022). An Empirical Analysis Data Mining Frameworks—An Overview. In: Hu, YC., Tiwari, S., Trivedi, M.C., Mishra, K.K. (eds) Ambient Communications and Computer Systems. Lecture Notes in Networks and Systems, vol 356. Springer, Singapore. https://doi.org/10.1007/978-981-16-7952-0_23
Download citation
DOI: https://doi.org/10.1007/978-981-16-7952-0_23
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-7951-3
Online ISBN: 978-981-16-7952-0
eBook Packages: EngineeringEngineering (R0)