Skip to main content

An Empirical Analysis Data Mining Frameworks—An Overview

  • Conference paper
  • First Online:
Ambient Communications and Computer Systems

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 356))

  • 529 Accesses

Abstract

In recent years, a massive breakthrough has attracted interest, dramatically altering the way companies produce, conduct, and operate data and analytics structures. Hadoop, NoSQL, and the cloud have ushered in a new age of scale-out, flexible, and real-time computation, resulting in the development of new data processing, collection, and analytical knowledge to enable advanced machine learning and optimization techniques. Over the last decade, many developments in data processing and development have occurred, including big data systems, cloud services, data lakes, optimization, self-service, data collation, machine learning, and many others. Today’s data management systems imitate the old data warehousing and BI systems on the outskirts, but with new data management concepts and updated processes. The need to modernize data processing architecture is widespread, as shown by the amount of architecture consultancy inquiries. As a result, this research explores a simple path to massive data handling framework as well as various popular file system architectures for handling big data for data mining. Furthermore, major frameworks for data mining are investigated and compared.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Azarmi B (2016) Scalable big data architecture. A practitioner’s guide to choosing relevant big data architecture. Apress, Berkeley. https://doi.org/10.1007/978-1-4842-1326-1

  2. Burys J, Awan AJ, Heinis T (2018) Large-scale clustering using MPI-based canopy. https://doi.org/10.13140/RG.2.2.26139.11049

  3. Chattopadhyay A, Chang C-H, Yu H (2017) Emerging technology and architecture for big-data analytics. https://doi.org/10.1007/978-3-319-54840-1

  4. Fahmideh M, Beydoun G (2018) Big data analytics architecture design—an application in manufacturing systems. Comput Ind Eng. https://doi.org/10.1016/j.cie.2018.08.004

  5. Godson K, Behera R (2019) Big data architectures: a detailed and application oriented review

    Google Scholar 

  6. Komal MS (2018) A review paper on big data analytics tools

    Google Scholar 

  7. Rafsanjani K, Asghari M, Emami Z, Nasibeh (2012) A survey of hierarchical clustering algorithms. J Math Comput Sci 5:229–240 https://doi.org/10.22436/jmcs.05.03.11

  8. Kumar A, Ingle Y, Pande A, Dhule P (2014) Canopy clustering: a review on pre-clustering approach to k-means clustering

    Google Scholar 

  9. Mukherjee S (2019) Benefits of AWS in modern cloud. https://doi.org/10.5281/zenodo.2587217

  10. Patil P, Karthikeyan A (2020) A survey on k-means clustering for analyzing variation in data. https://doi.org/10.1007/978-981-15-0146-3_29

  11. Memon M, Soomro S, Jumani A, Kartio M (2017) Big data analytics and its applications. Ann Emerg Technol Comput 1. https://doi.org/10.33166/AETiC.2017.01.006

  12. Mijwil M (2018) Microsoft azure what is it and where does microsoft bet with its cloud?

    Google Scholar 

  13. Kalipe GK, Behera RK (2019) Big data architectures: a detailed and application oriented review

    Google Scholar 

  14. Mijwil M (2018) Microsoft azure what is it and where does microsoft bet with its cloud?

    Google Scholar 

  15. Mukherjee S (2019) Benefits of AWS in modern cloud

    Google Scholar 

  16. Komal MS (2018) A review paper on big data analytics tools

    Google Scholar 

  17. Rafsanjani MK, Asghari Z, Emami N (2012) A survey of hierarchical clustering algorithms. J Mathe Comput Sci 5:229–240. https://doi.org/10.22436/jmcs.05.03.11

  18. Kumar A, Ingle YS, Pande A, Dhule P (2014) Canopy clustering: a review on pre-clustering approach to K-Means clustering

    Google Scholar 

  19. Memon MA, Soomro S, Jumani AK, Kartio MA (2017) Big data analytics and its applications. Annals Emerge Technol Comput 1. https://doi.org/10.33166/AETiC.2017.01.006.

  20. Nielsen F (2016) Hierarchical clustering. https://doi.org/10.1007/978-3-319-21903-5_8.

  21. Ramesh B (2015) Big data architecture

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sivakumar, K., Kalaivani, S., Venkatesan, D., Vetrivel, V. (2022). An Empirical Analysis Data Mining Frameworks—An Overview. In: Hu, YC., Tiwari, S., Trivedi, M.C., Mishra, K.K. (eds) Ambient Communications and Computer Systems. Lecture Notes in Networks and Systems, vol 356. Springer, Singapore. https://doi.org/10.1007/978-981-16-7952-0_23

Download citation

  • DOI: https://doi.org/10.1007/978-981-16-7952-0_23

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-16-7951-3

  • Online ISBN: 978-981-16-7952-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics