Skip to main content

Abstract

In this paper, we study the methods, techniques, and algorithms used in data mining, and from the studied algorithms, we emphasized the clustering algorithms, more precisely on the K-means algorithm. This algorithm was first studied using the Euclidean distance, then modifying the distance between the clusters using the distances Mahalanobis and Canberra. After implementing the algorithms in C/C++, we compared the clustering of the three algorithms, after which we modified them and studied the distance between the clusters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Chaudhuri, S., & Dayal U. (March, 1997). An overview of data warehousing and OLAP technology. Appears in ACM Sigmod Record.

    Google Scholar 

  • Cocianu C. L. (2006). Supervised and unsupervised classification for pattern recognition purposes. Economic Informatics Journal, 4(40).

    Google Scholar 

  • Inmon, B. (November, 1999). Data mart does not equal data warehouse. DM Direct Newsletter.

    Google Scholar 

  • Ionescu, Ș. A. (2015). Hierarchical clustering with applications in financial analysis. Romanian Journal of Statistics (Supplement no. 8).

    Google Scholar 

  • Moldovan, D. (2011). Economic intelligence: Data mining applied to financial data. Ph.D. thesis, Babeş-Bolyai University, Cluj-Napoca Faculty of Economics and Business Administration.

    Google Scholar 

  • Moore, G. A. (2004). Darwin and the demon: Innovating within established enterprises. Harvard Business Review, 82(7–8), 86–92.

    Google Scholar 

  • Moroz, M., & Polkowski, Z. (2016). The last mile issue and urban logistics: Choosing parcel machines in the context of the ecological attitudes of the Y generation consumers purchasing online. In 2nd International Conference “Green Cities - Green Logistics for Greener Cities”, Poland.

    Google Scholar 

  • Pakhira, M. K. (2009). A modified k-means algorithm to avoid empty clusters. International Journal of Recent Trends in Engineering, 1(1).

    Google Scholar 

  • Polkowski, Z., Zajac, D., Vasilev, J., & Florina, A. L. (June, 2016). A content analysis of existing educational portals for teaching data warehouse and business intelligence. In 2016 8th International Conference on Electronics, Computers and Artificial Intelligence (ECAI) (pp. 1–6). IEEE.

    Google Scholar 

  • Stancu A.M. R. (2017). Knowledge-extracting solutions from large volumes of data. Ph.D. thesis, ASE Bucharest.

    Google Scholar 

  • Taguchi, G., & Jugulum, R. (2002). The Mahalanobis-Taguchi strategy: A pattern technology system.

    Google Scholar 

  • Wu, F. X. (2008). Genetic weighted k-means algorithm for clustering large-scale gene expression data. BMC Bioinformatics, 9.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cristescu Marian Pompiliu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ana-Maria Ramona, S., Marian Pompiliu, C., Stoyanova, M. (2020). Data Mining Algorithms for Knowledge Extraction. In: Fotea, S., Fotea, I., Văduva, S. (eds) Challenges and Opportunities to Develop Organizations Through Creativity, Technology and Ethics. GSMAC 2019. Springer Proceedings in Business and Economics. Springer, Cham. https://doi.org/10.1007/978-3-030-43449-6_20

Download citation

Publish with us

Policies and ethics