Abstract
In this paper, we study the methods, techniques, and algorithms used in data mining, and from the studied algorithms, we emphasized the clustering algorithms, more precisely on the K-means algorithm. This algorithm was first studied using the Euclidean distance, then modifying the distance between the clusters using the distances Mahalanobis and Canberra. After implementing the algorithms in C/C++, we compared the clustering of the three algorithms, after which we modified them and studied the distance between the clusters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chaudhuri, S., & Dayal U. (March, 1997). An overview of data warehousing and OLAP technology. Appears in ACM Sigmod Record.
Cocianu C. L. (2006). Supervised and unsupervised classification for pattern recognition purposes. Economic Informatics Journal, 4(40).
Inmon, B. (November, 1999). Data mart does not equal data warehouse. DM Direct Newsletter.
Ionescu, Ș. A. (2015). Hierarchical clustering with applications in financial analysis. Romanian Journal of Statistics (Supplement no. 8).
Moldovan, D. (2011). Economic intelligence: Data mining applied to financial data. Ph.D. thesis, Babeş-Bolyai University, Cluj-Napoca Faculty of Economics and Business Administration.
Moore, G. A. (2004). Darwin and the demon: Innovating within established enterprises. Harvard Business Review, 82(7–8), 86–92.
Moroz, M., & Polkowski, Z. (2016). The last mile issue and urban logistics: Choosing parcel machines in the context of the ecological attitudes of the Y generation consumers purchasing online. In 2nd International Conference “Green Cities - Green Logistics for Greener Cities”, Poland.
Pakhira, M. K. (2009). A modified k-means algorithm to avoid empty clusters. International Journal of Recent Trends in Engineering, 1(1).
Polkowski, Z., Zajac, D., Vasilev, J., & Florina, A. L. (June, 2016). A content analysis of existing educational portals for teaching data warehouse and business intelligence. In 2016 8th International Conference on Electronics, Computers and Artificial Intelligence (ECAI) (pp. 1–6). IEEE.
Stancu A.M. R. (2017). Knowledge-extracting solutions from large volumes of data. Ph.D. thesis, ASE Bucharest.
Taguchi, G., & Jugulum, R. (2002). The Mahalanobis-Taguchi strategy: A pattern technology system.
Wu, F. X. (2008). Genetic weighted k-means algorithm for clustering large-scale gene expression data. BMC Bioinformatics, 9.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Ana-Maria Ramona, S., Marian Pompiliu, C., Stoyanova, M. (2020). Data Mining Algorithms for Knowledge Extraction. In: Fotea, S., Fotea, I., Văduva, S. (eds) Challenges and Opportunities to Develop Organizations Through Creativity, Technology and Ethics. GSMAC 2019. Springer Proceedings in Business and Economics. Springer, Cham. https://doi.org/10.1007/978-3-030-43449-6_20
Download citation
DOI: https://doi.org/10.1007/978-3-030-43449-6_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-43448-9
Online ISBN: 978-3-030-43449-6
eBook Packages: Business and ManagementBusiness and Management (R0)