Abstract
Data Mining is a software tool dedicated to scan data repositories, generate information, and discover knowledge. Currently, the traditional data processing tools and its applications are not capable of managing the massive amounts of data available inside SMEs environments. Therefore, it is critical to use effective and efficient Data Mining tools which represent a valuable support for SMEs decision-making. In this paper we describe and analyze seven popular open source data mining tools—KEEL, KNIME, Orange, RapidMiner, R Project, Tanagra and WEKA.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Fayyad, M. U., Piatetsky-Shapiro, G., Smyth, P. Advances in knowledge discovery and data mining, pp. 1—34. American Association for Artificial Intelligence, Menlo Park, CA (1996)
Borges, C. L., Marques, M. V., Bernardino, J. Comparison of data mining techniques and tools for data classification. C3S2E ‘13 Proceedings of the International C* Conference on Computer Science and Software Engineering. pp 113–116 (2013)
Witten, H. I., Frank, E., Hall, A. M. Data Mining: Practical Machine Learning Tools and Techniques, 3rd Edition. Morgan Kaufmann, Massachusetts (2011)
Hasim, N., Haris, A. N. A study of open-source data mining tools for forecasting. IMCOM - Proc. of 9th Int. Conf, on Ubiquitous Information Management and Communication. (2015)
Shearer, C.: The CRISP-DM Model: The New Blueprint for Data Mining. Journal of Data Warehousing. 5, 13—23 (2000)
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd Edition. Morgan Kaufmann, Massachusetts (2012)
Bell, G., Gray, N. J.: The revolution yet to happen. Beyond calculation. Copernicus, New York (1997)
Jovic, A., Brkic, K., Bogunovic, N.. An overview of free software tools for general data mining. 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO). pp 1112 -– 1117 (2014)
Fernández, A., Río, S., López, V., Bawakid, A., Jesus, M. J., Benítez, J. M., & Herrera, F. Big Data with Cloud Computing: an insight on the computing environment, MapReduce, and programming frameworks. WIREs Data Mining Knowl Discov, 4: 380–409. (2014)
Alcalá-Fdez, J., Fernandez, A., Luengo, J., Derrac, J., García, S., Sánchez, L., Herrera, F.. KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework. Journal of Soft Computing 17:2-3 pp. 255—287 (2011)
KNIME, http://www.knime.org
Demšar, J., Curk, T., Erjavec, A. Orange: Data Mining Toolbox in Python; Journal of Machine Learning Research, vol. 14, pp. 2349−2353, (2013)
Rapid Miner, http://rapidminer.com
R Project, http://www.r-project.org/
Hornik, K. The R FAQ. http://cran.r-project.org/doc/FAQ/R-FAQ (2015)
Morandat F., Hill, B., Osvald, L., Vitek, J. Evaluating the design of the R language: objects and functions for data analysis. ECOOP’12 Proceedings of the 26th European conference on Object-Oriented Programming, pp 104–131 (2012)
Rakotomalala, R. TANAGRA : un logiciel gratuit pour l’enseignement et la recherché. Actes de EGC’2005, RNTI-E-3, vol. 2, pp.697–702, (2005)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, H. I.. The WEKA Data Mining Software: An Update. SIGKDD Explorations, vol. 11 (1), pp. 10–18 (2009)
Olowe-Adedoyin, M., Gabet, M., Stahl, F. A Survey of Data Mining Techniques for Social
Network Analysis. Journal of Data Mining & Digital Humanities, vol. 2014 (2014)
Talia, D. 2nd IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services (ICSDM), pp 1-4 (2015)
Woerner, S., Wixom, B. Big data: extending the business strategy toolbox. Journal of Information Technology, vol. 30, pp 60-62 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Almeida, P., Bernardino, J. (2016). A Survey on Open Source Data Mining Tools for SMEs. In: Rocha, Á., Correia, A., Adeli, H., Reis, L., Mendonça Teixeira, M. (eds) New Advances in Information Systems and Technologies. Advances in Intelligent Systems and Computing, vol 444. Springer, Cham. https://doi.org/10.1007/978-3-319-31232-3_24
Download citation
DOI: https://doi.org/10.1007/978-3-319-31232-3_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-31231-6
Online ISBN: 978-3-319-31232-3
eBook Packages: EngineeringEngineering (R0)