Abstract
This paper analyses some tools that could be appropriate as teaching resources for undergraduate or postgraduate levels. A comparison is performed between two machine learning tools such as Weka and RapidMiner on one side, and with Minitab, on the other side, that is a more statistical tool and also covers some parts of the Cross Industry Standard Process for Data Mining. We describe the functionalities of those frameworks and also the installation and running procedure. A road-map is carried out in order to state the main tasks that are available in these tools and to encourage other researchers or lecturers to introduce them in laboratory classes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Addinsoft. Xlstat v. 2015.1. 01: Data analysis and statistics software for Microsoft Excel (2015)
Akthar, F., Hahne, C.: Rapidminer 5 operator reference. Rapid-I GmbH (2012)
Fayyad, U., Piatetsky-Shapiro, G., Smyth, P.: From data mining to knowledge discovery in databases. AI Mag. 17(3), 37 (1996)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: an update. ACM SIGKDD Explor. Newslett. 11(1), 10–18 (2009)
Hosmer Jr., D.W., Lemeshow, S., Sturdivant, R.X.: Applied logistic regression, vol. 398. Wiley, Hoboken (2013)
Jolliffe, I.: Principal Component Analysis. Wiley, Hoboken (2002)
Kelleher, J.D., Mac Namee, B., D’Arcy, A.: Fundamentals of Machine Learning for Predictive Data Analytics: Algorithms, Worked Examples, and Case Studies. MIT Press, Cambridge (2015)
Larose, D.T.: Discovering Knowledge in Data: An Introduction to Data Mining. Wiley, Hoboken (2014)
Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theory 28(2), 129–137 (1982)
INC Minitab. Minitab statistical software. Minitab Release 13 (2000)
Ramamoorthy, C.V., Wah, B.W.: Knowledge and data engineering. IEEE Trans. Knowl. Data Eng. 1(1), 9–16 (1989)
SPSS Statistics. SPSS 20.0 SPSS Inc., Chicago, IL (2011)
Tallón-Ballesteros, A.J., Riquelme, J.C.: Tackling ant colony optimization meta-heuristic as search method in feature subset selection based on correlation or consistency measures. In: Corchado, E., Lozano, J.A., Quintián, H., Yin, H. (eds.) IDEAL 2014. LNCS, vol. 8669, pp. 386–393. Springer, Cham (2014). doi:10.1007/978-3-319-10840-7_47
Wirth, R., Hipp, J.: CRISP-DM: towards a standard process model for data mining. In: Proceedings of the 4th International Conference on the Practical Applications of Knowledge Discovery and Data Mining, pp. 29–39. Citeseer (2000)
Yin, H., Gao, Y., Li, B., Zhang, D., Yang, M., Li, Y., Klawonn, F., Tallón-Ballesteros, A.J. (eds.): IDEAL 2016. LNCS, vol. 9937. Springer, Cham (2016). doi:10.1007/978-3-319-46257-8
Zhang, S., Zhang, C., Yang, Q.: Data preparation for data mining. Appl. Artif. Intell. 17(5–6), 375–381 (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Cho, SB., Tallón-Ballesteros, A.J. (2017). Visual Tools to Lecture Data Analytics and Engineering. In: Ferrández Vicente, J., Álvarez-Sánchez, J., de la Paz López, F., Toledo Moreo, J., Adeli, H. (eds) Biomedical Applications Based on Natural and Artificial Computing. IWINAC 2017. Lecture Notes in Computer Science(), vol 10338. Springer, Cham. https://doi.org/10.1007/978-3-319-59773-7_56
Download citation
DOI: https://doi.org/10.1007/978-3-319-59773-7_56
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59772-0
Online ISBN: 978-3-319-59773-7
eBook Packages: Computer ScienceComputer Science (R0)