Abstract
This paper presents data preprocessing operations and visualisation techniques, carried out on the following datasets: Teaching Assistant Evaluation dataset, Statlog (Australian Credit Approval) dataset, Letter Recognition, Connectionist Bench (Sonar, Mines vs. Rocks) dataset, and Poker Hand dataset. These datasets are from the University of California Irvine (UCI) Machine Learning Repository. Further, appropriate visualisation techniques are applied to the five selected datasets depending on the properties that are supported by the visualisation techniques used. In the end, this paper offers a template for researchers, data scientists, and other data users, in selecting the right preprocessing operations and appropriate visualisation techniques when using these datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abdul Moiz, S.: Class level code smells: chernoff face visualization. CSI J. Comput. 3(2), 36–41 (2020). http://www.csi-india.org/downloads/pdf/4/csi
Ankerst, M.: Visual Data Mining. Ph.D. thesis, Faculty of Mathematics and Computer Science, University of Munich, Munich (2000)
Bruckner, L.A.: On chernoff-faces. In: Graphical Representation of Multivariate Data, pp. 93–121 (1978). https://www.sciencedirect.com/science/article/pii/B9780127347509500095
Ceneda, D., Gschwandtner, T., Miksch, S.: A review of guidance approaches in visual data analysis: a multifocal perspective. Comput. Graph. Forum 38(3), 861–879 (2019). https://doi.org/10.1111/cgf.13730
Cristobal, R., Sebastian, V.: Educational data mining and learning analytics: an updated survey. In: WIREs Data Mining Knowledge Discovery, pp. 1–22 (2020)
Gorman, R.P., Sejnowski, T.J.: Learned classification of sonar targets using a massively parallel network. IEEE Trans. Acoust. Speech Signal Process. 36(7), 1135–1140 (1988)
Keim, D., Mansmann, F., Schneidewind, J., Ziegler, H.: Challenges in visual data analysis. In: In Proceeding of International Conference on Information Visualization, pp. 26–36. ACM (2006)
Keim, D., North, S.: Visual data mining in large geospatial point sets. IEEE Comput. Graph. 24(5), 36–44 (2004)
Li, G.: Research on data analysis and mining technology based on computer visualization. In: CIPAE 2020: Proceedings of the 2020 International Conference on Computers, Information Processing and Advanced EducationOctober 2020, pp. 194–200. ACM (2020)
Mehta, A.Y., Cummings, R.D.: GLAD: glycan array dashboard, a visual analytics tool for glycan microarrays. Bioinformatics 35(18), 3536–3537 (2019). https://doi.org/10.1093/bioinformatics/btz075
Nayem, R.: A Taxonomy of Data Mining Problems. IGI Global Publishers (2020)
Rubio, E., Castillo, O., Valdez, F., Melin, P., Gonzalez, C.I., Martinez, G.: An extension of the fuzzy possibilistic clustering algorithm using type-2 fuzzy logic techniques. Adv. Fuzzy Syst. 2017, 23 (2017)
Simoff, S.J.: Visual Data Mining, pp. 3365–3370. Springer, Boston (2009). https://doi.org/10.1007/978-0-387-39940-9_1121
Simoff, S.J., Böhlen, M.H., Mazeika, A.: Visual data mining. In: LNCS 4404, pp. 1–12. Springer-Verlag, Berlin (2008)
Solmaz, M., Lane, A., Gonen, B., Akmamedova, O., Gunes, M.H., Komurov, K.: Graphical data mining of cancer mechanisms with SEMA. Bioinformatics 35(21), 4413–4418 (2019). https://doi.org/10.1093/bioinformatics/btz303
Supriyati, Abdillah, S.R.: Data mining in sales data grouping. IOP Conf. Ser. Mater. Sci. Eng. 879, 012116 (2020). https://doi.org/10.1088
UCI: Machine learning repository (2020). https://archive.ics.uci.edu/ml/index.php
Ying, Y., Yue, S.: Application of data mining combined visualization technology in visual communication. In: 2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA), pp. 874–879 (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Mgboh, U., Ogbuokiri, B., Obaido, G., Aruleba, K. (2021). Visual Data Mining: A Comparative Analysis of Selected Datasets. In: Abraham, A., Piuri, V., Gandhi, N., Siarry, P., Kaklauskas, A., Madureira, A. (eds) Intelligent Systems Design and Applications. ISDA 2020. Advances in Intelligent Systems and Computing, vol 1351. Springer, Cham. https://doi.org/10.1007/978-3-030-71187-0_35
Download citation
DOI: https://doi.org/10.1007/978-3-030-71187-0_35
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-71186-3
Online ISBN: 978-3-030-71187-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)