Abstract
This study explores a new methodology for machine learning classification tasks in 2-dimensional visualization space (2-D ML) using Visual knowledge Discovery in lossless General Line Coordinates. It is shown that this is a full machine learning approach that does not require processing n-dimensional data in an abstract n-dimensional space. It enables discovering n-D patterns in 2-D space without loss of n-D information using graph representations of n-D data in 2-D. Specifically, this study shows that it can be done with static and dynamic In-line Based Coordinates in different modifications, which are a category of General Line Coordinates. Based on these inline coordinates, classification and regression methods were developed. The viability of the strategy was shown by two case studies based on benchmark datasets (Wisconsin Breast Cancer and Page Block Classification datasets). The characteristics of page block classification data led to the development of an algorithm for imbalanced high-resolution data with multiple classes, which exploits the decision trees as a model design facilitator producing a model, which is more general than a decision tree. This work accelerates the ongoing consolidation of an emerging field of full 2-D machine learning and its methodology. Within this methodology the end users can discover models and justify them as self-service. Providing interpretable ML models is another benefit of this approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Lipton Z (2018) The mythos of model interpretability. Commun ACM 61:36–43
Caruana R, Lou Y, Gehrke J, Koch P, Sturm M, Elhadad N (2015) Intelligible models for healthcare: predicting pneumonia risk and hospital 30-day readmission. In: Proceedings of the 21st ACM SIGKDD. ACM, pp 1721–1730
Kovalerchuk B, Ahmad MA, Teredesai A (2021) Survey of explainable machine learning with visual and granular methods beyond quasi-explanations. In: Pedrycz W, Chen SM (eds) Interpretable artificial intelligence: a perspective of granular computing. Springer, pp 217–267. https://arxiv.org/abs/2009.1022
Kovalerchuk B (2020) Enhancement of cross validation using hybrid visual and analytical means with shannon function. In: Beyond traditional probabilistic data processing techniques: interval, fuzzy etc. methods and their applications. Springer, pp 517–554
Molnar C (2020) Interpretable machine learning. https://christophm.github.io/-interpretable-ml-book/
Rudin C (2019) Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat Mach Intell 1(5):206–215
Kovalerchuk B (2018) Visual knowledge discovery and machine learning. Springer
Kovalerchuk B, Phan H (2021) Full interpretable machine learning in 2D with inline coordinates. In: 2021 25th international conference information visualisation. IEEE, pp 189–196
Dovhalets D, Kovalerchuk B, Vajda S, Andonie R (2018) Deep learning of 2-D images representing n-D data in general line coordinates. In: International symposium on affective science and engineering, pp 1–6. https://doi.org/10.5057/isase.2018-C000025
Kovalerchuk B, Kalla DC, Agarwal B (2022) Deep learning image recognition for non-images. In: Integrating artificial intelligence and visualization for visual knowledge discovery. Springer, pp 63–100
Kovalerchuk B, Gharawi A (2018) Decreasing occlusion and increasing explanation in interactive visual knowledge discovery, In: Human interface and the management of information. Interaction, visualization, and analytics. Springer, pp 505–526
McDonald R, Kovalerchuk B (2022) Non-linear Visual Knowledge Discovery With Elliptic Paired Coordinates. In: Integrating artificial intelligence and visualization for visual knowledge discovery. Springer, pp 141–172
Wagle SN, Kovalerchuk B (2020) Interactive visual self-service data classification approach to democratize machine learning. In: 2020 24th international conference information visualization (IV). IEEE, pp 280–285. https://doi.org/10.1109/IV51561.2020.00052
Inselberg A (1998) Visual data mining with parallel coordinates. Comput Stat 13(1)
Inselberg A (2009) Parallel coordinates: visual multidimensional geometry and its applications. Springer
Sharma A, Vans E, Shigemizu D, Boroevich KA, Tsunoda T (2019) Deep insight: a methodology to transform a non-image data to an image for convolution neural network architecture. Nat: Sci Rep 9(1):1–7
Sansen J, Richer G, Jourde T, Lalanne F, Auber D, Bourqui R (2017) Visual exploration of large multidimensional data using parallel coordinates on big data infrastructure. Informatics, vol 4, no 3, p 21. Multidisciplinary Digital Publishing Institute
Estivill-Castro V, Gilmore E, Hexel R (2020) Constructing interpretable decision trees using parallel coordinates. In: International conference on artificial intelligence and soft computing. Springer, pp 152–164
Tam GK, Kothari V, Chen M (2016) An analysis of machine and human analytics in classification. IEEE Trans Visual Comput Graphics 23(1):71–80
Xu Y, Hong W, Chen N, Li X, Liu W, Zhang T (2007) Parallel filter: a visual classifier based on parallel coordinates and multivariate data analysis. In: International conference on intelligent computing. Springer, pp 1172–1183
Kovalerchuk B, Hayes D, Discovering interpretable models in parallel coordinates, international information visualization conference. https://doi.org/10.1109/IV53921.2021.00037
Dua D, Graff C (2019) UCI machine learning repository, University of California, School of Information and Computer Science, Irvine, CA. https://archive.ics.uci.edu/ml
Zaky D, Gunawan PH (2020) Computational parallel of K-nearest neighbor on page blocks classification dataset. In: 2020 8th international conference on information and communication technology (ICoICT). IEEE, pp 1–4
Eschrich S, Chawla NV, Hall LO (2002) Generalization methods in bioinformatics. In: BIOKDD, vol 2, pp 25–32
Huber L, Kovalerchuk B, Recaido C (2023) Visual knowledge discovery with general line coordinates. https://arxiv.org/pdf/2305.18429
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Kovalerchuk, B., Phan, H. (2024). Full High-Dimensional Intelligible Learning in 2-D Lossless Visualization Space. In: Kovalerchuk, B., Nazemi, K., Andonie, R., Datia, N., Bannissi, E. (eds) Artificial Intelligence and Visualization: Advancing Visual Knowledge Discovery. Studies in Computational Intelligence, vol 1126. Springer, Cham. https://doi.org/10.1007/978-3-031-46549-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-031-46549-9_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-46548-2
Online ISBN: 978-3-031-46549-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)