Abstract
Recent machine learning and deep learning algorithms have made important breakthroughs in tasks such as classification tasks. Multiple efficient methods and model architectures have been proposed to continuously improve the performance on these tasks. To determine the best classifier for a given task, machine learning experts generally rely on metrics such as the accuracy that measure both the number of correct predictions and the number of prediction errors. These metrics can be computed at a global scale, which means one value represents the performance of the whole model. They can also be computed at a class scale, which means the performance of the model is considered for each class individually. The most common way to efficiently represent the class-wise performance of such models is the standard Confusion Matrix. Yet, if this representation can efficiently highlight both well discriminated classes and problematic classes of a single classifier, the very few works leverage the matrix structure of this visualization to compare several models at a class scale. In this paper, we present the Relative Confusion Matrix (RCM), a matrix-based visualization leveraging a color encoding and a set of symbols to highlight class-wise differences of performance between two models. We conducted an evaluation with users to compare RCM with two confusion matrix design variants. Our results show that RCM’s encoding leads to a more efficient comparison of two models than existing approaches. This extended version develops the design choices with more details. It also includes an application scenario of RCM to compare various models generated through a Deep Neural Network visual pruning approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ren S, He K, Girshick R, Sun J (2016) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90
Hinterreiter A, Ruch P, Stitz H, Ennemoser M, Bernard J, Strobelt H, Streit M (2020) Confusion flow: a model-agnostic visualization for temporal analysis of classifier confusion. IEEE Trans Vis Comput Graph
Talbot J, Lee B, Kapoor A, Tan DS (2009) Ensemble matrix: interactive visualization to support machine learning with multiple classifiers. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 1283–1292
Alsallakh B, Hanbury A, Hauser H, Miksch S, Rauber A (2014) Visual methods for analyzing probabilistic classification data. IEEE Trans Visual Comput Graph 20(12):1703–1712
Ren D, Amershi S, Lee B, Suh J, Williams JD (2016) Squares: Supporting interactive performance analysis for multiclass classifiers. IEEE Trans Visual Comput Graphics 23(1):61–70
Powers DM (2020) Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. Preprint at arXiv:2010.16061
Pommé LE, Bourqui R, Giot R, Auber D (2022) Relative confusion matrix: efficient comparison of decision models. In: IV2022-IVE-information visualization evaluation and user studies
Mackinlay J (1986) Automating the design of graphical presentations of relational information. ACM Trans Graph 5(2):110–141
Purchase H (2012) Experimental human-computer interaction: a practical guide with visual examples. Cambridge University Press
Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27(8):861–874
Amershi S, Chickering M, Drucker SM, Lee B, Simard P, Suh J (2015) Model tracker: redesigning performance analysis tools for machine learning. In: Proceedings of the 33rd annual ACM conference on human factors in computing systems, pp 337–346
Görtler J, Hohman F, Moritz D, Wongsuphasawat K, Ren D, Nair R, Kirchner M, Patel K (2021) Neo: generalizing confusion matrix visualization to hierarchical and multi-output labels. In: CHI conference on human factors in computing systems, pp 1–13
Li G, Wang J, Shen HW, Chen K, Shan G, Lu Z (2020) CNN pruner: pruning convolutional neural networks with visual analytics. IEEE Trans Vis Comput Graph 1
Gleicher M (2017) Considerations for visualizing comparison. IEEE Trans Visual Comput Graph 24(1):413–423
Basak A, Bach B, Henry Riche N, Isenberg T, Fekete JD (2013) Weighted graph comparison techniques for brain connectivity analysis. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 483–492
Conover WJ, Iman RL (1979) On multiple-comparisons procedures, Technical Report LA-7677-MS, Los Alamos Scientific Laboratory
Pommé LE, Bourqui R, Giot R, Vallet J, Auber D (2023) NetPrune: a sparklines visualization for network pruning. Vis Inf
Xiao H, Rasul K, Vollgraf R (2017) Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms. Preprint at arXiv:1708.07747
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Preprint at arXiv:1409.1556
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Pommé, L.E., Bourqui, R., Giot, R., Auber, D. (2024). Relative Confusion Matrix: An Efficient Visualization for the Comparison of Classification Models. In: Kovalerchuk, B., Nazemi, K., Andonie, R., Datia, N., Bannissi, E. (eds) Artificial Intelligence and Visualization: Advancing Visual Knowledge Discovery. Studies in Computational Intelligence, vol 1126. Springer, Cham. https://doi.org/10.1007/978-3-031-46549-9_7
Download citation
DOI: https://doi.org/10.1007/978-3-031-46549-9_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-46548-2
Online ISBN: 978-3-031-46549-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)