Skip to main content

Relative Confusion Matrix: An Efficient Visualization for the Comparison of Classification Models

  • Chapter
  • First Online:
Artificial Intelligence and Visualization: Advancing Visual Knowledge Discovery

Part of the book series: Studies in Computational Intelligence ((SCI,volume 1126))

  • 57 Accesses

Abstract

Recent machine learning and deep learning algorithms have made important breakthroughs in tasks such as classification tasks. Multiple efficient methods and model architectures have been proposed to continuously improve the performance on these tasks. To determine the best classifier for a given task, machine learning experts generally rely on metrics such as the accuracy that measure both the number of correct predictions and the number of prediction errors. These metrics can be computed at a global scale, which means one value represents the performance of the whole model. They can also be computed at a class scale, which means the performance of the model is considered for each class individually. The most common way to efficiently represent the class-wise performance of such models is the standard Confusion Matrix. Yet, if this representation can efficiently highlight both well discriminated classes and problematic classes of a single classifier, the very few works leverage the matrix structure of this visualization to compare several models at a class scale. In this paper, we present the Relative Confusion Matrix (RCM), a matrix-based visualization leveraging a color encoding and a set of symbols to highlight class-wise differences of performance between two models. We conducted an evaluation with users to compare RCM with two confusion matrix design variants. Our results show that RCM’s encoding leads to a more efficient comparison of two models than existing approaches. This extended version develops the design choices with more details. It also includes an application scenario of RCM to compare various models generated through a Deep Neural Network visual pruning approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ren S, He K, Girshick R, Sun J (2016) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149

    Article  Google Scholar 

  2. Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587

    Google Scholar 

  3. Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90

    Article  Google Scholar 

  4. Hinterreiter A, Ruch P, Stitz H, Ennemoser M, Bernard J, Strobelt H, Streit M (2020) Confusion flow: a model-agnostic visualization for temporal analysis of classifier confusion. IEEE Trans Vis Comput Graph

    Google Scholar 

  5. Talbot J, Lee B, Kapoor A, Tan DS (2009) Ensemble matrix: interactive visualization to support machine learning with multiple classifiers. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 1283–1292

    Google Scholar 

  6. Alsallakh B, Hanbury A, Hauser H, Miksch S, Rauber A (2014) Visual methods for analyzing probabilistic classification data. IEEE Trans Visual Comput Graph 20(12):1703–1712

    Article  Google Scholar 

  7. Ren D, Amershi S, Lee B, Suh J, Williams JD (2016) Squares: Supporting interactive performance analysis for multiclass classifiers. IEEE Trans Visual Comput Graphics 23(1):61–70

    Article  Google Scholar 

  8. Powers DM (2020) Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. Preprint at arXiv:2010.16061

  9. Pommé LE, Bourqui R, Giot R, Auber D (2022) Relative confusion matrix: efficient comparison of decision models. In: IV2022-IVE-information visualization evaluation and user studies

    Google Scholar 

  10. Mackinlay J (1986) Automating the design of graphical presentations of relational information. ACM Trans Graph 5(2):110–141

    Article  Google Scholar 

  11. Purchase H (2012) Experimental human-computer interaction: a practical guide with visual examples. Cambridge University Press

    Book  Google Scholar 

  12. Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27(8):861–874

    Article  MathSciNet  Google Scholar 

  13. Amershi S, Chickering M, Drucker SM, Lee B, Simard P, Suh J (2015) Model tracker: redesigning performance analysis tools for machine learning. In: Proceedings of the 33rd annual ACM conference on human factors in computing systems, pp 337–346

    Google Scholar 

  14. Görtler J, Hohman F, Moritz D, Wongsuphasawat K, Ren D, Nair R, Kirchner M, Patel K (2021) Neo: generalizing confusion matrix visualization to hierarchical and multi-output labels. In: CHI conference on human factors in computing systems, pp 1–13

    Google Scholar 

  15. Li G, Wang J, Shen HW, Chen K, Shan G, Lu Z (2020) CNN pruner: pruning convolutional neural networks with visual analytics. IEEE Trans Vis Comput Graph 1

    Google Scholar 

  16. Gleicher M (2017) Considerations for visualizing comparison. IEEE Trans Visual Comput Graph 24(1):413–423

    Article  Google Scholar 

  17. Basak A, Bach B, Henry Riche N, Isenberg T, Fekete JD (2013) Weighted graph comparison techniques for brain connectivity analysis. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 483–492

    Google Scholar 

  18. Conover WJ, Iman RL (1979) On multiple-comparisons procedures, Technical Report LA-7677-MS, Los Alamos Scientific Laboratory

    Google Scholar 

  19. Pommé LE, Bourqui R, Giot R, Vallet J, Auber D (2023) NetPrune: a sparklines visualization for network pruning. Vis Inf

    Google Scholar 

  20. Xiao H, Rasul K, Vollgraf R (2017) Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms. Preprint at arXiv:1708.07747

  21. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Preprint at arXiv:1409.1556

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luc Etienne Pommé .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Pommé, L.E., Bourqui, R., Giot, R., Auber, D. (2024). Relative Confusion Matrix: An Efficient Visualization for the Comparison of Classification Models. In: Kovalerchuk, B., Nazemi, K., Andonie, R., Datia, N., Bannissi, E. (eds) Artificial Intelligence and Visualization: Advancing Visual Knowledge Discovery. Studies in Computational Intelligence, vol 1126. Springer, Cham. https://doi.org/10.1007/978-3-031-46549-9_7

Download citation

Publish with us

Policies and ethics