Skip to main content

Performance Measures in Discrete Supervised Classification

  • Conference paper
  • First Online:
Data Analysis and Rationality in a Complex World (IFCS 2019)

Abstract

The evaluation of results in Cluster Analysis frequently appears in the literature, and a variety of evaluation measures have been proposed. On the contrary, in supervised classification, particularly in the discrete case, the subject of results’ evaluation is relatively scarce in this field of the literature. This is the motto underlying this study. The evaluation of the performance of any model of supervised classification is, generally, based on the number of cases correctly or incorrectly predicted by the model. However, these measures can lead to a misleading evaluation when the data is not balanced. More recently, other types of measures have been studied as association or agreement coefficients, the Huberty index, Mutual information, and even ROC curves. Exploratory studies were conducted in this study to understand the relationship between each measure and data characteristics, namely, sample size, balance, and separability of classes. To this end, simulated data and a Beta regression model in the performance of the models were used.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Bacelar-Nicolau, H.: The affinity coefficient in cluster analysis. Methods Oper. Res. 53, 507–512 (1985)

    MATH  Google Scholar 

  • Cribari-Neto, F., Zeileis, A.: Beta regression in R. J. Stat. Softw. 34, 1–24 (2010)

    Article  Google Scholar 

  • Cohen, J.: A coefficient of agreement for nominal scales. Educ. Psychol. Measur. 20, 37–46 (1960)

    Article  Google Scholar 

  • Ferreira, A.S., Cardoso, M.G.: Evaluating discriminant analysis results. In: Lita da Silva, J., Caeiro, F., Natário, I., Braumann C. (eds.) Advances in Regression, Survival Analysis, Extreme Values, Markov Processes and and Other Statistical Applications. Studies in Theoretical and Applied Statistics, pp. 155–162. Springer, Heidelberg (2013)

    Google Scholar 

  • Goldstein, M., Dillon, W.R.: Discrete Discriminant Analysis. Wiley, New York (1978)

    Google Scholar 

  • Ho, T.K., Basu, M.: Complexity measures of supervised classification problems. IEEE Trans. Pattern Anal. Mach. Intell. 24, 289–300 (2002)

    Article  Google Scholar 

  • Huberty, C.J., Olejnik, S.: Applied MANOVA and Discriminant Analysis. Wiley-Interscience. Wiley, New Jersey (2006)

    Google Scholar 

  • Paik, H.: The effect of prior probability on skill in two-group discriminant analysis. Qual Quant. 32, 201–211 (1998)

    Article  Google Scholar 

  • Santafe, G., Inza, I., Lozano, J.A.: Dealing with the evaluation of supervised classification algorithms. Artif. Intell. Rev. 44, 467–508 (2015)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ana Sousa Ferreira .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ferreira, A.S., Marques, A. (2021). Performance Measures in Discrete Supervised Classification. In: Chadjipadelis, T., Lausen, B., Markos, A., Lee, T.R., Montanari, A., Nugent, R. (eds) Data Analysis and Rationality in a Complex World. IFCS 2019. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-030-60104-1_6

Download citation

Publish with us

Policies and ethics