Abstract
In the field of machine learning, a crucial task is understanding the relative importance of the different input features in a predictive model. There is an approach in the literature whose aim is to analyze the predictive capacity of some features with respect to others. Can we explain a feature of the input space with others? Can we quantify this capacity? We propose a practical approach for analyzing the importance of features in a model and the explanatory capacity of some features over others. It is based on the adaptation of existing definitions from the literature that use the Shapley value and fuzzy measures. Our new approach aims to facilitate the understanding and application of these concepts by starting from a simple idea and considering well known methods. The main objective of this work is to provide a useful and practical approach for analyzing feature importance in real world cases.
Supported by the Government of Spain, Grant Plan Nacional de I+D+i, PID2020-116884GB-I00, PID2021-122905NB-C21.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alonso Moral, J., Castiello, C., Magdalena, L., Mencar, C.: Explainable Fuzzy Systems, Studies in Computational Intelligence, vol. 970. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-71098-9
Beliakov, G., Gómez, D., James, S., Montero, J., Rodríguez, J.: Approaches to learning strictly-stable weights for data with missing values. Fuzzy Sets Syst. 325, 97–113 (2017). https://doi.org/10.1016/j.fss.2017.02.003
Chu, C., Chan, D.: Feature selection using approximated high-order interaction components of the shapley value for boosted tree classifier. IEEE Access 8, 112742–112750 (2020)
Cramer, J.: The origins of logistic regression, vol. 119. Tinbergen Institute (2002)
Gutiérrez, I., Santos, D., Castro, J., Gómez, D., Espínola, R., Guevara, J.: On measuring features importance in machine learning models in a two-dimensional representation scenario. In: 2022 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), pp. 1–9 (2022)
Nelder, J., Wedderburn, R.: Generalized linear model. J. R. Stat. Soc. Series A 135(3), 370–384 (1972)
Okhrati, R., Lipani, A.: A multilinear sampling algorithm to estimate shapley values. Artif. Intell. 298 (2021)
Santos, D., Gutiérrez, I., Castro, J., Gómez, D., Guevara, J., Espínola, R.: Explanation of machine learning classification models with fuzzy measures: an approach to individual classification. In: Kahraman, C., Tolga, A.C., Cevik Onar, S., Cebi, S., Oztaysi, B., Sari, I.U. (eds.) INFUS 2022. LNNS, vol. 505, pp. 62–69. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-09176-6_7
Shapley, L.: A value for \(n-\)person games. Ann. Math. Stud. 2, 307–317 (1953)
Sugeno, M.: Fuzzy measures and fuzzy integrals: a survey. Fuzzy Automata Decis. Process 78 (1977)
Štrumbelj, E., Kononenko, I.: An efficient explanation of individual classifications using game theory. J. Mach. Learn. Res. 1, 1–18 (2010)
Štrumbelj, E., Kononenko, I., Robnik Šikonja, M.: Explaining instance classifications with interactions of subsets of feature values. Data Knowl. Eng. 68(10), 886–904 (2009)
Štrumbelj, E., Kononenko, I., Robnik Šikonja, M.: Explaining prediction models and individual predictions with feature contributions. Knowl. Inf. Syst. 41(4), 647–665 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Gutiérrez, I., Santos, D., Castro, J., Hernández-Gonzalo, J.A., Gómez, D., Espínola, R. (2023). Machine Learning and Fuzzy Measures: A Real Approach to Individual Classification. In: Massanet, S., Montes, S., Ruiz-Aguilera, D., González-Hidalgo, M. (eds) Fuzzy Logic and Technology, and Aggregation Operators. EUSFLAT AGOP 2023 2023. Lecture Notes in Computer Science, vol 14069. Springer, Cham. https://doi.org/10.1007/978-3-031-39965-7_12
Download citation
DOI: https://doi.org/10.1007/978-3-031-39965-7_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-39964-0
Online ISBN: 978-3-031-39965-7
eBook Packages: Computer ScienceComputer Science (R0)