Abstract
The popularization of social networks has considerably increased the volume of data generated from the interaction between people. Understanding this data can be useful both for companies and governments and for users. This work proposes to study how to infer the behavior of people on social networks from published comments, specifically using the Myers-Briggs Typological Indicator (MBTI) in a social network focused on discussions on behavioral issues. The analysis carried out employs Natural Language Processing (NLP) techniques, resampling of the data set and classification algorithms combined by Majority Vote. The results showed 90% efficiency of the combiner with the use of random oversampling. SVM and KNN were the best individual classifiers regardless of the resampling technique used. Although smaller compared to the best individual classifier, the combination approach shows a decrease in the misclassification for INFJ and INFP classes up to 11% and 34%, respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
16personalities – https://www.16personalities.com/
- 2.
- 3.
Personality Cafe – https://www.personalitycafe.com/
References
E. Souza, D. Vitório, D. Castro, A.L.I. Oliveira, C. Gusmão, Characterizing opinion mining: A systematic mapping study of the Portuguese language, in Computational Processing of the Portuguese Language, Portugal, vol. 9727, (Springer, Cham, 2016), pp. 122–127
R.A. Sinoara, J. Antunes, S.O. Rezende, Text mining and semantics: A systematic mapping study. J. Braz. Comput. Soc. 23, 9 (2017)
K. Faceli, A.C. Lorena, J. Gama, A.C.P.L.F. de Carvalho, Inteligência artificial: Uma abordagem de aprendizado de máquina (Editora LTC, Rio de Janeiro, 2011)
D. Opitz, R. Maclin, Popular ensemble methods: An empirical study. J. Artif. Intell. Res. 11, 169–198 (1999)
C. Padurariu, M.E. Breaban, Dealing with data imbalance in text classification. Procedia Comput. Sci., Budapest, Hungary 159, 736–745 (2019)
I.B. Myers, P.B. Myers, Gifts Differing: Understanding Personality Type, 2nd edn. (Davies-Black, Boston, 2010)
C.G. Jung, O eu e o inconsciente, 27th edn. (Editora Vozes, Petrópolis, 2015)
G. Couto, D. Bartholomeu, J.M. Montiel, Estrutura interna do Myers Briggs Type Indicator (MBTI): evidência de validade. Avaliação Psicológica 15(1), 41–48 (2016)
I.B. Myers, L.K. Kirby, K.D. Myers, Introduction to Type: A Guide to Understanding Your Results on the Myers-Briggs Type Indicator, 6th edn. (Consulting Psychologists Press, Palo Alto, 2000)
C. Li et al., Feature extraction from social media posts for psychometric typing of participants, in AC 2018: Augmented Cognition: Intelligent Technologies, Las Vegas, NV, USA, vol. 10915, (Springer, Cham, 2018), pp. 267–286
L.I. Kuncheva, Combining Pattern Classifiers: Methods and Algorithms, 2nd edn. (Wiley, Hoboken, 2014)
A. Onan, S. Korukoğlu, H. Bulut, Ensemble of keyword extraction methods and classifiers in text classification. Expert Syst. Appl. 57, 232–247 (2016)
Y. Liu, T. Liu, Y.J. Wang, Research on micro-blog character analysis based on Naïve Bayes, in Seventh International Conference on Digital Image Processing (ICDIP 2015), Los Angeles, United States, vol. 9631, (SPIE, Bellingham, Washington, 2015), pp. 549–553
F. Provost, T. Fawcett, Data Science Para Negócios, 1st edn. (Alta Books, Rio de Janeiro, 2016)
N. Japkowicz, S. Stephen, The class imbalance problem: A systematic study. Intell. Data Anal. 6(5), 429–449 (2002)
N. Junsomboon, T. Phienthrakul, Combining over-sampling and under-sampling techniques for imbalance dataset, in Proceedings of the 9th International Conference on Machine Learning and Computing, Singapore (2017), pp. 243–247
T. Zhu, Y. Lin, Y. Liu, Synthetic minority oversampling technique for multiclass imbalance problems. Pattern Recogn. 72, 327–340 (2017). https://doi.org/10.1016/j.patcog.2017.07.024
N.V. Chawla, K.W. Bowyer, L.O. Hall, W.P. Kegelmeyer, SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 16(1), 321–357 (2002)
G.E.A.P.A. Batista, R.C. Prati, M.C. Monard, A study of the behavior of several methods for balancing machine learning training data. ACM SIGKDD Explor. Newsl. 6(1), 20–29 (2004)
R.E.S. Santos, J.S. Correia-Neto, E.P.R. Souza, C.V.C. de Magalhães, G. Vilar, Técnicas de processamento de linguagem natural aplicadas ao processo de mineração de textos: Resultados preliminares de um mapeamento sistemático. Revista de Sistemas e Computação 4(2), 116–125 (2014)
E. Souza, D. Vitório, D. Castro, A.L.I. Oliveira, C. Gusmão, Characterizing opinion mining: A systematic mapping study of the Portuguese language, in Computational Processing of the Portuguese Language, Portugal, vol. 9727, (Springer, Cham, 2016), pp. 122–127
H. Soong, N.B.A. Jalil, R.K. Ayyasamy, R. Akbar, The essential of sentiment analysis and opinion mining in social media: Introduction and survey of the recent approaches and techniques, in 2019 IEEE 9th Symposium on Computer Applications Industrial Electronics (ISCAIE), Kota Kinabalu, Malaysia (2019), pp. 272–277
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Mota, F., Paula, M., Drummond, I. (2021). Combined Classification Models Applied to People Personality Identification. In: Latifi, S. (eds) ITNG 2021 18th International Conference on Information Technology-New Generations. Advances in Intelligent Systems and Computing, vol 1346. Springer, Cham. https://doi.org/10.1007/978-3-030-70416-2_59
Download citation
DOI: https://doi.org/10.1007/978-3-030-70416-2_59
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-70415-5
Online ISBN: 978-3-030-70416-2
eBook Packages: EngineeringEngineering (R0)