Abstract
Personality detection is an old topic in psychology and automatic personality prediction (or perception) (APP) is the automated (computationally) forecasting of the personality on different types of human generated/exchanged contents (such as text, speech, image, and video). The principal objective of this study is to offer a shallow (overall) review of natural language processing approaches on APP since 2010. With the advent of deep learning and following it transfer-learning and pre-trained model in NLP, APP research area has been a hot topic, so in this review, methods are categorized into three: pre-trained independent, pre-trained model based, and multimodal approaches. In addition, to achieve a comprehensive comparison, reported results are informed by datasets.
Similar content being viewed by others
Data availability
Not applicable.
Notes
More information in [46].
A number of calculated readability measures based on simple surface characteristics of the text. These measures are basically linear regressions based on the number of words, syllables, and sentences.
Available on https://www.kaggle.com/datasnaek/mbti-type/.
References
Corr, P. J., & Matthews, G. (2009). The Cambridge handbook of personality psychology. Cambridge: Cambridge University Press.
Kazdin, A. E. (2000). Encyclopedia of psychology, volume 2. American Psychological Association Washington, DC. http://www.scielo.br/scielo.php?script=sci_nlinks &pid=S1516-4446201300040041600015 &lng=en
Bergner, R. M. (2020). What is personality? Two myths and a definition. New Ideas in Psychology, 57, 100759. https://doi.org/10.1016/J.NEWIDEAPSYCH.2019.100759.
Peters, E., & Killcoyne, H. L. (2015). Psychology. Encyclopaedia Britannica.
Schultz, D. P., & Schultz, S. Ellen. (2016). Theories of personality. Cengage Learning, 11 Edn. https://books.google.com/books/about/Theories_of_Personality.html?id=dXccCgAAQBAJ
Ewen, R. B. (2010). An introduction to theories of personality. Psychology Press. https://www.routledge.com/An-Introduction-to-Theories-of-Personality-7th-Edition/Ewen/p/book/9781841697468
Eysenck, H. (1998). Dimensions of personality. Routledge. https://doi.org/10.4324/9780203793268.
Jothilakshmi, S., Sangeetha, J., & Brindha, R. (2017). Speech based automatic personality perception using spectral features. International Journal of Speech Technology, 20(1), 43–50. https://doi.org/10.1007/s10772-016-9390-0.
Su, M. H., Wu, C. H., Huang, K. Y., Hong, Q. B., & Wang, H. M. (2018). Personality trait perception from speech signals using multiresolution analysis and convolutional neural networks. In Proceedings—9th Asia-Pacific signal and information processing association annual summit and conference, APSIPA ASC 2017 (vol. 2018-February, pp. 1532–1536). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/APSIPA.2017.8282287.
Gilpin, L. H., Olson, D. M. & Alrashed, T. (2018). Perception of speaker personality traits using speech signals. In Conference on human factors in computing systems—Proceedings (vol. 2018-April, pp. 1–6). Association for Computing Machinery. https://doi.org/10.1145/3170427.3188557.
Mohammadi, G., & Vinciarelli, A. (2012). Automatic personality perception: Prediction of trait attribution based on prosodic features. IEEE Transactions on Affective Computing, 3(3), 273–284. https://doi.org/10.1109/T-AFFC.2012.5.
Sang, J., Zhang, H., & Xu, C. (2016). Visual BFI: An exploratory study for image-based personality test. In Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics) (vol. 9916 LNCS, pp. 95–106). Springer. https://doi.org/10.1007/978-3-319-48890-5_10.
Allen, M. S., & Walter, E. E. (2016). Personality and body image: A systematic review. Body Image, 19, 79–88. https://doi.org/10.1016/j.bodyim.2016.08.012.
Chaudhari, K., & Thakkar, A. (2019). Survey on handwriting-based personality trait identification. Expert Systems with Applications, 124, 282–308. https://doi.org/10.1016/j.eswa.2019.01.028.
Lokhande, V. R., & Gawali, B. W. (2017). Analysis of signature for the prediction of personality traits. In Proceedings—1st international conference on intelligent systems and information management, ICISIM 2017 (vol. 2017-Janua, pp. 44–49). Institute of Electrical and Electronics Engineers Inc., nov . https://doi.org/10.1109/ICISIM.2017.8122145.
Kindiroglu, A. A., Akarun, L., & Aran, O. (2017). Multi-domain and multi-task prediction of extraversion and leadership from meeting videos. Eurasip Journal on Image and Video Processing, 1, 1–14. https://doi.org/10.1186/s13640-017-0224-z.
Aslan, S., & Güdükbay, U. (2019). Multimodal video-based apparent personality recognition using long short-term memory and convolutional neural networks. arXiv:1911.00381.
Ramezani, M., Feizi-Derakhshi, M.-R., Balafar, M.-A., Asgari-Chenaghlu, M., Feizi-Derakhshi, A.-R., Nikzad-Khasmakhi, N., et al. (2022). Automatic personality prediction: An enhanced method using ensemble modeling. Neural Computing and Applications. https://doi.org/10.1007/s00521-022-07444-6.
Han, S., Huang, H., & Tang, Y. (2020). Knowledge of words: An interpretable approach for personality recognition from social media. Knowledge-Based Systems, 194, 105550. https://doi.org/10.1016/j.knosys.2020.105550.
Xue, X., Feng, J., & Sun, X. (2021). Semantic-enhanced sequential modeling for personality trait recognition from texts. Applied Intelligence. https://doi.org/10.1007/s10489-021-02277-7.
Zhu, H., Li, L., Jiang, H., & Tan, A. (2020). Inferring personality traits from attentive regions of user liked images via weakly supervised dual convolutional network. Neural Processing Letters. https://doi.org/10.1007/s11063-019-09987-7.
Tadesse, M. M., Lin, H., Xu, B., & Yang, L. (2018). Personality predictions based on user behavior on the facebook social media platform. IEEE Access, 6, 61959–61969. https://doi.org/10.1109/access.2018.2876502.
Lima, A. C. E. S., & de Castro, L. N. (2014). A multi-label, semi-supervised classification approach applied to personality prediction in social media. Neural Networks, 58, 122–130. https://doi.org/10.1016/j.neunet.2014.05.020.
Küster, L., Trahms, C., & Voigt-Antons, JN. (2018). Predicting personality traits from touchscreen based interactions. In 2018 Tenth international conference on quality of multimedia experience (QoMEX) (pp. 1–6). IEEE.
Roy, S., Roy, U., & Sinha, D. D. (2018). The probability of predicting personality traits by the way user types on touch screen. Innovations in Systems and Software Engineering, 15(1), 27–34. https://doi.org/10.1007/s11334-018-0317-6.
Shen, J., Brdiczka, O., & Liu, J. (2013). Understanding email writers: Personality prediction from email messages. In Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics) (vol. 7899 LNCS, pp. 318–330). Springer. https://doi.org/10.1007/978-3-642-38844-6_29.
Yakoub, F., Zein, M., Yasser, K., Adl, A., & Hassanien, A.E. (2015). Predicting personality traits and social context based on mining the smartphones SMS data. In Advances in intelligent systems and computing (vol. 370, pp. 511–521). Springer. https://doi.org/10.1007/978-3-319-21206-7_44.
Arnoux, P.H., Xu, A., Boyette, N., Mahmud, J., Akkiraju, R., & Sinha, V. (2017). 25 Tweets to know you: A new model to predict personality with social media. In Eleventh international AAAI conference on web and social media. http://www.aaai.org.
Allport, G. W. (1937). Personality: A psychological interpretation. Holt. https://psycnet.apa.org/record/1938-01964-000
Cattell, R. B., Eber, H. W., & Tatsuoka, M. M. (1970). Handbook for the sixteen personality factor questionnaire (16 PF) in clinical, educational, industrial, and research psychology, for use with all forms of the test. Ill Institute for Personality and Ability Testing.
Eysenck, H., Eysenck, S. B., & Giuliett. (1975). Manual of the Eysenck personality questionnaire. Hodder and Stoughton.
Briggs, K. C., & Myers, I. B. (1976). Myers–Briggs type indicator. Consulting Psychologists Press.
John, O. P., & Srivastava, S. (1999). The Big Five Trait taxonomy: History, measurement, and theoretical perspectives—PsycNET. In Handbook of personality: Theory and research (pp. 102–138). Guilford Press. https://psycnet.apa.org/record/1999-04371-004
Pennebaker, J. W., & Francis, M. E. (1996). Cognitive, emotional, and language processes in disclosure. Cognition and Emotion, 10(6), 601–626. https://doi.org/10.1080/026999396380079.
Pennebaker, J. W., & King, L. A. (1999). Linguistic styles: Language use as an individual difference. Journal of Personality and Social Psychology, 77(6), 1296–1312. https://doi.org/10.1037/0022-3514.77.6.1296.
Tausczik, Y. R., & Pennebaker, J. W. (2010). The psychological meaning of words: LIWC and computerized text analysis methods. Journal of Language and Social Psychology, 29(1), 24–54. https://doi.org/10.1177/0261927X09351676.
LIWC. (2015). Commparing LIWC 2015 and LIWC 2007|LIWC. http://liwc.wpengine.com/compare-dictionaries/
Mairesse, F., Uk, M. A., Mehl, M. R., & Moore, R. K. (2007). Using linguistic cues for the automatic recognition of personality in conversation and text. Journal of Artificial Intelligence Research, 30, 457–500. https://doi.org/10.5555/1622637.1622649.
Coltheart, M. (1981). The mrc psycholinguistic database. The Quarterly Journal of Experimental Psychology Section A, 33(4), 497–505. https://doi.org/10.1080/14640748108400805.
Wilson, M. (1988). Mrc psycholinguistic database: Machine-usable dictionary, version 2.00. Behavior Research Methods, Instruments and Computers, 20(1), 6–10. https://doi.org/10.3758/bf03202594.
Mikolov, T., Sutskever, I., Chen, K., Corrado, G., & Dean, J. (2013a). Distributed representations of words and phrases and their compositionality. arXiv:1310.4546.
Mikolov, T., Chen, K., Corrado, G., Dean, J. (2013b). Efficient estimation of word representations in vector space. In 1st International conference on learning representations, ICLR 2013—Workshop track proceedings. International Conference on Learning Representations, ICLR. http://ronan.collobert.com/senna/
Bojanowski, P., Grave, E., Joulin, A., & Mikolov, T. (2017). Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics, 5:135–146. https://doi.org/10.1162/tacl_a_00051. http://www.isthe.com/chongo/tech/comp/fnv
Pennington, J., Socher, R., & Manning, C. D. (2014). Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP) (pp. 1532–1543).
Vaswani, A, Shazeer, N. Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., & Polosukhin, I. (2017). Attention is all you need.
Minaee, S., Kalchbrenner, N., Cambria, E., Nikzad, N., Chenaghlu, M., & Gao, J. (2020). Deep learning based text classification: A comprehensive review.
Devlin, J., Chang, M.W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. In NAACL HLT 2019—2019 Conference of the North American chapter of the association for computational linguistics: Human language technologies (vol. 1, pp. 4171–4186). Association for Computational Linguistics (ACL) . https://doi.org/10.18653/v1/N19-1423.
Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O., Lewis, M., Zettlemoyer, L., & Stoyanov, V. (2019). RoBERTa: A robustly optimized BERT training approach. arXiv:1907.11692.
Lan, Z., Chen, M., Goodman, S., Gimpel, K., Sharma, P., & Soricut, R. (2020). Albert: A lite bert for self-supervised learning of language representations.
Sanh, V., Debut, L., Chaumond, J., & Wolf, T. (2020). Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter.
Poria, S., Gelbukh, A., Agarwal, B., Cambria, E., & Howard, N. (2013). Common sense knowledge based personality recognition from text. In Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics) (vol. 8266 LNAI, pp. 484–496). Springer. https://doi.org/10.1007/978-3-642-45111-9_42.
Havasi, C., Speer, R., & Alonso, J. (2007). Conceptnet 3: A flexible, multilingual semantic network for common sense knowledge. Recent advances in natural language processing (pp. 27–29). John Benjamins.
Verhoeven, B., Daelemans, W., & Smedt, T. D. (2013). Ensemble methods for personality recognition. In Seventh international AAAI conference on weblogs and social media. https://www.aaai.org/ocs/index.php/ICWSM/ICWSM13/paper/view/6177
Wright, W. R., & Chin, D. N. (2014). Personality profiling from text: Introducing part-of-speech n-grams. In Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics) (vol. 8538, pp. 243–253). Springer. https://doi.org/10.1007/978-3-319-08786-3_21.
Tighe, E. P, Ureta, J. C, Andrei Pollo, B. L., Cheng, C. K., & de Dios Bulos, R. (2016). Personality trait classification of essays with the application of feature reduction. In 4th Workshop on sentiment analysis where AI meets psychology (SAAIP 2016) (pp. 22–28).
Liu, F., Perez, J., & Nowson, S. (2016). A recurrent and compositional model for personality trait recognition from short texts. In The workshop on computational modeling of people’s opinions, personality, and emotions in social media (pp. 20–29). https://www.aclweb.org/anthology/W16-4303.pdf
Liu, F., Perez, J., & Nowson, S. (2017). A language-independent and compositional model for personality trait recognition from short texts. In 15th Conference of the European chapter of the association for computational linguistics (vol. 1, pp. 754–764). https://www.aclweb.org/anthology/E17-1071
Zheng, H., & Wu, C. (2019). Predicting personality using facebook status based on semi-supervised learning. In ACM international conference proceeding series (vol. Part F148150, pp. 59–64). Association for Computing Machinery. https://doi.org/10.1145/3318299.3318363.
Chen, M., Weinberger, K. Q, & Chen, Y. (2011). Automatic feature decomposition for single view co-training. In ICML
Guan, Z., Wu, B., Wang, B., & Liu, H. (2020). Personality2vec: Network representation learning for personality. In 2020 IEEE Fifth international conference on data science in cyberspace (DSC) (pp. 30–37). Institute of Electrical and Electronics Engineers (IEEE). https://doi.org/10.1109/dsc50466.2020.00013.
Sun, X., Liu, B., Meng, Q., Cao, J., Luo, J., & Yin, H. (2020). Group-level personality detection based on text generated networks. World Wide Web, 23(3), 1887–1906. https://doi.org/10.1007/s11280-019-00729-2.
Wang, Z., Wu, C. H., Biao Li, Q., Yan, B., & Zheng, K. F. (2020). Encoding text information with graph convolutional networks for personality recognition. Applied Sciences (Switzerland), 10(12), 4081. https://doi.org/10.3390/APP10124081. www.mdpi.com/journal/applsci
Ramezani, M., Feizi-Derakhshi, M.-R., & Balafar, M.-A. (2022b). Knowledge graph-enabled text-based automatic personality prediction. Computational Intelligence and Neuroscience, 2022, 1–18. https://doi.org/10.1155/2022/3732351. https://www.hindawi.com/journals/cin/2022/3732351/.
Ramezani, M., Feizi-Derakhshi, M.-R., & Balafar, M.-A. (2022c). Text-based automatic personality prediction using KGrAt-Net; A knowledge graph attention network classifier. https://doi.org/10.48550/arxiv.2205.13780. arXiv:2205.13780.
Majumder, N., Poria, S., Gelbukh, A., & Cambria, E. (2017). Deep learning-based document modeling for personality detection from text. IEEE Intelligent Systems, 32(2), 74–79. https://doi.org/10.1109/MIS.2017.23.
Sun, X., Liu, B., Cao, J., Luo, J., & Shen, X. (2018). Who am I? Personality detection based on deep learning for texts. In IEEE International conference on communications (vol. 2018-May). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICC.2018.8422105.
Darliansyah, A., Asif Naeem, M., Mirza, F., & Pears, R. (2019). SENTIPEDE: A smart system for sentiment-based personality detection from short texts. Journal of Universal Computer Science, 25(10), 1323–1352.
Hutto, C., & Gilbert, E. (2014). Vader: A parsimonious rule-based model for sentiment analysis of social media text. In Proceedings of the international AAAI conference on web and social media (vol. 8).
Jiang, H., Zhang, X., & Choi, J. D. (2020). Automatic text-based personality recognition on monologues and multiparty dialogues using attentive networks and contextual embeddings. In The thirty-fourth aaai conference on artificial intelligence (pp. 13821–13822). Association for Computational Linguistics (ACL)
Mehta, Y., Fatehi, S., Kazameini, A., Stachl, C., Cambria, E., & Eetemadi, S. (2020). Bottom-up and top-down: Predicting personality with psycholinguistic and language model features. IEEE International Conference of Data Mining. https://doi.org/10.1109/ICDM50108.2020.00146.
Cambria, E., Poria, S., Hazarika, D., & Kwok, K. (2018). Senticnet 5: Discovering conceptual primitives for sentiment analysis by means of context embeddings. https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16839
Mohammad, S. M., & Turney, P. D. (2013). Crowdsourcing a word-emotion association lexicon.
Mohammad, S. (2018). Obtaining reliable human ratings of valence, arousal, and dominance for 20,000 English words. In Proceedings of the 56th annual meeting of the association for computational linguistics (Vol. 1: Long Papers, pp. 174–184). Association for Computational Linguistics. https://doi.org/10.18653/v1/P18-1017.
Kazameini, A., Fatehi, S., Mehta, Y., Eetemadi, S., & Cambria, E. (2020). Personality trait detection using bagged SVM over BERT word embedding ensembles. arXiv:2010.01309
Ren, Z., Shen, Q., Diao, X., & Xu, H. (2021). A sentiment-aware deep learning approach for personality detection from text. Information Processing and Management, 58(3), 102532. https://doi.org/10.1016/j.ipm.2021.102532.
Yang, F., Quan, X., Yang, Y., & Yu, Jg. (2021). Multi-document transformer for personality detection. AAAI. https://www.aaai.org/AAAI21Papers/AAAI-2371.YangF.pdf
Yu, J., & Markov, K. (2017). Deep learning based personality recognition from Facebook status updates. In Proceedings—2017 IEEE 8th international conference on awareness science and technology, iCAST 2017 (vol. 2018-January, pp. 383–387). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICAwST.2017.8256484.
Farnadi, G., Zoghbi, S., Moens, M.-F., & De Cock, M. (2013). Recognizing personality traits using facebook status updates. In Computational personality recognition (shared task) (pp. 14–18). AAAI. http://www.aaai.org.
Wei, H., Zhang, F., Yuan, N. J., Cao, C., Fu, H., Xie, X., Rui, Y., & Ma, W. Y. (2017). Beyond the words: Predicting user personality from heterogeneous information. In WSDM 2017—Proceedings of the 10th ACM international conference on web search and data mining (pp. 305–314). Association for Computing Machinery, Inc. https://doi.org/10.1145/3018661.3018717.
Kim, Y. (2014). Convolutional neural networks for sentence classification. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP) (pp. 1746–1751). Association for Computational Linguistics. https://doi.org/10.3115/v1/D14-1181.
An, G., & Levitan, R. (2018). Lexical and acoustic deep learning model for personality recognition. In Interspeech 2018 (vol. 2018-September, pp. 1761–1765). ISCA. https://doi.org/10.21437/Interspeech.2018-2263.
Biel, J. I., & Gatica-Perez, D. (2013). The youtube lens: Crowdsourced personality impressions and audiovisual analysis of vlogs. IEEE Transactions on Multimedia, 15(1), 41–55. https://doi.org/10.1109/TMM.2012.2225032.
Biel, J. I., Tsiminaki, V., Dines, J., & Gatica-Perez, D. (2013). Hi YouTube! Personality impressions and verbal content in social video. In 2013 ACM international conference on multimodal interaction (pp. 119–126). ACM Press. https://doi.org/10.1145/2522848.2522877.
Chen, Y.-H., & Choi, J. D. (2016). Character identification on multiparty conversation: Identifying mentions of characters in TV shows. In Proceedings of the 17th annual meeting of the special interest group on discourse and dialogue (pp. 90–100). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w16-3612. http://www.freebase.com
Carducci, G., Rizzo, G., Monti, D., Palumbo, E., & Morisio, M. (2018). TwitPersonality: Computing personality traits from tweets using word embeddings and supervised learning. Information, 9(5), 127. https://doi.org/10.3390/info9050127.
Xue, D., Wu, L., Hong, Z., Guo, S., Gao, L., Wu, Z., et al. (2018). Deep learning-based personality recognition from text posts of online social networks. Applied Intelligence, 48(11), 4232–4246. https://doi.org/10.1007/s10489-018-1212-4.
Mohammad, S. M., & Kiritchenko, S. (2013). Using nuances of emotion to identify personality. In ICWSM workshop on computational personality recognition. AAAI Publications. https://nrc-publications.canada.ca/eng/view/accepted/?id=0d2da4fd-27fd-42f1-a867-d3de866ad8ea
Hassanein, M., Hussein, W., Rady, S., & Gharib, T. F. (2019). Predicting personality traits from social media using text semantics. In Proceedings—2018 13th international conference on computer engineering and systems, ICCES 2018 (pp. 184–189). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICCES.2018.8639408.
Khan, A. S., Ahmad, H., Asghar, M. Z., Saddozai, F. K., Arif, A., & Khalid, H. A. (2020). Personality classification from online text using machine learning approach. International Journal of Advanced Computer Science and Applications, 11(3), 460–476. https://doi.org/10.14569/ijacsa.2020.0110358. http://www.ijacsa.thesai.org
Amirhosseini, M. H., & Kazemian, H. (2020). Machine learning approach to personality type prediction based on the Myers-Briggs type indicator. Multimodal Technologies and Interaction, 4(1), 9. https://doi.org/10.3390/mti4010009.
16PF Questionnaire—Wikipedia. https://en.wikipedia.org/wiki/16PF_Questionnaire
Acknowledgements
This project is supported by a research grant from the University of Tabriz (number S/806).
Funding
This project is supported by a research grant of the University of Tabriz (number S/806).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Feizi-Derakhshi, AR., Feizi-Derakhshi, MR., Ramezani, M. et al. Text-based automatic personality prediction: a bibliographic review. J Comput Soc Sc 5, 1555–1593 (2022). https://doi.org/10.1007/s42001-022-00178-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42001-022-00178-4