Abstract
In this study, we illustrate an ongoing work regarding building an Italian textual dataset for emotion recognition for HRI. The idea is to build a dataset with a well-defined methodology based on creating ad-hoc dialogues from scratch. Once that the criteria had been defined, we used ChatGPT to help us generate dialogues. Human experts in psychology have revised each dialogue. In particular, we analyzed the generated dialogues to observe the balance of the dataset under different parameters. During the analysis, we calculated the distribution of context types, gender, consistency between context and emotion, and interaction quality. With “quality” we mean the adherence of text to the desired manifestation of emotions. After the analysis, the dialogues were modified to bring out specific emotions in specific contexts. Significant results emerged that allowed us to reorient the generation of subsequent dialogues. This preliminary study allowed us to draw lines to guide subsequent and more substantial dataset creation in order to achieve increasingly realistic interactions in HRI scenarios.
This research has been made in the context of the Excellence Chair in Big Data Management and Analytics at University of Paris City, Paris, France.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Crowdflower: the emotion in text (2016). https://www.figure-eight.com/data/sentiment-analysis-emotion-text/
Basile, V., Maria, D.M., Danilo, C., Passaro, L.C., et al.: EVALITA 2020: overview of the 7th evaluation campaign of natural language processing and speech tools for Italian. In: Proceedings of the Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2020), pp. 1–7. CEUR-ws (2020)
Braunhofer, M., Elahi, M., Ricci, F.: User personality and the new user problem in a context-aware point of interest recommender system. In: Tussyadiah, I., Inversini, A. (eds.) Information and Communication Technologies in Tourism 2015, pp. 537–549. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-14343-9_39. https://doi.org/10.13140/RG.2.1.2493.8001
Buechel, S., Hahn, U.: EmoBank: studying the impact of annotation perspective and representation format on dimensional emotion analysis. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, Valencia, Spain, pp. 578–585. Association for Computational Linguistics (2017). https://aclanthology.org/E17-2092
Caselli, T., Novielli, N., Patti, V., Rosso, P.: Sixth evaluation campaign of natural language processing and speech tools for Italian: final workshop (EVALITA 2018). In: EVALITA 2018. CEUR Workshop Proceedings. CEUR-WS.org (2018)
Chatterjee, A., Gupta, U., Chinnakotla, M.K., Srikanth, R., Galley, M., Agrawal, P.: Understanding emotions in text using deep learning and big data. Comput. Hum. Behav. 93, 309–317 (2019)
Chiavetta, F., Bosco, G.L., Pilato, G., et al.: A lexicon-based approach for sentiment classification of amazon books reviews in Italian language. In: WEBIST 2016, vol. 2, pp. 159–170 (2016)
Corazza, M., Menini, S., Cabrio, E., Tonelli, S., Villata, S.: A multilingual evaluation for online hate speech detection. ACM Trans. Internet Technol. (TOIT) 20(2), 1–22 (2020)
Coronato, A., Cuzzocrea, A.: An innovative risk assessment methodology for medical information systems. IEEE Trans. Knowl. Data Eng. 34(7), 3095–3110 (2022)
Cuzzocrea, A., Leung, C.K., MacKinnon, R.K.: Mining constrained frequent itemsets from distributed uncertain data. Future Gener. Comput. Syst. 37, 117–126 (2014)
Cuzzocrea, A., Martinelli, F., Mercaldo, F., Vercelli, G.V.: Tor traffic analysis and detection via machine learning techniques. In: 2017 IEEE International Conference on Big Data (IEEE BigData 2017), Boston, MA, USA, 11–14 December 2017, pp. 4474–4480. IEEE Computer Society (2017)
Cuzzocrea, A., Pilato, G.: A composite framework for supporting user emotion detection based on intelligent taxonomy handling. Log. J. IGPL 29(2), 207–219 (2021)
Fantini, A., Asta, A., Cuzzocrea, A., Pilato, G.: Building and assessing an Italian textual dataset for emotion recognition in human-robot interactions. In: The 29th International DMS Conference on Visualization and Visual Languages, San Francisco, USA, 29 June–3 July 2023. KSI Research Inc. (2023)
Goel, A., Gautam, J., Kumar, S.: Real time sentiment analysis of tweets using Naive Bayes. In: 2016 2nd International Conference on Next Generation Computing Technologies (NGCT), pp. 257–261. IEEE (2016)
Joshi, A., Tripathi, V., Bhattacharyya, P., Carman, M.J.: Harnessing sequence labeling for sarcasm detection in dialogue from TV series ‘Friends’. In: Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, Berlin, Germany, pp. 146–155. Association for Computational Linguistics (2016). https://doi.org/10.18653/v1/K16-1015. https://aclanthology.org/K16-1015
Koelstra, S., et al.: DEAP: a database for emotion analysis; using physiological signals. IEEE Trans. Affect. Comput. 3(1), 18–31 (2011)
Leung, C.K., Braun, P., Hoi, C.S.H., Souza, J., Cuzzocrea, A.: Urban analytics of big transportation data for supporting smart cities. In: Ordonez, C., Song, I.-Y., Anderst-Kotsis, G., Tjoa, A.M., Khalil, I. (eds.) DaWaK 2019. LNCS, vol. 11708, pp. 24–33. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-27520-4_3
Leung, C.K., Cuzzocrea, A., Mai, J.J., Deng, D., Jiang, F.: Personalized DeepInf: enhanced social influence prediction with deep learning and transfer learning. In: 2019 IEEE International Conference on Big Data (IEEE BigData), Los Angeles, CA, USA, 9–12 December 2019, pp. 2871–2880. IEEE (2019)
Mohammad, S., Bravo-Marquez, F.: WASSA-2017 shared task on emotion intensity. In: Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Copenhagen, Denmark, pp. 34–49. Association for Computational Linguistics (2017). https://doi.org/10.18653/v1/W17-5205. https://aclanthology.org/W17-5205
Mohammad, S.M.: Word affect intensities. arXiv preprint arXiv:1704.08798 (2017)
Mohammad, S.M., Bravo-Marquez, F., Salameh, M., Kiritchenko, S.: SemEval-2018 task 1: affect in tweets. In: Proceedings of International Workshop on Semantic Evaluation (SemEval-2018), New Orleans, LA, USA (2018)
Mohammad, S.M., Sobhani, P., Kiritchenko, S.: Stance and sentiment in tweets. ACM Trans. Internet Technol. (TOIT) 17(3), 1–23 (2017)
Pilato, G., D’Avanzo, E.: Data-driven social mood analysis through the conceptualization of emotional fingerprints. Procedia Comput. Sci. 123, 360–365 (2018)
Poria, S., Majumder, N., Mihalcea, R., Hovy, E.: Emotion recognition in conversation: research challenges, datasets, and recent advances. IEEE Access 7, 100943–100953 (2019)
Rollo, F., Bonisoli, G., Po, L.: Supervised and unsupervised categorization of an imbalanced Italian crime news dataset. In: Ziemba, E., Chmielarz, W. (eds.) FedCSIS-AIST/ISM -2021. LNBIP, vol. 442, pp. 117–139. Springer, Cham (2022). https://doi.org/10.1007/978-3-030-98997-2_6
Scherer, K.R., Wallbott, H.G.: Evidence for universality and cultural variation of differential emotion response patterning: correction. J. Pers. Soc. Psychol. 67(1), 55 (1994)
Schuff, H., Barnes, J., Mohme, J., Padó, S., Klinger, R.: Annotation, modelling and analysis of fine-grained emotions on a stance and sentiment detection corpus. In: Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 13–23 (2017)
Shmueli, B., Ku, L.W.: SocialNLP EmotionX 2019 challenge overview: predicting emotions in spoken dialogues and chats. arXiv preprint arXiv:1909.07734 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Asta, A., Cuzzocrea, A., Fantini, A., Pilato, G., Bringas, P.G. (2023). Supporting Emotion Recognition in Human-Robot Interactions: An Experimental Italian Textual Dataset. In: García Bringas, P., et al. Hybrid Artificial Intelligent Systems. HAIS 2023. Lecture Notes in Computer Science(), vol 14001. Springer, Cham. https://doi.org/10.1007/978-3-031-40725-3_41
Download citation
DOI: https://doi.org/10.1007/978-3-031-40725-3_41
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-40724-6
Online ISBN: 978-3-031-40725-3
eBook Packages: Computer ScienceComputer Science (R0)