Abstract
Emojis are now frequently used in online communication, which express rich meaningful information and emotional messages. However, communication will fail if the meaning of different Emojis is not well understood, especially for the speakers of different languages and those from different countries/regions. There are very few researches about the Emoji dataset currently, since the process of building an Emoji database is labor-intensive and time-consuming. To solve this problem, we propose an active learning-based framework for building Japanese text datasets containing Emoji. This approach aims to achieve fast and balanced labeling of data given a small and unevenly distributed source of Emoji data. The active learning algorithm selects unlabeled data with high information content for manual labeling and updates the model parameters with the manually labeled data, in which way a large Emoji database is iteratively constructed. The constructed Japanese Emoji database contains hundred types of Emojis, with at least hundred pieces of Our experiment suggests that the Emoji dataset can be efficiently constructed with balanced data and the result dataset can provide rich information for text emotion classification, by rendering an accuracy of over 82%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Al Rashdi, F.: Functions of Emojis in WhatsApp interaction among Omanis. Discourse Context Media 26, 117–126 (2018)
Njenga, K.: Social media information security threats: anthropomorphic Emoji analysis on social engineering, in Paper Presented at the IT Convergence and Security 2017 (Seoul) (2018)
Chik, A., Vasquez, C.: A comparative multimodal analysis of restaurant reviews from two geographical contexts. Vis. Commun. 16, 3–26 (2017)
Derks, D., Fischer, A.H., Bos, A.E.R.: The role of emotion in computer-mediated communication: a review. Comput. Hum. Behav. 24, 766–785 (2008)
Derks, D., Bos, A.E., Von Grumbkow, J.: Emoticons and online message interpretation. Soc. Sci. Comput. Rev. 26, 379–388 (2008)
Gao, M., Zhang, Z., Yu, G., Arık, S.Ö., Davis, L.S., Pfister, T.: Consistency-based semi-supervised active learning: towards minimizing labeling cost. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12355, pp. 510–526. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58607-2_30
Cheng, L.: Do I mean what I say and say what I mean? A cross cultural approach to the use of emoticons & Emojis in CMC messages. Fonseca J. Commun. (2017)
Kaneko, D., Toet, A., Ushiama, S., Brouwer, A.-M., Kallen, V., van Erp, J.B.F.: EmojiGrid: a 2D pictorial scale for cross-cultural emotion assessment of negatively and positively valenced food. Food Res. Int. 115, 541–551 (2019)
Ljubešić, N., Fišer, D.: A global analysis of Emoji usage. In: Paper Presented at the Proceedings of the 10th Web as Corpus Workshop (2016)
Barbieri, F., Kruszewski, G., Ronzano, F., Saggion, H.: How cosmopolitan are Emojis?: exploring Emojis usage and meaning over different languages with distributional semantics In: Paper Presented at the 2016 ACM on Multimedia Conference (2016)
Sugiyama, S.: Kawaii meiru and Maroyaka neko: mobile Emoji for relationship maintenance and aesthetic expressions among Japanese teens. First Monday 20, 1 (2015)
Barbieri, F., Ronzano, F., Saggion, H.: What does this Emoji mean? A vector space skip-gram model for Twitter Emojis. In: Paper presented at the International Conference on Language Resources and Evaluation, LERC (2016)
Tauch, C., Kanjo, E.: The roles of Emojis in mobile phone notifications. In: Paper Presented at the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct (Heidelberg) (2016)
Settles, B.: Active learning literature survey (2009)
Gilyazev, R., Turdakov, D.Y.: Active learning and crowdsourcing: a survey of optimization methods for data labeling. Program. Comput. Softw. 44(6), 476–491 (2018)
Schroder, C., Niekler, A.: A survey of active learning for text classification using deep neural networks, arXiv preprint arXiv:2008.07267 (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Li, X., Kang, X., Ren, F. (2022). Use Active Learning to Construct Japanese Emoji Emotion Database. In: Yang, S., Lu, H. (eds) Artificial Intelligence and Robotics. ISAIR 2022. Communications in Computer and Information Science, vol 1701. Springer, Singapore. https://doi.org/10.1007/978-981-19-7943-9_29
Download citation
DOI: https://doi.org/10.1007/978-981-19-7943-9_29
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-7942-2
Online ISBN: 978-981-19-7943-9
eBook Packages: Computer ScienceComputer Science (R0)