Abstract
In Supervised-learning of emotions from human language, to keep the emotion labels balanced in the training set is a challenging task since emotion labels are highly biased in raw data of human language. In this paper, we propose a novel method based on active learning to partially inhibit the polarization of text samples with more frequently observed emotion labels for constructing the training set, and to encourage the selection of samples with less frequently observed emotion labels. For each batch of unlabeled samples, the selected samples by our approach are given the ground truth emotion labels from human experts before they are merged to the training data. Our experiment of multi-label emotion classification on Chinese Weibo messages suggests that the proposed method is effective in constructing the label-balanced training set for text emotion classification, and the supervised text emotion classification results have been steadily improved with such training set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The argpartition(F(X), n) function selects n elements x from X which have larger scores F(·) than the other elements in X.
References
Lippmann W (2017) Public opinion Routledge
Ravi K, Ravi V (2015) A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl-Based Syst 89:14–46
González-Bailón S, Paltoglou G (2015) Signals of public opinion in online communication: a comparison of methods and data sources. ANNALS Am Acad Polit Soc Sci 659(1):95–107
López-de-Ipiña K, Alonso JB, Travieso CM et al (2013) On the selection of non-invasive methods based on speech analysis oriented to automatic Alzheimer disease diagnosis. Sensors 13(5):6730–6745
Friedman HS, Booth-Kewley S (1987) Personality, type A behavior, and coronary heart disease: the role of emotional expression. J Pers Soc Psychol 53(4):783
Tacconi D, Mayora O, Lukowicz P et al (2008) Activity and emotion recognition to support early diagnosis of psychiatric diseases. In: Pervasive computing technologies for healthcare. In: Second international conference on pervasive health 2008. IEEE, pp 100–102
Yu LC, Wu JL, Chang PC et al (2013) Using a contextual entropy model to expand emotion words and their intensity for the sentiment classification of stock market news. Knowl-Based Syst 41:89–97
Rao Y, Xie H, Li J et al (2016) Social emotion classification of short text via topic-level maximum entropy model. Inf Manag 53(8):978–986
Das S, Chen M (2001) Yahoo! for amazon: extracting market sentiment from stock message boards. In: Proceedings of the Asia Pacific finance association annual conference, vol 35
Desmet P (2003) A multilayered model of product emotions. Des J 6(2):4–13
Dave K, Lawrence S, Pennock DM (2003) Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In: Proceedings of the 12th international conference on World Wide Web. ACM, pp 519–528
Hassenzahl M, Diefenbach S, Göritz A (2010) Needs, affect, and interactive products–facets of user experience. Interact Comput 22(5):353–362
Wang L, Ren F, Miao D (2016) Multi-label emotion recognition of weblog sentence based on Bayesian networks. IEEJ Trans Electr Electron Eng 11(2):178–184
Picard RW (1995) Affective computing
Ekman P (1992) An argument for basic emotions. Cogn Emot 6(3–4):169–200
Zhao YY, Qin B, Liu T (2010) Sentiment analysis. J Softw 21(8):1834–1848
Taboada M, Brooke J, Tofiloski M et al (2011) Lexicon-based methods for sentiment analysis. Comput Linguist 37(2):267–307
Yang C, Lin KHY, Chen HH (2007) Building emotion lexicon from weblog corpora. In: Proceedings of the 45th annual meeting of the ACL on interactive poster and demonstration sessions. Association for Computational Linguistics, pp 133–136
Matsumoto K, Ren F (2011) Estimation of word emotions based on part of speech and positional information. Comput Hum Behav 27(5):1553–1564
Picard RW, Vyzas E, Healey J (2001) Toward machine emotional intelligence: analysis of affective physiological state. IEEE Trans Pattern Anal Mach Intell 23(10):1175–1191
Bhowmick PK (2009) Reader perspective emotion analysis in text through ensemble based multi-label classification framework]. Comput Inf Sci 2(4):64
Tsoumakas G, Katakis I (2007) Multi-label classification: an overview. Int J Data Warehouse Min (IJDWM) 3(3):1–13
Tsoumakas G, Vlahavas I (2007) Random k-labelsets: an ensemble method for multilabel classification. In: European conference on machine learning. Springer, Berlin, Heidelberg, pp 406–417
Ren F, Kang X (2013) Employing hierarchical Bayesian networks in simple and complex emotion topic analysis. Comput Speech Lang 27(4):943–968
Liu H, Lieberman H, Selker T (2003) A model of textual affect sensing using real-world knowledge. In: Proceedings of the 8th international conference on Intelligent user interfaces. ACM, pp 125–132
Reyes O, Morell C, Ventura S (2018) Effective active learning strategy for multi-label learning. Neurocomputing 273:494–508
Li X, Guo Y (2013) Active learning with multi-label SVM classification. IJCAI, 1479–1485
Kang X, Wu Y, Ren F (2018) Progressively improving supervised emotion classification through active learning. In: International conference on multi-disciplinary trends in artificial intelligence. Springer, Cham, pp 49–57
Acknowledgements
This research has been partially supported by the Ministry of Education, Science, Sports and Culture of Japan, Grant-in-Aid for Scientific Research(A), 15H01712.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Shi, X., Kang, X., Liao, P., Ren, F. (2021). Building Label-Balanced Emotion Corpus Based on Active Learning for Text Emotion Classification. In: Lu, H. (eds) Artificial Intelligence and Robotics. ISAIR 2019. Studies in Computational Intelligence, vol 917. Springer, Cham. https://doi.org/10.1007/978-3-030-56178-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-56178-9_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-56177-2
Online ISBN: 978-3-030-56178-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)