Skip to main content

The Corpus of Emotional Valences for 33,669 Chinese Words Based on Big Data

  • Conference paper
  • First Online:
HCI in Business, Government and Organizations (HCII 2022)

Abstract

Emotion theories are mainly classified as categorical or dimensional approaches. Given the importance of emotional words in emotion research, researchers have constructed a co-occurrence corpus of 7 types of emotion words through word co-occurrence and big data corpora. However, in addition to the categorical approach, the dimensional approach plays an important role in natural language processing. In particular, valence has an important influence on the study of emotion and language. In this study, the co-occurrence corpus of 7 types of emotion words constructed by Chen et al. [1] was expanded to create a corpus of emotional valences. Then, stepwise multiple regression analysis was performed with the predicted criterion variables and 15 predictor variables. The criterion variables were the emotional valences of 553 frequently occurring stimulus words included in the Chinese Word Association Norms [2]. The predictor variables included the emotion co-occurrences scores for 2 clusters (a cluster of literal emotion words and a cluster of metaphorical emotion words) and 7 types of emotions (happiness, love, surprise, sadness, anger, disgust, and fear) [the emotional words were common words from both the co-occurrence corpus of 7 types of emotion words constructed by Chen et al. [1] and the Chinese Word Association Norms established by Hu et al. [2]] and the virtue word co-occurrences score. The results showed that the scores for literal happiness word co-occurrences, metaphorical happiness word co-occurrences, literal disgust word co-occurrences, literal fear word co-occurrences, and virtue word co-occurrences could predict the valence values of emotion words, with the multiple correlation coefficients of multiple regression analyses reaching .729. Subsequently, the valence values of 33,669 words were established using the formula obtained from the multiple regression analysis of the 553 words. Next, the correlation between the actual valence values and the predicted valence values was analyzed to test the cross-validity of the established valences using the common words in the norm established by Lee and Lee [3] for the emotionality ratings and free associations of 267 common Chinese words. The results showed that the correlation between the 2 was .755, indicating that the predicted values generated by the big data corpora and word co-occurrence had a degree of similarity with the manually determined values. Based on theories and tests, this study used the co-occurrence data of 7 emotions and virtue to construct the corpus of emotional valences for 33,669 Chinese words. The results showed that the combined use of big data corpora and word co-occurrence can effectively expand existing corpora that were established based on emotional categories, improve the efficiency of manual construction of corpora, and establish a larger corpus of emotional words.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Chen, C.H., et al.: Building a “Corpus of 7 types emotion co-occurrences words” of Chinese emotional words with Big Data Corpus (in Chinese) [Paper presentation]. In: 24th International Conference on Human-Computer Interaction (HCII2022), Virtual Only Conference, June 2022 (2022)

    Google Scholar 

  2. Hu, J.-F., Chen, Y.-C., Zhuo, S.-L., Chen, H.-C., Chang, Y.-L., Sung, Y.-T.: Word “association” and “associated” norms for 1200 Chinese two-character words (in Chinese). Bull. Educ. Psychol. 49(1), 137–161 (2017). https://doi.org/10.6251/BEP.20161111

    Article  Google Scholar 

  3. Lee, H.M., Lee, Y.S.: Emotionality ratings and free association of 267 common Chinese words (in Chinese). Formosa J. Mental Health 24(4), 495–524 (2011)

    Google Scholar 

  4. Darwin, C., Ekman, P., Prodger, P.: The Expression of the Emotions in Man and Animals. Oxford University Press, New York (1998)

    Google Scholar 

  5. Russell, J.A., Pratt, G.: A description of the affective quality attributed to environments. J. Pers. Soc. Psychol. 38(2), 311–322 (1980). https://doi.org/10.1037/0022-3514.38.2.311

    Article  Google Scholar 

  6. Eibl-Eibesfeldt, I.: The Expressive Behavior of the Deaf-and-Blind Born Social Comm. and Mov., Ed. I., pp. 163–193. Academic Press (1973)

    Google Scholar 

  7. Plutchik, R.: A general psychoevolutionary theory of emotion. Theor. Emotion 1, 3–31 (1980)

    Article  Google Scholar 

  8. Ekman, P., Friesen, W.V.: Measuring facial movement. Environ. Psychol. Nonverb. Behav. 1(1), 56–75 (1976). https://doi.org/10.1007/bf01115465

    Article  Google Scholar 

  9. Ekman, P., Friesen, W.V., Ellsworth, P.: Emotion in the Human Face: Guidelines for Research and an Integration of Findings. Pergamon Press, Oxford (1972)

    Google Scholar 

  10. Shaver, P., Schwartz, J., Kirson, D., O’Connor, C.: Emotion knowledge: Further exploration of a prototype approach. J. Pers. Soc. Psychol. 52(6), 1061–1086 (1987). https://doi.org/10.1037/0022-3514.52.6.1061

    Article  Google Scholar 

  11. Fontaine, J.R., Scherer, K.R., Roesch, E.B., Ellsworth, P.C.: The world of emotions is not two-dimensional. Psychol. Sci. 18(12), 1050–1057 (2007). https://doi.org/10.1111/j.1467-9280.2007.02024.x

    Article  Google Scholar 

  12. Lang, P.J., Bradley, M.M., Cuthbert, B.N.: Emotion, attention, and the startle reflex. Psychol. Rev. 97(3), 377–395 (1990). https://doi.org/10.1037/0033-295X.97.3.377

    Article  Google Scholar 

  13. Larsen, R.J., Diener, E.: Promises and Problems with the Circumplex Model of Emotion, pp. 25–59. Sage Publications Inc., Thousand Oaks (1992)

    Google Scholar 

  14. Osgood, C.E., Suci, G.J., Tannenbaum, P.H.: The Measurement of Meaning. University of Illinois Press (1957)

    Google Scholar 

  15. Thayer, R.E.: Activation-deactivation adjective check list: current overview and structural analysis. Psychol. Rep. 58(2), 607–614 (1986). https://doi.org/10.2466/pr0.1986.58.2.607

    Article  Google Scholar 

  16. Kuperman, V., Estes, Z., Brysbaert, M., Warriner, A.B.: Emotion and language: valence and arousal affect word recognition. J. Exp. Psychol. Gen. 143(3), 1065 (2014)

    Article  Google Scholar 

  17. Kauschke, C., Bahn, D., Vesker, M., Schwarzer, G.: The role of emotional valence for the processing of facial and verbal stimuli—positivity or negativity bias? Front. Psychol. 10, 1654 (2019)

    Article  Google Scholar 

  18. Gendron, M., Lindquist, K.A., Barsalou, L., Barrett, L.F.: Emotion words shape emotion percepts. Emotion 12(2), 314–325 (2012). https://doi.org/10.1037/a0026007

    Article  Google Scholar 

  19. Ko, Y.H., Cho, S.L.: The relationship between proneness to borderline personality and suicide ideation based on the analysis of mental function and free association of emotion words (in Chinese). Fu-Jen Journal of Medicine 11(2), 59–71 (2013)

    Google Scholar 

  20. Bradley, M.M., Lang, P.J.: Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings, pp. 1–45. The Center for Research in Psychophysiology (1999)

    Google Scholar 

  21. Mukherjee, S., Heise, D.R.: Affective meanings of 1,469 Bengali concepts. Behav. Res. Methods 49(1), 184–197 (2016). https://doi.org/10.3758/s13428-016-0704-6

    Article  Google Scholar 

  22. Stadthagen-Gonzalez, H., Imbault, C., Pérez Sánchez, M.A., Brysbaert, M.: Norms of valence and arousal for 14,031 Spanish words. Behav. Res. Methods 49(1), 111–123 (2016). https://doi.org/10.3758/s13428-015-0700-2

    Article  Google Scholar 

  23. Warriner, A.B., Kuperman, V., Brysbaert, M.: Norms of valence, arousal, and dominance for 13,915 English lemmas. Behav. Res. Methods 45(4), 1191–1207 (2013). https://doi.org/10.3758/s13428-012-0314-x

    Article  Google Scholar 

  24. Wang, Y.N., Zhou, L.M., Luo, Y.J.: The pilot establishment and evaluation of Chinese affective words system (in Chinese). Chin. Ment. Health J. 22(8), 608–612 (2008)

    Google Scholar 

  25. Cho, S.L., Chen, H.C., Cheng, C.M.: Taiwan corpora of Chinese emotions and relevant psychophysiological data-a study on the norms of Chinese emotional words (in Chinese). Chinese J. Psychol. 55(4), 493–523 (2013). https://doi.org/10.6129/cjp.20131026

    Article  Google Scholar 

  26. Chen, H.C., Chan, Y.C., Feng, Y.J.: Taiwan corpora of Chinese emotions and relevant psychophysiological data-a norm of emotion metaphors in Chinese (in Chinese). Chinese J. Psychol. 55(4), 525–553 (2013). https://doi.org/10.6129/CJP.20130112b

    Article  Google Scholar 

  27. Yao, Z., Wu, J., Zhang, Y., Wang, Z.: Norms of valence, arousal, concreteness, familiarity, imageability, and context availability for 1,100 Chinese words. Behav. Res. Methods 49(4), 1374–1385 (2016). https://doi.org/10.3758/s13428-016-0793-2

    Article  Google Scholar 

  28. Haidt, J.: The Moral Emotions Handbook of Affective Sciences, pp. 852–870. Oxford University Press, New York (2003)

    Google Scholar 

  29. Rich, J.M.: Moral education and the emotions. J. Moral Educ. 9(2), 81–87 (1980). https://doi.org/10.1080/0305724800090202

    Article  Google Scholar 

  30. Rudolph, U., Tscharaktschiew, N.: An attributional analysis of moral emotions: naïve scientists and everyday judges. Emot. Rev. 6(4), 344–352 (2014). https://doi.org/10.1177/1754073914534507

    Article  Google Scholar 

  31. Jackson, M.: Emotion and Psyche. John Hunt Publishing Limited (2010)

    Google Scholar 

  32. Tangney, J.P., Stuewig, J., Mashek, D.J.: Moral emotions and moral behavior. Annu. Rev. Psychol. 58, 345–372 (2007). https://doi.org/10.1146/annurev.psych.56.091103.070145

    Article  Google Scholar 

  33. Hinojosa, J.A., et al.: Affective norms of 875 Spanish words for five discrete emotional categories and two emotional dimensions. Behav. Res. Methods 48(1), 272–284 (2015). https://doi.org/10.3758/s13428-015-0572-5

    Article  Google Scholar 

  34. Baccianella, S., Esuli, A., Sebastiani, F.: Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. Paper Presented at the Lrec (2010)

    Google Scholar 

  35. Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011). https://doi.org/10.1162/COLI_a_00049

    Article  Google Scholar 

  36. Ku, L.-W., Chen, H.-H.: Mining opinions from the web: beyond relevance retrieval. J. Am. Soc. Inf. Sci. Technol. 58(12), 1838–1850 (2007). https://doi.org/10.1002/asi.20630

    Article  Google Scholar 

  37. Pennebaker, J.W., Booth, R.J., Francis, M.E.: Linguistic Inquiry and Word Count (LIWC2007) (2007)

    Google Scholar 

  38. Pennebaker, J.W., Booth, R.J., Francis, M.E.: Operator’s manual: linguistic inquiry and word count: LIWC2007, Austin: LIWC. net (2007). http://homepage.psy.utexas.edu/HomePage/Faculty/Pennebaker/Reprints/LIWC2007_OperatorManual.pdf. Accessed 1 Oct 2013

  39. Pennebaker, J.W., Francis, M.E., Booth, R.J.: Linguistic Inquiry and Word Count: LIWC 2001, vol. 71. Lawrence Erlbaum Associates, Mahway (2001)

    Google Scholar 

  40. Jaffe, E.: What Big Data Means for Psychological Science. Observer 27(6) (2014)

    Google Scholar 

  41. Chang Chien, C.Y.: Analysis of Life Curriculum Textbooks in Elementary Schools with Regard to Character Building Education (in Chinese). Master’s thesis in the summer social education program of the Department of Further Education, National Taitung University (2010)

    Google Scholar 

  42. Baroni-Urbani, C., Buser, M.W.: Similarity of binary data. Syst. Biol. 25(3), 251–259 (1976). https://doi.org/10.2307/2412493

    Article  Google Scholar 

  43. Pecina, P.: Lexical association measures and collocation extraction. Lang. Resour. Eval. 44(1–2), 137–158 (2009). https://doi.org/10.1007/s10579-009-9101-4

    Article  Google Scholar 

Download references

Acknowledgements

This work was financially supported by the grant MOST-111-2634-F-002-004 from Ministry of Science and Technology (MOST) of Taiwan, the MOST AI Biomedical Research Center, and the “Institute for Research Excellence in Learning Sciences” and “Chinese Language and Technology Center” of National Taiwan Normal University from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education in Taiwan.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hsueh-Chih Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chang, CY. et al. (2022). The Corpus of Emotional Valences for 33,669 Chinese Words Based on Big Data. In: Fui-Hoon Nah, F., Siau, K. (eds) HCI in Business, Government and Organizations. HCII 2022. Lecture Notes in Computer Science, vol 13327. Springer, Cham. https://doi.org/10.1007/978-3-031-05544-7_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-05544-7_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-05543-0

  • Online ISBN: 978-3-031-05544-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics