Abstract
There are several standard methods used to measure personal values, including the Schwartz Values Survey and the World Values Survey. While these tools are based on well-established questionnaires, they are expensive to administer at a large scale and rely on respondents to self-report their values rather than observing what people actually choose to write about. We employ a lexicon-based method that can computationally measure personal values on a large scale. Our approach is not limited to word-counting as we explore and evaluate several alternative approaches to quantifying the usage of value-related themes in a given document. We apply our methodology to a large blog dataset comprised of text written by users from different countries around the world in order to quantify cultural differences in the expression of person values on blogs. Additionally, we analyze the relationship between the value themes expressed in blog posts and the values measured for some of the same countries using the World Values Survey.
Y. Shen and S. R. Wilson—Equal contributions
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Change history
11 November 2019
The original version of this chapter was revised. A missing citation was added and the bibliography was updated accordingly.
Notes
- 1.
The words for each category are available from the resource available in the “Values Lexicon” section at http://nlp.eecs.umich.edu/downloads.html.
- 2.
While other contextual word embeddings like ELMo [16] do a good job of capturing the meanings of words in specific contexts, lexicons such as the values lexicon that we use is this study do not provide contexts along with the category-specific words, and so further research would be required to determine how to best create, e.g., value-specific dictionary embeddings with ELMo to use within the DDR framework.
- 3.
- 4.
- 5.
- 6.
Based on estimations provided at https://en.wikipedia.org/wiki/List_of_countries_by_English-speaking_population.
- 7.
At least 1,000 users claim to be from that country.
- 8.
We collected these using code from https://github.com/costaspappus/Blogs-Scraper.
- 9.
We use https://www.crummy.com/software/BeautifulSoup/ to clean the HTML.
- 10.
- 11.
As the overall results are not expected to change by a noticeable degree based on our evaluation, we opt not to use the UCRC method in the present analysis.
- 12.
We use data from round 6 of the WVS, available at http://www.worldvaluessurvey.org/.
References
Ball-Rokeach, S., Rokeach, M., Grube, J.W.: The Great American Values Test: Influencing Behavior and Belief Through Television. Free Press, New York (1984)
Bojanowski, P., Grave, E., Joulin, A., Mikolov, T.: Enriching word vectors with subword information. arXiv preprint arXiv:1607.04606 (2016)
Boyd, R.L., Wilson, S.R., Pennebaker, J.W., Kosinski, M., Stillwell, D.J., Mihalcea, R.: Values in words: using language to evaluate and understand personal values. In: Ninth International AAAI Conference on Web and Social Media (2015)
Chung, C.K., Pennebaker, J.W.: Finding values in words: using natural language to detect regional variations in personal concerns. In: Geographical psychology: Exploring the interaction of environment and behavior, pp. 195–216 (2014). https://doi.org/10.1037/14272-011
Faulkner, S.L., Baldwin, J.R., Lindsley, S.L., Hecht, M.L.: Layers of meaning: an analysis of definitions of culture. In: Redefining Culture: Perspectives Across the Disciplines, pp. 27–51 (2006)
Garten, J., Hoover, J., Johnson, K.M., Boghrati, R., Iskiwitch, C., Dehghani, M.: Dictionaries and distributions: Combining expert knowledge and large scale textual data content analysis. Behav. Res. Methods 50(1), 344–361 (2018)
Graham, J., Haidt, J., Nosek, B.A.: Liberals and conservatives rely on different sets of moral foundations. J. Pers. Soc. Psychol. 96(5), 1029 (2009)
Hofstede, G.: Dimensionalizing cultures: the hofstede model in context. Online Readings Psychol. Cult. 2(1), 8 (2011)
Inglehart, R., et al.: World values survey: round six-country-pooled datafile 2010–2014. JD Systems Institute, Madrid (2014)
Kendall, M.G.: The treatment of ties in ranking problems. Biometrika 33(3), 239–251 (1945)
Knafo, A., Roccas, S., Sagiv, L.: The value of values in cross-cultural research: a special issue in honor of shalom schwartz (2011)
Landes, S., Leacock, C., Tengi, R.I.: Building semantic concordances. WordNet Electron. Lexical Database 199(216), 199–216 (1998)
Maaten, L.V.D., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008)
Miller, G.: WordNet: An Electronic Lexical Database. MIT press, Cambridge (1998)
Pennebaker, J.W., Boyd, R.L., Jordan, K., Blackburn, K.: The development and psychometric properties of liwc2015. Technical report (2015)
Peters, M.E., et al.: Deep contextualized word representations. arXiv preprint arXiv:1802.05365 (2018)
Ramirez-Esparza, N., Chung, C.K., Kacewicz, E., Pennebaker, J.W.: The psychology of word use in depression forums in English and in Spanish: testing two text analytic approaches. In: In the International AAAI Conference on Web and Social Media (2008)
Rohan, M.J.: A rose by any name? the values construct. Pers. Soc. Psychol. Rev. 4(3), 255–277 (2000). https://doi.org/10.1207/S15327957PSPR0403_4
Rokeach, M.: Beliefs, Attitudes, and Values, vol. 34. Jossey-Bass, San Francisco (1968)
Rokeach, M.: The Nature of Human Values, vol. 438. Free press, New York (1973)
Schwartz, S.H.: Universals in the content and structure of values: theoretical advances and empirical tests in 20 countries. Adv. Exp. Soc. Psychol. 25, 1–65 (1992). https://doi.org/10.1016/S0065-2601(08)60281-6
Schwartz, S.H.: An overview of the Schwartz theory of basic values. Online Readings Psychol. Cult. 2(1), 11 (2012)
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011)
Wilson, S.R., Shen, Y., Mihalcea, R.: Building and validating hierarchical lexicons with a case study on personal values. In: Staab, S., Koltsova, O., Ignatov, D.I. (eds.) SocInfo 2018. LNCS, vol. 11185, pp. 455–470. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01129-1_28
Acknowledgments
This material is based in part upon work supported by the Michigan Institute for Data Science, by the National Science Foundation (grant #1815291), and by the John Templeton Foundation (grant #61156). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author and do not necessarily reflect the views of the Michigan Institute for Data Science, the National Science Foundation, or the John Templeton Foundation.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Shen, Y., Wilson, S.R., Mihalcea, R. (2019). Measuring Personal Values in Cross-Cultural User-Generated Content. In: Weber, I., et al. Social Informatics. SocInfo 2019. Lecture Notes in Computer Science(), vol 11864. Springer, Cham. https://doi.org/10.1007/978-3-030-34971-4_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-34971-4_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-34970-7
Online ISBN: 978-3-030-34971-4
eBook Packages: Computer ScienceComputer Science (R0)