Abstract
In 2015, the United Nation General Assembly adopted the 2030 Agenda for Sustainable Development and its 17 Sustainable Development Goals aiming at ending all forms of poverty, fighting inequalities, and tackling climate change. We collected Twitter data about the 2030 Agenda from May 9th to November 9th, 2018. The aim of this work is to obtain a classification of each tweet in the corpus according to the “Information”—“Action” categories, in order to detect whether a tweet refers to an event or it has only an informative-disclosure purpose. It seems particularly interesting to understand how and to what extent people and organizations are playing a more active role in shaping the process of responding locally and internationally to climate change. Explicit intention to act or inform had been captured by hand coding of a randomly selected sample of tweets and then the classification had been extended to the whole corpus through a supervised machine learning method. Overall, our classification supervised model has produced satisfactory results.
Similar content being viewed by others
Notes
The 17 SDGs: 1. No Poverty, 2. Zero Hunger, 3. Good Health and Well-being, 4. Quality Education, 5. Gender Equality, 6. Clean Water and Sanitation, 7. Affordable and Clean Energy, 8. Decent Work and Economic Growth, 9. Industry, Innovation and Infrastructure, 10. Reduced Inequality, 11. Sustainable Cities and Communities, 12. Responsible Consumption and Production, 13. Climate Action, 14. Life Below Water, 15. Life on Land, 16. Peace and Justice Strong Institutions, 17. Partnerships to achieve the Goals.
References
Beck, U., Blok, A., Tyfield, D., & Zhang, J. Y. (2013). Cosmopolitan communities of climate risk: Conceptual and empirical suggestions for a new research agenda. Global Networks, 13(1), 1–21.
Boström, M., Lidskog, R., & Uggla, Y. (2017). A reflexive look at reflexivity in environmental sociology. Environmental Sociology, 3(1), 6–16.
Boyd, D., & Crawford, K. (2012). Critical questions for big data: Provocations for a cultural, technological, and scholarly phenomenon. Information, Communication & Society, 15(5), 662–679. https://doi.org/10.1080/1369118X.2012.678878.
Boyom, C., Callens, S., & Cherfi, S. (2016). Cultures of sustainability according to Ulrich Beck scheme: Territorial strategies for electromobility. Journal of Innovation Economics Management, 1, 135–158.
Ceron, A., Curini, L., & Iacus, S. M. (2016). iSA: A fast, scalable and accurate algorithm for sentiment analysis of social media content. Information Sciences, 367, 105–124.
Ceron, A., Curini, L., & Iacus, S. M. (2017). Politics and Big Data: Nowcasting and Forecasting Elections with Social Media. New York: Routledge.
Chen, T., He, T., Benesty, M., Khotilovich, V., & Tang, Y. (2019). xgboost: Extreme gradient boosting. R package version 0.81.0.1, 1–4.
Friedman, J. H. (2002). Stochastic gradient boosting. Computational Statistics & Data Analysis, 38(4), 367–378.
Groves, C. (2019). Sustainability and the future: Reflections on the ethical and political significance of sustainability. Sustainability Science, 14(4), 915–924.
Hopkins, D. J., & King, G. (2010). A method of automated nonparametric content analysis for social science. American Journal of Political Science, 54(1), 229–247.
Lauro, N. C., Amaturo, E., Grassia, M. G., Aragona, B., & Marino, M. (Eds.). (2017). Data science and social research: Epistemology, methods, technology and applications. Heidelberg: Springer.
McCormick, T. H., Lee, H., Cesare, N., Shojaie, A., & Spiro, E. S. (2017). Using Twitter for demographic and social science research: Tools for data collection and processing. Sociological Methods & Research, 46(3), 390–421.
Morozov, E. (2009). The Brave New World of Slacktivism. Foreign Policy. Accessed March 15, 2019, from https://foreignpolicy.com/2009/05/19/the-brave-new-world-of-slacktivism
Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5), 513–523.
Sampson, J., Morstatter, F., Maciejewski, R., & Liu, H. (2015). Surpassing the limit: Keyword clustering to improve Twitter sample coverage. In Proceedings of the 26th ACM conference on hypertext & social media (pp. 237–245). ACM.
Surian, A., & Sciandra, A. (2019). City Prosperity Index: A comparative analysis of Latin American and Mediterranean cities based on well-being and social inclusion features. In Book of short papers ASA conference 2019, statistics for health and well-being (pp. 210–214). Padova: Cleup.
Symeonidis, S., Effrosynidis, D., & Arampatzis, A. (2018). A comparative evaluation of pre-processing techniques and their interactions for twitter sentiment analysis. Expert Systems with Applications, 110, 298–310.
United Nations General Assembly. (2015). Transforming our world: The 2030 Agenda for Sustainable Development. Resolution 70/1.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sciandra, A., Surian, A. & Finos, L. Supervised Machine Learning Methods to Disclose Action and Information in “U.N. 2030 Agenda” Social Media Data. Soc Indic Res 156, 689–699 (2021). https://doi.org/10.1007/s11205-020-02523-4
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11205-020-02523-4