Abstract
With the adjustment of social relations and interest patterns brought about by the comprehensive deepening reform in China, new and old contradictions are intertwined, various risks are increased, collective actions occasionally occur, and some new trends are observed. However, due to there is no authoritative database of collective action in China, it is difficult to observe the trend of collective actions. There has been significant research show that news coverage is an effective way to obtain collective action information. Thus, we examine the recent news coverage shift in terms of collective action. We collected 5354 news coverages from 2014 to 2018. Then, we constructed a collective action domain-specific word dictionary and presented a method to automatically detect temporal, spatial, and topical trends of collective action. The proposed framework is based on text mining analysis that collects data from news outlets and extracts valuable data for perceiving the collective action trends. The results show that the proposed method is an effective tool to identify the trends in collective action via machine learning.
Similar content being viewed by others
References
Bogdanor, V.: The Blackwell encyclopaedia of political institutions. Blackwell Publishing (1987)
Bokányi, E., Kondor, D., Dobos, L., Seb/Hok, T., Stéger, J., Csabai, I., Vattay, G.: Race, religion and the city: twitter word frequency patterns reveal dominant demographic dimensions in the United States. Palgrave Commun. 2, 16010 (2016)
Buitinck, L., Louppe, G., Blondel, M., Pedregosa, F., Mueller, A., Grisel, O., Niculae, V., Prettenhofer, P., Gramfort, A., Grobler, J., Layton, R., VanderPlas, J., Joly, A., Holt, B., Varoquaux, G.: API design for machine learning software: experiences from the scikit-learn project. In: ECML PKDD Workshop: Languages for Data Mining and Machine Learning, pp. 108–122 (2013)
Chiu, B., Crichton, G., Korhonen, A., Pyysalo, S.: How to train good word embeddings for biomedical NLP. In: Proceedings of the 15th workshop on biomedical natural language processing, pp. 166–174 (2016)
Fisher, D.R., Andrews, K.T., Caren, N., Chenoweth, E., Heaney, M.T., Leung, T., Nathan Perkins, L., Pressman, J.: The science of contemporary street protest: new efforts in the United States. Sci. Adv 5, 5461–5484 (2019)
Franzosi, R.: From words to numbers: a set theory framework for the collection, organization, and analysis of narrative data. Sociol. methodol. 24, 105–136 (1994)
Franzosi, R.: Quantitative narrative analysis. 162. Sage (2010)
Franzosi, R., De Fazio, G., Vicari, S.: Ways of measuring agency: an application of quantitative narrative analysis to lynchings in Georgia (1875–1930). Sociol. Methodol. 42, 1–42 (2012)
Hamilton, W. L., Leskovec, J., Jurafsky, D.: Diachronic word embeddings reveal statistical laws of semantic change (2016). arXiv preprint arXiv:1605.09096
Hanna, A.: MPEDS: Automating the Generation of Protest Event Data 1–40 (2017)
King, B.G., Bentele, K.G., Soule, S.A.: Protest and policymaking: explaining fluctuation in congressional attention to rights issues, 1960–1986. Soc. Forces 86, 137–163 (2007)
Koopmans, R., Rucht, D.: Protest event analysis. Methods soc. movem. res. 16, 231–259 (2002)
Lansdall-Welfare, T., Cristianini, N.: History playground: a tool for discovering temporal trends in massive textual corpora. Digit Scholarsh Humanit 35, 328–341 (2020)
Leetaru, K., Schrodt, P.A.: Gdelt: global data on events, location, and tone, 1979–2012. ISA annual convent. 2, 1–49 (2013)
Lindén, K.: A finnish news corpus for named entity recognition. Language Res. Evaluat. 54, 1–26 (2019)
López-Solaz, T.: An approach to the use of word embeddings in an opinion classification task. Expert Syst. Appl. 66, 1–6 (2016)
Mastrorocco, N., Minale, L.: News media and crime perceptions: evidence from a natural experiment. J. Public Econ. 165, 230–255 (2018)
McAdam, D., Su, Y.: The war at home: Antiwar protests and congressional voting, 1965 to 1973. Am. sociol. rev. 696–721 (2002)
McAdam, D., Tarrow, S., Tilly, C.: Dynamics of contention. Soc. Movement Stud. 2, 99–102 (2003)
Mowafi, Y., Zmily, A., Abou-Tair, D.E.D.I., Abu-Saymeh, D.: Tracking human mobility at mass gathering events using WISP (2013)
Rand, W.M.: Objective criteria for the evaluation of clustering methods. Publ. Am. Statist. Assoc. 66, 846–850 (1971)
Řehůřek, R., Sojka, P.: Software framework for topic modelling with large corpora. In: Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, pp. 45–50. ELRA, Valletta, Malta (2010)
Robert Tibshirani Guenther Walther, T.H.: Estimating the number of clusters in a data set via the gap statistic. J. Royal Statist. Soc.: Series B (Statistical Methodology) (2001)
Rucht, D., Neidhardt, F.: Methodological issues in collecting protest event data: Units of analysis, sources and sampling, coding problems. Acts of dissent: New developments in the study of protest 65–89 (1999)
Shao, P., Wang, Y.: How does social media change Chinese political culture? the formation of fragmentized public sphere. Telematics Inform. 34, 694–704 (2017)
Tilly, C.: Contentious performances. Cambridge University Press (2008)
Vicari, S.: Measuring collective action frames: a linguistic approach to frame analysis. Poetics 38, 504–525 (2010)
Wren, J.: Cross-type biomedical named entity recognition with deep multi-task learning. Bioinformatics 10 (2018)
Xu, X.: Chinese Named Entity Recognition Based on CNN-BiLSTM-CRF. In: IEEE International Conference on Software Engineering and Service Science (2018)
Zhang, H., Pan, J.: CASM: A deep-learning approach for identifying collective action events with text and image data from social media, volume 49 (2019)
Zhou, J., Wang, E., Chen, Y., Wu, X., Ma, Y., Tian, Y.: Forecasting model of mass incidents in China-An explorative research based on suppport vector machine. In: 2009 International Conference on Business Intelligence and Financial Engineering, pp. 152–155. IEEE (2009)
Acknowledgements
The authors acknowledge financial support from the National Natural Science Foundation of China (No. 71774154,72074205,71573247).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethical Statement
I certify that this manuscript is original and has not been published and will not be submitted elsewhere for publication while being considered by Quality & Quantity. And the study is not split up into several parts to increase the quantity of submissions and submitted to various journals or to one journal over time. No data have been fabricated or manipulated (including images) to support your conclusions. No data, text, or theories by others are presented as if they were our own. The submission has been received explicitly from all co-authors. And authors whose names appear on the submission have contributed sufficiently to the scientific work and therefore share collective responsibility and accountability for the results.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Ying, L., Linlin, L. & Qianqian, L. The clues in the news media coverage: detecting Chinese collective action trend from a text analytics research framework. Qual Quant 56, 729–749 (2022). https://doi.org/10.1007/s11135-021-01137-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11135-021-01137-3