Characteristics and Evolution of Citation Distance Based on LDA Method
The scientific research behavior of scholars is the core issue of scientific research. The research ideas and methods of complex networks provide a new perspective for the study of science. The scientific citation network and the scientist cooperation network are widely used to study the citation behavior of scholars and the dissemination of scientific ideas, and so far, some results have been obtained. However, due to the lack of information on the content of the article, the research based solely on the network topology has limitations and deficiencies. Combining the textual content analysis through LDA, this paper studies the distribution characteristics of content correlation between articles with citation relations and its evolution with time. It found that the distribution of citation distance has normal characteristics, but the reference distance is visible to be short. Authors have citation preferences for documents at a distance.
KeywordsScientific reference Citation distance Scientist’s behavior LDA
We appreciate comments and helpful suggestions from Prof. Zengru Di, Prof. Chensheng Wu, Ms. Weiwei Gu. This work was supported by Chinese National Natural Science Foundation (71701018, 61673070 and 71671017).
- 2.Leydesdorff, L.: The Challenge of Scientometrics: the Development, Measurement, and Self-Organization of Scientific Communications. Universal-Publishers (2001)Google Scholar
- 12.Niu, Q., Zhou, J., Zeng, A., Fan, Y., Di, Z.R.: Which publication is your representative work? J. Inf. 10(3), 842–853 (2016)Google Scholar
- 16.Saggion, H., Poibeau, T.: Automatic text summarization: past, present and future. In: Multi-source, Multilingual Information Extraction and Summarization, pp. 3–21. Springer Berlin Heidelberg (2013)Google Scholar