An Analysis of the Evolving Coverage of Computer Science Sub-fields in the DBLP Digital Library
- 1.5k Downloads
Many scientists and research groups make use of the DBLP bibliographic project collection in various ways. Most of them are unaware of its internal structure, although it can have significant influence on their results. Prior work has shown that the collection does not cover all sub-fields of computer science in the same quality but has not provided an explanation for these differences. We introduce an extension of the DBLP data set which gives us a detailed picture on how DBLP has evolved since 1995. We show that the project started with a narrow focus on two sub-fields and discuss how additional themes have been added in recent years. We analyze the relations between sub-fields at different times and provide a model which explains the differences in coverage.
KeywordsLogic Programming Betweenness Centrality Conference Paper Historic Collection Very Large Data Base
Unable to display preview. Download preview PDF.
- 1.Deng, H., King, I., Lyu, M.R.: Formal Models for Expert Finding on DBLP Bibliography Data. In: Proc. of the ICDM 2008, pp. 163–172. IEEE CS, New York (2008)Google Scholar
- 3.Huang, Z., Yan, Y., Qui, Y., Qiao, S.: Exploring Emergent Semantic Communities from DBLP Bibliography Database. In: Proc. of the ASONAM, pp. 219–224. IEEE CS, New York (2009)Google Scholar
- 7.Martins, W.S., Gonçalves, M.A., Laender, A.H.F., Pappa, G.L.: Learning to assess the quality of scientific conferences: a case study in computer science. In: Proc. of the JCDL, pp. 193–202. ACM, New York (2008)Google Scholar