Author Cooperation Based on Terms of Article Titles from DBLP
Very interesting source of information about scientific publishing in computer science is database DBLP. This database allows bibliographic information about main publications from conferences, journals and books in this area. In the article we deal with strength extraction between authors based on their association. The research presented in this article is partly motivated by work of Mori et al. From this paper we have used the approach for extraction of initial metadata, and we have inspired how to take advantage from Jaccard coefficient principals for description of the strength of associations between authors. Method is usable for development of synthetic coauthors network, where as input is used the set of words, which will describe the network (the authors used these words in publication titles).
KeywordsExpansion Query Association Strength Input Text Jaccard Coefficient Bibliographic Information
This work is supported by SGS, VŠB—Technical University of Ostrava, Czech Republic, under the grant No. SP2011/172.
- 1.Mori, J., Matsuo, Y., Ishizuka, M., Faltings, B.: Keyword extraction from the Web for FOAF metadata. In: Proceedings of the 1st Workshop on Friend of a Friend, Social Networking and the (Semantic) Web, (2004)Google Scholar
- 2.Konchady, M.: Text Mining Application Programming (Programming Se-ries). Charles River Media, Rockland (2006)Google Scholar
- 3.Porter, M.F.: An algorithm for suffix stripping. Program 14, 130–137 (1980)Google Scholar
- 4.Lopez, P.,Laurent, R.:HUMB:automatic key term extraction from scientific articles inGROBID, pp. 248–251. Computational Linguistics, July (2010)Google Scholar
- 5.Jacquemin, C., Didier, B.: Term extraction and automatic indexing. In: Mitkov, R. (ed.) Handbook of Computational Linguistics, pp. 599–615. Oxford University Press, Oxford (2003)Google Scholar
- 7.Deza, E., Deza, M.: Dictionary of Distances, pp. 1–391. Elsevier, Amsterdam (2006)Google Scholar