Author Cooperation Based on Terms of Article Titles from DBLP

  • Štěpán MinksEmail author
  • Jan Martinovič
  • Pavla Dráždilová
  • Kateřina Slaninová
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 179)


Very interesting source of information about scientific publishing in computer science is database DBLP. This database allows bibliographic information about main publications from conferences, journals and books in this area. In the article we deal with strength extraction between authors based on their association. The research presented in this article is partly motivated by work of Mori et al. From this paper we have used the approach for extraction of initial metadata, and we have inspired how to take advantage from Jaccard coefficient principals for description of the strength of associations between authors. Method is usable for development of synthetic coauthors network, where as input is used the set of words, which will describe the network (the authors used these words in publication titles).


Expansion Query Association Strength Input Text Jaccard Coefficient Bibliographic Information 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.



This work is supported by SGS, VŠB—Technical University of Ostrava, Czech Republic, under the grant No. SP2011/172.


  1. 1.
    Mori, J., Matsuo, Y., Ishizuka, M., Faltings, B.: Keyword extraction from the Web for FOAF metadata. In: Proceedings of the 1st Workshop on Friend of a Friend, Social Networking and the (Semantic) Web, (2004)Google Scholar
  2. 2.
    Konchady, M.: Text Mining Application Programming (Programming Se-ries). Charles River Media, Rockland (2006)Google Scholar
  3. 3.
    Porter, M.F.: An algorithm for suffix stripping. Program 14, 130–137 (1980)Google Scholar
  4. 4.
    Lopez, P.,Laurent, R.:HUMB:automatic key term extraction from scientific articles inGROBID, pp. 248–251. Computational Linguistics, July (2010)Google Scholar
  5. 5.
    Jacquemin, C., Didier, B.: Term extraction and automatic indexing. In: Mitkov, R. (ed.) Handbook of Computational Linguistics, pp. 599–615. Oxford University Press, Oxford (2003)Google Scholar
  6. 6.
    Lashkari, A.H., Mahdavi, F., Ghomi, V.: A boolean model in information retrieval for search engines (2009). doi: 10.1109/ICIME.2009.101 Google Scholar
  7. 7.
    Deza, E., Deza, M.: Dictionary of Distances, pp. 1–391. Elsevier, Amsterdam (2006)Google Scholar
  8. 8.
    Ding, Y.: Scientific collaboration and endorsement: Network analysis of coauthorship and citation networks. J. informetr. 5(1), 187–203 (2011)CrossRefGoogle Scholar
  9. 9.
    Santamaría, R.R.T.: Overlapping clustered graphs: co-authorship networks visualization. Lect. Notes Comput. Sci. 5166, 190–199 (2008)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Štěpán Minks
    • 1
    Email author
  • Jan Martinovič
    • 1
  • Pavla Dráždilová
    • 1
  • Kateřina Slaninová
    • 1
  1. 1.Department of Computer ScienceVŠB—Technical University of OstravaOstravaCzech Republic

Personalised recommendations