Exactus Expert—Search and Analytical Engine for Research and Development Support

  • Gennady Osipov
  • Ivan Smirnov
  • Ilya Tikhomirov
  • Ilya Sochenkov
  • Artem Shelmanov
Chapter
Part of the Studies in Computational Intelligence book series (SCI, volume 586)

Abstract

The paper presents the system-“Exactus Expert”—search and analytical engine. The system aims to provide comprehensive tools for analysis of large-scale collections of scientific documents for experts and researchers. The system challenges many tasks, among them full-text search, search for similar documents, automatic quality assessment, term and definition extraction, results extraction and comparison, detection of scientific directions and analysis of references. These features help to aggregate information about different sides of scientific activity and can be useful for evaluation of research projects and groups. The paper discusses general architecture of the system, implemented methods of scientific publication analysis and some experimental results.

Notes

Acknowledgements

The research is supported by Russian Foundation for Basic Research, project No. 14-29-05008-ofi_m.

References

  1. 1.
    SciVal. http://info.scival.com. Cited 15 July 2013
  2. 2.
    R.A. Day, The origins of the scientific paper: the IMRAD format. Am. Med. Writers Assoc. J. 4(2), 16–18 (1989)Google Scholar
  3. 3.
    D. Deviatkin, A. Shevets, Experimental method of automatic extraction of research topics and research groups, in Proceedings of Thirteenth National Conference on Artificial Intelligence with international participation CAI-2012, vol. 2, BGTU, Belgorod, Russia (2012) pp. 90–100. (In Russian)Google Scholar
  4. 4.
    W.B. Frakes, R. Baeza-Yates, Information retrieval: data structures and algorithms (Prentice-Hall Inc, Upper Saddle River, NJ, 1992)Google Scholar
  5. 5.
    C. Hui Han, L. Giles, E. Manavoglu, H. Zha, Z. Zhang, E.A. Fox, Automatic document metadata extraction using support vector machines, in JCDL ’03: Proceedings of the 3rd ACM/IEEE-CS Joint Conference on Digital Libraries. IEEE (2003), pp. 37–48Google Scholar
  6. 6.
    T. Joachims, A probabilistic analysis of the rocchio algorithm with TFIDF for text categorization, Proceedings of the Fourteenth International Conference on Machine Learning, ICML ’97 (Morgan Kaufmann Publishers Inc., San Francisco, CA, 1997), pp. 143–151Google Scholar
  7. 7.
    L. Leydesdorff, S. Carley, I. Rafols, Global maps of science based on the new Web-of-Science categories. Scientometrics 94(2), 589–593 (2013)CrossRefGoogle Scholar
  8. 8.
    E. Mbaykodzhi, A. Dral, I. Sochenkov, Method for automatic classification of short text messages. Inf. Technol. Comput. Syst. 3, 93–102 (2012). (In Russian)Google Scholar
  9. 9.
    M.A. Montemurro, Beyond the Zipf-Mandelbrot law in quantitative linguistics. Elsevier (2001), pp. 567–578Google Scholar
  10. 10.
    F. Moya-Anegon, Z. Chinchilla-Rodriguez, B. Vargas-Quesada, E. Corera-Alvarez, F.J. Munoz-Fernandez, A. Gonzalez-Molina, V. Herrero-Solana, Coverage analysis of Scopus: a journal metric approach. Scientometrics 73(1), 53–78 (2007)CrossRefGoogle Scholar
  11. 11.
    M. Nekrestyanova, I. Nekrestyanov, ROMIP 2008 evaluation: rules, methodology and adhoc decisions, in Proceedings of ROMIP’2008, NU CZSI, Saint Petersburg, Russia (2008), pp. 5–26. (In Russian)Google Scholar
  12. 12.
    G. Osipov, Formulation of subject domain models: Part 1. Heterogeneous semantic nets. J. Comput. Syst. Sci. Int. 30(5) (1992) Google Scholar
  13. 13.
    G. Osipov, Methods for extracting semantic types of natural language statements from texts, in 10th IEEE International Symposium on Intelligent Control. Monterey, California, USA (1995), pp. 292–299.Google Scholar
  14. 14.
    G. Osipov, I. Smirnov, I. Tikhomirov, Relational-situational method for text search and analysis and its applications. Sci. Tech. Inf. Process. 37(6), 432–437 (2010)CrossRefGoogle Scholar
  15. 15.
    G. Osipov, I. Smirnov, I. Tikhomirov, A. Shelmanov, Relational-situational method for intelligent search and analysis of scientific publications, in Proceedings of the Workshop on Integrating IR technologies for Professional Search, in conjunction with the 35th European Conference on Information Retrieval (ECIR’13), vol. 968. (2013), pp. 57–64Google Scholar
  16. 16.
    G. Osipov, I. Smirnov, I. Tikhomirov, O. Zavjalova, Application of linguistic knowledge to search precision improvement, in Proceedings of 4th International IEEE Conference on Intelligent Systems, Vol. 2. IEEE (2008), pp. 17-2–17-5Google Scholar
  17. 17.
    G. Osipov, I. Smirnov, I. Tikhomirov, I. Sochenkov, A. Shelmanov, A. Shvets, Information Retrieval for R&D Support // Professional Search in the Modern World. Lect. Notes Comput. Sc. 8830, 45–69 (2014).Google Scholar
  18. 18.
    L. Padro, E. Stanilovsky, Freeling 3.0: towards wider multilinguality, in Proceedings of the Language Resources and Evaluation Conference (LREC 2012). ELRA, Istanbul, Turkey (2012)Google Scholar
  19. 19.
    A. Shelmanov, Method for automatic extraction of multiword terms from texts of scientific publications, in Proceedings of Thirteenth National Conference on Artificial Intelligence with international participation CAI-2012, Vol. 1. BGTU, Belgorod, Russia (2012), pp. 268–274. (In Russian)Google Scholar
  20. 20.
    A. Shvets, A Method of Automatic Detection of Pseudoscientific Publications, in Proceedings of the 7th IEEE International Conference Intelligent Systems (IS’2014 IEEE). Advances in Intelligent Systems and Computing (AISC), Vol. 2. Warsaw (2015), pp. 533–539.Google Scholar
  21. 21.
    A. Shvets, D. Devyatkin, I. Sochenkov, I. Tikhomirov, K. Popov, K. Yarygin, Detection of Current Research Directions Based on Full-Text Clustering, in Proceedings of Science and Information Conference 2015, London, (2015), pp. 483–488.Google Scholar
  22. 22.
    I. Smirnov, I. Tikhomirov, Heterogeneous semantic networks for text representation in intelligent search engine EXACTUS, Proceedings of Workshop SENSE’09—Conceptual Structures for Extracting Natural language SEmantics, The 17th International Conference on Conceptual Structures (ICCS’09) (CEUR Workshop Proceedings, Moscow, Russia, 2009), pp. 1–9Google Scholar
  23. 23.
    I. Sochenkov, Text comparision method for search and analytical tasks. Artif. Intell. Decis. Mak. 2, 95–106 (2013). (In Russian)Google Scholar
  24. 24.
    A. Sokirko, A short description of Dialing project (2001). http://www.aot.ru/docs/sokirko/sokirko-candid-eng.html. Cited 15 July 2013
  25. 25.
    L.B. Sollaci, M.G. Pereira, The introduction, methods, results, and discussion (IMRAD) structure: a fifty-year survey. J. Med. Libr. Assoc. 92(3), 364–371 (2004)Google Scholar
  26. 26.
    R. Suvorov, I. Sochenkov, Detection of similarity of scientific documents using thematic importance characteristic. Artif. Intell. Decis. Mak. 1, 33–40 (2013). (In Russian)Google Scholar
  27. 27.
    R. Suvorov, I. Sochenkov, I. Tikhomirov, Method for pornography filtering in the web based on automatic classification and natural language processing, in M. Železnỳ et al. (ed.) Lecture Notes in Artificial Intelligence, vol. 8113. (Springer, Berlin 2013), pp. 233–240Google Scholar
  28. 28.
    J. Zobel, How reliable are the results of large-scale information retrieval experiments? in SIGIR ‘98: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, Melbourne, Australia (1998), pp. 307–314Google Scholar
  29. 29.
    G. Zolotova, Syntactic dictionary: repertory of elementary units of Russian Syntax (Nauka, Moscow, Russia, 1988). (In Russian)Google Scholar
  30. 30.
    G. Zolotova, N. Onipenko, M. Sidorova, Communicative grammar of Russian language (V.V. Vinogradov Russian Language Institute RAS, Moscow, Russia, 2004). (In Russian)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  • Gennady Osipov
    • 1
  • Ivan Smirnov
    • 1
  • Ilya Tikhomirov
    • 1
  • Ilya Sochenkov
    • 1
  • Artem Shelmanov
    • 1
  1. 1.Institute for Systems Analysis of the Russian Academy of SciencesMoscowRussia

Personalised recommendations