A Method of Automatic Detection of Pseudoscientific Publications

  • Alexander Shvets
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 323)


Currently, pseudoscientific theories are actively promoted being published in a large amount of papers. They appear in mass media, in patents and even in scientific journals, and it is rather difficult for non-expert to distinguish scientific paper from pseudoscientific. A method for identifying pseudoscientific publications based on automatic text analysis is proposed. At first, the text is partitioned into small fragments consisting of several paragraphs. Then feature extraction occurs using an automatic linguistic analysis and classification of text fragments is implemented by support vector machines. Experiments show that the method divides scientific and pseudoscientific publications into different classes with high accuracy.


Identifying of pseudoscientific papers Intelligent text analysis Support vector machine 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
  2. 2.
    Aleksandrov, E.B.: Answers to Questions about Pseudoscience. Journal (In defense of science) 8 (2011) (in Russian)Google Scholar
  3. 3.
    Gitelson, I.I.: Necessity of Government Protection of the People from the Onslaught of Fake Medicine. Journal (In defense of science) 2, 52–55 (2007) (in Russian)Google Scholar
  4. 4.
    Sceptic Society,
  5. 5.
    Skeptical Inquirer,
  6. 6.
    Journal (In defense of science) 12 (2013)Google Scholar
  7. 7.
    Labbé, C., Labbé, D.: Duplicate and fake publications in the scientific literature: How many SCIgen papers in computer science? Scientometrics. Scientometrics 94(1), 379–396 (2013)CrossRefGoogle Scholar
  8. 8.
    Osipov, G., Smirnov, I., Tikhomirov, I., Shelmanov, A.: Relational–situational method for intelligent search and analysis of scientific publications. In: Proceedings of the Workshop on Integrating IR Technologies for Professional Search, in Conjunction with the 35th European Conference on Information Retrieval (ECIR 2013), Moscow, Russia. CEUR Workshop Proceedings, vol. 968 (2013)Google Scholar
  9. 9.
    Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing & Management 24(5), 513–523 (1988)CrossRefGoogle Scholar
  10. 10.
    Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20(3), 273 (1995)zbMATHGoogle Scholar
  11. 11.
    LIBSVM – A Library for Support Vector Machines,
  12. 12.
    Powers, D.M.W.: Evaluation: From Precision, Recall and F-Factor to ROC, Informedness, Markedness and Correlation. Journal of Machine Learning Technologies 2(1), 37–63 (2011)MathSciNetGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  1. 1.Institute for Systems Analysis of Russian Academy of SciencesMoscowRussia

Personalised recommendations