Abstract
Currently, pseudoscientific theories are actively promoted being published in a large amount of papers. They appear in mass media, in patents and even in scientific journals, and it is rather difficult for non-expert to distinguish scientific paper from pseudoscientific. A method for identifying pseudoscientific publications based on automatic text analysis is proposed. At first, the text is partitioned into small fragments consisting of several paragraphs. Then feature extraction occurs using an automatic linguistic analysis and classification of text fragments is implemented by support vector machines. Experiments show that the method divides scientific and pseudoscientific publications into different classes with high accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
RationalWiki, http://rationalwiki.org
Aleksandrov, E.B.: Answers to Questions about Pseudoscience. Journal (In defense of science)Â 8 (2011) (in Russian)
Gitelson, I.I.: Necessity of Government Protection of the People from the Onslaught of Fake Medicine. Journal (In defense of science) 2, 52–55 (2007) (in Russian)
Sceptic Society, http://www.skeptic.com
Skeptical Inquirer, http://www.csicop.org
Journal (In defense of science) 12 (2013)
Labbé, C., Labbé, D.: Duplicate and fake publications in the scientific literature: How many SCIgen papers in computer science? Scientometrics. Scientometrics 94(1), 379–396 (2013)
Osipov, G., Smirnov, I., Tikhomirov, I., Shelmanov, A.: Relational–situational method for intelligent search and analysis of scientific publications. In: Proceedings of the Workshop on Integrating IR Technologies for Professional Search, in Conjunction with the 35th European Conference on Information Retrieval (ECIR 2013), Moscow, Russia. CEUR Workshop Proceedings, vol. 968 (2013)
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing & Management 24(5), 513–523 (1988)
Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20(3), 273 (1995)
LIBSVM – A Library for Support Vector Machines, http://w.csie.org/~cjlin/libsvm
Powers, D.M.W.: Evaluation: From Precision, Recall and F-Factor to ROC, Informedness, Markedness and Correlation. Journal of Machine Learning Technologies 2(1), 37–63 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Shvets, A. (2015). A Method of Automatic Detection of Pseudoscientific Publications. In: Filev, D., et al. Intelligent Systems'2014. Advances in Intelligent Systems and Computing, vol 323. Springer, Cham. https://doi.org/10.1007/978-3-319-11310-4_46
Download citation
DOI: https://doi.org/10.1007/978-3-319-11310-4_46
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11309-8
Online ISBN: 978-3-319-11310-4
eBook Packages: EngineeringEngineering (R0)