Abstract
The aim of this work is to construct computer methods for objective quality assessment of scientific documents (science and technical papers, dissertations, reports on R&D and design projects, application documents to hold them, patent documentation, etc.). Unlike computer programs, databases, handbooks of physical constants and other documents written in a specially structured formal language, such documents are unstructured. To objectify procedures for quality assessment of scientific texts written in natural languages, an approach is proposed that leverages computational analysis of semantic models of individual documents and collections of documents.
Similar content being viewed by others
References
Ph. J. Pritchard, MathCad: A Tool for Engineering Problem Solving (B.E.S.T. Series) (McGraw-Hill Science/Engineering/Math, New York, 1998).
B. Maxfield, Essential Mathcad for Engineering, Science, and Math, Second Edition (Acad. Press, New York, 2009).
K. L. McMillan, “Lazy Annotation for Program Testing and Verification,” Lecture Notes in Computer Science 6174, 104–118 (2010).
S. V. Bredikhin and A. Yu. Kuznetsov, Bibliometric Methods and Electronic Scientific Publishing Market (IVMiMG SO RAN, NEIKON, Novosibirsk, 2012) [in Russian].
N. S. Red’kina, “Bibliometrics: History and Modern State,” Molodye v bibliotechnom dele, No. 2, 76–86 (2003).
V. V. Nalimov and Z. M. Mul’chenko, Measurement of Science (Nauka, Moscow, 1969) [in Russian].
Ch. D. Manning and H. Schutze, Foundations of Statistical Natural Language Processing (MIT Press, Boston, 1999).
A. Broder, S. Glassman, M. Manasse, et al., “Syntactic Clustering of the Web,” SRC Technical Note, Pal Alto California: SRC, Digital Equipment Corporation 1997–015 (1997) http://www.std.org/msm/common/clustering.html.
T. H. Cormen, Ch. E. Leiserson, R. L. Rivest, et al., Introduction to Algorithms, Second Edition (MIT Press, McGraw-Hill, New York, 2001).
Zh. Yiu, J. Rong, and Zh. Zhi-Hua, “Understanding Bag-Of-Words Model: A Statistical Framework,” Intern. J. Machine Learning and Cybernetics 1, 43–52 (2010).
B. Gipp and J. Beel, “Citation Based Plagiarism Detection — A New Approach to Identifying Plagiarized Work Language Independently,” in Proceedings of the 21st ACM Conference on Hypertext and Hypermedia (HT’10) (ACM, Eindhoven, 2010).
P. Juola, “Authorship Attribution,” Foundations and Trends in Information Retrieval 1, 233–334 (2006).
M. G. Kreines and A. A. Afonin, “Clusterization of Text Collections: Help in Content Search and Analytical Tool,” Internet-portaly: soderzhanie i tekhnologii. Vyp. 4 (Prosveshchenie, Moscow, 2007), pp. 510–537.
M. G. Kreines, “Models and Technologies for Extraction of Aggregated Knowledge to Control Processes of the Retrieval of Non-Structured Information,” Comput. Syst. Sci. Int. 48, 272 (2009).
M. G. Kreines, “Model of Text Collection for Searching Information in Natural Languages: Keywords, Their Relations and Contexts,” in Proceedings of the 6th Moscow International Conference on Operations Research (MAKS Press., Moscow, 2010), pp. 150–151 [in Russian].
M. G. Kreines, “KEYS TO TEXTS Information Technology,” Rechevye tekhnologii, No. 4, 97–106 (2009), http://speechtechnology.ru/files/4-2009.pdf.
B. Russell, An Inquiry into Meaning and Truth (George Allen & Unwin, Ltd., London, 1940; Ideya-Press, Dom intellektual’noi knigi, Moscow, 1999).
Author information
Authors and Affiliations
Additional information
Original Russian Text © M.G. Kreines, 2013, published in Izvestiya Akademii Nauk. Teoriya i Sistemy Upravleniya, 2013, No. 2, pp. 64–75.
Rights and permissions
About this article
Cite this article
Kreines, M.G. Methods of computational analysis of semantic models for quality assessment of scientific texts. J. Comput. Syst. Sci. Int. 52, 226–236 (2013). https://doi.org/10.1134/S1064230713020044
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S1064230713020044