NLP-Based Detection of Mathematics Subject Classification

  • Yihe DongEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10931)


We present a classifier for the Mathematics Subject Classification (MSC) system, combining techniques in unsupervised learning such as nearest neighbors, and supervised learning such as neural networks. We will discuss the challenges presented in the classification task, such as the large number of possible classes, many with overlapping scope; and describe the data processing and experimental methodologies employed.


Text classification NLP Neural networks 



We would like to thank Jeremy Michelson and Michael Trott for continuously lending their ears and ideas throughout this project, and the ICMS reviewer for constructive comments on an earlier draft of this paper.


  1. 1.
    Řehůřek, R., Sojka, P.: Automated classification and categorization of mathematical knowledge. In: Autexier, S., Campbell, J., Rubio, J., Sorge, V., Suzuki, M., Wiedijk, F. (eds.) CICM 2008. LNCS (LNAI), vol. 5144, pp. 543–557. Springer, Heidelberg (2008). Scholar
  2. 2.
    Furnas, G., Dumais, S., Landauer, T., Harshman, R., Streeter, L., Lochbaum, K.: Information retrieval using a singular value decomposition model of latent semantic structure. In: Proceedings of SIGIR (1998)Google Scholar
  3. 3.
  4. 4.
    arXiv Bulk Data Access.
  5. 5.
    Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space.
  6. 6.
    Pennington, J., Socher, R., Manning,. C.D.: GloVe: global vectors for word representation. In: EMNLP (2014)Google Scholar
  7. 7.
    Kingma, D., Lei Ba, J.: Adam: a method for stochastic optimization. In: ICLR (2015)Google Scholar
  8. 8.
    Corpus of Contemporary American English.
  9. 9.
    Joulin, A., Grave, E., Bojanowski, P., Mikolov, T.: Bag of tricks for efficient text classification. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (2017)Google Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Wolfram ResearchChampaignUSA

Personalised recommendations