Overview of the NLPCC 2018 Shared Task: Automatic Tagging of Zhihu Questions

  • Bo HuangEmail author
  • Zhenyu ZhaoEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11109)


In this paper, we give an overview for the shared task at the CCF Conference on Natural Language Processing & Chinese Computing (NLPCC 2018): Automatic Tagging of Zhihu Questions. The dataset is collected from the Chinese question-answering web site Zhihu, which consists 25551 tags and 721608 training samples in this shared task. This is a multi-label text classification task, and each question can have as much as five relevant tags. The dataset can be assessed at


Automatic tagging Multi-label classification Text classification 


  1. 1.
    Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRefGoogle Scholar
  2. 2.
    Joulin, A., Grave, E., Bojanowski, P., Douze, M., Jégou, H., Mikolov, T.: compressing text classification models. arXiv preprint arXiv:1612.03651 (2016)
  3. 3.
    Kim, Y.: Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014)

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  1. 1.Zhihu InstituteBeijingChina

Personalised recommendations