Answering Multiple-Choice Questions in Geographical Gaokao with a Concept Graph

  • Jiwei Ding
  • Yuan Wang
  • Wei Hu
  • Linfeng Shi
  • Yuzhong QuEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10843)


Answering questions in Gaokao (the national college entrance examination in China) brings a great challenge for recent AI systems, where the difficulty of questions and the lack of formal knowledge are two main obstacles, among others. In this paper, we focus on answering multiple-choice questions in geographical Gaokao. Specifically, a concept graph is automatically constructed from textbook tables and Chinese wiki encyclopedia, to capture core concepts and relations in geography. Based on this concept graph, a graph search based question answering approach is designed to find explainable inference paths between questions and options. We developed an online system called CGQA and conducted experiments on two real datasets created from the last ten year geographical Gaokao. Our experimental results demonstrated that CGQA can generate accurate judgments and provide explainable solving procedures. Additionally, CGQA showed promising improvement by combining with existing approaches.


Concept graph Geographical Gaokao Question answering CGQA 



This work is funded by the National Natural Science Foundation of China (No. 61772264) and the National High-tech R&D Program of China (No. 2015AA015406). We thank all participants in the evaluation for their time and effort.


  1. 1.
    Abujabal, A., Yahya, M., Riedewald, M., Weikum, G.: Automated template generation for question answering over knowledge graphs. In: Proceedings of the 26th International Conference on World Wide Web, pp. 1191–1200. International World Wide Web Conferences Steering Committee (2017)Google Scholar
  2. 2.
    Agrawal, R., Golshan, B., Papalexakis, E.: Toward data-driven design of educational courses: a feasibility study. J. Educ. Data Min. 8(1), 1–21 (2016)Google Scholar
  3. 3.
    Angele, J., Moench, E., Oppermann, H., Staab, S., Wenke, D.: Ontology-based query and answering in chemistry: OntoNova Project Halo. In: Fensel, D., Sycara, K., Mylopoulos, J. (eds.) ISWC 2003. LNCS, vol. 2870, pp. 913–928. Springer, Heidelberg (2003). Scholar
  4. 4.
    Cheng, G., Zhu, W., Wang, Z., Chen, J., Qu, Y.: Taking up the Gaokao challenge: an information retrieval approach. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence, pp. 2479–2485. AAAI (2016)Google Scholar
  5. 5.
    Clark, P., Etzioni, O., Khot, T., Sabharwal, A., Tafjord, O., Turney, P.D., Khashabi, D.: Combining retrieval, statistics, and inference to answer elementary science questions. In: Proceedings of the 30th AAAI Conference on Artificial Intelligence, pp. 2580–2586. AAAI (2016)Google Scholar
  6. 6.
    Fujita, A., Kameda, A., Kawazoe, A., Miyao, Y.: Overview of Todai robot project and evaluation framework of its NLP-based problem solving. In: Proceedings of the 9th International Conference on Language Resources and Evaluation, pp. 2590–2597 (2014)Google Scholar
  7. 7.
    Guo, S., Zeng, X., He, S., Liu, K., Zhao, J.: Which is the effective way for Gaokao: information retrieval or neural networks? In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, pp. 111–120. ACL (2017)Google Scholar
  8. 8.
    Hu, W., Li, H., Sun, Z., Qian, X., Xue, L., Cao, E., Qu, Y.: Clinga: bringing Chinese physical and human geography in linked open data. In: Groth, P., Simperl, E., Gray, A., Sabou, M., Krötzsch, M., Lecue, F., Flöck, F., Gil, Y. (eds.) ISWC 2016. LNCS, vol. 9982, pp. 104–112. Springer, Cham (2016). Scholar
  9. 9.
    Khashabi, D., Khot, T., Sabharwal, A., Clark, P., Etzioni, O., Roth, D.: Question answering via integer programming over semi-structured knowledge. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence, pp. 1145–1152. AAAI (2016)Google Scholar
  10. 10.
    Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., Hellmann, S., Morsey, M., van Kleef, P., Auer, S., Bizer, C.: DBpedia - a large-scale, multilingual knowledge base extracted from Wikipedia. Semant. Web J. 6(2), 167–195 (2015)Google Scholar
  11. 11.
    Li, H., Xu, J.: Semantic matching in search. Found. Trends Inf. Retr. 7(5), 343–469 (2014)CrossRefGoogle Scholar
  12. 12.
    Lu, H., Kong, M.: Community-based question answering via contextual ranking metric network learning. In: Proceedings of the 31st AAAI Conference on Artificial Intelligence, pp. 4963–4964. AAAI (2017)Google Scholar
  13. 13.
    Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: Proceedings of the 1st International Conference on Learning Representations (2013)Google Scholar
  14. 14.
    Miller, G.A.: WordNet: an electronic lexical database. Commun. ACM 38(11), 39–41 (1995)CrossRefGoogle Scholar
  15. 15.
    Richardson, M., Burges, C.J.C., Renshaw, E.: MCTest: a challenge dataset for the open-domain machine comprehension of text. In: Proceedings of the 20th Conference on Empirical Methods in Natural Language Processing, pp. 193–203. ACL (2013)Google Scholar
  16. 16.
    Ritze, D., Lehmberg, O., Oulabi, Y., Bizer, C.: Profiling the potential of web tables for augmenting cross-domain knowledge bases. In: Proceedings of the 25th International World Wide Web Conference, pp. 251–261. ACM (2016)Google Scholar
  17. 17.
    Sukhbaatar, S., Szlam, A., Weston, J., Fergus, R.: End-to-end memory networks. In: Proceedings of the 29th International Conference on Neural Information Processing Systems, pp. 2440–2448. Curran Associates, Inc. (2015)Google Scholar
  18. 18.
    Wang, Z., Li, J., Wang, Z., Li, S., Li, M., Zhang, D., Shi, Y., Liu, Y., Zhang, P., Tang, J.: XLore: a large-scale English-Chinese bilingual knowledge graph. In: Proceedings of the 12th International Semantic Web Conference, pp. 121–124 (2013)Google Scholar
  19. 19.
    Zou, L., Huang, R., Wang, H., Yu, J.X., He, W., Zhao, D.: Natural language question answering over RDF: a graph data driven approach. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 313–324. ACM (2014)Google Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.National Key Laboratory for Novel Software TechnologyNanjing UniversityNanjingChina

Personalised recommendations