Abstract
A booming amount of information is continuously added to the Internet as structured and unstructured data, feeding knowledge bases such as DBpedia and Wikidata with billions of statements describing millions of entities. The aim of Question Answering systems is to allow lay users to access such data using natural language without needing to write formal queries. However, users often submit questions that are complex and require a certain level of abstraction and reasoning to decompose them into basic graph patterns. In this short paper, we explore the use of architectures based on Neural Machine Translation called Neural SPARQL Machines to learn pattern compositions. We show that sequence-to-sequence models are a viable and promising option to transform long utterances into complex SPARQL queries.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
The metadata can be fetched from http://mappings.dbpedia.org/server/ontology/classes/.
- 3.
- 4.
Retrieved on 19/10/2020 from https://dbpedia.org/sparql.
References
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Comput. Network. ISDN Syst. 30(1), 107–117 (1998). Proceedings of the Seventh International World Wide Web Conference
Brown, T.B., et al.: Language models are few-shot learners (2020)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv (2018)
Diefenbach, D., Singh, K., Maret, P.: WDAqua-core0: a question answering component for the research community. In: Dragoni, M., Solanki, M., Blomqvist, E. (eds.) SemWebEval 2017. CCIS, vol. 769, pp. 84–89. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69146-6_8
Dubey, M., Dasgupta, S., Sharma, A., Höffner, K., Lehmann, J.: AskNow: a framework for natural language query formalization in SPARQL. In: Sack, H., Blomqvist, E., d’Aquin, M., Ghidini, C., Ponzetto, S.P., Lange, C. (eds.) ESWC 2016. LNCS, vol. 9678, pp. 300–316. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-34129-3_19
Hartmann, A.K., Marx, E., Soru, T.: Generating a large dataset for neural question answering over the DBpedia knowledge base. In: Workshop on Linked Data Management, co-located with the W3C WEBBR 2018 (2018)
Lehmann, J., et al.: DBpedia - a large-scale, multilingual knowledge base extracted from Wikipedia. Semant. Web 6(2), 167–195 (2015)
Liang, C., Berant, J., Le, Q., Forbus, K.D., Lao, N.: Neural symbolic machines: learning semantic parsers on freebase with weak supervision. arXiv preprint arXiv:1611.00020 (2016)
Lukovnikov, D., Fischer, A., Lehmann, J., Auer, S.: Neural network-based question answering over knowledge graphs on word and character level. In: Proceedings of the 26th International Conference on World Wide Web, pp. 1211–1220 (2017)
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. Technical report, Stanford InfoLab (1999)
Shekarpour, S., Marx, E., Ngomo, A., Sina, S.: Semantic interpretation of user queries for question answering on interlinked data. Elsevier-Web Semantics (2015)
Shi, J., Cao, S., et al.: KQA pro: a large diagnostic dataset for complex question answering over knowledge base (2020)
Soru, T., et al.: SPARQL as a foreign language. In: 13th International Conference on Semantic Systems (SEMANTiCS 2017) - Posters and Demos (2017)
Soru, T., Marx, E., Valdestilhas, A., Esteves, D., Moussallem, D., Publio, G.: Neural machine translation for query construction and composition. In: 2nd ICML Workshop on Neural Abstract Machines & Program Induction (2018)
Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Advances in Neural Information Processing Systems (2014)
Trivedi, P., Maheshwari, G., Dubey, M., Lehmann, J.: LC-QuAD: a corpus for complex question answering over knowledge graphs. In: d’Amato, C., et al. (eds.) ISWC 2017. LNCS, vol. 10588, pp. 210–218. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68204-4_22
Unger, C., Bühmann, L., Lehmann, J., Ngonga Ngomo, A.C., Gerber, D., Cimiano, P.: Template-based question answering over RDF data. In: Proceedings of the 21st International Conference on World Wide Web, pp. 639–648 (2012)
Usbeck, R., Gusmita, R.H., Saleem, M., Ngonga Ngomo, A.C.: 9th challenge on question answering over linked data (QALD-9). Question Answering over Linked Data 7(1) (2018)
Usbeck, R., Ngomo, A.C.N., Conrads, F., Röder, M., Napolitano, G.: 8th challenge on question answering over linked data (QALD-8). Language 7, 1 (2018)
Zhang, Y., He, S., Liu, K., Zhao, J., et al.: A joint model for question answering over multiple knowledge bases. In: 30th AAAI Conference (2016)
Zheng, W., Yu, J.X., Zou, L., Cheng, H.: Question answering over knowledge graphs: question understanding via template decomposition. Proc. VLDB Endow. 11(11), 1373–1386 (2018)
Zhong, V., Xiong, C., Socher, R.: Seq2SQL: generating structured queries from natural language using reinforcement learning. arXiv preprint arXiv:1709.00103 (2017)
Zou, L., Huang, R., Wang, H., Yu, J.X., He, W., Zhao, D.: Natural language question answering over RDF: a graph data driven approach. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 313–324 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Panchbhai, A., Soru, T., Marx, E. (2020). Exploring Sequence-to-Sequence Models for SPARQL Pattern Composition. In: Villazón-Terrazas, B., Ortiz-RodrÃguez, F., Tiwari, S.M., Shandilya, S.K. (eds) Knowledge Graphs and Semantic Web. KGSWC 2020. Communications in Computer and Information Science, vol 1232. Springer, Cham. https://doi.org/10.1007/978-3-030-65384-2_12
Download citation
DOI: https://doi.org/10.1007/978-3-030-65384-2_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-65383-5
Online ISBN: 978-3-030-65384-2
eBook Packages: Computer ScienceComputer Science (R0)