Exploring Sequence-to-Sequence Models for SPARQL Pattern Composition

Panchbhai, Anand; Soru, Tommaso; Marx, Edgard

doi:10.1007/978-3-030-65384-2_12

Anand Panchbhai¹¹,
Tommaso Soru^9,11 &
Edgard Marx^9,10,11

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1232))

Included in the following conference series:

Iberoamerican Knowledge Graphs and Semantic Web Conference

577 Accesses
5 Citations

Abstract

A booming amount of information is continuously added to the Internet as structured and unstructured data, feeding knowledge bases such as DBpedia and Wikidata with billions of statements describing millions of entities. The aim of Question Answering systems is to allow lay users to access such data using natural language without needing to write formal queries. However, users often submit questions that are complex and require a certain level of abstraction and reasoning to decompose them into basic graph patterns. In this short paper, we explore the use of architectures based on Neural Machine Translation called Neural SPARQL Machines to learn pattern compositions. We show that sequence-to-sequence models are a viable and promising option to transform long utterances into complex SPARQL queries.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://github.com/LiberAI/NSpM/wiki/Compositionality.
2.
The metadata can be fetched from http://mappings.dbpedia.org/server/ontology/classes/.
3.
https://github.com/paulhoule/telepath/wiki/SubjectiveEye3D.
4.
Retrieved on 19/10/2020 from https://dbpedia.org/sparql.

References

Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Comput. Network. ISDN Syst. 30(1), 107–117 (1998). Proceedings of the Seventh International World Wide Web Conference
Google Scholar
Brown, T.B., et al.: Language models are few-shot learners (2020)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv (2018)
Google Scholar
Diefenbach, D., Singh, K., Maret, P.: WDAqua-core0: a question answering component for the research community. In: Dragoni, M., Solanki, M., Blomqvist, E. (eds.) SemWebEval 2017. CCIS, vol. 769, pp. 84–89. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69146-6_8
Chapter Google Scholar
Dubey, M., Dasgupta, S., Sharma, A., Höffner, K., Lehmann, J.: AskNow: a framework for natural language query formalization in SPARQL. In: Sack, H., Blomqvist, E., d’Aquin, M., Ghidini, C., Ponzetto, S.P., Lange, C. (eds.) ESWC 2016. LNCS, vol. 9678, pp. 300–316. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-34129-3_19
Chapter Google Scholar
Hartmann, A.K., Marx, E., Soru, T.: Generating a large dataset for neural question answering over the DBpedia knowledge base. In: Workshop on Linked Data Management, co-located with the W3C WEBBR 2018 (2018)
Google Scholar
Lehmann, J., et al.: DBpedia - a large-scale, multilingual knowledge base extracted from Wikipedia. Semant. Web 6(2), 167–195 (2015)
Article Google Scholar
Liang, C., Berant, J., Le, Q., Forbus, K.D., Lao, N.: Neural symbolic machines: learning semantic parsers on freebase with weak supervision. arXiv preprint arXiv:1611.00020 (2016)
Lukovnikov, D., Fischer, A., Lehmann, J., Auer, S.: Neural network-based question answering over knowledge graphs on word and character level. In: Proceedings of the 26th International Conference on World Wide Web, pp. 1211–1220 (2017)
Google Scholar
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. Technical report, Stanford InfoLab (1999)
Google Scholar
Shekarpour, S., Marx, E., Ngomo, A., Sina, S.: Semantic interpretation of user queries for question answering on interlinked data. Elsevier-Web Semantics (2015)
Google Scholar
Shi, J., Cao, S., et al.: KQA pro: a large diagnostic dataset for complex question answering over knowledge base (2020)
Google Scholar
Soru, T., et al.: SPARQL as a foreign language. In: 13th International Conference on Semantic Systems (SEMANTiCS 2017) - Posters and Demos (2017)
Google Scholar
Soru, T., Marx, E., Valdestilhas, A., Esteves, D., Moussallem, D., Publio, G.: Neural machine translation for query construction and composition. In: 2nd ICML Workshop on Neural Abstract Machines & Program Induction (2018)
Google Scholar
Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Advances in Neural Information Processing Systems (2014)
Google Scholar
Trivedi, P., Maheshwari, G., Dubey, M., Lehmann, J.: LC-QuAD: a corpus for complex question answering over knowledge graphs. In: d’Amato, C., et al. (eds.) ISWC 2017. LNCS, vol. 10588, pp. 210–218. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68204-4_22
Chapter Google Scholar
Unger, C., Bühmann, L., Lehmann, J., Ngonga Ngomo, A.C., Gerber, D., Cimiano, P.: Template-based question answering over RDF data. In: Proceedings of the 21st International Conference on World Wide Web, pp. 639–648 (2012)
Google Scholar
Usbeck, R., Gusmita, R.H., Saleem, M., Ngonga Ngomo, A.C.: 9th challenge on question answering over linked data (QALD-9). Question Answering over Linked Data 7(1) (2018)
Google Scholar
Usbeck, R., Ngomo, A.C.N., Conrads, F., Röder, M., Napolitano, G.: 8th challenge on question answering over linked data (QALD-8). Language 7, 1 (2018)
Google Scholar
Zhang, Y., He, S., Liu, K., Zhao, J., et al.: A joint model for question answering over multiple knowledge bases. In: 30th AAAI Conference (2016)
Google Scholar
Zheng, W., Yu, J.X., Zou, L., Cheng, H.: Question answering over knowledge graphs: question understanding via template decomposition. Proc. VLDB Endow. 11(11), 1373–1386 (2018)
Article Google Scholar
Zhong, V., Xiong, C., Socher, R.: Seq2SQL: generating structured queries from natural language using reinforcement learning. arXiv preprint arXiv:1709.00103 (2017)
Zou, L., Huang, R., Wang, H., Yu, J.X., He, W., Zhao, D.: Natural language question answering over RDF: a graph data driven approach. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 313–324 (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

AKSW, University of Leipzig, Leipzig, Germany
Tommaso Soru & Edgard Marx
Leipzig University of Applied Sciences, Leipzig, Germany
Edgard Marx
Liber AI Research, London, UK
Anand Panchbhai, Tommaso Soru & Edgard Marx

Authors

Anand Panchbhai
View author publications
You can also search for this author in PubMed Google Scholar
Tommaso Soru
View author publications
You can also search for this author in PubMed Google Scholar
Edgard Marx
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tommaso Soru .

Editor information

Editors and Affiliations

Tinámica, Madrid, Spain
Boris Villazón-Terrazas
Tamaulipas Autonomous University, Ciudad Victoria, Mexico
Fernando Ortiz-Rodríguez
Tamaulipas Autonomous University, Ciudad Victoria, Mexico
Sanju M. Tiwari
VIT Bhopal University, Bhopal, India
Shishir K. Shandilya

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Panchbhai, A., Soru, T., Marx, E. (2020). Exploring Sequence-to-Sequence Models for SPARQL Pattern Composition. In: Villazón-Terrazas, B., Ortiz-Rodríguez, F., Tiwari, S.M., Shandilya, S.K. (eds) Knowledge Graphs and Semantic Web. KGSWC 2020. Communications in Computer and Information Science, vol 1232. Springer, Cham. https://doi.org/10.1007/978-3-030-65384-2_12

Download citation

DOI: https://doi.org/10.1007/978-3-030-65384-2_12
Published: 10 December 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-65383-5
Online ISBN: 978-3-030-65384-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics