Abstract
The development of semantic information systems is a highly topical issue in various domains, e.g., financial news, which both ordinary users and experts deal with. Until now, queries in natural language on open vocabulary from, e.g., query.wikidata.org were hard to answer. The development of such software offers the possibility to make semantic information systems more effective, efficient, and user-friendly. This paper proposes a semantic knowledge-based question answering system called cisqa. It develops entity linking and relation linking approaches that have been experimentally proven to be superior to the state of the art. cisqa also handles question ambiguity, translates natural language questions into SPARQL-queries, and delivers answers to the user in an appropriate manner.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Laurent, D., Séguéla, P., Nègre, S.: QA better than IR?. In: Proceedings of the Workshop on Multilingual Question Answering-MLQA 2006 (2006)
Quarteroni, S., Manandhar, S.: A chatbot-based interactive question answering system. DecaLog 2007, 83 (2007)
Al-Zubaide, H., Issa, A.A.: OntBot: ontology based chatbot. In: International Symposium on Innovations in Information and Communications Technology, pp. 7–12 (2011)
Baksi, K.D.: Recent advances in automated question answering in biomedical domain. ArXiv, abs/2111.05937 (2021)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Radford, A., Narasimhan, K., Salimans, T., Sutskever, I.: Improving language understanding by generative pre-training (2018)
Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., Sutskever, I.: Language models are unsupervised multitask learners. OpenAI Blog 1(8), 9 (2019)
Brown, T., et al.: Language models are few-shot learners. In: Advances in Neural Information Processing Systems, vol. 33, pp. 1877–1901 (2020)
Liu, J., Shen, D., Zhang, Y., Dolan, B., Carin, L., Chen, W.: What makes good in-context examples for GPT-\$3 \$?. arXiv preprint arXiv:2101.06804 (2021)
Shahapure, K.R., Nicholas, C.: Cluster quality analysis using silhouette score. In: 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA), pp. 747–748. IEEE (2020)
Qundus, J.A., Paschke, A.: Investigating the effect of attributes on user trust in social media. In: Elloumi, M., et al. (eds.) DEXA 2018. CCIS, vol. 903, pp. 278–288. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99133-7_23
Al Qundus, J., Paschke, A., Kumar, S., Gupta, S.: Calculating trust in domain analysis: theoretical trust model. Int. J. Inf. Manage. 48, 1–11 (2019)
Al Qundus, J., Paschke, A., Gupta, S., Alzouby, A.M., Yousef, M.: Exploring the impact of short-text complexity and structure on its quality in social media. J. Enterprise Inf. Manag. (2020)
Cerone, A., Naghizade, E., Scholer, F., Mallal, D., Skelton, R., Spina, D.: Watch ‘n’ Check: towards a social media monitoring tool to assist fact-checking experts. In: 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA), pp. 607–613. IEEE (2020)
Yousef, M., Qundus, J.A., Peikert, S., Paschke, A.: TopicsRanksDC: distance-based topic ranking applied on two-class data. In: Kotsis, G., et al. (eds.) DEXA 2020. CCIS, vol. 1285, pp. 11–21. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59028-4_2
Al Qundus, J., Peikert, S., Paschke, A.: AI supported topic modeling using KNIME-workflows. arXiv preprint arXiv:2104.09428 (2021)
Rehm, G., et al.: QURATOR: innovative technologies for content and data curation. arXiv preprint arXiv:2004.12195 (2020)
Ehrlinger, L., Wöß, W.: Towards a definition of knowledge graphs. SEMANTiCS (Posters, Demos, SuCCESS) 48(1–4), 2 (2016)
Purkayastha, S., Dana, S., Garg, D., Khandelwal, D., Bhargav, G.P.: Knowledge graph question answering via SPARQL silhouette generation. ArXiv, abs/2109.09475 (2021)
Unger, C., Bühmann, L., Lehmann, J., Ngonga Ngomo, A. C., Gerber, D., Cimiano, P.: Template-based question answering over RDF data. In: Proceedings of the 21st International Conference on World Wide Web, pp. 639–648 (2012)
To, N.D., Reformat, M.Z: Question-answering system with linguistic terms over RDF knowledge graphs. In: 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 4236–4243. IEEE (2020)
Diefenbach, D., Both, A., Singh, K., Maret, P.: Towards a question answering system over the semantic web. Semantic Web 11(3), 421–439 (2020)
Dubey, M., Banerjee, D., Abdelkawi, A., Lehmann, J.: LC-QuAD 2.0: a large dataset for complex question answering over Wikidata and DBpedia. In: Ghidini, C., et al. (eds.) ISWC 2019. LNCS, vol. 11779, pp. 69–78. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30796-7_5
Vrandečić, D.: Wikidata: a new platform for collaborative data collection. In: Proceedings of the 21st International Conference on World Wide Web, pp. 1063–1064 (2012)
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., et al. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-76298-0_52
Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: a core of semantic knowledge. In: Proceedings of the 16th International Conference on World Wide Web, pp. 697–706 (2007)
Etzioni, O., et al.: Web-scale information extraction in KnowitAll: (preliminary results). In: Proceedings of the 13th International Conference on World Wide Web, pp. 100–110 (2004)
Silva, J., Coheur, L., Mendes, A.C., Wichert, A.: From symbolic to sub-symbolic information in question classification. Artif. Intell. Rev. 35(2), 137–154 (2011)
Dubey, M., Dasgupta, S., Sharma, A., Höffner, K., Lehmann, J.: AskNow: a framework for natural language query formalization in SPARQL. In: Sack, H., Blomqvist, E., d’Aquin, M., Ghidini, C., Ponzetto, S.P., Lange, C. (eds.) ESWC 2016. LNCS, vol. 9678, pp. 300–316. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-34129-3_19
Ho, V.T., Ibrahim, Y., Pal, K., Berberich, K., Weikum, G.: Qsearch: answering quantity queries from text. In: Ghidini, C., et al. (eds.) ISWC 2019. LNCS, vol. 11778, pp. 237–257. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30793-6_14
Zimina, E., Nummenmaa, J., Järvelin, K., Peltonen, J., Stefanidis, K., Hyyrö, H.: GQA: grammatical question answering for RDF data. In: Buscaldi, D., Gangemi, A., Reforgiato Recupero, D. (eds.) SemWebEval 2018. CCIS, vol. 927, pp. 82–97. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00072-1_8
Zafar, H., Dubey, M., Lehmann, J., Demidova, E.: IQA: interactive query construction in semantic question answering systems. J. Web Semant. 64, 100586 (2020)
Lin, X., Li, H., Xin, H., Li, Z., Chen, L.: KBPearl: a knowledge base population system supported by joint entity and relation linking. Proc. VLDB Endowment 13(7), 1035–1049 (2020)
Etzioni, O., Banko, M., Soderland, S., Weld, D.S.: Open information extraction from the web. Commun. ACM 51(12), 68–74 (2008)
Dubey, M., Banerjee, D., Chaudhuri, D., Lehmann, J.: EARL: joint entity and relation linking for question answering over knowledge graphs. In: Vrandečcić, D. (ed.) ISWC 2018. LNCS, vol. 11136, pp. 108–126. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00671-6_7
Sakor, A., et al.: Old is gold: linguistic driven approach for entity and relation linking of short text. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1 (Long and Short Papers), pp. 2336–2346 (2019)
Sakor, A., Singh, K., Patel, A., Vidal, M.E.: Falcon 2.0: an entity and relation linking tool over Wikidata. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, pp. 3141–3148 (2020)
Federici, S., Montemagni, S., Pirrelli, V.: Shallow parsing and text chunking: a view on under specification in syntax (2021)
Norvig, P.: Natural language corpus data. Beautiful Data, pp. 219–242 (2009)
Athan, T., Bell, R., Kendall, E., Paschke, A., Sottara, D.: API4KP metamodel: a meta-API for heterogeneous knowledge platforms. In: Bassiliades, N., Gottlob, G., Sadri, F., Paschke, A., Roman, D. (eds.) RuleML 2015. LNCS, vol. 9202, pp. 144–160. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-21542-6_10
Acknowledgment
The research presented in this article is partially funded by the German Federal Ministry of Education and Research (BMBF) through the project PANQURA, grant no. 03COV03F https://qurator.ai/panqura/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Vu, L.D.S., Qundus, J.A., Jung, J., Peikert, S., Paschke, A. (2022). CISQA: Corporate Smart Insights Question Answering System. In: Pardede, E., Delir Haghighi, P., Khalil, I., Kotsis, G. (eds) Information Integration and Web Intelligence. iiWAS 2022. Lecture Notes in Computer Science, vol 13635. Springer, Cham. https://doi.org/10.1007/978-3-031-21047-1_43
Download citation
DOI: https://doi.org/10.1007/978-3-031-21047-1_43
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21046-4
Online ISBN: 978-3-031-21047-1
eBook Packages: Computer ScienceComputer Science (R0)