Skip to main content

CISQA: Corporate Smart Insights Question Answering System

  • Conference paper
  • First Online:
Information Integration and Web Intelligence (iiWAS 2022)

Abstract

The development of semantic information systems is a highly topical issue in various domains, e.g., financial news, which both ordinary users and experts deal with. Until now, queries in natural language on open vocabulary from, e.g., query.wikidata.org were hard to answer. The development of such software offers the possibility to make semantic information systems more effective, efficient, and user-friendly. This paper proposes a semantic knowledge-based question answering system called cisqa. It develops entity linking and relation linking approaches that have been experimentally proven to be superior to the state of the art. cisqa also handles question ambiguity, translates natural language questions into SPARQL-queries, and delivers answers to the user in an appropriate manner.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.omgwiki.org/API4KB/doku.php.

  2. 2.

    de_core_news_lg-2.3.0 https://spacy.io/models/de#de_core_news_lg.

  3. 3.

    https://spacy.io/.

  4. 4.

    https://www.nltk.org/.

  5. 5.

    https://github.com/jdlauret/SpellChecker.

References

  1. Laurent, D., Séguéla, P., Nègre, S.: QA better than IR?. In: Proceedings of the Workshop on Multilingual Question Answering-MLQA 2006 (2006)

    Google Scholar 

  2. Quarteroni, S., Manandhar, S.: A chatbot-based interactive question answering system. DecaLog 2007, 83 (2007)

    Google Scholar 

  3. Al-Zubaide, H., Issa, A.A.: OntBot: ontology based chatbot. In: International Symposium on Innovations in Information and Communications Technology, pp. 7–12 (2011)

    Google Scholar 

  4. Baksi, K.D.: Recent advances in automated question answering in biomedical domain. ArXiv, abs/2111.05937 (2021)

    Google Scholar 

  5. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

  6. Radford, A., Narasimhan, K., Salimans, T., Sutskever, I.: Improving language understanding by generative pre-training (2018)

    Google Scholar 

  7. Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., Sutskever, I.: Language models are unsupervised multitask learners. OpenAI Blog 1(8), 9 (2019)

    Google Scholar 

  8. Brown, T., et al.: Language models are few-shot learners. In: Advances in Neural Information Processing Systems, vol. 33, pp. 1877–1901 (2020)

    Google Scholar 

  9. Liu, J., Shen, D., Zhang, Y., Dolan, B., Carin, L., Chen, W.: What makes good in-context examples for GPT-\$3 \$?. arXiv preprint arXiv:2101.06804 (2021)

  10. Shahapure, K.R., Nicholas, C.: Cluster quality analysis using silhouette score. In: 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA), pp. 747–748. IEEE (2020)

    Google Scholar 

  11. Qundus, J.A., Paschke, A.: Investigating the effect of attributes on user trust in social media. In: Elloumi, M., et al. (eds.) DEXA 2018. CCIS, vol. 903, pp. 278–288. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99133-7_23

    Chapter  Google Scholar 

  12. Al Qundus, J., Paschke, A., Kumar, S., Gupta, S.: Calculating trust in domain analysis: theoretical trust model. Int. J. Inf. Manage. 48, 1–11 (2019)

    Google Scholar 

  13. Al Qundus, J., Paschke, A., Gupta, S., Alzouby, A.M., Yousef, M.: Exploring the impact of short-text complexity and structure on its quality in social media. J. Enterprise Inf. Manag. (2020)

    Google Scholar 

  14. Cerone, A., Naghizade, E., Scholer, F., Mallal, D., Skelton, R., Spina, D.: Watch ‘n’ Check: towards a social media monitoring tool to assist fact-checking experts. In: 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA), pp. 607–613. IEEE (2020)

    Google Scholar 

  15. Yousef, M., Qundus, J.A., Peikert, S., Paschke, A.: TopicsRanksDC: distance-based topic ranking applied on two-class data. In: Kotsis, G., et al. (eds.) DEXA 2020. CCIS, vol. 1285, pp. 11–21. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59028-4_2

    Chapter  Google Scholar 

  16. Al Qundus, J., Peikert, S., Paschke, A.: AI supported topic modeling using KNIME-workflows. arXiv preprint arXiv:2104.09428 (2021)

  17. Rehm, G., et al.: QURATOR: innovative technologies for content and data curation. arXiv preprint arXiv:2004.12195 (2020)

  18. Ehrlinger, L., Wöß, W.: Towards a definition of knowledge graphs. SEMANTiCS (Posters, Demos, SuCCESS) 48(1–4), 2 (2016)

    Google Scholar 

  19. Purkayastha, S., Dana, S., Garg, D., Khandelwal, D., Bhargav, G.P.: Knowledge graph question answering via SPARQL silhouette generation. ArXiv, abs/2109.09475 (2021)

    Google Scholar 

  20. Unger, C., Bühmann, L., Lehmann, J., Ngonga Ngomo, A. C., Gerber, D., Cimiano, P.: Template-based question answering over RDF data. In: Proceedings of the 21st International Conference on World Wide Web, pp. 639–648 (2012)

    Google Scholar 

  21. To, N.D., Reformat, M.Z: Question-answering system with linguistic terms over RDF knowledge graphs. In: 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 4236–4243. IEEE (2020)

    Google Scholar 

  22. Diefenbach, D., Both, A., Singh, K., Maret, P.: Towards a question answering system over the semantic web. Semantic Web 11(3), 421–439 (2020)

    Google Scholar 

  23. Dubey, M., Banerjee, D., Abdelkawi, A., Lehmann, J.: LC-QuAD 2.0: a large dataset for complex question answering over Wikidata and DBpedia. In: Ghidini, C., et al. (eds.) ISWC 2019. LNCS, vol. 11779, pp. 69–78. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30796-7_5

  24. Vrandečić, D.: Wikidata: a new platform for collaborative data collection. In: Proceedings of the 21st International Conference on World Wide Web, pp. 1063–1064 (2012)

    Google Scholar 

  25. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., et al. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-76298-0_52

    Chapter  Google Scholar 

  26. Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: a core of semantic knowledge. In: Proceedings of the 16th International Conference on World Wide Web, pp. 697–706 (2007)

    Google Scholar 

  27. Etzioni, O., et al.: Web-scale information extraction in KnowitAll: (preliminary results). In: Proceedings of the 13th International Conference on World Wide Web, pp. 100–110 (2004)

    Google Scholar 

  28. Silva, J., Coheur, L., Mendes, A.C., Wichert, A.: From symbolic to sub-symbolic information in question classification. Artif. Intell. Rev. 35(2), 137–154 (2011)

    Article  Google Scholar 

  29. Dubey, M., Dasgupta, S., Sharma, A., Höffner, K., Lehmann, J.: AskNow: a framework for natural language query formalization in SPARQL. In: Sack, H., Blomqvist, E., d’Aquin, M., Ghidini, C., Ponzetto, S.P., Lange, C. (eds.) ESWC 2016. LNCS, vol. 9678, pp. 300–316. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-34129-3_19

    Chapter  Google Scholar 

  30. Ho, V.T., Ibrahim, Y., Pal, K., Berberich, K., Weikum, G.: Qsearch: answering quantity queries from text. In: Ghidini, C., et al. (eds.) ISWC 2019. LNCS, vol. 11778, pp. 237–257. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30793-6_14

    Chapter  Google Scholar 

  31. Zimina, E., Nummenmaa, J., Järvelin, K., Peltonen, J., Stefanidis, K., Hyyrö, H.: GQA: grammatical question answering for RDF data. In: Buscaldi, D., Gangemi, A., Reforgiato Recupero, D. (eds.) SemWebEval 2018. CCIS, vol. 927, pp. 82–97. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00072-1_8

    Chapter  Google Scholar 

  32. Zafar, H., Dubey, M., Lehmann, J., Demidova, E.: IQA: interactive query construction in semantic question answering systems. J. Web Semant. 64, 100586 (2020)

    Google Scholar 

  33. Lin, X., Li, H., Xin, H., Li, Z., Chen, L.: KBPearl: a knowledge base population system supported by joint entity and relation linking. Proc. VLDB Endowment 13(7), 1035–1049 (2020)

    Google Scholar 

  34. Etzioni, O., Banko, M., Soderland, S., Weld, D.S.: Open information extraction from the web. Commun. ACM 51(12), 68–74 (2008)

    Article  Google Scholar 

  35. Dubey, M., Banerjee, D., Chaudhuri, D., Lehmann, J.: EARL: joint entity and relation linking for question answering over knowledge graphs. In: Vrandečcić, D. (ed.) ISWC 2018. LNCS, vol. 11136, pp. 108–126. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00671-6_7

  36. Sakor, A., et al.: Old is gold: linguistic driven approach for entity and relation linking of short text. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1 (Long and Short Papers), pp. 2336–2346 (2019)

    Google Scholar 

  37. Sakor, A., Singh, K., Patel, A., Vidal, M.E.: Falcon 2.0: an entity and relation linking tool over Wikidata. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, pp. 3141–3148 (2020)

    Google Scholar 

  38. Federici, S., Montemagni, S., Pirrelli, V.: Shallow parsing and text chunking: a view on under specification in syntax (2021)

    Google Scholar 

  39. Norvig, P.: Natural language corpus data. Beautiful Data, pp. 219–242 (2009)

    Google Scholar 

  40. Athan, T., Bell, R., Kendall, E., Paschke, A., Sottara, D.: API4KP metamodel: a meta-API for heterogeneous knowledge platforms. In: Bassiliades, N., Gottlob, G., Sadri, F., Paschke, A., Roman, D. (eds.) RuleML 2015. LNCS, vol. 9202, pp. 144–160. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-21542-6_10

    Chapter  Google Scholar 

Download references

Acknowledgment

The research presented in this article is partially funded by the German Federal Ministry of Education and Research (BMBF) through the project PANQURA, grant no. 03COV03F https://qurator.ai/panqura/

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Le Duyen Sandra Vu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Vu, L.D.S., Qundus, J.A., Jung, J., Peikert, S., Paschke, A. (2022). CISQA: Corporate Smart Insights Question Answering System. In: Pardede, E., Delir Haghighi, P., Khalil, I., Kotsis, G. (eds) Information Integration and Web Intelligence. iiWAS 2022. Lecture Notes in Computer Science, vol 13635. Springer, Cham. https://doi.org/10.1007/978-3-031-21047-1_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-21047-1_43

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-21046-4

  • Online ISBN: 978-3-031-21047-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics