The Qanary Ecosystem: Getting New Insights by Composing Question Answering Pipelines

  • Dennis Diefenbach
  • Kuldeep Singh
  • Andreas Both
  • Didier Cherix
  • Christoph Lange
  • Sören Auer
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10360)

Abstract

The field of Question Answering (QA) is very multi-disciplinary as it requires expertise from a large number of areas such as natural language processing (NLP), artificial intelligence, machine learning, information retrieval, speech recognition and semantic technologies. In the past years a large number of QA systems were proposed using approaches from different fields and focusing on particular tasks in the QA process. Unfortunately, most of these systems cannot be easily reused, extended, and results cannot be easily reproduced since the systems are mostly implemented in a monolithic fashion, lack standardized interfaces and are often not open source or available as Web services. To address these issues we developed the knowledge-based Qanary methodology for choreographing QA pipelines distributed over the Web. Qanary employs the qa vocabulary as an exchange format for typical QA components. As a result, QA systems can be built using the Qanary methodology in a simpler, more flexible and standardized way while becoming knowledge-driven instead of being process-oriented. This paper presents the components and services that are integrated using the qa vocabulary and the Qanary methodology within the Qanary ecosystem. Moreover, we show how the Qanary ecosystem can be used to analyse QA processes to detect weaknesses and research gaps. We illustrate this by focusing on the Entity Linking (EL) task w.r.t. textual natural language input, which is a fundamental step in most QA processes. Additionally, we contribute the first EL benchmark for QA, as open source. Our main goal is to show how the research community can use Qanary to gain new insights into QA processes.

Keywords

Semantic web Software reusability Question answering Service composition Semantic search Ontologies Annotation model 

References

  1. 1.
    Atdag, S., Labatut, V.: A comparison of named entity recognition tools applied to biographical texts. In: 2nd International Conference on Systems and Computer Science (ICSCS) (2013)Google Scholar
  2. 2.
    Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007). doi:10.1007/978-3-540-76298-0_52 CrossRefGoogle Scholar
  3. 3.
    Both, A., Diefenbach, D., Singh, K., Shekarpour, S., Cherix, D., Lange, C.: Qanary – a methodology for vocabulary-driven open question answering systems. In: The Semantic Web. Latest Advances and New Domains: 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, 29 May–2 June 2016, Proceedings (2016)Google Scholar
  4. 4.
    Both, A., Ngonga Ngomo, A.-C., Usbeck, R., Lukovnikov, D., Lemke, C., Speicher, M.: A service-oriented search framework for full text, geospatial and semantic search. In: Proceedings of the 10th International Conference on Semantic Systems, SEM 2014, pp. 65–72. ACM (2014)Google Scholar
  5. 5.
    Cabrio, E., Cojan, J., Aprosio, A.P., Magnini, B., Lavelli, A., Gandon, F.: QAKiS: an open domain QA system based on relational patterns. In: Glimm, B., Huynh, D. (eds.) Proceedings of the ISWC 2012 Posters & Demonstrations Track, vol. 914, CEUR Workshop Proceedings (2012). CEUR-WS.org
  6. 6.
    Diefenbach, D., Amjad, S., Both, A., Singh, K., Maret, P.: Trill: a reusable front-end for QA systems. In: ESWC P&D (2017)Google Scholar
  7. 7.
    Diefenbach, D., Singh, K., Maret, P.: Wdaqua-core0: a question answering component for the research community. In: ESWC, 7th Open Challenge on Question Answering over Linked Data (QALD-7) (2017)Google Scholar
  8. 8.
    Dima, C.: Answering natural language questions with intui3. In: CLEF (Working Notes) (2014)Google Scholar
  9. 9.
    Ferrández, Ó., Spurk, C., Kouylekov, M., Dornescu, I., Ferrández, S., Negri, M., Izquierdo, R., Tomás, D., Orasan, C., Neumann, G., Magnini, B., González, J.L.V.: The QALL-ME framework: a specifiable-domain multilingual question answering architecture. Web Semant. Sci. Serv. Agents World Wide Web 9(2), 137–145 (2011)CrossRefGoogle Scholar
  10. 10.
    Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by Gibbs sampling. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, ACL 2005 (2005)Google Scholar
  11. 11.
    Freitas, A., Oliveira, J., Curry, E., O’Riain, S., da Silva, J.: Treo: combining entity-search, spreading activation and semantic relatedness for querying linked data. In: 1st Workshop on Question Answering over Linked Data (QALD-2011) (2011)Google Scholar
  12. 12.
    Lytra, I., Vidal, M.-E., Lange, C., Auer, S., Demidova, E.: WDAqua - answering questions using web data. In: Mannens, E., Dragoni, M., Nixon, L., Corcho, O. (eds.) EU Project Networking (2016)Google Scholar
  13. 13.
    Marx, E., Usbeck, R., Ngonga Ngomo, A., Höffner, K., Lehmann, J., Auer, S.: Towards an open question answering architecture. In: 10th International Conference on Semantic Systems (2014)Google Scholar
  14. 14.
    Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: DBpedia spotlight: shedding light on the web of documents. In: Proceedings of the 7th International Conference on Semantic Systems, I-Semantics 2011 (2011)Google Scholar
  15. 15.
    Nakatani, S.: Language detection library for Java (2010). https://github.com/shuyo/language-detection
  16. 16.
    Rizzo, G., Troncy, R.: NERD: a framework for unifying named entity recognition and disambiguation extraction tools. In: 13th Conference of the European Chapter of the Association for Computational Linguistics (2012)Google Scholar
  17. 17.
    Shekarpour, S., Marx, E., Ngonga Ngomo, A.-C., Auer, S.: SINA: semantic interpretation of user queries for question answering on interlinked data. Web Semant. Sci. Serv. Agents WWW 30, 39–51 (2015)CrossRefGoogle Scholar
  18. 18.
    Singh, K., Both, A., Diefenbach, D., Shekarpour, S.: Towards a message-driven vocabulary for promoting the interoperability of question answering systems. In: 2016 IEEE Tenth International Conference on Semantic Computing (ICSC) (2016)Google Scholar
  19. 19.
    Singh, K., Both, A., Diefenbach, D., Shekarpour, S., Cherix, D., Lange, C.: Qanary-the fast track to create a question answering system with linked data technology. In: The Semantic Web: ESWC 2016 Satellite Events, Heraklion, Crete, Greece, 29 May–2 June 2016, Revised Selected Papers (2016)Google Scholar
  20. 20.
    Speck, R., Ngonga Ngomo, A.-C.: Ensemble learning for named entity recognition. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014. LNCS, vol. 8796, pp. 519–534. Springer, Cham (2014). doi:10.1007/978-3-319-11964-9_33 Google Scholar
  21. 21.
    Unger, C., Forascu, C., Lopez, V., Ngonga Ngomo, A., Cabrio, E., Cimiano, P., Walter, S.: Question answering over linked data (QALD-5). In: CLEF (Working Notes) (2015)Google Scholar
  22. 22.
    Usbeck, R., Ngonga Ngomo, A.-C., Röder, M., Gerber, D., Coelho, S.A., Auer, S., Both, A.: AGDISTIS - graph-based disambiguation of named entities using linked data. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014. LNCS, vol. 8796, pp. 457–471. Springer, Cham (2014). doi:10.1007/978-3-319-11964-9_29 Google Scholar
  23. 23.
    Usbeck, R., Röder, M., Ngonga Ngomo, A., Baron, C., Both, A., Brümmer, M., Ceccarelli, D., Cornolti, M., Cherix, D., Eickmann, B., Ferragina, P., Lemke, C., Moro, A., Navigli, R., Piccinno, F., Rizzo, G., Sack, H., Speck, R., Troncy, R., Waitelonis, J., Wesemann, L.: GERBIL: general entity annotator benchmarking framework. In: 24th International Conference on World Wide Web (2015)Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Dennis Diefenbach
    • 1
  • Kuldeep Singh
    • 2
    • 3
  • Andreas Both
    • 4
  • Didier Cherix
    • 5
  • Christoph Lange
    • 2
    • 3
  • Sören Auer
    • 2
    • 3
  1. 1.Laboratoire Hubert CurienSaint EtienneFrance
  2. 2.Fraunhofer IAISSankt AugustinGermany
  3. 3.University of BonnBonnGermany
  4. 4.DATEV eGNurembergGermany
  5. 5.FLAVIA IT-Management GmbHKasselGermany

Personalised recommendations