An Effective Corpus-Based Question Answering Pipeline for Italian

  • Emanuele DamianoEmail author
  • Raffaele Spinelli
  • Massimo Esposito
  • Giuseppe De Pietro
Conference paper
Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 76)


Question Answering is a longevous field in computer science, aimed at realizing systems able to answer questions expressed in natural language. However, building Question Answering systems for Italian and able to extract answers from a corpus pertaining a closed domain is still an open research problem. Indeed, extracting clues from a question to generate a query for the information retrieval engine as well as determining the likelihood that a candidate answer is correct are two very thorny tasks. To face these issues, the paper presents a Question Answering pipeline for Italian and based on a corpus of documents pertaining a closed domain. In particular, this pipeline exhibits functionalities for: (i) analyzing natural language questions in Italian by using lexical features; (ii) handling both factoid and description answer types and, depending on them, filtering contextual stop words from questions; (iii) scoring and selecting candidate answers with respect to their type in order to determine the best one. The proposed solution has been subject to an evaluation of its performance using standard metrics, showing promising results.


Question Answering Cognitive Systems NLP 


  1. 1.
    Amato, F., Moscato, F.: Exploiting cloud and workflow patterns for the analysis of composite cloud services. Future Gener. Comput. Syst. 67, 255–265 (2017)CrossRefGoogle Scholar
  2. 2.
    Amato, F., Moscato, F.: Pattern-based orchestration and automatic verification of composite cloud services. Comput. Electr. Eng. 56, 842–853 (2016)CrossRefGoogle Scholar
  3. 3.
    Amato, F., Moscato, F.: Model transformations of mapreduce design patterns for automatic development and verification. JPDC (2016)Google Scholar
  4. 4.
    Baudiš, P.: Yodaqa: a modular question answering system pipeline. In: POSTER 2015–19th International Student Conference on Electrical Engineering (2015)Google Scholar
  5. 5.
    Carvalho, G., de Matos, D.M., Rocio, V.: IdSay: question answering for Portuguese, pp. 345–352. Springer, Heidelberg (2009)Google Scholar
  6. 6.
    Damiano, E., Spinelli, R., Esposito, M., De Pietro, G.: Towards a framework for closed-domain question answering in Italian. In: Proceedings Workshop KARE 2016 (2016)Google Scholar
  7. 7.
    Feng, M., Xiang, B., Glass, M.R., Wang, L., Zhou, B.: Applying deep learning to answer selection: a study and an open task. CoRR abs/1508.01585 (2015)Google Scholar
  8. 8.
    Gallagher, S., Zadrozny, W., Shalaby, W., Avadhani, A.: Watsonsim: overview of a question answering engine. arXiv preprint arXiv:1412.0879 (2014)
  9. 9.
    Gondek, D., Lally, A., Kalyanpur, A., Murdock, J.W., Duboué, P.A., Zhang, L., Pan, Y., Qiu, Z., Welty, C.: A framework for merging and ranking of answers in DeepQA. IBM J. Res. Dev. 56(3.4), 14:1 (2012)CrossRefGoogle Scholar
  10. 10.
    Hauswald, J., Laurenzano, M.A., Zhang, Y., Yang, H., Kang, Y., Li, C., Rovinski, A., Khurana, A., Dreslinski, R.G., Mudge, T., Petrucci, V., Tang, L., Mars, J.: Designing future warehouse-scale computers for sirius, an end-to-end voice and vision personal assistant. ACM Trans. Comput. Syst. 34(1), 2:1–2:32 (2016)CrossRefGoogle Scholar
  11. 11.
    Kamdi, R.P., Agrawal, A.J.: Keywords based closed domain question answering system for Indian penal code sections and Indian amendment laws. Int. J. Intell. Syst. Appl. 7(12), 57–67 (2015)Google Scholar
  12. 12.
    Lally, A., Prager, J.M., McCord, M.C., Boguraev, B.K., Patwardhan, S., Fan, J., Fodor, P., Chu-Carroll, J.: Question analysis: how Watson reads a clue. IBM J. Res. Dev. 56(3.4), 2:1 (2012)CrossRefGoogle Scholar
  13. 13.
    Li, T., Hao, Y., Zhu, X., Zhang, X.: A Chinese question answering system for specific domain, pp. 590–601. Springer International Publishing, Cham (2014)Google Scholar
  14. 14.
    Molino, P., Basile, P., Caputo, A., Lops, P., Semeraro, G.: Exploiting distributional semantic models in question answering. In: 2012 IEEE Sixth International Conference on Semantic Computing (ICSC), pp. 146–153. IEEE (2012)Google Scholar
  15. 15.
    Morales, A., Premtoon, V., Avery, C., Felshin, S., Katz, B.: Learning to answer questions from wikipedia infoboxes. In: Proceedings of the EMNLP 2016, Austin, Texas, USA, 1–4 November 2016, pp. 1930–1935 (2016)Google Scholar
  16. 16.
    Pipitone, A., Tirone, G., Pirrone, R.: QuASIt: a cognitive inspired approach to question answering for the Italian language, pp. 464–476. Springer (2016)Google Scholar
  17. 17.
    Prager, J., Brown, E., Coden, A., Radev, D.: Question-answering by predictive annotation. In: Proceedings of the 23rd ACM SIGIR Conference, SIGIR 2000, pp. 184–191. ACM, New York (2000)Google Scholar
  18. 18.
    Schlaefer, N., Gieselmann, P., Schaaf, T., Waibel, A.: A pattern learning approach to question answering within the ephyra framework. In: International Conference on Text, Speech and Dialogue. pp. 687–694. Springer (2006)Google Scholar
  19. 19.
    Solorio, T., Pérez-Coutino, M., Montes-y Gémez, M., Villasenor-Pineda, L., López-López, A.: A language independent method for question classification. In: Proceedings of Coling 2004, p. 1374 (2004)Google Scholar
  20. 20.
    Vargas-Vera, M., Lytras, M.D.: AQUA: a closed-domain question answering system. Inf. Syst. Manag. 27(3), 217–225 (2010)CrossRefGoogle Scholar
  21. 21.
    Wang, C., Kalyanpur, A., Fan, J., Boguraev, B.K., Gondek, D.: Relation extraction and scoring in DeepQA. IBM J. Res. Dev. 56(34), 91 (2012)CrossRefGoogle Scholar
  22. 22.
    Weis, K.: A case based reasoning approach for answer reranking in question answering. CoRR abs/1503.02917 (2015)Google Scholar
  23. 23.
    Xie, Z., Zeng, Z., Zhou, G., He, T.: Knowledge base question answering based on deep learning models, pp. 300–311. Springer (2016)Google Scholar
  24. 24.
    Yao, X., Van Durme, B., Clark, P.: Automatic coupling of answer extraction and information retrieval. In: Proceedings of ACL Short (2013)Google Scholar
  25. 25.
    Yu, L., Hermann, K.M., Blunsom, P., Pulman, S.: Deep learning for answer sentence selection. arXiv preprint arXiv:1412.1632 (2014)

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  • Emanuele Damiano
    • 1
    Email author
  • Raffaele Spinelli
    • 1
  • Massimo Esposito
    • 1
  • Giuseppe De Pietro
    • 1
  1. 1.National Research Council of ItalyInstitute for High Performance Computing and Networking - ICARNaplesItaly

Personalised recommendations