Skip to main content

A Hybrid Approach to DBQA

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10102))

Abstract

Document-based question answering (DBQA) is a sub-task of open-domain question answering, targeted at selecting the answer sentence(s) from the given documents for a question. In this paper, we propose a hybrid approach to select answer sentences, combining existing models via the rank SVM model. Specifically, we capture the inter-relationship between the question and answer sentences from three aspects: surface string similarity, deep semantic similarity and relevance based on information retrieval models. Our experiments show that an improved retrieval model out-performs other methods, including the deep learning models. And, applying a rank SVM model to combine all these features, we achieve 0.8120 in mean reciprocal rank (MRR) and 0.8111 in mean average precision (MAP) in the opening test.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  • Srihari, R.K., Li, W.: A question answering system supported by information extraction. In: Proceedings of the 1st Meeting of the North American Chapter of the Association for Computational Linguistics (ANLP-NAACL-2000), Seattle, WA, pp. 166–172 (2000)

    Google Scholar 

  • Hovy, E., Hermjakob, U., Lin, C., et al.: The use of external knowledge of factoid QA. In: TREC, vol. 2001, pp. 644–652 (2001)

    Google Scholar 

  • Ravichandran, D., Hovy, E.: Learning surface text patterns for a question answering system. In: Meeting of the Association for Computational Linguistics (2002)

    Google Scholar 

  • Severyn, A., Moschitti, A.: Learning to rank short text pairs with convolutional deep neural networks. In: International ACM SIGIR Conference on Research and Development in Information Retrieval (2015)

    Google Scholar 

  • Tran, Q.H., Tran, V., Vu, T., et al.: JAIST: combining multiple features for answer selection in community question answering. In: Proceedings of the 9th International Workshop on Semantic Evaluation, SemEval, vol. 15, pp. 215–219 (2015)

    Google Scholar 

  • Tan, M., Santos, C.N., Zhou, B., et al.: LSTM-based deep learning models for non-factoid answer selection. arXiv preprint arXiv:1511.04108 (2015)

  • Wang, D.W., Nyberg, E.: A long short-term memory model for answer sentence selection in question answering. In: Meeting of the Association for Computational Linguistics (2015)

    Google Scholar 

  • Yin, W., Yu, M., Zhou, B., et al.: Simple question answering by attentive convolutional neural network. arXiv preprint arXiv:1606.03391 (2016)

  • Papineni, K., Roukos, S., Ward, T., et al.: BLEU: a method for automatic evaluation of machine translation. In: Meeting of the Association for Computational Linguistics (2002)

    Google Scholar 

  • Kondrak, G.: N-gram similarity and distance. In: Consens, Mariano, Navarro, Gonzalo (eds.) SPIRE 2005. LNCS, vol. 3772, pp. 115–126. Springer, Heidelberg (2005). doi:10.1007/11575832_13

    Chapter  Google Scholar 

  • Lin, C., Och, F.J.: Automatic evaluation of machine translation quality using longest common subsequence and skip-bigram statistics. In: Meeting of the Association for Computational Linguistics (2004)

    Google Scholar 

  • Huerta, J.M.: An information-retrieval approach to language modeling: applications to social data. In: North American Chapter of the Association for Computational Linguistics (2010)

    Google Scholar 

  • Manning, C.D., Raghavan, P., Schutze, H., et al.: Introduction to information retrieval. In: Proceedings of the International Communication of Association for Computing Machinery Conference (2008)

    Google Scholar 

  • Lavrenko, V., Croft, W.B.: Relevance based language models. In: International ACM SIGIR Conference on Research and Development in Information Retrieval (2001)

    Google Scholar 

  • Turney, P.D.: Similarity of semantic relations. J. Comput. Linguist. 32(3), 379–416 (2006)

    Article  MATH  Google Scholar 

  • Mikolov, T., Sutskever, I., Chen, K., et al.: Distributed representations of words and phrases and their compositionality. In: Neural Information Processing Systems (2013)

    Google Scholar 

  • Yin, W., Schütze, H., Xiang, B., et al.: ABCNN: attention-based convolutional neural network for modeling sentence pairs. arXiv preprint arXiv:1512.05193 (2015)

Download references

Acknowledgments

This paper is supported by the project of Natural Science Foundation of China (Grant Nos. 61272384, 61402134, and 61370170).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Muyun Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Wu, F., Yang, M., Zhao, T., Han, Z., Zheng, D., Zhao, S. (2016). A Hybrid Approach to DBQA. In: Lin, CY., Xue, N., Zhao, D., Huang, X., Feng, Y. (eds) Natural Language Understanding and Intelligent Applications. ICCPOL NLPCC 2016 2016. Lecture Notes in Computer Science(), vol 10102. Springer, Cham. https://doi.org/10.1007/978-3-319-50496-4_87

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-50496-4_87

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-50495-7

  • Online ISBN: 978-3-319-50496-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics