Translation Language Model Enhancement for Community Question Retrieval Using User Adoption Answer

Chen, Ming; Li, Lin; Xie, Qing

doi:10.1007/978-3-319-63579-8_20

Ming Chen¹⁸,
Lin Li¹⁸ &
Qing Xie¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10366))

Included in the following conference series:

Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint Conference on Web and Big Data

1875 Accesses
1 Citations

Abstract

Community Question Answering (CQA) services on Web provide an important alternative for knowledge acquisition. As an essential component of CQA services, question retrieval can help users save much time by finding relevant questions. However, there is a “gap” between queried questions and candidate questions, which is called lexical chasm or word mismatch problem. In this paper, we improve traditional Topic inference based Translation Language Model (T\(^2\)LM) by using the topic information of queries. Moreover, we make use of user information, specifically the number of user adoption answers, for further enhancing our proposed model. In our model, the translation model and the topic model “bridge” the word gap by linking different words. Besides, user information that has no direct relation with semantics is used to help us “bypass” the gap. By combining both of them we obtain a considerable improvement for the performance of question retrieval. Experimental results on a real Chinese CQA data set show that our proposed model improves the retrieval performance over T\(^2\)LM baseline by 7.5% in terms of Mean Average Precision (MAP).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 275–281 (1998)
Google Scholar
Jeon, J., Croft, W.B., Lee, J.H.: Finding similar questions in large question and answer archives. In: CIKM, pp. 84–90 (2005)
Google Scholar
Berger, A.L., Caruana, R., Cohn, D., Freitag, D., Mittal, V.O.: Bridging the lexical chasm: statistical approaches to answer-finding. In: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 192–199 (2000)
Google Scholar
Riezler, S., Vasserman, A., Tsochantaridis, I., Mittal, V.O., Liu, Y.: Statistical machine translation for query expansion in answer retrieval. In: ACL, pp. 464–471 (2007)
Google Scholar
Xue, X., Jeon, J., Croft, W.B.: Retrieval models for question and answer archives. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 475–482 (2008)
Google Scholar
Bernhard, D., Gurevych, I.: Combining lexical semantic resources with question & answer archives for translation-based answer finding. In: ACL, pp. 728–736 (2009)
Google Scholar
Zhou, G., Cai, L., Zhao, J., Liu, K.: Phrase-based translation model for question retrieval in community question answer archives. In: ACL, pp. 653–662 (2011)
Google Scholar
Wei, W., Croft, W.B.: LDA-based document models for ad-hoc retrieval. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 178–185 (2006)
Google Scholar
Cai, L., Zhou, G., Liu, K., Zhao, J.: Learning the latent topics for question retrieval in community QA. In: IJCNLP, pp. 273–281 (2011)
Google Scholar
Ji, Z., Xu, F., Wang, B., He, B.: Question-answer topic model for question retrieval in community question answering. In: CIKM, pp. 2471–2474 (2012)
Google Scholar
Zhang, W.N., Zhang, Y., Liu, T.: A topic inference based translation model for question retrieval in community-based question answering services. Chin. J. Comput. 38(2), 313–321 (2015)
MathSciNet Google Scholar
Hofmann, T.: Unsupervised learning by probabilistic latent semantic analysis. Mach. Learn. 45, 177–196 (2001)
Article MATH Google Scholar
Blei, M.D., Ng, Y.A., Jordan, I.M.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
MATH Google Scholar
Cao, X., Cong, G., Cui, B., Jensen, C.S., Yuan, Q.: Approaches to exploring category information for question retrieval in community question-answer archives. Acm Trans. Inf. Syst. 30(2), 1–38 (2012)
Article Google Scholar
Och, F.J., Ney, H.: Improved statistical alignment models. In: ACL, pp. 440–447 (2000)
Google Scholar
Zhang, W.N., Ming, Z.Y., Zhang, Y., Liu, T., Chua, T.S.: Capturing the semantics of key phrases using multiple languages for question retrieval. IEEE Trans. Knowl. Data Eng. 28(4), 888–900 (2016)
Article Google Scholar
Chen, L., Jose, J.M., Yu, H., Yuan, F., Zhang, D.: A semantic graph based topic model for question retrieval in community question answering. In: The Ninth ACM International Conference on Web Search and Data Mining, pp. 287–296 (2016)
Google Scholar
Omari, A., Carmel, D., Rokhlenko, O., Szpektor, I.: Novelty based ranking of human answers for community questions. In: International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 215–224 (2016)
Google Scholar
Yuan, Q., Cong, G., Sun, A., Lin, C.Y., Thalmann, N.M.: Category hierarchy maintenance: a data-driven approach. In: The 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 791–800 (2012)
Google Scholar
Zhang, W., Ming, Z., Zhang, Y., Nie, L., Liu, T., Chua, T.S.: The use of dependency relation graph to enhance the term weighting in question retrieval. In: COLING, pp. 3105–3120 (2012)
Google Scholar
Dijk, D.V., Tsagkias, M., Rijke, M.D.: Early detection of topical expertise in community question answering. In: The International ACM SIGIR Conference, pp. 995–998 (2015)
Google Scholar
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval, pp. 139–159. Cambridge University Press, Cambridge (2008)
Book MATH Google Scholar

Download references

Acknowledgement

This research project is supported by the National Natural Science Foundation of China (Grant Nos: 61602353, 61303029), National Social Science Foundation of China (Grant No: 15BGL048), Hubei Province Science and Technology Support Project (2015BAA072), 863 Program (2015AA015403).

Author information

Authors and Affiliations

School of Computer Science and Technology, Wuhan University of Technology, Wuhan, China
Ming Chen, Lin Li & Qing Xie

Authors

Ming Chen
View author publications
You can also search for this author in PubMed Google Scholar
Lin Li
View author publications
You can also search for this author in PubMed Google Scholar
Qing Xie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lin Li .

Editor information

Editors and Affiliations

Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, China
Lei Chen
Computer Science, Aarhus University, Aarhus N, Denmark
Christian S. Jensen
Computer Science, University of Southern California, Los Angeles, California, USA
Cyrus Shahabi
Northeastern University, Shenyang, China
Xiaochun Yang
Kent State University, Kent, Ohio, USA
Xiang Lian

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, M., Li, L., Xie, Q. (2017). Translation Language Model Enhancement for Community Question Retrieval Using User Adoption Answer. In: Chen, L., Jensen, C., Shahabi, C., Yang, X., Lian, X. (eds) Web and Big Data. APWeb-WAIM 2017. Lecture Notes in Computer Science(), vol 10366. Springer, Cham. https://doi.org/10.1007/978-3-319-63579-8_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-63579-8_20
Published: 03 August 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63578-1
Online ISBN: 978-3-319-63579-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics