Cloud-based learning system for answer ranking

Yuan, Li Wei; Su, Lei; Zhang, Yin; Fang, Guang; Shu, Peng

doi:10.1007/s10586-017-0888-2

Cloud-based learning system for answer ranking

Published: 12 May 2017

Volume 20, pages 2253–2266, (2017)
Cite this article

Cluster Computing Aims and scope Submit manuscript

Li Wei Yuan¹,
Lei Su ORCID: orcid.org/0000-0003-0210-6506¹,
Yin Zhang²,
Guang Fang¹ &
…
Peng Shu¹

304 Accesses
2 Citations
Explore all metrics

Abstract

Community question answering (Q&A) is a new knowledge-sharing model where a large number of questions and answers are accumulated through the user’s submission. When the user submits a new question, the Q&A system can provide the accurate answers list by the learning model. The traditional ranking algorithm mainly uses a large number of labeled data to train the model. However, a ranking model trained in the source domain may lead to poor performance in the target domain because of the lack of labeled training samples in the new domain. To address this challenge, this paper proposes a transfer learning algorithm based on feature selection for ranking. Suppose that the source domain and the target domain share the low-dimensional feature representation, and due to the user features exist share knowledge in source domain and target, so we use the user features are integrated into the answer space. Then the features of the target domain are shared for knowledge transfer. Furthermore, to improve the computational efficiency for the huge amount of data in the community Q&A, the learning model is distributed and processed by the Spark technology. Experimental results show that the proposed method could effectively exploit the cross-domain knowledge to enhance the effect of ranking.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Mao, X.L., Li, X.M.: A survey of question–answer system. J. Comput. Sci. Explor. 6(3), 193–207 (2012)
Article Google Scholar
Lian, X.: Research on some problems in community question-answering system, pp. 2–3. School of Computer and Control Engineering, Nankai University (2014)
Google Scholar
You, L., Zhou, Y.Q., Huang, X.Q., Wu, L.D.: Confidence score algorithm for OA system based on maximum entropy model. J. Softw. 16(8), 1407–1414 (2005)
Article MATH Google Scholar
Quoc, C., Le, V.: Learning to rank with nonsmooth cost functions. Proc. Adv. Neural Inform. Process. Syst. 19, 193–200 (2007)
Google Scholar
Cao, Z., Qin, T., Liu, T.Y., et al.: Learning to rank: from pairwise approach to listwise approach. In: Proceedings of the 24th International Conference on Machine Learning, pp. 129–136 (2007)
Qin, T., Zhang, X.D., Tsai, M.F., et al.: Query-level loss functions for information retrieval. Inform. Process. Manag. 44(2), 838–855 (2008)
Article Google Scholar
Suzuki, J., Sasaki, Y., Maeda, E.: SVM answer selection for open-domain question answering. In: Proceedings of the 19th International Conference on Computational linguistics, vol. 1, pp. 1–7 (2002)
He, Y., Alani, H.: Automatic identification of best answers in online enquiry communities. In: 9th Extended Semantic Web Conference (2012)
Dalip, D.H., Gonalves, M.A., Cristo, M., et al.: Exploiting user feedback to learn to rank answers in Q&A forums: a case study with stack overflow. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval. pp. 43–552 (2013)
Zhiltsov, N., Kotov, A.,. Nikolaev, F.: Fielded sequential dependence model for ad-hoc entity retrieval in the web of data. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 253–262 (2015)
Xu, X., He, L., Lu, H., Taniguchi, R.: Non-linear Matrix Completion for Social Image Tagging. Piscataway, IEEE (2016)
Google Scholar
Zong, H.Y.: Research on the Ranking of Answers in Field Question Answering System, pp. 1–20. Kunming University of, Science and Technology (2011)
Google Scholar
Salton, G., Lesk, M.E.: Computer evaluation of indexing and text processing. J. ACM 15(1), 8–36 (1968)
Article MATH Google Scholar
Salton, G.: The Smart Retrieval System-Experiments in Automatic Document Processing, vol. 556. Prentice-Hall Inc, Upper Saddle River (1971)
Google Scholar
Robertson, S.E., Jones, K.S.: Relevance weighting of search terms. J. Am. Soc. Inform. Sci. 27(3), 129–146 (1976)
Article Google Scholar
Ravichandran, D., Hovy, E., Josef Och, F.: Statistical QA-classifier vs. re-ranker: what’s the difference? In: Proceedings of the ACI Workshop on Muhilingual Summarization and Question Answering Machine Learning, pp. 69–75 (2003)
Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)
Article Google Scholar
Arnold, A., Nallapati, R., Cohen, W.: Exploiting feature hierarchy for transfer learning in named entity recognition. In: Proceedings of ACL (2008)
Richman, A.E., Schone, P.: Mining wiki resources for multilingual named entity recognition. In: proceedings of 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 1–9 (2008)
Goldwasser, D., Roth, D.: Active sample selection for named entity transliteration. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics, pp. 53–56 (2008)
Pan, S.J., Yang, Q.: A survey on transfer learning. Knowl. Data Eng. 22(10), 1345–1359 (2010)
Article Google Scholar
Zaharia, M., Chowdhury, M., Das, T. et al.: Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. In: Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation, vol. 70(2), pp. 141–146 (2012)
Chan, Y., Ng, H.T.: Word sense disambiguation with distribution estimation. In: Proceedings of the 19th International Joint Conference on Artificial Intelligence, vol. 4, pp. 1010–1015 (2005)
Zhang, Y., Wang, S., et al.: Binary PSO with mutation operator for feature selection using decision tree applied to spam detection. In: Knowledge-Based Systems (2014)
Chen, D., Xiong, Y., Yan, J., Xue, G.-R., Chen, Z.: Knowledge transfer for cross domain learning to rank. Inform. Retriev. J. 13(3), 236–253 (2010)
Chen, D., Yan, J., Wang, G., Xiong, Y., Fan, W.: A novel algorithm for transfer of rank learning. In: IEEE 13th International Conference on Data Mining Workshops, pp. 106–115 (2008)
Wenyuan, D., Qiang, Y., Gui-Rong, X., Yong, Y.: Boosting for transfer learning. In: Proceedings of the Twenty-Fourth International Conference on Machine Learning, pp. 20–24 (2007)
Xue, G.-R., Dai, W., Yang, Q., Yu, Y.: Topic-bridged PLSA for cross-domain text classification. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 20–24 (2008)
Argyriou, A., Evgeniou, T., Pontil, M.: Convex multi-task feature learning. Adv. Neural Inform. Process. Syst. 73(3), 243–272 (2008)
Google Scholar
Caruana, R.: Multitask learning. In: The 28th International Conference on Machine Learning, vol. 28(1), pp. 41–45 (1997)
Herbrich, R., Graepel, T., Obermayer, K.: Large margin rank boundaries for ordinal regression. Advances in Large Margin Classifiers, pp. 115–132. MIT Press, Cambridge, MA (2000)
Sun, A., Jiang, M., Ma, Y.: An instance-based approach for pinpointing answers in Chinese question answering. In: Signal Processing 8th International Conference on IEEE, pp. 16–20(2006)

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (No. 61365010).

Author information

Authors and Affiliations

School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China
Li Wei Yuan, Lei Su, Guang Fang & Peng Shu
School of Information and Safety Engineering, Zhongnan University of Economics and Law, Wuhan, China
Yin Zhang

Authors

Li Wei Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Lei Su
View author publications
You can also search for this author in PubMed Google Scholar
Yin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Guang Fang
View author publications
You can also search for this author in PubMed Google Scholar
Peng Shu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lei Su.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yuan, L.W., Su, L., Zhang, Y. et al. Cloud-based learning system for answer ranking. Cluster Comput 20, 2253–2266 (2017). https://doi.org/10.1007/s10586-017-0888-2

Download citation

Received: 02 December 2016
Revised: 28 February 2017
Accepted: 25 April 2017
Published: 12 May 2017
Issue Date: September 2017
DOI: https://doi.org/10.1007/s10586-017-0888-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Cloud-based learning system for answer ranking

Abstract

Access this article

Similar content being viewed by others

Artificial intelligence in recommender systems

A systematic review: machine learning based recommendation systems for e-learning

A systematic review and research perspective on recommender systems

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Cloud-based learning system for answer ranking

Abstract

Access this article

Similar content being viewed by others

Artificial intelligence in recommender systems

A systematic review: machine learning based recommendation systems for e-learning

A systematic review and research perspective on recommender systems

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation