Predicting answer acceptability for question-answering system

Roy, Pradeep Kumar

doi:10.1007/s00799-023-00357-2

Predicting answer acceptability for question-answering system

Published: 05 May 2023

(2023)
Cite this article

International Journal on Digital Libraries Aims and scope Submit manuscript

Pradeep Kumar Roy ORCID: orcid.org/0000-0001-5513-2834¹

111 Accesses
Explore all metrics

Abstract

Question-answering (QA) platforms such as Stack Overflow, Quora, and Stack Exchange have become favourite places to exchange knowledge with community users. Finding answers to simple or complex questions is easier on QA platforms nowadays. Due to a large number of responses from users all around the world, these CQA systems are currently facing massive problems. Stack Overflow allows users to ask questions and give answers or comments on others’ posts. Consequently, Stack Overflow also rewards those users whose posts are appreciated by the community in the form of reputation points. The accepted answer provides maximum reputation points to the answerer. More reputation points allow getting more website privileges. Hence, each answerer needs to get their answer accepted. Very little research has been done to check whether the user’s answers will be accepted or not. This paper proposes a model that predicts answer acceptability and its reason. The model’s findings help the answerer know about the answer acceptance; if the model predicted the probability of acceptance is less, the answerer might revise their answer immediately. The comparison with the state-of-the-art literature confirmed that the proposed model achieves better performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Enhanced Question Answering System with Trustworthy Answers

Exploiting Content Quality and Question Difficulty in CQA Reputation Systems

Finding and Ranking High-Quality Answers in Community Question Answering Sites

Article 07 November 2017

Data availability

Data will be made available on request.

References

Saikh, T., Ghosal, T., Mittal, A., Ekbal, A., Bhattacharyya, P.: Scienceqa: a novel resource for question answering on scholarly articles. Int. J. Dig. Libr. 1–13 (2022)
Le, L.T., Shah, C., Choi, E.: Assessing the quality of answers autonomously in community question-answering. Int. J. Digit. Libr. 20(4), 351–367 (2019)
Article Google Scholar
Seki, Y., Zhao, K., Oguni, M., Sugiyama, K.: Cnn-based framework for classifying temporal relations with question encoder. Int. J. Digit. Libr. 23(2), 167–177 (2022)
Article Google Scholar
Ye, D., Xing, Z., Kapre, N.: The structure and dynamics of knowledge network in domain-specific q &a sites: a case study of stack overflow. Empir. Softw. Eng. 22(1), 375–406 (2017)
Article Google Scholar
May, A., Wachs, J., Hannák, A.: Gender differences in participation and reward on stack overflow. Empir. Softw. Eng. 24(4), 1997–2019 (2019)
Article Google Scholar
Shah, C., Pomerantz, J.: Evaluating and predicting answer quality in community QA. In: Proceedings of the 33rd International ACM SIGIR conference on Research and Development in Information Retrieval, pp. 411–418. ACM (2010)
Tian, Y., Kochhar, P. S., Lim, E.-P., Zhu, F., Lo, D.: Predicting best answerers for new questions: an approach leveraging topic modeling and collaborative voting. In: Workshops at the International Conference on Social Informatics, pp. 55–68. Springer (2013)
Perera, R., Nand, P., Naeem, A.: Utilizing typed dependency subtree patterns for answer sentence generation in question answering systems. Prog. Artif. Intell. 6(2), 105–119 (2017)
Article Google Scholar
Roy, P. K., Singh, J. P.: A tag2vec approach for questions tag suggestion on community question answering sites. In: Machine Learning and Data Mining in Pattern Recognition: 14th International Conference, MLDM 2018, New York, NY, USA, July 15–19, 2018, Proceedings, Part II 14, pp. 168–182. Springer (2018)
Roy, P.K., Saumya, S., Singh, J.P., Banerjee, S., Gutub, A.: Analysis of community question-answering issues via machine learning and deep learning: state-of-the-art review. CAAI Trans. Intell. Technol. (2022)
Roy, P.K., Singh, J.P., Baabdullah, A.M., Kizgin, H., Rana, N.P.: Identifying reputation collectors in community question answering (CQA) sites: exploring the dark side of social media. Int. J. Inf. Manag. (Elsevier) 42, 25–35 (2018)
Article Google Scholar
Tian, Q., Zhang, P., Li, B.: Towards predicting the best answers in community-based question-answering services. In: ICWSM, pp. 725–728 (2013)
Sahu, T.P., Nagwani, N.K., Verma, S.: Selecting best answer: an empirical analysis on community question answering sites. IEEE Access 4, 4797–4808 (2016)
Article Google Scholar
Roitman, H., Erera, S., Feigenblat, G.: A study of query performance prediction for answer quality determination. In: Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval, pp. 43–46, ACM (2019)
Kelley, J., Stewart, C., Morris, N., Tiwari, D., He, Y., Elnikety, S.: Obtaining and managing answer quality for online data-intensive services. ACM Trans. Model. Perform. Evaluat. Comput. Syst.(TOMPECS) 2(2), 11 (2017)
Google Scholar
Yulianti, E., Chen, R.-C., Scholer, F., Croft, W.B., Sanderson, M.: Ranking documents by answer-passage quality. In: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 335–344, ACM (2018)
Tay, Y., Tuan, L. A., Hui, S.C.: Cross temporal recurrent networks for ranking question answer pairs. In: 32nd AAAI Conference on Artificial Intelligence, pp. 5512–5519 (2018)
Roy, P.K., Ahmad, Z., Singh, J.P., Alryalat, M.A.A., Rana, N.P., Dwivedi, Y.K.: Finding and ranking high-quality answers in community question answering sites. Glob. J. Flex. Syst. Manag. 19(1), 53–68 (2018)
Article Google Scholar
Li, L., He, D., Jeng, W., Goodwin, S., Zhang, C.: Answer quality characteristics and prediction on an academic q &a site: a case study on researchgate. in Proceedings of the 24th International Conference on World Wide Web, pp. 1453–1458. ACM (2015)
Ahasanuzzaman, M., Asaduzzaman, M., Roy, C.K., Schneider, K.A.: Mining duplicate questions in stack overflow. In: Proceedings of the 13th International Conference on Mining Software Repositories, pp. 402–412, ACM (2016)
Roy, P.K., Singh, J.P., Banerjee, S.: Is this question going to be closed? answering question closibility on stack exchange. J. Inf. Sci. 01655515221118665 (2022)
Shah, C., Pomerantz, J.: Evaluating and predicting answer quality in community QA. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 411–418. ACM (2010)
Xie, Z., Nie, Y., Jin, S., Li, S., Li, A.: Answer quality assessment in cqa based on similar support sets. In: Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, pp. 309–325. Springer (2015)
Elalfy, D., Gad, W., Ismail, R.: Predicting best answer in community questions based on content and sentiment analysis. In: 2015 IEEE 7th International Conference on Intelligent Computing and Information Systems (ICICIS), pp. 585–590. IEEE (2015)
Lee, C. T., Rodrigues, E. M., Kazai, G., Milic-Frayling, N., Ignjatovic, A.: Model for voter scoring and best answer selection in community q &a services. In: IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies, 2009. WI-IAT’09,vol. 1, pp. 116–123. IEEE (2009)
John, B.M., Chua, A.Y.-K., Goh, D.H.-L.: What makes a high-quality user-generated answer? IEEE Internet Comput. 15(1), 66–71 (2011)
Article Google Scholar
Blooma, M.J., Hoe-Lian Goh, D., Yeow-Kuan Chua, A.: Predictors of high-quality answers. Online Inf. Rev. 36(3), 383–400 (2012)
Article Google Scholar
Lou, J., Fang, Y., Lim, K.H., Peng, J.Z.: Contributing high quantity and quality knowledge to online q &a communities. J. Am. Soc. Inform. Sci. Technol. 64(2), 356–371 (2013)
Article Google Scholar
Toba, H., Ming, Z.-Y., Adriani, M., Chua, T.-S.: Discovering high quality answers in community question answering archives using a hierarchy of classifiers. Inf. Sci. 261, 101–115 (2014)
Article MathSciNet Google Scholar
Dong, H., Wang, J., Lin, H., Xu, B., Yang, Z.: Predicting best answerers for new questions: An approach leveraging distributed representations of words in community question answering. In:2015 9th International Conference on Frontier of Computer Science and Technology, pp. 13–18. IEEE (2015)
Yao, Y., Tong, H., Xie, T., Akoglu, L., Xu, F., Lu, J.: Detecting high-quality posts in community question answering sites. Inf. Sci. 302, 70–82 (2015)
Article Google Scholar
Lin, Y., Shen, H.: Smartq: a question and answer system for supplying high-quality and trustworthy answers. IEEE Trans. Big Data 4(4), 600–613 (2017)
Article Google Scholar
Elalfy, D., Gad, W., Ismail, R.: A hybrid model to predict best answers in question answering communities. Egypt. Inf. J. 19(1), 21–31 (2018)
Google Scholar
Calefato, F., Lanubile, F., Novielli, N.: An empirical assessment of best-answer prediction models in technical q &a sites. Empir. Softw. Eng. 24(2), 854–901 (2019)
Article Google Scholar
Shao, T., Guo, Y., Chen, H., Hao, Z.: Transformer-based neural network for answer selection in question answering. IEEE Access 7, 26146–26156 (2019)
Article Google Scholar
Roy, P.K.: Multilayer convolutional neural network to filter low quality content from quora. Neural Process. Lett. 52(1), 805–821 (2020)
Article MathSciNet Google Scholar
Roy, P.K., Singh, J.P.: Predicting closed questions on community question answering sites using convolutional neural network. Neural Comput. Appl. (Springer) 32, 10555–10572 (2019)
Article Google Scholar
Kang, Y.-B., Du, H., Forkan, A.R.M., Jayaraman, P.P., Aryani, A., Sellis, T.: Expfinder: a hybrid model for expert finding from text-based expertise data. Expert Syst. Appl. 211, 118691 (2023)
Article Google Scholar
Peng, Q., Wang, W., Liu, H., Wang, Y., Xu, H., Shao, M.: Towards comprehensive expert finding with a hierarchical matching network. Knowl.-Based Syst. 257, 109933 (2022)
Article Google Scholar
Liu, Y., Tang, W., Liu, Z., Ding, L., Tang, A.: High-quality domain expert finding method in CQA based on multi-granularity semantic analysis and interest drift. Inf. Sci. 596, 395–413 (2022)
Article Google Scholar
Roy, P.K., Ahmad, Z., Singh, J.P., Banerjee, S.: Feature extraction to filter out low-quality answers from social question answering sites. IETE J. Res. 1–12 (2022)
Lee, C.T., Rodrigues, E.M., Kazai, G., Milic-Frayling, N., Ignjatovic, A.: Model for voter scoring and best answer selection in community q &a services. In: IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies (2009). WI-IAT’09. vol. 1, pp. 116–123. IEEE (2009)
Liu, M., Liu, Y., Yang, Q.: Predicting best answerers for new questions in community question answering. In: International Conference on Web-Age Information Management, pp. 127–138. Springer (2010)
Zhou, J., Wang, S., Bezemer, C.-P., Hassan, A.E.: Bounties on technical q &a sites: a case study of stack overflow bounties. Empir. Softw. Eng. 25(1), 139–177 (2020)
Article Google Scholar
askubuntu.com, Stackexchange official website. [online] http://askubuntu.com/help/whats-reputation. Accessed 21 Feb 2017. (2017)
Morán-Fernández, L., Bolón-Canedo, V., Alonso-Betanzos, A.: On the use of different base classifiers in multiclass problems. Progr. Artif. Intell. 6(4), 315–323 (2017)
Article Google Scholar
Singh, J.P., Irani, S., Rana, N.P., Dwivedi, Y.K., Saumya, S., Roy, P.K.: Predicting the “helpfulness" of online consumer reviews. J. Bus. Res. 70, 346–355 (2017)
Article Google Scholar
Feng, L., Jansche, M., Huenerfauth, M., Elhadad, N.: A comparison of features for automatic readability assessment. In: Proceedings of the 23rd International Conference on Computational Linguistics: Posters, pp. 276–284. Association for Computational Linguistics (2010)
Roy, P.K., Singh, J.P., Nag, A.: Finding active expert users for question routing in community question answering sites. In: International Conference on Machine Learning and Data Mining in Pattern Recognition, pp. 440–451. Springer (2018)
Kowsari, K., Jafari Meimandi, K., Heidarysafa, M., Mendu, S., Barnes, L., Brown, D.: Text classification algorithms: a survey. Information 10(4), 150 (2019)
Article Google Scholar
Roy, P.K., Singh, J.P., Banerjee, S.: Deep learning to filter SMS spam. Futur. Gener. Comput. Syst. 102, 524–533 (2020)
Article Google Scholar
Natekin, A., Knoll, A.: Gradient boosting machines, a tutorial. Front. Neurorobot. 7, 21 (2013)
Article Google Scholar
Ho, T.K.: Random decision forests. In: Proceedings of 3rd international conference on document analysis and recognition, vol. 1, pp. 278–282. IEEE (1995)
Krawczyk, B.: Learning from imbalanced data: open challenges and future directions. Progr. Artif. Intell. 5(4), 221–232 (2016)
Article Google Scholar
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: Smote: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
Article MATH Google Scholar
Burel, G., He, Y., Alani, H.: Automatic identification of best answers in online enquiry communities. In: Extended Semantic Web Conference, pp. 514–529. Springer (2012)
Suggu, S.P., Goutham, K.N., Chinnakotla, M.K., Shrivastava, M.: Deep feature fusion network for answer quality prediction in community question answering. arXiv preprint arXiv:1606.07103, pp. 1–7 (2016)

Download references

Funding

No funding was received for this research.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Indian Institute of Information Technology, Surat, India
Pradeep Kumar Roy

Authors

Pradeep Kumar Roy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pradeep Kumar Roy.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Roy, P.K. Predicting answer acceptability for question-answering system. Int J Digit Libr (2023). https://doi.org/10.1007/s00799-023-00357-2

Download citation

Received: 27 July 2022
Revised: 05 March 2023
Accepted: 16 March 2023
Published: 05 May 2023
DOI: https://doi.org/10.1007/s00799-023-00357-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Predicting answer acceptability for question-answering system

Abstract

Access this article

Similar content being viewed by others

Enhanced Question Answering System with Trustworthy Answers

Exploiting Content Quality and Question Difficulty in CQA Reputation Systems

Finding and Ranking High-Quality Answers in Community Question Answering Sites

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Predicting answer acceptability for question-answering system

Abstract

Access this article

Similar content being viewed by others

Enhanced Question Answering System with Trustworthy Answers

Exploiting Content Quality and Question Difficulty in CQA Reputation Systems

Finding and Ranking High-Quality Answers in Community Question Answering Sites

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation