Skip to main content

Quality Prediction of Newly Proposed Questions in CQA by Leveraging Weakly Supervised Learning

  • Conference paper
  • First Online:
Advanced Data Mining and Applications (ADMA 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10604))

Included in the following conference series:

Abstract

Community Question Answering (CQA) websites provide a platform to ask questions and share their knowledge. Good questions in CQA websites can improve user experiences and attract more users. To the best of our knowledge, a few researches have been studied on the question quality, especially the quality of newly proposed questions. In this work, we consider that a good question is popular and answerable in CQA websites. The community features of questions are extracted automatically and utilized to acquire massive good questions. The text features and asker features of good questions are utilized to train our weakly supervised model based on Convolutional Neural Network to recognize good newly proposed questions. We conduct extensive experiments on the publicly available dataset from StackExchange and our best result achieves F1-score at 91.5%, outperforming the baselines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://stackexchange.com/.

  2. 2.

    answers.yahoo.com.

  3. 3.

    www.quora.com.

  4. 4.

    www.zhihu.com.

  5. 5.

    https://www.quora.com/profile/Yoshua-Bengio.

  6. 6.

    https://stackoverflow.com/questions/2334712/how-do-i-update-from-a-select-in-sql-server.

  7. 7.

    https://movies.stackexchange.com/questions/72110/simpsons-episode-with-1000.

  8. 8.

    stackoverflow.com/questions/231767/what-does-the-yield-keyword-do-in-python.

  9. 9.

    stackoverflow.com/questions/37302912/command-not-working-in-an-interpreter-i- made.

  10. 10.

    https://stackoverflow.com/users/2901002/jezrael?tab=profile.

  11. 11.

    https://stackoverflw.com/users/5198106/m654?tab=profile.

  12. 12.

    https://archive.org/details/stackexchange.

  13. 13.

    https://android.stackexchange.com/.

  14. 14.

    https://movies.stackexchange.com/.

  15. 15.

    https://music.stackexchange.com/.

  16. 16.

    https://sports.stackexchange.com/.

References

  1. Jeon, J., Croft, W.B., Lee, J.H., Park, S.: A framework to predict the quality of answers with non-textual features. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 228–235. ACM (2006)

    Google Scholar 

  2. Shah, C., Pomerantz, J.: Evaluating and predicting answer quality in community QA. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 411–418. ACM (2010)

    Google Scholar 

  3. Bian, J., Liu, Y., Zhou, D., Agichtein, E., Zha, H.: Learning to recognize reliable users and content in social media with coupled mutual reinforcement. In: Proceedings of the 18th International Conference on World Wide Web, pp. 51–60. ACM (2009)

    Google Scholar 

  4. Ko, J., Nyberg, E., Si, L.: A probabilistic graphical model for joint answer ranking in question answering. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 343–350. ACM (2007)

    Google Scholar 

  5. Blooma, M.J., Chua, A.Y., Goh, D.H.-L.: A predictive framework for retrieving the best answer. In: Proceedings of the 2008 ACM Symposium on Applied Computing, pp. 1107–1111. ACM (2008)

    Google Scholar 

  6. Harper, F.M., Raban, D., Rafaeli, S., Konstan, J.A.: Predictors of answer quality in online Q&A sites. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 865–874. ACM (2008)

    Google Scholar 

  7. Suryanto, M.A., Lim, E.P., Sun, A., Chiang, R.H.: Quality-aware collaborative question answering: methods and evaluation. In: Proceedings of the Second ACM International Conference on Web Search and Data Mining, pp. 142–151. ACM (2009)

    Google Scholar 

  8. Le, L.T., Shah, C., Choi, E.: Evaluating the quality of educational answers in community question-answering. In: 2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL), pp. 129–138. IEEE (2016)

    Google Scholar 

  9. Liu, H., Huang, J., An, C., Fu, X.: Answer quality prediction joint textual and non-textual features. In: 13th Web Information Systems and Applications Conference, 2016, pp. 144–148. IEEE (2016)

    Google Scholar 

  10. Suggu, S.P., Goutham, K.N., Chinnakotla, M.K., Shrivastava, M.: Deep feature fusion network for answer quality prediction in community question answering. arXiv preprint arXiv:1606.07103 (2016)

  11. Luo, M., Nie, F., Chang, X., Yang, Y., Hauptmann, A.G., Zheng, Q.: Adaptive unsupervised feature selection with structure regularization. IEEE Trans. Neural Netw. Learn. Syst. PP(99), 1–13 (2017)

    Google Scholar 

  12. Luo, M., Chang, X., Yang, Y., Nie, L., Hauptmann, A.G., Zheng, Q.: Simple to complex cross-modal learning to rank, arXiv preprint arXiv:1702.01229 (2017)

  13. Li, B., Jin, T., Lyu, M.R., King, I., Mak, B.: Analyzing and predicting question quality in community question answering services. In: Proceedings of the 21st International Conference on World Wide Web, pp. 775–782. ACM (2012)

    Google Scholar 

  14. Agichtein, E., Castillo, C., Donato, D., Gionis, A., Mishne, G.: Finding high-quality content in social media. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp. 183–194. ACM (2008)

    Google Scholar 

  15. Ravi, S., Pang, B., Rastogi, V., Kumar, R.: Great question! question quality in community Q&A. In: ICWSM, vol. 14, pp. 426–435 (2014)

    Google Scholar 

  16. Baltadzhieva, A., Chrupala, G., Angelova, G., Bontcheva, K., Mitkov, R.: Predicting the quality of questions on stackoverflow. In: RANLP, 2015, pp. 32–40 (2015)

    Google Scholar 

Download references

Acknowledgments

This work is sponsored by The Fundamental Theory and Applications of Big Data with Knowledge Engineering under the National Key Research and Development Program of China with grant number 2016YFB1000903, Ministry of Education Innovation Research Team No. IRT 17R86, Innovative Research Group of the National Natural Science Foundation of China (61721002); National Science Foundation of China under Grant Nos. 61672419, 61532004, 61532015, the MOE Research Program for Online Education under Grant No. 2016YB166.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuanhao Zheng .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Zheng, Y. et al. (2017). Quality Prediction of Newly Proposed Questions in CQA by Leveraging Weakly Supervised Learning. In: Cong, G., Peng, WC., Zhang, W., Li, C., Sun, A. (eds) Advanced Data Mining and Applications. ADMA 2017. Lecture Notes in Computer Science(), vol 10604. Springer, Cham. https://doi.org/10.1007/978-3-319-69179-4_46

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-69179-4_46

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-69178-7

  • Online ISBN: 978-3-319-69179-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics