Skip to main content

Automated Assessment of Question Quality on Online Community Forums

  • Conference paper
  • First Online:
Digital Technologies and Applications (ICDTA 2021)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 211))

Included in the following conference series:

Abstract

People around the world often rely on online community forums for answers to their queries. These forums have become hugely popular in the last decade, leading to a spurt in the number of users and questions. For a better user experience, quality monitoring is essential. However, manual moderation of millions of questions is infeasible. Prior works mostly rely on handcrafted features which is ineffective or use community feedback as part of learning which makes them unsuitable for monitoring during question creation. In this work, we use recent deep learning techniques to assess the quality of questions in online community forums at creation time. We evaluate our model on the StackOverflow dataset that contains 60000 questions across three qualities. Our model achieves an F1 score of 0.92 on this dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    http://stackoverflow.com/help/how-to-ask.

References

  1. Agichtein E, Castillo C, Donato D, Gionis A, Mishne G (2008) Finding high-quality content in social media. In: Proceedings of the 2008 international conference on web search and data mining, pp 183–194

    Google Scholar 

  2. Movshovitz-Attias D, Movshovitz-Attias Y, Steenkiste P, Faloutsos C (2013) Analysis of the reputation system and user contributions on a question answering website: Stackoverflow. In: International conference on advances in social networks analysis and mining, pp 886–893

    Google Scholar 

  3. Correa D, Sureka A (2014) Characterization and modeling of deleted questions on stack overflow

    Google Scholar 

  4. Maron ME, Kuhns JL (1960) On relevance, probabilistic indexing and information retrieval. J ACM (JACM) 7(3):216–244

    Article  Google Scholar 

  5. Li YH, Jain AK (1998) Classification of text documents. Comput J 41(8):537–546

    Article  Google Scholar 

  6. Weiss S, Kasif S, Brill E (1996) Text classification in use net newsgroups: a progress report. In: Proceedings of the AAAI spring symposium on machine learning in information access, pp 125–127

    Google Scholar 

  7. Hull D, Pedersen J, Schutze H (1996) Document routing as statistical classification. In: AAAI spring symposium on machine learning in information access, vol 12, pp 49–54

    Google Scholar 

  8. Lewis DD, Ringuette M (1994) A comparison of two learning algorithms for text categorization. In: Third annual symposium on document analysis and information retrieval, vol 33, pp 81–93

    Google Scholar 

  9. Schutze H, Hull DA, Pedersen JO (1995) A comparison of classifiers and document representations for the routing problem. In: Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, pp 229–237

    Google Scholar 

  10. Zhang D, Lee WS (2003) Question classification using support vector machines. In: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval, pp 26–32

    Google Scholar 

  11. Chen T, Xu R, He Y, Wang X (2017) Improving sentiment analysis via sentence type classification using BiLSTM-CRF and Cnn. Expert Syst Appl 72:221–230

    Article  Google Scholar 

  12. Ranjan MNM, Ghorpade Y, Kanthale G, Ghorpade A, Dubey A (2017) Document classification using LSTM neural network. J Data Mining Manage 2(2):1–9

    Google Scholar 

  13. Zhou C, Sun C, Liu Z, Lau F (2015) A C-LSTM neural network for text classification. arXiv preprint arXiv:1511.08630

  14. Srba I, Bielikova M (2016) A comprehensive survey and classification of approaches for community question answering. ACM Trans Web (TWEB) 10(3):1–63

    Google Scholar 

  15. Shah C, Kitzie V, Choi E (2014) Questioning the question–addressing the answer-ability of questions in community question-answering. In: 2014 47th Hawaii international conference on system sciences, pp 1386–1395. IEEE

    Google Scholar 

  16. Ponzanelli L, Mocci A, Bacchelli A, Lanza M, Fullerton D (2014) Improving low quality stack overflow post detection. In: 2014 IEEE international conference on software maintenance and evolution, pp 541–544. IEEE

    Google Scholar 

  17. Tausczik YR, Pennebaker JW (2011) Predicting the perceived quality of online mathematics contributions from users’ reputations. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 1885–1888

    Google Scholar 

  18. Ravi S, Pang B, Rastogi V, Kumar R (2014) Great question! Question quality in community Q&A

    Google Scholar 

  19. Pennington J, Socher R, Manning CD (2014) Glove: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rithish, H., Deepak, G., Santhanavijayan, A. (2021). Automated Assessment of Question Quality on Online Community Forums. In: Motahhir, S., Bossoufi, B. (eds) Digital Technologies and Applications. ICDTA 2021. Lecture Notes in Networks and Systems, vol 211. Springer, Cham. https://doi.org/10.1007/978-3-030-73882-2_72

Download citation

Publish with us

Policies and ethics