Abstract
People around the world often rely on online community forums for answers to their queries. These forums have become hugely popular in the last decade, leading to a spurt in the number of users and questions. For a better user experience, quality monitoring is essential. However, manual moderation of millions of questions is infeasible. Prior works mostly rely on handcrafted features which is ineffective or use community feedback as part of learning which makes them unsuitable for monitoring during question creation. In this work, we use recent deep learning techniques to assess the quality of questions in online community forums at creation time. We evaluate our model on the StackOverflow dataset that contains 60000 questions across three qualities. Our model achieves an F1 score of 0.92 on this dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Agichtein E, Castillo C, Donato D, Gionis A, Mishne G (2008) Finding high-quality content in social media. In: Proceedings of the 2008 international conference on web search and data mining, pp 183–194
Movshovitz-Attias D, Movshovitz-Attias Y, Steenkiste P, Faloutsos C (2013) Analysis of the reputation system and user contributions on a question answering website: Stackoverflow. In: International conference on advances in social networks analysis and mining, pp 886–893
Correa D, Sureka A (2014) Characterization and modeling of deleted questions on stack overflow
Maron ME, Kuhns JL (1960) On relevance, probabilistic indexing and information retrieval. J ACM (JACM) 7(3):216–244
Li YH, Jain AK (1998) Classification of text documents. Comput J 41(8):537–546
Weiss S, Kasif S, Brill E (1996) Text classification in use net newsgroups: a progress report. In: Proceedings of the AAAI spring symposium on machine learning in information access, pp 125–127
Hull D, Pedersen J, Schutze H (1996) Document routing as statistical classification. In: AAAI spring symposium on machine learning in information access, vol 12, pp 49–54
Lewis DD, Ringuette M (1994) A comparison of two learning algorithms for text categorization. In: Third annual symposium on document analysis and information retrieval, vol 33, pp 81–93
Schutze H, Hull DA, Pedersen JO (1995) A comparison of classifiers and document representations for the routing problem. In: Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, pp 229–237
Zhang D, Lee WS (2003) Question classification using support vector machines. In: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval, pp 26–32
Chen T, Xu R, He Y, Wang X (2017) Improving sentiment analysis via sentence type classification using BiLSTM-CRF and Cnn. Expert Syst Appl 72:221–230
Ranjan MNM, Ghorpade Y, Kanthale G, Ghorpade A, Dubey A (2017) Document classification using LSTM neural network. J Data Mining Manage 2(2):1–9
Zhou C, Sun C, Liu Z, Lau F (2015) A C-LSTM neural network for text classification. arXiv preprint arXiv:1511.08630
Srba I, Bielikova M (2016) A comprehensive survey and classification of approaches for community question answering. ACM Trans Web (TWEB) 10(3):1–63
Shah C, Kitzie V, Choi E (2014) Questioning the question–addressing the answer-ability of questions in community question-answering. In: 2014 47th Hawaii international conference on system sciences, pp 1386–1395. IEEE
Ponzanelli L, Mocci A, Bacchelli A, Lanza M, Fullerton D (2014) Improving low quality stack overflow post detection. In: 2014 IEEE international conference on software maintenance and evolution, pp 541–544. IEEE
Tausczik YR, Pennebaker JW (2011) Predicting the perceived quality of online mathematics contributions from users’ reputations. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 1885–1888
Ravi S, Pang B, Rastogi V, Kumar R (2014) Great question! Question quality in community Q&A
Pennington J, Socher R, Manning CD (2014) Glove: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Rithish, H., Deepak, G., Santhanavijayan, A. (2021). Automated Assessment of Question Quality on Online Community Forums. In: Motahhir, S., Bossoufi, B. (eds) Digital Technologies and Applications. ICDTA 2021. Lecture Notes in Networks and Systems, vol 211. Springer, Cham. https://doi.org/10.1007/978-3-030-73882-2_72
Download citation
DOI: https://doi.org/10.1007/978-3-030-73882-2_72
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-73881-5
Online ISBN: 978-3-030-73882-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)