Measuring the Effectiveness of Software Code Review Comments

Hossain, Syeda Sumbul; Arafat, Yeasir; Hossain, Md. Ekram; Arman, Md. Shohel; Islam, Anik

doi:10.1007/978-981-15-6634-9_23

Syeda Sumbul Hossain¹²,
Yeasir Arafat¹²,
Md. Ekram Hossain¹²,
Md. Shohel Arman¹² &
…
Anik Islam¹²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1244))

Included in the following conference series:

International Conference on Advances in Computing and Data Sciences

1419 Accesses
5 Citations

Abstract

Code reviewing becomes a more popular technique to find out early defects in source code. Nowadays practitioners are going for peer reviewing their codes by their co-developers to make the source code clean. Working on a distributed or dispersed team, code review is mandatory to check the patches to merge. Code reviewing can also be a form of validating functional and non-functional requirements. Sometimes reviewers do not put structured comments, which becomes a bottle neck to developers for solving the findings or suggestions commented by the reviewers. For making the code review participation more effective, structured and efficient review comments is mandatory. Mining the repositories of five commercialized projects, we have extracted 15223 review comments and labelled them. We have used 8 different machine learning and deep learning classifiers to train our model. Among those Stochastic Gradient Descent (SGD) technique achieves higher accuracy of 80.32%. This study will help the practitioners to build up structured and effective code review culture among global software developers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Hossain, S.S.: Challenges and mitigation strategies in reusing requirements in large-scale distributed agile software development: a survey result. In: Arai, K., Bhatia, R., Kapoor, S. (eds.) CompCom 2019. AISC, vol. 998, pp. 920–935. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-22868-2_63
Chapter Google Scholar
Bacchelli, A., Bird, C.: Expectations, outcomes, and challenges of modern code review. In: Proceedings of the 2013 International Conference on Software Engineering, pp. 712–721. IEEE Press (2013)
Google Scholar
Holzmann, G.J.: SCRUB: a tool for code reviews. Innov. Syst. Softw. Eng. 6(4), 311–318 (2010). https://doi.org/10.1007/s11334-010-0136-x
Article MathSciNet Google Scholar
Ahmed, T., Bosu, A., Iqbal, A., Rahimi, S.: SentiCR: a customized sentiment analysis tool for code review interactions. In: Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering, pp. 106–111. IEEE Press (2017)
Google Scholar
Moskowitz, M., Potter, W., King, W.: Automatic computer code review tool, 26 May 2005. US Patent App. 10/769,535
Google Scholar
Sadowski, C., Söderberg, E., Church, L., Sipko, M., Bacchelli, A.: Modern code review: a case study at Google. In: Proceedings of the International Conference on Software Engineering: Software Engineering in Practice, pp. 181–190. Association for Computing Machinery (2018)
Google Scholar
Henley, A.Z., Muçlu, K., Christakis, M., Fleming, S.D., Bird, C.: Cfar: a tool to increase communication, productivity, and review quality in collaborative code reviews. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, pp. 1–13 (2018)
Google Scholar
Vinodhini, G., Chandrasekaran, R.M.: Sentiment analysis and opinion mining: a survey. Int. J. 2(6), 282–292 (2012)
Google Scholar
Rahman, S., Hossain, S.S., Islam, S., Chowdhury, M.I., Rafiq, F.B., Badruzzaman, K.B.M.: Context-based news headlines analysis using machine learning approach. In: Nguyen, N.T., Chbeir, R., Exposito, E., Aniorté, P., Trawiński, B. (eds.) ICCCI 2019. LNCS (LNAI), vol. 11684, pp. 167–178. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-28374-2_15
Chapter Google Scholar
Madera, M., ł Tomoń, R.: A case study on machine learning model for code review expert system in software engineering. In: 2017 Federated Conference on Computer Science and Information Systems (FedCSIS), pp. 1357–1363. IEEE (2017)
Google Scholar
Kalyan, A., Chiam, M., Sun, J., Manoharan, S.: A collaborative code review platform for GitHub. In 2016 21st International Conference on Engineering of Complex Computer Systems (ICECCS), pp. 191–196. IEEE (2016)
Google Scholar
Shi, S.-T., Li, M., Lo, D., Thung, F., Huo, X.: Automatic code review by learning the revision of source code. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 4910–4917 (2019)
Google Scholar
Lal, H., Pahwa, G.: Code review analysis of software system using machine learning techniques. In: 2017 11th International Conference on Intelligent Systems and Control (ISCO), pp. 8–13. IEEE (2017)
Google Scholar
Louis, A., Dash, S.K., Barr, E.T., Sutton, C.: Deep learning to detect redundant method comments. arXiv preprint arXiv:1806.04616 (2018)
Movshovitz-Attias, D., Cohen, W.W.: Natural language models for predicting programming comments. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), vol. 2, pp. 35–40 (2013)
Google Scholar
White, M., Tufano, M., Vendome, C., Poshyvanyk, D.: Deep learning code fragments for code clone detection. In: Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering, pp. 87–98. ACM (2016)
Google Scholar
Gupta, A., Sundaresan, N.: Intelligent code reviews using deep learning (2018)
Google Scholar
Li, H.-Y., et al.: DeepReview: automatic code review using deep multi-instance learning. In: Yang, Q., Zhou, Z.-H., Gong, Z., Zhang, M.-L., Huang, S.-J. (eds.) PAKDD 2019. LNCS (LNAI), vol. 11440, pp. 318–330. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-16145-3_25
Chapter Google Scholar
Brown, P.F., Desouza, P.V., Mercer, R.L., Della Pietra, V.J., Lai, J.C.: Class-based n-gram models of natural language. Comput. Linguist. 18(4), 467–479 (1992)
Google Scholar
Ho, T.K., Hull, J.J., Srihari, S.N.: Decision combination in multiple classifier systems. IEEE Trans. Pattern Anal. Mach. Intell. 16(1), 66–75 (1994)
Article Google Scholar
Rennie, J.D., Shih, L., Teevan, J., Karger, D.R.: Tackling the poor assumptions of Naive Bayes text classifiers. In: Proceedings of the 20th International Conference on Machine Learning (ICML-03), pp. 616–623 (2003)
Google Scholar
Gunn, S.R., et al.: Support vector machines for classification and regression. ISIS Tech. Rep. 14(1), 5–16 (1998)
Google Scholar
McCallum, A., Nigam, K., et al.: A comparison of event models for Naive Bayes text classification. In: AAAI-98 Workshop on Learning for Text Categorization, vol. 752, pp. 41–48. Citeseer (1998)
Google Scholar
Bottou, L.: Large-scale machine learning with stochastic gradient descent. In: Lechevallier, Y., Saporta, G. (eds.) Proceedings of COMPSTAT 2010, pp. 177–186. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-7908-2604-3_16
Chapter Google Scholar
Chen, T., Guestrin, C.: XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794 (2016)
Google Scholar
Stephen, I.: Perceptron-based learning algorithms. IEEE Trans. Neural Netw. 50(2), 179 (1990)
Google Scholar
Crammer, K., Dekel, O., Keshet, J., Shalev-Shwartz, S., Singer, Y.: Online passive-aggressive algorithms. J. Mach. Learn. Rese. 7(Mar), 551–585 (2006)
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Daffodil International University, Dhaka, Bangladesh
Syeda Sumbul Hossain, Yeasir Arafat, Md. Ekram Hossain, Md. Shohel Arman & Anik Islam

Authors

Syeda Sumbul Hossain
View author publications
You can also search for this author in PubMed Google Scholar
Yeasir Arafat
View author publications
You can also search for this author in PubMed Google Scholar
Md. Ekram Hossain
View author publications
You can also search for this author in PubMed Google Scholar
Md. Shohel Arman
View author publications
You can also search for this author in PubMed Google Scholar
Anik Islam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Syeda Sumbul Hossain .

Editor information

Editors and Affiliations

University of KwaZulu-Natal, Durban, South Africa
Mayank Singh
Jaypee University of Information Technology, Waknaghat, Himachal Pradesh, India
P. K. Gupta
Jaypee University of Engineering and Technology, Guna, Madhya Pradesh, India
Vipin Tyagi
Institute of Information Theory and Automation, Prague, Czech Republic
Jan Flusser
University of Ottawa, Ottawa, ON, Canada
Tuncer Ören
University of Malta, Valletta, Malta
Gianluca Valentino

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hossain, S.S., Arafat, Y., Hossain, M.E., Arman, M.S., Islam, A. (2020). Measuring the Effectiveness of Software Code Review Comments. In: Singh, M., Gupta, P., Tyagi, V., Flusser, J., Ören, T., Valentino, G. (eds) Advances in Computing and Data Sciences. ICACDS 2020. Communications in Computer and Information Science, vol 1244. Springer, Singapore. https://doi.org/10.1007/978-981-15-6634-9_23

Download citation

DOI: https://doi.org/10.1007/978-981-15-6634-9_23
Published: 18 July 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-6633-2
Online ISBN: 978-981-15-6634-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics