Investigating the relation between lexical smells and change- and fault-proneness: an empirical study

Guerrouj, Latifa; Kermansaravi, Zeinab; Arnaoudova, Venera; Fung, Benjamin C. M.; Khomh, Foutse; Antoniol, Giuliano; Guéhéneuc, Yann-Gaël

doi:10.1007/s11219-016-9318-6

Investigating the relation between lexical smells and change- and fault-proneness: an empirical study

Published: 09 May 2016

Volume 25, pages 641–670, (2017)
Cite this article

Software Quality Journal Aims and scope Submit manuscript

Latifa Guerrouj¹,
Zeinab Kermansaravi²,
Venera Arnaoudova³,
Benjamin C. M. Fung⁴,
Foutse Khomh²,
Giuliano Antoniol² &
…
Yann-Gaël Guéhéneuc²

798 Accesses
13 Citations
Explore all metrics

Abstract

Past and recent studies have shown that design smells which are poor solutions to recurrent design problems make object-oriented systems difficult to maintain, and that they negatively impact the class change- and fault-proneness. More recently, lexical smells have been introduced to capture recurring poor practices in the naming, documentation, and choice of identifiers during the implementation of an entity. Although recent studies show that developers perceive lexical smells as impairing program understanding, no study has actually evaluated the relationship between lexical smells and software quality as well as their interaction with design smells. In this paper, we detect 29 smells consisting of 13 design smells and 16 lexical smells in 30 releases of three projects: ANT, ArgoUML, and Hibernate. We analyze to what extent classes containing lexical smells have higher (or lower) odds to change or to be subject to fault fixing than other classes containing design smells. Our results show and bring empirical evidence on the fact that lexical smells can make, in some cases, classes with design smells more fault-prone. In addition, we empirically demonstrate that classes containing design smells only are more change- and fault-prone than classes with lexical smells only.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Ethics in the Software Development Process: from Codes of Conduct to Ethical Deliberation

Article Open access 21 April 2021

Large Language Model Assisted Software Engineering: Prospects, Challenges, and a Case Study

Challenges of Low-Code/No-Code Software Development: A Literature Review

Notes

References

Abebe, S. L., & Tonella, P. (2013) Automated identifier completion and replacement. In: CSMR’13 (pp. 263–272).
Abebe, S. L., Haiduc, S., Tonella, P., & Marcus, A. (2011). The effect of lexicon bad smells on concept location in source code. In SCAM (pp. 125–134). IEEE.
Arnaoudova, V., Penta, M. D., & Antoniol, G. (2016). Linguistic antipatterns: What they are and how developers perceive them. Empirical Software Engineering, 21(1), 104–158.
Article Google Scholar
Arnaoudova, V., Penta, M. D., Antoniol, G., & Guéhéneuc, Y. (2013). A new family of software anti-patterns: Linguistic anti-patterns. In Proceedings of the 17th IEEE European Conference on Software Maintenance and Reengineering, CSMR (pp. 187–196). Genova, Italy.
Bavota, G., Carluccio, B. D., Lucia, A. D., Penta, M. D., Oliveto, R., & Strollo, O. (2012). When does a refactoring induce bugs? An empirical study. In SCAM (pp. 104–113).
Brown, W. J., Malveau, R. C., McCormick, H. W. S., & Mowbray, T. J. (1998). AntiPatterns: Refactoring software, architectures, and projects in crisis (1st ed.). New York: Wiley.
Google Scholar
Cardoso, B., & Figueiredo, E. (2015). Co-occurrence of design patterns and bad smells in software systems: An exploratory study. In Proceedings of the annual conference on Brazilian symposium on information systems: Information systems: A computer socio-technical perspective (pp. 347–354). Brazilian Computer Society.
De Lucia, A., Di Penta, M., & Oliveto, R. (2010). Improving source code lexicon via traceability and information retrieval. IEEE Transactions on Software Engineering, 37(2), 205–227.
Article Google Scholar
Fontana, F. A., Mäntylä, M. V., Zanoni, M., & Marino, A. (2015). Comparing and experimenting machine learning techniques for code smell detection. Empirical Software Engineering, 1–49. doi:10.1007/s10664-015-9378-4.
Fowler, M. (1999). Refactoring: Improving the design of existing code. Boston, MA: Addison-Wesley.
MATH Google Scholar
Fischer, M., Pinzger, M., & Gall, H. (2003) Populating a release history database from version control and bug tracking systems. In Proceedings of the international conference on software maintenance (pp. 23–32).
Hall, T., Zhang, M., Bowes, D., & Sun, Y. (2014). Some code smells have a significant but small effect on faults. ACM Transactions on Software Engineering and Methodology, 23(4), 33.
Article Google Scholar
Kamei, Y., Shihab, E., Adams, B., Hassan, A. E., Mockus, A., Sinha, A., et al. (2013). A large-scale empirical study of just-in-time quality assurance. IEEE Transactions on Software Engineering, 39(6), 757–773.
Article Google Scholar
Khomh, F., Penta, M. D., & Guéhéneuc, Y. (2009). An exploratory study of the impact of code smells on software change-proneness. In WCRE (pp. 75–84). IEEE Computer Society.
Khomh, F., Penta, M. D., Guéhéneuc, Y., & Antoniol, G. (2012). An exploratory study of the impact of antipatterns on class change- and fault-proneness. Empirical Software Engineering, 17(3), 243–275.
Article Google Scholar
Kim, S., Whitehead, E. J, Jr., & Zhang, Y. (2008). Classifying software changes: Clean or buggy? IEEE Transactions on Software Engineering, 34(2), 181–196.
Article Google Scholar
Lawrie, D., Morrell, C., Feild, H., & Binkley, D. (2007). Effective identifier names for comprehension and memory. Innovations in Systems and Software Engineering, 3(4), 303–318.
Article Google Scholar
Lemma, A. S., Venera, A., Paolo, T., Giuliano, A., & Guéhéneuc, Y. (2012). Can lexicon bad smells improve fault prediction? In WCRE (pp. 235–244).
Li, W., & Shatnawi, R. (2007). An empirical study of the bad smells and class error probability in the post-release object-oriented system evolution. Journal of Systems and Software, 80(7), 1120–1128.
Article Google Scholar
Mayrhauser, A., & Vans, A. M. (1995). Program comprehension during software maintenance and evolution. IEEE Computer, 28(8) 44–55.
Marwen, A., Foutse, K., Guéhéneuc, Y., & Giuliano, A. (2011). An empirical study of the impact of two antipatterns, blob and spaghetti code, on program comprehension. In Proceedings of the 15th IEEE European Conference on Software Maintenance and Reengineering (CSMR), (pp. 181–190). IEEE Computer Society.
McIntosh, S., Kamei, Y., Adams, B., & Hassan, A. E. (2014). The impact of code review coverage and code review participation on software quality: A case study of the qt, vtk, and itk projects. In Proceedings of the 11th working conference on mining software repositories, ser. MSR 2014, (pp. 192–201).
McIntosh, S., Kamei, Y., Adams, B., & Hassan, A. E. (2015). An empirical study of the impact of modern code review practices on software quality, Empirical Software Engineering, 1–44. doi:10.1007/s10664-015-9381-9.
Moha, N., Guéhéneuc, Y., Laurence, D., & Anne-Franccoise, L. M. (2010). Decor: A method for the specification and detection of code and design smells. IEEE Transactions on Software Engineering (TSE), 36(1), 20–36.
Article Google Scholar
Olbrich, S. M., Cruzes, D., Basili, V. R., & Zazworka, N. (2009). The evolution and impact of code smells: A case study of two open source systems. In ESEM (pp. 390–400).
Palomba, F., Bavota, G., Penta, M. D., Oliveto, R., Lucia, A. D., & Poshyvanyk, D. (2013). Detecting bad smells in source code using change history information. In ASE (pp. 268–278).
Palomba, F., Bavota, G., Penta, M. D., Oliveto, R., & Lucia, A. D. (2014). Do they really smell bad? A study on developers’ perception of bad code smells. In ICSME’14 (pp. 101–110).
Palomba, F., Bavota, G., Penta, M. D., Oliverto, R., Poshyvanyk, D., & Lucia, A. D. (2015). Mining version histories for detecting code smells. IEEE Transactions on Software Engineering, 41(5), 462–489.
Article Google Scholar
Peters, R., & Zaidman, A. (2012). Evaluating the lifespan of code smells using software repository mining. In CSMR (pp. 411–416). IEEE.
Sheskin, D. J. (2007). Handbook of parametric and nonparametric statistical procedures (4th ed.). London: Chapman & Hall.
MATH Google Scholar
Śliwerski, J., Zimmermann, T., & Zeller, A. (2005). When do changes induce fixes? ACM Sigsoft Software Engineering Notes, 30(4), 1–5.
Article Google Scholar
Soloway, E., Bonar, J., & Ehrlich, K. (1983). Cognitive strategies and looping constructs: An empirical study. Communications of the ACM, 26(11), 853–860.
Article Google Scholar
Suryanarayana, G. (2014). Refactoring for software design smells: Managing technical debt (1st ed.). Los Altos, CA: Morgan Kaufmann.
Google Scholar
Taba, S. E. S., Khomh, F., Zou, Y., Hassan, A. E., & Nagappan, M. (2013). Predicting bugs using antipatterns. In ICSM (pp. 270–279). IEEE.
Tan, L., Yuan, D., Krishna, G., & Zhou, Y. (2007). iComment: Bugs or bad comments? In Proceedings of the 21st ACM symposium on operating systems principles (SOSP07).
Tan, L., Zhou, Y., & Padioleau, Y. (2011). aComment: Mining annotations from comments and code to detect interrupt-related concurrency bugs. In Proceedings of the 33rd international conference on software engineering (ICSE11).
Tan, S. H., Marinov, D., Tan, L., & Leavens, G. T. (2012). @tComment: Testing javadoc comments to detect comment-code inconsistencies. In Proceedings of the 5th international conference on software testing, verification and validation (ICST).
Takang, A., Grubb, P. A., & Macredie, R. D. (1996). The effects of comments and identifier names on program comprehensibility: An experiential study. Journal of Program Languages, 4(3), 143–167.
Google Scholar
Toutanova, K., & Manning, C. D. (2000). Enriching the knowledge sources used in a maximum entropy part-of-speech tagger. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP/VLC-2000), (pp. 63–70).
Webster, B. F. (1995). Pitfalls of object-oriented development. New York, NY: M & T Books.
Yamashita, A. F., & Moonen, L. (2013). Do developers care about code smells? An exploratory survey. In WCRE (pp. 242–251). IEEE.

Download references

Author information

Authors and Affiliations

École de Technologie Supérieure, Montreal, Canada
Latifa Guerrouj
École Polytechnique de Montréal, Montreal, Canada
Zeinab Kermansaravi, Foutse Khomh, Giuliano Antoniol & Yann-Gaël Guéhéneuc
Washington State University, Pullman, WA, USA
Venera Arnaoudova
McGill University, Montreal, Canada
Benjamin C. M. Fung

Authors

Latifa Guerrouj
View author publications
You can also search for this author in PubMed Google Scholar
Zeinab Kermansaravi
View author publications
You can also search for this author in PubMed Google Scholar
Venera Arnaoudova
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin C. M. Fung
View author publications
You can also search for this author in PubMed Google Scholar
Foutse Khomh
View author publications
You can also search for this author in PubMed Google Scholar
Giuliano Antoniol
View author publications
You can also search for this author in PubMed Google Scholar
Yann-Gaël Guéhéneuc
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Latifa Guerrouj.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Guerrouj, L., Kermansaravi, Z., Arnaoudova, V. et al. Investigating the relation between lexical smells and change- and fault-proneness: an empirical study. Software Qual J 25, 641–670 (2017). https://doi.org/10.1007/s11219-016-9318-6

Download citation

Published: 09 May 2016
Issue Date: September 2017
DOI: https://doi.org/10.1007/s11219-016-9318-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Investigating the relation between lexical smells and change- and fault-proneness: an empirical study

Abstract

Access this article

Similar content being viewed by others

Ethics in the Software Development Process: from Codes of Conduct to Ethical Deliberation

Large Language Model Assisted Software Engineering: Prospects, Challenges, and a Case Study

Challenges of Low-Code/No-Code Software Development: A Literature Review

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Investigating the relation between lexical smells and change- and fault-proneness: an empirical study

Abstract

Access this article

Similar content being viewed by others

Ethics in the Software Development Process: from Codes of Conduct to Ethical Deliberation

Large Language Model Assisted Software Engineering: Prospects, Challenges, and a Case Study

Challenges of Low-Code/No-Code Software Development: A Literature Review

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation