Skip to main content

Automatic Decomposition of Java Open Source Pull Requests: A Replication Study

  • 1199 Accesses

Part of the Lecture Notes in Computer Science book series (LNTCS,volume 10706)

Abstract

The presence of large changesets containing several independent modifications (e.g. bug fixes, refactorings, features) can negatively affect the efficacy of code review. To cope with this problem, Barnett et al. developed ClusterChanges — a lightweight static analysis technique for decomposing changesets in different partitions that can be reviewed independently. They have found that ClusterChanges can indeed decompose such changesets and that developers agree with their decomposition. However, the authors’ restricted their analysis to software that is: (i) closed source, (ii) written in C# and (iii) developed by a single organization. To address this threat to validity, we implemented JClusterChanges, a free and open source (FOSS) implementation of ClusterChanges for Java software, and replicated the original Barnett et al. study using changesets from Java open source projects hosted on GitHub. We found that open source changesets are similar to the changesets of the original study. Thus, our research confirms that the problem is relevant to other environments and provides a FOSS implementation of ClusterChanges that not only shows the feasibility of the technique in other contexts but can also be used to help future research.

Keywords

  • Modern code review
  • Changeset decomposition

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/978-3-319-73117-9_18
  • Chapter length: 14 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
eBook
USD   89.00
Price excludes VAT (USA)
  • ISBN: 978-3-319-73117-9
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   119.99
Price excludes VAT (USA)
Fig. 1.
Fig. 2.
Fig. 3.
Fig. 4.
Fig. 5.

References

  1. Bacchelli, A., Bird, C.: Expectations, outcomes, and challenges of modern code review. In: Proceedings of the 2013 International Conference on Software Engineering, pp. 712–721. IEEE Press (2013)

    Google Scholar 

  2. Ciolkowski, M., Laitenberger, O., Biffl, S.: Software reviews: the state of the practice. IEEE Softw. 20(6), 46–51 (2003)

    CrossRef  Google Scholar 

  3. Harjumaa, L., Tervonen, I., Huttunen, A.: Peer reviews in real life - motivators and demotivators, pp. 29–36. IEEE (2005)

    Google Scholar 

  4. Shull, F., Seaman, C.: Inspecting the history of inspections: an example of evidence-based technology diffusion. IEEE Softw. 25(1), 88–90 (2008)

    CrossRef  Google Scholar 

  5. Gerrit team: Gerrit. https://www.gerritcodereview.com/. Accessed 21 July 2016

  6. GitHub: Github’s features. https://github.com/features. Accessed 21 July 2016

  7. Phacility Inc: Phabricator. https://www.phacility.com/phabricator/. Accessed 21 July 2016

  8. Tao, Y., Dang, Y., Xie, T., Zhang, D., Kim, S.: How do software engineers understand code changes? an exploratory study in industry. In: Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering, Article no. 51. ACM (2012)

    Google Scholar 

  9. Herzig, K., Zeller, A.: The impact of tangled code changes. In: 2013 10th IEEE Working Conference on Mining Software Repositories (MSR), pp. 121–130. IEEE (2013)

    Google Scholar 

  10. Tao, Y., Kim, S.: Partitioning composite code changes to facilitate code review. In: Proceedings of the 2015 IEEE/ACM 12th Working Conference on Mining Software Repositories, pp. 180–190. IEEE, May 2015

    Google Scholar 

  11. Barnett, M., Bird, C., Brunet, J., Lahiri, S.K.: Helping developers help themselves: automatic decomposition of code review changesets. In: Proceedings of the 37th International Conference on Software Engineering. IEEE (2015)

    Google Scholar 

  12. Dias, M., Ducasse, S., Cassou, D., Uquillas-Gmez, V.: Do tools support code integration? a survey. J. Object Technol. 16(2), 1–20 (2016)

    CrossRef  Google Scholar 

  13. Mockus, A., Fielding, R.T., Herbsleb, J.D.: Two case studies of open source software development: Apache and Mozilla. ACM Trans. Softw. Eng. Methodol. (TOSEM) 11(3), 309–346 (2002)

    CrossRef  Google Scholar 

  14. Rigby, P.C., German, D.M., Cowen, L., Storey, M.A.: Peer review on open-source software projects: parameters, statistical models, and theory. ACM Trans. Softw. Eng. Methodol. 23(4), 1–33 (2014)

    CrossRef  Google Scholar 

  15. Basili, V.R., Shull, F., Lanubile, F.: Building knowledge through families of experiments. IEEE Trans. Softw. Eng. 25(4), 456–473 (1999)

    CrossRef  Google Scholar 

  16. Black Duck Software Inc: Open Hub - comparison between C# and Java with regard to monthly commits. https://www.openhub.net/languages/compare?language_name%5B%5D=csharp&language_name%5B%5D=java&language_name%5B%5D=-1&language_name%5B%5D=-1&measure=commits. Accessed 01 Aug 2016

  17. The Eclipse Foundation: Eclipse Java development tools (JDT). https://www.eclipse.org/jdt/. Accessed 26 Nov 2015

  18. Gómez, O.S., Juristo, N., Vegas, S.: Understanding replication of experiments in software engineering: a classification. Inf. Softw. Technol. 56(8), 1033–1048 (2014)

    CrossRef  Google Scholar 

  19. GitHub: About GitHub. https://github.com/about. Accessed 01 Aug 2016

  20. Gousios, G., Pinzger, M., van Deursen, A.: An exploratory study of the pull-based software development model. In: Proceedings of the 36th International Conference on Software Engineering, pp. 345–355. ACM (2014)

    Google Scholar 

  21. Kalliamvakou, E., Gousios, G., Blincoe, K., Singer, L., German, D.M., Damian, D.: The promises and perils of mining GitHub. In: Proceedings of the 11th Working Conference on Mining Software Repositories, pp. 92–101. ACM (2014)

    Google Scholar 

  22. Bessey, A., Engler, D., Block, K., Chelf, B., Chou, A., Fulton, B., Hallem, S., Henri-Gros, C., Kamsky, A., McPeak, S.: A few billion lines of code later: using static analysis to find bugs in the real world. Commun. ACM 53(2), 66–75 (2010)

    CrossRef  Google Scholar 

  23. Dias, M., Bacchelli, A., Gousios, G., Cassou, D., Ducasse, S.: Untangling fine-grained code changes. In: 2015 IEEE 22nd International Conference on Software Analysis, Evolution and Reengineering (SANER), pp. 341–350. IEEE (2015)

    Google Scholar 

  24. Kirinuki, H., Higo, Y., Hotta, K., Kusumoto, S.: Hey! Are you committing tangled changes? In: Proceedings of the 22nd International Conference on Program Comprehension, ICPC 2014, pp. 262–265. ACM, New York (2014)

    Google Scholar 

  25. Nguyen, H.A., Nguyen, A.T., Nguyen, T.N.: Filtering noise in mixed-purpose fixing commits to improve defect prediction and localization. In: 2013 IEEE 24th International Symposium on Software Reliability Engineering (ISSRE), pp. 138–147. IEEE (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Victor da C. Luna Freire .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Verify currency and authenticity via CrossMark

Cite this paper

Luna Freire, V.d.C., Brunet, J., de Figueiredo, J.C.A. (2018). Automatic Decomposition of Java Open Source Pull Requests: A Replication Study. In: Tjoa, A., Bellatreche, L., Biffl, S., van Leeuwen, J., Wiedermann, J. (eds) SOFSEM 2018: Theory and Practice of Computer Science. SOFSEM 2018. Lecture Notes in Computer Science(), vol 10706. Edizioni della Normale, Cham. https://doi.org/10.1007/978-3-319-73117-9_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-73117-9_18

  • Published:

  • Publisher Name: Edizioni della Normale, Cham

  • Print ISBN: 978-3-319-73116-2

  • Online ISBN: 978-3-319-73117-9

  • eBook Packages: Computer ScienceComputer Science (R0)