Skip to main content
Log in

License usage and changes: a large-scale study on gitHub

  • Published:
Empirical Software Engineering Aims and scope Submit manuscript

Abstract

Open source software licenses determine, from a legal point of view, under which conditions software can be integrated and redistributed. The reason why developers of a project adopt (or change) a license may depend on various factors, e.g., the need for ensuring compatibility with certain third-party components, the perspective towards redistribution or commercialization of the software, or the need for protecting against somebody else’s commercial usage of the software. This paper reports a large empirical study aimed at quantitatively and qualitatively investigating when and why developers adopt or change software licenses. Specifically, we first identify license changes in 1,731,828 commits, representing the entire history of 16,221 Java projects hosted on GitHub. Then, to understand the rationale of license changes, we perform a qualitative analysis on 1,160 projects written in seven different programming languages, namely C, C++, C#, Java, Javascript, Python, and Ruby—following an open coding approach inspired by grounded theory—on commit messages and issue tracker discussions concerning licensing topics, and whenever possible, try to build traceability links between discussions and changes. On one hand, our results highlight how, in different contexts, license adoption or changes can be triggered by various reasons. On the other hand, the results also highlight a lack of traceability of when and why licensing changes are made. This can be a major concern, because a change in the license of a system can negatively impact those that reuse it. In conclusion, results of the study trigger the need for better tool support in guiding developers in choosing/changing licenses and in keeping track of the rationale of license changes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Notes

  1. http://sourceforge.net

  2. https://github.com

  3. https://www.apache.org

  4. http://www.gnu.org

  5. http://www.eclipse.org/

  6. https://www.ibm.com/developerworks/community/blogs/6e6f6d1b-95c3-46df-8a26-b7efd8ee4b57/ entry/why_big_companies_are_embracing_open_source119?lang=en

  7. https://www.blackducksoftware.com/future-of-open-source

  8. We looked for the target keywords only in the issue titles, because we found that including the issue descriptions in the search generates a considerable number of false positives.

  9. http://gplv3.fsf.org/wiki/index.php/Compatible_licenses

  10. http://www.gnu.org/licenses/old-licenses/gpl-2.0-faq.html#CanIUseGPLToolsForNF

  11. http://choosealicense.com

References

Download references

Acknowledgments

This work is supported in part by NSF CAREER CCF-1253837 grant. Massimiliano Di Penta is partially supported by the Markos project, funded by the European Commission under Contract Number FP7-317743. Any opinions, findings, and conclusions expressed herein are the authors’ and do not necessarily reflect those of the sponsors.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Christopher Vendome.

Additional information

Communicated by: Lin Tan

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Vendome, C., Bavota, G., Penta, M.D. et al. License usage and changes: a large-scale study on gitHub. Empir Software Eng 22, 1537–1577 (2017). https://doi.org/10.1007/s10664-016-9438-4

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10664-016-9438-4

Keywords

Navigation