Abstract
A software developer joining a large software project faces a steep learning curve before they are able to make real contributions. One challenge is finding the subject matter experts who can answer questions about a specific area of the software or to review changes. This is especially true of large projects with many modules and a large number of authors. In this paper, we describe a method to model a software project as a network using information mined from the project’s version control repository, and demonstrate how network analysis techniques can be used to identify the key authors and subject matter experts. We investigate metrics that can be gathered using network analysis, such as which groups of authors typically work together, and how closely knit the developers are on a project. We analyze several specific projects to demonstrate the applicability of these techniques and several hundred projects to show general trends.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Collins-Sussman, B., Fitzpatrick, B.W., Pilato, C.M.: Version control with subversion (2007), http://svnbook.red-bean.com/en/1.4/index.html
Howison, J., Crowston, K.: The perils and pitfalls of mining sourceforge. In: Proceedings of the International Workshop on Mining Software Repositories (MSR 2004), pp. 7–11 (2004)
Huang, S.K., Liu, K.M.: Mining version histories to verify the learning process of legitimate peripheral participants. In: Proceedings of the 2005 International Workshop on Mining Software Repositories, MSR 2005, pp. 1–5. ACM, New York (2005)
The igraph website (2010), http://igraph.sourceforge.net/
Kagdi, H., Yusuf, S., Maletic, J.I.: Mining sequences of changed-files from version histories. In: Proceedings of the 2006 International Workshop on Mining Software Repositories, MSR 2006, pp. 47–53. ACM, New York (2006)
Linstead, E., Rigor, P., Bajracharya, S., Lopes, C., Baldi, P.: Mining eclipse developer contributions via author-topic models. In: Fourth International Workshop on Mining Software Repositories, ICSE Workshops, MSR 2007, p. 30 (2007), doi:10.1109/MSR.2007.20
Lopez-Fernandez, L., Robles, G., Gonzalez-Barahona, J.M.: Applying social network analysis to the information in cvs repositories. In: Proceedings of 26th International Conference on Software Engineering, ICSE 2004 (2004), doi:10.1109/ICSE.2004.1317529
Newman, M.E.J.: Coauthorship networks and patterns of scientific collaboration. Proceedings of the National Academy of Sciences of the United States of America 101(suppl. 1), 5200–5205 (2004), http://www.pnas.org/content/101/suppl.1/5200.abstract , doi:10.1073/pnas.0307545100
Newman, M.E.J.: Networks an Introduction. Oxford University Press, New York (2010)
Ordonez, M., Haddad, H.: The state of metrics in software industry. In: Fifth International Conference on Information Technology: New Generations, ITNG 2008, pp. 453–458 (2008), doi:10.1109/ITNG.2008.106
Reichardt, J., Bornholdt, S.: Statistical mechanics of community detection. Phys. Rev. EÂ 74(1), 016,110 (2006), doi:10.1103/PhysRevE.74.016110
Sommerville, I.: Software Engineering, 8th edn. Addison-Wesley, Harlow (2007)
Umarji, M., Shull, F.: Measuring developers: Aligning perspectives and other best practices. IEEE Software 26(6), 92–94 (2009), doi:10.1109/MS.2009.180
Voinea, L., Telea, A.: Mining software repositories with cvsgrab. In: Proceedings of the 2006 International Workshop on Mining Software Repositories, MSR 2006, pp. 167–168. ACM, New York (2006)
Voinea, L., Telea, A.: An open framework for cvs repository querying, analysis and visualization. In: Proceedings of the 2006 International Workshop on Mining Software Repositories, MSR 2006, pp. 33–39. ACM, New York (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Dittrich, A., Gunes, M.H., Dascalu, S. (2013). Network Analysis of Software Repositories: Identifying Subject Matter Experts. In: Menezes, R., Evsukoff, A., González, M. (eds) Complex Networks. Studies in Computational Intelligence, vol 424. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30287-9_20
Download citation
DOI: https://doi.org/10.1007/978-3-642-30287-9_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30286-2
Online ISBN: 978-3-642-30287-9
eBook Packages: EngineeringEngineering (R0)