Skip to main content

Network Analysis of Software Repositories: Identifying Subject Matter Experts

  • Chapter
Book cover Complex Networks

Part of the book series: Studies in Computational Intelligence ((SCI,volume 424))

Abstract

A software developer joining a large software project faces a steep learning curve before they are able to make real contributions. One challenge is finding the subject matter experts who can answer questions about a specific area of the software or to review changes. This is especially true of large projects with many modules and a large number of authors. In this paper, we describe a method to model a software project as a network using information mined from the project’s version control repository, and demonstrate how network analysis techniques can be used to identify the key authors and subject matter experts. We investigate metrics that can be gathered using network analysis, such as which groups of authors typically work together, and how closely knit the developers are on a project. We analyze several specific projects to demonstrate the applicability of these techniques and several hundred projects to show general trends.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Collins-Sussman, B., Fitzpatrick, B.W., Pilato, C.M.: Version control with subversion (2007), http://svnbook.red-bean.com/en/1.4/index.html

  2. Howison, J., Crowston, K.: The perils and pitfalls of mining sourceforge. In: Proceedings of the International Workshop on Mining Software Repositories (MSR 2004), pp. 7–11 (2004)

    Google Scholar 

  3. Huang, S.K., Liu, K.M.: Mining version histories to verify the learning process of legitimate peripheral participants. In: Proceedings of the 2005 International Workshop on Mining Software Repositories, MSR 2005, pp. 1–5. ACM, New York (2005)

    Chapter  Google Scholar 

  4. The igraph website (2010), http://igraph.sourceforge.net/

  5. Kagdi, H., Yusuf, S., Maletic, J.I.: Mining sequences of changed-files from version histories. In: Proceedings of the 2006 International Workshop on Mining Software Repositories, MSR 2006, pp. 47–53. ACM, New York (2006)

    Chapter  Google Scholar 

  6. Linstead, E., Rigor, P., Bajracharya, S., Lopes, C., Baldi, P.: Mining eclipse developer contributions via author-topic models. In: Fourth International Workshop on Mining Software Repositories, ICSE Workshops, MSR 2007, p. 30 (2007), doi:10.1109/MSR.2007.20

    Google Scholar 

  7. Lopez-Fernandez, L., Robles, G., Gonzalez-Barahona, J.M.: Applying social network analysis to the information in cvs repositories. In: Proceedings of 26th International Conference on Software Engineering, ICSE 2004 (2004), doi:10.1109/ICSE.2004.1317529

    Google Scholar 

  8. Newman, M.E.J.: Coauthorship networks and patterns of scientific collaboration. Proceedings of the National Academy of Sciences of the United States of America 101(suppl. 1), 5200–5205 (2004), http://www.pnas.org/content/101/suppl.1/5200.abstract , doi:10.1073/pnas.0307545100

    Article  Google Scholar 

  9. Newman, M.E.J.: Networks an Introduction. Oxford University Press, New York (2010)

    MATH  Google Scholar 

  10. Ordonez, M., Haddad, H.: The state of metrics in software industry. In: Fifth International Conference on Information Technology: New Generations, ITNG 2008, pp. 453–458 (2008), doi:10.1109/ITNG.2008.106

    Google Scholar 

  11. Reichardt, J., Bornholdt, S.: Statistical mechanics of community detection. Phys. Rev. E 74(1), 016,110 (2006), doi:10.1103/PhysRevE.74.016110

    MathSciNet  Google Scholar 

  12. Sommerville, I.: Software Engineering, 8th edn. Addison-Wesley, Harlow (2007)

    MATH  Google Scholar 

  13. Umarji, M., Shull, F.: Measuring developers: Aligning perspectives and other best practices. IEEE Software 26(6), 92–94 (2009), doi:10.1109/MS.2009.180

    Article  Google Scholar 

  14. Voinea, L., Telea, A.: Mining software repositories with cvsgrab. In: Proceedings of the 2006 International Workshop on Mining Software Repositories, MSR 2006, pp. 167–168. ACM, New York (2006)

    Chapter  Google Scholar 

  15. Voinea, L., Telea, A.: An open framework for cvs repository querying, analysis and visualization. In: Proceedings of the 2006 International Workshop on Mining Software Repositories, MSR 2006, pp. 33–39. ACM, New York (2006)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Andrew Dittrich .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Dittrich, A., Gunes, M.H., Dascalu, S. (2013). Network Analysis of Software Repositories: Identifying Subject Matter Experts. In: Menezes, R., Evsukoff, A., González, M. (eds) Complex Networks. Studies in Computational Intelligence, vol 424. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30287-9_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-30287-9_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-30286-2

  • Online ISBN: 978-3-642-30287-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics