An Analysis of Author Contribution Patterns in Eclipse Foundation Project Source Code

  • Quinn C. Taylor
  • Jonathan L. Krein
  • Alexander C. MacLean
  • Charles D. Knutson
Part of the IFIP Advances in Information and Communication Technology book series (IFIPAICT, volume 365)


Collaborative development is a key tenet of open source software, but if not properly understood and managed, it can become a liability. We examine author contribution data for the newest revision of 251,633 Java source files in 592 Eclipse projects. We use this observational data to analyze collaboration patterns within files, and to explore relationships between file size, author count, and code authorship. We calculate author entropy to characterize the contributions of multiple authors to a given file, with an eye toward understanding the degree of collaboration and the most common interaction patterns.


Source Code Open Source Software Authorship Pattern Open Source Software Development Line Count 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Bird, C., Pattison, D., D’Souza, R., Filkov, V., Devanbu, P.: Chapels in the Bazaar? Latent Social Structure in OSS. In: FSE (2008),
  2. 2.
    Brooks Jr., F.P.: The Mythical Man-Month: Essays on Software Engineering. Addison-Wesley Longman Publishing Co., Inc., Amsterdam (1975)Google Scholar
  3. 3.
    Casebolt, J.R., Krein, J.L., MacLean, A.C., Knutson, C.D., Delorey, D.P.: Author Entropy vs. File Size in the GNOME Suite of Applications. In: Proceedings of the 6th IEEE Working Conference on Mining Software Repositories (MSR 2009), pp. 91–94 ( May 2009)Google Scholar
  4. 4.
    Conway, M.E.: Do Committees Invent? Datamation 14(4), 28–31 (1968)Google Scholar
  5. 5.
    Dinh-Trong, T.T., Bieman, J.M.: The FreeBSD Project: A Replication Case Stdy of Open Source Development. IEEE Transactions of Software Engineering 31(6), 481–494 (2005)CrossRefGoogle Scholar
  6. 6.
    Ducheneaut, N.: Socialization in an Open Source Software Community: A Socio-Technical Analysis. In: Computer Supported Cooperative Work, vol. 14, pp. 323–368 (2005)Google Scholar
  7. 7.
    Gilbert, E., Karahalios, K.: CodeSaw: A social visualization of distributed software development. In: Baranauskas, C., Abascal, J., Barbosa, S.D.J. (eds.) INTERACT 2007. LNCS, vol. 4663, pp. 303–316. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  8. 8.
    Jensen, C., Scacchi, W.: Role Migration and Advancement Processes in OSSD Projects: A Comparative Case Study. In: 29th International Conference on Software Engineering, ICSE 2007 Minneapolis, MN, pp. 364–374 (May 2007)Google Scholar
  9. 9.
    Krein, J.L., MacLean, A.C., Knutson, C.D., Delorey, D.P., Eggett, D.L.: Impact of Programming Language Fragmentation on Developer Productivity: a SourceForge Empirical Study. International Journal of Open Source Software and Processes (IJOSSP) 2(2), 41–61 (2010)CrossRefGoogle Scholar
  10. 10.
    Lotka, A.J.: The frequency distribution of scientific productivity. Journal of the Washington Academy of Sciences 16(12), 317–324 (1926)Google Scholar
  11. 11.
    Mitchell, T.M.: Machine Learning, pp. 55–57. McGraw-Hill, New York (1997)zbMATHGoogle Scholar
  12. 12.
    Mockus, A., Fielding, R.T., Herbsleb, J.D.: Two Case Studies of Open Source Software Development: Apache and Mozilla. ACM Transactions on Software Engineering and Methodology 11(3), 309–346 (2002)CrossRefGoogle Scholar
  13. 13.
    Raymond, E.S.: The Cathedral and the Bazaar: Musings on Linux and Open Source by an Accidental Revolutionary. O’Reilly and Associates, Inc., Sebastopol (2001)Google Scholar
  14. 14.
    Shannon, C.E.: A Mathematical Theory of Communication. The Bell System Technical Journal 27, 379–423 (1948)MathSciNetzbMATHGoogle Scholar
  15. 15.
    Taylor, Q.C., Stevenson, J.E., Delorey, D.P., Knutson, C.D.: Author Entropy: A Metric for Characterization of Software Authorship Patterns. In: Proceedings of the 3rd International Workshop on Public Data about Software Development (WoPDaSD 2008), Milan, Italy (September 2008)Google Scholar
  16. 16.
    von Krogh, G., Spacth, S., Lakhani, K.R.: Community, Joining, and Specialization in Open Source Software Innovation: A Case Study (Open Source Software Development). Research Policy 32(7), 1217–1241 (2003)CrossRefGoogle Scholar

Copyright information

© IFIP International Federation for Information Processing 2011

Authors and Affiliations

  • Quinn C. Taylor
    • 1
  • Jonathan L. Krein
    • 1
  • Alexander C. MacLean
    • 1
  • Charles D. Knutson
    • 1
  1. 1.SEQuOIA LabBrigham Young UniversityProvoUSA

Personalised recommendations