Abstract
Nowadays Open-Source Software is developed mostly by decentralized teams of developers cooperating on-line. GitHub portal is an online social network that supports development of software by virtual teams of programmers. Since there is no central mechanism that governs the process of team formation, it is interesting to investigate if there are any significant correlations between project quality and the characteristics of the team members. However, for such analysis to be possible, we need good metrics of a project quality. This paper develops two such metrics, first one reflecting project’s popularity, and the second one - the quality of support offered by team members to users. The first metric is based on the number of ‘stars’ a project is given by other GitHub members, the second is obtained using survival analysis techniques applied to issues reported on the project by its users. After developing the metrics we have gathered characteristics of several GitHub projects and analyzed their influence on the project quality using statistical regression techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cleveland, W.S., Devlin, S.J.: Journal of the American Statistical Association 83(403), 596–610 (1988)
Crowston, K., Wei, K., Howison, J., Wiggins, A.: Free/libre open-source software development: What we know and what we do not know. ACM Comput. Surv. 44(2), 7:1–7:35 (2008)
Dabbish, L., Stuart, C., Tsay, J., Herbsleb, J.: Social coding in github: Transparency and collaboration in an open software repository. In: Proceedings of the ACM 2012 Conference on Computer Supported Cooperative Work, CSCW 2012, pp. 1277–1286. ACM, New York (2012)
Farah, G., Tejada, J.S., Correal, D.: Openhub: a scalable architecture for the analysis of software quality attributes. In: Proceedings of the 11th Working Conference on Mining Software Repositories, pp. 420–423. ACM (2014)
Ferenc, R., Hegedus, P., Gyimothy, T.: Software product quality models. In: Mens, T., Serebrenik, A., Cleve, A. (eds.) Evolving Software Systems, pp. 65–100. Springer, Heidelberg (2014)
Fischer, M., Pinzger, M., Gall, H.: Analyzing and relating bug report data for feature tracking. In: 2013 20th Working Conference on Reverse Engineering (WCRE), p. 90. IEEE Computer Society (2003)
Hosmer, D.W., Lemeshow, S.: Applied logistic regression. Wiley-Interscience Publication (2000)
Hupa, A., Rzadca, K., Wierzbicki, A., Datta, A.: Interdisciplinary matchmaking: Choosing collaborators by skill, acquaintance and trust. In: Abraham, A., Hassanien, A., Snášel, V. (eds.) Computational Social Network Analysis. Computer Communications and Networks, pp. 319–347. Springer, London (2010)
Kalbfleisch, J.D., Prentice, R.L.: The statistical analysis of failure time data. John Wiley & Sons (2002)
Kalliamvakou, E., Gousios, G., Blincoe, K., Singer, L., German, D.M., Damian, D.: The promises and perils of mining github. In: Proceedings of the 11th Working Conference on Mining Software Repositories, MSR 2014, pp. 92–101. ACM, New York (2014)
Khondhu, J., Capiluppi, A., Stol, K.-J.: Is it all lost? a study of inactive open source projects. In: Open Source Software: Quality Verification, pp. 61–79. Springer (2013)
McDonald, N., Goggins, S.: Performance and participation in open source software on github. In: CHI 2013 Extended Abstracts on Human Factors in Computing Systems, CHI EA 2013, pp. 139–144. ACM, New York (2013)
Michlmayr, M., Senyard, A.: A statistical analysis of defects in debian and strategies for improving quality in free software projects. The Economics of Open Source Software Development, 131–148 (2006)
O’Mahony, S., Ferraro, F.: The emergence of governance in an open source community. Academy of Management Journal 50(5), 1079–1106 (2007)
Oram, A., Wilson, G.: Making Software: What Really Works, and Why We Believe It. O’Reilly Media (2010)
Rahmani, C., Khazanchi, D.: A study on defect density of open source software. In: 2010 IEEE/ACIS 9th International Conference on Computer and Information Science (ICIS), pp. 679–683. IEEE (2010)
Turek, P.: Wikiteams: How do they achieve success? IEEE Potentials 30(5), 15–20 (September 2011)
Turek, P., Wierzbicki, A., Nielek, R., Hupa, A., Datta, A.: Learning about the quality of teamwork from wikiteams. In: 2010 IEEE Second International Conference on Social Computing (SocialCom), pp. 17–24 (August 2010)
Wierzbicki, A., Turek, P., Nielek, R.: Learning about team collaboration from wikipedia edit history. In: Proceedings of the 6th International Symposium on Wikis and Open Collaboration, WikiSym 2010, pp. 27:1–27:2. ACM, New York (2010)
Zimmermann, T., Weissgerber, P.: Mining version histories to guide software changes. In: 26th International Conference on Software Engineering (ICSE 2004), pp. 563–572 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Jarczyk, O., Gruszka, B., Jaroszewicz, S., Bukowski, L., Wierzbicki, A. (2014). GitHub Projects. Quality Analysis of Open-Source Software. In: Aiello, L.M., McFarland, D. (eds) Social Informatics. SocInfo 2014. Lecture Notes in Computer Science, vol 8851. Springer, Cham. https://doi.org/10.1007/978-3-319-13734-6_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-13734-6_6
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13733-9
Online ISBN: 978-3-319-13734-6
eBook Packages: Computer ScienceComputer Science (R0)