Abstract
Open source software (OSS) community relies on volunteers and developers contributions for its survival. However, only a few projects reach success and popularity in open source community. Then, it is important to know the success factors of OSS projects. In this paper, we have applied time series clustering on open source projects hosted on a social coding platform to understand the main effective attributes of an OSS project on its popularity trends. We have applied exploratory data analysis on each cluster to see the effect of projects’ performance and attributes on projects’ reputation inside the OSS community. Finally, we have applied machine learning techniques to predict the popularity trend of OSS projects. Having access to the social coding data expands our view on project popularity on both social and technical factors. Results of this empirical study can help project owners and members to manage and promote the project reputation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Blincoe, K., Sheoran, J., Goggins, S., Petakovic, E., Damian, D.: Understanding the popular users: following, affiliation influence and leadership on Github. Inf. Softw. Technol. 70, 30–39 (2016)
Borges, H., Hora, A., Valente, M.T.: Understanding the factors that impact the popularity of Github repositories. arXiv preprint arXiv:1606.04984) (2016)
Cosentino, V., Izquierdo, J.L.C., Cabot, J.: A systematic mapping study of software development with Github. IEEE Access 5, 7173–7192 (2017)
Dabbish, L., Stuart, C., Tsay, J., Herbsleb, J.: Leveraging transparency. IEEE Softw. 30(1), 37–43 (2013)
Gousios, G., Spinellis, D.: Ghtorrent: Github’s data from a firehose. In: 2012 9th IEEE Working Conference on Mining Software Repositories (MSR), pp. 12–21. IEEE (2012)
Grewal, R., Lilien, G.L., Mallapragada, G.: Location, location, location: how network embeddedness affects project success in open source systems. Manage. Sci. 52(7), 1043–1056 (2006)
Jarczyk, O., Gruszka, B., Jaroszewicz, S., Bukowski, L., Wierzbicki, A.: GitHub projects. quality analysis of open-source software. In: Aiello, L.M., McFarland, D. (eds.) SocInfo 2014. LNCS, vol. 8851, pp. 80–94. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-13734-6_6
Jiang, J., Lo, D., He, J., Xia, X., Kochhar, P.S., Zhang, L.: Why and how developers fork what from whom in Github. Empirical Softw. Eng. 22(1), 547–578 (2017)
Kalliamvakou, E., Gousios, G., Blincoe, K., Singer, L., German, D.M., Damian, D.: The promises and perils of mining Github. In: Proceedings of the 11th Working Conference on Mining Software Repositories, pp. 92–101. ACM (2014)
Lee, S.-Y.T., Kim, H.-W., Gupta, S.: Measuring open source software success. Omega 37(2), 426–438 (2009)
Midha, V., Palvia, P.: Factors affecting the success of open source software. J. Syst. Softw. 85(4), 895–905 (2012)
Oates, T., Firoiu, L., Cohen, P.R.: Clustering time series with hidden markov models and dynamic time warping. In: Proceedings of the IJCAI-99 Workshop on Neural, Symbolic and Reinforcement Learning Methods for Sequence Learning, Sweden Stockholm, pp. 17–21 (1999)
Schall, D.: Who to follow recommendation in large-scale online development communities. Inf. Softw. Technol. 56(12), 1543–1555 (2014)
Schilling, A., Laumer, S., Weitzel, T.: Who will remain? An evaluation of actual person-job and person-team fit to predict developer retention in floss projects. In: 2012 45th Hawaii International Conference on System Science (HICSS), pp. 3446–3455. IEEE (2012)
Stewart, K.J., Ammeter, A.P., Maruping, L.M.: Impacts of license choice and organizational sponsorship on user interest and development activity in open source software projects. Inf. Syst. Res. 17(2), 126–144 (2006)
Subramaniam, C., Sen, R., Nelson, M.L.: Determinants of open source software project success: a longitudinal study. Decis. Support Syst. 46(2), 576–585 (2009)
Tsay, J., Dabbish, L., Herbsleb, J.: Influence of social and technical factors for evaluating contribution in Github. In: Proceedings of the 36th International Conference on Software Engineering, pp. 356–366. ACM (2014)
Weber, S., Luo, J.: What makes an open source code popular on Git Hub? In: 2014 IEEE International Conference on Data Mining Workshop, pp. 851–855. IEEE (2014)
Wu, J., Goh, K.Y.: Evaluating longitudinal success of open source software projects: a social network perspective. In: 42nd Hawaii International Conference on System Sciences, HICSS’2009, pp. 1–10. IEEE (2009)
Yamashita, K., Kamei, Y., McIntosh, S., Hassan, A.E., Ubayashi, N.: Magnet or Sticky? measuring project characteristics from the perspective of developer attraction and retention. J. Inf. Process. 24(2), 339–348 (2016)
Yu, Y., Wang, H., Filkov, V., Devanbu, P., Vasilescu, B.: Wait for it: determinants of pull request evaluation latency on Github. In: 2015 IEEE/ACM 12th Working Conference on Mining Software Repositories (MSR), pp. 367–371. IEEE (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Bayati, S., Heidary, M. (2020). Time Series Analysis of Open Source Projects Popularity. In: Lang, K.R., et al. Smart Business: Technology and Data Enabled Innovative Business Models and Practices. WeB 2019. Lecture Notes in Business Information Processing, vol 403. Springer, Cham. https://doi.org/10.1007/978-3-030-67781-7_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-67781-7_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-67780-0
Online ISBN: 978-3-030-67781-7
eBook Packages: Computer ScienceComputer Science (R0)