Using Repository of Repositories (RoRs) to Study the Growth of F/OSS Projects: A Meta-Analysis Research Approach

  • Sulayman K. Sowe
  • L. Angelis
  • I. Stamelos
  • Y. Manolopoulos
Part of the IFIP — The International Federation for Information Processing book series (IFIPAICT, volume 234)

Abstract

Free/Open Source Software (F/OSS) repositories contain valuable data and their usefulness in studying software development and community activities continues to attract a lot of research attention. A trend in F/OSS studies is the use of metadata stored in a repository of repositories or RoRs. This paper utilizes data obtained from such RoRs -FLOSSmole- to study the types of projects being developed by the F/OSS community. We downloaded projects by topics data in five areas (Database, Internet, Software Development, Communications, and Games/Entertainment) from Flossmole’s raw and summary data of the sourceforge repository. Time series analysis show the numbers of projects in the five topics are growing linearly. Further analysis supports our hypothesis that F/OSS development is moving “up the stack” from developer tools and infrastructure support to end-user applications such as Databases. The findings have implications for the interpretation of the F/OSS landscape, the utilization and adoption of open source databases, and problems researchers might face in obtaining and using data from RoRs.

Key words

Open Source Software Repositories Metadata Time Series Analysis Missing Values Estimation Projects Growth Linear Trend Open Source Databases 

References

  1. 1.
    Hahsler M, Koch S., Discussion of a Large-Scale Open Source Data Collection Methodology, Proceedings of the 38th Hawaii International Conference on System Sciences (IEEE, HICSS’ 05-Track 7), Jan 03–06, Big Island, Hawaii, 2005, page 197b.Google Scholar
  2. 2.
    Barahona, J. et al., Analysing the Anatomy of GNU/Linux Distributions: Methodology and Case Studies (Red Hat and Debian), In Koch, S. (Ed.), Free/Open Source Software Development, Idea Group Inc., 2005, pp: 27–58.Google Scholar
  3. 3.
    govtech.net, SourceForge.net Surpasses 100,000 Open Source Projects, http:// www.govtech.net/news/news.php?id=94043, May, 2005.Google Scholar
  4. 4.
    Nikolai Bezroukov, Open Source Software Development as a Special Type of Academic Research (Critique of Vulgar Raymondism), Firstmonday, 1999, vol. 4(10).Google Scholar
  5. 5.
    Scott Christley and Greg Madey, Collection of Activity Data for SourceForge Projects, Technical Report: TR-2005-15, University of Notre Dame, 2005.Google Scholar
  6. 6.
    Megan S Conklin, Beyond Low-Hanging Fruit: Seeking the Next Generation in FLOSS Data Mining, In Damiani, E., Fitzerald, B., Scacchi, W., Scott, M., Succi, G (Eds.), IFIP International Federation for Information Processing, Open Source Systems, Boston: Springer, Vol. 203, 2006, pp: 261–266.Google Scholar
  7. 7.
    Kevin Crowston, Hala Annabi, and James Howison, Defining Open Source Software Project Success, Proc. of International Conference on Information Systems, ICIS 2003, 2003.Google Scholar
  8. 8.
    G. Madey, V. Freeh, and R. Tynan, The open source software development phenomenon: An analysis based on social network theory, In Americas conf. on Information Systems (AMCIS2002), 2002, pp: 18061813.Google Scholar
  9. 9.
    German, D. and Mockus, A., Automating the Measurement of Open Source Projects, ICSE’ 03 Workshop on Open Source Software Engineering, Portland, Oregon, May 3–10, 2003.Google Scholar
  10. 10.
    Ghosh, A.R., Clustering and dependencies in free/open source software development: Methodology and tools, Firstmonday, Vol. 8(4), 2004.Google Scholar
  11. 11.
    Howison, J., Conklin, M., and Crowston, K., FLOSSmole: A collaborative repository for FLOSS research data and analyses, International Journal of Information Technology and Web Engineering, Vol. 1(3), 2006, pp: 17–26.Google Scholar
  12. 12.
    J. Howison, and K. Crowston, The Perils and pitfalls of mining SourceForge, 26th International Conference on Software Engineering, Edinburgh, Scotland, 2004.Google Scholar
  13. 13.
    Koch S, Schneider G., Effort, cooperation and coordination in an open source software project: Gnome., Information Systems Journal, Vol. 12(1), 2002, pp: 27–42.CrossRefGoogle Scholar
  14. 14.
    Krogh G, Spaeth S, and Lakhani, K., Community, joining, and specialisation in open source software innovation: a case study, Research Policy, Vol. 32, 2003, pp: 1217–1241.CrossRefGoogle Scholar
  15. 15.
    Lakhani K, Hippel von E., How open source software works: “free” user-to-user assistance, Research Policy, Vol. 32, 2003, pp: 923–943.CrossRefGoogle Scholar
  16. 16.
    David M. Nichols and Michael B. Twidale, Usability processes in open source projects, Software Process: Improvement and Practice, Vol. 11(2), 2006, pp: 149–162.CrossRefGoogle Scholar
  17. 17.
    Masao Ohira, Naoki Ohsugi, Tetsuya Ohoka, and Ken-ichi Matsumoto, Accelerating cross-project knowledge collaboration using collaborative filtering and social networks, MSR’ 05: Proceedings of the 2005 international workshop on Mining software repositories, ACM Press, 2005, pp: 1–5.Google Scholar
  18. 18.
    FLOSSmole Project, FLOSSmole Project (2004–2006) Sourceforge, http://ossmole.Sourceforge.net.Google Scholar
  19. 19.
    Raymond, E.S., The Cathedral and the Bazaar. Musings on Linux and Open Source by an Accidental Revolutionary, OReilly, Sebastopol, USA., 1999.Google Scholar
  20. 20.
    Gregorio Robles and Jesus M. Gonzalez-Barahona, Geographic location of developers at SourceForge, In MSR’ 06: Proceedings of the 2006 international workshop on Mining software repositories, ACM Press, 2006, pp: 144–150.Google Scholar
  21. 21.
    Sulayman K. Sowe and Ioannis. Stamelos, Identification of Knowledge Brokers in F/OSS Projects through Social and Collaborative Networks, In Proc. of 10th Panhellenic Conference on Informatics, Volos, Greece, 2005, pp.285–303.Google Scholar
  22. 22.
    Sulayman K. Sowe, Ioannis Samoladas, and Ioannis Stamelos, Trends in Open Source Database Management Systems. In Laura C. Rivero, Jorge H. Doorn, and Viviana E. Ferraggine (Eds.) Encyclopedia of Database Technologies and Applications, Idea Group, Inc., 2005, pp: 457–462.Google Scholar
  23. 23.
    David J. Sheskin, Handbook of Parametric and Nonparametric Statistical Procedures, Chapman and Halucrc, 2004.Google Scholar
  24. 24.
    Sulayman K. Sowe, Ioannis Stamelos, and Karoulis, Anastesiou, A constructivist View on Knowledge Management in Open Source Virtual Communities. In Figueiredo, D. A, and Paula, A. (Eds.), Managing Learning in Virtual Settings: The Role of Context, Idea Group, Inc., 2005, pp: 290–308.Google Scholar
  25. 25.
    Sulayman K. Sowe, Ioannis Stamelos, Lefteris Angelis, Identifying knowledge brokers that yield software engineering knowledge in OSS projects, Information and Software Technology, 2006, vol. 48, pp: 1025–1033.CrossRefGoogle Scholar
  26. 26.
    Katherine J. Stewart, David P. Darcy, and Sherae L. Daniel, Observations on patterns of development in open source software projects, 5-WOSSE: Proceedings of the fifth workshop on Open source software engineering, St. Louis, Missouri, ACM Press, 2005, pp: 1–5.Google Scholar
  27. 27.
    Walt Scacchi, Joseph Feller, Brian Fitzgerald, Scott A. Hissam, and Karim Lakhani, Understanding Free/Open Source Software Development Processes, Software Process: Improvement and Practice, 2006, vol. 11(2), pp: 95–105.CrossRefGoogle Scholar
  28. 28.
    Xu J, Gao Y, Christley, S, Madey S., A topological Analysis of the Open Source Software Development Community, IEEE Proceedings of the 38th Hawaii International Conference on System Sciences, (IEEE, HICSS’ 05-Track 7), Jan 03–06, Big Island, Hawaii., 2005, page 198a.Google Scholar
  29. 29.
    Valverde, S., Theraulaz, G., Gautrais, J., Fourcassie, V., and Sole, R. V. Self-Organization Patterns in Wasp and Open Source Communities. IEEE Intelligent Systems, 2006; 21(2): 36–40.CrossRefGoogle Scholar
  30. 30.
    N. Bassiliades, I. Vlahavas, A. Elmagarmid, E. Houstis, Interbase-KB: A Knowledge-based Multidatabase Sustem for Data Warehousing, IEEE Transactions on Knowledge and Data Engineering, Vol. 15, No. 5, pp. 1188–1205, 2003.CrossRefGoogle Scholar

Copyright information

© International Federation for Information Processing 2007

Authors and Affiliations

  • Sulayman K. Sowe
    • 1
  • L. Angelis
    • 1
  • I. Stamelos
    • 1
  • Y. Manolopoulos
    • 1
  1. 1.Department of InformaticsAristotle UniversityThessalonikiGreece

Personalised recommendations