Abstract
The Open Source Software(OSS) movement has attracted considerable attention in the last few years. In this paper we report our results of mining data acquired from SourceForge.net, the largest open source software hosting website. In the process we introduce Association Rules Network(ARN), a (hyper)graphical model to represent a special class of association rules. Using ARNs we discover important relationships between the attributes of successful OSS projects. We verify and validate these relationships using Factor Analysis, a classical statistical technique related to Singular Value Decomposition(SVD).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Rakesh Agrawal and Ramakrishnan Srikant. Fast algorithms for mining association rules. In Jorge B. Bocca, Matthias Jarke, and Carlo Zaniolo, editors, Proc. 20th Int. Conf. Very Large Data Bases, VLDB, pages 487–499. Morgan Kaufmann, 12–15 1994.
Sanjay Chawla, Bavani Arunasalam, and Joseph Davis. Mining open source software(oss) data using association rules network. Technical Report TR 535, School of IT, University of Sydney, Sydney, NSW, Australia, 2003.
A. Dutoit and B. Bruegge. Communication metrics for software development. IEEE Transactions On Software Engineering, 24(8):615–628, 1998.
L. Feng, J. Yu, H. Lu, and J. Han. A template model for multi-dimensional, inter-transactional association rules. VLDB Journal, 11(2):153–175, 2002.
R.L Glass. The sociology of open source: of cults and cultures. IEEE Software, 17(3):104–105, 2000.
Eui-Hong Han, George Karypis, Vipin Kumar, and Bamshad Mobasher. Clustering based on association rule hypergraphs. In Proceedings SIGMOD Workshop Research Issues on Data Mining and Knowledge Discovery(DMKD’ 97), 1997.
Han, J., Kamber, M., 2001. Data Mining, Concepts and Trends. Morgan Kaufmann.
Hand, D., Mannila, H., Smyth, P., 2001. Principles of Data Mining. M.I.T Press.
Bing Liu, Wynne Hsu, and Yiming Ma. Integrating classification and association rule mining. In Knowledge Discovery and Data Mining, pages 80–86, 1998.
E.S. Raymond. The Cathedral and Bazaar:Musings on Open Source and Linux by an Accidental Revolutionary. O’Reilly, 2001.
L. Torwalds. The linux edge. Communications of the ACM, 42(4):38–39, 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chawla, S., Arunasalam, B., Davis, J. (2003). Mining Open Source Software (OSS) Data Using Association Rules Network. In: Whang, KY., Jeon, J., Shim, K., Srivastava, J. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2003. Lecture Notes in Computer Science(), vol 2637. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36175-8_46
Download citation
DOI: https://doi.org/10.1007/3-540-36175-8_46
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-04760-5
Online ISBN: 978-3-540-36175-6
eBook Packages: Springer Book Archive