Skip to main content

Mining Open Source Software (OSS) Data Using Association Rules Network

  • Conference paper
  • First Online:
Advances in Knowledge Discovery and Data Mining (PAKDD 2003)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2637))

Included in the following conference series:

Abstract

The Open Source Software(OSS) movement has attracted considerable attention in the last few years. In this paper we report our results of mining data acquired from SourceForge.net, the largest open source software hosting website. In the process we introduce Association Rules Network(ARN), a (hyper)graphical model to represent a special class of association rules. Using ARNs we discover important relationships between the attributes of successful OSS projects. We verify and validate these relationships using Factor Analysis, a classical statistical technique related to Singular Value Decomposition(SVD).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Rakesh Agrawal and Ramakrishnan Srikant. Fast algorithms for mining association rules. In Jorge B. Bocca, Matthias Jarke, and Carlo Zaniolo, editors, Proc. 20th Int. Conf. Very Large Data Bases, VLDB, pages 487–499. Morgan Kaufmann, 12–15 1994.

    Google Scholar 

  2. Sanjay Chawla, Bavani Arunasalam, and Joseph Davis. Mining open source software(oss) data using association rules network. Technical Report TR 535, School of IT, University of Sydney, Sydney, NSW, Australia, 2003.

    Google Scholar 

  3. A. Dutoit and B. Bruegge. Communication metrics for software development. IEEE Transactions On Software Engineering, 24(8):615–628, 1998.

    Article  Google Scholar 

  4. L. Feng, J. Yu, H. Lu, and J. Han. A template model for multi-dimensional, inter-transactional association rules. VLDB Journal, 11(2):153–175, 2002.

    Article  Google Scholar 

  5. R.L Glass. The sociology of open source: of cults and cultures. IEEE Software, 17(3):104–105, 2000.

    Google Scholar 

  6. Eui-Hong Han, George Karypis, Vipin Kumar, and Bamshad Mobasher. Clustering based on association rule hypergraphs. In Proceedings SIGMOD Workshop Research Issues on Data Mining and Knowledge Discovery(DMKD’ 97), 1997.

    Google Scholar 

  7. Han, J., Kamber, M., 2001. Data Mining, Concepts and Trends. Morgan Kaufmann.

    Google Scholar 

  8. Hand, D., Mannila, H., Smyth, P., 2001. Principles of Data Mining. M.I.T Press.

    Google Scholar 

  9. Bing Liu, Wynne Hsu, and Yiming Ma. Integrating classification and association rule mining. In Knowledge Discovery and Data Mining, pages 80–86, 1998.

    Google Scholar 

  10. E.S. Raymond. The Cathedral and Bazaar:Musings on Open Source and Linux by an Accidental Revolutionary. O’Reilly, 2001.

    Google Scholar 

  11. L. Torwalds. The linux edge. Communications of the ACM, 42(4):38–39, 1999.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chawla, S., Arunasalam, B., Davis, J. (2003). Mining Open Source Software (OSS) Data Using Association Rules Network. In: Whang, KY., Jeon, J., Shim, K., Srivastava, J. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2003. Lecture Notes in Computer Science(), vol 2637. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36175-8_46

Download citation

  • DOI: https://doi.org/10.1007/3-540-36175-8_46

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-04760-5

  • Online ISBN: 978-3-540-36175-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics