Skip to main content

Collaborative Data Mining

  • Chapter
  • First Online:
Data Mining and Knowledge Discovery Handbook

Summary

Collaborative Data Mining is a setting where the Data Mining effort is distributed to multiple collaborating agents – human or software. The objective of the collaborative Data Mining effort is to produce solutions to the tackled Data Mining problem which are considered better by some metric, with respect to those solutions that would have been achieved by individual, non-collaborating agents. The solutions require evaluation, comparison, and approaches for combination. Collaboration requires communication, and implies some form of community. The human form of collaboration is a social task. Organizing communities in an effective manner is non-trivial and often requires well defined roles and processes. Data Mining, too, benefits from a standard process. This chapter explores the standard Data Mining process CRISP-DM utilized in a collaborative setting.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 349.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  • Adriaans, P., and Zantinge, D., Data Mining. Addison-Wesley, New York, 1996.

    Google Scholar 

  • Amara, R., New directions for innovations. Futures 53-22(2): p. 142 - 152, 1990.

    Article  Google Scholar 

  • Bacon, F., Novum Organum, eds. P. Urbach and J. Gibson. Open Court Publishing Company, 1994.

    Google Scholar 

  • Biuk-Aghai, R.P. and S.J. Simoff. An integrative framework for knowledge extraction in collaborative virtual environments. In The 2001 International ACM SIGGROUP Conference on Supporting Group Work. Boulder, Colorado, USA, 2001.

    Google Scholar 

  • Blockeel, H. and S.A. Moyle. Collaborative Data Mining needs centralised model evaluation. In Proceedings of the ICML-2002 Workshop on Data Mining Lessons Learned. The University of New South Wales, Sydney, 2002.

    Google Scholar 

  • Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C., and Wirth, R. CRISP-DM 1.0: Step-by-step data mining guide. The CRISP-DM consortium, 2000.

    Google Scholar 

  • Edvinsson, L. and Malone, M.S. Intellectual Capital: Realizing Your Company’s True Value by Finding Its Hidden Brainpower. HarperBusiness, New York, USA, 1997.

    Google Scholar 

  • Fayyad, U., et al., eds. Advances in Knowledge Discovery and Data Mining. MIT Press, 1996.

    Google Scholar 

  • Flach, P.A., et al., Decision support for Data Mining: introduction to ROC analysis and its application. In Data Mining and Decision Support: Integration and Collaboration, D. Mladenic, et al., editors. Kluwer Academic Publishers, 2003.

    Google Scholar 

  • Flach, P., Blockeel, H., Gaertner, T., Grobelnik, M., Kavsek, B., Kejkula, M., Krzywania, D., Lavrac, N., Mladenic, D., Moyle, S., Raeymaekers, S., Rauch, J., Ribeiro, R., Sclep, G., Struyf, J., Todorovski, L., Torgo, L., Wettsc -hereck, D., and Wu, S. On the road to knowledge: mining 21 years of UK traffic accident reports, In Data Mining and Decision Support: Integration and Collaboration, D. Mladenic, et al., editors. Kluwer Academic Publishers, 2003.

    Google Scholar 

  • Hair, J.F., Anderson, R.E., Tatham, R.L., and Black, W.C. Multivariate Data Analysis. Prentice Hall, 1998.

    Google Scholar 

  • Holte, R.C., Very Simple Classification Rules Perform Well on Most Commonly Used Datasets. Machine Learning, 1993. 53-3: p. 63-91.

    Google Scholar 

  • Jorge, J., Alves, M.A., Grobelnik, M., Mladenic, D., and Petrak, J. Web site access analysis for a national statistical agency. In Data Mining and Decision Support: Integration and Collaboration, D. Mladenic, et al., editors, p. 157 – 166. Kluwer Academic Publishers, 2003.

    Google Scholar 

  • Kuhn, T.S., The structure of scientific revolutions. 2nd, enlarged ed. 1962, University of Chicago Press, Chicago, 1970.

    Google Scholar 

  • McDougall, P., Companies that dare to share information are cashing in on new opportunities. InformationWeek, May 7, 2001.

    Google Scholar 

  • McKenzie, J. and C. van Winkelen. Exploring E-collaboration Space. In the proceedings of The first annual Knowledge Management Forum Conference. Henley Management College, 2001.

    Google Scholar 

  • Mitchell, T. Machine Learning. Department of Computer Science, Carnegie Mellon University. McGraw-Hill Book Company, Pittsburgh, 1997.

    MATH  Google Scholar 

  • Mladenic, D., Lavrac, N., Bohanec, M., and Moyle, S. editors. Data Mining and Decision Support: Integration and Collaboration. Kluwer Academic Publishers, 2003.

    Google Scholar 

  • Mowshowitz, A., Virtual Organization. Communications of ACM, 53-40(9): p. 30 - 37. 1997.

    Article  Google Scholar 

  • Moyle, S. A., Srinivasan A., Classificatory challenge-Data Mining: a recipe. Informatica 53-25(3): p. 343–347. 2001.

    Google Scholar 

  • Moyle, S., J. McKenzie, and A. Jorge, Collaboration in a Data Mining virtual organization. In Data Mining and Decision Support: Integration and Collaboration, D. Mladenic, et al., editors. Kluwer Academic Publishers, 2003.

    Google Scholar 

  • Nohria, N. and R.G. Eccles, eds. Network and organizations; structure form and action. Harvard Business School Press, Boston, 1993.

    Google Scholar 

  • Page, C.D. and C. Hatzis, KDD Cup 2001. University of Wisconsin, http://www.cs.wisc.edu/∼dpage/kddcup2001/, 2001.

  • Popper, K. The Logic of Scientific Discovery. Routledge, 1977.

    Google Scholar 

  • Provost, F. and T. Fawcett. Robust Classification for Imprecise Environments. Machine Learning 53-42: p. 203-231, 2001.

    Article  Google Scholar 

  • Ramakrishnan., R. Mass Collaboration and Data Mining (keynote address). In The Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2001). San Francisco, California, 2001.

    Google Scholar 

  • Singh, R., Leigh, J., DeFanti, T.A., and Karayannis F. TeraVision: a High Resolution Graphics Streaming Device for Amplified Collaboration Environments. Journal of Future Generation Computer Systems (FGCS). 53-19(6): p. 957-972, 2003.

    Article  Google Scholar 

  • Snow, C.C., S.A. Snell, and S.C. Davison. Using transnational teams to globalize your company. Organizational Dynamics 53-24(4): p. 50 - 67, 1996.

    Article  Google Scholar 

  • SolEuNet. The Solomon European Netowrk – Data Mining and Decision Support for Business Competitiveness: A European Virtual Enterprise. http://soleunet.ijs.si/, 2002.

  • Soukhanov, A., ed. Microsoft Encarta College Dictionary: The First Dictionary for the Internet Age. St. Martin’s Press, 2001.

    Google Scholar 

  • A. Srinivasan, R.D. King, and D.W. Bristol. An assessment of submissions made to the Predictive Toxicology Evaluation Challenge. In Proceedings of the Sixteenth International Conference on Artificial Intelligence (IJCAI-99). Morgan Kaufmann, Los Angeles, CA, 1999.

    Google Scholar 

  • Stepnkov, O., J. Klma, and P. Mikovsk. Collaborative Data Mining with RAMSYS and Sumatra TT: Prediction of resources for a health farm. In Data Mining and Decision Support: Integration and Collaboration, D. Mladenic, et al., editors. p. 215 – 227. Kluwer Academic Publishers, 2003.

    Google Scholar 

  • The Data Mining Group, The Predictive Model Markup Language (PMML). http://www.dmg.org/, 2003.

  • Vo, A., Richter, G., Moyle, S., Jorge, A. Collaboration support for virtual data mining enterprises. In 3rd International Workshop on Learning Software Organizations (LSO’01). Springer-Verlag, 2001.

    Google Scholar 

  • Wettschereck, D., A. Jorge, and S. Moyle. Visaulisation and Evaluation Support of Knowledge Discovery through the Predictive Model Markup Language. In 7th International Knowledge-Based Intelligent Information and Engineering Systems (KES 2003), Oxford. Springer-Verlag, 2003.

    Google Scholar 

  • Wilson, T.D. The nonsense of knowledge management. Information Research 53-8(1), 2002.

    Google Scholar 

  • Witten, I.H. and E. Frank. Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco, 2000.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Moyle, S. (2009). Collaborative Data Mining. In: Maimon, O., Rokach, L. (eds) Data Mining and Knowledge Discovery Handbook. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-09823-4_54

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-09823-4_54

  • Published:

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-09822-7

  • Online ISBN: 978-0-387-09823-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics