Skip to main content

Does Relevance Matter to Data Mining Research?

  • Chapter
Data Mining: Foundations and Practice

Part of the book series: Studies in Computational Intelligence ((SCI,volume 118))

Summary

Data mining (DM) and knowledge discovery are intelligent tools that help to accumulate and process data and make use of it. We review several existing frameworks for DM research that originate from different paradigms. These DM frameworks mainly address various DM algorithms for the different steps of the DM process. Recent research has shown that many real-world problems require integration of several DM algorithms from different paradigms in order to produce a better solution elevating the importance of practice-oriented aspects also in DM research. In this chapter we strongly emphasize that DM research should also take into account the relevance of research, not only the rigor of it. Under relevance of research in general, we understand how good this research is in terms of the utility of its results. This chapter motivates development of such a new framework for DM research that would explicitly include the concept of relevance. We introduce the basic idea behind such framework and propose one sketch for the new framework for DM research based on results achieved in the information systems area having some tradition related to the relevance aspects of research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Benbasat, I., Zmud, R. W.: The Identity Crisis Within the IS Discipline: Defining and Communicating the Discipline’s Core Properties, MIS Quarterly, 27(2), 2003, 183–194.

    Google Scholar 

  2. Bensusan, H.: Is Machine Learning Experimental Philosophy of Science?, Technical report, Bristol, UK, 2000.

    Google Scholar 

  3. Boulicaut, J.-F., Klemettinen, M., Mannila, H.: Modeling KDD Processes Within the Inductive Database Framework, DaWaK’99: Proceedings of the First International Conference on Data Warehousing and Knowledge Discovery, Springer, Berlin Heidelberg New York, London, 1999, 293–302.

    Google Scholar 

  4. Burrell, G., Morgan, G.: Sociological Paradigms and Organizational Analysis, Heinemann, London, UK, 1979.

    Google Scholar 

  5. Carvalho, D. R., Freitas, A. A., Ebecken, N. F. F.: Evaluating the Correlation Between Objective Rule Interestingness Measures and Real Human Interest, PKDD, 2005, 453–461.

    Google Scholar 

  6. Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C., Wirth, R.: CRISP-DM 1.0 – Step-by-Step Data Mining Guide, The CRISPDM Consortium SPSS Inc., 2000, Availabe on http://www.crisp-dm.org.

  7. Coppi, R.: A Theoretical Framework for Data Mining: The Informational Paradigm, Computational Statistics and Data Analysis, 38(4), 2002, 501–515.

    Article  MATH  MathSciNet  Google Scholar 

  8. Coppock, D. S.: Data Mining and Modeling: So You have a Model, Now What?, DM Review, February 2003.

    Google Scholar 

  9. Davis, G. B.: Information Systems Conceptual Foundations: Looking Backward and Forward, Organizational and Social Perspectives on Information Technology, Kluwer, Boston, 2000, 61–82.

    Google Scholar 

  10. DeLone, W. H., McLean, E. R.: Information Systems Success: The Quest for the Dependent Variable, Information Systems Research, 3(1), 1992, 60–95.

    Article  Google Scholar 

  11. DeLone, W. H., McLean, E. R.: The DeLone and McLean Model of Information Systems Success: A Ten-Year Update, Journal of MIS, 19(4), 2003, 9–30.

    Google Scholar 

  12. Dunkel, B., Soparkar, N., Szaro, J., Uthurusamy, R.: Systems for KDD: From concepts to practice, Future Generation Computer Systems, 13(2–3), 1997, 231–242.

    Article  Google Scholar 

  13. Fayyad, U. M.: Data Mining and Knowledge Discovery: Making Sense Out of Data, IEEE Expert: Intelligent Systems and Their Applications, 11(5), 1996, 20–25.

    Google Scholar 

  14. Friedman, J. H.: Data Mining and Statistics: What’s the Connection?, Proceedings of the 29th Symposium on the Interface (Scott, D. Ed.), 1999.

    Google Scholar 

  15. Giraud-Carrier, C.: Success Stories in Data/Text Mining, Technical report, Brigham Young University, 2004, (An updated version of an ELCA Informatique SA White Paper).

    Google Scholar 

  16. Hand, D. J.: Data Mining: Statistics and More?, The American Statistician, 52, 1998, 112–118.

    Article  Google Scholar 

  17. Hand, D. J.: Statistics and Data Mining: Intersecting Disciplines, SIGKDD Explorations, 1, 1999, 16–19.

    Article  Google Scholar 

  18. Hermiz, K. B.: Critical Success Factors for Data Mining Projects, DM Review, February 1999.

    Google Scholar 

  19. Hevner, A. R., March, S. T., Park, J., Ram, S.: Design Science in Information Systems Research, MIS Quarterly, 26(1), 2004, 75–105.

    Google Scholar 

  20. Iivari, J., Hirschheim, R., Klein, H. K.: A Paradigmatic Analysis Contrasting Information Systems Development Approaches and Methodologies, Information Systems Research, 9(2), 1998, 164–193.

    Article  Google Scholar 

  21. Imielinski, T., Mannila, H.: A Database Perspective on Knowledge Discovery, Communications of ACM, 39(11), 1996, 58–64.

    Article  Google Scholar 

  22. Ives, B., Hamilton, S., Davis, G. B.: A Framework for Research in Computer-based Management Information Systems, Management Science, 26(9), 1980, 910–934.

    Article  Google Scholar 

  23. Järvinen, P.: Research methods, Opinpaja, Tampere, Finland, 2002, (http://www.uta.fi/pj/).

  24. Järvinen, P.: Action Research as an Approach in Design Science, Technical Report TR D-2005–2, Department of Computer Science, University of Tampere, Finland, 2005.

    Google Scholar 

  25. Nunamaker, J. F., Chen, M., Purdin, T. D. M.: Systems Development in Information Systems Research, Journal of Management Information Systems, 7(3), 90/91, 89–106.

    Google Scholar 

  26. Kleinberg, J., Papadimitriou, C., Raghavan, P.: A Microeconomic View of Data Mining, Data Mining and Knowledge Discovery, 2(4), 1998, 311–324.

    Article  Google Scholar 

  27. Lin, T. Y.: Granular Computing of Binary Relations I: Data Mining and Neighborhood Systems, Rough Sets and Knowledge Discovery (Polkowski, L. Skowron A., Eds.), Physica, Heidelberg, 1998, 107–140.

    Google Scholar 

  28. Lin, T. Y.: Data Mining: Granular Computing Approach, PAKDD’99: Proceedings of third Pacific-Asia Conference, Methodologies for Knowledge Discovery and Data Mining, 1999, 24–33.

    Google Scholar 

  29. Lyytinen, K.: Different Perspectives on Information Systems: Problems and Solutions, ACM Computing Surveys, 19(1), 1987, 5–46.

    Article  Google Scholar 

  30. Mannila, H.: Theoretical Frameworks for Data Mining, SIGKDD Explorations, 1(2), 2000, 30–32.

    Article  Google Scholar 

  31. Mason, R.: Experimentation and Knowledge – A Paradigmatic Perspective, Knowledge: Creation, Diffusion, Utilization, 10(1), 1988, 3–24.

    Google Scholar 

  32. Mehta, M., Rissanen, J., Agrawal, R.: MDL-Based Decision Tree Pruning, KDD’95, 1995, 216–221.

    Google Scholar 

  33. Michalski, R. S.: Seeking Knowledge in the Deluge of Facts, Fundamenta Informaticae, 30(3–4), 1997, 283–297.

    Google Scholar 

  34. Pechenizkiy, M., Puuronen, S., Tsymbal, A.: The Iterative and Interactive Data Mining Process: The ISD and KM Perspectives, FDM’04: Proceedings of Foundations of Data Mining Workshop, 2004, 129–136.

    Google Scholar 

  35. Pechenizkiy, M., Puuronen, S., Tsymbal, A.: Competitive Advantage from Data Mining: Lessons Learnt in the Information Systems Field, DEXA’05 Workshop: Philosophies and Methodologies for Knowledge Discovery (PMKD’05), IEEE CS Press, New York, 2005, 733–737, (Invited Paper).

    Google Scholar 

  36. Pechenizkiy, M., Puuronen, S., Tsymbal, A.: Why Data Mining Does Not Contribute to Business?, DMBiz’05: Proceedings of Data Mining for Business Workshop, 2005, 67–71.

    Google Scholar 

  37. Reinartz, T.: Focusing Solutions for Data Mining: Analytical Studies and Experimental Results in Real-World Domains, Springer, Berlin Heidelberg New York, 1999.

    Book  MATH  Google Scholar 

  38. Tan, P.-N., Kumar, V., Srivastava, J.: Selecting the Right Objective Measure for Association Analysis, Information Systems, 29(4), 2004, 293–313.

    Article  Google Scholar 

  39. Vapnik, V. N.: The Nature of Statistical Learning Theory, Springer, Berlin Heidelberg New York, 1995.

    MATH  Google Scholar 

  40. Wu, X., Yu, P. S., Piatetsky-Shapiro, G., Cercone, N., Lin, T. Y., Kotagiri, R., Wah, B. W.: Data Mining: How Research Meets Practical Development?, Knowledge and Information Systems, 5(2), 2003, 248–261.

    Article  Google Scholar 

  41. Yao, J. T., Yao, Y. Y.: A Granular Computing Approach to Machine Learning, FSKD’02: Proceedings of the First International Conference on Fuzzy Systems and Knowledge Discovery, 2002, 732–736.

    Google Scholar 

  42. Zadeh, L. A.: Toward a Theory of Fuzzy Information Granulation and its Centrality in Human Reasoning and Fuzzy Logic, Fuzzy Sets Systems, 90(2), 1997, 111–127.

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Pechenizkiy, M., Puuronen, S., Tsymbal, A. (2008). Does Relevance Matter to Data Mining Research?. In: Lin, T.Y., Xie, Y., Wasilewska, A., Liau, CJ. (eds) Data Mining: Foundations and Practice. Studies in Computational Intelligence, vol 118. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78488-3_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-78488-3_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-78487-6

  • Online ISBN: 978-3-540-78488-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics