Summary
Data mining (DM) and knowledge discovery are intelligent tools that help to accumulate and process data and make use of it. We review several existing frameworks for DM research that originate from different paradigms. These DM frameworks mainly address various DM algorithms for the different steps of the DM process. Recent research has shown that many real-world problems require integration of several DM algorithms from different paradigms in order to produce a better solution elevating the importance of practice-oriented aspects also in DM research. In this chapter we strongly emphasize that DM research should also take into account the relevance of research, not only the rigor of it. Under relevance of research in general, we understand how good this research is in terms of the utility of its results. This chapter motivates development of such a new framework for DM research that would explicitly include the concept of relevance. We introduce the basic idea behind such framework and propose one sketch for the new framework for DM research based on results achieved in the information systems area having some tradition related to the relevance aspects of research.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Benbasat, I., Zmud, R. W.: The Identity Crisis Within the IS Discipline: Defining and Communicating the Discipline’s Core Properties, MIS Quarterly, 27(2), 2003, 183–194.
Bensusan, H.: Is Machine Learning Experimental Philosophy of Science?, Technical report, Bristol, UK, 2000.
Boulicaut, J.-F., Klemettinen, M., Mannila, H.: Modeling KDD Processes Within the Inductive Database Framework, DaWaK’99: Proceedings of the First International Conference on Data Warehousing and Knowledge Discovery, Springer, Berlin Heidelberg New York, London, 1999, 293–302.
Burrell, G., Morgan, G.: Sociological Paradigms and Organizational Analysis, Heinemann, London, UK, 1979.
Carvalho, D. R., Freitas, A. A., Ebecken, N. F. F.: Evaluating the Correlation Between Objective Rule Interestingness Measures and Real Human Interest, PKDD, 2005, 453–461.
Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C., Wirth, R.: CRISP-DM 1.0 – Step-by-Step Data Mining Guide, The CRISPDM Consortium SPSS Inc., 2000, Availabe on http://www.crisp-dm.org.
Coppi, R.: A Theoretical Framework for Data Mining: The Informational Paradigm, Computational Statistics and Data Analysis, 38(4), 2002, 501–515.
Coppock, D. S.: Data Mining and Modeling: So You have a Model, Now What?, DM Review, February 2003.
Davis, G. B.: Information Systems Conceptual Foundations: Looking Backward and Forward, Organizational and Social Perspectives on Information Technology, Kluwer, Boston, 2000, 61–82.
DeLone, W. H., McLean, E. R.: Information Systems Success: The Quest for the Dependent Variable, Information Systems Research, 3(1), 1992, 60–95.
DeLone, W. H., McLean, E. R.: The DeLone and McLean Model of Information Systems Success: A Ten-Year Update, Journal of MIS, 19(4), 2003, 9–30.
Dunkel, B., Soparkar, N., Szaro, J., Uthurusamy, R.: Systems for KDD: From concepts to practice, Future Generation Computer Systems, 13(2–3), 1997, 231–242.
Fayyad, U. M.: Data Mining and Knowledge Discovery: Making Sense Out of Data, IEEE Expert: Intelligent Systems and Their Applications, 11(5), 1996, 20–25.
Friedman, J. H.: Data Mining and Statistics: What’s the Connection?, Proceedings of the 29th Symposium on the Interface (Scott, D. Ed.), 1999.
Giraud-Carrier, C.: Success Stories in Data/Text Mining, Technical report, Brigham Young University, 2004, (An updated version of an ELCA Informatique SA White Paper).
Hand, D. J.: Data Mining: Statistics and More?, The American Statistician, 52, 1998, 112–118.
Hand, D. J.: Statistics and Data Mining: Intersecting Disciplines, SIGKDD Explorations, 1, 1999, 16–19.
Hermiz, K. B.: Critical Success Factors for Data Mining Projects, DM Review, February 1999.
Hevner, A. R., March, S. T., Park, J., Ram, S.: Design Science in Information Systems Research, MIS Quarterly, 26(1), 2004, 75–105.
Iivari, J., Hirschheim, R., Klein, H. K.: A Paradigmatic Analysis Contrasting Information Systems Development Approaches and Methodologies, Information Systems Research, 9(2), 1998, 164–193.
Imielinski, T., Mannila, H.: A Database Perspective on Knowledge Discovery, Communications of ACM, 39(11), 1996, 58–64.
Ives, B., Hamilton, S., Davis, G. B.: A Framework for Research in Computer-based Management Information Systems, Management Science, 26(9), 1980, 910–934.
Järvinen, P.: Research methods, Opinpaja, Tampere, Finland, 2002, (http://www.uta.fi/pj/).
Järvinen, P.: Action Research as an Approach in Design Science, Technical Report TR D-2005–2, Department of Computer Science, University of Tampere, Finland, 2005.
Nunamaker, J. F., Chen, M., Purdin, T. D. M.: Systems Development in Information Systems Research, Journal of Management Information Systems, 7(3), 90/91, 89–106.
Kleinberg, J., Papadimitriou, C., Raghavan, P.: A Microeconomic View of Data Mining, Data Mining and Knowledge Discovery, 2(4), 1998, 311–324.
Lin, T. Y.: Granular Computing of Binary Relations I: Data Mining and Neighborhood Systems, Rough Sets and Knowledge Discovery (Polkowski, L. Skowron A., Eds.), Physica, Heidelberg, 1998, 107–140.
Lin, T. Y.: Data Mining: Granular Computing Approach, PAKDD’99: Proceedings of third Pacific-Asia Conference, Methodologies for Knowledge Discovery and Data Mining, 1999, 24–33.
Lyytinen, K.: Different Perspectives on Information Systems: Problems and Solutions, ACM Computing Surveys, 19(1), 1987, 5–46.
Mannila, H.: Theoretical Frameworks for Data Mining, SIGKDD Explorations, 1(2), 2000, 30–32.
Mason, R.: Experimentation and Knowledge – A Paradigmatic Perspective, Knowledge: Creation, Diffusion, Utilization, 10(1), 1988, 3–24.
Mehta, M., Rissanen, J., Agrawal, R.: MDL-Based Decision Tree Pruning, KDD’95, 1995, 216–221.
Michalski, R. S.: Seeking Knowledge in the Deluge of Facts, Fundamenta Informaticae, 30(3–4), 1997, 283–297.
Pechenizkiy, M., Puuronen, S., Tsymbal, A.: The Iterative and Interactive Data Mining Process: The ISD and KM Perspectives, FDM’04: Proceedings of Foundations of Data Mining Workshop, 2004, 129–136.
Pechenizkiy, M., Puuronen, S., Tsymbal, A.: Competitive Advantage from Data Mining: Lessons Learnt in the Information Systems Field, DEXA’05 Workshop: Philosophies and Methodologies for Knowledge Discovery (PMKD’05), IEEE CS Press, New York, 2005, 733–737, (Invited Paper).
Pechenizkiy, M., Puuronen, S., Tsymbal, A.: Why Data Mining Does Not Contribute to Business?, DMBiz’05: Proceedings of Data Mining for Business Workshop, 2005, 67–71.
Reinartz, T.: Focusing Solutions for Data Mining: Analytical Studies and Experimental Results in Real-World Domains, Springer, Berlin Heidelberg New York, 1999.
Tan, P.-N., Kumar, V., Srivastava, J.: Selecting the Right Objective Measure for Association Analysis, Information Systems, 29(4), 2004, 293–313.
Vapnik, V. N.: The Nature of Statistical Learning Theory, Springer, Berlin Heidelberg New York, 1995.
Wu, X., Yu, P. S., Piatetsky-Shapiro, G., Cercone, N., Lin, T. Y., Kotagiri, R., Wah, B. W.: Data Mining: How Research Meets Practical Development?, Knowledge and Information Systems, 5(2), 2003, 248–261.
Yao, J. T., Yao, Y. Y.: A Granular Computing Approach to Machine Learning, FSKD’02: Proceedings of the First International Conference on Fuzzy Systems and Knowledge Discovery, 2002, 732–736.
Zadeh, L. A.: Toward a Theory of Fuzzy Information Granulation and its Centrality in Human Reasoning and Fuzzy Logic, Fuzzy Sets Systems, 90(2), 1997, 111–127.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Pechenizkiy, M., Puuronen, S., Tsymbal, A. (2008). Does Relevance Matter to Data Mining Research?. In: Lin, T.Y., Xie, Y., Wasilewska, A., Liau, CJ. (eds) Data Mining: Foundations and Practice. Studies in Computational Intelligence, vol 118. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78488-3_15
Download citation
DOI: https://doi.org/10.1007/978-3-540-78488-3_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78487-6
Online ISBN: 978-3-540-78488-3
eBook Packages: EngineeringEngineering (R0)