A Survey of Methodologies and Techniques for Data Mining and Intelligent Data Discovery

  • Ricardo Gonzalez
  • Ali Kamrani
Part of the Massive Computing book series (MACO, volume 3)


This paper gives a description of data mining and its methodology. First, the definition of data mining along with the purposes and growing needs for such a technology are presented. A six-step methodology for data mining is then presented and discussed. The goals and methods of this process are then explained, coupled with a presentation of a number of techniques that are making the data mining process faster and more reliable. These techniques include the use of neural networks and genetic algorithms, which are presented and explained as a way to overcome several complexity problems that the data mining process possesses. A deep survey of the literature is done to show the various purposes and achievements that these techniques have brought to the study of data mining.


Genetic Algorithm Data Mining Data Mining Algorithm Structure Query Language Learning Classifier System 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Adriaans, P. and D. Zantinge, Data Mining, Harlow: Addison-Wesley, 1996.Google Scholar
  2. Andrade, M. and P. Bork, “Automated extraction of information in molecular biology,” FEBS Letters, 476: 12–17, 2000.CrossRefGoogle Scholar
  3. Delesie, L. and L. Croes, “Operations research and knowledge discovery: a data mining method applied to health care management,” International Transactions in Operational Research, 7: 159–170, 2000.CrossRefGoogle Scholar
  4. Fayyad, U., D. Madigan, G. Piatetsky-Shapiro and P. Smyth, “From data mining to knowledge discovery in databases,” AI Magazine, 17: 37–54, 1996.Google Scholar
  5. Fayyad, U. and P. Stolorz, “Data Mining and KDD: Promise and challenges,” Future Generation Computer Systems, 13: 99–115, 1997.CrossRefGoogle Scholar
  6. Feelders, A., H. Daniels and M. Holsheimer, “Methodological and practical aspects of data mining,” Information & Management, 37: 271–281, 2000.CrossRefGoogle Scholar
  7. Fu, Z., “Dimensionality optimization by heuristic greedy learning vs. genetic algorithms in knowledge discovery and data mining,” Intelligent Data Analysis, 3: 211–225, 1999.CrossRefGoogle Scholar
  8. Glymour, C., D. Madigan, D. Pregibon and P. Smyth, “Statistical themes and lessons for data mining,” Data Mining and Knowledge Discovery, 1: 11–28, 1997.CrossRefGoogle Scholar
  9. Hand, D.J., “Data mining: statistics and more?,” The American Statistician, 52: 112–118, 1998.Google Scholar
  10. Heckerman, D., “Bayesian Networks for Knowledge Discovery,” Advances in Knowledge Discovery and Data Mining, AAAI Press, 273–305, 1996.Google Scholar
  11. Holmes, J.H., D.R. Durbin and F.K. Winston, `The learning classifier system: an evolutionary computation approach to knowledge discovery in epidemiologic surveillance,“ Artificial Intelligence in Medicine, 19: 53–74, 2000.CrossRefGoogle Scholar
  12. Koonce, D.A., C. Fang and S. Tsai, “A data mining tool for learning from manufacturing systems,” Computers & Industrial Engineering, 33: 27–30, 1997Google Scholar
  13. Kusiak, A., Computational Intelligence in Design and Manufacturing. Wiley-Interscience Publications, pp. 498–526, 1999.Google Scholar
  14. Lin, X., X. Zhou and C. Liu, “Efficient computation of a proximity matching in spatial databases,” Data & Knowledge Engineering, 33: 85–102, 2000.CrossRefMATHGoogle Scholar
  15. Michalski, R.S., “Learnable evolution model: evolutionary processes guided by machine learning,” Machine Learning, 38: 9–40, 2000.CrossRefMATHGoogle Scholar
  16. Russell, S., P. Norvig, Artificial Intelligence: A Modern Approach. New Jersey: Prentice Hall, 1995.MATHGoogle Scholar
  17. Scott, P.D. and E. Wilkins, “Evaluating data mining procedures: techniques for generation artificial data sets,” Information and software technology, 41: 579–587, 1999.CrossRefGoogle Scholar
  18. Skarmeta, A., A. Bensaid and N. Tazi, “Data mining for text categorization with semi-supervised agglomerative hierarchical clustering,” International Journal of Intelligent Systems, 15: 633–646, 2000.CrossRefMATHGoogle Scholar
  19. Subramanian, A., L.D. Smith, A.C. Nelson, J.F. Campbell and D,A. Bird, “Strategic planning for data warehousing,” Information and Management, 33: 99–113, 1997.CrossRefGoogle Scholar
  20. Yevich, R., “Data Mining,” in Data Warehouse: Practical Advice from the Experts, pp. 309–321, Prentice Hall, 1997.Google Scholar
  21. Yuanhui, Z., L. Yuchang and S. Chunyi, “Mining Classification Rules in Multi-strategy Learning Approach,” Intelligent Data Analysis, 2: 165–185, 1998.CrossRefGoogle Scholar
  22. Vila, M.A., J.C. Cubero, J.M. Medina and O. Pons, “Soft computing: a new perspective for some data mining problems,” Vistas in Astronomy, 41: 379–386, 1997.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media Dordrecht 2001

Authors and Affiliations

  • Ricardo Gonzalez
    • 1
  • Ali Kamrani
    • 1
  1. 1.Rapid Prototyping Laboratory, College of Engineering and Computer ScienceThe University of MichiganDearbornUSA

Personalised recommendations