Knowledge Discovery and Data Mining

Wei, Chih-Ping; Piramuthu, Selwyn; Shaw, Michael J.

doi:10.1007/978-3-540-24748-7_9

Chih-Ping Wei²,
Selwyn Piramuthu³ &
Michael J. Shaw⁴

Part of the book series: International Handbooks on Information Systems ((INFOSYS,volume 2))

988 Accesses
4 Citations

Abstract

With the advances of information technology and widespread diffusion of databases systems in organizations, large volumes of data are generated and collected by organizations. This dramatic expansion of data has generated an urgent need for new analysis techniques that can intelligently and automatically transform the processed data into useful information and knowledge. As a result, knowledge discovery and data mining have increased in importance and economic value. Knowledge discovery refers to the overall process of discovering useful knowledge from data, while data mining refers to the extraction of patterns from data. This chapter provides a reasonably comprehensive review of knowledge discovery and its associated data mining techniques. Based on the kinds of knowledge that can be discovered in databases, data mining techniques can be broadly structured into several categories, including classification, clustering, dependency analysis, data visualization, and text mining. Representative data mining techniques for each category are depicted in this chapter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., R. Bayardo, and R. Srikant, “Athena: Mining-based Interactive Management of Text Databases,” Proceedings of the 6th International Conference on Extending Database Technology, July 1999, 365–379.
Google Scholar
Agrawal, R., T. Imielinski, and A. Swami, “Mining Association Rules Between Sets of Items in Large Databases,” Proceedings of the ACM SIGMOD International Conference on Management of Data, Washington DC, 1993, 207–216.
Google Scholar
Agrawal, R. and R. Srikant, “Fast Algorithms for Mining Association Rules,” Proceedings of the 20th International Conference on Very Large Data Bases, Santiago, Chile, 1994, 487–499.
Google Scholar
Agrawal R. and R. Srikant, “Mining Sequential Patterns,” Proceedings of the 1995 Conference on Data Engineering, Taiepi, Taiwan, 1995, 3–14.
Google Scholar
Anand, S. S., A. E. Smith, P. W. Hamilton, J. S. Anand, J. G. Hughes, and P. H. Bartels, “An Evaluation of Intelligent Prognostic Systems for Colorectal Cancer,” Artificial Intelligence in Medicine, Vol. 15, No. 2, 1999, 193–214.
Article Google Scholar
Anderberg, M. R., Cluster Analysis for Applications, New York: Academic Press, 1973.
Google Scholar
Aoki, N., M. J. Wall, J. Demsar, B. Zupan, T. Granchi, M. A. Schreiber, J. B. Holcomb, M. Byrne, K. R. Liscum, G. Goodwin, J. R. Beck, and K. K. Mattox, “Predictive Model for Survival at the Conclusion of A Damage Control Laparotomy,” The American Journal of Surgery, Vol. 180, No. 6, December 2000, 540–545.
Article Google Scholar
Apté, C., F. Damerau, and S. Weiss, “Automated Learning of Decision Rules for Text Categorization,” ACM Transactions on Information Systems, Vol. 12, No. 3, 1994, 233–251.
Article Google Scholar
Arya, S., D. M. Mount, N. S. Netanyahu, R. Silverman, and A. Wu, “An Optimal Algorithm for Approximate Nearest Neighbor Searching,” ACM-SIAM Symposium on Discrete Algorithms (SODA), Arlington, VA, 1994, 573–582.
Google Scholar
Atlas, L., R. Cole, J. Connor, M. El-Sharkawi, R. J. Marks II, Y. Muthusamy, and E. Barnard, “Performance Comparisons between Backpropagation Networks and Classification Trees on Three Real-World Applications,” in Turetzky, D.S. (ed.), Neural Information Processing Systems (NIPS) 2, San Mateo, CA: Morgan Kaufmann, 1990, 622–629.
Google Scholar
Azuaje, F., W. Dubitzky, P. Lopes, N. Black, K. Adamson, X. Wu, and J. A. White, “Predicting Coronary Disease Risk Based on Short-term RR Interval Measurements: A Neural Network Approach,” Artificial Intelligence in Medicine, Vol. 15, No. 3, 1999, 275–297.
Article Google Scholar
Baker, L. D. and A. K. McCallum, “Distributional Clustering of Words for Text Classification,” Proceedings of the 21st International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, August 1998, 96–103.
Google Scholar
Becker, R. A., S. G. Eick, and A. R. Wilks, “Visualizing Network Data,” IEEE Transactions on Visualization and Computer Graphics, Vol. 1, No. 1, March 1995, 16–28.
Article Google Scholar
Berry, M. J. and G. Linoff, Data Mining Techniques: For Marketing, Sales, and Customer Support, New York: John Wiley & Sons, Inc., 1997.
Google Scholar
Berson, A. and S. J. Smith, Data Warehousing, Data Mining, and OLAP, New York: McGraw-Hill, 1997.
Google Scholar
Berson, A., S. Smith, and K. Thearling, Building Data Mining Applications for CRM, New York: McGraw-Hill, 2000.
Google Scholar
Beyer, K., J. Goldstein, R. Ramakrishnan, and U. Shaft, “When is ‘Nearest Neighbor’ Meaningful?” Proceedings of the 7th International Conference on Data Theory (ICDT), Jerusalem, Israel, 1999, 217–235.
Google Scholar
Blum, A., Neural Network in C++, New York: Wiley, 1992.
Google Scholar
Bonchi, F., F. Giannotti, C. Gozzi, G. Manco, M. Nanni, D. Pedreschi, C. Renso, and S. Ruggieri, “Web Log Data Warehousing and Mining for Intelligent Web Caching,” Data and Knowledge Engineering, Vol. 39, No. 2, 2001, 165–189.
Article Google Scholar
Breiman, L., J. Friedman, R. Olshen and C. Stone, Classification and Regression Trees, Pacific Grove, CA: Wadsworth, 1984.
Google Scholar
Brin, S., R. Motwani, J. Ullman, and S. Tsur, “Dynamic Itemset Counting and Implication Rules for Market Basket Data,” Proceedings of the ACM SIGMOD International Conference on Management of Data, Tucson, AZ, 1997, 255–264.
Google Scholar
Cabena, P., P. Hadjinian, R. Stadler, J. Verhees and A. Zanasi, Discovering Data Mining: From Concept to Implementation, Upper Saddle River, NJ: Prentice Hall, 1997.
Google Scholar
Can, F. and E. A. Ozkarahan, “Concepts and Effectiveness of the CoverCoefficient-Based Clustering Methodology for Text Databases,” ACM Transactions on Database Systems, Vol. 15, No. 4, 1990, 483–517.
Article Google Scholar
Carter, C. and J. Catlett, “Assessing Credit Card Applications Using Machine Learning,” IEEE Expert, Fall 1987, 71–79.
Google Scholar
Chae, Y. M., S. H. Ho, K. W. Cho, D. H. Lee, and S. H. Ji, “Data Mining Approach to Policy Analysis in A Health Insurance Domain,” International Journal of Medical Informatics, Vol. 62, No. 2–3, 2001, 103–111.
Article Google Scholar
Chen, M. S., J. Han, and P. S. Yu, “Data Mining: An Overview from a Database Perspective,” IEEE Transactions on Knowledge and Data Engineering, Vol. 8, No. 6, 1996, 866–883.
Article Google Scholar
Cheung, D., J. Han, V. Ng, and C. Y. Wong, “Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique,” Proceedings of the International Conference on Data Engineering, New Orleans, LA, 1996, 106–114.
Google Scholar
Cheung, D., S. D. Lee, and B. Kao, “A General Incremental Technique for Maintaining Discovered Association Rules,” Proceedings of the 5th International Conference on Database Systems for Advanced Applications, Melbourne, Australia, 1997, 185–194.
Google Scholar
Clark, P. and T. Niblett, “The CN2 Induction Algorithm,” Machine Learning, Vol. 3, No. 4, 1989, 261–283.
Google Scholar
Cohen, W. W. and Y. Singer, “Context-sensitive Learning Methods for Text Categorization,” ACM Transactions on Information Systems, 17, 2, 1999, 141–173.
Article Google Scholar
Consens, M. P. and A. O. Mendelzon, “Hy+: A Hygraph-Based Query and Visualization System,” Proceedings of ACM SIGMOD International Conference on Management of Data, Washington D.C., 1993, 511–516.
Google Scholar
Cover, T. M. and P. E. Hart, “Nearest Neighbor Pattern Classification,” IEEE Transactions on Information Theory, IT-13, 1, 1967, 21–27.
Article Google Scholar
Davenport, T. H., D. W. DeLong, and M. C. Beers, “Successful Knowledge Management Projects,” Sloan Management Review, Winter 1998, 43–57.
Google Scholar
Davenport, T. H. and L. Prusak, Working Knowledge: How Organizations Manager What They Know, Boston, MA: Harvard Business School Press, 1998.
Google Scholar
Dey, D., S. Sarkar, and P. De, “Entity Matching in Heterogeneous Databases,” Proceedings of the 31st Hawaii International Conference on System Sciences, Kona, Hawaii, 1998.
Google Scholar
Dobkin, D. and R. J. Lipton, “Multidimensional Search Problems,” SIAM Journal of Computing, 5, 2, 1976, 181–186.
Article Google Scholar
Dumais, S., J. Platt, D. Heckerman, and M. Sahami, “Inductive Learning Algorithms and Representations for Text Categorization,” Proceedings of the 1998 ACM 7th International Conference on Information and Knowledge Management (CIKM ′98), Washington D.C., November 1998, 148–155.
Google Scholar
El-Hamdouchi, A. and P. Willett, “Hierarchical Document Clustering Using Ward’s Method,” Proceedings of ACM Conference on Research and Development in Information Retrieval, Pisa, Italy, September 1986, 149–156.
Google Scholar
Esposito, F., D. Malerba, and G. Semeraro, “A Comparative Analysis of Methods for Pruning Decision Trees,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 19, No. 5, 1997, 476–491.
Article Google Scholar
Estivill-Castro, V. and A. T. Murray, “Spatial Clustering for Data Mining with Generic Algorithms,” Technical Report FIT-TR-97–10, Queensland University of Technology, Faculty of Information Management, September 1997.
Google Scholar
Ezawa, K. J. and S. W. Norton, “Constructing Bayesian Networks to Predict Uncollectible Telecommunications Accounts,” IEEE Expert, Vol. 11, No. 5, 1996, 45–51.
Article Google Scholar
Feng, L., T. Dillon, and J. Liu, “Inter-transactional Association Rules for Multidimensional Contexts for Prediction and Their Application to Studying Meteorological Data,” Data and Knowledge Engineering, Vol. 37, No. 1, 2001, 85115.
Article Google Scholar
Frawley, W., G. Piatetsky-Shapiro, and C. J. Matheus. “Knowledge Discovery in Databases: An Overview,” in Piatesky-Shapiro, G. and Frawley, W.J. (eds.), Knowledge Discovery in Databases, Cambridge, MA: AAAI/MIT Press, 1991, 1–30.
Google Scholar
Fu, L., “Knowledge Discovery Based on Neural Networks,” Communications of the ACM, 42, 11, 1999, 47–50.
Article Google Scholar
Fu, L. and E. H. Shortliffe. “The Application of Certainty Factors to Neural Computing for Rule Discovery,” IEEE Transactions on Neural Networks, 11, 3, 2000, 647–657.
Article Google Scholar
Gerritsen, R., “Assessing Loan Risks: A Data Mining Case Study,” IT Professional, 1, 6, 1999, 16–21.
Article Google Scholar
Han, J. and Y. Fu, “Mining Multiple-Level Association Rules in Large Databases,” IEEE Transactions on Knowledge and Data Engineering, 11, 5, 1999, 798–805.
Article Google Scholar
Heckerman, D., “Bayesian Networks for Data Mining,” Data Mining and Knowledge Discovery, 1, 1, 1997, 79–119.
Article Google Scholar
Hui, S. C. and G. Jha, “Data Mining for Customer Service Support,” Information and Management, 38, 1, 2000, 1–13.
Article Google Scholar
Iwayama, M. and T. Tokunaga, “Cluster-based Text Categorization: A Comparison of Category Search Strategies,” Proceedings of 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, WA, July 1995, 273–281.
Google Scholar
John, G. H., P. Miller, and R. Kerber, “Stock Selection Using Rule Induction,” IEEE Expert, 11, 5, 1996, 52–58.
Article Google Scholar
Kappert, C. B. and S. W. F. Omta, “Neural Networks and Business Modeling—An Application of Neural Modeling Techniques to Prospect Profiling in the Telecommunications Industry,” Proceedings of the 30th Hawaii International Conference on System Sciences, Maui, Hawaii, 1997, 465–473.
Google Scholar
Kass, G. V., “An Exploratory Technique for Investigating Large Quantities of Categorical Data,” Applied Statistics, 29, 1980, 119–127.
Article Google Scholar
Kaufman, L. and P. J. Rousseeuw, Finding Groups in Data: An Introduction to Cluster Analysis, New York: John Wiley & Sons, Inc., 1990.
Book Google Scholar
Keim, D. A. and H. Kriegel, “Visualization Techniques for Mining Large Databases: A Comparison,” IEEE Transactions on Knowledge and Data Engineering, 8, 6, 1996, 923–927.
Article Google Scholar
Klemettinen, M., H. Mannila, and H. Toivonen, “Interactive Exploration of Interesting Findings in the Telecommunication Network Alarm Sequence Analyzer (TASA),” Information and Software Technology, 41, 9, 1999, 557–567.
Article Google Scholar
Kim, H. and S. Lee, “A Semi-Supervised Document Clustering Technique for Information Organization,” Proceedings of the 9th International Conference on Information and Knowledge Management (CIKM), McLean, VA, November 2000, 30–37.
Google Scholar
Kim, S. H. and H. J. Noh, “Predictability of Interest Rates Using Data Mining Tools: A Comparative Analysis of Korea and the US,” Expert Systems with Applications, 13, 2, 1997, 85–95.
Article Google Scholar
Kohonen, T., Self-Organization and Associative Memory, Berlin: Springer, 1989.
Book Google Scholar
Kohonen, T., Self-Organizing Maps, Berlin: Springer, 1995.
Book Google Scholar
Kukar, M., I. Kononenko, C. Groselj, K. Kralj, and J. Fettich, “Analysing and Improving the Diagnosis of Ischaemic Heart Disease with Machine Learning,” Artificial Intelligence in Medicine, 16, 1, 1999, 25–50.
Article Google Scholar
Lam, W. and C. Y. Ho, “Using A Generalized Instance set for Automatic Text Categorization,” Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melboume, Australia, August 1998, 81–89.
Google Scholar
Larkey, L. and W. Croft, “Combining Classifiers in Text Categorization,” Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, Switzerland, August 1996, 289–297.
Google Scholar
Larsen, B. and C. Aone, “Fast and Effective Text Mining Using Linear-time Document Clustering,” Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, August 1999, 16–22.
Google Scholar
LeBlanc, J., M. O. Ward, and N. Wittels, “Exploring N-Dimensional Databases,” Proceedings of Visualization, San Francisco, CA, 1990, 230–237.
Google Scholar
Lee, C. H., Y. H. Kim, and P. K. Rhee, “Web Personalization Expert with Combining Collaborative Filtering and Association Rule Mining Technique,” Expert Systems with Applications, 21, 3, 2001, 131–137.
Article Google Scholar
Lee, H. Y. and H. L. Ong, “Visualization Support for Data Mining,” IEEE Expert, 11, 5, 1996, 69–75.
Article Google Scholar
Leu, S. S., C. N. Chen, and S. L. Chang, “Data Mining for Tunnel Support Stability: Neural Network Approach,” Automation in Construction, 10, 4, 2001, 429–441.
Article Google Scholar
Leung, M. T., A. S. Chen, and H. Daouk, “Forecasting Exchange Rates Using General Regression Neural Networks,” Computers and Operations Research, 27, 11–12, 2000, 1093–1110.
Article Google Scholar
Lewis, D. and M. Ringuette, “A Comparison of Two Learning Algorithms for Text Categorization,” Proceedings of Symposium on Document Analysis and Information Retrieval, 1994.
Google Scholar
Lin, F., S. Chou, S. Pan, and Y. Chen, “Mining Time Dependency Patterns in Clinical Pathways,” International Journal of Medical Informatics, 62, 1, 2001, 11–25.
Article Google Scholar
Lin, F. Y. and S. McClean, “A Data Mining Approach to the Prediction of Corporate Failure,” Knowledge-Based Systems, 14, 3–4, 2001, 189–195.
Article Google Scholar
Luchetta, A., S. Manetti, and F. Francini, “Forecast: A Neural System for Diagnosis and Control of Highway Surfaces,” IEEE Intelligent Systems, 13, 3, 1998, 20–26.
Article Google Scholar
Mannila, H., H. Toivonen, and A. I. Verkamo, “Discovering Frequent Episodes in Sequences,” Proceedings of First International Conference on Knowledge Discovery and Data Mining (KDD’95), Montreal, Canada, August 1995, 210–215.
Google Scholar
Mannila, H. and H. Toivonen, “Discovering Generalized Episodes Using Minimal Occurrences,” Proceedings of Second International Conference on KnowledgeDiscovery and Data Mining, Portland, Oregon, August 2–4, 1996.
Google Scholar
Marble, R. P. and J. C. Healy, “A Neural Network Approach to the Diagnosis of Morbidity Outcomes in Trauma Care,” Artificial Intelligence in Medicine, 15, 3, 1999, 299–307.
Article Google Scholar
McCallum, A. K. and K. Nigam, “A Comparison of Event Models for Naïve Bayes Text Classification,” Proceedings of AAAI-98 Workshop on Learning for Text Categorization, 1998.
Google Scholar
Mingers, J., “An Empirical Comparison of Selection Measures for Decision-Tree Induction,” Machine Learning, 3, 1989a, 319–341.
Google Scholar
Mingers, J., “An Empirical Comparison of Pruning Methods for Decision Tree Induction,” Machine Learning, 4, 2, 1989b, 227–243.
Article Google Scholar
Murthy, S. K., S. Kasif, and S. Salzberg, “A System for Induction of Oblique Decision Trees,” Journal of Artificial Intelligence Research, 2, 1994, 1–32.
Google Scholar
Ng, H. T., W. B. Goh, and K. L. Low, “Feature Selection, Perceptron Learning, and A Usability Case Study for Text Categorization,” Proceedings of Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ′97), Philadelphia, PA, July 1997, 67–73.
Google Scholar
Ng, R. and J. Han, “Efficient and Effective Clustering Methods for Spatial Data Mining,” Proceedings of International Conference on Very Large Data Bases, Santiago, Chile, Sept. 1994, 144–155.
Google Scholar
Niblett, T. and I. Bratko, “Learning Decision Rules in Noisy Domains,” Research and Development in Expert Systems III: Proceedings of the 6th Technical Conference of the British Computer Society Specialist Group on Expert Systems, Brignton, December 1986, 25–34.
Google Scholar
Pickett, R. M. and G. G. Grinstein, “Iconographics Displays for Visualizing Multidimensional Data,” Proceedings of IEEE Conference on Systems, Man and Cybernetics, 1988, 514–519.
Google Scholar
Quinlan, J. R., “Induction of Decision Trees,” Machine Learning , 1, 1, 1986, 81106.
Google Scholar
Quinlan, J. R., C4.5: Programs for Machine Learning, San Mateo, C A: Morgan Kaufmann, 1993.
Google Scholar
Ramaswamy, S., S. Mahajan, and A. Silberschatz, “On the Discovery of Interesting Patterns in Association Rules,” Proceedings of the 24th International Conference on Very Large Data Bases, 1998.
Google Scholar
Ronco, A. L., “Use of Artificial Neural Networks in Modeling Associations of Discriminant Factors: Towards An Intelligent Selective Breast Cancer Screening,” Artificial Intelligence in Medicine, 16, 3, 1999, 299–309.
Article Google Scholar
Roussinov, D. and H. Chen, “Document Clustering for Electronic Meetings: An Experimental Comparison of Two Techniques,” Decision Support Systems, 27, 1–2, 1999, 67–79.
Article Google Scholar
Rumelhart, D. E., G. E. Hinton, and R. J. Williams, “Learning Internal Representations by Error Propagation,” in Rumelhart, D.E. and McClelland J.L. (eds.), Parallel Distributed Processing: Explorations in the Microstructures of Cognition, Vol. 1, Cambridge, MA: MIT Press, 1986, 318–362.
Google Scholar
Shaw, M. J., C. Subramaniam, G. W. Tan, and M. E. Welge, “Knowledge Management and Data Mining for Marketing,” Decision Support Systems, 31, 1, 2001, 127–137.
Article Google Scholar
Song, H. S., J. K. Kim, and S. H. Kim, “Mining the Change of Customer Behavior in An Internet Shopping Mall,” Expert Systems with Applications, 21, 3, 2001, 157–168.
Article Google Scholar
Srikant, R. and R. Agrawal, “Mining Generalized Association Rules,” Future Generation Computer Systems, 13, 2–3, 1997, 161–180.
Article Google Scholar
Srikant, R. and R. Agrawal, “Mining Sequential Patterns: Generalizations and Performance Improvements,” Proceedings of the 5th International Conference on Extending Database Technology (EDBT), Avignon, France, March 1996, 3–17.
Google Scholar
Tickle, A. B., R. Andrews, M. Golea, and J. Diederich, “The Truth Will Come to Light: Directions and Challenges in Extracting the Knowledge Embedded Within Trained Artificial Neural Networks,” IEEE Transactions on Neural Networks, 9, 6, 1998, 1057–1068.
Article Google Scholar
Toivonen, H., “Sampling Large Databases for Association Rules,” Proceedings of the 22nd International Conference on Very Large Data Bases, Bombay, India, 1996, 134–145.
Google Scholar
Tufte, E. R., The Visual Display of Quantitative Information, Cheshire, CT: Graphics Press, 1983.
Google Scholar
Voorhees, E. M., “Implementing Agglomerative Hierarchical Clustering Algorithms for Use in Document Retrieval,” Information Processing and Management, 22, 1986, 465–476.
Article Google Scholar
Walczak, S. and J. E. Scharf, “Reducing Surgical Patient Costs Through Use of An Artificial Neural Network to Predict Transfusion Requirements,” Decision Support Systems, 30, 2, 2000, 125–138.
Article Google Scholar
Walley, W. J. and M. A. O’Connor, “Unsupervised Pattern Recognition for the Interpretation of Ecological Data,” Ecological Modelling, 146, 1–3, 2001, 219–230.
Article Google Scholar
Wei, C., Y. H. Lee and C. M. Hsu, “Empirical Comparison of Fast Clustering Algorithms for Large Data Sets,” Proceedings of 33rd Hawaii International Conference on System Sciences, Maui, Hawaii, January 2000a.
Google Scholar
Wei, C., S. Y. Hwang, and W. S. Yang, “Mining Frequent Temporal Patterns in Process Databases,” Proceedings of 10th Workshop on Information Technologies and Systems (WITS 2000), Brisbane, Australia, December 2000b, 175–180.
Google Scholar
Wei, C. and Y. H. Lee, “Event Detection for Supporting Environmental Scanning: An Information Extraction-based Approach,” Proceedings of 5th Pacific Asia Conference on Information Systems (PACIS), Seoul, Korea, June 2001.
Google Scholar
Wei, J., S. Bressan, and B. C. Ooi, “Mining Term Association Rules for Automatic Global Query Expansion: Methodology and Preliminary Results,” Proceedings of the First International Conference on Web Information Systems Engineering, 2000c, 366–373.
Google Scholar
Weiss, S. M., C. Apte, F. J. Damerau, D. E. Johnson, F. J. Oles, T. Goetz, and T. Hampp, “Maximizing Text-Mining Performance,” IEEE Intelligent Systems, 14, 4, 1999, 63–69.
Article Google Scholar
West, D., “Neural Network Credit Scoring Models,” Computers and Operations Research, 27, 11–12, 2000, 1131–1152.
Article Google Scholar
Wiener, W., J. O. Pedersen, and A. S. Weigend, “A Neural Network Approach to Topic Spotting,” Proceedings of the 4th Annual Symposium on Document Analysis and Information Retrieval (SDAIR ′95), Las Vegas, NV, 1995, 317332.
Google Scholar
Wright, W., “Business Visualization Applications,” IEEE Computer Graphics and Applications, 17, 4, 1997, 66–70.
Article Google Scholar
Xu, J. and W. B. Croft, “Query Expansion Using Local and Global Document Analysis,” Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1996, 4–11.
Google Scholar
Yang, Y., “Expert Network: Effective and Efficient Learning from Human Decisions in Text Categorization and Retrieval,” Proceedings of the 17th International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, July 1994, 13–22.
Google Scholar
Yang, Y., J. G. Carbonell, R. D. Brown, T. Pierce, B. T. Archibald, and X. Liu, “Learning Approaches for Detecting and Tracking News Events,” IEEE Intelligent Systems, 14, 4, 1999, 32–43.
Article Google Scholar
Yang, Y. and C. G. Chute, “An Example-Based Mapping Method for Text Categorization and Retrieval,” ACM Transactions on Information Systems, 12, 3, 1994, 252–277.
Article Google Scholar
Yang, Y. and X. Liu, “A Re-examination of Text Categorization Methods,” Proceedings of 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, CA, August 1999, 42–49.
Google Scholar
Yang, Y., T. Pierce, and J. G. Carbonell, “A Study on Retrospective and On-line Event Detection,” Proceedings of 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1998, 28–36.
Google Scholar
Zaki, M., S. Parthasarathy, W. Li, and M. Ogihara, “Evaluation of Sampling for Data Mining of Association Rules,” Proceedings of the 7th Workshop on Research Issues in Data Engineering, Birmingham, UK, 1997, 42–50.
Google Scholar
Zaki, M., S. Parthasarathy, M. Ogihara, and W. Li, “New Algorithms for Fast Discovery of Association Rules,” Technical Report 651, Computer Science Department, University of Rochester, 1998.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Management, National Sun Yat-Sen University, Kaohsiung, Taiwan
Chih-Ping Wei
Department of Decision and Information Sciences, University of Florida, Gainesville, FL, USA
Selwyn Piramuthu
Department of Business Administration and Beckman Institute for Advanced Sciences and Technology, University of Illinois, Urbana, IL, USA
Michael J. Shaw

Authors

Chih-Ping Wei
View author publications
You can also search for this author in PubMed Google Scholar
Selwyn Piramuthu
View author publications
You can also search for this author in PubMed Google Scholar
Michael J. Shaw
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Rosenthal Endowed Chair in Management Information Systems, University of Kentucky, 425B Gatton Building of Business and Economics, 40506-0034, Lexington, KY, USA
Clyde W. Holsapple

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Wei, CP., Piramuthu, S., Shaw, M.J. (2003). Knowledge Discovery and Data Mining. In: Holsapple, C.W. (eds) Handbook on Knowledge Management. International Handbooks on Information Systems, vol 2. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24748-7_9

Download citation

DOI: https://doi.org/10.1007/978-3-540-24748-7_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20019-2
Online ISBN: 978-3-540-24748-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics