Abstract
In recent years, new decision support system (DSS) based on the technologies of data warehouse, data mining and on-line analytical processing appeared. As the accumulated amount of data becomes enormous too much, the data quantitative problem, the data qualitative problem and the data presentation problem occur in data mining in large-scale databases and data warehouses. An effective way to enhance the power and flexibility of data mining in data warehouses and large databases is to integrate data mining with OLAP in DSS. Parallel and distributed processing are also two important components of successful large-scale data mining applications. In this paper, a high performance data mining scheme is proposed. The overall architecture and the mechanism of the system are described.
Chapter PDF
References
M. S. Scott Morton, Management Decision Systems: Computer Based Support for Decision Making, Boston, Division of Research, Graduate School of Business Administration, Harvard University, 1971.
S. Alter, Decision Support Systems: Current Practice and Continuing Challenges, Addison-Wesley, Reading, MA, 1980.
F. Jane and B. W. Gerald, Representing Modeling Knowledge in an Intelligent Decision Support System, Decision Support Systems, 1986.
M. S. Y. Wang and James F. Courtney, JR., A Conceptual Architecture for Generalized Decision Support System Software, IEEE Transaction on S.M.C., Vol. Smc-14, 1984.
LIU Zhen, WANG Shuwen and JIANG Zhanhua, Research on Decision Support System Architecture, Journal of Jilin University of Technology, No. 3, 1994.
W. H. Inmon, Building the Data Warehouse, New York: John Wiley & Sons, 1993.
W. H. Inmon, Claudia Imhoff and Ryan Sousa, Corporate Information Factory, Wiley & Sons, 1997.
W. H. Inmon and R. H. Terdeman, Claudia Imhoff, Exploration Warehousing: Turning Business Information into Business Opportunity, Wiley, 2000.
U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, 1996.
M.S. Chen, J. Han, and P.S. Yu. Data mining: An overview from a database perspective, IEEE Transactions on Knowledge and Data Engineering, 1996.
Frawley, W., Piatetsky-Shapiro, G., and Matheus, C, Knowledge Discovery in Databases: An Overview, Knowledge Discovery in Databases, eds. G. Piatetsky-Shapiro and W. Frawley, 1–27, Cambridge, Mass.: AAAI Press / The MIT Press, 1991.
E. F. Codd, E. S. Codd and C. T. Salley, Beyond Decision Support, Computer-world, Vol. 27, No. 30, July 1993.
Qing Chen, Mining Exceptions and Quantitative Association Rules in Olap Data Cube, IEEE Transactions on Knowledge and Data Engineering, 1999.
Y. Chang, et al., An object Transaction Service Based on the CORBA Architecture, International Conference on Distributed Platforms, Dresden, 1996.
R. Hubert, Distributed Object Technology in EDS, International Conference on distributed Platforms, Dresden, 1996. 1
C.C. Fabris and A.A. Freitas. Incorporating deviation-detection functionality into the OLAP paradigm. Proc. XVI Brazilian Symp. on Databases (SBBD-2001), pp. 274–285. Rio de Janeiro, Brazil, 2001.
Masato Oguchi, Masaru Kitsuregawa, Data Mining on PC Cluster connected with Storage Area Network: Its Preliminary Experimental Results, IEEE International Conference on Communications, Helsinki, Finland, 2001.
Masato Oguchi and Masaru Kitsuregawa, Using Available Remote Memory Dynamically for Parallel Data Mining Application on ATM-Connected PC Cluster, Proc. of the International Parallel and Distributed Processing Symposium, IEEE Computer Society, 2000
C.C. Bojarczuk, H.S. Lopes, A.A. Freitas. Genetic programming for knowledge discovery in chest pain diagnosis. IEEE Engineering in Medicine and Biology magazine-special issue on data mining and knowledge discovery, 19(4), July/Aug. 2000.
D.L.A. Araujo, H.S. Lopes, A.A. Freitas. A parallel genetic algorithm for rule discovery in large databases. Proc. 1999 IEEE Systems, Man and Cybernetics Conf., v. III, Tokyo, Oct. 1999.
Mohammed J. Zaki, Parallel Sequence Mining on Shared-Memory Machines, Journal of Parallel and Distributed Computing, 61, 2001.
Jeffrey P. Bradford and Jose A. B. Fortes, Characterization and Parallelization of Decision-Tree Induction, Journal of Parallel and Distributed Computing, 61, 2001
Diane J. Cook, Lawrence B. Holder, Gehad Galal, and Ron Maglothin, Approched to Parallel Graph-Based Knowledge Discovery, Journal of Parallel and Distributed Computing, 61, 2001.
Sanjay Goil, PARSIMONY: An Infrastructure for Paralle Multidimensional Analysis and Data Mining, Journal of Parallel and Distributed Computing, 61, 2001.
Liu Zhen and Guo Minyi, A Proposal of Integrating Data Mining and On-Line Analytical Processing in Data Warehouse, Proceedings of 2001 International Conferences on Info-tech and Info-net, 2001.
A.A. Freitas, Generic, Set-Oriented Primitives to Support Data-Parallel Knowledge Discovery in Relational Databases Systems, thesis of the doctoral degree, Department of Computer Science, University of Essex, 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, Z., Guo, M. (2002). A Proposal of High Performance Data Mining System. In: Fagerholm, J., Haataja, J., Järvinen, J., Lyly, M., Råback, P., Savolainen, V. (eds) Applied Parallel Computing. PARA 2002. Lecture Notes in Computer Science, vol 2367. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48051-X_12
Download citation
DOI: https://doi.org/10.1007/3-540-48051-X_12
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43786-4
Online ISBN: 978-3-540-48051-8
eBook Packages: Springer Book Archive