A Proposal of High Performance Data Mining System

Liu, Zhen; Guo, Minyi

doi:10.1007/3-540-48051-X_12

A Proposal of High Performance Data Mining System

Zhen Liu⁵ &
Minyi Guo⁶

Conference paper
First Online: 01 January 2002

489 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2367))

Abstract

In recent years, new decision support system (DSS) based on the technologies of data warehouse, data mining and on-line analytical processing appeared. As the accumulated amount of data becomes enormous too much, the data quantitative problem, the data qualitative problem and the data presentation problem occur in data mining in large-scale databases and data warehouses. An effective way to enhance the power and flexibility of data mining in data warehouses and large databases is to integrate data mining with OLAP in DSS. Parallel and distributed processing are also two important components of successful large-scale data mining applications. In this paper, a high performance data mining scheme is proposed. The overall architecture and the mechanism of the system are described.

Download to read the full chapter text

Chapter PDF

References

M. S. Scott Morton, Management Decision Systems: Computer Based Support for Decision Making, Boston, Division of Research, Graduate School of Business Administration, Harvard University, 1971.
Google Scholar
S. Alter, Decision Support Systems: Current Practice and Continuing Challenges, Addison-Wesley, Reading, MA, 1980.
Google Scholar
F. Jane and B. W. Gerald, Representing Modeling Knowledge in an Intelligent Decision Support System, Decision Support Systems, 1986.
Google Scholar
M. S. Y. Wang and James F. Courtney, JR., A Conceptual Architecture for Generalized Decision Support System Software, IEEE Transaction on S.M.C., Vol. Smc-14, 1984.
Google Scholar
LIU Zhen, WANG Shuwen and JIANG Zhanhua, Research on Decision Support System Architecture, Journal of Jilin University of Technology, No. 3, 1994.
Google Scholar
W. H. Inmon, Building the Data Warehouse, New York: John Wiley & Sons, 1993.
Google Scholar
W. H. Inmon, Claudia Imhoff and Ryan Sousa, Corporate Information Factory, Wiley & Sons, 1997.
Google Scholar
W. H. Inmon and R. H. Terdeman, Claudia Imhoff, Exploration Warehousing: Turning Business Information into Business Opportunity, Wiley, 2000.
Google Scholar
U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, 1996.
Google Scholar
M.S. Chen, J. Han, and P.S. Yu. Data mining: An overview from a database perspective, IEEE Transactions on Knowledge and Data Engineering, 1996.
Google Scholar
Frawley, W., Piatetsky-Shapiro, G., and Matheus, C, Knowledge Discovery in Databases: An Overview, Knowledge Discovery in Databases, eds. G. Piatetsky-Shapiro and W. Frawley, 1–27, Cambridge, Mass.: AAAI Press / The MIT Press, 1991.
Google Scholar
E. F. Codd, E. S. Codd and C. T. Salley, Beyond Decision Support, Computer-world, Vol. 27, No. 30, July 1993.
Google Scholar
Qing Chen, Mining Exceptions and Quantitative Association Rules in Olap Data Cube, IEEE Transactions on Knowledge and Data Engineering, 1999.
Google Scholar
Y. Chang, et al., An object Transaction Service Based on the CORBA Architecture, International Conference on Distributed Platforms, Dresden, 1996.
Google Scholar
R. Hubert, Distributed Object Technology in EDS, International Conference on distributed Platforms, Dresden, 1996. 1
Google Scholar
C.C. Fabris and A.A. Freitas. Incorporating deviation-detection functionality into the OLAP paradigm. Proc. XVI Brazilian Symp. on Databases (SBBD-2001), pp. 274–285. Rio de Janeiro, Brazil, 2001.
Google Scholar
Masato Oguchi, Masaru Kitsuregawa, Data Mining on PC Cluster connected with Storage Area Network: Its Preliminary Experimental Results, IEEE International Conference on Communications, Helsinki, Finland, 2001.
Google Scholar
Masato Oguchi and Masaru Kitsuregawa, Using Available Remote Memory Dynamically for Parallel Data Mining Application on ATM-Connected PC Cluster, Proc. of the International Parallel and Distributed Processing Symposium, IEEE Computer Society, 2000
Google Scholar
C.C. Bojarczuk, H.S. Lopes, A.A. Freitas. Genetic programming for knowledge discovery in chest pain diagnosis. IEEE Engineering in Medicine and Biology magazine-special issue on data mining and knowledge discovery, 19(4), July/Aug. 2000.
Google Scholar
D.L.A. Araujo, H.S. Lopes, A.A. Freitas. A parallel genetic algorithm for rule discovery in large databases. Proc. 1999 IEEE Systems, Man and Cybernetics Conf., v. III, Tokyo, Oct. 1999.
Google Scholar
Mohammed J. Zaki, Parallel Sequence Mining on Shared-Memory Machines, Journal of Parallel and Distributed Computing, 61, 2001.
Google Scholar
Jeffrey P. Bradford and Jose A. B. Fortes, Characterization and Parallelization of Decision-Tree Induction, Journal of Parallel and Distributed Computing, 61, 2001
Google Scholar
Diane J. Cook, Lawrence B. Holder, Gehad Galal, and Ron Maglothin, Approched to Parallel Graph-Based Knowledge Discovery, Journal of Parallel and Distributed Computing, 61, 2001.
Google Scholar
Sanjay Goil, PARSIMONY: An Infrastructure for Paralle Multidimensional Analysis and Data Mining, Journal of Parallel and Distributed Computing, 61, 2001.
Google Scholar
Liu Zhen and Guo Minyi, A Proposal of Integrating Data Mining and On-Line Analytical Processing in Data Warehouse, Proceedings of 2001 International Conferences on Info-tech and Info-net, 2001.
Google Scholar
A.A. Freitas, Generic, Set-Oriented Primitives to Support Data-Parallel Knowledge Discovery in Relational Databases Systems, thesis of the doctoral degree, Department of Computer Science, University of Essex, 1997.
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Human Environment, Nagasaki Institute of Applied Science, 536 Aba-machi, Nagasaki, 851-0193, Japan
Zhen Liu
Department of Computer Software, The University of Aizu, Aizu Wakamatsu City, Fukushima, 965-8580, Japan
Minyi Guo

Authors

Zhen Liu
View author publications
You can also search for this author in PubMed Google Scholar
Minyi Guo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CSC, P.O. Box 405, 02101, Espoo, Finland
Juha Fagerholm , Juha Haataja , Jari Järvinen , Mikko Lyly , Peter Råback & Ville Savolainen , , , , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, Z., Guo, M. (2002). A Proposal of High Performance Data Mining System. In: Fagerholm, J., Haataja, J., Järvinen, J., Lyly, M., Råback, P., Savolainen, V. (eds) Applied Parallel Computing. PARA 2002. Lecture Notes in Computer Science, vol 2367. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48051-X_12

Download citation

DOI: https://doi.org/10.1007/3-540-48051-X_12
Published: 04 July 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43786-4
Online ISBN: 978-3-540-48051-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics