A Requirements Analysis for Parallel KDD Systems

  • William A. Maniatty
  • Mohammed J. Zaki
Conference paper

DOI: 10.1007/3-540-45591-4_47

Part of the Lecture Notes in Computer Science book series (LNCS, volume 1800)
Cite this paper as:
Maniatty W.A., Zaki M.J. (2000) A Requirements Analysis for Parallel KDD Systems. In: Rolim J. (eds) Parallel and Distributed Processing. IPDPS 2000. Lecture Notes in Computer Science, vol 1800. Springer, Berlin, Heidelberg

Abstract

The current generation of data mining tools have limited capacity and performance, since these tools tend to be sequential. This paper explores a migration path out of this bottleneck by considering an integrated hardware and software approach to parallelize data mining. Our analysis shows that parallel data mining solutions require the following components: parallel data mining algorithms, parallel and distributed data bases, parallel file systems, parallel I/O, tertiary storage, management of online data, support for heterogeneous data representations, security, quality of service and pricing metrics. State of the art technology in these areas is surveyed with an eye towards an integration strategy leading to a complete solution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2000

Authors and Affiliations

  • William A. Maniatty
    • 1
  • Mohammed J. Zaki
    • 2
  1. 1.Computer Science Dept.University at AlbanyAlbany
  2. 2.Computer Science Dept.Rensselaer Polytechnic InstituteTroy

Personalised recommendations