On Designing and Composing Grid Services for Distributed Data Mining

  • Antonio Congiusta
  • Domenico Talia
  • Paolo Trunfio


The use of computers is changing our way to make discoveries and is improving both speed and quality of the discovery processes and in some cases of the obtained results. In this scenario, future Grids can be effectively used as an environment for distributed data mining and knowledge discovery in large data sets. To utilize Grids for high-performance knowledge discovery, software tools and mechanisms are needed. To this purpose we designed a system called Knowledge Grid and we are implementing its services as Grid Services. This chapter describes the design and composition of distributed knowledge discovery services, according to the OGSA model, by using the Knowledge Grid environment. We present Grid Services for searching Grid resources, composing software and data elements, and executing the resulting data mining application on a Grid.


distributed data mining Grid services OGSA WSRF 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [1 ]
    M. Cannataro and D. Talia. The Knowledge Grid. Communitations of the ACM. 46(1):89–93, 2003.CrossRefGoogle Scholar
  2. [2]
    M. Cannataro, A. Congiusta, D. Talia and P. Trunfio. A Data Mining Toolset for Distributed High-Performance Platforms. Proc. 3rd Int. Conference Data Mining 2002, WIT Press, Bologna, Italy, pp. 41–50, 2002.Google Scholar
  3. [3]
    M. Cannataro, A. Congiusta, A. Pugliese, D. Talia and P. Trunfio. Distributed Data Mining on Grids: Services, Tools, and Applications. IEEE Transactions on Systems, Man, and Cybernetics, Part B. 34(6):2451–2465, 2004.CrossRefGoogle Scholar
  4. [4]
    The European Molecular Biology Laboratory. The Swiss-Prot protein database. Scholar
  5. [5]
    G. Bueti, A. Congiusta and D. Talia. Developing Distributed Data Mining Applications in the KNOWLEDGE GRID Framework. Proc. Sixth International Meeting on High Performance Computing for Computational Science (VECPAR’04), Valencia, Spain, 2004.Google Scholar
  6. [6]
    M. Cannataro, C. Comito, A. Congiusta and P. Veltri. PROTEUS: a Bioinformatics Problem Solving Environment on Grids. Parallel Processing Letters. 14(2):217–237, 2004.MathSciNetCrossRefGoogle Scholar
  7. [7]
    K. Channabasavaiah, K. Holley and E.M. Tuggle. Migrating to a service-oriented architecture. 2003. Scholar
  8. [8]
    I. Foster, C. Kesselman, J. Nick and S. Tuecke. The Physiology of the Grid. In: F. Berman, G. Fox and A. Hey (eds.), Grid Computing: Making the Global Infrastructure a Reality, Wiley, pages 217–249, 2003.Google Scholar
  9. [9]
    D. Box et al. Simple Object Access Protocol (SOAP) 1.1, W3C Note 08 May 2000. Scholar
  10. [10]
    E. Christensen, F. Curbera, G. Meredith and S. Weerawarana. Web Services Description Language (WSDL) 1.1, W3C Note 15 March 2001. Scholar
  11. [11]
    I. Foster, C. Kesselman, J.M. Nick and S. Tuecke. Grid Services for Distributed System Integration. IEEE Computer. 35(6):37–46, 2002.Google Scholar
  12. [12]
    S. Tuecke et al. Open Grid Services Infrastructure (OGSI) Version 1.0. Scholar
  13. [13]
    The Global Grid Forum (GGF). Scholar
  14. [14]
    K. Czajkowski et al. The WS-Resource Framework Version 1.0. Scholar
  15. [15]
    D. Box et al. Web Services Addressing (WS-Addressing), W3C Member Submission 10 August 2004. Scholar
  16. [16]
    K. Czajkowski et al. From Open Grid Services Infrastructure to WS-Resource Framework: Refactoring & Evolution. Scholar
  17. [17]
    The UCI Machine Learning Repository. Scholar

Copyright information

© Springer Science+Business Media, Inc. 2006

Authors and Affiliations

  • Antonio Congiusta
    • 1
  • Domenico Talia
    • 1
  • Paolo Trunfio
    • 1
  1. 1.DEISUniversity of CalabriaRendeItaly

Personalised recommendations