On Designing and Composing Grid Services for Distributed Data Mining

  • Antonio Congiusta
  • Domenico Talia
  • Paolo Trunfio

Abstract

The use of computers is changing our way to make discoveries and is improving both speed and quality of the discovery processes and in some cases of the obtained results. In this scenario, future Grids can be effectively used as an environment for distributed data mining and knowledge discovery in large data sets. To utilize Grids for high-performance knowledge discovery, software tools and mechanisms are needed. To this purpose we designed a system called Knowledge Grid and we are implementing its services as Grid Services. This chapter describes the design and composition of distributed knowledge discovery services, according to the OGSA model, by using the Knowledge Grid environment. We present Grid Services for searching Grid resources, composing software and data elements, and executing the resulting data mining application on a Grid.

Keywords

distributed data mining Grid services OGSA WSRF 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1 ]
    M. Cannataro and D. Talia. The Knowledge Grid. Communitations of the ACM. 46(1):89–93, 2003.CrossRefGoogle Scholar
  2. [2]
    M. Cannataro, A. Congiusta, D. Talia and P. Trunfio. A Data Mining Toolset for Distributed High-Performance Platforms. Proc. 3rd Int. Conference Data Mining 2002, WIT Press, Bologna, Italy, pp. 41–50, 2002.Google Scholar
  3. [3]
    M. Cannataro, A. Congiusta, A. Pugliese, D. Talia and P. Trunfio. Distributed Data Mining on Grids: Services, Tools, and Applications. IEEE Transactions on Systems, Man, and Cybernetics, Part B. 34(6):2451–2465, 2004.CrossRefGoogle Scholar
  4. [4]
    The European Molecular Biology Laboratory. The Swiss-Prot protein database. http://www.embl-heidelberg.de.Google Scholar
  5. [5]
    G. Bueti, A. Congiusta and D. Talia. Developing Distributed Data Mining Applications in the KNOWLEDGE GRID Framework. Proc. Sixth International Meeting on High Performance Computing for Computational Science (VECPAR’04), Valencia, Spain, 2004.Google Scholar
  6. [6]
    M. Cannataro, C. Comito, A. Congiusta and P. Veltri. PROTEUS: a Bioinformatics Problem Solving Environment on Grids. Parallel Processing Letters. 14(2):217–237, 2004.MathSciNetCrossRefGoogle Scholar
  7. [7]
    K. Channabasavaiah, K. Holley and E.M. Tuggle. Migrating to a service-oriented architecture. 2003. http://www-106.ibm.com/developerworks/library/ws-migratesoaGoogle Scholar
  8. [8]
    I. Foster, C. Kesselman, J. Nick and S. Tuecke. The Physiology of the Grid. In: F. Berman, G. Fox and A. Hey (eds.), Grid Computing: Making the Global Infrastructure a Reality, Wiley, pages 217–249, 2003.Google Scholar
  9. [9]
    D. Box et al. Simple Object Access Protocol (SOAP) 1.1, W3C Note 08 May 2000. http://www.w3.org/TR/2000/NOTE-SOAP-20000508.Google Scholar
  10. [10]
    E. Christensen, F. Curbera, G. Meredith and S. Weerawarana. Web Services Description Language (WSDL) 1.1, W3C Note 15 March 2001. http://www.w3.org/TR/2001/NOTE-wsdl-20010315.Google Scholar
  11. [11]
    I. Foster, C. Kesselman, J.M. Nick and S. Tuecke. Grid Services for Distributed System Integration. IEEE Computer. 35(6):37–46, 2002.Google Scholar
  12. [12]
    S. Tuecke et al. Open Grid Services Infrastructure (OGSI) Version 1.0. http://www-unix.globus.org/toolkit/draft-ggf-ogsi-gridservice-33_2003-06-27.eps.Google Scholar
  13. [13]
    The Global Grid Forum (GGF). http://www.ggf.org.Google Scholar
  14. [14]
    K. Czajkowski et al. The WS-Resource Framework Version 1.0. http://www-106.ibm.com/developerworks/library/ws-resource/ws-wsrf.eps.Google Scholar
  15. [15]
    D. Box et al. Web Services Addressing (WS-Addressing), W3C Member Submission 10 August 2004. http://www.w3.org/Submission/2004/SUBM-ws-addressing-20040810.Google Scholar
  16. [16]
    K. Czajkowski et al. From Open Grid Services Infrastructure to WS-Resource Framework: Refactoring & Evolution. http://www-106.ibm.com/developerworks/library/ws-resource/ogsi.to_wsrf_1.0.eps.Google Scholar
  17. [17]
    The UCI Machine Learning Repository. http://www.ics.uci.edu/mlearn/MLRepository.html.Google Scholar

Copyright information

© Springer Science+Business Media, Inc. 2006

Authors and Affiliations

  • Antonio Congiusta
    • 1
  • Domenico Talia
    • 1
  • Paolo Trunfio
    • 1
  1. 1.DEISUniversity of CalabriaRendeItaly

Personalised recommendations