Hyper: A Framework for Peer-to-Peer Data Integration on Grids

Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3226)


Data Grids allow for seeing heterogeneous, distributed, and dynamic informational resources as if they were a uniform, stable, secure, and reliable database. According to this view, current proposals for data integration on Grids are based on the notion of global schema built over a collection of autonomous information sources. On the other hand, in dynamic and distributed environments, such a hierarchical and centralized architecture is not well suited for effective information integration. Peer-to-peer data integration aims at overcoming these drawbacks by modeling autonomous information systems as peers, and establishing mappings among peers without resorting to any hierarchical structure. In this paper, we present Hyper, a joint research initiative of Università di Roma “La Sapienza” and IBM Italia, which aims at developing principles and techniques for peer-to-peer data integration on a Grid infrastructure. The main contributions presented are a semantic characterization of P2P data integration, the deployment of our P2P framework on a Grid architecture, and the design of a query answering algorithm that is coherent both with the semantics and with the Grid infrastructure.


Data Integration First Order Logic Epistemic Logic Conjunctive Query Grid Infrastructure 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    W3C semantic web (2001),
  2. 2.
    The globus alliance (2004),
  3. 3.
    Open grid services architecture data access and integration (2004),
  4. 4.
    Open grid services infrastructure (2004),
  5. 5.
    The semantic grid (2004),
  6. 6.
    The ws-resource framework (2004),
  7. 7.
    Aberer, K., Cudré-Mauroux, P., Hauswirth, M., Pelt, T.V.: Gridvine: Building internet-scale semantic overlay networks. Technical Report IC/2004/38, EPFL (2004)Google Scholar
  8. 8.
    Antonioletti, M., Krause, A., Hastings, S., Langella, S., Malaika, S., Magowan, J., Laws, S., Paton, N.W.: Grid data service specification: The relational realisation. Technical report, DAIS Working Group (2003)Google Scholar
  9. 9.
    Calvanese, D., De Giacomo, G., Lenzerini, M., Rosati, R.: Logical foundations of peer-to-peer data integration. In: Proc. of the 23nd ACM SIGACT SIGMOD SIGART Symp. on Principles of Database Systems (PODS 2004) (2004) (to appear)Google Scholar
  10. 10.
    Camarinha-Matos, L., Afsarmanesh, H., Garita, C., Lima, C.: Towards an architecture for virtual enterprises. J. Intelligent Manufacturing 9(2) (1998)Google Scholar
  11. 11.
    Crespo, A., Garcia-Molina, H.: Semantic overlay networks for P2P systems. Technical report, Computer Science Department, Stanford University (2002)Google Scholar
  12. 12.
    Doan, A., Lu, Y., Lee, Y., Han, J.: Profile-based object matching for information integration. IEEE Intelligent Systems 18(5), 54–59 (2003)CrossRefGoogle Scholar
  13. 13.
    Foster, I., Kesselman, C., Nick, J., Tuecke, S.: The physiology of the grid: An open grid services architecture for distributed systems integration. In: Open Grid Service Infrastructure WG, Global Grid Forum (2002)Google Scholar
  14. 14.
    Foster, I., Kesselman, C., Tuecke, S.: The anatomy of the grid: Enabling scalable virtual organizations. International J. Supercomputer Applications 15(3) (2001)Google Scholar
  15. 15.
    Friedman, M., Levy, A., Millstein, T.: Navigational plans for data integration. In: Proc. of the 16th Nat. Conf. on Artificial Intelligence (AAAI 1999), pp. 67–73. AAAI Press/The MIT Press (1999)Google Scholar
  16. 16.
    Halevy, Y.: Answering queries using views: A survey. Very Large Database Journal 10(4), 270–294 (2001)zbMATHCrossRefGoogle Scholar
  17. 17.
    Halevy, Y., Ives, Z.G., Mork, P., Tatarinov, I.: Piazza: data management infrastructure for semantic web applications. In: Proc. of the 12th Int. World Wide Web Conference (WWW 2003), pp. 556–567 (2003)Google Scholar
  18. 18.
    Lenzerini, M.: Data integration: A theoretical perspective. In: Proc. of the 21st ACM SIGACT SIGMOD SIGART Symp. on Principles of Database Systems (PODS 2002), pp. 233–246 (2002)Google Scholar
  19. 19.
    Levesque, H.J., Lakemeyer, G.: The Logic of Knowledge Bases. MIT Press, Cambridge (2001)zbMATHGoogle Scholar
  20. 20.
    McBrien, P., Poulovassilis, A.: Distributed databases. In: Piattini, M., Diaz, O. (eds.) Advanced Database Technology and Design, Artech House, Norwood (2000)Google Scholar
  21. 21.
    Paton, N., Atkinson, M., Dialani, V., Pearson, D., Storey, T., Watson, P.: Database access and integration services on the grid. Technical Report UKeS-2002-03, UK e-Science Programme, National e-Science Centre (2002)Google Scholar
  22. 22.
    Zhou, G., Hull, R., King, R., Franchitti, J.-C.: Using object matching and materialization to integrate heterogeneous databases, pp. 4–18 (1995)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  1. 1.Faculty of Computer ScienceFree University of Bolzano/Bozen 
  2. 2.Dipartimento di Informatica e SistemisticaUniversità di Roma “La Sapienza” 
  3. 3.IBMItalia

Personalised recommendations