Semantic-Based Query Routing and Heterogeneous Data Integration in Peer-to-Peer Semantic Link Networks

  • Hai Zhuge
  • Jie Liu
  • Liang Feng
  • Chao He
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3226)


A semantic link P2P network specifies and manages semantic relationships between peers’ data schemas. The proposed approach includes a tool for constructing and maintaining P2P semantic link networks, a semantic-based peer similarity measurement approach for efficient query routing, and peer schema mapping algorithms for query reformulation and heterogeneous data integration. The advantages of the proposed approach include three aspects: First, it uses semantic links to enrich relationships between peers’ data schemas. Second, it considers not only node but also structure in measuring the similarity between schemas so as to efficiently and accurately forward queries to relevant peers. Finally, it deals with semantic heterogeneity, structural heterogeneity and data inconsistency to enable peers to exchange and translate heterogeneous information in single semantic image.


Semantic Relationship Semantic Mapping Target Schema Semantic Link Breadth First Search 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Aberer, K., Cudre-Mauroux, P., Hauswirth, M.: The Chatty Web: Emergent Semantics through Gossiping. In: WWW 2003, Budapest, Hungary (May 2003)Google Scholar
  2. 2.
  3. 3.
    Bernstein, P., et al.: Data Management for Peer-to-Peer Computing: A Vision. In: ACM SIGMOD WebDB Workshop 2002, Madison, Wisconsin (June 2002)Google Scholar
  4. 4.
    DBLP XML Database,
  5. 5.
    Deutsch, Tannen, V.: MARS: A System for Publishing XML from Mixed and Redundant Storage. In: Proceedings of the 29th VLDB Conference, Berlin, Germany (September 2003)Google Scholar
  6. 6.
    Giunchiglia, F., Zaihrayeu, I.: Making Peer Databases Interact — A Vision for an Architecture Supporting Data Coordination. In: Proc. of the Conference on Information Agents (CIA 2002), Madrid, Spain (September 2002)Google Scholar
  7. 7.
    Halevy, et al.: Schema Mediation in Peer Data Management Systems. In: Proc. of ICDE 2003, Bangalore, India (March 2003)Google Scholar
  8. 8.
    Halevy, et al.: Piazza: Data Management Infrastructure for Semantic Web Applications. In: Proc. of the Intl. WWW Conf. 2003, Budapest, Hungary (May 2003)Google Scholar
  9. 9.
    Kementsietsidis, Arenas, M., Miller, R.: Mapping Data in Peer-to-Peer Systems: Semantics and Algorithmic Issues. In: Proc. of the ACM SIGMOD International Conference on Management of Data 2003, San Diego, California (June 2003)Google Scholar
  10. 10.
    Loser, et al.: Semantic Overlay Clusters within Supper-Peer Networks. In: International Workshop on Databases, Information Systems, and P2P Computing, Berlin, Germany (September 2003)Google Scholar
  11. 11.
    Lu, S.Y.: A Tree-Matching Algorithm Based on Node Splitting and Merging. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 6(2), 249–256 (1984)zbMATHCrossRefGoogle Scholar
  12. 12.
    Madhavan, J., Bernstein, P., Rahm, E.: Generic Schema Matching with Cupid.In: Proceedings of the 27th VLDB Conference, Roma, Italy (September 2001)Google Scholar
  13. 13.
    Nejdl, W., et al.: Super-Peer-Based Routing and Clustering Strategies for RDF-Based Peer- To-Peer Networks. In:WWW2003, Budapest, Hungary (May 2003)Google Scholar
  14. 14.
    Ng, W.S., et al.: PeerDB: A P2P-Based System for Distributed Data Sharing. In: Intl. Conf. on Data Engineering (ICDE) 2003, Bangalore, India (March 2003)Google Scholar
  15. 15.
    Ooi, B., Shu, Y., Tan, K.: DB-Enabled Peers for Managing Distributed Data. In: 5th Asia- Pacific Web Conference, APWeb2003, Xian, China (April 2003)Google Scholar
  16. 16.
    Rahm, E., Bernstein, P.: A Survey of Approaches to Automatic Schema Matching. VLDB Journal 10(4), 334–350 (2001)zbMATHCrossRefGoogle Scholar
  17. 17.
    Stoica, et al.: Chord: A Scalable Peer-to-Peer Lookup Protocol for Internet Applications. IEEE/ACM Transactions on Networking 11, 17–32 (2003)CrossRefGoogle Scholar
  18. 18.
    Tatarinov, Halevy, A.: Efficient Query Reformulation in Peer-Data Management Systems. In: ACM SIGMOD 2004, Paris, France (June 2004)Google Scholar
  19. 19.
    Zhuge, H.: Active E-Document Framework ADF: Model and Tool. Information and Management 41(1), 87–97 (2003)CrossRefGoogle Scholar
  20. 20.
    Zhuge, H.: Clustering Soft-Devices in Semantic Grid. IEEE Computing in Science and Engineering 4(6), 60–63 (2002)Google Scholar
  21. 21.
    Zhuge, H.: China’s E-Science Knowledge Grid Environment. IEEE Intelligent Systems 19(1), 13–17 (2004)CrossRefGoogle Scholar
  22. 22.
    Zhuge, H.: Future Interconnection Environment — Dream, Principle, Challenge and Practice. In: Keynote at The 5th International Conference on Web-Age Information Management, Dalian, China (July 2004),
  23. 23.
    Gnutella website,
  24. 24.
    Napster website,

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Hai Zhuge
    • 1
  • Jie Liu
    • 1
    • 2
  • Liang Feng
    • 1
    • 2
  • Chao He
    • 1
    • 2
  1. 1.China Knowledge Grid Research Group, Key Lab of Intelligent Information Processing, Institute of Computing TechnologyChinese Academy of SciencesBeijingChina
  2. 2.Graduate School of the Chinese Academy of Sciences 

Personalised recommendations