SemreX: A Semantic Peer-to-Peer System for Literature Documents Retrieval

  • Hai Jin
  • Hanhua Chen
  • Xiaomin Ning
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4185)


The decentralized structure together with the features of self-organ-ization and fault-tolerance makes peer-to-peer network a promising model for information sharing. However, efficient content-based searching remains a serious challenge of large scale peer-to-peer network. In this paper, we present SemreX, a peer-to-peer system for sharing literature documents. Two main features of SemreX networks are 1) semantic supported literature document retrieval function is provided and 2) peers are self-organized into a semantic overlay according to the similarity of documents which belongs to different topics and queries are routed to semantically similar peers to reduce messages. Experiment results show that SemreX improves search efficiency for literature document retrieval in peer-to-peer network.


Support Vector Machine Semantic Similarity Recall Rate Distribute Hash Table Latent Semantic Indexing 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Budanitsky, Hirst, G.: Semantic Distance in WordNet: an Experimental, Application oriented Evaluation of Five Measures. In: Proceedings of workshop WordNet and Other Lexical Resources (June 2001)Google Scholar
  2. 2.
    Gnutella (2000),
  3. 3.
    Napster (2001),
  4. 4.
    Rada, R., Mili, H., Bicknell, E., Blettner, M.: Development and Application of a Metric on Semantic Nets. IEEE Transactions on System, Man, and Cybernetics 19(1), 17–30 (1989)CrossRefGoogle Scholar
  5. 5.
    Resnik, P.: Semantic Similarity in a Taxonomy: an Information-Based Measure and its Application to Problems of Ambiguity in Natural Language. Journal of Artificial Intelligence Research 11, 95–130 (1999)MATHGoogle Scholar
  6. 6.
    Deerwester, M.S., Dumais, S., Furnas, G., Landauer, T., Harshman, R.: Indexing by Latent Semantic Analysis. Journal of American Society for Information Science 41(6), 391–407 (1990)CrossRefGoogle Scholar
  7. 7.
    Boser, E., Guyon, I.M., Vapnik, V.N.: A Training Algorithm for Optimal Margin Classifiers. In: Proceedings of the 5th Annual ACM Workshop on Computational Learning Theory, Pittsburgh, PA, July 1992, pp. 144–152. ACM Press, New York (1992)CrossRefGoogle Scholar
  8. 8.
    Berry, M.W., Drmac, Z., Jessup, E.R.: Matrices, Vector Spaces, and Information Retrieval. SIAM Review 41(2), 335–362 (1999)MATHCrossRefMathSciNetGoogle Scholar
  9. 9.
    Nejdl, W., Wolf, B., Qu, C., Decker, S., Sintek, M., Naeve, A., Nilsson, M., Palmer, M., Risch, T.: Edutella: a Peer-to-Peer Networking Infrastructure Based on RDF. In: Proceedings of WWW 2002, Hawaii, USA, May 2002, pp. 604–615 (2002)Google Scholar
  10. 10.
    Haase, P., Broekstra, J., Ehrig, M., Menken, M., Mika, P., Plechawski, M., Pyszlak, P., Schnizler, B., Siebes, R., Staab, S., Tempich, C.: Bibster: A Semantic-Based Bibliographic Peer-to-Peer System. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 122–136. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  11. 11.
    Haase, P., Siebes, R., van Harmelen, F.: Peer Selection in Peer-to-Peer Networks with Semantic Topologies. In: Bouzeghoub, M., Goble, C.A., Kashyap, V., Spaccapietra, S. (eds.) ICSNW 2004. LNCS, vol. 3226, pp. 108–125. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  12. 12.
    Yuhua, L., Bandar, Z.A., McLean, D.: An Approach for Measuring Semantic Similarity Between Words Using Multiple Information Sources. IEEE Transactions on knowledge and data engineering 15(4), 871–882 (2003)CrossRefGoogle Scholar
  13. 13.
    Jiang, J.J., Conrath, D.W.: Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy. In: Proceedings of International Conference Research on Computational Linguistics (POCLING X), Taiwan (1997)Google Scholar
  14. 14.
    The ACM Topic Hierarchy,
  15. 15.
    Guo, Z., Jin, H., Chen, H.: Semantic Information Extraction of Reference Metadata in SemreX. Journal of Computer Research and Development (2006)Google Scholar
  16. 16.
    Jin, H., Ning, X., Chen, H.: Efficient Query Routing in Semantic Overlays Based on Latent Semantic Indexing. In: Proceedings of the 21st Annual ACM Symposium on Applied Computing (SAC 2006), Dijon, France, April 23-27 (2006)Google Scholar
  17. 17.
    Yu, Y., Jin, H.: Building a Semantic P2P Scientific References Sharing System with JXTA. In: Zhou, X., Li, J., Shen, H.T., Kitsuregawa, M., Zhang, Y. (eds.) APWeb 2006. LNCS, vol. 3841, pp. 937–942. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  18. 18.
    Ratnasamy, S., Francis, P., Handley, M., Karp, R., Shenker, S.: A Scalable Content- Addressable Network. In: Proceedings of ACM SIGCOMM 2001, San Diego, California, USA (August 2001)Google Scholar
  19. 19.
    Aberer, K., Klemm, F., Rajman, M., Wu, J.: An Architecture for Peer-to-Peer Information Retrieval. In: Proceedings of 27th Annual International ACM SIGIR Conference Workshop on P2PIR, July 29, pp. 32–42 (2004)Google Scholar
  20. 20.
    Zheng, Z., Mallik, M., Xu, Z., Tang, W.: On Scaling Latent Semantic Indexing for Large Peer-to-Peer Systems. In: Proceedings of 27th Annual International ACM SIGIR Conference, Sheffield, UK, July 2004, pp. 112–121 (2004)Google Scholar
  21. 21.
    Sujata, B., Xu, Z., Dwarkadas, S.: Peer-to-Peer Information Retrieval Using Self- Organizing Semantic Overlay Networks. In: Proceedings of ACM SIGCOMM 2003, Karlsruhe, Germany, August 2003, pp. 175–186 (2003)Google Scholar
  22. 22.
    Iamnitchi, A., Ripeanu, M., Foster, I.: Small-world file-sharing communities. In: Proceedings of IEEE INFOCOM 2004, Hong Kong (2004)Google Scholar
  23. 23.
    Shen, H.T., Shu, Y., Yu, B.: Efficient semantic-based content search in p2p network. IEEE Transactions on Knowledge and Data Engineering 16(7), 813–826 (2004)CrossRefGoogle Scholar
  24. 24.
    Stoica, I., Morris, R., Karger, D., Kaashoek, M.F., Balakrishnan, H.: Chord: a scalable peer-to-peer lookup service for internet application. In: Proceedings of ACM SIGCOMM 2001, San Diego, California, USA (2001)Google Scholar
  25. 25.
    Reynolds, P., Vahdat, A.: Efficient peer-to-peer keyword searching. In: Proceedings of Middleware (2003)Google Scholar
  26. 26.
    Gnawali, O.D.: A Keyword-Set search system for peer-to-peer networks., Master’s thesis, Massachusetts Institute of Technology (June 2002)Google Scholar
  27. 27.
    Bender, M., Michel, S., Triantafillou, P., Weikum, G., Zimmer, C.: P2p content search: give the web back to the people. In: Proceedings of the 5th International Workshop on Peer-to-Peer Systems (IPTPS 2006), Santa Barbara, CA, USA (2006)Google Scholar
  28. 28.
    Mihail, G.M., Saberi, A.: Random walks in peer-to-peer networks. In: Proceedings of IEEE INFOCOM 2004, Hong Kong, China (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Hai Jin
    • 1
  • Hanhua Chen
    • 1
  • Xiaomin Ning
    • 1
  1. 1.Cluster and Grid Computing LabHuazhong University of Science and TechnologyWuhanChina

Personalised recommendations