Knowledge and Information Systems

, Volume 39, Issue 1, pp 153–173 | Cite as

Modeling and exploiting tag relevance for Web service mining

  • Liang Chen
  • Jian Wu
  • Zibin Zheng
  • Michael R. Lyu
  • Zhaohui Wu
Regular Paper


Web service tags, i.e., terms annotated by users to describe the functionality or other aspects of Web services, are being treated as collective user knowledge for Web service mining. Since user tagging is inherently uncontrolled, ambiguous, and overly personalized, a critical and fundamental problem is how to measure the relevance of a user-contributed tag with respect to the functionality of the annotated Web service. In this paper, we propose a hybrid mechanism by using Web Service Description Language documents and service-tag network information to compute the relevance scores of tags by employing semantic computation and Hyperlink-Induced Topic Search model, respectively. Further, we introduce tag relevance measurement mechanism into three applications of Web service mining: (1) Web service clustering; (2) Web service tag recommendation; and (3) tag-based Web service retrieval. To evaluate the accuracy of tag relevance measurement and its impact to Web service mining, experiments are implemented based on Titan which is a Web service search engine constructed based on 15,968 real Web services. Comprehensive experiments demonstrate the effectiveness of the proposed tag relevance measurement mechanism and its active promotion to the usage of tagging data in Web service mining.


Web service Tag Relevance Service clustering HITS 



This research was partially supported by the National Technology Support Program under Grant No. 2011BAH16B04, the National Natural Science Foundation of China under Grant No. 61173176, National High-Tech Research and Development Plan of China under Grant No. 2013AA01A604, the Shenzhen Basic Research Program (Project No. JCYJ20120619153834216, JC201104220300A), National Key Science and Technology Research Program of China (2009ZX01043-003- 003), and the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. CUHK 415311 of General Research Fund).


  1. 1.
    George Z, Athman B (2010) Web service mining. Springer, BerlinMATHGoogle Scholar
  2. 2.
    Ames M, Naaman M (2007) Why we tag: motivations for annotation in mobile and online media. In: Proceedings of the SIGCHI conference on human factors in computing systems (CHI), pp 71–980Google Scholar
  3. 3.
    Sigurbjrnsson B, van Zwol R (2008) Flickr tag recommendation based on collective knowledge. In: Proceedings of the 17th international conference on World Wide Web (WWW), pp 327–336Google Scholar
  4. 4.
    Chen L, Hu L, Zheng Z, Wu J (2011) Wtcluster: utilizing tags for web services clustering. In: Proceedings of the 9th international conference on service oriented computing (ICSOC), pp 204–218Google Scholar
  5. 5.
    Wu J, Chen L, Zheng Z, Lyu MR, Wu Z (2013) Clustering web services to facilitate service discovery. Int J Knowl Inf Syst (KAIS), p. to appearGoogle Scholar
  6. 6.
    Averbakh A, Krause D, Skoutas D (2009) Exploiting user feedback to improve semantic web service discovery. In: International semantic web conference (ISWC), pp 33–48Google Scholar
  7. 7.
    Hou J, Zhang J, Nayak R, Bose A (2012) Semantics-based web service discovery using information retrieval techniques. In: Comparative evaluation of focused retrieval, pp 336–346Google Scholar
  8. 8.
    Bouillet E, Feblowitz M, Feng H, Liu Z, Ranganathan A, Riabov A (2008) A folksonomy-based model of web services for discovery and automatic composition. In: IEEE international conference on services computing, pp 389–396Google Scholar
  9. 9.
    Kennedy LS, Chang S-F, Kozintsev IV (2006), To search or to label?: predicting the performance of search-based automatic image classifiers. In: Proceedings of the 8th ACM international workshop on multimedia information retrieval, pp 249–258Google Scholar
  10. 10.
    Chen L, Zheng Z, Feng Y, Wu J, Lyu MR (2012), Wstrank: ranking tags to facilitate web service mining. In: Proceedings of the 10th international conference on service oriented computing, pp 574–581Google Scholar
  11. 11.
    Li X, Snoek CG, Worring M (2008), Learning tag relevance by neighbor voting for social image retrieval. In: Proceedings of the 1st ACM international conference on multimedia information retrieval, pp 180–187Google Scholar
  12. 12.
    Wu L, Yang L, Yu N (2009) Learning to tag. In: Proceedings of the 18st international conference companion on World Wide Web (WWW), pp 361–370Google Scholar
  13. 13.
    Li L, Shang Y, Zhang W (2002) Improvement of hits-based algorithms on web documents. In: Proceedings of the 11th international World Wide Web conference, pp 527–535Google Scholar
  14. 14.
    Dong X, Halevy A, Madhavan J, Nemes E, Zhang J (2004) Similarity search for web services. In: International conference on very large databases, pp 372–383Google Scholar
  15. 15.
    Liu W, Wong W (2009) Web service clustering using text mining techniques. Int J Agent-Oriented Softw Eng 3(1):6–26CrossRefGoogle Scholar
  16. 16.
    Elgazzar K, Hassan AE, Martin P (2009) Clustering wsdl documents to bootstrap the discovery of web services. In: International conference on web services, pp 147–154Google Scholar
  17. 17.
    Chukmol U, Benharkat A-N, Amghar Y (2011) Bringing socialized semantics into web services based on user-centric collaborative tagging and usage experience. In: IEEE Asia-Pacific services computing conference, pp 450–455Google Scholar
  18. 18.
    Azmeh Z, My Falleri J-R, Huchard M, Tibermacine C (2011) Automatic web service tagging using machine learning and wordnet synsets. Lecture notes in business information processing 75(1):46–59Google Scholar
  19. 19.
    Fang L, Wang L, Li M, Zhao J, Zou Y, Shao L (2012) Towards automatic tagging for web services. In: 19th IEEE international conference on web services, pp 528–535Google Scholar
  20. 20.
    Katakis I, Pallis G, Dikaiakos MD, Onoufriou O (2012), Automated tagging for the retrieval of software resources in grid and cloud infrastructures In: 12th IEEE/ACM international symposium on cluster, cloud and grid computing, pp 628–635Google Scholar
  21. 21.
    Ding Z, Lei D, Yan J, Bin Z, Lun A (2010) A web service discovery method based on tag. In: International conference on complex, intelligent and software intensive systems, pp 404–408Google Scholar
  22. 22.
    Fernandez A, Hayes C, Loutas N, Peristeras V (2008) Closing the service discovery gap by collaborative tagging and clustering techniques. In: International workshop on service matchmaking and resource retrieval in the Semantic Web, pp 115–128Google Scholar
  23. 23.
    Nayak R (2008) Data mining in web service discovery and monitoring. Int J Web Serv Res 5(1):62–80CrossRefGoogle Scholar
  24. 24.
    Porter MF (1980) An algorithm for suffix stripping. Program 14(3):130–137CrossRefGoogle Scholar
  25. 25.
    Church K, Gale W (1995) Inverse document frequency (idf): a measure of deviations from poisson. In: Proceedings of the ACL 3rd workshop on very large corpora, pp 121–130Google Scholar
  26. 26.
    Cilibrasi RL, Vitnyi PMB (2007) The google similarity distance. IEEE Trans Knowl Data Eng 19(3):370–383CrossRefGoogle Scholar
  27. 27.
    Kleinberg JM (1999) Hubs, authorities, and communities. ACM Comput Surv 31(4):1–3MathSciNetGoogle Scholar
  28. 28.
    Arvelin KJ, Kekalainen J (2002) Cumulated gain-based evaluation of IR techniques. ACM Trans Inf Syst 20(4):422–446CrossRefGoogle Scholar
  29. 29.
    Wu J, Chen L, Xie Y, Zheng Z (2012) Titan: a system for effective web service discovery. In: Proceedings of the 21st international conference companion on World Wide Web (WWW), pp 441–444Google Scholar

Copyright information

© Springer-Verlag London 2013

Authors and Affiliations

  • Liang Chen
    • 1
  • Jian Wu
    • 1
  • Zibin Zheng
    • 2
  • Michael R. Lyu
    • 2
  • Zhaohui Wu
    • 1
  1. 1.College of Computer ScienceZhejiang UniversityHangzhouChina
  2. 2.Department of Computer Science and EngineeringThe Chinese University of Hong KongHong KongChina

Personalised recommendations