Web Service Discovery Based on Information Gain Theory and BiLSTM with Attention Mechanism

  • Xiangping Zhang
  • Jianxun Liu
  • Buqing CaoEmail author
  • Qiaoxiang Xiao
  • Yiping Wen
Conference paper
Part of the Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering book series (LNICST, volume 268)


Web service discovery is an important problem in service-oriented computing with the increasing number of Web services. Clustering or classifying Web services according to their functionalities has been proved to be an effective way to Web service discovery. Recently, semantic-based Web services clustering exploits topic model to extract latent topic features of Web services description document to improve the accuracy of service clustering and discovery. However, most of them don’t consider deep and fine-grained level information of description document, such as the weight (importance) for each word or the word order. While the deep and fine-grained level information can be fully used to argument service clustering and discovery. To address this problem, we proposed a Web service discovery approach based on information gain theory and BiLSTM with attention mechanism. This method firstly obtains the effective words through information gain theory and then adds them to an attention-based BiLSTM neural network for Web service clustering. The comparative experiments are performed on ProgrammableWeb dataset, and the results show that a significant improvement is achieved for our proposed method, compared with baseline methods.


Web service clustering Mashup creation Information gain Attention layer BiLSTM 



The work was supported by the Hunan Provincial Natural Science Foundation of China under grant No. 2017JJ2098, 2017JJ4036, 2018JJ2139, 2018JJ2136, Open Foundation of State Key Laboratory of Networking and Switching Technology (Beijing University of Posts and Telecommunications) under grant No. SKLNST-2016-2-26, National Natural Science Foundation of China under grant No. 61572187, 61772193, 61702181, 61872139, 61873316, Innovation Platform Open Foundation of Hunan Provincial Education Department of China under grant No. 17K033.


  1. 1.
    Xia, B., Fan, Y., Tan, W., et al.: Category-aware API clustering and distributed recommendation for automatic mashup creation. IEEE Trans. Serv. Comput. 8(5), 674–687 (2015)CrossRefGoogle Scholar
  2. 2.
    Samanta, P., Liu, X.: Recommending services for new mashups through service factors and top-k neighbors. In: ICWS 2017, pp. 381–388 (2017)Google Scholar
  3. 3.
    Cao, B., Liu, X., Li, B., et al.: Mashup service clustering based on an integration of service content and network via exploiting a two-level topic model. In: ICWS 2016, pp. 212–219 (2016)Google Scholar
  4. 4.
    Li, C., Zhang, R., Huai, J., et al.: A probabilistic approach for web service discovery. In: ICWS 2013, pp. 101–108 (2013)Google Scholar
  5. 5.
    Chen, L., Wang, Y., Yu, Q., Zheng, Z., Wu, J.: WT-LDA: user tagging augmented LDA for web service clustering. In: Basu, S., Pautasso, C., Zhang, L., Fu, X. (eds.) ICSOC 2013. LNCS, vol. 8274, pp. 162–176. Springer, Heidelberg (2013). Scholar
  6. 6.
    Rodriguez Mier, P., Pedrinaci, C., Lama, M., et al.: An integrated semantic web service discovery and composition framework. IEEE Trans. Serv. Comput. 9(4), 537–550 (2016)CrossRefGoogle Scholar
  7. 7.
    Cheng, B., Zhao, S., Li, C., et al.: A web services discovery approach based on mining underlying interface semantics. IEEE Trans. Knowl. Data Eng. 99, 1–18 (2017)Google Scholar
  8. 8.
    Lu, Y., Mei, Q., Zhai, C.: Investigating task performance of probabilistic topic models: an empirical study of PLSA and LDA. Inf. Retrieval 14(2), 178–203 (2011)CrossRefGoogle Scholar
  9. 9.
    Shi, M., Liu, J., Zhou, D., Cao, B., et al.: WE-LDA: a word embeddings augmented LDA model for web services clustering. In: ICWS 2017, pp. 9–16 (2017)Google Scholar
  10. 10.
    Chen, F., Lu, C., Wu, H., et al.: A semantic similarity measure integrating multiple conceptual relationships for web service discovery. Expert Syst. Appl. Int. J. 67(C), 19–31 (2017)CrossRefGoogle Scholar
  11. 11.
    Zhu, L., Wang, G., Zou, X.: Improved information gain feature selection method for Chinese text classification based on word embedding. In: ICSCA 2017, pp. 72–76 (2017)Google Scholar
  12. 12.
    Greff, K., Srivastava, R.K., Koutnik, J., et al.: LSTM: a search space odyssey. IEEE Trans. Neural Netw. Learn. Syst. 28(10), 2222–2232 (2017)MathSciNetCrossRefGoogle Scholar
  13. 13.
    Zhou, P., Shi, W., Tian, J., et al.: Attention-based bidirectional long short-term memory networks for relation classification. In: ACL 2016, pp. 207–212 (2016)Google Scholar
  14. 14.
    Řehůřek, R., Sojka, P.: Software framework for topic modelling with large corpora. In: Proceedings of LREC 2010 Workshop New Challenges for NLP Frameworks (2010)Google Scholar
  15. 15.
    Chen, L., et al.: WTCluster: utilizing tags for web services clustering. In: Kappel, G., Maamar, Z., Motahari-Nezhad, H.R. (eds.) ICSOC 2011. LNCS, vol. 7084, pp. 204–218. Springer, Heidelberg (2011). Scholar
  16. 16.
    Tian, G., He, K., Sun, C., et al.: Ontology learning from web service descriptions. J. Front. Comput. Sci. Technol. 9(5), 575–585 (2015)Google Scholar
  17. 17.
    Huang, Z., Xu, W., Yu, K.: Bidirectional LSTM-CRF models for sequence tagging. Comput. Sci. (2015)Google Scholar
  18. 18.
    Mikolov, T., Chen, K., Corrado, G., et al.: Efficient estimation of word representations in vector space. Comput. Sci. (2013)Google Scholar
  19. 19.
    Wang, Z., Hamza, W., Florian, R.: Bilateral multi-perspective matching for natural language sentences (2017)Google Scholar
  20. 20.
    Cao, B., Liu, X., Rahman, M.D., Li, B., Liu, J., Tang, M.: Integrated content and network-based service clustering and web APIs recommendation for mashup development. IEEE Trans. Serv. Comput. Accepted 22 Mar 2017
  21. 21.
    Wu, Z., Zheng, X., Dahlmeier, D.: Character-based text classification using top down semantic model for sentence representation (2017)Google Scholar
  22. 22.
    Liu, X., Fulia, I.: Incorporating user, topic, and service related latent factors into web service recommendation. In: ICWS 2015, pp. 185–192 (2015)Google Scholar
  23. 23.
    Zhang, X., Zhao, J., Lecun, Y.: Character-level convolutional networks for text classification, pp. 649–657 (2015)Google Scholar
  24. 24.
    Yang, Z., Yang, D., Dyer, C., et al.: Hierarchical attention networks for document classification. In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1480–1489 (2017)Google Scholar
  25. 25.
    Elgazzar, K., Hassan, A., Martin, P.: Clustering WSDL documents to bootstrap the discovery of web services. In: ICWS 2010, pp. 147–154 (2010)Google Scholar

Copyright information

© ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2019

Authors and Affiliations

  • Xiangping Zhang
    • 1
  • Jianxun Liu
    • 1
  • Buqing Cao
    • 1
    Email author
  • Qiaoxiang Xiao
    • 1
  • Yiping Wen
    • 1
  1. 1.Key Laboratory of Knowledge Processing and Networked ManufacturingHunan University of Science and TechnologyXiangtanChina

Personalised recommendations