Skip to main content

XPloreRank: exploring XML data via you may also like queries


In many cases, users are not familiar with their exact information needs while searching complicated data sources. This lack of understanding may cause the users to feel dissatisfaction when the system retrieves insufficient results after they issue queries. However, using their original query results, we may recommend additional queries which are highly relevant to the original query. This paper presents XPloreRank to recommend top-l highly relevant keyword queries called “You May Also Like” (YMAL) queries to the users in XML keyword search. To generate such queries, we firstly analyze the original keyword query results content and construct a weighted co-occurring keyword graph. Then, we generate the YMAL queries by traversing the co-occurring keyword graph and rank them based on the following correlation aspects: (a) external correlation, which measures the similarity of the YMAL query to the original query and (b) internal correlation, which measures the capability of the YMAL query keywords in producing meaningful results with respect to the data source. Due to the complexity of generating YMAL queries, we propose a novel A* search-based technique to generate top-l YMAL queries efficiently. We also present a greedy-based approximation for it to improve the performance further. Extensive experiments verify the effectiveness and efficiency of our approach.

This is a preview of subscription content, access via your institution.

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9


  1. Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 17(6), 734–749 (2005)

    Article  Google Scholar 

  2. Akbarnejad, J., Chatzopoulou, G., Eirinaki, M., Koshy, S., Mittal, S., On, D., Polyzotis, N., Varman, J.S.V.: SQL Querie recommendations. PVLDB 3(2), 1597–1600 (2010)

    Google Scholar 

  3. Baid, A., Wu, W., Sun, C., Doan, A., Naughton, J.F.: On debugging non-answers in keyword search systems. In: EDBT, pp 37–48 (2015)

  4. Bao, Z., Ling, T. W., Chen, B., Lu, J.: Effective Xml keyword search with relevance oriented ranking. In: ICDE, pp 517–528 (2009)

  5. Bao, Z., Zeng, Y., Ling, T.W., Zhang, D., Li, G., Jagadish, H.V.: A general framework to resolve the mismatch problem in XML keyword search. VLDB J. 24(4), 493–518 (2015)

    Article  Google Scholar 

  6. Cilibrasi, R., Vitányi, P.M.B.: The google similarity distance. IEEE Trans. Knowl. Data Eng. 19(3), 370–383 (2007)

    Article  Google Scholar 

  7. Cohen, S., Brodianskiy, T.: Correcting queries for XML. Inf. Syst. 34(8), 690–710 (2009)

    Article  Google Scholar 

  8. Drosou, M., Pitoura, E.: Ymaldb: exploring relational databases via result-driven recommendations. VLDB J. 22(6), 849–874 (2013)

    Article  Google Scholar 

  9. Ehsan, H., Sharaf, M.A., Chrysanthis, P.K.: Muve: efficient multi-objective view recommendation for visual data exploration. In: ICDE, pp 731–742 (2016)

  10. Ge, X., Xue, Y., Luo, Z., Sharaf, M.A., Chrysanthis, P.K.: REQUEST: a scalable framework for interactive construction of exploratory queries. In: IEEE Bigdata, pp 646–655 (2016)

  11. Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: Xrank: ranked keyword search over xml documents. In: SIGMOD, pp 16–27 (2003)

  12. Huang, H., Chen, Z., Liu, C., Huang, H., Zhang, X.: An effective suggestion method for keyword search of databases. World Wide Web 20(4), 729–747 (2017)

    Article  Google Scholar 

  13. Islam, M.S., Liu, C., Li, J.: Efficient answering of why-not questions in similar graph matching. IEEE Trans. Knowl Data Eng. 27(10), 2672–2686 (2015)

    Article  Google Scholar 

  14. Islam, M.S., Liu, C., Zhou, R.: Flexiq: a flexible interactive querying framework by exploiting the skyline operator. J. Syst. Softw. 97, 97–117 (2014)

    Article  Google Scholar 

  15. Islam, M.S., Zhou, R., Liu, C.: On answering why-not questions in reverse skyline queries. In: 29th IEEE International Conference on Data Engineering, ICDE 2013, Brisbane, Australia, April 8-12, 2013, pp 973–984 (2013)

  16. Jagadish, H.V., Chapman, A., Elkiss, A., Jayapandian, M., Li, Y., Nandi, A., Yu, C.: Making database systems usable. In: SIGMOD, pp 13–24 (2007)

  17. Kalinin, A., Çetintemel, U., Zdonik, S.B.: Interactive data exploration using semantic windows. In: International Conference on Management of Data, SIGMOD 2014, Snowbird, UT, USA, June 22-27, 2014, pp 505–516 (2014)

  18. Li, F., Jagadish, H.V.: Usability, databases, and HCI. IEEE Data Eng. Bull. 35(3), 37–45 (2012)

    Google Scholar 

  19. Li, J., Liu, C., Yu, J.X.: Context-based diversification for keyword queries over XML data. IEEE Trans. Knowl Data Eng. 27(3), 660–672 (2015)

    Article  Google Scholar 

  20. Li, J., Liu, C., Zhou, R., Wang, W.: XML Keyword search with promising result type recommendations. World Wide Web 17(1), 127–159 (2014)

    Article  Google Scholar 

  21. Mishra, C., Koudas, N.: Interactive query refinement. In: EDBT, pp 862–873 (2009)

  22. Mooney, R.J., Roy, L.: Content-based book recommending using learning for text categorization. In: ACM Conference on Digital Libraries, pp 195–204 (2000)

  23. Nambiar, U., Kambhampati, S.: Answering imprecise queries over autonomous web databases. In: ICDE (2006)

  24. Naseriparsa, M., Islam, M.S., Liu, C., Moser, I.: No-but-semantic-match: computing semantically matched xml keyword search results. World Wide Web, 1–35 (2017)

  25. Palmisano, C., Tuzhilin, A., Gorgoglione, M.: Using context to improve predictive modeling of customers in personalization applications. IEEE Trans. Knowl. Data Eng. 20(11), 1535–1549 (2008)

    Article  Google Scholar 

  26. Pazzani, M.J., Billsus, D.: Learning and revising user profiles: the identification of interesting web sites. Mach. Learn. 27(3), 313–331 (1997)

    Article  Google Scholar 

  27. Schenkel, R., Theobald, A., Weikum, G.: Semantic similarity search on semistructured data with the XXL search engine. Inf. Retr. 8(4), 521–545 (2005)

    Article  Google Scholar 

  28. Sun, C., Chan, C.-Y., Goenka, A.K.: Multiway Slca-based keyword search in xml data. In: World Wide Web, pp 1043–1052 (2007)

  29. Sun, J., Xu, J., Zheng, K., Liu, C.: Interactive spatial keyword querying with semantics. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, CIKM 2017, Singapore, November 06 - 10, 2017, pp 1727–1736 (2017)

  30. Xu, Y., Papakonstantinou, Y.: Efficient keyword search for smallest lcas in xml databases. In: SIGMOD, pp 527–538 (2005)

  31. Xu, Y., Papakonstantinou, Y.: Efficient lca based keyword search in xml data. In: EDBT, pp 535–546 (2008)

  32. Zhou, R., Liu, C., Li, J.: Fast elca computation for keyword queries on xml data. In: EDBT, pp 549–560 (2010)

Download references


This work is supported by the Australian Research Council Discovery Grants DP140103499, DPDP170104747, and DP180100212.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Mehdi Naseriparsa.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Naseriparsa, M., Liu, C., Islam, M.S. et al. XPloreRank: exploring XML data via you may also like queries. World Wide Web 22, 1727–1750 (2019).

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI:


  • XML keyword search
  • Data exploration
  • Recommendations