Skip to main content

From Structure-Based to Semantics-Based: Towards Effective XML Keyword Search

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8217))

Abstract

Existing XML keyword search approaches can be categorized into tree-based search and graph-based search. Both of them are structure-based search because they mainly rely on the exploration of the structural features of document. Those structure-based approaches cannot fully exploit hidden semantics in XML document. This causes serious problems in processing some class of keyword queries. In this paper, we thoroughly point out mismatches between answers returned by structure-based search and the expectations of common users. Through detailed analysis of these mismatches, we show the importance of semantics in XML keyword search and propose a semantics-based approach to process XML keyword queries. Particularly, we propose to use Object Relationship (OR) graph, which fully captures semantics of object, relationship and attribute, to represent XML document and we develop algorithms based on the OR graph to return more comprehensive answers. Experimental results show that our proposed semantics-based approach can resolve the problems of the structure-based search, and significantly improve both the effectiveness and efficiency.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bao, Z., Ling, T.W., Chen, B., Lu, J.: Efficient XML keyword search with relevance oriented ranking. In: ICDE (2009)

    Google Scholar 

  2. Bao, Z., Lu, J., Ling, T.W., Xu, L., Wu, H.: An effective object-level XML keyword search. In: Kitagawa, H., Ishikawa, Y., Li, Q., Watanabe, C. (eds.) DASFAA 2010. LNCS, vol. 5981, pp. 93–109. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  3. Bhalotia, G., Hulgeri, A., Nakhe, C., Chakrabarti, S., Sudarshan, S.: Keyword searching and browsing in databases using BANKS. In: ICDE (2002)

    Google Scholar 

  4. Cohen, S., Mamou, J., Kanza, Y., Sagiv, Y.: XSEarch: A semantic search engine for XML. In: VLDB (2003)

    Google Scholar 

  5. Ding, B., Yu, J.X., Wang, S., Qin, L., Zhang, X., Lin, X.: Finding top-k min-cost connected trees in database. In: ICDE (2007)

    Google Scholar 

  6. Golenberg, K., Kimelfeld, B., Sagiv, Y.: Keyword proximity search in complex data graphs. In: SIGMOD (2008)

    Google Scholar 

  7. He, H., Wang, H., Yang, J., Yu, P.S.: BLINKS: ranked keyword searches on graphs. In: SIGMOD (2007)

    Google Scholar 

  8. Hristidis, V., Papakonstantinou, Y., Balmin, A.: Keyword proximity search on XML graphs. In: ICDE (2003)

    Google Scholar 

  9. Kacholia, V., Pandit, S., Chakrabarti, S., Sudarshan, S., Hrishikesh Karambelkar, R.D.: Bidirectional expansion for keyword search on graph databases. In: VLDB (2005)

    Google Scholar 

  10. Kargar, M., An, A.: Keyword search in graphs: finding r-cliques. In: PVLDB (2011)

    Google Scholar 

  11. Le, T.N., Wu, H., Ling, T.W., Li, L., Lu, J.: From structure-based to semantics-based: Effective XML keyword search. TRB4/13, School of Computing, NUS (2013)

    Google Scholar 

  12. Li, G., Feng, J., Wang, J., Zhou, L.: Effective keyword search for valuable lcas over xml documents. In: CIKM, pp. 31–40 (2007)

    Google Scholar 

  13. Li, G., Ooi, B.C., Feng, J., Wang, J., Zhou, L.: EASE: Efficient and adaptive keyword search on unstructured, semi-structured and structured data. In: SIGMOD (2008)

    Google Scholar 

  14. Li, L., Le, T.N., Wu, H., Ling, T.W., Bressan, S.: Discovering semantics from data-centric XML. In: Decker, H., Lhotská, L., Link, S., Basl, J., Tjoa, A.M. (eds.) DEXA 2013, Part I. LNCS, vol. 8055, pp. 88–102. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  15. Li, Y., Yu, C., Jagadish, H.V.: Schema-free XQuery. In: VLDB (2004)

    Google Scholar 

  16. Liu, X., Wan, C., Chen, L.: Returning clustered results for keyword search on xml documents. In: TKDE (2011)

    Google Scholar 

  17. Liu, Z., Chen, Y.: Identifying meaningful return information for XML keyword search. In: SIGMOD (2007)

    Google Scholar 

  18. Liu, Z., Chen, Y.: Reasoning and identifying relevant matches for XML keyword search. In: PVLDB (2008)

    Google Scholar 

  19. Qin, L., Yu, J.X., Chang, L., Tao, Y.: Querying communities in relational databases. In: ICDE (2009)

    Google Scholar 

  20. Truong, B.Q., Bhowmick, S.S., Dyreson, C.E., Sun, A.: MESSIAH: missing element-conscious slca nodes search in xml data. In: SIGMOD (2013)

    Google Scholar 

  21. Wu, H., Bao, Z.: Object-oriented XML keyword search. In: Jeusfeld, M., Delcambre, L., Ling, T.-W. (eds.) ER 2011. LNCS, vol. 6998, pp. 402–410. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  22. Xu, Y., Papakonstantinou, Y.: Efficient keyword search for smallest LCAs in XML databases. In: SIGMOD (2005)

    Google Scholar 

  23. Zhou, R., Liu, C., Li, J.: Fast ELCA computation for keyword queries on XML data. In: EDBT (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Le, T.N., Wu, H., Ling, T.W., Li, L., Lu, J. (2013). From Structure-Based to Semantics-Based: Towards Effective XML Keyword Search. In: Ng, W., Storey, V.C., Trujillo, J.C. (eds) Conceptual Modeling. ER 2013. Lecture Notes in Computer Science, vol 8217. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41924-9_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41924-9_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41923-2

  • Online ISBN: 978-3-642-41924-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics