XML Keyword Pattern Refinement

Lu, Jiaheng

doi:10.1007/978-3-642-34555-5_7

XML Keyword Pattern Refinement

Jiaheng Lu²

Chapter
First Online: 01 January 2013

1139 Accesses

Abstract

In XML keyword search, users’ queries usually contain irrelevant or mismatched terms, typos, etc., which may easily lead to empty or meaningless results. In this chapter, we first introduce the problem of content-aware XML keyword query refinement, aiming to integrate the job of finding the desired refined queries and generating their matching results as a single problem. Furthermore, a statistics-based query ranking model, which takes into account of both keyword dependencies and the relevance, is proposed. The ranking model evaluates the quality of a refined query, which captures the morphological/semantic similarity between the original query and refined queries and the dependency of keywords of the refined queries over the XML data. In addition, two adaptive query refinement algorithms are also proposed. Finally, we experimentally demonstrate the efficiency and effectiveness of the approach presented in this chapter.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.00; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
To facilitate our discussion, the dissimilarity score of a single term deletion rule is 2 throughout all examples in this chapter.
2.
http://www.ibiblio.org/xml/books/biblegold/examples/baseball/
3.
http://xmldb.ddns.comp.nus.edu.sg

References

Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: SIGMOD 1993, Washington, DC (1993)
Google Scholar
Bao, Z., Chen, B., Ling, T.W., Lu, J.: Demonstrating effective ranked XML keyword search with meaningful result display. In: DASFAA 2009, Brisbane (2009)
Google Scholar
Bao, Z., Ling, T.W., Chen, B., Lu, J.: Effective XML keyword search with relevance oriented ranking. In: ICDE, Shanghai (2009)
Google Scholar
Fellbaum, F.C.: WordNet: a electronic lexical database. Cambridge, MA: MIT Press (1998)
Google Scholar
Fain, D.C., Pedersen, J.O.: Sponsored search. In: Bulletin of the American Society for Information Science and Technology (2005)
Google Scholar
Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: XRANK: ranked keyword search over XML documents. In: SIGMOD, San Diego (2003)
Google Scholar
Guo, J., Xu, G., Li, H., Cheng, X.: A unified and discriminative model for query refinement. In: SIGIR, Singapore (2008)
Google Scholar
Jarvelin, K., Kekalainen, J.: Cumulated gain-based evaluation of IR techniques. ACM Trans. Inf. Syst. 20(4), 422 (2002)
Google Scholar
Jones, R., Rey, B., Madani, O., Greiner, W.: Generating query substitutions. In: WWW (2006)
Google Scholar
Liu, Z., Chen, Y.: Identifying meaningful return information for XML keyword search. In: SIGMOD, Beijing (2007)
Google Scholar
Liu, Z., Chen, Y.: Reasoning and identifying relevant matches for XML keyword search. PVLDB 1(1), 921–932 (2008)
Google Scholar
Li, Y., Yu, C., Jagadish, H.V.: Schema-free XQuery. In: VLDB, Toronto, pp. 72–83 (2004)
Google Scholar
Mass, Y., Mandelbrod, M.: Component ranking and automatic query refinement for XML retrieval. In: INEX, Dagstuhl (2004)
Google Scholar
Pan, H., Theobald, A., Schenkel, R.: Query refinement by relevance feedback in an XML retrieval system. In: ER, Shanghai (2004)
Google Scholar
Pu, K.Q., Yu, X.: Keyword query cleaning. In: VLDB, Auckland (2008)
Google Scholar
Jones, R., Fain, D.: Query word deletion prediction. In: SIGIR, Toronto (2003)
Google Scholar
Ruthven, I.: Re-examining the potential effectiveness of interactive query expansion. In: SIGIR, Toronto (2003)
Google Scholar
Sun, C., Chan, C.Y., Goenka, A.K.: Multiway SLCA-based keyword search in XML data. In: WWW, 2007, Banff (2007)
Google Scholar
Theobald, M., Bast, H., Majumdar, D., Schenkel, R., Weikum, G.: Topx: efficient and versatile top-k query processing for semistructured data. VLDB J. 17(1), 81–115 (2008)
Google Scholar
Xu, J., Croft, W.B.: Improving the effectiveness of information retrieval with local context analysis. ACM Trans. Inf. Syst. 18(1), 79–112 (2000)
Google Scholar
Xu, Y., Papakonstantinou, Y.: Efficient keyword search for smallest LCAs in XML databases. In: SIGMOD, Baltimore (2005)
Google Scholar
Xu, Y., Papakonstantinou, Y.: Efficient LCA based keyword search in XML data. In: EDBT, Nantes (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information, Renmin University of China, Beijing, People’s Republic of China
Jiaheng Lu

Authors

Jiaheng Lu
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Lu, J. (2013). XML Keyword Pattern Refinement. In: An Introduction to XML Query Processing and Keyword Search. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34555-5_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-34555-5_7
Published: 28 January 2013
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34554-8
Online ISBN: 978-3-642-34555-5
eBook Packages: Computer Science

Publish with us

Policies and ethics