Abstract
There has been extensive research in XML keyword-based and loosely structured querying. Some frameworks work well for certain types of XML data models and fail in others. The reason is that the proposed techniques are based on finding relationships between solely individual nodes while overlooking the context of these nodes. The context of a leaf node is determined by its parent node, because it specifies one of the characteristics of its parent node. Building relationships between individual leaf nodes without consideration of their parents may result in relationships that are semantically disconnected. Since leaf nodes are nothing but characteristics of their parents, we observe that we could treat each parent-children set of nodes as one unified entity. We then find semantic relationships between the different unified entities.Based on those observations, we propose an XML semantic search engine called OOXSearch, which answers loosely structured queries. The recall and precision of the engine were evaluated experimentally and compared with two recent proposed systems [1, 2] and the results showed marked improvement.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cohen, S., Mamou, J., Kanza, Y., Sagiv, Y.: XSEarch: A Semantic Search Engine for XML. In: Aberer, K., Koubarakis, M., Kalogeraki, V. (eds.) Databases, Information Systems, and Peer-to-Peer Computing. LNCS, vol. 2944, Springer, Heidelberg (2004)
Li., Y., Jagadish, H.: Schema-Free XQuery. In: VLDB 2004 (2004)
Xu, Y., Papakonstantinou, Y.: Efficient Keyword Search for Smallest LCAs in XML Databases. In: SIGMOD 2005 (2005)
Amer-Yahia, S., Deutsch, A.: Flexible and Efficient XML Search with Complex Full-Text Predicates. In: SIGMOD 2006 (2006)
Al-Khalifa, S., Yu, C., Jagadish, H.: Querying Structured Text in an XML Database. In: SIGMOD 2003 (2003)
Hristidis, V., Papakonstantinou, Y.: DISCOVER:Keyword search in Relational Databases. In: Bressan, S., Chaudhri, A.B., Lee, M.L., Yu, J.X., Lacroix, Z. (eds.) CAiSE 2002 and VLDB 2002. LNCS, vol. 2590, Springer, Heidelberg (2003)
Chaudhuri, S., Das, G., Agrawal, S.: DBXplorer: a System for Keyword-Based Search Over Relational Databases. In: ICDE 2002 (2002)
Aditya, B., Sudarshan, S.: BANKS: Browsing and Keyword Searching in Relational Databases. In: Bressan, S., Chaudhri, A.B., Lee, M.L., Yu, J.X., Lacroix, Z. (eds.) CAiSE 2002 and VLDB 2002. LNCS, vol. 2590, Springer, Heidelberg (2003)
Balmin, A., Hristidis, V., Koudas, N.: A System for Keyword Proximity Search on XML Databases. In: Aberer, K., Koubarakis, M., Kalogeraki, V. (eds.) Databases, Information Systems, and Peer-to-Peer Computing. LNCS, vol. 2944, Springer, Heidelberg (2004)
Shao, F., Guo, L., Botev, C.: XRANK: Ranked Keyword Search over XML Documents. In: SIGMOD 2003 (2003)
XML Query Use Cases, W3C Working Draft 8 (June 2006), available at http://www.w3.org/TR/xquery-use-cases/
Elmasri, R., Taha, K.: Caching: An Efficient XML Query Mechanism in Client-Server Architecture. In: SWDIM 2006 (2006)
XQEngine version 0.69, downloaded from http://sourceforge.net/projects/xqengine
ToXgene, a template-based generator for large XML documents, available at: http://www.cs.toronto.edu/tox/toxgene/
Boag, S.: XQuery 1.0: An XML Query Language. W3C Recommendation 2006, http://www.w3.org/TR/2006/CR-xquery-20060608/
Florescu, D., et al.: Integrating keyword search into XML query processing. Computer Networks 33, 119–135 (2000)
Bray, T.: Extensible Markup Language (XML). W3C, At: http://www.w3.org/TR/2004/REC-xml11-20040204/
Berglund, A.: XML Path Language (XPath) 2.0. W3C Working Draft 15 (September 2005)
XMark - An XML Benchmark Project, available at http://monetdb.cwi.nl/xml/downloads.html
XML Validation Benchmark, Sarvega (an Intel company), http://www.sarvega.com/xml-validation-benchmark.html
Hristidis, V., Balmin, A.: Keyword Proximity Search on XML Graphs. In: ICDE 2003 (2003)
Sara Cohen, B.: Kimelfeld. Interconnection Semantics for Keyword Search in XML. In: CIKM 2005 (2005)
Amer-Yahia, S., Deutsch, A.: Flexible and Efficient XML Search with Complex Full-Text Predicates. In: VLDB 2006 (2006)
Pradhan, S.: An Algebraic Query Model for Effective and Efficient Retrieval of XML Fragments. In: VLDB 2006 (2006)
Balmin, A.: Authority Based Keyword Search in Databases. In: VLDB 2004
Castor is an Open Source data-binding framework for Java, available at http://www.castor.org/
The complete version of this paper is available at: http://www.students.uta.edu/ks/kst0035/
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Taha, K., Elmasri, R. (2007). OOXSearch: A Search Engine for Answering Loosely Structured XML Queries Using OO Programming. In: Cooper, R., Kennedy, J. (eds) Data Management. Data, Data Everywhere. BNCOD 2007. Lecture Notes in Computer Science, vol 4587. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73390-4_9
Download citation
DOI: https://doi.org/10.1007/978-3-540-73390-4_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73389-8
Online ISBN: 978-3-540-73390-4
eBook Packages: Computer ScienceComputer Science (R0)