Abstract
This article mainly proposes a bottom-up method to index XML document. Firstly we discuss the underlying properties of the method, architecture, creation algorithm and query algorithm, then conduct a set of experiments referring to the Timber and XIndice system. The demo system convinces that, this method can maintain excellent indexing and querying performance under given queries with normal PC on the DBLP XML test set of which the size is 315M, so it can be regarded as a prospective application with good performance. XML, indexing, Inter-relevant Successive Trees.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Jagadish, H.V., Al-Khalifa, S., Chapman, A., et al.: Timber: A native XML database. The VLDB Journal 11, 274–291 (2002)
Goldman, R., Widom, J.: Dataguides: enabling query formulation and optimization in semistructured databases. In: Proc. of VLDB’97, pp. 436–445 (1997)
Cooper, B., et al.: A fast index for semistructured data. In: Proc. of VLDB’01, pp. 341–350 (2001)
Li, Q., Moon, B.: Indexing and querying xml data for regular path expressions. In: Proc. of VLDB’01, pp. 361–370 (2001)
Wang, H., et al.: Vist: A dynamic index method for querying xml data by tree structures. In: Proc. of ACM SIGMOD’03, pp. 110–121 (2003)
Chen, Q.H., Lim, A., Ong, K.: D(k)-index: An adaptive structural summary for graph-structured data. In: Proc. of ACM SIGMOD’03, pp. 134–144 (2003)
Gottlob, G., Koch, C., Pichler, R.: Efficient algorithms for processing xpath queries. In: Proc. of VLDB’02, pp. 95–106 (2002)
Ishikawa, Y., Nagai, T., Kitagawa, H.: Transforming XPath Queries for Bottom-Up Query Processing. In: Proc. of ISDB’02, pp. 210–215 (2002)
Catania, B., Maddalena, A., Vakali, A.: XML document indexes: A classification. IEEE Internet Computing 9(5), 64–71 (2005)
XIndice (2006), available at http://xml.apache.org/xindice/index.html
Zou, Q., Liu, S., Chu, W.: Ctree: a compact tree for indexing XML data. In: Proc. of WIDM’04, pp. 39–46 (2004)
Kazai, G., Lalmas, M., Vries, A.: The Overlapping problem in Content-Oriented XML Retrieval Evaluation. In: Proc. of SIGIR’04, pp. 72–79 (2004)
Zhou, S., Hu, Y., Guan, J.: Adjacency matrix based full-text indexing models. Journal of Software 13(10), 1933–1942 (2000)
Hu, Y.: Inter-relevant successive trees – a new mathematical model for full-text database. Technical Report no. TR022031, Department of Computer and Information Technology, Fudan University (2002)
Ma, H.-B., et al.: Mining frequent patterns based on is+-tree model. Journal of Computer Research and Development 42(4), 588–593 (2005)
Chung, C., Min, J., Shim, K.: Apex: An adaptive path index for xml data. In: Proc. of ACM SIGMOD’02, pp. 121–132 (2002)
Timber (2004), available at http://www.eecs.umich.edu/db/timber
DBLP (2006), available at http://dblp.uni-trier.de/xml/
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Wang, X., Zhang, C., Wang, J., Hu, Y. (2007). Xistree: Bottom-Up Method of XML Indexing. In: Abramowicz, W. (eds) Business Information Systems. BIS 2007. Lecture Notes in Computer Science, vol 4439. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72035-5_25
Download citation
DOI: https://doi.org/10.1007/978-3-540-72035-5_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72034-8
Online ISBN: 978-3-540-72035-5
eBook Packages: Computer ScienceComputer Science (R0)