Xistree: Bottom-Up Method of XML Indexing

Wang, Xinyin; Zhang, Chenghong; Wang, Jingyuan; Hu, Yunfa

doi:10.1007/978-3-540-72035-5_25

Xinyin Wang¹,
Chenghong Zhang²,
Jingyuan Wang¹ &
…
Yunfa Hu¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4439))

Included in the following conference series:

International Conference on Business Information Systems

1865 Accesses

Abstract

This article mainly proposes a bottom-up method to index XML document. Firstly we discuss the underlying properties of the method, architecture, creation algorithm and query algorithm, then conduct a set of experiments referring to the Timber and XIndice system. The demo system convinces that, this method can maintain excellent indexing and querying performance under given queries with normal PC on the DBLP XML test set of which the size is 315M, so it can be regarded as a prospective application with good performance. XML, indexing, Inter-relevant Successive Trees.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Jagadish, H.V., Al-Khalifa, S., Chapman, A., et al.: Timber: A native XML database. The VLDB Journal 11, 274–291 (2002)
Article MATH Google Scholar
Goldman, R., Widom, J.: Dataguides: enabling query formulation and optimization in semistructured databases. In: Proc. of VLDB’97, pp. 436–445 (1997)
Google Scholar
Cooper, B., et al.: A fast index for semistructured data. In: Proc. of VLDB’01, pp. 341–350 (2001)
Google Scholar
Li, Q., Moon, B.: Indexing and querying xml data for regular path expressions. In: Proc. of VLDB’01, pp. 361–370 (2001)
Google Scholar
Wang, H., et al.: Vist: A dynamic index method for querying xml data by tree structures. In: Proc. of ACM SIGMOD’03, pp. 110–121 (2003)
Google Scholar
Chen, Q.H., Lim, A., Ong, K.: D(k)-index: An adaptive structural summary for graph-structured data. In: Proc. of ACM SIGMOD’03, pp. 134–144 (2003)
Google Scholar
Gottlob, G., Koch, C., Pichler, R.: Efficient algorithms for processing xpath queries. In: Proc. of VLDB’02, pp. 95–106 (2002)
Google Scholar
Ishikawa, Y., Nagai, T., Kitagawa, H.: Transforming XPath Queries for Bottom-Up Query Processing. In: Proc. of ISDB’02, pp. 210–215 (2002)
Google Scholar
Catania, B., Maddalena, A., Vakali, A.: XML document indexes: A classification. IEEE Internet Computing 9(5), 64–71 (2005)
Article Google Scholar
XIndice (2006), available at http://xml.apache.org/xindice/index.html
Zou, Q., Liu, S., Chu, W.: Ctree: a compact tree for indexing XML data. In: Proc. of WIDM’04, pp. 39–46 (2004)
Google Scholar
Kazai, G., Lalmas, M., Vries, A.: The Overlapping problem in Content-Oriented XML Retrieval Evaluation. In: Proc. of SIGIR’04, pp. 72–79 (2004)
Google Scholar
Zhou, S., Hu, Y., Guan, J.: Adjacency matrix based full-text indexing models. Journal of Software 13(10), 1933–1942 (2000)
Google Scholar
Hu, Y.: Inter-relevant successive trees – a new mathematical model for full-text database. Technical Report no. TR022031, Department of Computer and Information Technology, Fudan University (2002)
Google Scholar
Ma, H.-B., et al.: Mining frequent patterns based on is+-tree model. Journal of Computer Research and Development 42(4), 588–593 (2005)
Article Google Scholar
Chung, C., Min, J., Shim, K.: Apex: An adaptive path index for xml data. In: Proc. of ACM SIGMOD’02, pp. 121–132 (2002)
Google Scholar
Timber (2004), available at http://www.eecs.umich.edu/db/timber
DBLP (2006), available at http://dblp.uni-trier.de/xml/

Download references

Author information

Authors and Affiliations

Dept. of Computer and Info. Tech., Fudan University, Shanghai 200433, China
Xinyin Wang, Jingyuan Wang & Yunfa Hu
School of Management,Fudan University, Shanghai 200433, China
Chenghong Zhang

Authors

Xinyin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Chenghong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jingyuan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yunfa Hu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Witold Abramowicz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, X., Zhang, C., Wang, J., Hu, Y. (2007). Xistree: Bottom-Up Method of XML Indexing. In: Abramowicz, W. (eds) Business Information Systems. BIS 2007. Lecture Notes in Computer Science, vol 4439. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72035-5_25

Download citation

DOI: https://doi.org/10.1007/978-3-540-72035-5_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72034-8
Online ISBN: 978-3-540-72035-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics