Abstract
Tree structures provide a popular means to organize the information on the Web. Taxonomies of thematic categories, concept hierarchies, e-commerce product catalogs are examples of such structures. Querying multiple data sources that use tree structures to organize their data is a challenging issue due to name mismatches, structural differences and structural inconsistencies that occur in such structures, even for a single knowledge domain. In this paper, we present a method to query tree-structured data. We introduce dimensions which are sets of semantically related nodes in tree structures. Based on dimensions, we suggest dimension graphs. Dimension graphs can be automatically extracted from trees and abstract their structural information. They are semantically rich constructs that provide query guidance to pose and evaluate queries on trees. We design a query language to query tree-structured data. A key feature of this language is that queries are not restricted by the structure of the trees. We present a technique for evaluating queries and we provide necessary and sufficient conditions for checking query unsatisfiability. We also show how dimension graphs can be used to query multiple trees in the presence of structural differences and inconsistencies.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Exchangeable Faceted Metadata Language, XFML (2003), http://www.xfml.org/
XML Topic Maps (XTM) (2001), http://www.topicmaps.org
World Wide Web Consortium site (W3C), http://www.w3c.org
XML Path Language (XPath). World Wide Web Consortium site. W3C (2003-2005), http://www.w3c.org/TR/xpath20/
XML Query (XQuery). World Wide Web Consortium site, The Architecture Domain. W3C (2003-2005), http://www.w3.org/XML/Query
Abiteboul, S., Buneman, P., Suciu, D.: Data on the Web: from Relations to Semistructured Data and XML. Morgan Kaufmann Publishers, San Francisco (2000)
Amann, B., Beeri, C., Fundulaki, I., Scholl, M.: Ontology-based Integration of XML Web Resources. In: Proc. of the ICSW 2002 Conference, Sardinia, Italy (2002)
Behrens, R.: A grammar based model for XML schema integration. In: Jeffery, K., Lings, B. (eds.) BNCOD 2000. LNCS, vol. 1832, p. 172. Springer, Heidelberg (2000)
Bergamaschi, S., Guerra, F., Vincini, M.: A Data Integration Framework for E-commerce Product Classification. In: Proc. of the ICSW 2002 Conference, Sardinia, Italy (2002)
Buneman, P., Davidson, S.B., Fernandez, M.F., Suciu, D.: Adding Structure to Unstructured Data. In: Afrati, F.N., Kolaitis, P.G. (eds.) ICDT 1997. LNCS, vol. 1186. Springer, Heidelberg (1996)
Chaudhri, A.B., Rashid, A., Zicari, R.: XML Data Management. Addison Wesley, Reading (2003)
Christophides, V., Cluet, S., Simeon, J.: On Wrapping Query Languages and Efficient XML Integration. In: Proc. of the ACM SIGMOD 2000 Conference, USA (2000)
Cluet, S., Veltri, P., Vodislav, D.: Views in a Large Scale XML Repository. In: Proc. of the VLDB 2001 Conference, Rome, Italy (2001)
Garofalakis, M., Gionis, A., Rastogi, R., Seshadri, S., Shim, K.: XTRACT: A System for Extracting Document Type Descriptors from XML Documents. In: Proc. of the ACM SIGMOD 2000 Conference, Dallas, Texas, USA (2000)
Goldman, R., Widom, J.: DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. In: Proc. of the VLDB 1997 Conference, Athens, Greece (1997)
Kim, D., Kim, J., Lee, S.-G.: Catalog Integration for Electronic Commerce through Category-hierarchy Merging Technique. In: Proc. of the RIDE 2002 Workshop, San Jose, USA (2002)
Lee, M.L., Yang, L.H., Hsu, W., Yang, X.: Xclust: Clustering XML Schemas for Effective Integration. In: Proc. of the CIKM 2002 Conference, Virginia, USA (2002)
Manolescu, I., Florescu, D., Kossmann, D.: Answering XML Queries over Heterogeneous Data Sources. In: Proc. of the VLDB 2001 Conference, Rome, Italy (2001)
Marron, P.J., Lausen, G., Weber, M.: Catalog Integration Made Easy. In: Proc. of the ICDE 2003 Conference, Bangalore, India (poster) (2003)
Rahm, E., Bernstein, P.A.: A Survey of Approaches to Automatic Schema Matching. VLDB Journal 10(4) (2001)
Tzitzikas, Y., Spyratos, N., Constantopoulos, P., Analyti, A.: Extended Faceted Taxonomies for Web Catalogs. In: Proc. of the WISE 2002 Conference, Singapore (December 2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Theodoratos, D., Dalamagas, T. (2005). Querying Tree-Structured Data Using Dimension Graphs. In: Pastor, O., Falcão e Cunha, J. (eds) Advanced Information Systems Engineering. CAiSE 2005. Lecture Notes in Computer Science, vol 3520. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11431855_15
Download citation
DOI: https://doi.org/10.1007/11431855_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26095-0
Online ISBN: 978-3-540-32127-9
eBook Packages: Computer ScienceComputer Science (R0)