Xaggregation: Flexible Aggregation of XML Data

  • Hongzhi Wang
  • Jianzhong Li
  • Zhenying He
  • Hong Gao
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2762)


XML is an important format of information exchange and representation. One of its features is that it has tag representing semantics. Based on this feature, an extensive aggregation of operation of XML data, Xaggregation, is represented in this paper. Xaggregation permits XPath expression decorating dimension property and measure property. With Xaggregation, statistics of XML data becomes more flexible with function of aggregating heterogeneous data and hierarchy data along some path of XML. Xaggregation could be embedded in query language of XML such as XQuery. In this paper, the definition of Xaggregation is presented, as well as the semantics and application of it. Implementation of Xaggregation based on native XML database with XML as tree structure is also designed.


Relational Database Common Root Aggregation Result Dimension Property Dimension List 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Abiteboul, S., Buneman, P., Suciu, D.: Data on the Web: From Relations to Semistructured Data and XML. Morgan Kaufmann Publishers, San Francisco (2000)Google Scholar
  2. 2.
    World Wide Web Consortium: XML Path Language (XPath) 2.0,
  3. 3.
    World Wide Web consortium: XQuery 1.0: An XML Query Language,
  4. 4.
    Gray, J., Bosworth, A., Layman, A., Piramish, H.: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals. In: Proc. of ICDE (1996)Google Scholar
  5. 5.
    Bonifati, A., Ceri, S.: Comparative Analysis of Five XML Query Languages. ACM SIGMOD Record 29(1) (2000)Google Scholar
  6. 6.
    Pedersen, D., Riis, K., Pedersen, T.B.: XML-Extended OLAP Querying. In: Proc of 14th International Conference on Scientific and Statistical Database Management (2002)Google Scholar
  7. 7.
    Pedersen, D., Riis, K., Pedersen, T.B.: Cost Modeling and Estimation for OLAP-XML Federations. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds.) DaWaK 2002. LNCS, vol. 2454, pp. 245–254. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  8. 8.
    Pedersen, D., Riis, K., Pedersen, T.B.: Query optimization for OLAP-XML federations. In: Proc. Of the fifth ACM international workshop on Data Warehousing and OLAP (2002)Google Scholar
  9. 9.
    Deutsch, A., Fernandez, M.F., Suciu, D.: Storing Semi-structured Data with STORED. In: Proc of SIGMOD Conference (1999)Google Scholar
  10. 10.
    Shanmugasundaram, J., Tufte, K., Zhang, C., He, G., DeWitt, D.J., Naughton, J.F.: Relational Databases for Querying XML Documents: Limitations and Opportunities. In: Proc of VLDB Conference (1999)Google Scholar
  11. 11.
    Tian, F., DeWitt, D.J., Chen, J., Zhang, C.: The Design and Performance Evaluation of Alternative XML Storage Strategies. SIGMOD Record special issue on Data Management Issues in E-commerce (2002)Google Scholar
  12. 12.
    Tufte, K., Mater, D.: Aggregation and Accumulation of XML Data. IEEE Data Engineering Bulletin 24(2), 34–39 (2001)Google Scholar
  13. 13.
  14. 14.
    Beech, D., Malhotra, A., Rys, M.: A formal data model and algebra for xml. Communication to the W3C (1999)Google Scholar
  15. 15.
    Fankhauser, P., Fernandez, M., Malhotra, A., Rys, M., Simeon, J., Wadler, P.: The XML Query Algebra,
  16. 16.
    Galanis, L., Viglas, E., DeWitt, D.J., Naughton, J.F., Maier, D.: Following the Paths of XML Data: An Algebraic Framework for XML Query Evaluation (2001), Available at
  17. 17.
    Shim, K., Chung, C., Min, J.: APEX: An Adaptive Path Index for XML data. In: Proc. of ACM SIGMOD (2002)Google Scholar
  18. 18.
    Grust, T.: Accelerating XPath location steps. In: Proc. of ACM SIGMOD (2002)Google Scholar
  19. 19.
    Gottlob, G., Koch, C., Pichler, R.: Efficient algorithms for processing XPath queries. In: Proc of 28th VLDB Conference (2002)Google Scholar
  20. 20.
    Chien, S., Vagena, Z., Zhang, D., Tsotras, V.J.: Efficient Structural Joins on Indexed XML Documents. In: Proc of 28th VLDB Conference (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Hongzhi Wang
    • 1
  • Jianzhong Li
    • 1
  • Zhenying He
    • 1
  • Hong Gao
    • 1
  1. 1.Department of Computer Science and TechnologyHarbin Institute of Technology 

Personalised recommendations