Efficiently Processing XML Queries over Fragmented Repositories with PartiX

  • Alexandre Andrade
  • Gabriela Ruberg
  • Fernanda Baião
  • Vanessa P. Braganholo
  • Marta Mattoso
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4254)


The data volume of XML repositories and the response time of query processing have become critical issues for many applications, especially for those in the Web. An interesting alternative to improve query processing performance consists in reducing the size of XML databases through fragmentation techniques. However, traditional fragmentation definitions do not directly apply to collections of XML documents. This work formalizes the fragmentation definition for collections of XML documents, and shows the performance of query processing over fragmented XML data. Our prototype, PartiX, exploits intra-query parallelism on top of XQuery-enabled sequential DBMS modules. We have analyzed several experimental settings, and our results showed a performance improvement of up to a 72 scale up factor against centralized databases.


Path Expression Hybrid Fragmentation Vertical Fragmentation XQuery Query Fragmentation Design 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Abiteboul, S., Bonifati, A., Cobena, G., Manolescu, I., Milo, T.: Dynamic XML documents with distribution and replication. In: SIGMOD, pp. 527–538 (2003)Google Scholar
  2. 2.
    Amer-Yahia, S., Kotidis, Y.: A web-services architecture for efficient XML data exchange. In: ICDE, pp. 523–534 (2004)Google Scholar
  3. 3.
    Andrade, A., Ruberg, G., Baião, F., Braganholo, V., Mattoso, M.: Partix: Processing XQueries over fragmented XML repositories. Technical Report ES-691, COPPE/UFRJ (2005),
  4. 4.
    Baião, F., Mattoso, M., Zaverucha, G.: A distribution design methodology for object DBMS. Distributed and Parallel Databases 16(1), 45–90 (2004)CrossRefGoogle Scholar
  5. 5.
    Barbosa, D., Mendelzon, A., Keenleyside, J., Lyons, K.: ToXgene: a template-based data generator for XML. In: WebDB, pp. 621–632 (2002)Google Scholar
  6. 6.
    Bonifati, A., Matrangolo, U., Cuzzocrea, A., Jain, M.: XPath lookup queries in P2P networks. In: WIDM, pp. 48–55 (2004)Google Scholar
  7. 7.
    Bose, S., Fegaras, L., Levine, D., Chaluvadi, V.: XPath lookup queries in p2p networks. In: WIDM, pp. 48–55 (2004)Google Scholar
  8. 8.
    Bremer, J.-M., Gertz, M.: On distributing XML repositories. In: WebDB (2003)Google Scholar
  9. 9.
    Fernández, M., Siméon, J., Wadler, P.: An algebra for XML query. In: Kapoor, S., Prasad, S. (eds.) FST TCS 2000. LNCS, vol. 1974, pp. 11–45. Springer, Heidelberg (2000)CrossRefGoogle Scholar
  10. 10.
    Jagadish, H., Lakshmanan, L., Srivastava, D., Thompson, K.: TAX: A tree algebra for XML. In: Ghelli, G., Grahne, G. (eds.) DBPL 2001. LNCS, vol. 2397, pp. 149–164. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  11. 11.
    Lima, A., Mattoso, M., Valduriez, P.: Adaptive virtual partitioning for olap query processing in a database cluster. In: SBBD, pp. 92–105 (2004)Google Scholar
  12. 12.
    Ma, H., Schewe, K.-D.: Fragmentation of XML documents. In: SBBD (2003)Google Scholar
  13. 13.
    Meier, W.: eXist: Open source native XML database (2000), Available at:
  14. 14.
    Navathe, S., Karlapalem, K., Ra, M.: A mixed fragmentation methodology for initial distributed database design. Journal of Computer and Software Engineering 3(4) (1995)Google Scholar
  15. 15.
    Ozsu, T., Valduriez, P.: Principles of Distributed Database Systems. Prentice Hall, Englewood Cliffs (1999)Google Scholar
  16. 16.
    Paparizos, S., Wu, Y., Lakshmanan, L., Jagadish, H.: Tree logical classes for efficient evaluation of XQuery. In: SIGMOD, pp. 71–82 (2004)Google Scholar
  17. 17.
    Yao, B., Ozsu, M., Khandelwal, N.: Xbench benchmark and performance testing of XML DBMSs. In: ICDE, pp. 621–632 (2004)Google Scholar
  18. 18.
    Zhang, X., Pielech, B., Rundesnteiner, E.: Honey, I shrunk the XQuery!: an XML algebra optimization approach. In: WIDM, pp. 15–22 (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Alexandre Andrade
    • 1
  • Gabriela Ruberg
    • 1
  • Fernanda Baião
    • 2
  • Vanessa P. Braganholo
    • 1
  • Marta Mattoso
    • 1
  1. 1.Computer Science DepartmentCOPPE/Federal Univ. of Rio de JaneiroBrazil
  2. 2.Applied Informatics DepartmentUniversity of Rio de JaneiroBrazil

Personalised recommendations