Skip to main content
Log in

Optimizing the execution of XSLT stylesheets for querying transformed XML data

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

We have to deal with different data formats whenever data formats evolve or data must be integrated from heterogeneous systems. These data when implemented in XML for data exchange cannot be shared freely among applications without data transformation. A common approach to solve this problem is to convert the entire XML data from their source format to the applications’ target formats using the transformations rules specified in XSLT stylesheets. However, in many cases, not all XML data are required to be transformed except for a smaller part described by a user’s query (application). In this paper, we present an approach that optimizes the execution time of an XSLT stylesheet for answering a given XPath query by modifying the XSLT stylesheet in such a way that it would (a) capture only the parts in the XML data that are relevant to the query and (b) process only those XSLT instructions that are relevant to the query. We prove the correctness of our optimization approach, analyze its complexity and present experimental results. The experimental results show that our approach performs the best in terms of execution time, especially when many cost-intensive XSLT instructions can be excluded in the XSLT stylesheet.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Abiteboul S (1999) On views and XML. In: Proceedings of the eighteenth ACM SIGACT-SIGMOD-SIGART symposium on principles of database systems (PODS), Philadelphia, Pennsylvania

  2. Abiteboul S, Cluet S, Milo T (1997) Correspondence and translation for heterogeneous data. In: Proceedings of the 6th international conference on database theory (ICDT), Delphi, Greece

  3. Apache Software Foundation (2003) Xalan-Java. http://xml.apache.org/xalan-j/index.html

  4. Apache Software Foundation (2003) Xerces2 Java Parser 2.5.0 Release. http://xml.apache.org/xerces2-j

  5. Böttcher S, Steinmetz R (2004) Optimized Internet search based on an intersection test for XPath expressions under a DTD, In: Proceedings of the international conference on internet computing (IC), Las Vegas, USA

  6. Böttcher S, Türling A (2003) Checking XPath expressions for synchronization, access control and reuse of query results on mobile clients. In: Proceedings of the workshop on database mechanisms for mobile applications, Karlsruhe, Germany

  7. Bourret R, Bornhövd C, Buchmann, AP (2000) A generic load/extract utility for data transfer between XML documents and relational databases. In: Proceedings of the 2nd international workshop on advanced issues of EC and Web-based information systems (WECWIS), San Jose

  8. Chang CCK, Garcia-Molina H (2000) Approximate query translation across heterogeneous information sources. In: Proceedings of the 26th international conference on very large data bases (VLDB), Cairo, Egypt

  9. Chen YB, Ling TW, Lee ML (2002) Designing valid XML views. In: Proceedings of the 21st international conference on conceptual modeling (ER), Tampere, Finland

  10. Cluet S, Delobel C, Simon J, Smaga K (1998) Your mediators need data conversion! In: Proceedings ACM SIGMOD international conference on management of data, Seattle, Washington, USA

  11. Cluet S, Veltri P, Vodislav D (2001) Views in a large scale XML repository. In: Proceedings of 27th international conference on very large data bases (VLDB), Roma, Italy

  12. Deutsch A, Tannen V (2003) Reformulation of XML queries and constraints. In: Proceedings of the 9th international conference on database theory (ICDT) 2003, Siena, Italy

  13. Developer (2005) XSLT Mark Version 2.1.0. http://www.datapower.com/xmldev/xsltmark.html

  14. Diao Y, Altinel M, Franklin MJ, Zhang H, Fischer P: Path sharing and predicate evaluation for high-performance XML filtering. ACM Trans Database Syst 28(4), 467–516 (2003)

    Article  Google Scholar 

  15. Diao Y, Rizvi S, Franklin MJ (2004) Towards an Internet-Scale XML dissemination service. In: Proceedings of the thirtieth international conference on very large data bases (VLDB), Toronto, Canada

  16. Fernández M, Kadiyska Y, Suciu D, Morishima A, Tan WC: SilkRoute, a framework for publishing relational data in XML. ACM Trans Database Syst 27(4), 438–493 (2002)

    Article  Google Scholar 

  17. Fisher D, Lam F, Wong RK (2004) Algebraic transformation and optimization for XQuery. In: Proceedings of the 6th Asia-Pacific Web conference (APWeb), Hangzhou, China

  18. Franceschet M (2005) XPathMark—an XPath benchmark for the XMark generated data. In: Proceedings of the third international XML database symposium (XSym 2005), Trondheim, Norway

  19. Gottlob G, Koch C, Pichler R (2003) The complexity of XPath query evaluation. In: Proceedings of the 22th ACM SIGMOD-SIGACT-SIGART symposium of principles of database systems (PODS), San Diego, California, USA

  20. Grinev M, Kuznetsov S (2002) Towards an exhaustive set of rewriting rules for XQuery optimisation: BizQuery experience. In: Proceedings of the 6th East European conference on advances in databases and information systems (ADBIS), Bratislava, Slovakia

  21. Grinev M, Pleshachkov P (2005) Rewriting-based Optimization for XQuery transformational queries. In: Proceedings of the 9th international database engineering and applications symposium (IDEAS 2005), Montreal, Canada

  22. Groppe S, Böttcher S (2003) XPath query transformation based on XSLT stylesheets. In: Proceedings of the fifth international workshop on web information and data management (WIDM), New Orleans, Louisiana, USA

  23. Groppe S, Böttcher S, Birkenheuer G (2004) Efficient querying of transformed XML Documents. In: Proceedings of the 6th international conference on enterprise information systems (ICEIS), Porto, Portugal

  24. Groppe S, Böttcher S, Birkenheuer G, Höing A: Reformulating XPath queries and XSLT queries on XSLT views. Data Knowl Eng J (DKE) 57(1), 64–110 (2006)

    Article  Google Scholar 

  25. Groppe S, Böttcher S, Heckel R, Birkenheuer G (2004) Using XSLT stylesheets to transform XPath queries. In: Proceedings of the eighth East-European conference on advances in databases and information systems (ADBIS), Budapest, Hungary

  26. Jain S, Mahajan R, Suciu D (2002) Translating XSLT programs to efficient SQL queries. In: Proceedings of the eleventh international world wide web conference (WWW2002), Honolulu, Hawaii, USA

  27. Kay MH (2004) Saxon - The XSLT and XQuery Processor. http://saxon.sourceforge.net

  28. Krishnamurthy R, Kaushik R, Naughton JF (2004) Efficient XML-to-SQL query translation: where to add the intelligence? In: Proceedings of the thirtieth international conference on very large data bases (VLDB), Toronto, Canada

  29. Lechner S, Preuner G, Schrefl M (2001) Translating XQuery into XSLT, In: ER Workshops, Yokohama, Japan

  30. Leung Hp, Chung KFL, Chan SCf: On the use of hierarchical information in sequential mining-based XML document similarity computation. Knowl Inf Syst 7(4), 476–498 (2005)

    Article  Google Scholar 

  31. Megginson D (2000) SAX. http://www.saxproject.org/

  32. Marian A, Siméon J (2003) Projecting XML documents. In: Proceedings of the 29th international conference on very large data bases (VLDB), Berlin, Germany

  33. Moerkotte G (2002) Incorporating XSL processing into database engines. In: Proceedings of the 28th international conference on very large data bases (VLDB), Hong Kong, China

  34. Mukherjea S: Discovering and analyzing world wide web collections. Knowl Inf Syst 6(2), 230–241 (2004)

    Google Scholar 

  35. Nayak R: Fast and effective clustering of XML data using structural information. Knowl Inf Syst 14(2), 197–215 (2008)

    Article  MathSciNet  Google Scholar 

  36. Paparizos S, Wu Y, Lakshmanan LVS, Jagadish HV (2004) Tree logical classes for efficient evaluation of XQuery. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD), Paris, France

  37. Rys M (2001) Bringing the Internet to your database: Using SQL Server 2000 and XML to build loosely coupled systems. In: Proceedings of the 17th international conference on data engineering (ICDE), Heidelberg, Germany

  38. Shanmugasundaram J, Kiernan J, Shekita E, Fan C, Funderburk J (2001) Querying XML views of relational data. In: Proceedings of 27th international conference on very large data bases (VLDB), Roma, Italy

  39. Shanmugasundaram J, Shekita E, Barr R, Carey M, Lindsay B, Pirahesh H, Reinwald B: Efficiently publishing relational data as XML documents. VLDB J 10(2–3), 133–154 (2001)

    MATH  Google Scholar 

  40. University Trier (2005) Computer Science Bibliographie. http://dblp.uni-trier.de/

  41. Wang L, Mulchandani M, Rundensteiner EA (2003) Updating XQuery views published over relational data: a round-trip case study. In: Proceedings of the first international XML database symposium (XSym), Berlin, Germany

  42. Wilfred Ng, Lam WY, Wood PT, Levene M: XCQ: a queriable XML compression system. Knowl Inf Syst 10(4), 421–452 (2006)

    Article  Google Scholar 

  43. W3C (2004) Document Object Model (DOM) Level 3 Core Specification Version 1.0, W3C Recommendation. http://www.w3.org/TR/2004/REC-DOM-Level-3-Core-20040407/

  44. W3C (2001) Extensible Stylesheet Language (XSL). W3C Recommendation. http://www.w3.org/Style/XSL/

  45. W3C (1999) XML Path Language (XPath) Version 1.0. W3C Recommendation. http://www.w3.org/TR/xpath/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sven Groppe.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Groppe, S., Groppe, J., Böttcher, S. et al. Optimizing the execution of XSLT stylesheets for querying transformed XML data. Knowl Inf Syst 18, 331–391 (2009). https://doi.org/10.1007/s10115-008-0144-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-008-0144-4

Keywords

Navigation