Advertisement

Knowledge and Information Systems

, Volume 18, Issue 3, pp 331–391 | Cite as

Optimizing the execution of XSLT stylesheets for querying transformed XML data

  • Sven Groppe
  • Jinghua Groppe
  • Stefan Böttcher
  • Thomas Wycisk
  • Le Gruenwald
Regular Paper

Abstract

We have to deal with different data formats whenever data formats evolve or data must be integrated from heterogeneous systems. These data when implemented in XML for data exchange cannot be shared freely among applications without data transformation. A common approach to solve this problem is to convert the entire XML data from their source format to the applications’ target formats using the transformations rules specified in XSLT stylesheets. However, in many cases, not all XML data are required to be transformed except for a smaller part described by a user’s query (application). In this paper, we present an approach that optimizes the execution time of an XSLT stylesheet for answering a given XPath query by modifying the XSLT stylesheet in such a way that it would (a) capture only the parts in the XML data that are relevant to the query and (b) process only those XSLT instructions that are relevant to the query. We prove the correctness of our optimization approach, analyze its complexity and present experimental results. The experimental results show that our approach performs the best in terms of execution time, especially when many cost-intensive XSLT instructions can be excluded in the XSLT stylesheet.

Keywords

XML XSLT Query reformulation Optimization Transformation Views 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Abiteboul S (1999) On views and XML. In: Proceedings of the eighteenth ACM SIGACT-SIGMOD-SIGART symposium on principles of database systems (PODS), Philadelphia, PennsylvaniaGoogle Scholar
  2. 2.
    Abiteboul S, Cluet S, Milo T (1997) Correspondence and translation for heterogeneous data. In: Proceedings of the 6th international conference on database theory (ICDT), Delphi, GreeceGoogle Scholar
  3. 3.
    Apache Software Foundation (2003) Xalan-Java. http://xml.apache.org/xalan-j/index.html
  4. 4.
    Apache Software Foundation (2003) Xerces2 Java Parser 2.5.0 Release. http://xml.apache.org/xerces2-j
  5. 5.
    Böttcher S, Steinmetz R (2004) Optimized Internet search based on an intersection test for XPath expressions under a DTD, In: Proceedings of the international conference on internet computing (IC), Las Vegas, USAGoogle Scholar
  6. 6.
    Böttcher S, Türling A (2003) Checking XPath expressions for synchronization, access control and reuse of query results on mobile clients. In: Proceedings of the workshop on database mechanisms for mobile applications, Karlsruhe, GermanyGoogle Scholar
  7. 7.
    Bourret R, Bornhövd C, Buchmann, AP (2000) A generic load/extract utility for data transfer between XML documents and relational databases. In: Proceedings of the 2nd international workshop on advanced issues of EC and Web-based information systems (WECWIS), San JoseGoogle Scholar
  8. 8.
    Chang CCK, Garcia-Molina H (2000) Approximate query translation across heterogeneous information sources. In: Proceedings of the 26th international conference on very large data bases (VLDB), Cairo, EgyptGoogle Scholar
  9. 9.
    Chen YB, Ling TW, Lee ML (2002) Designing valid XML views. In: Proceedings of the 21st international conference on conceptual modeling (ER), Tampere, FinlandGoogle Scholar
  10. 10.
    Cluet S, Delobel C, Simon J, Smaga K (1998) Your mediators need data conversion! In: Proceedings ACM SIGMOD international conference on management of data, Seattle, Washington, USAGoogle Scholar
  11. 11.
    Cluet S, Veltri P, Vodislav D (2001) Views in a large scale XML repository. In: Proceedings of 27th international conference on very large data bases (VLDB), Roma, ItalyGoogle Scholar
  12. 12.
    Deutsch A, Tannen V (2003) Reformulation of XML queries and constraints. In: Proceedings of the 9th international conference on database theory (ICDT) 2003, Siena, ItalyGoogle Scholar
  13. 13.
    Developer (2005) XSLT Mark Version 2.1.0. http://www.datapower.com/xmldev/xsltmark.html
  14. 14.
    Diao Y, Altinel M, Franklin MJ, Zhang H, Fischer P: Path sharing and predicate evaluation for high-performance XML filtering. ACM Trans Database Syst 28(4), 467–516 (2003)CrossRefGoogle Scholar
  15. 15.
    Diao Y, Rizvi S, Franklin MJ (2004) Towards an Internet-Scale XML dissemination service. In: Proceedings of the thirtieth international conference on very large data bases (VLDB), Toronto, CanadaGoogle Scholar
  16. 16.
    Fernández M, Kadiyska Y, Suciu D, Morishima A, Tan WC: SilkRoute, a framework for publishing relational data in XML. ACM Trans Database Syst 27(4), 438–493 (2002)CrossRefGoogle Scholar
  17. 17.
    Fisher D, Lam F, Wong RK (2004) Algebraic transformation and optimization for XQuery. In: Proceedings of the 6th Asia-Pacific Web conference (APWeb), Hangzhou, ChinaGoogle Scholar
  18. 18.
    Franceschet M (2005) XPathMark—an XPath benchmark for the XMark generated data. In: Proceedings of the third international XML database symposium (XSym 2005), Trondheim, NorwayGoogle Scholar
  19. 19.
    Gottlob G, Koch C, Pichler R (2003) The complexity of XPath query evaluation. In: Proceedings of the 22th ACM SIGMOD-SIGACT-SIGART symposium of principles of database systems (PODS), San Diego, California, USAGoogle Scholar
  20. 20.
    Grinev M, Kuznetsov S (2002) Towards an exhaustive set of rewriting rules for XQuery optimisation: BizQuery experience. In: Proceedings of the 6th East European conference on advances in databases and information systems (ADBIS), Bratislava, SlovakiaGoogle Scholar
  21. 21.
    Grinev M, Pleshachkov P (2005) Rewriting-based Optimization for XQuery transformational queries. In: Proceedings of the 9th international database engineering and applications symposium (IDEAS 2005), Montreal, CanadaGoogle Scholar
  22. 22.
    Groppe S, Böttcher S (2003) XPath query transformation based on XSLT stylesheets. In: Proceedings of the fifth international workshop on web information and data management (WIDM), New Orleans, Louisiana, USAGoogle Scholar
  23. 23.
    Groppe S, Böttcher S, Birkenheuer G (2004) Efficient querying of transformed XML Documents. In: Proceedings of the 6th international conference on enterprise information systems (ICEIS), Porto, PortugalGoogle Scholar
  24. 24.
    Groppe S, Böttcher S, Birkenheuer G, Höing A: Reformulating XPath queries and XSLT queries on XSLT views. Data Knowl Eng J (DKE) 57(1), 64–110 (2006)CrossRefGoogle Scholar
  25. 25.
    Groppe S, Böttcher S, Heckel R, Birkenheuer G (2004) Using XSLT stylesheets to transform XPath queries. In: Proceedings of the eighth East-European conference on advances in databases and information systems (ADBIS), Budapest, HungaryGoogle Scholar
  26. 26.
    Jain S, Mahajan R, Suciu D (2002) Translating XSLT programs to efficient SQL queries. In: Proceedings of the eleventh international world wide web conference (WWW2002), Honolulu, Hawaii, USAGoogle Scholar
  27. 27.
    Kay MH (2004) Saxon - The XSLT and XQuery Processor. http://saxon.sourceforge.net
  28. 28.
    Krishnamurthy R, Kaushik R, Naughton JF (2004) Efficient XML-to-SQL query translation: where to add the intelligence? In: Proceedings of the thirtieth international conference on very large data bases (VLDB), Toronto, CanadaGoogle Scholar
  29. 29.
    Lechner S, Preuner G, Schrefl M (2001) Translating XQuery into XSLT, In: ER Workshops, Yokohama, JapanGoogle Scholar
  30. 30.
    Leung Hp, Chung KFL, Chan SCf: On the use of hierarchical information in sequential mining-based XML document similarity computation. Knowl Inf Syst 7(4), 476–498 (2005)CrossRefGoogle Scholar
  31. 31.
    Megginson D (2000) SAX. http://www.saxproject.org/
  32. 32.
    Marian A, Siméon J (2003) Projecting XML documents. In: Proceedings of the 29th international conference on very large data bases (VLDB), Berlin, GermanyGoogle Scholar
  33. 33.
    Moerkotte G (2002) Incorporating XSL processing into database engines. In: Proceedings of the 28th international conference on very large data bases (VLDB), Hong Kong, ChinaGoogle Scholar
  34. 34.
    Mukherjea S: Discovering and analyzing world wide web collections. Knowl Inf Syst 6(2), 230–241 (2004)Google Scholar
  35. 35.
    Nayak R: Fast and effective clustering of XML data using structural information. Knowl Inf Syst 14(2), 197–215 (2008)CrossRefMathSciNetGoogle Scholar
  36. 36.
    Paparizos S, Wu Y, Lakshmanan LVS, Jagadish HV (2004) Tree logical classes for efficient evaluation of XQuery. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD), Paris, FranceGoogle Scholar
  37. 37.
    Rys M (2001) Bringing the Internet to your database: Using SQL Server 2000 and XML to build loosely coupled systems. In: Proceedings of the 17th international conference on data engineering (ICDE), Heidelberg, GermanyGoogle Scholar
  38. 38.
    Shanmugasundaram J, Kiernan J, Shekita E, Fan C, Funderburk J (2001) Querying XML views of relational data. In: Proceedings of 27th international conference on very large data bases (VLDB), Roma, ItalyGoogle Scholar
  39. 39.
    Shanmugasundaram J, Shekita E, Barr R, Carey M, Lindsay B, Pirahesh H, Reinwald B: Efficiently publishing relational data as XML documents. VLDB J 10(2–3), 133–154 (2001)zbMATHGoogle Scholar
  40. 40.
    University Trier (2005) Computer Science Bibliographie. http://dblp.uni-trier.de/
  41. 41.
    Wang L, Mulchandani M, Rundensteiner EA (2003) Updating XQuery views published over relational data: a round-trip case study. In: Proceedings of the first international XML database symposium (XSym), Berlin, GermanyGoogle Scholar
  42. 42.
    Wilfred Ng, Lam WY, Wood PT, Levene M: XCQ: a queriable XML compression system. Knowl Inf Syst 10(4), 421–452 (2006)CrossRefGoogle Scholar
  43. 43.
    W3C (2004) Document Object Model (DOM) Level 3 Core Specification Version 1.0, W3C Recommendation. http://www.w3.org/TR/2004/REC-DOM-Level-3-Core-20040407/
  44. 44.
    W3C (2001) Extensible Stylesheet Language (XSL). W3C Recommendation. http://www.w3.org/Style/XSL/
  45. 45.
    W3C (1999) XML Path Language (XPath) Version 1.0. W3C Recommendation. http://www.w3.org/TR/xpath/

Copyright information

© Springer-Verlag London Limited 2008

Authors and Affiliations

  • Sven Groppe
    • 1
  • Jinghua Groppe
    • 1
  • Stefan Böttcher
    • 2
  • Thomas Wycisk
    • 2
  • Le Gruenwald
    • 3
  1. 1.Institute of Information Systems (IFIS)University of LübeckLübeckGermany
  2. 2.University of PaderbornPaderbornGermany
  3. 3.School of Computer ScienceUniversity of OklahomaNormanUSA

Personalised recommendations