Advertisement

Evolution of XPath Lists for Document Data Selection

  • Pablo García-Sánchez
  • Juan J. Merelo Guervós
  • Pedro Ángel Castillo
  • Jesús González
  • Juan L. Jiménez Laredo
  • Antonio M. Mora García
  • Maria I. García Arenas
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6239)

Abstract

XML has became a standard for structured data, and very often transformations from one specific format to another are needed. XSLT stylesheets are programs designed for this purpose, and they use XPath expressions to select sets of nodes within the document. In this paper a new version of an evolutionary algorithm that creates XSLT from examples is presented, improving on previously obtained results by testing a new individual representation with a new set of operators, based mainly on evolution of XPaths with a fixed XSLT program structure. The experiments show that this new representation, and a lower set of operators, yield better results in less generations that in our previous version.

Keywords

Evolutionary Algorithm XPath Expression Input Document Document Engineer Population Genetic Program 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Clark, J.: XSL transformations (XSLT), version 1.0, W3C recommendation (November 16, 1999), http://www.w3.org/TR/xslt.html
  2. 2.
    Wikipedia: Simple API for XML — Wikipedia, the free encyclopedia (2007) [Online; accessed March 21, 2007]Google Scholar
  3. 3.
    Clark, J., DeRose, S., et al.: XML Path Language (XPath) Version 1.0. W3C Recommendation 16 (1999)Google Scholar
  4. 4.
    García-Sánchez, P., Guervós, J.J.M., Sevilla, J.P., Laredo, J.L.J., Mora, A.M., Valdivieso, P.A.C.: Automatic generation of xslt stylesheets using evolutionary algorithms. In: Genetic and Evolutionary Computation Conference, GECCO 2008, Proceedings, pp. 1701–1702 (2008)Google Scholar
  5. 5.
    García-Sánchez, P., Guervós, J.J.M., Laredo, J.L.J., Mora, A., Castillo, P.A.: Evolving xslt stylesheets for document transformation. In: Rudolph, G., Jansen, T., Lucas, S., Poloni, C., Beume, N. (eds.) PPSN 2008. LNCS, vol. 5199, pp. 1021–1030. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  6. 6.
    Martens, S.: Automatic creation of XML document conversion scripts by genetic programming. In: Genetic Algorithms and Genetic Programming at Stanford, p. 269 (2000)Google Scholar
  7. 7.
    Schmid, U., Waltermann, J.: Automatic synthesis of XSL-transformations from example documents. In: Hamza, M. (ed.) IASTED International Conference on Artificial Intelligence and Applications, pp. 252–257 (2004)Google Scholar
  8. 8.
    Biermann, A.: The inference of regular LISP programs from examples. IEEE Transactions on Systems, Man and Cybernetics 8(8), 585–600 (1978)MathSciNetzbMATHCrossRefGoogle Scholar
  9. 9.
    Biermann, A.W., Guiho, G. (eds.): Computer Program Synthesis Methodologies. Reidel, Dordrecht (1983)zbMATHGoogle Scholar
  10. 10.
    Leinonen, P.: Automating XML document structure transformations. In: Proceedings of the 2003 ACM Symposium on Document Engineering, pp. 26–28 (2003)Google Scholar
  11. 11.
    Kuikka, E., Leinonen, P., Penttonen, M.: Towards automating of document structure transformations. In: Proceedings of the 2002 ACM Symposium on Document Engineering, pp. 103–110 (2002)Google Scholar
  12. 12.
    Chidlovskii, B., Fuselier, J.: Supervised learning for the legacy document conversion. In: DocEng 2004: Proceedings of the 2004 ACM Symposium on Document Engineering, pp. 220–228. ACM, New York (2004)CrossRefGoogle Scholar
  13. 13.
    Suzuki, N., Fukushima, Y.: An XML document transformation algorithm inferred from an edit script between DTDS. In: ADC 2008: Proceedings of the Nineteenth Conference on Australasian Database, pp. 175–184. Australian Computer Society, Inc., Australia (2007)Google Scholar
  14. 14.
    Chuang, T.R., Lin, J.L.: On modular transformation of structural content. In: DocEng 2004: Proceedings of the 2004 ACM symposium on Document Engineering, pp. 201–210. ACM, New York (2004)CrossRefGoogle Scholar
  15. 15.
    Soares, L.F.G., Rodrigues, R.F., de Resende Costa, R.M.: Automatic building of frameworks for processing XML documents. In: WebMedia 2006: Proceedings of the 12th Brazilian Symposium on Multimedia and the Web, pp. 118–127. ACM Press, New York (2006)CrossRefGoogle Scholar
  16. 16.
    Shin, D.H., Lee, K.H.: Towards the faster transformation of XML documents. J. Inf. Sci. 32(3), 261–276 (2006)CrossRefGoogle Scholar
  17. 17.
    Arenas, M.G., Dolin, B., Merelo-Guervós, J.J., Castillo, P.A., de Viana, I.F., Schoenauer, M.: JEO: Java Evolving Objects. In: Proceedings of the Genetic and Evolutionary Computation Conference, p. 991 (2002)Google Scholar
  18. 18.
    Arenas, M., Collet, P., Eiben, A., Jelasity, M., Merelo, J.J., Paechter, B., Preuß, M., Schoenauer, M.: A framework for distributed evolutionary algorithms. In: Guervós, J.J.M., Adamidis, P.A., Beyer, H.-G., Fernández-Villacañas, J.-L., Schwefel, H.-P. (eds.) PPSN 2002. LNCS, vol. 2439, pp. 665–675. Springer, Heidelberg (2002)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Pablo García-Sánchez
    • 1
  • Juan J. Merelo Guervós
    • 1
  • Pedro Ángel Castillo
    • 1
  • Jesús González
    • 1
  • Juan L. Jiménez Laredo
    • 1
  • Antonio M. Mora García
    • 1
  • Maria I. García Arenas
    • 1
  1. 1.Dept. of Computer Architecture and TechnologyUniversity of GranadaSpain

Personalised recommendations