Abstract
We investigate the incremental validation of XML documents with respect to DTDs and XML Schemas, under updates consisting of element tag renamings, insertions and deletions. DTDs are modeled as extended context-free grammars and XML Schemas are abstracted as “specialized DTDs”, allowing to decouple element types from element tags. For DTDs, we exhibit an O(m log n) incremental validation algorithm using an auxiliary structure of size O(n), where n is the size of the document and m the number of updates. For specialized DTDs, we provide an O(m log2 n) incremental algorithm, again using an auxiliary structure of size O(n). This is a significant improvement over brute-force re-validation from scratch.
Authors supported in part by the NSF under grant numbers IRI-9734548, IRI- 9221268 and Digital Government 9983510.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
A. Bruggemann-Klein, M. Murata, and D. Wood. Regular tree and regular hedge languages over non-ranked alphabets. HKUST-TCSC-2001-05, HKUST 2001. Available at http://www.cs.ust.hk/tcsc/RR/2001-05.ps.gz.
A. Bruggemann-Klein and D. Wood. One-unambiguous regular languages. Information and Computation, 142(2):182–206, 1998.
C. Beeri and T. Milo. Schemas for integration and translation of structured and semi-structured data. In Int’l. Conf. on Database Theory, pages 296–313, 1999.
S. Cluet, C. Delobel, J. Simeon, and K. Smaga. Your mediators need data conversion! In Proc. ACM SIGMOD, 177–188, 1998.
T. Cormen and C. Leiserson and R. Rivest. Introduction to Algorithms, Mc Graw-Hill, 1992.
G. Dong and J. Su. Space-bounded foies. In Proc. ACM PODS, 139–150, 1995.
C. Ghezzi and D. Mandrioli. Augmenting parsers to support incrementality. JACM, 27(3), 1980.
H. Garcia-Molina, J. Ullman, and J. Widom. Database Systems: The Complete Book. Prentice Hall, 2001.
B. Hesse and N. Immerman. Complete problems for dynamic complexity classes. Proc.IEEE LICS, 313–322, 2002.
F. Jalili and J. Gallier. Building friendly parsers. In Proc. ACM POPL, 1982.
J. Larcheveque. Optimal incremental parsing. ACM Transactions on Programming Languages and Systems, 17(1), 1995.
W. Li. A simple and efficient incremental LL(1) parsing. In Theory and Practice of Informatics, 1995.
G. Linden. Incremental updates in structured documents, 1993. Licentiate Thesis, Report C-1993-19, Department of Computer Science, University of Helsinki.
M. Lohrey. On the parallel complexity of tree automata. In Proceedings of the 12th RTA, LNCS 2051, 2001.
A. Murching, Y. Prasant, and Y. Srikant. Incremental recursive descent parsing. Computer Languages, 15(4), 1990.
P.B. Miltersen, S. Subramanian, J.S. Vitter, and R. Tamassia. Complexity models for incremental computation. TCS, 130(1):203–236, 1994.
F. Neven. Automata, logic and XML. In Computer Science Logic, 2–26, 2002.
L. Petrone. Reusing batch parsers as incremental parsers. In Proc. FSTTCS, 1995.
S. Patnaik and N. Immerman. Dyn-FO: A parallel, dynamic complexity class. JCSS, 55(2), 1997.
Y. Papakonstantinou and V. Vianu. DTD inference for views of XML data. In Proc. ACM PODS, 35–46, 2000.
L. Segoufin. Personal communication, 2002.
H. Vollmer. Introduction to Circuit Complexity. Springer Verlag, 1999.
W3C. The extensible markup language (XML), 1998. W3C Recomendation available at http://www.w3c.org/XML.
W3C. XML schema definition, 2001. W3C Recomendation available at http://www.w3c.org/XML/Schema.
T. Wagner and S. Graham. Efficient and flexible incremental parsing. ACM Transactions on Programming Languages and Systems, 20(2), 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Papakonstantinou, Y., Vianu, V. (2003). Incremental Validation of XML Documents. In: Calvanese, D., Lenzerini, M., Motwani, R. (eds) Database Theory — ICDT 2003. ICDT 2003. Lecture Notes in Computer Science, vol 2572. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36285-1_4
Download citation
DOI: https://doi.org/10.1007/3-540-36285-1_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00323-6
Online ISBN: 978-3-540-36285-2
eBook Packages: Springer Book Archive