Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

XML Parsing, SAX/DOM

  • Chengkai LiEmail author
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_769


XML parsing is the process of reading an XML document and providing an interface to a user application for accessing the document. An XML parser is a software apparatus that accomplishes such tasks. In addition, most XML parsers check the well-formedness of the XML document and many can also validate the document with respect to a DTD (Document Type Definition) or XML schema. Through the parsing interface, the user application can focus on application logic itself, without dwelling on the details of XML.

There are mainly two categories of XML programming interfaces – DOM (Document Object Model) and SAX (Simple API for XML). DOM is a tree-based interface that models an XML document as a tree of nodes, via which an application can search for nodes, read their information, and update the contents of the nodes. SAX is an event-driven interface. The application registers with the parser various event handlers. As the parser reads an XML document, it generates events for the...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Chiu K, Govindaraju M, Bramley R. Investigating the limits of SOAP performance for scientific computing. In: Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing; 2002. p. 246–54.Google Scholar
  2. 2.
    Document Type Declaration, http://www.w3.org/TR/REC-xml/#dt-doctype
  3. 3.
    Farfán F, Hristidis V, Rangaswami R. Beyond lazy XML parsing. In: Proceedings of the 18th International Conference on Database and Expert Systems Applications; 2007. p. 75–86.Google Scholar
  4. 4.
    Harold ER. Processing XML with Java(TM): a guide to SAX, DOM, JDOM, JAXP, and TrAX. Addison-Wesley; 2002.Google Scholar
  5. 5.
    Kostoulas M, Matsa M, Mendelsohn N, Perkins E, Heifets A, Mercaldi M. XML screamer: an integrated approach to high performance XML parsing, validation and deserialization. In: Proceedings of the 15th International World Wide Web Conference; 2006. p. 93–102.Google Scholar
  6. 6.
    Nicola M, John J. XML parsing: a threat to database performance. In: Proceedings of the International Conference on Information and knowledge Management; 2003. p. 175–8.Google Scholar
  7. 7.
    Noga M, Schott S, Löwe W. Lazy XML processing. In: Proceedings of the 2nd ACM Symposium on Document Engineering; 2002. p. 88–94.Google Scholar
  8. 8.
    Takase T, Miyashita H, Suzumura T, Tatsubori M. An adaptive, fast, and safe XML parser based on byte sequences memorization. In: Proceedings of the 14th International World Wide Web Conference; 2005. p. 692–701.Google Scholar
  9. 9.
    Thompson H, Tobin R. Using finite state automata to implement W3C XML schema content model validation and restriction checking. In: Proceedings of the XML Europe; 2003. p. 246–54.Google Scholar
  10. 10.
    Van Engelen R. Constructing finite state automata for high performance XML web services. In: Proceedings of the International Symposium on Web Services; 2004. p. 975–81.Google Scholar
  11. 11.
  12. 12.
    Zhang W, Van Engelen R. A table-driven streaming XML parsing methodology for high-performance web services. In: Proceedings of the IEEE International Conferences on Web Services; 2006. p. 197–204.Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.University of Texas at ArlingtonArlingtonUSA

Section editors and affiliations

  • Sihem Amer-Yahia
    • 1
  1. 1.Laboratoire d’Informatique de GrenobleCNRS and LIGGrenobleFrance