Abstract
This paper describes the structural classification method used in a strategy for retrospective conversion of documents. This strategy consists in an cycle in which document analysis and document understanding interact. This cycle is initialized by the extraction of the outline of the layout and logical structures of the document. Then, each iteration of the cycle consists in the detection and the processing of inconsistencies in the document modeling. The cycle ends when no more inconsistency occurs.
A structural representation is used to describe documents. This representation is detailed.
Retrospective conversion consists in identifying each entity of the document and its structures as well. The structural classification method based on graph comparison is used at several levels of this process. Graph comparison is also used in the learning of generic entities.
Chapter PDF
Similar content being viewed by others
References
Sébastien Diana, Éric Trupin, Frédéric Jouzel, Jacques Labiche, and Yves Lecoutier. From acquisition to modelisation of a form base to retrieve information. In Fouth International Conference on Document Analysis and Recognition. IAPR, 1997.
Pierre Héroux, Sébastien Diana, Éric Trupin, and Yves Lecourtier. A structural classifier to automatically identify form classes. Advances in Pattern Recognition, Lecture Notes in Computer Science, 1451:429–439, 1998.
International Standard Organization. ISO 8613: Information Processing. Text and Office System, Office Document Architecture (ODA) and Interchange Format, 1989.
Laurent Miclet. Méthodes structurelles pour la reconnaissances de formes. Eyrolles, 1984.
Jean-Marc Ogier, Rémy Mullot, Jacques Labiche, and Yves Lecourtier. Interprétation de document par cycles perceptifs de construction d’objets cohérents. application aux données cadastrales. Traitement du Signal, 12(6):627–637, 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Héroux, P., Trupin, É., Lecoutier, Y. (2000). Structural Classification for Retrospective Conversion of Documents. In: Ferri, F.J., Iñesta, J.M., Amin, A., Pudil, P. (eds) Advances in Pattern Recognition. SSPR /SPR 2000. Lecture Notes in Computer Science, vol 1876. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44522-6_16
Download citation
DOI: https://doi.org/10.1007/3-540-44522-6_16
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67946-2
Online ISBN: 978-3-540-44522-7
eBook Packages: Springer Book Archive