Structural Classification for Retrospective Conversion of Documents
This paper describes the structural classification method used in a strategy for retrospective conversion of documents. This strategy consists in an cycle in which document analysis and document understanding interact. This cycle is initialized by the extraction of the outline of the layout and logical structures of the document. Then, each iteration of the cycle consists in the detection and the processing of inconsistencies in the document modeling. The cycle ends when no more inconsistency occurs.
A structural representation is used to describe documents. This representation is detailed.
Retrospective conversion consists in identifying each entity of the document and its structures as well. The structural classification method based on graph comparison is used at several levels of this process. Graph comparison is also used in the learning of generic entities.
Keywordsretrospective conversion document structure
- 1.Sébastien Diana, Éric Trupin, Frédéric Jouzel, Jacques Labiche, and Yves Lecoutier. From acquisition to modelisation of a form base to retrieve information. In Fouth International Conference on Document Analysis and Recognition. IAPR, 1997.Google Scholar
- 3.International Standard Organization. ISO 8613: Information Processing. Text and Office System, Office Document Architecture (ODA) and Interchange Format, 1989.Google Scholar
- 4.Laurent Miclet. Méthodes structurelles pour la reconnaissances de formes. Eyrolles, 1984.Google Scholar
- 5.Jean-Marc Ogier, Rémy Mullot, Jacques Labiche, and Yves Lecourtier. Interprétation de document par cycles perceptifs de construction d’objets cohérents. application aux données cadastrales. Traitement du Signal, 12(6):627–637, 1995.Google Scholar