Automatic Metadata Retrieval from Ancient Manuscripts
The paper presents a document analysis system to retrieve metadata from digitized ancient manuscripts. This platform has been developed to assist researchers, historians and libraries to process a wide variety of manuscripts written in different languages. In order to retrieve different metadata from various digitized documents, we propose a user-training system, which use robust approaches based on a sequential bottom-up process. We develop a low-level segmentation and a basic recognition stage which do not use prior knowledge on documents contents. Our objective was to study the feasibility to process a large variety of manuscripts with the same platform, which can be used by non-specialists in image analysis.
KeywordsDigitize Document Document Image Analysis Regular Text Metadata Extraction Chapter Title
- 1.LeBourgeois, F., et al.: Document Images Analysis Solutions for Digital Libraries. In: Proc. Of the DIAL 2004, Palo Alto, California, January 23-24, pp. 2–24 (2004)Google Scholar
- 2.Kaileh, H.: L’accès à distance aux manucrits arabes numérisés en mode image. PhD, university Lyon II, France, January 28, p. 445 (2004)Google Scholar
- 5.Bres, S., Jolion, J.-M., Le Bourgeois, F.: Traitement et analyse des images numériques, Hermes, p. 412 (2003)Google Scholar