Low-Level Document Image Analysis and Description: From Appearance to Structure

Salerno, Emanuele; Savino, Pasquale; Tonazzini, Anna

doi:10.1007/978-3-642-22913-8_16

Emanuele Salerno⁴,
Pasquale Savino⁴ &
Anna Tonazzini⁴

Part of the book series: Studies in Computational Intelligence ((SCI,volume 375))

653 Accesses

Abstract

This chapter deals with the problem of processing and analyzing digital images of ancient or degraded documents to increase the possibilities of inferring their structures. Classification and recognition are needed to infer structure but, when dealing with degraded documents, they are particularly difficult to apply directly to unprocessed images. This is why an intermediate step is needed that extracts automatically the “perceptual components” of the documents from their appearance. By “appearance” of a document, we mean the “raw” data set, containing the “sensorial components” of the object under study. Ancient documents of historical importance pose specific problems that are now being solved with the help of information technology. As much information as possible should be drawn from the physical documents and should be structured so as to permit specialized searches to be performed in large databases. The tools we use to treat unstructured, low-level information are both mathematical and descriptive. Under a mathematical point of view, we model our appearance as a function of all the perceptual components, or patterns we want to identify. Once the model has been established, its parameters can be learned from the data available and from reasonable assumptions on both the model itself and the patterns. Our descriptive tools form a specialized metadata schema that can help both the storage and the indexing of all the digital objects produced to represent the original document, and provides a complete description of all the processing performed. Suitable links fully interconnect the various descriptions in order to relate the different representations of the physical object and to trace the history of all the processing performed. Inferring structure is much easier by analyzing the patterns and their mutual relationships than by analyzing the appearance.

This work was partially supported by European funds through POR Calabria FESR 2007-2013, PIA 2008 project No. 1220000119. Partners: TEA sas di Elena Console & C. (CZ), Istituto di Scienza e Tecnologie dell’Informazione CNR (PI), Dipartimento di Meccanica Università dalla Calabria (CS).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Amato, G., Gennaro, C., Rabitti, F., Savino, P., Milos, A.: A multimedia content management system for digital library applications. In: ECDL 2004, pp. 14–25 (September 12-17, 2004)
Google Scholar
Amato, G., Cigarrán, J.M., Gonzalo, J., Peters, C., Savino, P.: MultiMatch – multilingual/Multimedia access to cultural heritage. In: Kovács, L., Fuhr, N., Meghini, C. (eds.) ECDL 2007. LNCS, vol. 4675, pp. 505–508. Springer, Heidelberg (2007)
Chapter Google Scholar
Bedini, L., Gerace, I., Tonazzini, A.: A deterministic algorithm for reconstructing images with interacting discontinuities. CVGIP: Graph. Models Image Process. 56(2), 109
Google Scholar
Bell, A.J., Sejnowski, T.J.: An Information-Maximization Approach to Blind Separation and Blind Deconvolution. Neural Comp. 7, 1129
Google Scholar
Blake, A., Zissermann, A.: Visual Reconstruction. MIT Press, Cambridge (1987)
Google Scholar
Cichocki, A., Amari, S.-I.: Adaptive Blind Signal and Image Processing. Wiley, New York (2002)
Book Google Scholar
Debole, F., Savino, P., Eckes, G.: Searching and browsing film archives: The European film gateway approach. In: Proc. 4th Int. Congr. Sci. & Technol. Safeguard Cultural Heritage in the Mediterranean Basin, Cairo, Egypt, December 6-8, vol. I, pp. 359–364 (2009)
Google Scholar
The Dublin Core Metadata Initiative, http://dublincore.org/
Functional Requirements for Bibliographic Records, http://www.ifla.org/en/publications/functional-requirements-for-bibliographic-records
Gennaro, C., Rabitti, F., Savino, P.: The Use of XML in a video digital library. Intelligent Search on XML Data, 19–38 (2003)
Google Scholar
Hyvärinen, A., Karhunen, J., Oja, E.: Independent component analysis. Wiley, New York (2001)
Book Google Scholar
Merrikh-Bayat, F., Babaie-Zadeh, M., Jutten, C.: A Nonlinear blind source separation solution for removing the show-through effect in the scanned documents. In: Proc. Eur. Signal Processing Conf, EUSIPCO 2008, Lausanne, Switzerland, August 25-29 (2008)
Google Scholar
Ohta, Y., Kanade, T., Sakai, T.: Color information for region segmentation. Computer Graphics and Image Processing 13(3), 222
Google Scholar
Salembier, P., Sikora, T., Manjunath, B.: Introduction to MPEG-7: Multimedia content description interface. Wiley, New York (2002)
Google Scholar
Salerno, E., Tonazzini, A., Bedini, L.: Digital image analysis to enhance underwritten text in the Archimedes palimpsest. Int. J. Doc. Anal. Rec. 9(2-4), 79
Google Scholar
Savino, P., Peters, C.: ECHO: a digital library for historical film archives, Int. J. Int. J. on Digital Libraries 4(3-7), 1
Google Scholar
Sharma, G.: Show-through cancellation in scans of duplex printed documents. IEEE Trans. Image Processing 10(5), 736
Google Scholar
Tai, Y.W., Jia, J., Tang, C.K.: Soft color segmentation and its applications. IEEE Trans. Patt. Anal. Mach. Intell. 29, 1520
Google Scholar
Tonazzini, A., Salerno, E., Mochi, M., Bedini, L.: Bleed-through removal from degraded documents using a color decorrelation method. In: Marinai, S., Dengel, A.R. (eds.) DAS 2004. LNCS, vol. 3163, pp. 229–240. Springer, Heidelberg (2004)
Chapter Google Scholar
Tonazzini, A., Salerno, E., Mochi, M., Bedini, L.: Blind source separation techniques for detecting hidden texts and textures in document images. In: Campilho, A.C., Kamel, M.S. (eds.) ICIAR 2004. LNCS, vol. 3212, pp. 241–248. Springer, Heidelberg (2004)
Chapter Google Scholar
Tonazzini, A., Bedini, L., Salerno, E.: Independent component analysis for document restoration. Int. J. Doc. Anal. Rec. 7(1), 17
Google Scholar
Tonazzini, A., Bedini, L., Salerno, E.: A Markov model for blind image separation by a mean-field EM algorithm. IEEE Trans. Image Processing 15(2), 473
Google Scholar
Tonazzini, A., Salerno, E., Bedini, L.: Fast correction of bleed-through distortion in grayscale documents by a blind source separation technique. Int. J. Doc. Anal. Rec. 10(1), 17
Google Scholar
Tonazzini, A., Bianco, G., Salerno, E.: Registration and enhancement of double-sided degraded manuscripts acquired in multispectral modality. In: Ramos, O., Karatzas, D. (eds.) Int. Conf. Doc. Anal. Rec., ICDAR 2009, pp. 546–550. IEEE Computer Society Press, Los Alamitos (2009)
Google Scholar
Tonazzini, A., Gerace, I., Martinelli, F.: Multichannel blind separation and deconvolution of images for document analysis. IEEE Trans. Image Processing 19(4), 912
Google Scholar
Extensible Markup Language, http://www.w3.org/XML/

Download references

Author information

Authors and Affiliations

CNR - Istituto di Scienza e Tecnologie dell’Informazione, Via Moruzzi 1, 56124, Pisa, Italy
Emanuele Salerno, Pasquale Savino & Anna Tonazzini

Authors

Emanuele Salerno
View author publications
You can also search for this author in PubMed Google Scholar
Pasquale Savino
View author publications
You can also search for this author in PubMed Google Scholar
Anna Tonazzini
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of New York Tirana, Rr. Komuna E Parisit,, Tirana, Albania
Marenglen Biba
Technical University of Catalonia, Campus Nord, Ed. Omega, C/Jordi Girona 1-3, 08034, Barcelona, Spain
Fatos Xhafa

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Salerno, E., Savino, P., Tonazzini, A. (2011). Low-Level Document Image Analysis and Description: From Appearance to Structure. In: Biba, M., Xhafa, F. (eds) Learning Structure and Schemas from Documents. Studies in Computational Intelligence, vol 375. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22913-8_16

Download citation

DOI: https://doi.org/10.1007/978-3-642-22913-8_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22912-1
Online ISBN: 978-3-642-22913-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics