User-driven page layout analysis of historical printed books

Ramel, J. Y.; Leriche, S.; Demonet, M. L.; Busson, S.

doi:10.1007/s10032-007-0040-6

User-driven page layout analysis of historical printed books

Original Paper
Published: 02 March 2007

Volume 9, pages 243–261, (2007)
Cite this article

International Journal of Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

J. Y. Ramel¹,
S. Leriche¹,
M. L. Demonet² &
…
S. Busson²

308 Accesses
35 Citations
Explore all metrics

Abstract

In this paper, based on the study of the specificity of historical printed books, we first explain the main error sources in classical methods used for page layout analysis. We show that each method (bottom-up and top-down) provides different types of useful information that should not be ignored, if we want to obtain both a generic method and good segmentation results. Next, we propose to use a hybrid segmentation algorithm that builds two maps: a shape map that focuses on connected components and a background map, which provides information about white areas corresponding to block separations in the page. Using this first segmentation, a classification of the extracted blocks can be achieved according to scenarios produced by the user. These scenarios are defined very simply during an interactive stage. The user is able to make processing sequences adapted to the different kinds of images he is likely to meet and according to the user needs. The proposed “user-driven approach” is capable of doing segmentation and labelling of the required user high level concepts efficiently and has achieved above 93% accurate results over different data sets tested. User feedbacks and experimental results demonstrate the effectiveness and usability of our framework mainly because the extraction rules can be defined without difficulty and parameters are not sensitive to page layout variation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Akindele, O., Belaid, A.: Page segmentation by segment tracing. In: Proceedings of the 2nd International Conference One Document Analysis and Recognition, Tsukuba pp. 341–344 (1993)
Antonacopoulos A. (1998). Page segmentation using the description of the background. Comput. Vis. Image Underst. Spec Issue Doc. Image Underst. Retrieval 70(3): 350–369
Google Scholar
Baird, H.S., Jones, S.E., Fortune, S.J.: Image segmentation by shape-directed covers. In: Proceedings of International Conference on Pattern Recognition, Atlantic City, NJ, 820–825 June 1990
Baird, H.: Background structure in document images. In: Bunke, H. (ed.) Proceedings of the Advances in Structural and Syntactical Pattern Recognition, Singapore pp. 253–269 (1992)
Belaïd, A.: Computer aided design of models of page for their use in recognition of documents, Workshop one Electronic Page Models (LAMPE’ 97), Lausanne (1997)
Breuel, T.M.: An algorithm for finding maximal whitespace rectangles at arbitrary orientations for document layout analysis. International Conference for Document Analysis and Recognition (ICDAR 2003), Edinburg vol. 1, pp. 66–70 (2003)
Chen S. and Haralick R.M. (1995). Recursive erosion, dilation, opening and closing transforms. IEEE. Trans. Image. Process. 4(3): 335–345
Article Google Scholar
Book scanner web site: http://www.i2s-bookscanner.com/fr/ default.asp
Dori, D., Doermann, D., Shin, C., Haralick, R., Phillips, I., Buchman, M., Ross, D.: The representation of document structure: a generic object-process analysis. In: Bunke, H., Wang, P.S.P. (eds.) Handbook of Character Recognition and Document Image Analysis, chapter 16, pp. 421–456. World Scientific, London (1997)
Etemad K., Doermann D. and Chellappa R. (1997). Multiscale segmentation of unstructured document pages using soft decision integration. IEEE Trans. Pattern Anal. Mach. 19(1): 92–96
Article Google Scholar
Jain A.K. and Bhattacharjee S. (1992). Text segmentation using gabor fillters for automatic document processing. Mach. Vis. Appl. 5(3): 169–184
Google Scholar
Jain A.K. and Zhong Y. (1996). Page segmentation using texture analysis. Pattern Recognit. 29(5): 743–770
Article Google Scholar
Hadjar, K., Hitz, O., Ingold, R.: Newpaper page decomposition using Split and merge approach. In: Proceedings of the 5th International Conference one Document Analysis and Recognition, Seatle pp. 1186–1191 (2001)
Hadjar, K., Hitz, O., Robadey, L., Ingold, R.: Configuration recognition model for complex transfers methods engineering: 2(CREM). In: Proceedings of the 5th International Workshop one Document Analysis Systems, Princeton pp. 469–479 (2002)
He, J., Downton, A.: User-assisted archive document image analysis for digital library construction. In: Proceedings of the 6th International Conference one Document Analysis and Recognition, Edinburg pp. 498–502 (2003)
Journet, N., Eglin, V., Ramel, J.Y., Mullot, R.: Text/graphic labelling of ancient printed documents. In: Proceedings of the International Conference on Document Analysis and Recognition (ICDAR’05). Séoul, Corée, pp. 1010–1014 (2005)
Kise K., Sato A. and Iwata M. (1998). Segmentation of page images using the area voronoi diagram. Comput. Vis. Image Underst. Spec. issue doc. Image Underst. Retrieval 70(3): 370–382
Google Scholar
Le D.S., Thoma G.R. and Wechsler H. (1994). Automated page orientation and skew angle detection for binary document images. Pattern Recognit. 27(10): 1325–1344
Article Google Scholar
LeBourgeois, F., Emptoz, H.: document analysis in gray level and typography extraction using character pattern redundancies. In: proceedings of the 5th International Conference on Document Analysis and Recognition, Bengalore, India, pp. 177–180 (1999)
Lebourgeois F., Emptoz H. and Trinh E. (2003). Compression et accessibilité aux images de documents numérisés/application au projet debora. Doc. Numérique. 7(3–4): 103–127
Article Google Scholar
Lee S.W. and Ryu D.S. (2001). Parameter-free geometric document layout analysis. IEEE Trans. Pattern Anal. Mach. 23(11): 1240–1256
Article Google Scholar
Marinai S., Gori M. and Soda G. (2005). Artificial neural networks for document analysis and recognition. IEEE Trans. Pattern Anal. Mach. Intell. 27(1): 23–35
Article Google Scholar
Min, Y., Cho, S.-B., Lee, Y.: A data reduction method for efficient document skew estimation based on hough transformation. In: Proceedings of the 13th International Conference on Pattern Recognition, Vienna, Austria, pp. 732–736 (1996)
Nagy, G., Seth, S.: Hierarchical representation of optically scanned documents. In: Proceedings of the 7th International Conference on Pattern Recognition (ICPR), Montreal pp. 347–349 (1984)
Nagy G., Seth S., Krishnamoorthy M. and Viswanathan M. (1993). Syntactic segmentation and labeling of digitized pages from technical journals. IEEE Trans. Pattern Anal. Mach. Intell. 15(7): 737–747
Article Google Scholar
O’ Gorman L. (1993). The document spectrum for page analysis layout. IEEE Trans. Pattern Anal. Mach. Intell. 15(11): 1162–1173
Article Google Scholar
O’ Gorman L. and Kasturi R. (1995). Document image analysis. IEEE Computer Society Press, Los Alamitos, CA
Google Scholar
Ramel, J.Y., Vincent, N., Emptoz, H.: Extraction contextuelle d’entités graphiques dans les dessins : du plus simple au plus complexe.... Colloque International Francophone sur l’Ecrit et le Document, Quebec, Canada, pp. 453–462 (1998)
Trinh, E.: De la numérization à la consultation de documents anciens. Thèse de doctorat en Informatique, Insa de Lyon, (2003)
Sauvola J. and Pietikainen M. (2000). Adaptive document image binarization. Pattern Recognit. 33: 225–236
Article Google Scholar
Spitz A.L. (1998). Analysis of compressed document images for dominant skew, multiple skew and logotype detection. Comput. Vis. Image Underst. 70(3): 321–334
Article Google Scholar
Wang D. and Srihari S.N. (1989). Classification of newspaper image blocks using texture analysis. Comput. Vis. Graph. Image Process. 47: 327–352
Article Google Scholar
Wang, S.-Y., Yagasaki, T.: Block selection: a method for segmenting page image of various editing styles. In: Proceedings of the 3th International Conference on Document Analysis and Recognition, Montreal, Canada, pp. 128–133 (1995)
Wang Y., Phillips IT. and Haralick RM. (2006). Document zone content classification and its performance evaluation. Pattern Recognit. 39: 57–73
Article Google Scholar
Wong K.Y., Casey R.G. and Wahl F.M. (1982). Document Analysis System. IBM J. Res. Dev. 26(6): 647
Article Google Scholar

Download references

Author information

Authors and Affiliations

Lab. d’Informatique, Ecole Polytechnique de l’Université de Tours, 64, avenue Jean Portalis, 37200, Tours, France
J. Y. Ramel & S. Leriche
CESR, UMR 6576 du CNRS, 59, rue Néricault-Destouches, BP 11328, 37013, Tours, France
M. L. Demonet & S. Busson

Authors

J. Y. Ramel
View author publications
You can also search for this author in PubMed Google Scholar
S. Leriche
View author publications
You can also search for this author in PubMed Google Scholar
M. L. Demonet
View author publications
You can also search for this author in PubMed Google Scholar
S. Busson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. Y. Ramel.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ramel, J.Y., Leriche, S., Demonet, M.L. et al. User-driven page layout analysis of historical printed books. IJDAR 9, 243–261 (2007). https://doi.org/10.1007/s10032-007-0040-6

Download citation

Received: 10 February 2005
Revised: 12 January 2007
Accepted: 30 January 2007
Published: 02 March 2007
Issue Date: April 2007
DOI: https://doi.org/10.1007/s10032-007-0040-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

User-driven page layout analysis of historical printed books

Abstract

Access this article

Similar content being viewed by others

A comprehensive survey of image segmentation: clustering methods, performance parameters, and benchmark datasets

A survey of density based clustering algorithms

Datasets and annotations for layout analysis of scientific articles

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

User-driven page layout analysis of historical printed books

Abstract

Access this article

Similar content being viewed by others

A comprehensive survey of image segmentation: clustering methods, performance parameters, and benchmark datasets

A survey of density based clustering algorithms

Datasets and annotations for layout analysis of scientific articles

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation