Abstract
This paper presents a bottom-up approach for segmenting document images and labeling the segmented regions with logical names. Our method uses image features in terms of the characteristics of text lines, such as margin and character size, then our method can analyze an unstable document image that has floating elements such as figures and tables. Experimental application of this method to images of technical journals written in Japanese yielded classification rates of 98.6 % for the front pages and 90.0 % for the final pages that have floating elements.
This is a preview of subscription content, log in via an institution.
References
A.Dengel,“ANASTASIL: A System for Low-Level and High-Level Geometric Analysis of Printed Documents,” Structured Document Image Analysis, pp. 70–98, Springer-Verlag, 1992.
K.Iwane, M.Yamaoka and O.Iwaki, “A Functional Classification Approach to Layout Analysis of Document Images,” Proceedings of the Second International Conference on Document Analysis and Recognition, pp. 778–781, 1993.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yamaoka, M., Iwaki, O. (1995). Document layout analysis using pattern classification method. In: Chin, R.T., Ip, H.H.S., Naiman, A.C., Pong, TC. (eds) Image Analysis Applications and Computer Graphics. ICSC 1995. Lecture Notes in Computer Science, vol 1024. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60697-1_156
Download citation
DOI: https://doi.org/10.1007/3-540-60697-1_156
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60697-0
Online ISBN: 978-3-540-49298-6
eBook Packages: Springer Book Archive