Abstract
A system using multiple agents working on a pyramid structure to do text extraction is described in this paper. The method is based on the observation that text strings appear as different groupings of connected components at appropriate resolutions. The pyramid structure, which is a multi-resolution image representation, is amenable to parallel processing for detection of text strings. Agents in the system individually and concurrently look for groups of connected components at appropriate levels. They may in turn spawn new agents when connected components become disjointed at finer resolution levels. The agent-based pyramidal operations do not require expensive feature analysis among different connected components to detect text strings as found in other existing works.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Olivier D and Dominique B. A robust and multiscale document image segmentation for block line/text line structures extraction. Twelfth International Conference on Pattern Recognition, Jerusalem, 1994, pp 306–309.
Wahl FM, Wong KY and Casey RG. Block segmentation and text extraction in mixed text/image documents. Computer Graphics and Image Processing, 1982; 20: 375–390.
Fletcher LA and Kasturi R. A robust algorithm for text string separation from mixed text/graphics images. IEEE Transactions on Pattern Analysis Machine Intelligence, 1998; 10(6): 910–918.
He S, Abe N and Tan CL. A clustering-based approach to the separation of text strings from mixed text/graphics documents. Thirteenth International Conference on Pattern Recognition, Austria, 25–29, August 1996, pp 706–710.
Hase H, Shinokawa T, Yoneda M, Sakai M and Maruyama H. Character string extraction by multi-stage relaxation. Fourth International Conference on Document Analysis and Recognition, 18–20 August 1997, pp 298–302.
Tan CL and Ng PO. Text extraction using pyramid. Pattern Recognition, 1998; 31(1): 63–72.
Kropatsch WG. Properties of pyramidal representations. Computing Suppl., 1996; 11: 99–111.
Tanimoto SL. Pictorial feature distortion in a pyramid. Computer Graphics and Image Procesing, 1976; 5: 333–352.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag London Limited
About this paper
Cite this paper
Tan, C.L., Yuan, B., Ang, C.H. (1999). Agent-Based Text Extraction from Pyramid Images. In: Singh, S. (eds) International Conference on Advances in Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-0833-7_35
Download citation
DOI: https://doi.org/10.1007/978-1-4471-0833-7_35
Publisher Name: Springer, London
Print ISBN: 978-1-4471-1214-3
Online ISBN: 978-1-4471-0833-7
eBook Packages: Springer Book Archive