Skip to main content

Agent-Based Text Extraction from Pyramid Images

  • Conference paper
International Conference on Advances in Pattern Recognition

Abstract

A system using multiple agents working on a pyramid structure to do text extraction is described in this paper. The method is based on the observation that text strings appear as different groupings of connected components at appropriate resolutions. The pyramid structure, which is a multi-resolution image representation, is amenable to parallel processing for detection of text strings. Agents in the system individually and concurrently look for groups of connected components at appropriate levels. They may in turn spawn new agents when connected components become disjointed at finer resolution levels. The agent-based pyramidal operations do not require expensive feature analysis among different connected components to detect text strings as found in other existing works.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Olivier D and Dominique B. A robust and multiscale document image segmentation for block line/text line structures extraction. Twelfth International Conference on Pattern Recognition, Jerusalem, 1994, pp 306–309.

    Google Scholar 

  2. Wahl FM, Wong KY and Casey RG. Block segmentation and text extraction in mixed text/image documents. Computer Graphics and Image Processing, 1982; 20: 375–390.

    Article  Google Scholar 

  3. Fletcher LA and Kasturi R. A robust algorithm for text string separation from mixed text/graphics images. IEEE Transactions on Pattern Analysis Machine Intelligence, 1998; 10(6): 910–918.

    Article  Google Scholar 

  4. He S, Abe N and Tan CL. A clustering-based approach to the separation of text strings from mixed text/graphics documents. Thirteenth International Conference on Pattern Recognition, Austria, 25–29, August 1996, pp 706–710.

    Google Scholar 

  5. Hase H, Shinokawa T, Yoneda M, Sakai M and Maruyama H. Character string extraction by multi-stage relaxation. Fourth International Conference on Document Analysis and Recognition, 18–20 August 1997, pp 298–302.

    Google Scholar 

  6. Tan CL and Ng PO. Text extraction using pyramid. Pattern Recognition, 1998; 31(1): 63–72.

    Article  Google Scholar 

  7. Kropatsch WG. Properties of pyramidal representations. Computing Suppl., 1996; 11: 99–111.

    Article  MathSciNet  Google Scholar 

  8. Tanimoto SL. Pictorial feature distortion in a pyramid. Computer Graphics and Image Procesing, 1976; 5: 333–352.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag London Limited

About this paper

Cite this paper

Tan, C.L., Yuan, B., Ang, C.H. (1999). Agent-Based Text Extraction from Pyramid Images. In: Singh, S. (eds) International Conference on Advances in Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-0833-7_35

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-0833-7_35

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-1214-3

  • Online ISBN: 978-1-4471-0833-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics