Distinguishing between Handwritten and Machine Printed Text in Bank Cheque Images

  • José Eduardo Bastos Dos Santos
  • Bernard Dubuisson
  • Flávio Bortolozzi
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2423)

Abstract

In the current literature about textual element identification in bank cheque images, many strategies put forward are strongly dependent on document layout. This means searching and employing contextual information as a pointer to a search region on the image. However human handwriting, as well as machine printed characters, are not dependent on the document in which they are inserted. Components of handwritten and machine printed behavior can be maintained in a generic and independent way. Based on these observations this paper presents a new approach to identifying textual elements from a set of local features enabling the category of a textual element to be established, without needing to observe its environment. The use of local features might allow a more generic and reach classificatory process, enabling it in some cases to be used over different sorts of documents. Based on this assumption, in our tests we used bank cheque images from Brazil, USA, Canada and France. The preliminary results show the efficiency and the potential of this approach.

References

  1. 1.
    John D. Hobby. Using shape and layout information to find signatures, text and graphics. Computer Vision and Image Understanding, 80(1): 88–110, October, 2000.MATHCrossRefGoogle Scholar
  2. 2.
    J. E. B. Santos, B. Dubuisson and F. Bortolozzi. Handwritten Text Extraction from Bank Cheque Images by a Multivariate Classification Process. 6th World Multi Conference on Systemics, Cybernetics and Informatics-SCI’02, Orlando-USA, 2002.Google Scholar
  3. 3.
    Nikolay Gorski, Valery Anisimov, Emmanuel Augustin, Olivier Baret, Sergey Maximov. Industrial bank check processing: the a2ia check reader. International Journal on Document Analysis and Recognition, 3(4):196–206, May, 2001.CrossRefGoogle Scholar
  4. 4.
    P. Clark and M. Mirhehdi. Combining statistical measures to find image text regions. In ICPR’00, pages 450–453, Barcelona-España, 2000.Google Scholar
  5. 5.
    Xiangyun Ye, Mohamed Cheriet and Ching Y. Suen. A generic system to extract and clean handwritten data from business forms. In Seventh International Workshop on Frontiers in Handwriting Recognition, pages 63–72, Amsterdam, 2000.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • José Eduardo Bastos Dos Santos
    • 1
    • 2
  • Bernard Dubuisson
    • 1
  • Flávio Bortolozzi
    • 2
  1. 1.Heudiasyc - Université de Technologie de Compiègne(UTC)Compiegne cedexFrance
  2. 2.LUCIAPontifícia Universidade Católica do Paraná (PUCPR)CuritibaBrasil

Personalised recommendations