Skip to main content
Log in

A generic method of cleaning and enhancing handwritten data from business forms

  • Published:
International Journal on Document Analysis and Recognition Aims and scope Submit manuscript

Abstract.

The automation of business form processing is attracting intensive research interests due to its wide application and its reduction of the heavy workload due to manual processing. Preparing clean and clear images for the recognition engines is often taken for granted as a trivial task that requires little attention. In reality, handwritten data usually touch or cross the preprinted form frames and texts, creating tremendous problems for the recognition engines. In this paper, we contribute answers to two questions: “Why do we need cleaning and enhancement procedures in form processing systems?” and “How can we clean and enhance the hand-filled items with easy implementation and high processing speed?” Here, we propose a generic system including only cleaning and enhancing phases. In the cleaning phase, the system registers a template to the input form by aligning corresponding landmarks. A unified morphological scheme is proposed to remove the form frames and restore the broken handwriting from gray or binary images. When the handwriting is found touching or crossing preprinted texts, morphological operations based on statistical features are used to clean it. In applications where a black-and-white scanning mode is adopted, handwriting may contain broken or hollow strokes due to improper thresholding parameters. Therefore, we have designed a module to enhance the image quality based on morphological operations. Subjective and objective evaluations have been studied to show the effectiveness of the proposed procedures.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Additional information

Received January 19, 2000 / Revised March 20, 2001

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ye, X., Cheriet, M. & Suen, C. A generic method of cleaning and enhancing handwritten data from business forms. IJDAR 4, 84–96 (2001). https://doi.org/10.1007/s100320100056

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s100320100056

Navigation