Table Structure Extraction from Form Documents Based on Gradient-Wavelet Scheme

  • Dihua Xi
  • Seong-Whan Lee
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1655)

Abstract

Based on gradient and wavelet analyses, a novel scheme has been developed to extract table structures from skewed form document images. In this scheme, first, a skewed form document image is rotated according to the angle obtained from the gradient algorithm. Then the deskewed image is decomposed into four sub-images by divisible Multiresolution Analysis(MRA) wavelets. Afterwards, the table structure image which represents the geometric structure of the form can be obtained from the sub-images by a modified wavelet reconstruction algorithm. Meanwhile, another document image without table lines can be produced by Minkowski operation and is referred to as a table free image. Experimental results indicate that this new scheme can be applied to process the skewed form document images with promising achievements.

Reference

  1. 1.
    R. G. Casey, D. R. Ferguson, K. M. Mohiuddin, and E. Walach, “ Intelligent Forms Processing System,” Machine Vision and Application, Vol. 5, No. 3, pp. 143–155, 1992.CrossRefGoogle Scholar
  2. 2.
    ICDAR'95, Proc. Third Int. Conf. on Document Analysis and Recognition, Montreal, Canada, August 14-16, 1995.Google Scholar
  3. 3.
    ICDAR'97. Proc. Fourth Int. Conf. on Document Analysis and Recognition, Ulm-Germany, August 18-20, 1997.Google Scholar
  4. 4.
    Y. Y. Tang, H. Ma, J. Liu, B. Li, and D. Xi, “ Multiresolution Analysis in Extraction of Reference Lines from Documents with Gray Level Background,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 19, No. 8, pp. 921–926, 1997.CrossRefGoogle Scholar
  5. 5.
    S. Mallat, “ A Theory of Multiresolution Signal Decomposition: the Wavelet Representation,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 11, pp. 674–693, 1989.MATHCrossRefGoogle Scholar
  6. 6.
    R. Jain, “ Extraction of Motion Information from Peripheral Processes,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 3, No. 5, pp. 489–503, 1981.CrossRefGoogle Scholar
  7. 7.
    S. Mallat, A Wavelet Tour of Signal Processing, San Diego: Academic Press, 1998.MATHGoogle Scholar
  8. 8.
    E. Turolla, Y. Belaid, and A. Belaid, “Form Item Extraction Based on Line Searching”, in Graphics Recognition: Method and Applications, Lecture Notes in Computer Science, Vol. 1072, Springer-Verlag, Berlin Heidelberg New York, pp. 69–79, 1996.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1999

Authors and Affiliations

  • Dihua Xi
    • 1
  • Seong-Whan Lee
    • 1
  1. 1.Center for Artificial Vision ResearchKorea UniversitySeongbuk-ku, SeoulKorea

Personalised recommendations