Table Structure Extraction from Form Documents Based on Gradient-Wavelet Scheme
Based on gradient and wavelet analyses, a novel scheme has been developed to extract table structures from skewed form document images. In this scheme, first, a skewed form document image is rotated according to the angle obtained from the gradient algorithm. Then the deskewed image is decomposed into four sub-images by divisible Multiresolution Analysis(MRA) wavelets. Afterwards, the table structure image which represents the geometric structure of the form can be obtained from the sub-images by a modified wavelet reconstruction algorithm. Meanwhile, another document image without table lines can be produced by Minkowski operation and is referred to as a table free image. Experimental results indicate that this new scheme can be applied to process the skewed form document images with promising achievements.
- 2.ICDAR'95, Proc. Third Int. Conf. on Document Analysis and Recognition, Montreal, Canada, August 14-16, 1995.Google Scholar
- 3.ICDAR'97. Proc. Fourth Int. Conf. on Document Analysis and Recognition, Ulm-Germany, August 18-20, 1997.Google Scholar
- 8.E. Turolla, Y. Belaid, and A. Belaid, “Form Item Extraction Based on Line Searching”, in Graphics Recognition: Method and Applications, Lecture Notes in Computer Science, Vol. 1072, Springer-Verlag, Berlin Heidelberg New York, pp. 69–79, 1996.Google Scholar