Abstract
We survey the methods developed up to date for crude vectorization of document images. We classify them into six categories: thinning based, Hough Transform based, contour-based, run-graph based, mesh-pattern based, and sparse pixel based. The crude vectorization is a relatively mature subject in the Document Analysis and Recognition field, though there are rooms to improve. The purpose of the survey is to provide researchers with a comprehensive overview of this technique for them to choose a suitable method when developing their vectorization algorithms and systems.
Chapter PDF
Reference
Boatto L et al. (1992) An Interpretation System for Land Register Maps. IEEE Computer 25(7):25–32
Chai I, Dori D (1992) Orthogonal Zig-Zag: An Efficient Method for Extracting Lines from Engineering Drawings. In: Visual Form, eds. Arcelli C, Cordella LP, Sanniti di Baja G, Plenum Press, New York London, pp 127–136
Di Zenzo S and Morelli A (1989) A useful image representation. In: Proc of 5th Int. Conf. on Image Analysis and Processing, Singapore, pp 170–178.
Dori D, Liang Y, Dowell J, I. Chai (1993) Spare Pixel Recognition of Primitives in Engineering Drawings. Machine Vision and Applications 6:79–82
Dori D (1997) Orthogonal Zig-Zag: an Algorithm for Vectorizing Engineering Drawings Compared With Hough Transform. Advances in Engineering Software 28(1):11–24
Dunham JG (1986) Optimum uniform piecewise linear approximation of planar curves. IEEE PAM 18(1):67–75
Hough PVC (1962) A method and means for recognizing complex patterns, ]USA Patent 3,096,654, 1962.
Hung SHY and Kasvand T (1983) Critical points on a perfectly 8-or perfectly 6connected thin binary line. Pattern Recognition 16:297–284.
Jaisimha MY et al. (1993) A Methodology for the Characterization of the Performance of Thinning Algorithms. In: Proc. of 2nd ICDAR, pp 282–286
Jimenez J and Navalon JL (1982) Some Experiments in Image Vectorization. IBM J. Res. Develop 26:724–734
Kasturi R et al. (1990) A System for Interpretation of Line Drawings. IEEE PAMI 12(10):978–992
Lam L, Lee SW, and Suen CY (1992) Thinning methodologies — A comprehensive survey. IEEE PAMI:14(9):869–887.
Lam L, Suen CY (1993) Evaluation of Thinning Algorithms from an OCR Viewpoint. In: Proc. of 2nd ICDAR, Tsukuba, Japan, pp 287–290
Lee S et al. (1991) Performance Evaluation of Skeletonization Algorithms for Document Image Processing. In: Proc. of Ist ICDAR, France, pp 260–271
Lin X et al. (1985) Efficient Diagram Understanding With Characteristic Pattern Detection. Computer Vision, Graphics and Image Processing 30:84–106
Liu W et al. (1995) Object Recognition in Engineering Drawings Using Planar Indexing. In: Proc. of GREC'95, Penn. State Univ., USA, pp 53–61
Liu W, Dori D (1996) Sparse Pixel Tracking: A Fast Vectorization Algorithm Applied to Engineering Drawings. In: Proc. 13th ICPR, Vienna, Austria, Volume III (Robotics and Applications, pp 808–811
[18]Liu W, Dori D (1997) A Protocol for Performance Evaluation of Line Detection Algorithms. Machine Vision Applications 9(5/6):240–250
Monagan G and Roosli M (1993) Appropriate Base Representation Using a Run Graph. In: Proc. of 2nd ICDAR, Tsukuba, Japan, 1993, pp 623–626
Montanari U (1970) A note on the minimal length polygonal approximation to a digitized contour. CACM 13(l):41–47.
Sklansky J and Gonzalez V (1980) Fast Polygonal Approximation of Digitized Curves. Pattern Recognition 12:327–331
Smith RW (1987) Computer Processing of Line Images: A Survey. Pattern Recognition 20(1):7–15
Tamura H (1978) A Comparison of Line Thinning Algorithms from Digital Geometry Viewpoint. In: Proc. of 4th ICPR, Kyoto, Japan, pp 715–719
Vaxiviere P and Tombre K (1992) Cellestin: CAD Conversion of Mechanical Drawings. IEEE Computer 25(5): 46–54
Vaxiviere P and Tombre K (1995) Subsampling: A Structural Approach to Technical Document Vectorization. In: Shape, Structure and Pattern Recognition, eds. Dori D and Bruckstein A, World Scientific, 1995, pp 323–332
Yoo J-Y et al. (1998) Information Extraction from a Skewed Form Document in the Presence of Crossing Characters. In: Graphics Recognition—Algorithms and Systems, eds. K. Tombre and A. Chhabra, Lecture Notes in Computer Science, Vol. 1389, pp139–148, Springer, April, 1998
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wenyin, L., Dori, D. (1998). A survey of non-thinning based vectorization methods. In: Amin, A., Dori, D., Pudil, P., Freeman, H. (eds) Advances in Pattern Recognition. SSPR /SPR 1998. Lecture Notes in Computer Science, vol 1451. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0033241
Download citation
DOI: https://doi.org/10.1007/BFb0033241
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64858-1
Online ISBN: 978-3-540-68526-5
eBook Packages: Springer Book Archive