Abstract
A typical paper based document consists of regions of text, graphics, and halftone images. Developing algorithms for automating the input of such documents is the goal of this research. The scope of this paper is the design of algorithms to process raster oriented binary images of paper based graphics to obtain vector oriented description files. The image is preprocessed to suppress noise and other digitization artifacts. The graphical components are then represented as a union of maximal squares using the maximal square moving algorithm. The pointers describing the connectivity of the maximal squares are analyzed to generate a linked list of squares. Straight line segments, curves, junctions, and large areas are then identified after extensive processing of this linked list. The output of the algorithm is a graphics description file which is then used as input to a graphics recognition system.
Similar content being viewed by others
References
Bley H (1984) Segmentation and preprocessing of electrical schematics using picture graphs, Computer Vision, Graphics, Image Processing, 28(3):271–288, December.
Bow ST, and EI-Masri W (1987) Knowledge-based graphics understanding and description for document archival and retrieval, Proc. SPIE Conf. Advances in Intelligent Robotic Systems, 848.
Bunke H (1982) Automatic Interpretation of Lines and Text in Circuit Diagrams, In: Kittler J, Fu, KS, Pau LF (Eds.) Pattern recognition theory and applications, D. Reidel, Boston, pp. 297–310.
Ejiri M, Kakumoto S, Miyatake T, Shimada S, Matsushima H (1984) Automatic recognition of design drawings and maps, Proc. 7th. Intl. Conf. on Pattern Recognition, Montreal, pp. 1296–1305.
Fletcher LA, Kasturi R (1988) A robust algorithm for text string separation from mixed text/graphics images, IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 10(6):910–918, November.
Fukada Y (1984) A primary algorithm for the understanding of logic circuit diagrams, Pattern Recognition, 17(1):125–134.
Honnenahalli S, Bow ST (1988) Piecewise representation of curves with concatenated arcs via detection of appropriate break points, Proc. IEEE International Conference on Robotics and Automation, Philadelphia, pp. 626–627, April.
Jimenez J, Navalon JL (1982) Some experiments in image vectorization, IBM Journal Research Develop., 26(6):724–734, November.
Kasturi R, Bow ST, Gattiker J, Shah J, El-Masri W, Mokate U, Honnenahalli S (1988) A system for recognition and description of graphics, Proc. Ninth International Conference on Pattern Recognition, Rome, Italy, pp. 255–259, November.
Ramachandran K (1980) Coding method for vector representation of engineering drawings, Proceedings of IEEE, 68(7):813–817, July.
Wakayama T (1982) A core line tracking algorithm based on maximal square moving, IEEE Trans. Pattern Analysis and Machine Intelligence, 4(1):68–74, January.
Wong KY, Casey RG, Wahl FM (1982) Document Analysis System, IBM Journal Res. Develop., 26(6):647–656, November.
Watson LT, Arvind K, Ehrich RW, Haralick RM (1984) Extraction of lines and regions from grey tone line drawing images, Pattern Recognition, 17:493–507.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Shih, Cc., Kasturi, R. Extraction of graphic primitives from images of paper based line drawings. Machine Vis. Apps. 2, 103–113 (1989). https://doi.org/10.1007/BF01212371
Issue Date:
DOI: https://doi.org/10.1007/BF01212371