Abstract
The demand for digitisation of complex engineering drawings becomes increasingly important for the industry given the pressure to improve the efficiency and time effectiveness of operational processes. There have been numerous attempts to solve this problem, either by proposing a general form of document interpretation or by establishing an application dependant framework. Moreover, text/graphics segmentation has been presented as a particular form of addressing document digitisation problem, with the main aim of splitting text and graphics into different layers. Given the challenging characteristics of complex engineering drawings, this paper presents a novel sequential heuristics-based methodology which is aimed at localising and detecting the most representative symbols of the drawing. This implementation enables the subsequent application of a text/graphics segmentation method in a more effective form. The experimental framework is composed of two parts: first we show the performance of the symbol detection system and then we present an evaluation of three different state of the art text/graphic segmentation techniques to find text on the remaining image.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ablameyko, S.V., Uchida, S.: Recognition of engineering drawing entities: review of approaches. Int. J. Image Graph. 07(04), 709–733 (2007)
Arias, J.F., Lai, C.P., Chandran, S., Kasturi, R., Chhabra, A.: Interpretation of telephone system manhole drawings. Pattern Recognit. Lett. 16(4), 365–368 (1995)
Ballard, D.H.: Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognit. 13(2), 111–122 (1981)
Bunke, H.: Automatic interpretation of lines and text in circuit diagrams. In: Kittler, J., Fu, K.S., Pau, L.F. (eds.) Pattern Recognition Theory and Applications, vol. 81, pp. 297–310. Springer, Dordrecht (1982)
Cao, R., Tan, C.L.: Text/graphics separation in maps. In: Blostein, D., Kwon, Y.-B. (eds.) GREC 2001. LNCS, vol. 2390, pp. 167–177. Springer, Heidelberg (2002). doi:10.1007/3-540-45868-9_14
Chowdhury, S.P., Mandal, S., Das, A.K., Chanda, B.: Segmentation of text and graphics from document images. In: Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, vol, 2 (Sect. 4), pp. 619–623 (2007)
Cordella, L.P., Vento, M.: Symbol recognition in documents: a collection of techniques? Int. J. Doc. Anal. Recognit. 3(2), 73–88 (2000)
De, P., Mandal, S., Bhowmick, P.: Identification of annotations for circuit symbols in electrical diagrams of document images. In: 2014 Fifth International Conference on Signal and Image Processing, pp. 297–302 (2014)
Dori, D., Wenyin, L.: Vector-based segmentation of text connected to graphics in engineering drawings. In: Perner, P., Wang, P., Rosenfeld, A. (eds.) SSPR 1996. LNCS, vol. 1121, pp. 322–331. Springer, Heidelberg (1996). doi:10.1007/3-540-61577-6_33
Douglas, D.H., Peucker, T.K.: Algorithms for the reduction of the number of points required to represent a digitized line or its caricature. Cartogr. Int. J. Geogr. Inf. Geovisualization 10(2), 112–122 (1973)
Duda, R.O., Hart, P.E.: Use of the Hough transformation to detect lines and curves in pictures. Commun. ACM 15, 11–15 (1971)
Fan, K.C., Liu, C.H., Wang, Y.K.: Segmentation and classification of mixed text/graphics/image documents. Pattern Recognit. Lett. 15(12), 1201–1209 (1994)
Fletcher, L.A., Kasturi, R.: Robust algorithm for text string separation from mixed text/graphics images. IEEE Trans. Pattern Anal. Mach. Intell. 10(6), 910–918 (1988)
Freeman, H.: On the encoding of arbitrary geometric configurations. IRE Trans. Electron. Comput. EC–10, 260–268 (1960)
Gellaboina, M.K., Venkoparao, V.G.: Graphic symbol recognition using auto associative neural network model. In: Proceedings of the 7th International Conference on Advances in Pattern Recognition, ICAPR 2009, pp. 297–301 (2009)
Gray, S.B.: Local properties of binary images in two dimensions. IEEE Trans. Comput. 20(5), 551–561 (1971)
He, S., Abe, N.: A clustering-based approach to the separation of text strings from mixed text/graphics documents. Proc. - Int. Conf. Pattern Recognit. 3, 706–710 (1996)
Hough, P.V.C.: Method and means for recognizing complex patterns. US Patent 3,069,654, 18 December 1962
Hu, M.K.: Visual pattern recognition by moment invariants. IRE Trans. Inf. Theory 8, 179–187 (1962)
Kasturi, R., Bow, S.T., El-Masri, W., Shah, J., Gattiker, J.R.: A system for interpretation of line drawings. IEEE Trans. Pattern Anal. Mach. Intell. 12(10), 978–992 (1990)
Kim, S.H., Suh, J.W., Kim, J.H.: Recognition of logic diagrams by identifying loops and rectilinear polylines. In Proceedings of the Second International Conference on Document Analysis and Recognition - ICDAR 1993, pp. 349–352 (1993)
LladĂłs, J., Valveny, E., Sánchez, G., MartĂ, E.: Symbol recognition: current advances and perspectives. In: Blostein, D., Kwon, Y.-B. (eds.) GREC 2001. LNCS, vol. 2390, pp. 104–128. Springer, Heidelberg (2002). doi:10.1007/3-540-45868-9_9
Lu, Z.: Detection of text regions from digital engineering drawings. IEEE Trans. Pattern Anal. Mach. Intell. 20(4), 431–439 (1998)
Luo, H., Agam, G., Dinstein, I.: Directional mathematical morphology approach for line thinning and extraction of character strings from maps and line drawings. In: Proceedings of 3rd International Conference on Document Analysis and Recognition, vol. 1, pp. 257–260, 1 August 1995
Matas, J., Galambos, C., Kittler, J.: Robust detection of lines using the progressive probabilistic hough transform. Comput. Vis. Image Underst. 78(1), 119–137 (2000)
Moreno-GarcĂa, C.F., CortĂ©s, X., Serratosa, F.: Partial to full image registration based on candidate positions and multiple correspondences. In: Bayro-Corrochano, E., Hancock, E. (eds.) CIARP 2014. LNCS, vol. 8827, pp. 745–753. Springer, Cham (2014). doi:10.1007/978-3-319-12568-8_90
Okazaki, A., Kondo, T., Mori, K., Tsunekawa, S., Kawamoto, E.: Automatic circuit diagram reader with loop-structure-based symbol recognition. IEEE Trans. Pattern Anal. Mach. Intell. 10(3), 331–341 (1988)
Pratt, W.K.: Digital Image Processing, 4th edn. Wiley, Los Altos (2013)
Roy, P.P., Vazquez, E., Lladós, J., Baldrich, R., Pal, U.: A system to segment text and symbols from color maps. In: Liu, W., Lladós, J., Ogier, J.-M. (eds.) GREC 2007. LNCS, vol. 5046, pp. 245–256. Springer, Heidelberg (2008). doi:10.1007/978-3-540-88188-9_23
Tan, C., Ng, P.O.: Text extraction using pyramid. Pattern Recognit. 31(1), 63–72 (1998)
Tombre, K., Tabbone, S., Pélissier, L., Lamiroy, B., Dosch, P.: Text/graphics separation revisited. In: Lopresti, D., Hu, J., Kashi, R. (eds.) DAS 2002. LNCS, vol. 2423, pp. 200–211. Springer, Heidelberg (2002). doi:10.1007/3-540-45869-7_24
Wahl, F.M., Wong, K.Y., Casey, R.G.: Block segmentation and text extraction in mixed text/image documents. Comput. Graph. Image Process. 20(4), 375–390 (1982)
Wei, Y., Zhang, Z., Shen, W., Zeng, D., Fang, M., Zhou, S.: Text detection in scene images based on exhaustive segmentation. Signal Process. Image Commun. 50, 1–8 (2017)
Yu, Y., Samal, A., Seth, S.C.: A system for recognizing a large class of engineering drawings. IEEE Trans. Pattern Anal. Mach. Intell. 19(8), 868–890 (1997)
Acknowledgement
We would like to thank Dr. Brian Bain from DNV-GL Aberdeen for his feedback and collaboration in the project. This work is supported by a Scottish national project granted by the Data Lab Innovation Centre.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Moreno-GarcĂa, C.F., Elyan, E., Jayne, C. (2017). Heuristics-Based Detection to Improve Text/Graphics Segmentation in Complex Engineering Drawings. In: Boracchi, G., Iliadis, L., Jayne, C., Likas, A. (eds) Engineering Applications of Neural Networks. EANN 2017. Communications in Computer and Information Science, vol 744. Springer, Cham. https://doi.org/10.1007/978-3-319-65172-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-65172-9_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-65171-2
Online ISBN: 978-3-319-65172-9
eBook Packages: Computer ScienceComputer Science (R0)