Abstract
This chapter presents methods for character segmentation from text lines and recognition of video characters. It is noted that character segmentation from video text lines detected by video text detection method is not as easy as segmenting characters from scanned document images due to low resolution and complex background of video. This chapter presents a method for word segmentation based on the combination of Fourier and moments. Then, the segmented words are used for character segmentation using top and bottom profile features of the words. This chapter also presents a method which does not require words for character segmentation. Instead, it segments character from text lines directly by exploring gradient vector flow (GVF) for identifying the space between words. Further, this chapter introduces a recognition method without the use of an OCR engine. The method proposes structural features based on eight-directional sectors to facilitate character recognition y calculating representatives for each class of the characters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
OCR Engine used: http://code.google.com/p/tesseract-ocr/
Jung K, Kim KI, Jain AK (2004) Text information extraction in images and video: a survey. Pattern Recogn 37(5):977–997
Shivakumara P, Phan TQ, Tan CL (2011) A Laplacian approach to multi-oriented text detection in video. IEEE Trans PAMI 33(2):412–419
Mori M, Sawaki M, Hagita N (2003) Video text recognition using feature compensation as category-dependent feature extraction. In: Proceedings of the ICDAR, pp 645–649
Lienhart R, Wernicke A (2002) Localizing and segmenting text in images and videos. IEEE Trans Circ Syst Video Technol 12(4):256–268
Huang X, Ma H, Zhang H (2009) A new video text extraction approach. In: Proceedings of the ICME, pp 650–653
Miao G, Zhu G, Jiang S, Huang Q, Xu C, Gao W (2007) A real-time score detection and recognition approach for broadcast basketball video. In: Proceedings of the ICME, pp 1691–1694
Kopf S, Haenselmann T, Effelsberg W (2005) Robust character recognition in low-resolution images and videos. Technical report, University of Mannheim
Tse J, Jones C, Curtis D, Yfantis E (2007) An OCR-independent character segmentation using shortest-path in grayscale document images. In: Proceedings of the international conference on machine learning and applications, pp 142–147
Kim W, Kim C (2009) A new approach for overlay text detection and extraction from complex video scene. IEEE Trans Image Process 18(2):401–411
Saidane Z, Garcia C (2007) Robust binarization for video text recognition. In: Proceedings of the ICDAR, pp 874–879
Chen D, Odobez J (2005) Video text recognition using sequential Monte Carlo and error voting methods. Pattern Recogn Lett 26(9):1386–1403
Lee SH, Kim JH (2008) Complementary combination of holistic and component analysis for recognition of low resolution video character images. Pattern Recogn Lett 29:383–391
Chen D, Odobez JM, Bourland H (2004) Text detection and recognition in images and video frames. Pattern Recogn 37(3):595–608
Tang X, Gao X, Liu J, Zhang H (2002) A spatial-temporal approach for video caption detection and recognition. IEEE Trans Neural Netw 13:961–971
Doermann D, Liang J, Li H (2003) Progress in camera-based document image analysis. In: Proceedings of the ICDAR, pp 606–616
Wolf C, Jolion JM (2003) Extraction and Recognition of artificial text in multimedia documents. Pattern Anal Applic 6(4):309–326
Zang J, Kasturi R (2008) Extraction of text objects in video documents: recent progress. In: Proceedings of the DAS, pp 5–17
Jain AK, Yu B (1998) Automatic text location in images and video frames. Pattern Recogn 31:2055–2076
Li H, Doermann D, Kia O (2000) Automatic text detection and tracking in digital video. IEEE Trans Image Process 9:147–156
Kim KL, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans PAMI 25:1631–1639
Saidane Z, Garcia C (2007) Robust binarization for video text recognition. In: Proceedings of the ICDAR, pp 874–879
Zhou Z, Li L, Tan CL (2010) Edge based binarization for video text images. In: Proceedings of the ICPR, pp 133–136
Jung K (2001) Neural network-based text location in color images. Pattern Recogn Lett 22:1503–1515
Hearn D, Pauline Baker M (1994) Computer graphics C version. 2nd edn. Prentice-Hall, Bresenham Line Drawing Algorithm
Xu C, Prince JL (1998) Snakes, shapes, and gradient vector flow. IEEE Trans Image Process 7(3):359–369
Kass M, Witkin A, Terzopoulos D (1987) Snakes: active contour models. Int J Comput Vision 1(4):321–331
Wang J, Jean J (1993) Segmentation of merged characters by neural networks and shortest path. In: Proceedings of the ACM/SIGAPP symposium on applied computing, pp 762–769
Su B, Lu S, Tan CL (2010) Binarization of historical document images using the local maximum and minimum. In: Proceedings of the international workshop on document analysis systems, pp 159–166
Bolan S, Shijian L, Tan CL (2010) Binarization of historical document images using the local maximum and minimum. In: Proceedings of the DAS, pp 159–165
Shivakumara P, Rajan D, Sadanathan SA (2008) Classification of images: are rule based systems effective when classes are fixed and known? In: Proceedings of the ICPR
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag London
About this chapter
Cite this chapter
Lu, T., Palaiahnakote, S., Tan, C.L., Liu, W. (2014). Character Segmentation and Recognition. In: Video Text Detection. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-6515-6_6
Download citation
DOI: https://doi.org/10.1007/978-1-4471-6515-6_6
Published:
Publisher Name: Springer, London
Print ISBN: 978-1-4471-6514-9
Online ISBN: 978-1-4471-6515-6
eBook Packages: Computer ScienceComputer Science (R0)