Character Segmentation and Recognition

Lu, Tong; Palaiahnakote, Shivakumara; Tan, Chew Lim; Liu, Wenyin

doi:10.1007/978-1-4471-6515-6_6

Tong Lu⁷,
Shivakumara Palaiahnakote⁸,
Chew Lim Tan⁹ &
…
Wenyin Liu¹⁰

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

1124 Accesses
1 Citations

Abstract

This chapter presents methods for character segmentation from text lines and recognition of video characters. It is noted that character segmentation from video text lines detected by video text detection method is not as easy as segmenting characters from scanned document images due to low resolution and complex background of video. This chapter presents a method for word segmentation based on the combination of Fourier and moments. Then, the segmented words are used for character segmentation using top and bottom profile features of the words. This chapter also presents a method which does not require words for character segmentation. Instead, it segments character from text lines directly by exploring gradient vector flow (GVF) for identifying the space between words. Further, this chapter introduces a recognition method without the use of an OCR engine. The method proposes structural features based on eight-directional sectors to facilitate character recognition y calculating representatives for each class of the characters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

OCR Engine used: http://code.google.com/p/tesseract-ocr/
Jung K, Kim KI, Jain AK (2004) Text information extraction in images and video: a survey. Pattern Recogn 37(5):977–997
Article Google Scholar
Shivakumara P, Phan TQ, Tan CL (2011) A Laplacian approach to multi-oriented text detection in video. IEEE Trans PAMI 33(2):412–419
Article Google Scholar
Mori M, Sawaki M, Hagita N (2003) Video text recognition using feature compensation as category-dependent feature extraction. In: Proceedings of the ICDAR, pp 645–649
Google Scholar
Lienhart R, Wernicke A (2002) Localizing and segmenting text in images and videos. IEEE Trans Circ Syst Video Technol 12(4):256–268
Article Google Scholar
Huang X, Ma H, Zhang H (2009) A new video text extraction approach. In: Proceedings of the ICME, pp 650–653
Google Scholar
Miao G, Zhu G, Jiang S, Huang Q, Xu C, Gao W (2007) A real-time score detection and recognition approach for broadcast basketball video. In: Proceedings of the ICME, pp 1691–1694
Google Scholar
Kopf S, Haenselmann T, Effelsberg W (2005) Robust character recognition in low-resolution images and videos. Technical report, University of Mannheim
Google Scholar
Tse J, Jones C, Curtis D, Yfantis E (2007) An OCR-independent character segmentation using shortest-path in grayscale document images. In: Proceedings of the international conference on machine learning and applications, pp 142–147
Google Scholar
Kim W, Kim C (2009) A new approach for overlay text detection and extraction from complex video scene. IEEE Trans Image Process 18(2):401–411
Article MathSciNet Google Scholar
Saidane Z, Garcia C (2007) Robust binarization for video text recognition. In: Proceedings of the ICDAR, pp 874–879
Google Scholar
Chen D, Odobez J (2005) Video text recognition using sequential Monte Carlo and error voting methods. Pattern Recogn Lett 26(9):1386–1403
Article Google Scholar
Lee SH, Kim JH (2008) Complementary combination of holistic and component analysis for recognition of low resolution video character images. Pattern Recogn Lett 29:383–391
Article Google Scholar
Chen D, Odobez JM, Bourland H (2004) Text detection and recognition in images and video frames. Pattern Recogn 37(3):595–608
Article Google Scholar
Tang X, Gao X, Liu J, Zhang H (2002) A spatial-temporal approach for video caption detection and recognition. IEEE Trans Neural Netw 13:961–971
Article Google Scholar
Doermann D, Liang J, Li H (2003) Progress in camera-based document image analysis. In: Proceedings of the ICDAR, pp 606–616
Google Scholar
Wolf C, Jolion JM (2003) Extraction and Recognition of artificial text in multimedia documents. Pattern Anal Applic 6(4):309–326
MathSciNet Google Scholar
Zang J, Kasturi R (2008) Extraction of text objects in video documents: recent progress. In: Proceedings of the DAS, pp 5–17
Google Scholar
Jain AK, Yu B (1998) Automatic text location in images and video frames. Pattern Recogn 31:2055–2076
Article Google Scholar
Li H, Doermann D, Kia O (2000) Automatic text detection and tracking in digital video. IEEE Trans Image Process 9:147–156
Article Google Scholar
Kim KL, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans PAMI 25:1631–1639
Article Google Scholar
Saidane Z, Garcia C (2007) Robust binarization for video text recognition. In: Proceedings of the ICDAR, pp 874–879
Google Scholar
Zhou Z, Li L, Tan CL (2010) Edge based binarization for video text images. In: Proceedings of the ICPR, pp 133–136
Google Scholar
Jung K (2001) Neural network-based text location in color images. Pattern Recogn Lett 22:1503–1515
Article MATH Google Scholar
Hearn D, Pauline Baker M (1994) Computer graphics C version. 2nd edn. Prentice-Hall, Bresenham Line Drawing Algorithm
Google Scholar
Xu C, Prince JL (1998) Snakes, shapes, and gradient vector flow. IEEE Trans Image Process 7(3):359–369
Article MATH MathSciNet Google Scholar
Kass M, Witkin A, Terzopoulos D (1987) Snakes: active contour models. Int J Comput Vision 1(4):321–331
Article Google Scholar
Wang J, Jean J (1993) Segmentation of merged characters by neural networks and shortest path. In: Proceedings of the ACM/SIGAPP symposium on applied computing, pp 762–769
Google Scholar
Su B, Lu S, Tan CL (2010) Binarization of historical document images using the local maximum and minimum. In: Proceedings of the international workshop on document analysis systems, pp 159–166
Google Scholar
Bolan S, Shijian L, Tan CL (2010) Binarization of historical document images using the local maximum and minimum. In: Proceedings of the DAS, pp 159–165
Google Scholar
Shivakumara P, Rajan D, Sadanathan SA (2008) Classification of images: are rule based systems effective when classes are fixed and known? In: Proceedings of the ICPR
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Technology, Nanjing University, Nanjing, China
Tong Lu
Faculty of CSIT, University of Malaya, Kuala Lumpur, Malaysia
Shivakumara Palaiahnakote
National University of Singapore, Singapore, Singapore
Chew Lim Tan
Multimedia Software Engineering Research Center, City University of Hong Kong, Kowloon Tong, Hong Kong SAR
Wenyin Liu

Authors

Tong Lu
View author publications
You can also search for this author in PubMed Google Scholar
Shivakumara Palaiahnakote
View author publications
You can also search for this author in PubMed Google Scholar
Chew Lim Tan
View author publications
You can also search for this author in PubMed Google Scholar
Wenyin Liu
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Lu, T., Palaiahnakote, S., Tan, C.L., Liu, W. (2014). Character Segmentation and Recognition. In: Video Text Detection. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-6515-6_6

Download citation

DOI: https://doi.org/10.1007/978-1-4471-6515-6_6
Published: 30 June 2014
Publisher Name: Springer, London
Print ISBN: 978-1-4471-6514-9
Online ISBN: 978-1-4471-6515-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics