Skip to main content

Character Segmentation and Recognition

  • Chapter
  • First Online:
Video Text Detection

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

Abstract

This chapter presents methods for character segmentation from text lines and recognition of video characters. It is noted that character segmentation from video text lines detected by video text detection method is not as easy as segmenting characters from scanned document images due to low resolution and complex background of video. This chapter presents a method for word segmentation based on the combination of Fourier and moments. Then, the segmented words are used for character segmentation using top and bottom profile features of the words. This chapter also presents a method which does not require words for character segmentation. Instead, it segments character from text lines directly by exploring gradient vector flow (GVF) for identifying the space between words. Further, this chapter introduces a recognition method without the use of an OCR engine. The method proposes structural features based on eight-directional sectors to facilitate character recognition y calculating representatives for each class of the characters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. OCR Engine used: http://code.google.com/p/tesseract-ocr/

  2. Jung K, Kim KI, Jain AK (2004) Text information extraction in images and video: a survey. Pattern Recogn 37(5):977–997

    Article  Google Scholar 

  3. Shivakumara P, Phan TQ, Tan CL (2011) A Laplacian approach to multi-oriented text detection in video. IEEE Trans PAMI 33(2):412–419

    Article  Google Scholar 

  4. Mori M, Sawaki M, Hagita N (2003) Video text recognition using feature compensation as category-dependent feature extraction. In: Proceedings of the ICDAR, pp 645–649

    Google Scholar 

  5. Lienhart R, Wernicke A (2002) Localizing and segmenting text in images and videos. IEEE Trans Circ Syst Video Technol 12(4):256–268

    Article  Google Scholar 

  6. Huang X, Ma H, Zhang H (2009) A new video text extraction approach. In: Proceedings of the ICME, pp 650–653

    Google Scholar 

  7. Miao G, Zhu G, Jiang S, Huang Q, Xu C, Gao W (2007) A real-time score detection and recognition approach for broadcast basketball video. In: Proceedings of the ICME, pp 1691–1694

    Google Scholar 

  8. Kopf S, Haenselmann T, Effelsberg W (2005) Robust character recognition in low-resolution images and videos. Technical report, University of Mannheim

    Google Scholar 

  9. Tse J, Jones C, Curtis D, Yfantis E (2007) An OCR-independent character segmentation using shortest-path in grayscale document images. In: Proceedings of the international conference on machine learning and applications, pp 142–147

    Google Scholar 

  10. Kim W, Kim C (2009) A new approach for overlay text detection and extraction from complex video scene. IEEE Trans Image Process 18(2):401–411

    Article  MathSciNet  Google Scholar 

  11. Saidane Z, Garcia C (2007) Robust binarization for video text recognition. In: Proceedings of the ICDAR, pp 874–879

    Google Scholar 

  12. Chen D, Odobez J (2005) Video text recognition using sequential Monte Carlo and error voting methods. Pattern Recogn Lett 26(9):1386–1403

    Article  Google Scholar 

  13. Lee SH, Kim JH (2008) Complementary combination of holistic and component analysis for recognition of low resolution video character images. Pattern Recogn Lett 29:383–391

    Article  Google Scholar 

  14. Chen D, Odobez JM, Bourland H (2004) Text detection and recognition in images and video frames. Pattern Recogn 37(3):595–608

    Article  Google Scholar 

  15. Tang X, Gao X, Liu J, Zhang H (2002) A spatial-temporal approach for video caption detection and recognition. IEEE Trans Neural Netw 13:961–971

    Article  Google Scholar 

  16. Doermann D, Liang J, Li H (2003) Progress in camera-based document image analysis. In: Proceedings of the ICDAR, pp 606–616

    Google Scholar 

  17. Wolf C, Jolion JM (2003) Extraction and Recognition of artificial text in multimedia documents. Pattern Anal Applic 6(4):309–326

    MathSciNet  Google Scholar 

  18. Zang J, Kasturi R (2008) Extraction of text objects in video documents: recent progress. In: Proceedings of the DAS, pp 5–17

    Google Scholar 

  19. Jain AK, Yu B (1998) Automatic text location in images and video frames. Pattern Recogn 31:2055–2076

    Article  Google Scholar 

  20. Li H, Doermann D, Kia O (2000) Automatic text detection and tracking in digital video. IEEE Trans Image Process 9:147–156

    Article  Google Scholar 

  21. Kim KL, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans PAMI 25:1631–1639

    Article  Google Scholar 

  22. Saidane Z, Garcia C (2007) Robust binarization for video text recognition. In: Proceedings of the ICDAR, pp 874–879

    Google Scholar 

  23. Zhou Z, Li L, Tan CL (2010) Edge based binarization for video text images. In: Proceedings of the ICPR, pp 133–136

    Google Scholar 

  24. Jung K (2001) Neural network-based text location in color images. Pattern Recogn Lett 22:1503–1515

    Article  MATH  Google Scholar 

  25. Hearn D, Pauline Baker M (1994) Computer graphics C version. 2nd edn. Prentice-Hall, Bresenham Line Drawing Algorithm

    Google Scholar 

  26. Xu C, Prince JL (1998) Snakes, shapes, and gradient vector flow. IEEE Trans Image Process 7(3):359–369

    Article  MATH  MathSciNet  Google Scholar 

  27. Kass M, Witkin A, Terzopoulos D (1987) Snakes: active contour models. Int J Comput Vision 1(4):321–331

    Article  Google Scholar 

  28. Wang J, Jean J (1993) Segmentation of merged characters by neural networks and shortest path. In: Proceedings of the ACM/SIGAPP symposium on applied computing, pp 762–769

    Google Scholar 

  29. Su B, Lu S, Tan CL (2010) Binarization of historical document images using the local maximum and minimum. In: Proceedings of the international workshop on document analysis systems, pp 159–166

    Google Scholar 

  30. Bolan S, Shijian L, Tan CL (2010) Binarization of historical document images using the local maximum and minimum. In: Proceedings of the DAS, pp 159–165

    Google Scholar 

  31. Shivakumara P, Rajan D, Sadanathan SA (2008) Classification of images: are rule based systems effective when classes are fixed and known? In: Proceedings of the ICPR

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag London

About this chapter

Cite this chapter

Lu, T., Palaiahnakote, S., Tan, C.L., Liu, W. (2014). Character Segmentation and Recognition. In: Video Text Detection. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-6515-6_6

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-6515-6_6

  • Published:

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-6514-9

  • Online ISBN: 978-1-4471-6515-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics