Skip to main content

Introduction to Video Text Detection

  • Chapter
  • First Online:
Video Text Detection

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

Abstract

Text plays a dominant role in video viewing and understanding as text carries rich and important information relevant to the video contents. Studies have shown that humans often pay first attention to text over other objects in a video as text helps in getting semantics relevant to the content of the video. With this in mind, this chapter introduces research in video text detection. It first reviews relevant literature and then discusses characteristics and difficulties of video text detection faced by the majority of the methods under review. Various issues such as low resolution of video images, the presence of both caption and scene text in video, and background complexity variations are examined. This chapter also presents a brief historical overview to show how video text detection has evolved from the field of document image analysis and how the document analysis community has explored various methods proposed in different fields, including image processing, pattern recognition, computer vision, and artificial intelligence to find solution to text detection in video. Finally, this chapter discusses potential applications of video text detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Sun Q-Y, Lu Y (2012) Text location in scene images using visual attention model. Int J Pattern Recogn Artif Intell 26(04):1–22

    Article  MathSciNet  Google Scholar 

  2. Jung K, Kim KI, Jain AK (2004) Text information extraction in images and video: a survey. Pattern Recogn 37:977–997

    Article  Google Scholar 

  3. Chen D, Luttin J, Shearer K (2000) A survey of text detection and recognition in images and videos, IDIAP research report, pp 1–21

    Google Scholar 

  4. Zhang J, Kasturi R (2008) Extraction of text objects in video documents: recent progress. In: Proceedings of the eighth IAPR workshop on document analysis systems (DAS), pp 5–17

    Google Scholar 

  5. Doremann D, Liang J, Li H (2003) Progress in camera-based document image analysis. In: Proceedings of the seventh international conference on document analysis and recognition (ICDAR)

    Google Scholar 

  6. Chen D, Odobez JM (2005) Video text recognition using sequential Monte Carlo and error voting methods. Pattern Recogn Lett 1386–1403

    Google Scholar 

  7. Nagy G (2000) Twenty years of document image analysis. IEEE Trans Pattern Anal Mach Intel (PAMI) 38–62

    Google Scholar 

  8. Tang YY, Lee SW, Suen CY (1996) Automatic document processing: a survey. Pattern Recogn 1931–1952

    Google Scholar 

  9. Abele L, Wahl F, Scheri W (1981) Procedures for an automatic segmentation of text graphics and halftone regions in document, Scandinavian conference on image analysis, pp 177–182

    Google Scholar 

  10. Toyoda J, Noguchi Y, Nishmiura Y (1982) Study of extracting Japanese newspaper. Int Conf Pattern Recog 1113–1115

    Google Scholar 

  11. Wong IY, Casey RG, Wahl FM (1982) Document analysis system. IBM Res Dev 647–656

    Google Scholar 

  12. Jain AK, Chen Y (1994) Address block location using color and texture analysis, CVGIP. Image Underst 179–190

    Google Scholar 

  13. Jain AK, Bhattcharjee SK (1992) Address block location using Gabor filters. Pattern Recogn 1459–1477

    Google Scholar 

  14. Jain AK, Farrokhina F (1991) Unsupervised texture segmentation using Gabor filter. Pattern Recogn 1167–1186

    Google Scholar 

  15. Ohya J, Shio A, Akamatsu S (1994) Recognizing characters in scene images. IEEE Trans Pattern Anal Mach Intel (PAMI) 214–224

    Google Scholar 

  16. Zhong Y, Karu K, Jain AK (1995) Locating text in complex color images. Pattern Recogn 1523–1535

    Google Scholar 

  17. Kim HK (1996) Efficient automatic text location method and content-based indexing and structuring of video database. J Vis Commun Image Represent 7:336–344

    Article  Google Scholar 

  18. Shim JC, Dorai C, Bolle R (1998) Automatic text extraction from video for content-based annotation and retrieval. In: Proceedings of international conference on pattern recognition (ICPR), vol 1, pp 618–620

    Google Scholar 

  19. Lienhart R, Stuber F (1996) Automatic text recognition in digital videos. In: Proceedings of SPIE, pp 180–188

    Google Scholar 

  20. Lienhart V, Effelsberg W (1998) Automatic text segmentation and text recognition for video indexing, Technical Report TR-98-009, PraktscheInformatik IV, University of Mannhein

    Google Scholar 

  21. Jain AK, Yu B (1998) Automatic text location in images and video frames. Pattern Recogn 31:2055–2076

    Article  Google Scholar 

  22. Wu V, Manmatha R, Risean EM (1999) Text finder: an automatic system to detect and recognize text in images. IEEE Trans Pattern Anal Mach Intell (PAMI) 21:1224–1229

    Article  Google Scholar 

  23. Wu V, Manmatha R, Risean EM (1997) Finding text in images. In: Proceedings of ACM international conference on digital libraries, pp 1–10

    Google Scholar 

  24. Mao W, Chung F, Lanm K, Siu W (2002) Hybird Chinese/English text detection in images and video frames. In: Proceedings of the international conference on pattern recognition (ICPR), vol 3, pp 1015–1018

    Google Scholar 

  25. Jeong KY, Jung K, Kim EY Kim JJ (1999) Neural network-based text location for news video indexing. In: Proceedings of the international conference on image processing (ICIP), pp 319–323

    Google Scholar 

  26. Jung K, Kim K, Kurata T, Kourogi M, Han J (2002) Text scanner with text detection technology on image sequence. In: Proceedings of the international conference on pattern recognition (ICPR), vol 3, pp 473–476

    Google Scholar 

  27. Kim KI, Jung J, Park SH, Kim HJ (2001) Support vector machine-based text detection in digital video. Pattern Recogn 34:527–529

    Article  Google Scholar 

  28. Li H, Doermann D (2000) A video text detection system based on automated training. In: Proceedings of the international conference on pattern recognition (ICPR), pp 223–226

    Google Scholar 

  29. Li H, Doerman D, Kia O (2000) Automatic text detection and tracking in digital video. IEEE Trans Pattern Anal Mach Intell (PAMI) 9:147–156

    Google Scholar 

  30. Chen D, Shearer K, Bourlard H (2001) Text enhancement with asymmetric filter for video OCR. In: Proceedings of the international conference on image analysis and processing, pp 192–197

    Google Scholar 

  31. Shivakumara P, Phan TQ, Tan CL (2009) A robust wavelet transform based technique for video text detection. In: Proceedings of ICDAR, pp 1285–1289

    Google Scholar 

  32. Shivakumara P, Phan TQ, Tan CL (2010) New fourier-statistical features in RGB space for video text detection. IEEE Trans Circ Syst Video Technol (TCSVT) 20:1520–1532

    Article  Google Scholar 

  33. Shivakumara P, Phan TQ, Tan CL(2010) New wavelet and color features for text detection in video. In: Proceedings of ICPR, pp 3996–3999

    Google Scholar 

  34. Shivakumara P, Dutta A, Tan CL, Pal U (2010) A new wavelet-median-moment based method for multi-oriented video text detection. In: Proceedings of DAS, pp 279–288

    Google Scholar 

  35. Shivakumara P, Phan TQ, Tan CL (2011) A laplacian approach to multi-oriented text detection in video. IEEE Trans Pattern Anal Mach Intell (TPAMI) 33:412–419

    Article  Google Scholar 

  36. Shivakumara P, Huang W, Tan CL (2008) An efficient edge based technique for text detection in video frames. In: Proceedings of the international workshop on document analysis systems (DAS2008), pp 307–314

    Google Scholar 

  37. Shivakumara P, Huang W, Tan CL (2008) Efficient video text detection using edge features. In: Proceedings of the international conference on pattern recognition (ICPR08)

    Google Scholar 

  38. Shivakumara P, Phan TQ, Tan CL (2009) Video text detection based on filters and edge analysis. In: Proceedings of ICME, 2009, pp 514–517

    Google Scholar 

  39. Shivakumara P, Phan TQ, Tan CL (2009) A gradient difference based technique for video text detection. In: Proceedings of ICDAR, 2009, pp 156–160

    Google Scholar 

  40. Phan TQ, Shivakumara P, Tan CL (2009) A Laplacian method for video text detection. In: Proceedings of ICDAR, pp 66–70

    Google Scholar 

  41. Shivakumara P, Huang W, Trung PQ, Tan CL (2010) Accurate video text detection through classification of low and high contrast images. Pattern Recogn 43:2165–2185

    Article  Google Scholar 

  42. Shivakumara P, Sreedhar RP, Phan TQ, Shijian L, Tan CL (2012) Multi-oriented video scene text detection through Bayesian classification and boundary growing. IEEE Trans Circ Syst Video Technol (TCSVT) 22:1227–1235

    Article  Google Scholar 

  43. Sharma N, Shivakumara P, Pal U, Blumenstein M, Tan CL (2012) A new method for arbitrarily-oriented text detection in video. In: Proceedings of DAS, pp 74–78

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag London

About this chapter

Cite this chapter

Lu, T., Palaiahnakote, S., Tan, C.L., Liu, W. (2014). Introduction to Video Text Detection. In: Video Text Detection. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-6515-6_1

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-6515-6_1

  • Published:

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-6514-9

  • Online ISBN: 978-1-4471-6515-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics