Skip to main content

Video Text Detection Systems

  • Chapter
  • First Online:
Video Text Detection

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

  • 1125 Accesses

Abstract

Nowadays, a large number of video text detection systems have been developed for daily used video applications such as transportation surveillance, electronic payment, traffic safety detection, sport videos retrieval, and even commercial online advertisements, in which the existing closed-circuit television, road-rule enforcement cameras, or online videos can be the data sources. These systems have the same requirement of video content analysis, in which automatic video text detection is believed an essential task. The detected video texts can be recognized using OCR systems and further passed to a speech system, which potentially has other usages such as providing video services for blind people and automatic navigation for drivers. This chapter introduces several typical real-life video text detection applications, including license plate recognition, navigation assistance, sport video analysis, and online video advertising. The discussed techniques in these applications can be similarly adopted by or extended to many other real-life systems such as video content retrieval, person identification from videos, E-education or E-meeting, and even karaoke music entertainments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Huang Y-P et al (2009) An intelligent strategy for checking the annual inspection status of motorcycles based on license plate recognition. Expert Syst Appl 36(5):9260–9267

    Article  Google Scholar 

  2. Tao M, Xian-Sheng H, Shipeng L (2009) Video sense: a contextual in-video advertising system. IEEE Trans Circ Syst Vid Technol 19(12):1866–1879

    Article  Google Scholar 

  3. IAB internet advertising revenue report. Available from: http://www.iab.net/insights_research/industry_data_and_landscape/adrevenuereport

  4. FreeWheel manages the economics of content for the enterprise-class world of entertainment. Available from: http://www.freewheel.tv/

  5. Neumann L, Matas J (2012) A real-time scene text to speech system, in computer vision – ECCV 2012. In: Fusiello A, Murino V, Cucchiara R (eds) Workshops and demonstrations. Springer, Berlin, pp 619–622

    Google Scholar 

  6. Weiming H et al (2011) A survey on visual content-based video indexing and retrieval. IEEE Trans Syst Man Cybern C Appl Rev 41(6):797–819

    Article  Google Scholar 

  7. Chua T-S, Ruan L-Q (1995) A video retrieval and sequencing system. ACM Trans Inf Syst 13(4):373–407

    Article  Google Scholar 

  8. Poignant J et al (2011) Text detection and recognition for person identification in videos. In: 9th international workshop on content-based multimedia indexing (CBMI), 2011

    Google Scholar 

  9. Ming-yu C, Hauptmann A (2004) Searching for a specific person in broadcast news video. In: Proceedings (ICASSP ’04). IEEE international conference on acoustics, speech, and signal processing, 2004

    Google Scholar 

  10. Erol B, Ying L (2005) An overview of technologies for e-meeting and e-lecture. In: ICME 2005. IEEE international conference on multimedia and expo, 2005

    Google Scholar 

  11. Quanfu F et al (2011) Robust spatiotemporal matching of electronic slides to presentation videos. IEEE Trans Image Process 20(8):2315–2328

    Article  MathSciNet  Google Scholar 

  12. Dorai C, Oria V, Neelavalli V (2003) Structuralizing educational videos based on presentation content. In: ICIP 2003. Proceedings of the international conference on image processing, 2003

    Google Scholar 

  13. Zhu Y, Chen K, Sun Q Multimodal content-based structure analysis of karaoke music. In: Proceedings of the 13th annual ACM international conference on Multimedia 2005, ACM, Hilton, Singapore, pp 638–647

    Google Scholar 

  14. Ying W et al (2011) An algorithm for license plate recognition applied to intelligent transportation system. IEEE Trans Intell Transp Syst 12(3):830–845

    Article  Google Scholar 

  15. Shyang-Lih C et al (2004) Automatic license plate recognition. IEEE Trans Intell Transp Syst 5(1):42–53

    Article  Google Scholar 

  16. Anagnostopoulos CNE et al (2006) A license plate-recognition algorithm for intelligent transportation system applications. IEEE Trans Intell Transp Syst 7(3):377–392

    Article  Google Scholar 

  17. Shapiro V, Gluhchev G, Dimov D (2006) Towards a multinational car license plate recognition system. Mach Vis Appl 17(3):173–183

    Article  Google Scholar 

  18. Abolghasemi V, Ahmadyfard A (2009) An edge-based color-aided method for license plate detection. Image Vis Comput 27(8):1134–1142

    Article  Google Scholar 

  19. Zhigang X, Honglei Z (2007) An efficient method of locating vehicle license plate. In: ICNC 2007. Third international conference on Natural computation, 2007

    Google Scholar 

  20. Caner H, Gecim HS, Alkar AZ (2008) Efficient embedded neural-network-based license plate recognition system. IEEE Trans Veh Technol 57(5):2675–2683

    Article  Google Scholar 

  21. Jin L et al (2012) License plate recognition algorithm for passenger cars in Chinese residential areas. Sensors 12(6):8355–8370

    Article  Google Scholar 

  22. Road safety statistics. Available from: http://www.tmr.qld.gov.au/Safety/Transport-and-road-statistics/Road-safety-statistics.aspx

  23. Wu W, Chen X, Yang J Incremental detection of text on road signs from video with application to a driving assistant system. In: Proceedings of the 12th annual ACM international conference on Multimedia 2004, ACM, New York, pp 852–859

    Google Scholar 

  24. Marmo R, Lombardi L (2007) Milepost sign detection. In: CAMP 2006. International workshop on computer architecture for machine perception and sensing, 2006

    Google Scholar 

  25. Verma B, Stockwell D (2011) An automated system for the analysis of the status of road safety using neural networks. In: Lu B-L, Zhang L, Kwok J (eds) Neural information processing. Springer, Berlin, pp 530–537

    Chapter  Google Scholar 

  26. Pazio M et al (2007) Text detection system for the blind. In: 15th European signal processing conference EUSIPCO

    Google Scholar 

  27. Wang JR, Parameswaran N Survey of sports video analysis: research issues and applications. In: Proceedings of the Pan-Sydney area workshop on visual information processing 2004, Australian Computer Society, Inc., pp 87–90

    Google Scholar 

  28. Chih-Yi C et al (2012) Tagging webcast text in baseball videos by video segmentation and text alignment. IEEE Trans Circ Syst Vid Technol 22(7):999–1013

    Article  Google Scholar 

  29. Lien C-C, Chiang C-L, Lee C-H (2007) Scene-based event detection for baseball videos. J Vis Commun Image Represent 18(1):1–14

    Article  Google Scholar 

  30. Han M et al An integrated baseball digest system using maximum entropy method. In: Proceedings of the tenth ACM international conference on Multimedia 2002, ACM, Juan-les-Pins, pp 347–350

    Google Scholar 

  31. Takahashi M, Fujii M, Yagi N (2008) Automatic pitch type recognition from baseball broadcast videos. In: ISM 2008. Tenth IEEE international symposium on multimedia, 2008

    Google Scholar 

  32. Zhang D, Chang SF Event detection in baseball video using superimposed caption recognition. In: Proceedings of the tenth ACM international conference on Multimedia 2002, ACM, Juan-les-Pins, pp 315–318

    Google Scholar 

  33. Jinjun W et al (2007) Generation of personalized music sports video using multimodal cues. IEEE Trans Multimedia 9(3):576–588

    Article  Google Scholar 

  34. Online Publisher Association. Available from: http://www.online-publishers.org

  35. McCoy S et al (2007) The effects of online advertising. Commun ACM 50(3):84–88

    Article  MathSciNet  Google Scholar 

  36. Mehta A et al (2007) AdWords and generalized online matching. J ACM 54(5):22

    Article  MathSciNet  Google Scholar 

  37. Srinivasan SH, Sawant N, Wadhwa S vADeo: video advertising system. In: Proceedings of the 15th international conference on Multimedia 2007, ACM, Augsburg, pp 455–456

    Google Scholar 

  38. Mei T et al (2012) Image sense: towards contextual image advertising. ACM Trans Multimedia Comput Commun Appl 8(1):1–18

    Article  Google Scholar 

  39. Albayrak S et al (2011) Towards “Semantic IPTV”. In: Prasad AR, Buford JF, Gurbani VK (eds) Advances in next generation services and service architectures, River Publishers, pp 197–230

    Google Scholar 

  40. Ulges A, Borth D, Koch M (2013) Content analysis meets viewers: linking concept detection with demographics on YouTube. Int J Multimedia Inf Retr 2(2):145–157

    Article  Google Scholar 

  41. Gilly D, Raimond K (2013) A survey on license plate recognition systems. Int J Comput Appl 61(6):34–40

    Google Scholar 

  42. Joakim Kristian Olle Arfvidsson, Sriram Thirthala. Labeling features of maps using road signs. US patent (US8483447 B1)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag London

About this chapter

Cite this chapter

Lu, T., Palaiahnakote, S., Tan, C.L., Liu, W. (2014). Video Text Detection Systems. In: Video Text Detection. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-6515-6_7

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-6515-6_7

  • Published:

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-6514-9

  • Online ISBN: 978-1-4471-6515-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics