Skip to main content

Color and Gradient Features for Text Segmentation from Video Frames

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 213))

Abstract

Text segmentation in a video is drawing attention of researchers in the field of image processing, pattern recognition and document image analysis because it helps in annotating and labeling video events accurately. We propose a novel idea of generating an enhanced frame from the R, G, and B channels of an input frame by grouping high and low values using Min–Max clustering criteria. We also perform sliding window on enhanced frame to group high and low values from the neighboring pixel values to further enhance the frame. Subsequently, we use k-means with k = 2 clustering algorithm to separate text and non-text regions. The fully connected components will be identified in the skeleton of the frame obtained by k-means clustering. Concept of connected component analysis based on gradient feature has been adapted for the purpose of symmetry verification. The components which satisfy symmetric verification are selected to be the representatives of text regions and they are permitted to grow to cover their respective region fully containing text. The method is tested on variety of video frames to evaluate the performance of the method in terms of recall, precision and f-measure. The results show that method is promising and encouraging.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Sharma N, Pal U, Blumenstein M (2012) Recent advances in video based document processing: a review. In: Proceedings of DAS, pp 63–68

    Google Scholar 

  2. Zang J, Kasturi J (2008) Extraction of text objects in video documents: recent progress. In: Proceedings of DAS, pp 5–17

    Google Scholar 

  3. Jung K, Kim K, Jain K (2004) Text information extraction in images and video: a survey. Pattern Recogn 37:977–997

    Google Scholar 

  4. Doermann D, Liang J, Li J (2003) Progress in camera-based document image analysis. In: Proceedings of ICDAR, pp 606–616

    Google Scholar 

  5. Jung K (2001) Neural network-based text location in color images. Pattern Recogn Lett 22:1503–1515

    Google Scholar 

  6. Ye Q, Huang Q, Gao W, Zhao D (2005) Fast and robust text detection in images and videos frames. Image Vis Comput 23:565–576

    Google Scholar 

  7. Chen D, Odobez JM, Bourlard H (2004) Text detection and recognition in images and video frames. Pattern Recog 37:595–608

    Google Scholar 

  8. Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: Proceedings of CVPR, pp 3538–3545

    Google Scholar 

  9. Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: Proceedings of CVPR, pp 1083–1090

    Google Scholar 

  10. Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: Proceedings of CVPR, pp 2963–2970

    Google Scholar 

  11. Jain AK, Yu B (1998) Automatic text location in images and video frames. Pattern Recogn 31:2055–2076 (1998)

    Google Scholar 

  12. Mariano VY, Kasturi R (2000) Locating uniform-colored text in video frames. In: Proceedings of ICPR, pp 539–542

    Google Scholar 

  13. Wu V, Manmatha V, Riseman EM (1999) Text finder: an automatic system to detect and recognize text in images. IEEE Trans PAMI 21:1224–1229

    Google Scholar 

  14. Kim KL, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuous adaptive mean shift algorithm. IEEE Trans PAMI 25:1631–1639

    Google Scholar 

  15. Shivakumara P, Phan TQ, Tan CL (2011) A laplacian approach to multi-oriented text detection in video. IEEE Trans PAMI 33:412–419

    Google Scholar 

  16. Shivakumara P, Sreedhar RP, Phan TQ, Lu S, Tan CL (2012) Multi-oriented video scene text detection through bayesian classification and boundary growing. IEEE Trans CSVT 22:1227–1235

    Google Scholar 

  17. Zhou J, Xu L, Xiao B, Dai R (2007) A robust system for text extraction in video. In: Proceedings of ICMV, pp 119–124

    Google Scholar 

  18. Lu C, Wang C, Dai R (2005) Text detection in images based on unsupervised classification of edge-based features. In: Proceedings of ICDAR pp 610–614

    Google Scholar 

  19. Wong EK, Chen M (2003) A new robust algorithm for video text extraction. Pattern Recogn 36:1397–1406

    Google Scholar 

  20. Guru DS, Manjunath S, Shivakumara P, Tan CL (2010) An eigen value based approach for text detection in video. In: Proceedings of DAS, pp 501–506

    Google Scholar 

  21. Basavanna M, Shivakumara P, Srivatsa SK, Hemantha Kumar G (2011) A new run-length based method for scene text detection. In: Proceedings of IICAI, pp 1730–1736

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to P. Shivakumara .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer India

About this paper

Cite this paper

Shivakumara, P., Guru, D.S., Basavaraju, H.T. (2013). Color and Gradient Features for Text Segmentation from Video Frames. In: Swamy, P., Guru, D. (eds) Multimedia Processing, Communication and Computing Applications. Lecture Notes in Electrical Engineering, vol 213. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1143-3_22

Download citation

  • DOI: https://doi.org/10.1007/978-81-322-1143-3_22

  • Published:

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-322-1142-6

  • Online ISBN: 978-81-322-1143-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics