Text Extraction in Digital News Video Using Morphology
Abstract
In this paper, a new method is presented to extract both superimposed and embedded scene texts in digital news videos. The algorithm is summarized in the following three steps : preprocessing, extracting candidate regions, and filtering candidate regions. For the first preprocessing step, a color image is converted into a gray-level image and a modified local adaptive thresholding is applied to the contrast-stretched image. In the second step, various morphological operations and Geo-correction method are applied to remove non-text components while retaining the text components. In the third filtering step, non-text components are removed based on the characteristics of each candidate component such as the number of pixels and the bounding box of each connected component Acceptable results have been obtained using the proposed method on 300 domestic news images with a recognition rate of 93.6%. Also, the proposed method gives good performance on the various kinds of images such as foreign news and film videos.
Keywords
Text Component Text Line Morphological Operation Text Region Scene ImageReferences
- 1.Jae-Chang Shim, Chitra Dorai, and Ruud Bolle, Automatic Text Extraction from Video for Content-Based Annotation and Retrieval, Proceedings of Fourteenth International Conference on Pattern Recognition, Vol. 1, pp. 618–620, 1998.Google Scholar
- 2.Anil K. Jain and Bin Yu, Automatic text location in images and video frames, Pattern Recognition, Vol. 31, No. 12, pp. 2055–2076, 1998.CrossRefGoogle Scholar
- 3.H. Kuwano, Y. Taniguchi, H. Arai, M. Mori, S. Kuraka-ke, and H. Kojima, Telop-on-demand: video structuring and retrieval based on text recognition, IEEE International Conference on Multimedia and Expo, Vol. 2, pp. 759–762, 2000.Google Scholar
- 4.U. Gargi, S. Antani, and R. Kasturi, Indexing text events in digital video databases, Proceedings of Fourteenth International Conference on Pattern Recognition, Vol. 1, pp. 916–918, 1998.Google Scholar
- 5.Sameer Antani, Ullas Gargi, David Crandall, Tarak Gandhi, and Rangachar Kasturi, Extraction of Text in Video, Dept. of Computer. Science and Eng., Pennsylvania State Univ., Technical Report, CSE-99-016, 1999.Google Scholar
- 6.S. Messelodi and C.M. Modena, Automatic identification and skew estimation of text lines in real scene images, Pattern Recognition, Vol. 32, pp. 791–810, 1999.CrossRefGoogle Scholar
- 7.S. Antani, D. Crandall, and R. Kasturi, Robust extraction of text in video, Proceedings of 15th International Conference on Pattern Recognition, Vol. 1, pp. 831–834, 2000.Google Scholar
- 8.Y. Lu, Machine printed character segmentation-An overview, Pattern Recognition, Vol. 28, pp. 67–80, 1995.CrossRefGoogle Scholar
- 9.Y. Zhong, K. Karu, and A. K. Jain, Locating text in complex color images, Pattern Recognition, Vol. 28, pp. 1523–1535, 1995.CrossRefGoogle Scholar
- 10.J. Ohya, A. Shio, and S. Akamatsu, Recognizing characters in scene images, IEEE Trans on Pattern Analysis and Machine Intelligence. PAMI-16, pp.214–220, 1994.Google Scholar
- 11.H. K. Kim, Efficient automatic text location method and content-based indexing and structuring of video database, J. Visual Commun. Image Representation, Vol. 7, pp. 336–344, 1996.CrossRefGoogle Scholar
- 12.M. A. Smith and T. Kanade, Video skimming for quick browsing base on audio and image characterization, Technical Report CMU-CS-95-186, Carnegie Mellon University, July 1995.Google Scholar
- 13.J. Serra, Image Analysis and Mathematical Morphology. New York: Academic, 1982.MATHGoogle Scholar
- 14.Pyeoung-Kee Kim, Automatic Text Location in Complex Color Images using Local Color Quantization, TENCON 99. Proceedings of the IEEE Region 10 Conference, Vol. 1, pp. 629–632, 1999.Google Scholar
- 15.R. Lienhart and F. Stuber, Automatic text recognition in digital videos, SPIE Image and Video Processing IV, pp 2666–2669, 1996Google Scholar