Abstract
The paper describes a texture-based fast text location scheme which operates directly in the Discrete Wavelet Transform (DWT) domain. By the distinguishing texture characteristics encoded in wavelet transform domain, the text is fast detected from complex background images stored in the compressed format such as JPEG2000 without full decompress. Compared with some traditional character location methods, the proposed scheme has the advantages of low computational cost, robust to size and font of characters and high accuracy. Preliminary experimental results show that the proposed scheme is efficient and effective.
Similar content being viewed by others
References
J. Eakins, M. Graham, Content-based image retrieval. Technical Report, University of Northumbria at Newcastle, Research Report, 1999.
Y. Rui, T. S. Huang, S. F. Chang, Image retrieval: Current techniques, promising directions, and open issues, Journal of Visual Communication and Image Representation, 10(1999)4, 39–62.
P. K. Kim, Automatic text location in complex color images using local color quantization, TENCON 99, Proceedings of the IEEE Region 10 Conference, Cheju Island South Korea, 1999, 1, 629–632.
J. Y. Zhou, D. Lopresti, Extracting text from WWW images, Proceedings of the Fourth International Conference on Document Analysis and Recognition, Ulm Germany, 1997, 1, 248–252.
C. Li, X. Q. Ding, Y. S. Wu, Automatic text location in natural scene images, Proc. Sixth International Conference on Document Analysis and Recognition, Seattle, WA, USA, 2001, 1069–1073.
Ki-Young Jeong, Keechul Jung, et al., Neural network-based text location for news video indexing, 1999 International Conference on Image Processing, Kobe, Japan, 1999, 3, 319–323.
V. Wu, R. Manmatha, E. M. Riseman, Textfinder: An automatic system to detect and recognize text in images, IEEE Trans. on Pattern Analysis and Machine Intelligence, 21(1999)11, 1224–1229.
C. S. Shin, K. I. Kim, et al., Support vector machine-based text detection in digital video, IEEE Proc. of Signal Processing Society Workshop on Neural Networks for Signal Processing, Sydney, NSW, Australia, 2000, 2, 634–641.
Yu Zhong, Hongjiang Zhang, A. K. Jain, Automatic caption localization in compressed video, IEEE Trans. on Pattern Analysis and Machine Intelligence, 22(2000)4, 385–392.
S. Mallat, A theory for multiresolution signal decomposition: The wavelet representation, IEEE Trans. on Pattern Analysis and Machine Intelligence, 11(1989)7, 674–693.
Author information
Authors and Affiliations
Additional information
Supported by the National Natural Science Foundation of China(No.60402036) and the Natural Science Foundation of Beijing(No.4042008).
Communication author: Li Xiaohua, born in 1973, female, Ph.D. Signal & Information Processing Lab., Beijing University of Technology, Beijing 100022, China.
About this article
Cite this article
Li, X., Shen, L. Fast text location based on discrete wavelet transform. J. of Electron.(China) 22, 385–394 (2005). https://doi.org/10.1007/BF02687926
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02687926