A learning-based method to detect and segment text from scene images


This paper proposes a learning-based method for text detection and text segmentation in natural scene images. First, the input image is decomposed into multiple connected-components (CCs) by Niblack clustering algorithm. Then all the CCs including text CCs and non-text CCs are verified on their text features by a 2-stage classification module, where most non-text CCs are discarded by an attentional cascade classifier and remaining CCs are further verified by an SVM. All the accepted CCs are output to result in text only binary image. Experiments with many images in different scenes showed satisfactory performance of our proposed method.

Project supported by the OMRON and SJTU Collaborative Foundation under PVS project (2005.03–2005.10)

Jiang, Rj., Qi, Fh., Xu, L. et al. A learning-based method to detect and segment text from scene images. J. Zhejiang Univ. - Sci. A 8, 568–574 (2007).

  • Text detection
  • Text segmentation
  • Text feature
  • Attentional cascade

  • TP391.41