Script Identification in Natural Scene Images: A Dataset and Texture-Feature Based Performance Evaluation

  • Manisha VermaEmail author
  • Nitakshi Sood
  • Partha Pratim Roy
  • Balasubramanian Raman
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 460)


Recognizing text with occlusion and perspective distortion in natural scenes is a challenging problem. In this work, we present a dataset of multi-lingual scripts and performance evaluation of script identification in this dataset using texture features. A ‘Station Signboard’ database that contains railway sign-boards written in 5 different Indic scripts is presented in this work. The images contain challenges like occlusion, perspective distortion, illumination effect, etc. We have collected a total of 500 images and corresponding ground-truths are made in semi-automatic way. Next, a script identification technique is proposed for multi-lingual scene text recognition. Considering the inherent problems in scene images, local texture features are used for feature extraction and SVM classifier, is employed for script identification. From the preliminary experiment, the performance of script identification is found to be 84 % using LBP feature with SVM classifier.


Texture feature Local binary pattern Script identification SVM classifier k-NN classifier 


  1. 1.
    Chanda, S., Franke, K., Pal, U.: Text independent writer identification for oriya script. In: Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on. pp. 369–373. IEEE (2012)Google Scholar
  2. 2.
    Ghosh, D., Dube, T., Shivaprasad, A.P.: Script recognition–a review. Pattern Analysis and Machine Intelligence, IEEE Transactions on 32(12), 2142–2161 (2010)Google Scholar
  3. 3.
    Heikkilä, M., Pietikäinen, M., Schmid, C.: Description of interest regions with local binary patterns. Pattern recognition 42(3), 425–436 (2009)Google Scholar
  4. 4.
    Murala, S., Maheshwari, R., Balasubramanian, R.: Directional local extrema patterns: a new descriptor for content based image retrieval. International Journal of Multimedia Information Retrieval 1(3), 191–203 (2012)Google Scholar
  5. 5.
    Ojala, T., Pietikäinen, M., Mäenpää, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. Pattern Analysis and Machine Intelligence, IEEE Transactions on 24(7), 971–987 (2002)Google Scholar
  6. 6.
    Pal, U., Sinha, S., Chaudhuri, B.: Multi-script line identification from indian documents. In: Proceedings of Seventh International Conference on Document Analysis and Recognition. pp. 880–884. IEEE (2003)Google Scholar
  7. 7.
    Phan, T.Q., Shivakumara, P., Ding, Z., Lu, S., Tan, C.L.: Video script identification based on text lines. In: International Conference on Document Analysis and Recognition (ICDAR). pp. 1240–1244. IEEE (2011)Google Scholar
  8. 8.
    Shi, B., Yao, C., Zhang, C., Guo, X., Huang, F., Bai, X.: Automatic script identification in the wild. In: Proceedings of ICDAR. No. 531–535 (2015)Google Scholar
  9. 9.
    Shijian, L., Tan, C.L.: Script and language identification in noisy and degraded document images. Pattern Analysis and Machine Intelligence, IEEE Transactions on 30(1), 14–24 (2008)Google Scholar
  10. 10.
    Shivakumara, P., Yuan, Z., Zhao, D., Lu, T., Tan, C.L.: New gradient-spatial-structural features for video script identification. Computer Vision and Image Understanding 130, 35–53 (2015)Google Scholar
  11. 11.
    Singhal, V., Navin, N., Ghosh, D.: Script-based classification of hand-written text documents in a multilingual environment. In: Proceedings of 13th International Workshop on Research Issues in Data Engineering: Multi-lingual Information Management (RIDE-MLIM). pp. 47–54. IEEE (2003)Google Scholar
  12. 12.
    Sun, Q.Y., Lu, Y.: Text location in scene images using visual attention model. International Journal of Pattern Recognition and Artificial Intelligence 26(04), 1–22 (2012)Google Scholar
  13. 13.
    Ullrich, C.: Support vector classification. In: Forecasting and Hedging in the Foreign Exchange Markets, pp. 65–82. Springer (2009)Google Scholar

Copyright information

© Springer Science+Business Media Singapore 2017

Authors and Affiliations

  • Manisha Verma
    • 1
    Email author
  • Nitakshi Sood
    • 2
  • Partha Pratim Roy
    • 3
  • Balasubramanian Raman
    • 3
  1. 1.Mathematics DepartmentIIT RoorkeeRoorkeeIndia
  2. 2.University Institute of Engineering and Technology, Panjab UniversityChandigarhIndia
  3. 3.Computer Science and Engineering DepartmentIIT RoorkeeRoorkeeIndia

Personalised recommendations