Abstract
Telugu text is composed of aksharas (characters). The presence of split and connected aksharas in Telugu document images causes segmentation difficulties and the performance of the Telugu OCR systems is affected. Our novel approach to solve this problem is using an implicit segmentation for recognizing words. The implicit segmentation approach does not need prior segmentation of the words into aksharas before they are recognized. Since the Hidden Markov models (HMM) are successfully applied for phoneme recognition with no prior segmentation of the speech into phonemes in the automatic speech recognition applications. In this paper, we report on the use of continuous density Hidden Markov Models for representing the shape of aksharas to build Telugu text recognition system. The sliding window method is used for computing simple statistical features and 450 akshara HMMs are trained. We use word bigram language model as contextual information. The word recognition relies on akshara models and contextual information of words. The word recognition involves finding the maximum likelihood sequence of akshara models that matches against the feature vector sequence. Our system recognizes words with split and connected aksharas. The performance of the system is encouraging.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aas, K., Eikvil, L., Andersen, T.: Text recognition from grey level images using hidden markov models. In: Hlaváč, V., Šára, R. (eds.) Computer Analysis of Images and Patterns. Lecture Notes in Computer Science, vol. 970, pp. 503–508. Springer, Heidelberg (1995)
Bazzi, I., Schwartz, R., Makhoul, J.: An omnifont open-vocabulary ocr system for english and arabic. IEEE Transactions on Pattern Analysis and Machine Intelligence 21(6), 495–504 (1999)
Bose, C., Kuo, S.S.: Connected and degraded text recognition using hidden markov model. In: Proceedings of the 11th IAPR International Conference on Pattern Recognition, 1992, Conference B: Pattern Recognition Methodology and Systems, vol. II, pp. 116–119 (1992)
Dutta, S., Sankaran, N., Sankar, K., Jawahar, C.: Robust recognition of degraded documents using character n-grams. In: 2012 10th IAPR International Workshop on Document Analysis Systems (DAS), pp. 130–134 (2012)
Elms, A.: A connected character recogniser using level building of hmms. In: Proceedings of the 12th IAPR International Conference on Pattern Recognition, 1994, Conference B: Computer Vision amp; Image Processing, vol. 2, pp. 439–441 (1994)
Elms, A., Procter, S., Illingworth, J.: The advantage of using an hmm-based approach for faxed word recognition. International Journal on Document Analysis and Recognition 1(1), 18–36 (1998)
Khorsheed, M.S.: Offline recognition of omnifont arabic text using the hmm toolkit (htk). Pattern Recogn. Lett. 28(12), 1563–1571 (2007)
Krishnan, P., Sankaran, N., Singh, A., Jawahar, C.: Towards a robust OCR system for indic scripts. In: 2014 11th IAPR International Workshop on Document Analysis Systems (DAS), pp. 141–145 (2014)
Kumar, P.P., Bhagvati, C., Agarwal, A.: On performance analysis of end-to-end OCR systems of indic scripts. In: Proceeding of the Workshop on Document Analysis and Recognition, pp. 132–138. ACM, New York (2012)
Natarajan, P., MacRostie, E., Decerbo, M.: The BBN byblos hindi OCR system. In: Govindaraju, V., Setlur, S.R. (eds.) Guide to OCR for Indic Scripts, Advances in Pattern Recognition, pp. 173–180. Springer, London (2010)
Negi, A., Bhagvati, C., Krishna, B.: An OCR system for telugu. In: ICDAR, pp. 1110–1114. IEEE Computer Society (2001)
Rabiner, L.: A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
Roy, P., Roy, S., Pal, U.: Multi-oriented text recognition in graphical documents using hmm. In: 2014 11th IAPR International Workshop on Document Analysis Systems (DAS), pp. 136–140 (2014)
Tesseract: http://code.google.com/p/tesseract-ocr/
Young, S.: The HTK Hidden Markov Model Toolkit: Design and Philosophy (1993)
Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Liu, X.A., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version 3.4). Cambridge University Engineering Department (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Koteswara Rao, D., Negi, A. (2016). An Implicit Segmentation Approach for Telugu Text Recognition Based on Hidden Markov Models. In: Thampi, S., Bandyopadhyay, S., Krishnan, S., Li, KC., Mosin, S., Ma, M. (eds) Advances in Signal Processing and Intelligent Recognition Systems. Advances in Intelligent Systems and Computing, vol 425. Springer, Cham. https://doi.org/10.1007/978-3-319-28658-7_54
Download citation
DOI: https://doi.org/10.1007/978-3-319-28658-7_54
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28656-3
Online ISBN: 978-3-319-28658-7
eBook Packages: EngineeringEngineering (R0)