Abstract
This paper shows how the nowadays prevalent technology used in HTR borrows concepts and methods from the field of ASR; i.e. those based on Hidden Markov Models (HMMs). Additionally, it will be described a HTR approach based on employing Bernoulli distributions rather than Gaussian-Mixture distributions for the HMM-state emission probability of observations. Finally, handwritten text recognition evaluation results are reported for several corpora involving different characteristics and languages.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Likforman-Sulem, L., Zahour, A., Taconet, B.: Text line segmentation of historical documents: a survey. International Journal on Document Analysis and Recognition 9, 123–138 (2007)
Wong, K.Y., Wahl, F.M.: Document analysis system. IBM Journal of Research and Development 26, 647–656 (1982)
Jelinek, F.: Statistical methods for speech recognition. MIT Press (1998)
Katz, S.M.: Estimation of probabilities from sparse data for the language model component of a speech recognizer. In: Proceedings of the IEEE Transactions on Acoustics, Speech and Signal Processing (ICASSP 1987), vol. ASSP-35, pp. 400–401(March 1987)
Kneser, R., Ney, H.: Improved backing-off for n-gram language modeling. In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 1, pp. 181–184 (1995)
Bazzi, I., Schwartz, R., Makhoul, J.: An omnifont open-vocabulary OCR system for English and Arabic. IEEE Transactions on Pattern Analysis and Machine Intelligence 21(6), 495–504 (1999)
Toselli, A.H., Juan, A., Keysers, D., González, J., Salvador, I., Ney, H., Vidal, E., Casacuberta, F.: Integrated handwriting recognition and interpretation using finite-state models. International Journal of Pattern Recognition and Artificial Intelligence 18(4), 519–539 (2004)
Rabiner, L., Juang, B.H.: Fundamentals of speech recognition. Prentice-Hall, Englewood Cliffs (1993)
Giménez, A., Juan, A.: Embedded bernoulli mixture hmms for handwritten word recognition. In: Proceedings of the 10th International Conference on Document Analysis and Recognition, Barcelona, Spain, pp. 896–900. IEEE Computer Society (July 2009)
Toselli, A., Juan, A., Vidal, E.: Spontaneous handwriting recognition and classification. In: Proceedings of the International Conference on Pattern Recognition (ICPR 2004), Cambridge, United Kingdom, vol. 1, pp. 433–436 (August 2004)
Marti, U.V., Bunke, H.: The IAM-database: an English sentence database for off-line handwriting recognition. International Journal on Document Analysis and Recognition (IJDAR) 5(1), 39–46 (2002)
Romero, V., Toselli, A.H., Rodríguez, L., Vidal, E.: Computer Assisted Transcription for Ancient Text Images. In: Kamel, M.S., Campilho, A. (eds.) ICIAR 2007. LNCS, vol. 4633, pp. 1182–1193. Springer, Heidelberg (2007)
Pérez, D., Tarazón, L., Serrano, N., Castro, F.M., Ramos-Terrades, O., Juan, A.: The germana database. In: Proceedings of the 10th International Conference on Document Analysis and Recognition, Barcelona, Spain, pp. 301–305. IEEE Computer Society (July 2009)
Serrano, N., Juan, A.: The rodrigo database. In: Proceedings of the The Seventh International Conference on Language Resources and Evaluation (LREC 2010), Malta, May 19-21 (2010)
Pechwitz, M., Maddouri, S.S., Magn̈er, V., Ellouze, N., Amiri, H.: IFN/ENIT-database of handwritten Arabic words. In: Proc. of the Colloque International Francophone sur l’Ecrit et le Document (CIFED), Hammmamet, Tunisia (October 2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Toselli, A.H., Serrano, N., Giménez-Pastor, A., Khoury, I., Juan, A., Vidal, E. (2012). Language Technology for Handwritten Text Recognition. In: Torre Toledano, D., et al. Advances in Speech and Language Technologies for Iberian Languages. Communications in Computer and Information Science, vol 328. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35292-8_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-35292-8_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35291-1
Online ISBN: 978-3-642-35292-8
eBook Packages: Computer ScienceComputer Science (R0)