Language Technology for Handwritten Text Recognition

Toselli, Alejandro H.; Serrano, Nicolás; Giménez-Pastor, Adrià; Khoury, Ihab; Juan, Alfons; Vidal, Enrique

doi:10.1007/978-3-642-35292-8_19

Alejandro H. Toselli⁷,
Nicolás Serrano⁷,
Adrià Giménez-Pastor⁷,
Ihab Khoury⁷,
Alfons Juan⁷ &
…
Enrique Vidal⁷

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 328))

749 Accesses

Abstract

This paper shows how the nowadays prevalent technology used in HTR borrows concepts and methods from the field of ASR; i.e. those based on Hidden Markov Models (HMMs). Additionally, it will be described a HTR approach based on employing Bernoulli distributions rather than Gaussian-Mixture distributions for the HMM-state emission probability of observations. Finally, handwritten text recognition evaluation results are reported for several corpora involving different characteristics and languages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Likforman-Sulem, L., Zahour, A., Taconet, B.: Text line segmentation of historical documents: a survey. International Journal on Document Analysis and Recognition 9, 123–138 (2007)
Article Google Scholar
Wong, K.Y., Wahl, F.M.: Document analysis system. IBM Journal of Research and Development 26, 647–656 (1982)
Article Google Scholar
Jelinek, F.: Statistical methods for speech recognition. MIT Press (1998)
Google Scholar
Katz, S.M.: Estimation of probabilities from sparse data for the language model component of a speech recognizer. In: Proceedings of the IEEE Transactions on Acoustics, Speech and Signal Processing (ICASSP 1987), vol. ASSP-35, pp. 400–401(March 1987)
Google Scholar
Kneser, R., Ney, H.: Improved backing-off for n-gram language modeling. In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 1, pp. 181–184 (1995)
Google Scholar
Bazzi, I., Schwartz, R., Makhoul, J.: An omnifont open-vocabulary OCR system for English and Arabic. IEEE Transactions on Pattern Analysis and Machine Intelligence 21(6), 495–504 (1999)
Article Google Scholar
Toselli, A.H., Juan, A., Keysers, D., González, J., Salvador, I., Ney, H., Vidal, E., Casacuberta, F.: Integrated handwriting recognition and interpretation using finite-state models. International Journal of Pattern Recognition and Artificial Intelligence 18(4), 519–539 (2004)
Article Google Scholar
Rabiner, L., Juang, B.H.: Fundamentals of speech recognition. Prentice-Hall, Englewood Cliffs (1993)
Google Scholar
Giménez, A., Juan, A.: Embedded bernoulli mixture hmms for handwritten word recognition. In: Proceedings of the 10th International Conference on Document Analysis and Recognition, Barcelona, Spain, pp. 896–900. IEEE Computer Society (July 2009)
Google Scholar
Toselli, A., Juan, A., Vidal, E.: Spontaneous handwriting recognition and classification. In: Proceedings of the International Conference on Pattern Recognition (ICPR 2004), Cambridge, United Kingdom, vol. 1, pp. 433–436 (August 2004)
Google Scholar
Marti, U.V., Bunke, H.: The IAM-database: an English sentence database for off-line handwriting recognition. International Journal on Document Analysis and Recognition (IJDAR) 5(1), 39–46 (2002)
Article MATH Google Scholar
Romero, V., Toselli, A.H., Rodríguez, L., Vidal, E.: Computer Assisted Transcription for Ancient Text Images. In: Kamel, M.S., Campilho, A. (eds.) ICIAR 2007. LNCS, vol. 4633, pp. 1182–1193. Springer, Heidelberg (2007)
Chapter Google Scholar
Pérez, D., Tarazón, L., Serrano, N., Castro, F.M., Ramos-Terrades, O., Juan, A.: The germana database. In: Proceedings of the 10th International Conference on Document Analysis and Recognition, Barcelona, Spain, pp. 301–305. IEEE Computer Society (July 2009)
Google Scholar
Serrano, N., Juan, A.: The rodrigo database. In: Proceedings of the The Seventh International Conference on Language Resources and Evaluation (LREC 2010), Malta, May 19-21 (2010)
Google Scholar
Pechwitz, M., Maddouri, S.S., Magn̈er, V., Ellouze, N., Amiri, H.: IFN/ENIT-database of handwritten Arabic words. In: Proc. of the Colloque International Francophone sur l’Ecrit et le Document (CIFED), Hammmamet, Tunisia (October 2002)
Google Scholar

Download references

Author information

Authors and Affiliations

DSIC/PRHLT, Universitat Politècnica de València, Camí de Vera, s/n, 46022, València, Spain
Alejandro H. Toselli, Nicolás Serrano, Adrià Giménez-Pastor, Ihab Khoury, Alfons Juan & Enrique Vidal

Authors

Alejandro H. Toselli
View author publications
You can also search for this author in PubMed Google Scholar
Nicolás Serrano
View author publications
You can also search for this author in PubMed Google Scholar
Adrià Giménez-Pastor
View author publications
You can also search for this author in PubMed Google Scholar
Ihab Khoury
View author publications
You can also search for this author in PubMed Google Scholar
Alfons Juan
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Vidal
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Escuela Politecnica Superior, Universidad Autonoma de Madrid. C/ Francisco, Tomas y Valiente 11, 28049, Madrid, Spain
Doroteo Torre Toledano
Centro Politécnico Superior, Edificio Ada Byron, C/ María de Luna nº 1, 50018, Zaragoza, Spain
Alfonso Ortega Giménez
Universidade de Aveiro, Campus Universitário Aveiro, 3810-193, Aveiro, Portugal
António Teixeira
Escuela Politecnica Superior, Universidad Autonoma de Madrid, C/ Francisco, Tomas y Valiente 11, 28049, Madrid, Spain
Joaquín González Rodríguez
E.T.S.I.Telecomunicacion, Universidad Politécnica de Madrid, Ciudad Universitaria s/n, 28040, Madrid, Spain
Luis Hernández Gómez & Rubén San Segundo Hernández &
Escuela Politecnica Superior, Universidad Autonoma de Madrid, C/ Francisco, Tomas y Valiente 11, 28049, Madrid, Spain
Daniel Ramos Castro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Toselli, A.H., Serrano, N., Giménez-Pastor, A., Khoury, I., Juan, A., Vidal, E. (2012). Language Technology for Handwritten Text Recognition. In: Torre Toledano, D., et al. Advances in Speech and Language Technologies for Iberian Languages. Communications in Computer and Information Science, vol 328. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35292-8_19

Download citation

DOI: https://doi.org/10.1007/978-3-642-35292-8_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35291-1
Online ISBN: 978-3-642-35292-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics