Aligning Transcripts to Automatically Segmented Handwritten Manuscripts

Rothfeder, Jamie; Manmatha, R.; Rath, Toni M.

doi:10.1007/11669487_8

Jamie Rothfeder¹⁸,
R. Manmatha¹⁸ &
Toni M. Rath¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3872))

Included in the following conference series:

International Workshop on Document Analysis Systems

1628 Accesses
17 Citations

Abstract

Training and evaluation of techniques for handwriting recognition and retrieval is a challenge given that it is difficult to create large ground-truthed datasets. This is especially true for historical handwritten datasets. In many instances the ground truth has to be created by manually transcribing each word, which is a very labor intensive process. Sometimes transcriptions are available for some manuscripts. These transcriptions were created for other purposes and hence correspondence at the word, line, or sentence level may not be available. To be useful for training and evaluation, a word level correspondence must be available between the segmented handwritten word images and the ASCII transcriptions. Creating this correspondence or alignment is challenging because the segmentation is often errorful and the ASCII transcription may also have errors in it. Very little work has been done on the alignment of handwritten data to transcripts. Here, a novel Hidden Markov Model based automatic alignment algorithm is described and tested. The algorithm produces an average alignment accuracy of about 72.8% when aligning whole pages at a time on a set of 70 pages of the George Washington collection. This outperforms a dynamic time warping alignment algorithm by about 12% previously reported in the literature and tested on the same collection.

Download to read the full chapter text

Chapter PDF

Transcript Alignment for Historical Handwritten Documents: The MiM Algorithm

Continuous Handwritten Script Recognition

Dense Correspondences and Ancient Texts

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Deng, Y., Byrne, W.: Hmm word and phrase alignment for statistical machine translation. In: Proceedings of HLT-EMNLP (2005)
Google Scholar
Durbin, R., Eddy, S., Krogh, A., Mitchison, G.: Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University Press, Cambridge (2001)
Google Scholar
Hobby, J.D.: Matching document images with ground truth. International Journal on Document Analysis and Recognition 1(1), 52–61 (1997)
Google Scholar
Kay, M., Roscheisen, M.: Text-translation alignment. Computational Linguistics 19(1), 121–142 (1993)
Google Scholar
Kornfield, E.M., Manmatha, R., Allan, J.: Text alignment with handwritten documents. In: Proceedings of Document Image Analysis for Libraries (DIAL), pp. 23–24 (2004)
Google Scholar
Lavrenko, V., Rath, T.M., Manmatha, R.: Holistic word recognition for handwritten historical documents. In: Proceedings of the Workshop on Document Image Analysis for Libraries DIAL 2004, pp. 278–287 (2004)
Google Scholar
Malfrère, F., Deroo, O., Dutoit, T.: Phonetic alignment: Speech synthesis based vs. hybrid hmm/ann. In: Proceedings of the ICSLP, pp. 1571–1574 (1998)
Google Scholar
Manmatha, R., Rothfeder, J.L.: A scale space approach for automatically segmenting words from historical handwritten documents. IEEE Transactions on PAMI 28(8), 1212–1225 (2005)
Google Scholar
Manmatha, R., Srimal, N.: Scale space technique for word segmentation in handwritten manuscripts. In: Proc. of the Second Int’l Conf. on Scale-Space Theories in Computer Vision, Corfu, Greece, September 26-27, pp. 22–33 (1999)
Google Scholar
Marti, U.V., Bunke, H.: A full English sentence database for off-line handwriting recognition. In: Proc. of the 5th Int. Conf. on Document Analysis and Recognition, Gangalore, India, pp. 705–708 (1999)
Google Scholar
Marti, U.-V., Bunke, H.: Using a statistical language model to improve the performance of an HMM-based cursive handwriting recognition system. Int’l Journal of Pattern Recognition and Artifical Intelligence 15(1), 65–90 (2001)
Article Google Scholar
Jang, P.J., Hauptmann, A.G.: Learning to recognize speech by watching television. IEEE Intelligent Systems 14(5), 51–58 (1999)
Article Google Scholar
Rath, T.M., Lavrenko, V., Manmatha, R.: A search engine for historical manuscript images. In: Proceedings of ACM SIGIR 2004, pp. 369–376 (2004)
Google Scholar
Rath, T.M., Rothfeder, J.L., Lvin, V.B.: The BoxModify tool, Computer program (2004)
Google Scholar
Roy, D.K., Malamud, C.: Speaker identification based text to audio alignment for an audio retrieval system. In: ICASSP 1997, Munich, Germany, pp. 1099–1102 (1997)
Google Scholar
Tomai, C.I., Zhang, B., Govindaraju, V.: Transcript mapping for historic handwritten document images. In: Proc. of the 8th Int’l Workshop on Frontiers in Handwriting Recognition, Niagara-on-the-Lake, ON, August 6-8, pp. 413–418 (2002)
Google Scholar
Vinciarelli, A., Bengio, S., Bunke, H.: Offline recognition of unconstrained handwritten texts using hmms and statistical language models. IEEE Trans. Pattern Anal. Mach. Intelligence 26(6), 709–720 (2004)
Article Google Scholar
Xu, Y., Nagy, G.: Prototype extraction and adaptive ocr. IEEE Trans. PAMI 21(12), 1280–1296 (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Massachusetts Amherst, Amherst, MA, 01003, USA
Jamie Rothfeder, R. Manmatha & Toni M. Rath

Authors

Jamie Rothfeder
View author publications
You can also search for this author in PubMed Google Scholar
R. Manmatha
View author publications
You can also search for this author in PubMed Google Scholar
Toni M. Rath
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computer Science and Applied Mathematics, University of Bern, Neubrückstrasse 10, CH-3012, Bern, Switzerland
Horst Bunke
DocRec Ltd, 34 Strathaven Place, 7001, Atawhai, Nelson, New Zealand
A. Lawrence Spitz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rothfeder, J., Manmatha, R., Rath, T.M. (2006). Aligning Transcripts to Automatically Segmented Handwritten Manuscripts. In: Bunke, H., Spitz, A.L. (eds) Document Analysis Systems VII. DAS 2006. Lecture Notes in Computer Science, vol 3872. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11669487_8

Download citation

DOI: https://doi.org/10.1007/11669487_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32140-8
Online ISBN: 978-3-540-32157-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Aligning Transcripts to Automatically Segmented Handwritten Manuscripts

Abstract

Chapter PDF

Similar content being viewed by others

Transcript Alignment for Historical Handwritten Documents: The MiM Algorithm

Continuous Handwritten Script Recognition

Dense Correspondences and Ancient Texts

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Aligning Transcripts to Automatically Segmented Handwritten Manuscripts

Abstract

Chapter PDF

Similar content being viewed by others

Transcript Alignment for Historical Handwritten Documents: The MiM Algorithm

Continuous Handwritten Script Recognition

Dense Correspondences and Ancient Texts

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation