Towards the interactive transcription of handwritings: anytime anywhere document analysis

Gottfried, Björn; Wegner, Marius; Lawo, Mathias

doi:10.1007/s10032-014-0234-7

Towards the interactive transcription of handwritings: anytime anywhere document analysis

Original Paper
Published: 02 November 2014

Volume 18, pages 31–45, (2015)
Cite this article

International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Björn Gottfried¹,
Marius Wegner¹ &
Mathias Lawo²

523 Accesses
1 Citation
Explore all metrics

Abstract

This paper introduces the anytime anywhere document analysis methodology applied in the context of computer-aided transcription. Its utility is revealed for documents which are difficult to analyse, as in the case of handwritten texts. A special focus lies on the glyph separation problem which turns out to be particularly complicated. As automatic methods show fundamental limitations, a number of interactive methods are proposed which are based on the interplay between user and machine. These methods get along without any assumptions concerning underlying languages or appearances of texts. An evaluation in the context of palaeography and applied to a well-established data set illustrates how well handwritings are dealt with, although they offer distinct differences in their regularity and shape.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

$$\textit{TexT}$$ - Text Extractor Tool for Handwritten Document Transcription and Annotation

Continuous Handwritten Script Recognition

A Historical Document Handwriting Transcription End-to-end System

References

Brockhaus: Graphem. In: Der neue Brockhaus, vol. 2, p. 436. F. A. Brockhaus Wiesbaden (1979)
Casey, R.G., Lecolinet, E.: A survey of methods and strategies in character segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 18(7), 690–706 (1996)
Article Google Scholar
Chazalon, J., Coüasnon, B.: Iterative analysis of document collections enables efficient human-initiated interaction. In: Viard-Gaudin, C., Zanibbi, R. (eds.) Document Recognition and Retrieval XIX—DRR 2012, 19th Document Recognition and Retrieval Conference, Part of the IS&T-SPIE Electronic Imaging Symposium, Burlingame, California, USA, 25 Jan 2012, Proceedings, SPIE Proceedings, vol. 8297. SPIE (2012)
Chazalon, J., Coüasnon, B., Lemaitre, A.: Iterative analysis of pages in document collections for efficient user interaction. In: International Conference on Document Analysis and Recognition, ICDAR 2011, Beijing, China, 18–21 Sept 2011, pp. 503–507. IEEE (2011)
Clavier, E., Masini, G., Delalandre, M., Rigamonti, M., Tombre, K., Gardes, J.: Docmining: a cooperative platform for heterogeneous document interpretation according to user-defined scenarios. In: Lladós, J., Kwon, Y.B. (eds.) Graphics Recognition, Recent Advances and Perspectives, 5th InternationalWorkshop, GREC 2003, Barcelona, Spain, 30–31 July 2003. Revised Selected Papers, Lecture Notes in Computer Science, vol. 3088, pp. 13–24. Springer (2004)
Fischer, A., Frinken, V., Fornés, A., Bunke, H.: Transcription alignment of latin manuscripts using hidden markov models. In: Proceedings of the 2011 Workshop on Historical Document Imaging and Processing. HIP ’11, pp. 29–36. ACM, New York, NY, USA (2011)
Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G., Stolz, M.: Ground truth creation for handwriting recognition in historical documents. In: Doermann, D.S., Govindaraju, V., Lopresti, D.P., Natarajan, P. (eds.) The Ninth IAPR International Workshop on Document Analysis Systems, DAS 2010, 9–11 June 2010, Boston, Massachusetts, USA, ACM International Conference Proceeding Series, pp. 3–10. ACM (2010)
Fischer, A., Keller, A., Frinken, V., Bunke, H.: Lexicon-free handwritten word spotting using character hmms. Pattern Recogn. Lett. 33(7), 934–942 (2012)
Article Google Scholar
Gottfried, B.: Qualitative similarity measures—the case of two-dimensional outlines. Comput. Vis. Image Underst. 110(1), 117–133 (2008)
Article Google Scholar
Gottfried, B., Meyer-Lerbs, L.: Towards the processing of historic documents. In: Bernadi, R. (ed.) Advanced Technologies for Digital Libraries, LNCS, pp. 15–28. Springer, Berlin (2011)
Chapter Google Scholar
He, L., Chao, Y., Suzuki, K.: A run-based two-scan labeling algorithm. IEEE Trans. Image Process. 17(5), 749–756 (2008)
Article MathSciNet Google Scholar
Hofmeister, W., Hofmeister-Winter, A.: Schriftzüge unter der High-Tech-Lupe. Theoretische Grundlagen und erste praktische Ergebnisse des Grazer Pilotprojekts DAmalS. In: Internatiohnales Jahrbuch für Editionswissenschaft, vol. 22, pp. 90–117 (2008)
Kansal, H., Sanyal, S., Gupta, D.: Dewarping and deskewing of a document using affine transformation. In: Ranchordas, A., Araújo, H. (eds.) VISAPP (2), pp. 73–78. INSTICC Press, Setúbal (2009)
Kim, G., Govindaraju, V., Srihari, S.N.: An architecture for handwritten text recognition systems. IJDAR 2(1), 37–44 (1999)
Article Google Scholar
Lebourgeois, F., Emptoz, H.: DEBORA: Digital AccEss to BOoks of the RenAissance. IJDAR 9(2–4), 193–221 (2007)
Article Google Scholar
Lowe, K.A.: From quill to t-pen: palaeography, editing and their e-futures. Lit. Compass 9(12), 1004–1009 (2012)
Article Google Scholar
Moalla, I., Lebourgeois, F., Emptoz, H., Alimi, A.M.: Contribution to the discrimination of the medieval manuscript texts: application in the palaeography. In: Bunke, H., Spitz, A.L. (eds.) Document Analysis Systems, LNCS, pp. 25–37. Springer, Berlin (2006)
Chapter Google Scholar
Nagy, G.: Twenty years of document image analysis in pami. IEEE Trans. Pattern Anal. Mach. Intell. 22(1), 38–62 (2000)
Article Google Scholar
Ouwayed, N., Belaïd, A.: A general approach for multi-oriented text line extraction of handwritten documents. IJDAR 15(4), 297–314 (2012)
Article Google Scholar
Peck, A.: Beginning GIMP: From Novice to Professional. Apress Inc., New York (2006)
Philipps 1870, fol. 11r and fol. 144r. Staatsbibliothek zu Berlin, Preußischer Kulturbesitz, Department of manuscripts, (c. 1100)
Plötz, T., Fink, G.A.: Markov models for offline handwriting recognition: a survey. IJDAR 12(4), 269–298 (2009)
Article Google Scholar
Ramel, J.Y., Sidére, N., Rayar, F.: Interactive layout analysis, content extraction and transcription of historical printed books using pattern redundancy analysis. Lit. Linguist. Comput. 28(2), 301–314 (2013)
Romero, V., Toselli, A.H., Rodríguez, L., Vidal, E.: Computer assisted transcription for ancient text images. In: Kamel, M.S., Campilho, A.C. (eds.) ICIAR, LNCS, pp. 1182–1193. Springer, Berlin (2007)
Google Scholar
Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recogn. 33, 225–236 (2000)
Article Google Scholar
Serrano, N., Gimnez, A., Civera, J., Sanchis, A., Juan, A.: Interactive handwriting recognition with limited user effort. IJDAR 17(1), 47–59 (2013)
Serrano, N., Tarazón, L., Pérez, D., Terrades, O.R., Juan, A.: The gidoc prototype. In: Fred, A.L.N. (ed.) Pattern Recognition in Information Systems, Proceedings of the 10th International Workshop on Pattern Recognition in Information Systems, PRIS 2010, In Conjunction with ICEIS 2010, Funchal, Madeira, Portugal, June 2010, pp. 82–89. SciTePress (2010)
Smith, R.: A simple and efficient skew detection algorithm via text row accumulation. In: ICDAR, pp. 1145–1148. IEEE Computer Society (1995)
Worch, J.H., Lawo, M., Gottfried, B.: Glyph spotting for mediaeval handwritings by template matching. In: Concolato, C., Schmitz, P. (eds.) ACM Symposium on Document Engineering, pp. 213–216. ACM, New York (2012)
Google Scholar
Wüthrich, M., Liwicki, M., Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G., Stolz, M.: Language model integration for the recognition of handwritten medieval documents. In: ICDAR, pp. 211–215. IEEE Computer Society (2009)
Yacoub, S.M., Saxena, V., Sami, S.N.: Perfectdoc: A ground truthing environment for complex documents. In: Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 Aug–1 Sept 2005, Seoul, Korea, pp. 452–457. IEEE Computer Society (2005)

Download references

Acknowledgments

The described methods have been realised within the Diptychon system which has been implemented to a large extent by Jan-Hendrik Worch. We are thankful for his valuable contributions to this project. Moreover, we acknowledge the support of Prof. Dr. Michael Menzel, Lehrstuhl für Mittelalterliche Geschichte, Humboldt-Universität zu Berlin, and Prof. Dr. Michael Lawo, Centre for Computing and Communication Technologies, University of Bremen. Finally, we appreciate the valuable feedback provided by the reviewers.

Author information

Authors and Affiliations

Centre for Computing and Communication Technologies, University of Bremen, Bremen, Germany
Björn Gottfried & Marius Wegner
Monumenta Germaniae Historica, Berlin-Brandenburgische Akademie der Wissenschaften, Berlin, Germany
Mathias Lawo

Authors

Björn Gottfried
View author publications
You can also search for this author in PubMed Google Scholar
Marius Wegner
View author publications
You can also search for this author in PubMed Google Scholar
Mathias Lawo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Björn Gottfried.

Additional information

This work was funded by the German Research Foundation (DFG) under Grant Numbers {GO 2023/4-1, LA 3066/4-2} and LA 3007/1-1 under the project name Diptychon and partially under Grant Number GO 2023/3-2.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gottfried, B., Wegner, M. & Lawo, M. Towards the interactive transcription of handwritings: anytime anywhere document analysis. IJDAR 18, 31–45 (2015). https://doi.org/10.1007/s10032-014-0234-7

Download citation

Received: 25 September 2013
Revised: 02 July 2014
Accepted: 07 October 2014
Published: 02 November 2014
Issue Date: March 2015
DOI: https://doi.org/10.1007/s10032-014-0234-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Towards the interactive transcription of handwritings: anytime anywhere document analysis

Abstract

Access this article

Similar content being viewed by others

$$\textit{TexT}$$ - Text Extractor Tool for Handwritten Document Transcription and Annotation

Continuous Handwritten Script Recognition

A Historical Document Handwriting Transcription End-to-end System

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Towards the interactive transcription of handwritings: anytime anywhere document analysis

Abstract

Access this article

Similar content being viewed by others

$$\textit{TexT}$$ - Text Extractor Tool for Handwritten Document Transcription and Annotation

Continuous Handwritten Script Recognition

A Historical Document Handwriting Transcription End-to-end System

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation