Abstract
This paper introduces the anytime anywhere document analysis methodology applied in the context of computer-aided transcription. Its utility is revealed for documents which are difficult to analyse, as in the case of handwritten texts. A special focus lies on the glyph separation problem which turns out to be particularly complicated. As automatic methods show fundamental limitations, a number of interactive methods are proposed which are based on the interplay between user and machine. These methods get along without any assumptions concerning underlying languages or appearances of texts. An evaluation in the context of palaeography and applied to a well-established data set illustrates how well handwritings are dealt with, although they offer distinct differences in their regularity and shape.
Similar content being viewed by others
References
Brockhaus: Graphem. In: Der neue Brockhaus, vol. 2, p. 436. F. A. Brockhaus Wiesbaden (1979)
Casey, R.G., Lecolinet, E.: A survey of methods and strategies in character segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 18(7), 690–706 (1996)
Chazalon, J., Coüasnon, B.: Iterative analysis of document collections enables efficient human-initiated interaction. In: Viard-Gaudin, C., Zanibbi, R. (eds.) Document Recognition and Retrieval XIX—DRR 2012, 19th Document Recognition and Retrieval Conference, Part of the IS&T-SPIE Electronic Imaging Symposium, Burlingame, California, USA, 25 Jan 2012, Proceedings, SPIE Proceedings, vol. 8297. SPIE (2012)
Chazalon, J., Coüasnon, B., Lemaitre, A.: Iterative analysis of pages in document collections for efficient user interaction. In: International Conference on Document Analysis and Recognition, ICDAR 2011, Beijing, China, 18–21 Sept 2011, pp. 503–507. IEEE (2011)
Clavier, E., Masini, G., Delalandre, M., Rigamonti, M., Tombre, K., Gardes, J.: Docmining: a cooperative platform for heterogeneous document interpretation according to user-defined scenarios. In: Lladós, J., Kwon, Y.B. (eds.) Graphics Recognition, Recent Advances and Perspectives, 5th InternationalWorkshop, GREC 2003, Barcelona, Spain, 30–31 July 2003. Revised Selected Papers, Lecture Notes in Computer Science, vol. 3088, pp. 13–24. Springer (2004)
Fischer, A., Frinken, V., Fornés, A., Bunke, H.: Transcription alignment of latin manuscripts using hidden markov models. In: Proceedings of the 2011 Workshop on Historical Document Imaging and Processing. HIP ’11, pp. 29–36. ACM, New York, NY, USA (2011)
Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G., Stolz, M.: Ground truth creation for handwriting recognition in historical documents. In: Doermann, D.S., Govindaraju, V., Lopresti, D.P., Natarajan, P. (eds.) The Ninth IAPR International Workshop on Document Analysis Systems, DAS 2010, 9–11 June 2010, Boston, Massachusetts, USA, ACM International Conference Proceeding Series, pp. 3–10. ACM (2010)
Fischer, A., Keller, A., Frinken, V., Bunke, H.: Lexicon-free handwritten word spotting using character hmms. Pattern Recogn. Lett. 33(7), 934–942 (2012)
Gottfried, B.: Qualitative similarity measures—the case of two-dimensional outlines. Comput. Vis. Image Underst. 110(1), 117–133 (2008)
Gottfried, B., Meyer-Lerbs, L.: Towards the processing of historic documents. In: Bernadi, R. (ed.) Advanced Technologies for Digital Libraries, LNCS, pp. 15–28. Springer, Berlin (2011)
He, L., Chao, Y., Suzuki, K.: A run-based two-scan labeling algorithm. IEEE Trans. Image Process. 17(5), 749–756 (2008)
Hofmeister, W., Hofmeister-Winter, A.: Schriftzüge unter der High-Tech-Lupe. Theoretische Grundlagen und erste praktische Ergebnisse des Grazer Pilotprojekts DAmalS. In: Internatiohnales Jahrbuch für Editionswissenschaft, vol. 22, pp. 90–117 (2008)
Kansal, H., Sanyal, S., Gupta, D.: Dewarping and deskewing of a document using affine transformation. In: Ranchordas, A., Araújo, H. (eds.) VISAPP (2), pp. 73–78. INSTICC Press, Setúbal (2009)
Kim, G., Govindaraju, V., Srihari, S.N.: An architecture for handwritten text recognition systems. IJDAR 2(1), 37–44 (1999)
Lebourgeois, F., Emptoz, H.: DEBORA: Digital AccEss to BOoks of the RenAissance. IJDAR 9(2–4), 193–221 (2007)
Lowe, K.A.: From quill to t-pen: palaeography, editing and their e-futures. Lit. Compass 9(12), 1004–1009 (2012)
Moalla, I., Lebourgeois, F., Emptoz, H., Alimi, A.M.: Contribution to the discrimination of the medieval manuscript texts: application in the palaeography. In: Bunke, H., Spitz, A.L. (eds.) Document Analysis Systems, LNCS, pp. 25–37. Springer, Berlin (2006)
Nagy, G.: Twenty years of document image analysis in pami. IEEE Trans. Pattern Anal. Mach. Intell. 22(1), 38–62 (2000)
Ouwayed, N., Belaïd, A.: A general approach for multi-oriented text line extraction of handwritten documents. IJDAR 15(4), 297–314 (2012)
Peck, A.: Beginning GIMP: From Novice to Professional. Apress Inc., New York (2006)
Philipps 1870, fol. 11r and fol. 144r. Staatsbibliothek zu Berlin, Preußischer Kulturbesitz, Department of manuscripts, (c. 1100)
Plötz, T., Fink, G.A.: Markov models for offline handwriting recognition: a survey. IJDAR 12(4), 269–298 (2009)
Ramel, J.Y., Sidére, N., Rayar, F.: Interactive layout analysis, content extraction and transcription of historical printed books using pattern redundancy analysis. Lit. Linguist. Comput. 28(2), 301–314 (2013)
Romero, V., Toselli, A.H., Rodríguez, L., Vidal, E.: Computer assisted transcription for ancient text images. In: Kamel, M.S., Campilho, A.C. (eds.) ICIAR, LNCS, pp. 1182–1193. Springer, Berlin (2007)
Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recogn. 33, 225–236 (2000)
Serrano, N., Gimnez, A., Civera, J., Sanchis, A., Juan, A.: Interactive handwriting recognition with limited user effort. IJDAR 17(1), 47–59 (2013)
Serrano, N., Tarazón, L., Pérez, D., Terrades, O.R., Juan, A.: The gidoc prototype. In: Fred, A.L.N. (ed.) Pattern Recognition in Information Systems, Proceedings of the 10th International Workshop on Pattern Recognition in Information Systems, PRIS 2010, In Conjunction with ICEIS 2010, Funchal, Madeira, Portugal, June 2010, pp. 82–89. SciTePress (2010)
Smith, R.: A simple and efficient skew detection algorithm via text row accumulation. In: ICDAR, pp. 1145–1148. IEEE Computer Society (1995)
Worch, J.H., Lawo, M., Gottfried, B.: Glyph spotting for mediaeval handwritings by template matching. In: Concolato, C., Schmitz, P. (eds.) ACM Symposium on Document Engineering, pp. 213–216. ACM, New York (2012)
Wüthrich, M., Liwicki, M., Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G., Stolz, M.: Language model integration for the recognition of handwritten medieval documents. In: ICDAR, pp. 211–215. IEEE Computer Society (2009)
Yacoub, S.M., Saxena, V., Sami, S.N.: Perfectdoc: A ground truthing environment for complex documents. In: Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 Aug–1 Sept 2005, Seoul, Korea, pp. 452–457. IEEE Computer Society (2005)
Acknowledgments
The described methods have been realised within the Diptychon system which has been implemented to a large extent by Jan-Hendrik Worch. We are thankful for his valuable contributions to this project. Moreover, we acknowledge the support of Prof. Dr. Michael Menzel, Lehrstuhl für Mittelalterliche Geschichte, Humboldt-Universität zu Berlin, and Prof. Dr. Michael Lawo, Centre for Computing and Communication Technologies, University of Bremen. Finally, we appreciate the valuable feedback provided by the reviewers.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was funded by the German Research Foundation (DFG) under Grant Numbers {GO 2023/4-1, LA 3066/4-2} and LA 3007/1-1 under the project name Diptychon and partially under Grant Number GO 2023/3-2.
Rights and permissions
About this article
Cite this article
Gottfried, B., Wegner, M. & Lawo, M. Towards the interactive transcription of handwritings: anytime anywhere document analysis. IJDAR 18, 31–45 (2015). https://doi.org/10.1007/s10032-014-0234-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10032-014-0234-7