Overview of the Virtual Transcription Laboratory Usage Scenarios and Architecture

  • Adam Dudczak
  • Michał Dudziński
  • Cezary Mazurek
  • Piotr Smoczyk
Chapter
Part of the Studies in Computational Intelligence book series (SCI, volume 541)

Abstract

This chapter outlines findings from the final stage of development of the Virtual Transcription Laboratory (http://wlt.synat.pcss.pl, VTL) prototype. VTL is a crowdsourcing platform developed to support creation of the searchable representation of historic textual documents from Polish digital libraries. This chapter describes identified usage scenarios and shows how they were implemented in the data model and architecture of the prototype. Last chapter presents current usage of the portal and the results of basic benchmarks conducted in order to assess performance of transcription process in VTL.

Keywords

OCR Digitisation Post correction Digital libraries 

References

  1. 1.
    Dudczak, A., Kmieciak, M., Werla, M.: Creation of textual versions of historical documents from polish digital libraries. Lecture Notes in Computer Science, vol. 7489, pp. 89–94. Springer, (2012)Google Scholar
  2. 2.
    Dudczak, A., Kmieciak, M., Mazurek, C., Stroiński, M., Werla, M., Weglarz, J.: Improving the workflow for creation of textual versions of polish historical documents. In: Bembenik, R., Skonieczny, Ł., Rybiński, H., Kryszkiewicz, M., Niezgódka, M. (eds.) Intelligent Tools for Building a Scientific Information Platform: Advanced Architectures and Solutions, pp. 187–198. Springer, Berlin Heidelberg (2013)Google Scholar
  3. 3.
    Holley, R.: Many Hands Make Light Work: Public Collaborative OCR Text Correction in Australian Historic Newspapers, pp. 1–28. National Library of Australia Staff Papers, Canberra (2009)Google Scholar
  4. 4.
    Lewandowska, A., Werla, M.: PIONIER network digital libraries federation—interoperability of advanced network services implemented on a country scale. Comput. Methods Sci. Technol. 119–124 (2010)Google Scholar
  5. 5.
    Breuel, T.: The hOCR microformat for OCR workflow and results. In: 9th International Conference on Document Analysis and Recognition ICDAR 2007, vol 2, pp. 1063–1067. (2007)Google Scholar
  6. 6.
    Górny, M., Mazurek, J.: Key users of polish digital libraries. Electron. Libr. 30(4), (2012)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Adam Dudczak
    • 1
  • Michał Dudziński
    • 1
  • Cezary Mazurek
    • 1
  • Piotr Smoczyk
    • 1
  1. 1.Poznan Supercomputing and Networking CenterPoznanPoland

Personalised recommendations