The IAM-database: an English sentence database for offline handwriting recognition

Marti, U.-V.; Bunke, H.

doi:10.1007/s100320200071

The IAM-database: an English sentence database for offline handwriting recognition

Original Research Paper
Published: November 2002

Volume 5, pages 39–46, (2002)
Cite this article

International Journal on Document Analysis and Recognition Aims and scope Submit manuscript

U.-V. Marti¹ &
H. Bunke¹

3176 Accesses
910 Citations
3 Altmetric
Explore all metrics

Abstract.

In this paper we describe a database that consists of handwritten English sentences. It is based on the Lancaster-Oslo/Bergen (LOB) corpus. This corpus is a collection of texts that comprise about one million word instances. The database includes 1,066 forms produced by approximately 400 different writers. A total of 82,227 word instances out of a vocabulary of 10,841 words occur in the collection. The database consists of full English sentences. It can serve as a basis for a variety of handwriting recognition tasks. However, it is expected that the database would be particularly useful for recognition tasks where linguistic knowledge beyond the lexicon level is used, because this knowledge can be automatically derived from the underlying corpus. The database also includes a few image-processing procedures for extracting the handwritten text from the forms and the segmentation of the text into lines and words.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Author information

Authors and Affiliations

Department of Computer Science, University of Bern, Neubrückstrasse 10, 3011 Bern, Switzerland; e-mail: {marti,bunke}@iam.unibe.ch , , , , , , CH
U.-V. Marti & H. Bunke

Authors

U.-V. Marti
View author publications
You can also search for this author in PubMed Google Scholar
H. Bunke
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Received September 28, 2001 / Revised October 10, 2001

Rights and permissions

Reprints and permissions

About this article

Cite this article

Marti, UV., Bunke, H. The IAM-database: an English sentence database for offline handwriting recognition. IJDAR 5, 39–46 (2002). https://doi.org/10.1007/s100320200071

Download citation

Issue Date: November 2002
DOI: https://doi.org/10.1007/s100320200071

Keywords: Handwriting recognition – Database – Unconstrained English sentences – Corpus – Linguistic knowledge

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The IAM-database: an English sentence database for offline handwriting recognition

Abstract.

Access this article

Similar content being viewed by others

A survey on semi-supervised learning

Visualizing and Understanding Convolutional Networks

Siamese Neural Networks: An Overview

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Navigation

The IAM-database: an English sentence database for offline handwriting recognition

Abstract.

Access this article

Similar content being viewed by others

A survey on semi-supervised learning

Visualizing and Understanding Convolutional Networks

Siamese Neural Networks: An Overview

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation