Extraction and recognition of artificial text in multimedia documents

Wolf, C.; Jolion, J.-M.

doi:10.1007/s10044-003-0197-7

Extraction and recognition of artificial text in multimedia documents

ORIGINAL ARTICLE
Published: February 2004

Volume 6, pages 309–326, (2004)
Cite this article

Formal Pattern Analysis & Applications Aims and scope Submit manuscript

C. Wolf¹ &
J.-M. Jolion¹

356 Accesses
125 Citations
Explore all metrics

An Erratum to this article was published on 16 June 2004

Abstract

The systems currently available for contentbased image and video retrieval work without semantic knowledge, i. e. they use image processing methods to extract low level features of the data. The similarity obtained by these approaches does not always correspond to the similarity a human user would expect. A way to include more semantic knowledge into the indexing process is to use the text included in the images and video sequences. It is rich in information but easy to use, e. g. by key word based queries. In this paper we present an algorithm to localise artificial text in images and videos using a measure of accumulated gradients and morphological processing. The quality of the localised text is improved by robust multiple frame integration. A new technique for the binarisation of the text boxes based on a criterion maximizing local contrast is proposed. Finally, detection and OCR results for a commercial OCR are presented, justifying the choice of the binarisation technique.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Author information

Authors and Affiliations

Lyon Research Center for Images and Intelligent Information Systems, INSA de Lyon, Bât. Verne, 20, Av. Albert Einstein, 69621, Villeurbanne cedex, France
C. Wolf & J.-M. Jolion

Authors

C. Wolf
View author publications
You can also search for this author in PubMed Google Scholar
J.-M. Jolion
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to C. Wolf.

Additional information

An erratum to this article can be found at http://dx.doi.org/10.1007/s10044-004-0216-3

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wolf, C., Jolion, JM. Extraction and recognition of artificial text in multimedia documents. Formal Pattern Analysis & Applications 6, 309–326 (2004). https://doi.org/10.1007/s10044-003-0197-7

Download citation

Received: 25 February 2002
Accepted: 18 July 2003
Issue Date: February 2004
DOI: https://doi.org/10.1007/s10044-003-0197-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Extraction and recognition of artificial text in multimedia documents

Abstract

Access this article

Similar content being viewed by others

OCR with Tesseract, Amazon Textract, and Google Document AI: a benchmarking experiment

Review on image-stitching techniques

Image Features Detection, Description and Matching

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Extraction and recognition of artificial text in multimedia documents

Abstract

Access this article

Similar content being viewed by others

OCR with Tesseract, Amazon Textract, and Google Document AI: a benchmarking experiment

Review on image-stitching techniques

Image Features Detection, Description and Matching

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation