Self Adaptable Recognizer for Document Image Collections

Meshesha, Million; Jawahar, C. V.

doi:10.1007/978-3-540-77046-6_69

Million Meshesha¹ &
C. V. Jawahar¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4815))

Included in the following conference series:

International Conference on Pattern Recognition and Machine Intelligence

2176 Accesses
2 Citations

Abstract

This paper presents an architecture that enables the recognizer to learn incrementally and, thereby adapt to document image collections for performance improvement. We argue that the recognition scheme for a book could be considerably different from that designed for isolated pages. We employ learning procedures to capture the relevant information available online, and feed it back to update the knowledge of the system. Experimental results show the effectiveness of our design for improving the performance on-the-fly.

Download to read the full chapter text

Chapter PDF

Image indexing and content analysis in children’s picture books using a large-scale database

Article 05 March 2019

Chengwei Huang & Hao Jiang

Online Task-free Continual Learning with Dynamic Sparse Distributed Memory

An Innovative Character Recognition for Ancient Book and Archival Materials: A Segmentation and Self-learning Based Approach

References

Feng, S., Manmatha, R.: A hierarchical, HMM-based automatic evaluation of OCR accuracy for a digital library of books. In: Joint Conference on Digital Libraries (JCDL), pp. 109–118 (2006)
Google Scholar
Sankar, P., et al.: Digitizing a million books: Challenges for document analysis. In: Proc. of the Seventh IAPR Workshop on Document Analysis Systems, pp. 425–436 (2006)
Google Scholar
Lin, X.: DRR research beyond COTS OCR software: A survey. In: SPIE Conference on Document Recognition and Retrieval XII, San Jose, CA, pp. 16–20 (2005)
Google Scholar
Xu, Y., Nagy, G.: Prototype extraction and adaptive OCR. IEEE Transactions on Pattern Analysis and Machine Intelligence 21, 1280–1296 (1999)
Article Google Scholar
Hastie, T., Tibshirani, R., Friedman, J.: The elements of statistical learning. Springer, Heidelberg (2001)
Google Scholar
Nagy, G.: Twenty years of document image analysis in PAMI. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 38–62 (2000)
Article MathSciNet Google Scholar
Kahan, S., Pavlidis, T., Baird, H.S.: On the recognition of printed characters of any font and size. IEEE Transactions on Pattern Analysis and Machine Intelligence 9, 274–288 (1987)
Article Google Scholar
Rawat, S., et al.: A semi-automatic adaptive OCR for digital libraries. In: Proc. of the Seventh IAPR Workshop on Document Analysis Systems, pp. 13–24 (2006)
Google Scholar
Ivanov, Y., Blumberg, B., Pentland, A.: Expectation maximization for weakly labeled data. In: Proc. of the Int. Conf. on Machine Learning, pp. 218–225 (2001)
Google Scholar
Iyengar, V.S., Apte, C., Zhang, T.: Active learning using adaptive resampling. In: Sixth Int. Conference on Knowledge Discovery and Data Mining, pp. 92–98 (2000)
Google Scholar
Diehl, C., Cauwenberghs, G.: SVM incremental learning, adaptation and optimization. In: Proc. IEEE Int. Joint Conf. Neural Networks, pp. 2685–2690 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Center for Visual Information Technology, International Institute of Information Technology, Hyderabad - 500 032, India
Million Meshesha & C. V. Jawahar

Authors

Million Meshesha
View author publications
You can also search for this author in PubMed Google Scholar
C. V. Jawahar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Ashish Ghosh Rajat K. De Sankar K. Pal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Meshesha, M., Jawahar, C.V. (2007). Self Adaptable Recognizer for Document Image Collections. In: Ghosh, A., De, R.K., Pal, S.K. (eds) Pattern Recognition and Machine Intelligence. PReMI 2007. Lecture Notes in Computer Science, vol 4815. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77046-6_69

Download citation

DOI: https://doi.org/10.1007/978-3-540-77046-6_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77045-9
Online ISBN: 978-3-540-77046-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Self Adaptable Recognizer for Document Image Collections

Abstract

Chapter PDF

Similar content being viewed by others

Image indexing and content analysis in children’s picture books using a large-scale database

Online Task-free Continual Learning with Dynamic Sparse Distributed Memory

An Innovative Character Recognition for Ancient Book and Archival Materials: A Segmentation and Self-learning Based Approach

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Self Adaptable Recognizer for Document Image Collections

Abstract

Chapter PDF

Similar content being viewed by others

Image indexing and content analysis in children’s picture books using a large-scale database

Online Task-free Continual Learning with Dynamic Sparse Distributed Memory

An Innovative Character Recognition for Ancient Book and Archival Materials: A Segmentation and Self-learning Based Approach

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation