A Nom historical document recognition system for digital archiving

Van Phan, Truyen; Cong Nguyen, Kha; Nakagawa, Masaki

doi:10.1007/s10032-015-0257-8

A Nom historical document recognition system for digital archiving

Original Paper
Published: 12 December 2015

Volume 19, pages 49–64, (2016)
Cite this article

International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Truyen Van Phan¹,
Kha Cong Nguyen¹ &
Masaki Nakagawa¹

518 Accesses
14 Citations
Explore all metrics

Abstract

A Nom historical document recognition system is being developed for digital archiving that uses image binarization, character segmentation, and character recognition. It incorporates two versions of off-line character recognition: one for automatic recognition of scanned and segmented character patterns (7660 categories) and the other for user handwritten input (32,695 categories). This separation is used since including less frequently appearing categories in automatic recognition increases the misrecognition rate without reliable statistics on the Nom language. Moreover, a user must be able to check the results and identify the correct categories from an extended set of categories, and a user can input characters by hand. Both versions use the same recognition method, but they are trained using different sets of training patterns. Recursive X–Y cut and Voronoi diagrams are used for segmentation; k–d tree and generalized learning vector quantization are used for coarse classification; and the modified quadratic discriminant function is used for fine classification. The system provides an interface through which a user can check the results, change binarization methods, rectify segmentation, and input correct character categories by hand. Evaluation done using a limited number of Nom historical documents after providing ground truths for them showed that the two stages of recognition along with user checking and correction improved the recognition results significantly.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents

HDPA: historical document processing and analysis framework

Article 20 May 2020

Ladislav Lenc, Jiří Martínek, … Vincent Christlein

Text Segmentation for Document Recognition

References

Kim, M.S., Jang, M.D., Choi, H.I., Rhee, T.H., Kim, J.H., Kwag, H.K.: Digitalizing scheme of handwritten Hanja historical documents. In: Proceedings of the 1st International Workshop on Document Image Analysis for Libraries, USA, pp. 321–327, Jan. 2004
Shih, V.J., Chu, T.L.: The Han Nom Digital Library. In: The International Nom Conference, The National Library of Vietnam, Hanoi, pp. 12–14, Nov. 2004
Phan, T.V., Zhu, B., Nakagawa, M.: Development of Nom character segmentation for collecting patterns from historical document pages. In: Proceedings of 1st International Workshop on Historical Document Imaging and Processing, China, pp. 133–139, Sep. 2011
Phan, T.V., Zhu, B., Nakagawa, M.: Collecting handwritten Nom character patterns from historical document pages. In: Proceedings of 10th IAPR International Workshop on Document Analysis Systems, Australia, pp. 344–348, Mar. 2012
Su, B., Lu, S., Tan, C.L.: Binarization of historical handwritten document images using local maximum and minimum filter. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, USA, pp. 159–165, Jun. 2010
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)
Kittler, J., Illingworth, J.: Threshold selection based on a simple image statistics. Comput. Vis. Graphics Image Process. 30, 125–147 (1985)
Article Google Scholar
Schindelin, J., Arganda-Carreras, I., Frise, E., Kaynig, V., Longair, M., Pietzsch, T., Cardona, A.: Fiji: an open-source platform for biological-image analysis. Nat. Methods. 9(7), 676–682 (2012)
Article Google Scholar
Tsukumo, J., Tanaka, H.: Classification of handprinted Chinese characters using non-linear normalization and correlation methods. In: Proceedings of the 9th International Conference on Pattern Recognition, Italy, pp. 168–171 (1988)
Liu, C.L.: Normalization-cooperated gradient feature extraction for handwritten character recognition. Pattern Anal. Mach. Intell. IEEE Trans. 29(8), 1465–1469 (2007)
Article Google Scholar
Kawamura, A., Yura, K., Hayama, T., Hidai, Y., Minamikawa, T., Tanaka, A., Masuda, S.: Online recognition of freely handwritten Japanese characters using directional feature densities. In: Proceedings of the 11th International Conference on Pattern Recognition, Netherlands, 2, pp. 183–186 (1992)
Fukunaga, K.: Introduction to Statistical Pattern Recognition, 2nd edn. Academic Press, San Diego (1990)
MATH Google Scholar
Kimura, F., Takashina, K., Tsuruoka, S., Miyake, Y.: Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans. PAMI 9(1), pp. 149–153 (1987)
Kohonen, T., Hynninen, J., Kangas, J., Laaksonen, J., Torkkola, K.: LVQ PAK: The learning vector quantization program package. In: Technical Report, Laboratory of Computer and Information Science Rakentajanaukio 2(C), pp. 1991–1992 (1996)
Sato, A., Yamada, K.: Generalized learning vector quantization. In: Proceedings of the 1995 Conference on Advances in Neural Information Processing Systems, vol 8, pp 423–429. MIT Press, Cambridge, USA (1996)
Juang, B.-H., Katagiri, S.: Discriminative learning for minimum error classification. Signal Process. IEEE Trans. 40(12), 3043–3054 (1992)
Article MATH Google Scholar
Liu, C.L., Nakagawa, M.: Evaluation of prototype learning algorithms for nearest-neighbor classifier in application to handwritten character recognition. Pattern Recognit. 34(3), 601–615 (2001)
Article MATH Google Scholar
Fukumoto, T., Wakabayashi, T., Kimura, F., Miyake, Y.: Accuracy improvement of handwritten character recognition by GLVQ. In: Proceedings of the 7th International Workshop on Frontiers in handwriting recognition, pp. 687–692. The Netherlands (2000)
Bentley, J.L.: Multidimensional binary search trees used for associative searching. Commun. ACM 18(9), 509–517 (1975)
Article MathSciNet MATH Google Scholar
Phan, T.V., Nakagawa, M., Baba, H., Watanabe, A.: MokkAnnotator - A system for archiving Mokkan images. In: Proceedings of the 16th Biennial Conference of the International Graphonomics Society, Japan, pp. 54–57, Jun. 2013
Nakagawa, M., Matsumoto, K.: Collection of on-line handwritten Japanese character pattern databases and their analysis. Doc. Anal. Recognit. 7(1), 69–81 (2004)
Chen, B., Zhu, B., Nakagawa, M.: Effects of generating a large amount of artificial patterns for on-line handwritten Japanese character recognition. In: Proceedings of the 11th International Conference on Document Analysis and Recognition, China, pp. 663–667, Sep. 2011
Leung, K.C., Leung, C.H.: Recognition of handwritten Chinese characters by combining regularization, Fisher’s discriminant and transformation sample generation. In: Proceedings of the 10th International Conference of Document Analysis and Recognition, Spain, pp. 1026–1030 (2009)

Download references

Acknowledgments

We thank the National Library of Vietnam and the Vietnamese Nom Preservation Foundation for providing the Nom historical document pages. This research is being supported by Grant-in-Aid for Scientific Research from the Japan Society for the Promotion of Science (JSPS) (contract numbers (B) 24300095 and (S) 25220401).

Author information

Authors and Affiliations

Department of Information and Communication Engineering, Tokyo University of Agriculture and Technology, Tokyo, 184-8588, Japan
Truyen Van Phan, Kha Cong Nguyen & Masaki Nakagawa

Authors

Truyen Van Phan
View author publications
You can also search for this author in PubMed Google Scholar
Kha Cong Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Masaki Nakagawa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Masaki Nakagawa.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Van Phan, T., Cong Nguyen, K. & Nakagawa, M. A Nom historical document recognition system for digital archiving. IJDAR 19, 49–64 (2016). https://doi.org/10.1007/s10032-015-0257-8

Download citation

Received: 12 December 2014
Revised: 31 October 2015
Accepted: 11 November 2015
Published: 12 December 2015
Issue Date: March 2016
DOI: https://doi.org/10.1007/s10032-015-0257-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

A Nom historical document recognition system for digital archiving

Abstract

Access this article

Similar content being viewed by others

An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents

HDPA: historical document processing and analysis framework

Text Segmentation for Document Recognition

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A Nom historical document recognition system for digital archiving

Abstract

Access this article

Similar content being viewed by others

An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents

HDPA: historical document processing and analysis framework

Text Segmentation for Document Recognition

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation