Learning confidence transformation for handwritten Chinese text recognition

Wang, Da-Han; Liu, Cheng-Lin

doi:10.1007/s10032-013-0214-3

Learning confidence transformation for handwritten Chinese text recognition

Original Paper
Published: 05 November 2013

Volume 17, pages 205–219, (2014)
Cite this article

International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Da-Han Wang¹ &
Cheng-Lin Liu²

450 Accesses
4 Citations
Explore all metrics

Abstract

Handwritten text recognition systems commonly combine character classification confidence scores and context models for evaluating candidate segmentation-recognition paths, and the classification confidence is usually optimized at character level. In this paper, we investigate into different confidence-learning methods for handwritten Chinese text recognition and propose a string-level confidence-learning method, which estimates confidence parameters by directly optimizing the performance of character string recognition. We first compare the performances of parametric (class-dependent and class-independent parameters) and nonparametric (isotonic regression) confidence-learning methods. Then, we propose two regularized confidence estimation methods and particularly, a string-level confidence-learning method under the minimum classification error criterion. In experiments of online handwritten Chinese text recognition, the string-level confidence-learning method is shown to effectively improve the string recognition performance. Using three character classifiers, the character correct rates are improved from 92.39, 90.24 and 88.69 % to 92.76, 90.91 and 89.93 %, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Handwritten Character Recognition Based on Weighted Integral Image and Probability Model

One-Model Ensemble-Learning for Text Recognition of Historical Printings

Handwritten Chinese character recognition using fuzzy image alignment

Article Open access 17 November 2015

References

Liu, C.-L., Jaeger, S., Nakagawa, M.: Online handwritten Chinese character recognition: the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 26(2), 198–213 (2004)
Article Google Scholar
Cheriet, M., Kharma, N., Liu, C.-L., Suen, C.Y.: Character Recognition Systems: A Guide for Students and Practitioners. Wiley, New Jersey (2007)
Book Google Scholar
Liu, C.-L.: Classifier combination based on confidence transformation. Pattern Recognit. 38(1), 11–28 (2005)
Article MATH Google Scholar
Li, Y.X., Tan, C.L., Ding, X.: A hybrid post-processing system for offline handwritten Chinese script recognition. Pattern Anal. Appl. 8, 272–286 (2005)
Article MathSciNet Google Scholar
Jiang, Y., Ding, X., Fu, Q., Ren, Z.: Context driven Chinese string segmentation and recognition. Struct. Struct. Syntactic Stat. Pattern Recognit. 4109, 127–135 (2006)
Article Google Scholar
Wang, Q.-F., Yin, F., Liu, C.-L.: Improving handwritten Chinese text recognition by confidence transformation. In: Proceedings of the 11th ICDAR, pp. 518–522 (2011)
Lin, X., Ding, X., Chen, M., Zhang, R., Wu, Y.: Adaptive confidence transform based on classifier combination for Chinese character recognition. Pattern Recognit. Lett. 19(10), 975–988 (1998)
Article Google Scholar
Gillick, L., Ito, Y., Young, J.: A probabilistic approach to confidence estimation and evaluation. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Munich, Germany, pp. 879–882 (1997)
Platt, J.: Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In: Smola, A.J., Bartlett, P., Schölkpf, D., Schuurmanns, D. (eds.) Advances in Large Margin Classifiers, pp. 61–74. MIT Press, Cambridge, MA (1999)
Google Scholar
Schürmann, J.: Pattern Classification: A Unified View of Statistical and Neural Approaches. Wiley, New York (1996)
Google Scholar
Barnett, J.A.: Computational methods for a mathematical theory of evidence. In: Proceedings of the 7th IJCAI, pp. 868–875 (1981)
Liu, C.-L., Sako, H., Fujisawa, H.: Effects of classifier structures and training regimes on integrated segmentation and recognition of handwritten numeral strings. IEEE Trans. Pattern Anal. Mach. Intell. 26(11), 1395–1407 (2004)
Article Google Scholar
Zadrozny, B., Elkan, C.: Learning and making decisions when costs and probabilities are both unknown. In: Proceedings of the 7th ACM SIGKDD, pp. 204–213 (2001)
Robertson, T., Wright, F., Dykstra, R.: Order restricted statistical inference, chap. 1. Wiley, New York (1988)
Google Scholar
Zadrozny, B., Elkan, C.: Transforming classifier scores into accurate multiclass probability estimates. In: Proceedings of the 8th SIGKDD (2002)
Ayer, M., Brunk, H., Ewing, G., Reid, W., Silverman, E.: An empirical distribution function for sampling with incomplete information. Ann. Math. Stat. 26(4), 641–647 (1955)
Article MATH MathSciNet Google Scholar
Juang, B.-H., Chou, W., Lee, C.-H.: Minimum classification error rate methods for speech recognition. IEEE Trans. Speech Audio Process. 5(3), 257–265 (1997)
Article Google Scholar
Liu, C.-L., Yin, F., Wang, D.-H., Wang, Q.-F.: CASIA online and offline Chinese handwriting databases. In: Proceedings of the 11th ICDAR, pp. 37–41 (2011)
Wang, D.-H., Liu, C.-L.: String-level learning of confidence transformation for Chinese handwritten text recognition. In: Proceedings of the 21th ICPR, pp. 3208–3211 (2012)
Wang, Q.-F., Yin, F., Liu, C.-L.: Handwritten Chinese text recognition by integrating multiple contexts. IEEE Trans. Pattern Anal. Mach. Intell. 34(8), 1469–1481 (2012)
Article Google Scholar
Chen, M.-Y., Kundu, A., Srihari, S.N.: Variable duration hidden Markov model and morphological segmentation for handwritten word recognition. IEEE Trans. Image Process. 4(12), 1675–1688 (1995)
Article Google Scholar
Robbins, H., Monro, S.: A stochastic approximation method. Ann. Math. Stat. 22, 400–407 (1951)
Article MATH MathSciNet Google Scholar
Chou, W.: Discriminant-function-based minimum recognition error pattern-recognition approach to speech recognition. Proc. IEEE 88(8), 1201–1223 (2000)
Article Google Scholar
Chen, W.-T., Gader, P.: Word level discriminative training for handwritten word recognition. In: Proceedings of the 7th IWFHR, Amsterdam, pp. 393–402 (2000)
Liu, C.-L., Marukawa, K.: Handwritten numeral string recognition: character-level training versus string-level training. In: Proceedings of the 17th ICPR, Cambridge, UK, pp. 405–408 (2004)
Biem, A.: Minimum classification error training for online handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 28(7), 1041–1051 (2006)
Article Google Scholar
Wang, D.-H., Liu, C.-L., Zhou, X.-D.: An approach for real-time recognition of online Chinese handwritten sentences. Pattern Recognit. 45(10), 3661–3675 (2012)
Article Google Scholar
Liu, C.-L., Fujisawa, H.: Classification and learning in character recognition: advances and remaining problems. In: Marinai, S., Fujisawa, H. (eds.) Machine Learning in Document Analysis and Recognition, pp. 139–161. Springer, Berlin (2008)
Chapter Google Scholar
Kimura, F., Takashina, K., Tsuruoka, S., Miyake, Y.: Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans. Pattern Anal. Mach. Intell. 9(1), 149–153 (1987)
Article Google Scholar
Jin, X.-B., Liu, C.-L., Hou, X.: Regularized margin-based conditional log-likelihood loss for prototype learning. Pattern Recognit. 43(7), 2428–2438 (2010)
Article MATH Google Scholar
Liu, C.-L.: One-vs-all training of prototype classifier for pattern classification and retrieval. In: Proceedings of the 20th ICPR, pp. 3328–3331 (2010)
Liu, C.-L., Zhou, X.-D.: Online Japanese character recognition using trajectory-based normalization and direction feature extraction. In: Proceedings of the 10th IWFHR, pp. 217–222 (2006)
Yin, F., Wang, Q.-F., Liu, C.-L.: Integrating geometric context for text alignment of handwritten Chinese documents. In: Proceedings of the 12th ICFHR, pp. 7–12 (2010)
Rabiner, L.R.: A tutorial on hidden Markov models and selective applications in speech recognition. Proc. IEEE 77, 257–286 (1989)
Article Google Scholar
Su, T.-H., Zhang, T.-W., Guan, D.-J., Huang, H.-J.: Off-line recognition of realistic Chinese handwriting using segmentation-free strategy. Pattern Recognit. 42(1), 167–182 (2009)
Article MATH Google Scholar
Zhou, X.-D., Yu, J.-L., Liu, C.-L., Nagasaki, T., Marukawa, K.: Online handwritten Japanese character string recognition incorporating geometric context. In: Proceedings of the 9th ICDAR, Curitiba, Brazil, pp. 48–52 (2007)

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China (NSFC) Grant 60933010. The authors would like to thank Xu-Yao Zhang for helpful discussions. The work of Da-Han Wang was partly accomplished at the Institute of Automation of Chinese Academy of Sciences.

Author information

Authors and Affiliations

Center for Pattern Analysis and Machine Intelligence, School of Information Science and Engineering, Xiamen University, Fujian, 361005, China
Da-Han Wang
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, 95 Zhongguan East Road, Beijing, 100190, China
Cheng-Lin Liu

Authors

Da-Han Wang
View author publications
You can also search for this author in PubMed Google Scholar
Cheng-Lin Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Cheng-Lin Liu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, DH., Liu, CL. Learning confidence transformation for handwritten Chinese text recognition. IJDAR 17, 205–219 (2014). https://doi.org/10.1007/s10032-013-0214-3

Download citation

Received: 06 October 2012
Revised: 23 June 2013
Accepted: 21 October 2013
Published: 05 November 2013
Issue Date: September 2014
DOI: https://doi.org/10.1007/s10032-013-0214-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning confidence transformation for handwritten Chinese text recognition

Abstract

Access this article

Similar content being viewed by others

Handwritten Character Recognition Based on Weighted Integral Image and Probability Model

One-Model Ensemble-Learning for Text Recognition of Historical Printings

Handwritten Chinese character recognition using fuzzy image alignment

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Learning confidence transformation for handwritten Chinese text recognition

Abstract

Access this article

Similar content being viewed by others

Handwritten Character Recognition Based on Weighted Integral Image and Probability Model

One-Model Ensemble-Learning for Text Recognition of Historical Printings

Handwritten Chinese character recognition using fuzzy image alignment

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation