MCMC Based Generative Adversarial Networks for Handwritten Numeral Augmentation

Zhang, He; Luo, Chunbo; Yu, Xingrui; Ren, Peng

doi:10.1007/978-981-10-6571-2_327

He Zhang³⁸,
Chunbo Luo³⁸,
Xingrui Yu³⁹ &
…
Peng Ren³⁹

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 463))

Included in the following conference series:

International Conference in Communications, Signal Processing, and Systems

101 Accesses

Abstract

In this paper, we propose a novel data augmentation framework for handwritten numerals by incorporating the probabilistic learning and the generative adversarial learning. First, we simply transform numeral images from spatial space into vector space. The Gaussian based Markov probabilistic model is then developed for simulating synthetic numeral vectors given limited handwritten samples. Next, the simulated data are used to pre-train the generative adversarial networks (GANs), which initializes their parameters to fit the general distribution of numeral features. Finally, we adopt the real handwritten numerals to fine-tune the GANs, which greatly increases the authenticity of generated numeral samples. In this case, the outputs of the GANs can be employed to augment original numeral datasets for training the follow-up inference models. Considering that all simulation and augmentation are operated in 1-D vector space, the proposed augmentation framework is more computationally efficient than those based on 2-D images. Extensive experimental results demonstrate that our proposed augmentation framework achieves improved recognition accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ha, T.M., Bunke, H.: Off-Line, handwritten numeral recognition by perturbation method. IEEE Trans. Pattern Anal. Mach. Intell. 19, 535–539 (1997)
Google Scholar
Wu, Y.C., Yin, F., Liu, C.L.: Evaluation of geometric context models for handwritten numeral string recognition. In: 14th ICFHR, pp. 193–198. IEEE Press, Greece (2014)
Google Scholar
Gelfand, A.E., Smith, A.F.: Sampling-based approaches to calculating marginal densities. J. Am. Stat. Assoc. 85, 398–409 (1990)
Google Scholar
Andrieu, C., de Freitas, N., Doucet, A., Jordan, M.I.: An introduction to MCMC for machine learning. Mach. Learn. 50, 5–43 (2003)
Google Scholar
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: 27th NIPS, pp. 2672–2680. Curran Associates Inc., Canada (2014)
Google Scholar
Denton, E.L., Chintala, S., Fergus, R.: Deep generative image models using a laplacian pyramid of adversarial networks. In: 28th NIPS, pp. 1486–1494. Curran Associates Inc., Canada (2015)
Google Scholar
Metropolis, N., Ulam, S.: The Monte Carlo method. J. Am. Stat. Assoc. 44, 335–341 (1949)
Google Scholar
Geman, S., Geman, D.: Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell. 6, 721–741 (1984)
Google Scholar
Pathak, D., Krahenbuhl, P., Donahue, J., Darrell, T., Efros, A. A.: Context encoders: feature learning by inpainting. In: 2016 CVPR, pp. 2536–2544. IEEE Press, America (2016)
Google Scholar
Dai, B., Lin, D., Urtasun, R., Fidler, S.: Towards Diverse and Natural Image Descriptions via a Conditional GAN. arXiv preprint arXiv:1703.06029 (2017)
Arjovsky, M., Bottou, L.: Towards Principled Methods for Training Generative Adversarial Networks. arXiv preprint arXiv:1701.04862 (2017)
Arjovsky, M., Chintala, S., Bottou, L.: Wasserstein GAN. arXiv preprint arXiv:1703.06029 (2017)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley, New York (2012)
Google Scholar
Kingma, D., Ba, J.: Adam: A Method for Stochastic Optimization. arXiv preprint arXiv:1412.6980 (2014)
Sutskever, I., Martens, J., Dahl, G.E., Hinton, G.E.: On the importance of initialization and momentum in deep learning. In: 30th ICML, pp. 1139–1147. PMLR, America (2013)
Google Scholar

Download references

Acknowledgments

This work was supported by grants from the Chinese Scholarship Council (CSC) program.

Author information

Authors and Affiliations

College of Engineering, Mathematics and Physical Sciences, University of Exeter, Exeter, EX4 4QF, UK
He Zhang & Chunbo Luo
College of Information and Control Engineering, China University of Petroleum (East China), Qingdao, 266580, China
Xingrui Yu & Peng Ren

Authors

He Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chunbo Luo
View author publications
You can also search for this author in PubMed Google Scholar
Xingrui Yu
View author publications
You can also search for this author in PubMed Google Scholar
Peng Ren
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to He Zhang or Chunbo Luo .

Editor information

Editors and Affiliations

Department of Electrical Engineering, University of Texas at Arlington, Arlington, Texas, USA
Qilian Liang
College of Electronic and Communication Engineering, Tianjin Normal University, Tianjin, China
Jiasong Mu
Harbin Institute of Technology, Harbin, Heilongjiang, China
Min Jia
College of Electronic and Communication Engineering, Tianjin Normal University, Tianjin, China
Wei Wang
College of Electronic and Communication Engineering, Tianjin Normal University, Tianjin, China
Xuhong Feng
College of Electronic and Communication Engineering, Tianjin Normal University, Tianjin, China
Baoju Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, H., Luo, C., Yu, X., Ren, P. (2019). MCMC Based Generative Adversarial Networks for Handwritten Numeral Augmentation. In: Liang, Q., Mu, J., Jia, M., Wang, W., Feng, X., Zhang, B. (eds) Communications, Signal Processing, and Systems. CSPS 2017. Lecture Notes in Electrical Engineering, vol 463. Springer, Singapore. https://doi.org/10.1007/978-981-10-6571-2_327

Download citation

DOI: https://doi.org/10.1007/978-981-10-6571-2_327
Published: 07 June 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6570-5
Online ISBN: 978-981-10-6571-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics